Acessibilidade / Reportar erro

Predictive Modeling of Surface Tension in Chemical Compounds: Uncovering Crucial Features with Machine Learning

Surface tension (SFT) can shape the behavior of liquids in industrial chemical processes, influencing variables such as flow rate and separation efficiency. This property is commonly measured with experimental approaches such as Du Noüy ring and Wilhelmy plate methods. Here, we present machine learning (ML) methodologies that can predict the SFT of hydrocarbons. A comparative analysis encompassing k-nearest neighbors, random forest, and XGBoost (extreme gradient boosting) methods was done. Results from our study reveal that XGBoost is the most accurate in predicting hydrocarbon SFT, with a mean squared error (MSE) of 4.65 mN22 Massarweh, O.; Abushaikha, A. S.; Energy Rep. 2020, 6, 3150. [Crossref]
Crossref...
m-2 and a coefficient of determination (R2) score of 0.89. The feature importance was evaluated with the permutation feature importance method and Shapley analysis. Enthalpy of vaporization, density, molecular weight and hydrogen content are key factors in accurately predicting SFT. The successful integration of these methodologies holds the potential to impact efficiency in different industry processes.

Keywords:
machine learning; physicochemical properties; surface tension; organic molecules; regression


Sociedade Brasileira de Química Instituto de Química - UNICAMP, Caixa Postal 6154, 13083-970 Campinas SP - Brazil, Tel./FAX.: +55 19 3521-3151 - São Paulo - SP - Brazil
E-mail: office@jbcs.sbq.org.br