Acessibilidade / Reportar erro

3D-QSARpy: Combining variable selection strategies and machine learning techniques to build QSAR models

Abstract

Quantitative Structure-Activity Relationship (QSAR) is a computer-aided technology in the field of medicinal chemistry that seeks to clarify the relationships between molecular structures and their biological activities. Such technologies allow for the acceleration of the development of new compounds by reducing the costs of drug design. This work presents 3D-QSARpy, a flexible, user-friendly and robust tool, freely available without registration, to support the generation of QSAR 3D models in an automated way. The user only needs to provide aligned molecular structures and the respective dependent variable. The current version was developed using Python with packages such as scikit-learn and includes various techniques of machine learning for regression. The diverse techniques employed by the tool is a differential compared to known methodologies, such as CoMFA and CoMSIA, because it expands the search space of possible solutions, and in this way increases the chances of obtaining relevant models. Additionally, approaches for select variables (dimension reduction) were implemented in the tool. To evaluate its potentials, experiments were carried out to compare results obtained from the proposed 3D-QSARpy tool with the results from already published works. The results demonstrated that 3D-QSARpy is extremely useful in the field due to its expressive results.

Keywords:
Drug Design; 3D-QSAR; Machine learning; Variable selection

Universidade de São Paulo, Faculdade de Ciências Farmacêuticas Av. Prof. Lineu Prestes, n. 580, 05508-000 S. Paulo/SP Brasil, Tel.: (55 11) 3091-3824 - São Paulo - SP - Brazil
E-mail: bjps@usp.br