Acessibilidade / Reportar erro

Use of Biochemical Tests and Machine Learning in the Search for Potential Diagnostic Biomarkers of COVID-19, HIV/AIDS, and Pulmonary Tuberculosis

This study aims to develop, validate, and evaluate machine learning algorithms for predicting the diagnosis of coronavirus disease (COVID-19), human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS), pulmonary tuberculosis (TB), and HIV/TB co-infection. We also investigated potential biomarkers associated with the diagnosis. Data from biochemical and hematological tests of infected and controls were collected in a single general hospital, totalizing 6,418 patients. The discriminant analysis by partial least squares (PLS-DA) model had the highest performance in predicting the diagnosis of COVID-19, HIV/AIDS, TB, and HIV/TB co-infection with an accuracy of 94, 97, 95, and 96%, respectively. The biomarkers calcium, lactate dehydrogenase, red blood cells (RBC), white blood cells, neutrophils, basophils, eosinophils, hemoglobin, and hematocrit were associated with COVID-19. HIV infection was associated with mean corpuscular volume, platelets, neutrophils, and mean platelet volume. Red blood cell distribution width and urea were associated with infection by Mycobacterium tuberculosis. The following biomarkers were associated with HIV/TB co-infection: lymphocytes, RBC, hematocrit, hemoglobin, aspartate transaminase, alanine transaminase, and glycemia. The PLS-DA model can optimize COVID-19, HIV/AIDS, TB, and HIV/TB co-infection diagnostics. Some biomarkers were potential diagnostic indicators and could be evaluated during the screening of these diseases.

Keywords:
COVID-19; HIV/AIDS; tuberculosis; co-infection; diagnosis; machine learning


Sociedade Brasileira de Química Instituto de Química - UNICAMP, Caixa Postal 6154, 13083-970 Campinas SP - Brazil, Tel./FAX.: +55 19 3521-3151 - São Paulo - SP - Brazil
E-mail: office@jbcs.sbq.org.br