Acessibilidade / Reportar erro

Investigating artificial intelligence models for predicting joint pain from serum biochemistry

SUMMARY

OBJECTIVE:

The study used machine learning models to predict the clinical outcome with various attributes or when the models chose features based on their algorithms.

METHODS:

Patients who presented to an orthopedic outpatient department with joint swelling or myalgia were included in the study. A proforma collected clinical information on age, gender, uric acid, C-reactive protein, and complete blood count/liver function test/renal function test parameters. Machine learning decision models (Random Forest and Gradient Boosted) were evaluated with the selected features/attributes. To categorize input data into outputs of indications of joint discomfort, multilayer perceptron and radial basis function-neural networks were used.

RESULTS:

The random forest decision model outperformed with 97% accuracy and minimum errors to anticipate joint pain from input attributes. For predicted classifications, the multilayer perceptron fared better with an accuracy of 98% as compared to the radial basis function. Multilayer perceptron achieved the following normalized relevance: 100% (uric acid), 10.3% (creatinine), 9.8% (AST), 5.4% (lymphocytes), and 5% (C-reactive protein) for having joint pain. Uric acid has the highest normalized relevance for predicting joint pain.

CONCLUSION:

The earliest artificial intelligence-based detection of joint pain will aid in the prevention of more serious orthopedic complications.

KEYWORDS:
Joint pain; Arthritis; Random forest; Perceptrons; C-reactive protein; Uric acid; Creatinine

INTRODUCTION

Machine learning (ML) is rapidly being applied in numerous disciplines of medicine for illness detection, stratification, and prediction in both at-risk groups and diseases11 Kalweit M, Walker UA, Finckh A, Müller R, Kalweit G, Scherer A, et al. Personalized prediction of disease activity in patients with rheumatoid arthritis using an adaptive deep neural network. PLoS One. 2021;16(6):e0252289. https://doi.org/10.1371/journal.pone.0252289
https://doi.org/10.1371/journal.pone.025...
. Joint redness, soreness, swelling, limping, freezing, weakness, etc. are the common signs of joint pain. Arthritis affects 24% of all adults in the United States. It is the biggest cause of work disability22 Shiel WC. MedicineNet. Joint pain: signs & symptoms. 2019. Available from: https://www.medicinenet.com/joint_pain/symptoms.htm
https://www.medicinenet.com/joint_pain/s...
. In Pakistan, the average age of osteoarthritis patients is 58 years old. Women are more likely to suffer from osteoarthritis. Diabetes, hypertension, obesity, cardiovascular disease, chronic renal failure, use of diuretics, a diet high in purine-rich foods, and alcohol usage are risk factors for gout-related joint discomfort and swelling33 Ma L, Cranney A, Holroyd-Leduc JM. Acute monoarthritis: what is the cause of my patient's painful swollen joint? CMAJ. 2009;180(1):59-65. https://doi.org/10.1503/cmaj.080183
https://doi.org/10.1503/cmaj.080183...
. A variety of ailments, including infections and autoimmune diseases like rheumatoid arthritis and inflammatory bowel disease, can result in inflammation. Increased CRP in the presence of joint pain suggests the probability of autoimmune arthritis or joint infection44 Rifbjerg-Madsen S, Christensen AW, Christensen R, Hetland ML, Bliddal H, Kristensen LE, et al. Pain and pain mechanisms in patients with inflammatory arthritis: a Danish nationwide cross-sectional DANBIO registry survey. PLoS One. 2017;12(7):e0180014. https://doi.org/10.1371/journal.pone.0180014
https://doi.org/10.1371/journal.pone.018...
.

We included a variety of models to evaluate how artificial intelligence (AI) interfaces in predicting clinical outcomes when we train the models with different attributes or when models pick features depending on their algorithms. Early detection of joint pain can help prevent more serious complications like arthritis, bursitis, fibromyalgia, osteoarthritis, tendinitis, and so on. For this purpose, we included those subjects who were suffering from mild joint edema or myalgia and visited the orthopedic department. We tested two ML decision trees (random forest, and gradient boosted) to report and compare their efficiencies in predicting the target variable (joint pain) based on the clinical biochemistry data of individuals. Clinical variables such as risk factors (such as uric acid, CRP, etc.) were taken to predict the diagnostic outcome for the indication of joint pain in the patients. We also employed multilayer perceptron (MLP) and radial basis function (RBF) to match the results and determine the relative percentage of normalized relevance of each clinical/biochemistry variable in influencing joint pain. Decision trees are nonparametric supervised learning approaches used in ML for classification55 RapidMiner9. 2022. Available from: https://docs.rapidminer.com/latest/studio/operators/rapidminer-studio-operator-reference.pdf
https://docs.rapidminer.com/latest/studi...
. Artificial neural networks (ANNs) research is available that can focus on the biomechanics of the spine by elucidating joint moments, spinal stresses, and muscle forces. Zeng et al.66 Zeng J, Zhang J, Li Z, Li T, Li G. Prediction model of artificial neural network for the risk of hyperuricemia incorporating dietary risk factors in a Chinese adult study. Food Nutr Res. 2020;64. https://doi.org/10.29219/fnr.v64.3712
https://doi.org/10.29219/fnr.v64.3712...
created a prediction model to predict the risk of hyperuricemia in Chinese individuals. Machine learning fills a big void in clinical experience learning, diagnosing predictions, and advising accurate treatment. Therefore, the primary goal was to determine if decision trees constructed using machine learning techniques and neural networks could predict joint pain based on biochemical data.

METHODS

A cross-sectional study was conducted after ethical approval from the University of Lahore (UOL), Mansoorah Teaching Hospital, Lahore. The study was conducted from January 2022 to June 2022. A total of 650 patients were included. This determined that machine learning decision models and neural networks can correctly predict joint pain under given biochemistry variables. Patients aged 20–65 years who visited outpatient department (OPD) with moderate joint swelling or myalgia were included. Cases of severe comorbidities, autoimmune, or degenerative joint conditions were not included. Through a predesigned proforma, the basic and clinical data were collected. The patients were also inquired about the presence of joint pain, which was documented on proforma. Clinical laboratory findings were conducted at the affiliated UOL's Mansoorah Hospital, Lahore. A chi-square test was applied to see the significant or insignificant difference between the expected and observed values of variables.

ML models Random Forest and Gradient Boosted were evaluated. The attributed dataset of outdoor patients was included to build the models. Observations about a parameter are provided in branches of a predictive decision tree model, which leads to inferences about the targets of the parameter. Sorting the other examples from the tree's root to the leaf nodes allowed for the categorization of each instance. Gender, age, uric acid, and CRP are considered attributes representing risk factors, and outcome in terms of identification of joint pain was considered the target variable.

A random forest operator was used to create a random forest model for classification. These trees were built or trained using bootstrapped subsets of the dataset sent over the input. A tree node represented a splitting rule for a single attribute. For splitting rule selection, only a subset of attributes determined by the subset ratio requirement was evaluated. The process of creating additional nodes was repeated until the halting requirements were fulfilled. Pruning was used to simplify the model by substituting leaves for sub-trees that had weak prediction abilities55 RapidMiner9. 2022. Available from: https://docs.rapidminer.com/latest/studio/operators/rapidminer-studio-operator-reference.pdf
https://docs.rapidminer.com/latest/studi...
. Gradient Boosted was utilized for trees that run the gradient boosted (GBT) algorithm using H2O 3.30.0.1. For the prediction of the labeled attribute, this classification model was used for data sets55 RapidMiner9. 2022. Available from: https://docs.rapidminer.com/latest/studio/operators/rapidminer-studio-operator-reference.pdf
https://docs.rapidminer.com/latest/studi...
.

The joint pain indication was labeled as yes or no. To build the model, the data sets were accessed, identified the target output, assigned the training (70%) and testing (30%) portions, trained the model under supervised learning methods, generated the model, and evaluated the performance variables (accuracies and errors) with cross-validations (10-fold).

ANN MLP model was developed to predict the presence or absence (as an outcome) of joint discomfort by evaluating data from recorded scaled variables. We began with the input layer and worked our way forward to the output layer while creating MLP. The error was determined based on the output. These stages were repeated numerous times to learn the optimal weights. The result was created using functions to predict the designated class. A four-layered ANN, through a supervised learning algorithm with batch training, was generated to classify input data (factors and covariates) into output categories as clinical outcomes. Of this, 70.2% of the data were segregated as a training set, and 29.8% were taken as a testing set. The stopping rule was used for training. Biochemistry parameters, along with age and gender, were used as predictor variables to predict dependent variable outcomes. We used the "standardized" rescaling method of covariates for the input layer. A technique for supervised back-propagation learning was used to train the MLP model. To control the synaptic weights, a batch training mode with a scaled conjugate gradient was chosen. We used the following values: 0.4 starting learning rate, 0.9 momentum, 0 interval center, and 0.5 interval offset. The use of linear combinations of terms based on a single univariate function to approximate multivariable functions is known as the radial basis function. We incorporated the radial basis-neural network (RB-NN) by adding the radial basis function as a neuron and compared the input data to the training data. The percentage normalized importance was calculated with all independent predictor variables for both models.

RESULTS

The chi-square test showed significant differences (p-value<0.050) between the expected and observed values in the following variables: hematocrit, eosinophils, CRP, blood urea nitrogen, creatinine, total bilirubin, ALT, AST, and ALP (Table 1). Random forest and gradient-boosted models both achieved 97.44% accuracy with 96.92 and 96.77% accuracy on validation. Although the percentage accuracy was the same at 97.44% in the gradient-boosted model; however, its relative errors were quite greater as compared to the random forest model. The random forest decision model performed best (Figure 1).

Table 1
Clinical statistics.
Figure 1
The decision trees analyze the uric acid first (as a root node) with values above or below the normal range. If uric acid is found above the normal range, the decision model looks for the age factor, and with increased age, it evaluates the C-reactive protein values (above or below the normal range), etc. for the prediction of joint pain (as leaf nodes) in yes/no for random forest, and leaf nodes (average of all residuals for the specified loss function) as the final output (joint pain) prediction, comprising the sum of all previous predictions and the overall prediction of the current tree.

The multilayer perceptron-neural network (MLP-NN) model produced just 2% inaccurate predictions for training and 4.6% for testing datasets, whereas the radial basis function-neural network (RBF-NN) model yielded 7.8% (training) and 7% (testing). For predicted classifications, the MLP-NN achieved 98% for training and 95.4% for testing datasets, whereas the RBF-NN showed 92.2% for training and 93% for testing datasets. The MLP model showed better performance and accuracy as compared to the RBF model. MLP and RBF both predicted the same percentage for hematocrit (HCT) (5.1%), red blood cells (2.9%), age (1.7%), and uric acid (100%). In all other biochemistry variables (except lymphocytes), RBF predicted more normalized importance percentages than MLP. Model MLP achieved the following percentages: 10.3% (creatinine), 9.8% (AST), 5.4% (lymphocytes), and 5% (CRP) for having joint pain. RBF yielded 24.5% (creatinine), 19.8% (AST), 3.2% (lymphocytes), and 8.2% (CRP) for joint pain. The analysis concludes that uric acid has the highest normalized importance for the anticipation of joint pain (Figure 2).

Figure 2
Independent variable importance chart.

DISCUSSION

It is known that uric acid is linked with the indication of joint pain. With 97% accuracy and minimal errors, the random forest decision model outscored all others. Regarding neural networks, MLP obtained 98% for training and 95.4% for testing datasets for predicted classifications. Based on both MLP and RBF results, the research suggests that uric acid has the highest normalized relevance for the prediction of joint discomfort. ANN and ML systems can provide reliable and optimum performance for clinical diagnosis, prognosis, and personalized therapy by employing mathematical algorithms and computational approaches. Researchers developed methods to detect the earliest pain progression to avoid knee osteoarthritis. Researchers have applied machine learning-based models that anticipated knee pain with 84% accuracy. They offered a technique that can identify the advancement of pain early on, hence enhancing future attempts to avoid knee osteoarthritis77 Alexos A, Kokkotis C, Moustakidis S, Papageorgiou E, Tsaopoulos D. Prediction of pain in knee osteoarthritis patients using machine learning: data from osteoarthritis initiative. In: Proceedings of the 2020 11th international conference on information, intelligence, systems and applications (IISA), IEEE; 2020. p. 1-7.. Recently, the systemic risk factors were evaluated in knee-osteoarthritis onsets from various combinations of the DeepLIFT gradient. Their methods predicted diabetes and smoking as the highest gradient factors88 Chan LC, Li HHT, Chan PK, Wen C. A machine learning-based approach to decipher multi-etiology of knee osteoarthritis onset and deterioration. Osteoarthr Cartil Open. 2021;3(1):100135. https://doi.org/10.1016/j.ocarto.2020.100135
https://doi.org/10.1016/j.ocarto.2020.10...
. Another study employed a support vector machine with 74% accuracy to predict knee-osteoarthritis progression99 Kokkotis C, Moustakidis S, Giakas G, Tsaopoulos D. Identification of risk factors and machine learning-based prediction models for knee osteoarthritis patients. Appl Sci. 2020;10(19):6797. https://doi.org/10.3390/app10196797
https://doi.org/10.3390/app10196797...
. Abnormal joint moments during walking are proven predictors of osteoarthritic knee discomfort. In another experiment, four machine-learning methods were utilized. The random forest method fared best in terms of prediction performance, but it was also the slowest by a factor of ten. The multilayer perceptron neural network technique outperformed the others in terms of root mean square error and area under the recall curve1010 Aljaaf AJ, Hussain AJ, Fergus P, Przybyla A, Barton GJ. Evaluation of machine learning methods to predict knee loading from the movement of body segments. In: Proceedings of the 2016 international joint conference on neural networks (IJCNN), IEEE; 2016. p. 5168-73.. Chronic impairment in temporomandibular joint osteoarthritis worsens with age, and the primary objective is to detect the disease before morphological degradation develops. Bianchi et al.1111 Bianchi J, Oliveira Ruellas AC, Gonçalves JR, Paniagua B, Prieto JC, Styner M, et al. Osteoarthritis of the temporomandibular joint can be diagnosed earlier using biomarkers and machine learning. Sci Rep. 2020;10(1):8012. https://doi.org/10.1038/s41598-020-64942-0
https://doi.org/10.1038/s41598-020-64942...
tackled this difficulty by processing and analyzing 52 clinical, biochemical, and high-resolution cone beam computed tomography (CBCT) (radiomics) variables from temporomandibular joint osteoarthritis patients and controls using sophisticated data science. They included machine-learning models with good accuracy for diagnosing temporomandibular joint osteoarthritis. In a study, deep neural networks were found effective for the classification of the complicated problem of knee osteoarthritis diagnosis. Gender, age, and obesity were used to categorize the subgroups studied1212 Christodoulou E, Moustakidis S, Papandrianos N, Tsaopoulos D, Papageorgiou E. Exploring deep learning capabilities in knee osteoarthritis case study for classification. In: Proceedings of the 2019 10th international conference on information, intelligence, systems and applications (IISA), IEEE; 2019. p. 1-6.. Hafezi-Nejad et al.1313 Hafezi-Nejad N, Guermazi A, Roemer FW, Hunter DJ, Dam EB, Zikria B, et al. Prediction of medial tibiofemoral compartment joint space loss progression using volumetric cartilage measurements: data from the FNIH OA biomarkers consortium. Eur Radiol. 2017;27(2):464-73. https://doi.org/10.1007/s00330-016-4393-4
https://doi.org/10.1007/s00330-016-4393-...
investigated the relationship between baseline cartilage volume measures and medial compartment joint space-loss advancement. The most relevant estimators for medial joint-space-loss progression in the MLP-ANN studies were baseline and interval changes in lateral-femoral cartilage volume. K-means algorithms are also used to predict rheumatoid arthritis patients with 84% accuracy1414 Yoo J, Lim MK, Ihm C, Choi ES, Kang MS. A study on prediction of rheumatoid arthritis using machine learning. Int J Appl Eng Res. 2017;12(20):9858-62.. Soin et al.1515 Soin A, Hirschbeck M, Verdon M, Manchikanti L. A pilot study implementing a machine learning algorithm to use artificial intelligence to diagnose spinal conditions. Pain Physician. 2022;25(2):171-8. PMID: 35322974 achieved 72% accuracy in detecting spinal pain and concluded that AI decision tools are quite effective. ML models should ideally integrate acquired information into clinical evidence, with computers capable of predicting clinical outcomes, identifying illness patterns, detecting disease signs, and improving treatment procedures1616 Hügle M, Omoumi P, Laar JM, Boedecker J, Hügle T. Applied machine learning and artificial intelligence in rheumatology. Rheumatol Adv Pract. 2020;4(1):rkaa005. https://doi.org/10.1093/rap/rkaa005
https://doi.org/10.1093/rap/rkaa005...
.

CONCLUSION AND RECOMMENDATIONS

The random forest decision model performed better overall in anticipating joint pain with biochemistry parameters. Uric acid has the highest normalized relevance for predicting joint pain. The platforms of AI models should be used in hospitals with integrated computerized medical systems.

ETHICAL APPROVAL

The study was conducted in accordance with the Declaration of Helsinki and followed the ethical standards of Pakistan. The study was conducted after getting ethical approval from the hospital's ethical review committee. The date of approval was March 3, 2022. The study conformed to institutional ethical standards.

  • Funding:

    none.

REFERENCES

  • 1
    Kalweit M, Walker UA, Finckh A, Müller R, Kalweit G, Scherer A, et al. Personalized prediction of disease activity in patients with rheumatoid arthritis using an adaptive deep neural network. PLoS One. 2021;16(6):e0252289. https://doi.org/10.1371/journal.pone.0252289
    » https://doi.org/10.1371/journal.pone.0252289
  • 2
    Shiel WC. MedicineNet. Joint pain: signs & symptoms. 2019. Available from: https://www.medicinenet.com/joint_pain/symptoms.htm
    » https://www.medicinenet.com/joint_pain/symptoms.htm
  • 3
    Ma L, Cranney A, Holroyd-Leduc JM. Acute monoarthritis: what is the cause of my patient's painful swollen joint? CMAJ. 2009;180(1):59-65. https://doi.org/10.1503/cmaj.080183
    » https://doi.org/10.1503/cmaj.080183
  • 4
    Rifbjerg-Madsen S, Christensen AW, Christensen R, Hetland ML, Bliddal H, Kristensen LE, et al. Pain and pain mechanisms in patients with inflammatory arthritis: a Danish nationwide cross-sectional DANBIO registry survey. PLoS One. 2017;12(7):e0180014. https://doi.org/10.1371/journal.pone.0180014
    » https://doi.org/10.1371/journal.pone.0180014
  • 5
    RapidMiner9. 2022. Available from: https://docs.rapidminer.com/latest/studio/operators/rapidminer-studio-operator-reference.pdf
    » https://docs.rapidminer.com/latest/studio/operators/rapidminer-studio-operator-reference.pdf
  • 6
    Zeng J, Zhang J, Li Z, Li T, Li G. Prediction model of artificial neural network for the risk of hyperuricemia incorporating dietary risk factors in a Chinese adult study. Food Nutr Res. 2020;64. https://doi.org/10.29219/fnr.v64.3712
    » https://doi.org/10.29219/fnr.v64.3712
  • 7
    Alexos A, Kokkotis C, Moustakidis S, Papageorgiou E, Tsaopoulos D. Prediction of pain in knee osteoarthritis patients using machine learning: data from osteoarthritis initiative. In: Proceedings of the 2020 11th international conference on information, intelligence, systems and applications (IISA), IEEE; 2020. p. 1-7.
  • 8
    Chan LC, Li HHT, Chan PK, Wen C. A machine learning-based approach to decipher multi-etiology of knee osteoarthritis onset and deterioration. Osteoarthr Cartil Open. 2021;3(1):100135. https://doi.org/10.1016/j.ocarto.2020.100135
    » https://doi.org/10.1016/j.ocarto.2020.100135
  • 9
    Kokkotis C, Moustakidis S, Giakas G, Tsaopoulos D. Identification of risk factors and machine learning-based prediction models for knee osteoarthritis patients. Appl Sci. 2020;10(19):6797. https://doi.org/10.3390/app10196797
    » https://doi.org/10.3390/app10196797
  • 10
    Aljaaf AJ, Hussain AJ, Fergus P, Przybyla A, Barton GJ. Evaluation of machine learning methods to predict knee loading from the movement of body segments. In: Proceedings of the 2016 international joint conference on neural networks (IJCNN), IEEE; 2016. p. 5168-73.
  • 11
    Bianchi J, Oliveira Ruellas AC, Gonçalves JR, Paniagua B, Prieto JC, Styner M, et al. Osteoarthritis of the temporomandibular joint can be diagnosed earlier using biomarkers and machine learning. Sci Rep. 2020;10(1):8012. https://doi.org/10.1038/s41598-020-64942-0
    » https://doi.org/10.1038/s41598-020-64942-0
  • 12
    Christodoulou E, Moustakidis S, Papandrianos N, Tsaopoulos D, Papageorgiou E. Exploring deep learning capabilities in knee osteoarthritis case study for classification. In: Proceedings of the 2019 10th international conference on information, intelligence, systems and applications (IISA), IEEE; 2019. p. 1-6.
  • 13
    Hafezi-Nejad N, Guermazi A, Roemer FW, Hunter DJ, Dam EB, Zikria B, et al. Prediction of medial tibiofemoral compartment joint space loss progression using volumetric cartilage measurements: data from the FNIH OA biomarkers consortium. Eur Radiol. 2017;27(2):464-73. https://doi.org/10.1007/s00330-016-4393-4
    » https://doi.org/10.1007/s00330-016-4393-4
  • 14
    Yoo J, Lim MK, Ihm C, Choi ES, Kang MS. A study on prediction of rheumatoid arthritis using machine learning. Int J Appl Eng Res. 2017;12(20):9858-62.
  • 15
    Soin A, Hirschbeck M, Verdon M, Manchikanti L. A pilot study implementing a machine learning algorithm to use artificial intelligence to diagnose spinal conditions. Pain Physician. 2022;25(2):171-8. PMID: 35322974
  • 16
    Hügle M, Omoumi P, Laar JM, Boedecker J, Hügle T. Applied machine learning and artificial intelligence in rheumatology. Rheumatol Adv Pract. 2020;4(1):rkaa005. https://doi.org/10.1093/rap/rkaa005
    » https://doi.org/10.1093/rap/rkaa005

Publication Dates

  • Publication in this collection
    16 Sept 2024
  • Date of issue
    2024

History

  • Received
    16 May 2024
  • Accepted
    31 May 2024
Associação Médica Brasileira R. São Carlos do Pinhal, 324, 01333-903 São Paulo SP - Brazil, Tel: +55 11 3178-6800, Fax: +55 11 3178-6816 - São Paulo - SP - Brazil
E-mail: ramb@amb.org.br