Abstract
Objective:
To determinate the accuracy of computed tomography (CT) imaging assessed by deep neural networks for predicting the need for mechanical ventilation (MV) in patients hospitalized with severe acute respiratory syndrome due to coronavirus disease 2019 (COVID-19).
Materials and Methods:
This was a retrospective cohort study carried out at two hospitals in Brazil. We included CT scans from patients who were hospitalized due to severe acute respiratory syndrome and had COVID-19 confirmed by reverse transcriptionpolymerase chain reaction (RT-PCR). The training set consisted of chest CT examinations from 823 patients with COVID-19, of whom 93 required MV during hospitalization. We developed an artificial intelligence (AI) model based on convolutional neural networks. The performance of the AI model was evaluated by calculating its accuracy, sensitivity, specificity, and area under the receiver operating characteristic (ROC) curve.
Results:
For predicting the need for MV, the AI model had a sensitivity of 0.417 and a specificity of 0.860. The corresponding area under the ROC curve for the test set was 0.68.
Conclusion:
The high specificity of our AI model makes it able to reliably predict which patients will and will not need invasive ventilation. That makes this approach ideal for identifying high-risk patients and predicting the minimum number of ventilators and critical care beds that will be required.
Keywords:
COVID-19; Tomography; X-ray computed; Artificial intelligence.
Resumo
Objetivo:
Determinar a acurácia da tomografia computadorizada (TC), avaliada por redes neurais profundas, na ventilação mecânica, de pacientes hospitalizados por síndrome respiratória aguda grave por COVID-19.
Materiais e Métodos:
Trata-se de estudo de coorte retrospectivo, realizado em dois hospitais brasileiros. Foram incluídas TCs de pacientes hospitalizados por síndrome respiratória aguda grave e COVID-19 confirmada por RT-PCR. O treinamento consistiu em TC de tórax de 823 pacientes com COVID-19, dos quais 93 foram submetidos a ventilação mecânica na hospitalização. Nós desenvolvemos um modelo de inteligência artificial baseado em redes de convoluções neurais. A avaliação do desempenho do uso da inteligência artificial foi baseada no cálculo de acurácia, sensibilidade, especificidade e área sob a curva ROC.
Resultados:
A sensibilidade do modelo foi de 0,417 e a especificidade foi de 0,860. A área sob a curva ROC para o conjunto de teste foi de 0,68.
Conclusão:
Criamos um modelo de aprendizado de máquina com elevada especificidade, capaz de prever de forma confiável pacientes que não precisarão de ventilação mecânica. Isso significa que essa abordagem é ideal para prever com antecedência pacientes de alto risco e um número mínimo de equipamentos de ventilação e de leitos críticos.
Unitermos:
COVID-19; Tomografia computadorizada; Inteligência artificial.
INTRODUCTION
Since coronavirus disease 2019 (COVID-19) was declared a pandemic by the World Health Organization, on March 11, 2020, various measures have been implemented worldwide in order to promote early diagnosis and containment of the disease(11 World Health Organization. Naming the coronavirus disease (CO-VID-19) and the virus that causes it. [cited 2021 Oct 28]. Available from: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease(covid-2019)-and-the-virus-that-causes-it.
https://www.who.int/emergencies/diseases...
,22 Javor D, Kaplan H, Kaplan A, et al. Deep learning analysis provides accurate COVID-19 diagnosis on chest computed tomography. Eur J Radiol. 2020;133:109402.). In a study conducted in China(33 Ai T, Yang Z, Hou H, et al. Correlation of chest CT and RT-PCR testing for coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology. 2020;296:E32-E40.), the sensitivity of reverse transcription-polymerase chain reaction (RT-PCR) tests to identify infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was found to range from 37% to 71%. In another study, Fang et al.(44 Fang Y, Zhang H, Xie J, et al. Sensitivity of chest CT for COVID-19: comparison to RT-PCR. Radiology. 2020;296:E115-E117.) demonstrated that the sensitivity of chest computed tomography (CT) was signifi cantly greater than was that of RT-PCR (98% vs. 71%; p < 0.001). Therefore, imaging came to be recognized as an important additional diagnostic tool during the pandemic.
According to the Fleischner Society, the indications for CT scans in patients with suspected COVID-19 include moderate to severe clinical features, regardless of laboratory test results, and worsening respiratory status in patients testing positive for infection with SARS-CoV-2(55 Rubin GD, Ryerson CJ, Haramati LB, et al. The role of chest im-aging in patient management during the COVID-19 pandemic: a multinational consensus statement from the Fleischner Society. Radiology. 2020;296:172-80.). In the early phase of COVID-19, CT typically shows bilateral ground-glass opacities, with a predominantly peripheral, subpleural distribution. Several days after the onset of symptoms, linear consolidations or areas with the reverse halo sign can appear, suggesting organizing pneumonia, which is associated with a poorer prognosis in older patients(66 Revel MP, Parkar AP, Prosch H, et al. COVID-19 patients and the radiology department - advice from the European Society of Radiology (ESR) and the European Society of Thoracic Imaging (ESTI). Eur Radiol. 2020;30:4903-9.).
In scenarios in which there is limited availability of radiologists, there can be a signifi cant delay in providing chest CT reports, which are helpful to emergency physicians and clinicians engaged in the management of COVID-19. Therefore, it is important to develop a method to help physicians predict the severity of the viral disease, which we argue could be done through the use of artifi cial intelligence (AI).
Studies have shown that AI algorithms, particularly deep learning algorithms, perform remarkably well in classifying lung disease(77 Hwang S, Park S. Accurate lung segmentation via network-wise training of convolutional networks. In: Cardoso MJ, Arbel T, editors. Deep learning in medical image analysis and multimodal learning for clinical decision support. Cham, Switzerland: Springer; 2017. p. 92-9.
8 Lakhani P, Sundaram B. Deep learning at chest radiography: auto-mated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology. 2017;284:574-82.-99 Zhu B, Luo W, Li B, et al. The development and evaluation of a computerized diagnosis scheme for pneumoconiosis on digital chest radiographs. Biomed Eng Online. 2014;13:141.). Deep learning is characterized as a subset of machine learning that is based on a neural network structure loosely inspired by the human brain. Convolutional neural networks (CNNs) currently represent the most prevalent deep learning architecture in medical imaging. These networks successively map image inputs to desired endpoints while learning increasingly reliable imaging features. Deep learning solutions have been proposed for the analysis of various imaging modalities, including CT(88 Lakhani P, Sundaram B. Deep learning at chest radiography: auto-mated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology. 2017;284:574-82.,1010 Prevedello LM, Erdal BS, Ryu JL, et al. Automated critical test findings identification and online notification system using artificial intelligence in imaging. Radiology. 2017;285:923-31.).
The aim of the present study was to determinate the accuracy of CT imaging assessed by deep neural networks in predicting the need for mechanical ventilation (MV) in patients hospitalized with SARS due to COVID-19.
MATERIALS AND METHODS
This was a retrospective cohort study carried out at two tertiary hospitals in Brazil between April 1, 2020 and May 31, 2020. This study was approved by the institutional ethics committees of both hospitals.
We included CT scans from patients who were hospitalized due to SARS and had a diagnosis of COVID-19, as confi rmed by RT-PCR. To identify SARS, we used the criteria established by the Brazilian National Ministry of Health(1111 Brasil. Ministério da Saúde. Saiba como é feita a definição de casos suspeitos de Covid-19 no Brasil. [cited 2021 Oct 28]. Available from: https://www.gov.br/saude/pt-br/coronavirus/artigos/definicao-e-casos-suspeitos .
https://www.gov.br/saude/pt-br/coronavir...
): fl u symptoms with dyspnea; persistent chest tightness; oxygen saturation less than 95% on room air; or cyanosis of the lips or face. Patients for whom CT images were incomplete or unavailable were excluded from the study. The indications for MV included excessive respiratory effort, with evidence of muscle fatigue. The model predicted the risk for requiring MV within the fi rst 72 h after admission.
Patients and dataset
The initial dataset consisted of 947 CT scans of 833 consecutive inpatients. All of the cases were anonymized before inclusion in the study. Ten patients were excluded because a soft-tissue kernel was not identifi ed in the CT dataset. The fi nal sample comprised 937 CT scans, with a training set of 823 patients and a test set of 114 patients.
The training set consisted of chest CT examinations of 823 patients with COVID-19, of whom 93 required MV during hospitalization. We included only the fi rst CT scan for each patient. The total number of slices in the training set was 189,290. We used k-fold cross-validation (k = 5) to compute the validation metrics (Figure 1). In this validation procedure, we trained the model fi ve times, each time with different patients composing the training and validation sets, 80% of the data being used for training and 20% being used for validation. Each patient appeared in the validation fold once and in the training fold four times. The test set contained CT scans from 114 patients, of whom 67 required MV, with a total number of slices of 28,500. The model used in order to compute the metrics on the test set was trained over all the samples of the training set, rather than over samples from a particular fold.
CT techniques
All chest CT examinations were performed in 64-slice scanners-LightSpeed VCT (GE Healthcare, Milwaukee, WI, USA) or Somatom Sensation 64 (Siemens AG, Forchheim, Germany)-and were acquired and reconstructed with soft-kernel reconstruction as axial images, with the following parameters: slice thickness, 1.25 mm; interslice gap, 1.25 mm; voltage, 120 kVp; and current, 200 mAs.
AI model design
We developed an AI model based on CNNs, one of the most successful deep learning architectures to date(1212 LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436- 44.). In the past few years, CNNs achieved state-of-the-art results in several medical image analysis tasks(1313 Litjens G, Kooi T, Bejnordi BE, et al. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60-88.). By using a mathematical operation called convolution, which leads to local connections between neurons of adjacent layers and shared weights, CNNs exploit spatially-local correlations on the input data(1414 Lecun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86:2278-324.), making them an excellent option for automated image analysis. Each convolutional layer has matrices of weights, also called filters or kernels, that are convolved with the inputs. Each resulting matrix is called a feature map, which summarizes the features of the input image in a lower-dimensional space (Figure 2). The filters within each convolutional layer are optimized during the training process to learn the best features to represent the desired output.
A: Axial unenhanced CT scan showing ground-glass opacities and consolidation in both lungs, findings typical of COVID-19. B: Heatmap of the same image, in which areas of red indicate activation of the algorithm related to prediction of the need for MV.
DenseNet-121(1515 Huang G, Liu Z, van der Maaten L, et al. Densely connected convolutional networks. Proceedings of the IEEE conference on computer vision and pattern recognition; 2017.) is a CNN architecture with connectivity patterns that allow it to eliminate redundancies and thus has fewer parameters than do similar networks. CheXNet(1616 Rajpurkar P, Irvin J, Zhu K, et al. CheXNet: radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv: 1711.05225v3 [cs.CV].), a CNN based on DenseNet-121, has been shown to achieve radiologist-level performance for detecting pneumonia on chest radiographs. In our proposed approach, each CT slice is an individual input during training and testing, which increases the number of input samples. To perform transfer learning from other computer vision tasks, we used a model pre-trained on the ImageNet dataset and then trained it on our dataset of CT slices for eight epochs. The input image size is 224 × 224 pixels. For image augmentation purposes, the slices may go through a random horizontal flip with a probability of 0.5. The network outputs a score between 0 and 1, representing the risk that the patient will require MV.
Statistical analysis
Receiver operating characteristic curves, with their areas under the curve and 95% confidence intervals, were used in order to quantify the performance of the AI prediction models. Because the model evaluates the slices individually, we first computed the metrics for individual slices. Optimal thresholds (0.001) were obtained to describe the sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio, and negative likelihood ratio.
All statistical tests used were two-tailed, and a significance level of 5% was established. The analyses were performed with the Predictive Analytics Software package, version 18.0 (SPSS Inc., Chicago, IL, USA).
RESULTS
As can be seen in Table 1, the AI model had an overall sensitivity of 0.417 and an overall specificity of 0.860. The receiver operating characteristic curve is shown in Figure 3. The corresponding area under the curve for the test set was 0.68. We prioritized specificity metrics, meaning that the model will rarely classify as “positive” patients who do not need MV.
DISCUSSION
We have created a machine learning model with high specificity, capable of reliably predicting which patients will and will not require invasive ventilation. That makes this approach ideal for identifying high-risk patients and predicting the minimum number of ventilators and critical care beds that will be needed, which is extremely important during a pandemic, when intensive care units can be overwhelmed. Our study was conducted using only tomographic data. We see this as a strong point, because there is some difficulty in obtaining clinical data and complete medical records in a real-world setting, especially in low- and middle-income countries.
Previous studies have shown that the use of AI combining tomographic and clinical data has good accuracy for predicting critical evolution. Wang et al.(1717 Wang R, Jiao Z, Yang L, et al. Artificial intelligence for prediction of COVID-19 progression using CT imaging and clinical data. Eur Radiol. 2022;32:205-12.) employed an AI system to evaluate a sample of 1,051 patients with COVID-19, of whom 282 eventually required intensive care, required MV, or evolved to death. In that study, the AI concordance index for predicting critical illness was 0.8. The authors found that the AI system successfully stratified the patients into high-risk and low-risk groups with significantly different risks of progression. Another study, conducted at a single hospital in Mexico, with the objective of developing a multivariable prognostic model, evaluated clinical and chest CT data from 166 patients with COVID-19(1818 Kimura-Sandoval Y, Arévalo-Molina ME, Cristancho-Rojas CN, et al. Validation of chest computed tomography artificial intelligence to determine the requirement for mechanical ventilation and risk of mortality in hospitalized coronavirus disease-19 patients in a tertiary care center in Mexico City. Rev Invest Clín. 2021;73:111-9.). The authors found that a CT severity score had an area under the curve of 0.88 for predicting the need for MV, with a sensitivity of 65% and a specificity of 92%.
During the emerging COVID-19 pandemic, radiology departments faced a substantial increase in the number of requests for chest CT scans, together with the new demand for quantification of pulmonary opacities(1919 Anastasopoulos C, Weikert T, Yang S, et al. Development and clinical implementation of tailored image analysis tools for COVID-19 in the midst of the pandemic: the synergetic effect of an open, clinically embedded software development platform and machine learning. Eur J Radiol. 2020;131:109233.). With overwhelming demands on medical resources, risk-based stratification of patients is essential. Given the large number of examinations in high case-load scenarios, an automated tool could facilitate and save critical time in the diagnosis and risk stratification of the disease. The AI model created for the present study could also facilitate hospital management and resource allocation.
Our study has some limitations. First, it used a retrospective design, with a likely selection bias. Second, a disadvantage of all deep learning methods is the lack of transparency and interpretability-e.g., it is currently quite difficult to determine what imaging features are being used in order to determine the output(2020 Li L, Qin L, Xu Z, et al. Using artificial intelligence to detect COVID-19 and community-acquired pneumonia based on pulmonary CT: evaluation of the diagnostic accuracy. Radiology. 2020;96:E65-E71.).
In conclusion, our findings demonstrate that a deep learning model can reliably predict which patients will require invasive ventilation, with accuracy similar to that reported in the literature for other methods and without the need for clinical data assessment. Albeit promising, our AI model should be validated in multiple cohorts to evaluate its performance across populations and settings.
REFERENCES
-
1World Health Organization. Naming the coronavirus disease (CO-VID-19) and the virus that causes it. [cited 2021 Oct 28]. Available from: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease(covid-2019)-and-the-virus-that-causes-it
» https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease(covid-2019)-and-the-virus-that-causes-it -
2Javor D, Kaplan H, Kaplan A, et al. Deep learning analysis provides accurate COVID-19 diagnosis on chest computed tomography. Eur J Radiol. 2020;133:109402.
-
3Ai T, Yang Z, Hou H, et al. Correlation of chest CT and RT-PCR testing for coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology. 2020;296:E32-E40.
-
4Fang Y, Zhang H, Xie J, et al. Sensitivity of chest CT for COVID-19: comparison to RT-PCR. Radiology. 2020;296:E115-E117.
-
5Rubin GD, Ryerson CJ, Haramati LB, et al. The role of chest im-aging in patient management during the COVID-19 pandemic: a multinational consensus statement from the Fleischner Society. Radiology. 2020;296:172-80.
-
6Revel MP, Parkar AP, Prosch H, et al. COVID-19 patients and the radiology department - advice from the European Society of Radiology (ESR) and the European Society of Thoracic Imaging (ESTI). Eur Radiol. 2020;30:4903-9.
-
7Hwang S, Park S. Accurate lung segmentation via network-wise training of convolutional networks. In: Cardoso MJ, Arbel T, editors. Deep learning in medical image analysis and multimodal learning for clinical decision support. Cham, Switzerland: Springer; 2017. p. 92-9.
-
8Lakhani P, Sundaram B. Deep learning at chest radiography: auto-mated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology. 2017;284:574-82.
-
9Zhu B, Luo W, Li B, et al. The development and evaluation of a computerized diagnosis scheme for pneumoconiosis on digital chest radiographs. Biomed Eng Online. 2014;13:141.
-
10Prevedello LM, Erdal BS, Ryu JL, et al. Automated critical test findings identification and online notification system using artificial intelligence in imaging. Radiology. 2017;285:923-31.
-
11Brasil. Ministério da Saúde. Saiba como é feita a definição de casos suspeitos de Covid-19 no Brasil. [cited 2021 Oct 28]. Available from: https://www.gov.br/saude/pt-br/coronavirus/artigos/definicao-e-casos-suspeitos .
» https://www.gov.br/saude/pt-br/coronavirus/artigos/definicao-e-casos-suspeitos -
12LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436- 44.
-
13Litjens G, Kooi T, Bejnordi BE, et al. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60-88.
-
14Lecun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86:2278-324.
-
15Huang G, Liu Z, van der Maaten L, et al. Densely connected convolutional networks. Proceedings of the IEEE conference on computer vision and pattern recognition; 2017.
-
16Rajpurkar P, Irvin J, Zhu K, et al. CheXNet: radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv: 1711.05225v3 [cs.CV].
-
17Wang R, Jiao Z, Yang L, et al. Artificial intelligence for prediction of COVID-19 progression using CT imaging and clinical data. Eur Radiol. 2022;32:205-12.
-
18Kimura-Sandoval Y, Arévalo-Molina ME, Cristancho-Rojas CN, et al. Validation of chest computed tomography artificial intelligence to determine the requirement for mechanical ventilation and risk of mortality in hospitalized coronavirus disease-19 patients in a tertiary care center in Mexico City. Rev Invest Clín. 2021;73:111-9.
-
19Anastasopoulos C, Weikert T, Yang S, et al. Development and clinical implementation of tailored image analysis tools for COVID-19 in the midst of the pandemic: the synergetic effect of an open, clinically embedded software development platform and machine learning. Eur J Radiol. 2020;131:109233.
-
20Li L, Qin L, Xu Z, et al. Using artificial intelligence to detect COVID-19 and community-acquired pneumonia based on pulmonary CT: evaluation of the diagnostic accuracy. Radiology. 2020;96:E65-E71.
Publication Dates
-
Publication in this collection
10 Mar 2023 -
Date of issue
Mar-Apr 2023
History
-
Received
19 Apr 2022 -
Accepted
22 July 2022