Acessibilidade / Reportar erro

Comparison of the accuracy of residents, senior physicians and surrogate decision-makers for predicting hospital mortality of critically ill patients

ABSTRACT

Objective:

To compare the predictive performance of residents, senior intensive care unit physicians and surrogates early during intensive care unit stays and to evaluate whether different presentations of prognostic data (probability of survival versus probability of death) influenced their performance.

Methods:

We questioned surrogates and physicians in charge of critically ill patients during the first 48 hours of intensive care unit admission on the patient’s probability of hospital outcome. The question framing (i.e., probability of survival versus probability of death during hospitalization) was randomized. To evaluate the predictive performance, we compared the areas under the ROC curves (AUCs) for hospital outcome between surrogates and physicians’ categories. We also stratified the results according to randomized question framing.

Results:

We interviewed surrogates and physicians on the hospital outcomes of 118 patients. The predictive performance of surrogate decisionmakers was significantly lower than that of physicians (AUC of 0.63 for surrogates, 0.82 for residents, 0.80 for intensive care unit fellows and 0.81 for intensive care unit senior physicians). There was no increase in predictive performance related to physicians’ experience (i.e., senior physicians did not predict outcomes better than junior physicians). Surrogate decisionmakers worsened their prediction performance when they were asked about probability of death instead of probability of survival, but there was no difference for physicians.

Conclusion:

Different predictive performance was observed when comparing surrogate decision-makers and physicians, with no effect of experience on health care professionals’ prediction. Question framing affected the predictive performance of surrogates but not of physicians.

Keywords:
Prognosis; Internship and residency; Critical illness; Decision-making; Delivery of health care; Hospitalization; Intensive care units; Mortality

RESUMO

Objetivo:

Comparar o desempenho preditivo de residentes, médicos seniores de unidades de terapia intensiva e decisores substitutos dos pacientes logo no início da internação na unidade de terapia intensiva e avaliar se diferentes apresentações de prognóstico (probabilidade de sobrevida versus probabilidade de óbito) influenciaram seus desempenhos.

Métodos:

Os decisores substitutos e os médicos responsáveis pelos pacientes críticos foram questionados durante as primeiras 48 horas de internação na unidade de terapia intensiva sobre a probabilidade do desfecho hospitalar do paciente. O enquadramento da pergunta (isto é, a probabilidade de sobrevida versus a probabilidade de óbito durante a internação) foi randomizado. Para avaliar o desempenho preditivo, comparou-se a área sob a curva ROC para desfecho hospitalar entre as categorias decisores substitutos e médicos. Também estratificaram-se os resultados de acordo com o enquadramento da pergunta randomizado.

Resultados:

Entrevistaram-se decisores substitutos e médicos sobre os desfechos hospitalares de 118 pacientes. O desempenho preditivo dos decisores substitutos foi significativamente inferior ao dos médicos (área sob a curva de 0,63 para decisores substitutos, 0,82 para residentes, 0,80 para residentes de medicina intensiva e 0,81 para médicos seniores de unidade de terapia intensiva). Não houve aumento no desempenho preditivo quanto à experiência dos médicos (ou seja, médicos seniores não previram desfechos melhor que médicos juniores). Os decisores substitutos pioraram seu desempenho de previsão quando perguntados sobre a probabilidade de óbito ao invés da probabilidade de sobrevida, mas não houve diferença entre os médicos.

Conclusão:

Observou-se desempenho preditivo diferente ao comparar decisores substitutos e médicos, sem qualquer efeito da experiência no prognóstico dos profissionais de saúde. O enquadramento da pergunta afetou o desempenho preditivo dos substitutos, mas não o dos médicos.

Descritores:
Prognóstico; Internato e residência; Estado crítico; Tomada de decisões; Atenção à; saúde; Hospitalização; Unidades de terapia intensiva; Mortalidade hospitalar

INTRODUCTION

Prognostic assessment in critically ill patients is performed at intensive care unit (ICU) admission and daily thereafter for decisions concerning invasive treatments and for discussions with surrogate decision-makers (usually a family member). Although uncertainty and imprecisions are frequent in prognostication, most surrogates of patients understand this as unavoidable and still want a discussion of uncertain prognoses.(11 Evans LR, Boyd EA, Malvar G, Apatira L, Luce JM, Lo B, et al. Surrogate decision-makers’ perspectives on discussing prognosis in the face of uncertainty. Am J Respir Crit Care Med. 2009;179(1):48-53.)

However, this same uncertainty might hinder some physicians, especially those early in their career or in training, from engaging in further discussions. Therefore, a better understanding of physicians’ accuracy of prognostication could be informative.

Some previous studies suggested that ICU physicians were more accurate than severity-of-illness scores in predicting mortality.(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.) Surrogates might also be accurate but less accurate than physicians. This discordance about the patient’s prognosis might lead to and fuel conflicts.(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) However, heterogeneity among these studies about the time of assessment (from less than 24 hours until 128 hours of ICU admission) and the outcome (from ICU mortality to 180-day mortality) makes this comparison problematic.(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.,33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) The effect of previous experience and training on the discriminative performance of physicians has not been adequately evaluated. Finally, the format of prognostic data presentation might influence the surrogate’s perceived risks,(44 Chapman AR, Litton E, Chamberlain J, Ho KM. The effect of prognostic data presentation format on perceived risk among surrogate decision makers of critically ill patients: a randomized comparative trial. J Crit Care. 2015;30(2):231-5.) but whether it influences the physician’s perceived risk is unknown.

Therefore, we conducted the present study to compare the prognostic ability of residents, senior ICU physicians, surrogates and severity of illness score early during ICU stay and to evaluate if different presentations of prognostic data (probability of survival versus probability of death) influences the prognostic ability of physicians’ categories and surrogates.

METHODS

Study design and patient population

This was a prospective cohort study of patients admitted to our medical-surgical ICU (a 14-bed unit at Hospital das Clínicas, a public tertiary universityaffiliated hospital located in São Paulo, Brazil) carried out from August 2017 to December 2019. The staffing model during the daytime included two senior ICU physicians, two ICU fellows and six internal medicine residents (who stayed for one month in the ICU during their internal medicine rotation). The inclusion criteria were admitted patients whose surrogate decision-makers could be interviewed ≤ 48 hours of ICU admission and one person ≥ 18 years old who identified as a surrogate decision-maker (not necessarily a first-degree family member). During the study period, each patient could be visited by two persons daily from 4 p.m. to 5 p.m., when physicians also approached them to discuss the patients’ clinical status. The exclusion criteria were patients younger than 18 years, pregnant patients and readmission in the same hospitalization.

Data collection and study procedures

We enrolled patients whose surrogates had visited them at least once in the first 48 hours after ICU admission and had discussed the patient’s clinical status with the patients’ physician (usually the internal medicine resident and/or the ICU fellow). We interviewed participants only on weekdays (physicians could be interviewed from 4 p.m. to 5 p.m. and surrogates from 4 p.m. to 5 p.m. too, during the visit time). Research investigators approached the patients’ surrogates about study participation during the visitation period. Similar to a previous study, if more than one possible surrogate decision-maker was identified during the visitation, we interviewed the one who self-reported as having a significant amount of responsibility for decision-making.(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) All participating surrogates and physicians provided written informed consent; for patients’ data, the ethical committee waived informed consent since all study information was collected from the administrative database already collected. The local Ethical Committee approved all study procedures (Ethics Committee approval 2.222.797, CAAE 69174717.4.0000.0068), and the study was performed according to national legislation (Resolução n° 466, December 12, 2012, Brazilian Ministry of Health).

Prior to the interview, we randomized the question framing (i.e., probability that the patient would survive the hospitalization or probability that the patient would die during the hospitalization) in a random sequence of two possibilities (i.e., simple randomization). The interview procedure consisted of asking the internal medicine resident, the ICU fellow and ICU senior physician in charge of the patient, as well as their surrogate decision-maker, to estimate the probability that the patient would survive/die the hospitalization. They were asked the validated question: “What do you think are the chances that the patient will survive/die this hospitalization?”(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) They answered using a 0% to 100% scale in strata of 10% each (i.e., 0%, 10%, 20% and so on until 100%).(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) All answers were blinded to the others’ responses. The interval for the research staff to obtain all answers did not exceed 1 hour.

We retrieved patients’ baseline characteristics and outcomes from our administrative database that was prospectively collected in a cloud-based software database (Sistema Epimed™) by trained staff.(55 Zampieri FG, Soares M, Borges LP, Salluh JI, Ranzani OT. The Epimed Monitor ICU Database: a cloud-based national registry for adult intensive care unit patients in Brazil. Rev Bras Ter Intensiva. 2017;29(4):418-26.) The data recorded included age, sex, date of ICU admission, Simplified Acute Physiology Score III (SAPS III),(66 Metnitz PG, Moreno RP, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR; SAPS 3 Investigators. SAPS 3--From evaluation of the patient to evaluation of the intensive care unit. Part 1: Objectives, methods and cohort description. Intensive Care Med. 2005;31(10):1336-44.,77 Moreno RP, Metnitz PG, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR; SAPS 3 Investigators. SAPS 3--From evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. 2005;31(10):1345-55.) referring facility, admission diagnosis, surgical procedures before admission, Charlson index for comorbidities,(88 Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-83.) Eastern Cooperative Oncology Group (ECOG), resource utilization during ICU stay (mechanical ventilation, vasoactive drugs or renal replacement therapy), oncological status (locoregional or metastatic) and hospital mortality. At the time of the interview, we also collected demographic data (age and sex) from the physicians and surrogate decision-makers. For the residents and ICU fellows, we also collected information about previous medical specialty, current residency and years since graduation. All participants’ senior ICU physicians were intensivists with complete training for more than five years and were certified by the National Board of Critical Care Medicine (Associação de Medicina Intensiva Brasileira).

Sample size and statistical analysis

A previous systematic review suggested a discriminative performance estimated by a summary receiver operating characteristic (ROC) curve for physicians of 0.85 and 0.63 for severity-of-illness scores.(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.) We estimated that a sample size of 108 interviews would be required to detect a significant (i.e., significantly different from 0.5) discriminative performance of 0.63 (the lower value of the ROC curve in the previous systematic review), assuming a statistical power of 80%, an α = 0.05 and hospital mortality of 35% (hospital mortality of our ICU in the previous year).(99 Goksuluk D, Korkmaz S, Zararsiz G, Karaagaoglu E. easyROC: an interactive web-tool for ROC curve analysis using R language environment. R J. 2016;8(2):213-30.) This sample size is also adequate to detect a significant difference between these predicted ROC values (0.85 versus 0.63) with the same hospital mortality, assuming a statistical power of 80% and α = 0.05 (in this case, 94 interviews would be required).(99 Goksuluk D, Korkmaz S, Zararsiz G, Karaagaoglu E. easyROC: an interactive web-tool for ROC curve analysis using R language environment. R J. 2016;8(2):213-30.) To account for possible patients’ or surrogates’ consent withdrawal, we decided to enroll 10% more, for the final sample size estimation of 118 patients.

Continuous variables are presented as the median and interquartile range (IQR) and were evaluated using the Wilcoxon or Mann-Whitney tests. Categorical variables are presented as numbers and percentages and were evaluated using the chi-squared test.

To assess the accuracy of each group prediction, we obtained areas under the ROC curves (AUCs) for hospital outcome and compared them with the DeLong method without correction for multiple comparisons.(1010 DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837-45.) Calibration was assessed by the calibration belt method as described by the Gruppo Italiano per la Valutazione degli Interventi in Terapia Intensiva (GiViTI). This method applies a generalized polynomial logistic function between the outcome and the logit transformation of the estimated predicted probability, with the respective 95 and 80% confidence intervals (CI) boundaries.

A statistically significant deviation from the bisector (the line of perfect calibration) occurs when the 95%CI boundaries of the calibration belt do not include the bisector.(1111 Finazzi S, Poole D, Luciani D, Cogo PE, Bertolini G. Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLoS One. 2011;6(2):e16110.) To calculate the calibration belt, 0% probability was converted to 0.01% and 100% probability was converted to 99.99%, since the method does not handle 0% and 100% probabilities from the questionnaire. A post hoc analysis was performed stratified by the use of invasive mechanical ventilation at the time of interview and comparing groups with SAPS III prediction using the standard equation for the probability of death.(77 Moreno RP, Metnitz PG, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR; SAPS 3 Investigators. SAPS 3--From evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. 2005;31(10):1345-55.) Statistical analyses were performed by using the software Statistical Package for the Social Sciences - SPSS (IBM Corp., Armonk, New York, USA), MedCalc for Windows, version 19.6 (MedCalc Software, Ostend, Belgium) and R (http://www.r-project.org).

RESULTS

We enrolled 118 patients from August 2017 to December 2019 (Table 1). No consent withdrawals occurred. At enrollment, patients had a median age of 54 (IQR, 38.75 - 65.25), a median SAPS III score of 53 (IQR, 42.75 - 63), and 45 patients (38.1%) ultimately died in the hospital. At the time of the interview, a large proportion required invasive organ support (46.6% on mechanical ventilation and 50.8% on vasopressors), but no end-of-life discussion had taken place (Table 1). Two-thirds of the surrogates were women (74 of 118 surrogates) with a median age of 44.5 (IQR, 34 - 54). Most of them were spouses (31.4%) or siblings (35.6%, Table 2).

Table 1
Patient characteristics
Table 2
Surrogate characteristics

We interviewed 18 residents, 8 ICU fellows and 5 senior ICU physicians. Residents who were interviewed had a median age of 26 (IQR, 25 - 27), and the majority were male (72.2%) with less than two years of graduation (84.7%). Intensive care unit fellow residents had a median age of 28 (IQR, 28 - 31), the majority were male (75%), 77.6% had less than four years of graduation and all had a previous residency in Internal Medicine. Intensive care unit senior physicians had a median age of 37 (IQR, 33.5 - 41.5), were mostly male (80.0%), and had a median time since graduation of 12 years (IQR, 10 - 19).

Hospital outcome prediction

We did not have any missing data about the predicted outcomes. The predicted probability of hospital outcomes differed between those who were interviewed (Figure 1 and Table 1S - Supplementary material).

Figure 1
Estimated probabilities of hospital outcomes for patients by surrogates (A), residents (B), intensive care unit fellows (C) and intensive care unit senior physicians (D).

Two correlated modes for the surrogates’ answers could be observed: a high percentage of “100% survival” and a high percentage of “0% dying”. For physicians, an even distribution in types of answers was observed.

Overall, all groups demonstrated a significant discriminative performance (i.e., different from 0.50), including SAPS III (Figure 2). Pairwise comparison showed a significantly higher discriminative performance between each physicians’ categories and surrogates’ (p < 0.001), but not between physicians’ categories themselves. In our post hoc analysis, the presence of invasive mechanical ventilation deteriorated the discriminative performance of physicians (Figure 1SA and 1SB - Supplementary material).

Figure 2
Discriminative performance for hospital outcomes by surrogates, residents, intensive care unit fellows, intensive care unit senior physicians and Simplified Acute Physiology Score III.

In all analyses, SAPS III performance was significantly different from 0.50 but was different from the surrogate prediction only in those without mechanical ventilation (Figure 1SB - Supplementary material). There was no difference between the SAPS III prediction and any of the physician groups.

Calibration analysis demonstrated that physicians’ estimates were calibrated (Figure 2S - Supplementary material). It also showed that surrogates’ estimates were miscalibrated over a wide range of predicted probabilities. SAPS III underestimated the mortality in the lower-mid range of predicted probabilities.

Influence of question framing

All physicians’ categories’ predictive performance was different from chance regardless of the question framing. However, when surrogates were asked about the probability of dying, they were not accurate. ICU senior physicians demonstrated a higher AUC for both question framing compared to the surrogates (Table 3).

Table 3
Area under the receiver operating characteristics of interviewed groups to predict survival or death during hospitalization

DISCUSSION

The main findings of our study are that (1) the predictive performance of both surrogate decision-makers and physicians for hospital mortality in our cohort of critically ill patients was significantly better than chance, but all physician groups (internal medicine residents, ICU fellows and ICU senior physicians) had better accuracy compared to surrogates, (2) physicians’ experience was not related to improvements in predictive performance, and (3) different question framing affected the surrogates’ predictive performance, notably if asked about the occurrence of death.

Sinuff et al. had previously performed a systematic review of studies that showed physicians’ predictive performance was better than that of severity-of-illness scores to predict outcomes after ICU admission for individual patients.(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.) Physicians’ observed predictive performance in our study was in the excellent range (0.80 - 0.90), as suggested by Hosmer et al.(1212 Hosmer DW Jr, Lemeshow S, Sturdivant RX. Applied Logistic Regression. 3rd ed. New Jersey: John Wiley & Sons; 2013.) Although most studies are from the 1990s - 2000s, some recent observations also demonstrated physicians’ predictive performance similar to those in our study.(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) Although different case mixes between studies could be observed [e.g., White et al. only studied patients on the fifth day of invasive mechanical ventilation,(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) and the systematic review cited before enrolled studies with hospital mortality ranging from 18% to 46%(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.)], it seems that physicians’ estimates of prognosis are accurate.

Although we could not demonstrate a statistically significant difference between SAPS III performance and the physicians’ performance as previously described,(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.) this might be due to a lack of power in our post hoc analysis. Nevertheless, in all conditions, the point estimate of physicians’ performance was higher than that observed for SAPS III.

We did not observe significant differences between senior physicians and junior residents. Previous publications did not demonstrate that experienced ICU physicians demonstrate better predictive performance than inexperienced physicians.(22 Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.) Gusmão Vicente et al. did not observe a statistically significant difference between senior and junior ICU physicians in predicting longer ICU stays and ICU outcomes.(1313 Gusmão Vicente F, Polito Lomar F, Melot C, Vincent JL. Can the experienced ICU physician predict ICU length of stay and outcome better than less experienced colleagues? Intensive Care Med. 2004;30(4):655-9.) Even when comparing only trainees, Kruse et al. observed the same ROC curve for fellows, residents and interns.(1414 Kruse JA, Thill-Baharozian MC, Carlson RW. Comparison of clinical assessment with APACHE II for predicting mortality risk in patients admitted to a medical intensive care unit. JAMA. 1988;260(12):1739-42.) Our results are similar since experienced (senior physicians) and inexperienced (fellows and residents) physicians had similar predictive performances and they were all well calibrated.

We also observed that the overall predictive performance of surrogates was lower than those observed from physicians in our study. Notably, the calibration was poor in a wide range of predicted estimates, as observed in the calibration belt. This is noteworthy and highly anticipated, since it is not expected that a nonmedically educated person (who also has feelings involved for the person they are representing) can be compared to health care professionals. However, this fact is clinically relevant, as White et al. demonstrated that this divergent predictive performance is associated with discordances between prognostic points of view.(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.) Family members are usually overoptimistic due to both misunderstanding of the prognostic information conveyed by physicians and optimistic biases,(33 White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.,1515 Zier LS, Sottile PD, Hong SY, Weissfield LA, White DB. Surrogate decision makers’ interpretation of prognostic information: a mixed-methods study. Ann Intern Med. 2012;156(5):360-6.) and these factors might lead to conflict and the overuse of inappropriate invasive support. Thus, it is highly suggested to start any serious discussion by asking the patient and family members’ point of view and their actual perceptions.(1616 Baile WF, Buckman R, Lenzi R, Glober G, Beale EA, Kudelka AP. SPIKES-A six-step protocol for delivering bad news: application to the patient with cancer. Oncologist. 2000;5(4):302-11.) Our data emphasize this possible source of discordance, since we also observed that surrogates usually conveyed optimistic answers, with a high number of “100% chance of surviving” and “0% chance of dying” answers.

An interesting finding of our study was the effect of question framing on the predictive performance among surrogates but not on physicians. It was formerly demonstrated that a prognostic data presentation format might influence how surrogate decision-makers perceive risk, especially when the data are qualitatively explained,(44 Chapman AR, Litton E, Chamberlain J, Ho KM. The effect of prognostic data presentation format on perceived risk among surrogate decision makers of critically ill patients: a randomized comparative trial. J Crit Care. 2015;30(2):231-5.) but this effect was not previously studied for physicians.

Most publications presented the question in a survival frame (i.e., the probability that the patient will survive the hospitalization).(1414 Kruse JA, Thill-Baharozian MC, Carlson RW. Comparison of clinical assessment with APACHE II for predicting mortality risk in patients admitted to a medical intensive care unit. JAMA. 1988;260(12):1739-42.,1717 Brannen AL 2nd, Godfrey LJ, Goetter WE. Prediction of outcome from critical illness. A comparison of clinical judgment with a prediction rule. Arch Intern Med. 1989;149(5):1083-6.,1818 Garrouste-Orgeas M, Montuclard L, Timsit JF, Misset B, Christias M, Carlet J. Triaging patients to the ICU: a pilot study of factors influencing admission decisions and patient outcomes. Intensive Care Med. 2003;29(5):774-81.) However, the effect of question framing was not previously studied. It is well known that risk perception is highly variable among subjects due to multiple factors.(1919 Slovic P. The psychology of risk. Saude Soc. 2010;19(4):731-47.) In the psychometric paradigm, the individual quantitatively judges the current and desired riskiness.(2020 Lipkus IM. Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med Decis Making. 2007;27(5):696-713.) However, in situations where no a priori experience exists (such as ICU admission), personal beliefs and feelings are likely to be influenced by the framing of presentations.(2020 Lipkus IM. Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med Decis Making. 2007;27(5):696-713.) We believe that this should be taken into account when presenting prognostic information to surrogates and should also be acknowledged when designing future qualitative studies.

Another unexpected finding was the deterioration in the physicians’ category accuracy when faced with patients under mechanical ventilation. The reasons for the deterioration are unknown. Although a type I error cannot be excluded in this post hoc analysis, previous studies demonstrated inaccuracy in prognostication for some aspects of mechanically ventilated patients. Figueroa-Casas et al. demonstrated that physicians’ accuracy in predicting the duration of mechanical ventilation was limited.(2121 Figueroa-Casas JB, Connery SM, Montoya R, Dwivedi AK, Lee S. Accuracy of early prediction of duration of mechanical ventilation by intensivists. Ann Am Thorac Soc. 2014;11(2): 182-5.) Young et al. also suggested that the ability of physicians to predict extended ventilation and the requirement for a tracheostomy was poor.(2222 Young D, Harrison DA, Cuthbertson BH, Rowan K; TracMan Collaborators. Effect of early vs late tracheostomy placement on survival in patients receiving mechanical ventilation: the TracMan randomized trial. JAMA. 2013;309(20):2121-9.) Therefore, additional care might be taken when predicting outcomes for patients on mechanical ventilation, since some judgment bias could be present.

Our study has some limitations. First, it is a singlecenter study performed in a university-affiliated public hospital. Therefore, our results are influenced by our local case mix of patients, physicians and surrogates. This should be taken into account when comparing our data with others. Second, we did not interview health care professionals other than physicians to predict survival. A previous publication suggested that the perception of nurses is also relevant.(2323 Detsky ME, Harhay MO, Bayard DF, Delman AM, Buehler AE, Kent SA, et al. Discriminative accuracy of physician and nurse predictions for survival and functional outcomes 6 months after an ICU admission. JAMA. 2017;317(21):2187-95.) Third, the lack of a predictive performance difference between senior physicians and residents might be due to a lack of power to detect smaller differences than the one observed. However, previous publications documented similar findings.(1313 Gusmão Vicente F, Polito Lomar F, Melot C, Vincent JL. Can the experienced ICU physician predict ICU length of stay and outcome better than less experienced colleagues? Intensive Care Med. 2004;30(4):655-9.,1414 Kruse JA, Thill-Baharozian MC, Carlson RW. Comparison of clinical assessment with APACHE II for predicting mortality risk in patients admitted to a medical intensive care unit. JAMA. 1988;260(12):1739-42.) Finally, we presented the prognostic question as strata of numerical risk. Numeracy is critical to proper understanding of such a presentation format,(2424 Peters E, Hart PS, Fraenkel L. Informing patients: the influence of numeracy, framing, and format of side effect information on risk perceptions. Med Decis Making. 2011;31(3):432-6.,2525 Keller C, Siegrist M. Effect of risk communication formats on risk perception depending on numeracy. Med Decis Making. 2009;29(4):483-90.) and it is particularly relevant in low- and middle-income countries where low educational attainments are unfortunately frequent.

CONCLUSION

In conclusion, we observed higher predictive performance for hospital outcomes among physicians than among surrogate decision-makers, who are usually overly optimistic. Experience did not lead to better predictive performance among the included physicians. On the other hand, question framing (i.e., probability of death versus probability of survival) influenced surrogate performance.

REFERÊNCIAS

  • 1
    Evans LR, Boyd EA, Malvar G, Apatira L, Luce JM, Lo B, et al. Surrogate decision-makers’ perspectives on discussing prognosis in the face of uncertainty. Am J Respir Crit Care Med. 2009;179(1):48-53.
  • 2
    Sinuff T, Adhikari NK, Cook DJ, Schunemann HJ, Griffith LE, Rocker G, et al. Mortality predictions in the intensive care unit: comparing physicians with scoring systems. Crit Care Med. 2006;34(3):878-85.
  • 3
    White DB, Ernecoff N, Buddadhumaruk P, Hong S, Weissfeld L, Curtis JR, et al. Prevalence of and factors related to discordance about prognosis between physicians and surrogate decision makers of critically ill patients. JAMA. 2016;315(19):2086-94.
  • 4
    Chapman AR, Litton E, Chamberlain J, Ho KM. The effect of prognostic data presentation format on perceived risk among surrogate decision makers of critically ill patients: a randomized comparative trial. J Crit Care. 2015;30(2):231-5.
  • 5
    Zampieri FG, Soares M, Borges LP, Salluh JI, Ranzani OT. The Epimed Monitor ICU Database: a cloud-based national registry for adult intensive care unit patients in Brazil. Rev Bras Ter Intensiva. 2017;29(4):418-26.
  • 6
    Metnitz PG, Moreno RP, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR; SAPS 3 Investigators. SAPS 3--From evaluation of the patient to evaluation of the intensive care unit. Part 1: Objectives, methods and cohort description. Intensive Care Med. 2005;31(10):1336-44.
  • 7
    Moreno RP, Metnitz PG, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR; SAPS 3 Investigators. SAPS 3--From evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. 2005;31(10):1345-55.
  • 8
    Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-83.
  • 9
    Goksuluk D, Korkmaz S, Zararsiz G, Karaagaoglu E. easyROC: an interactive web-tool for ROC curve analysis using R language environment. R J. 2016;8(2):213-30.
  • 10
    DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837-45.
  • 11
    Finazzi S, Poole D, Luciani D, Cogo PE, Bertolini G. Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLoS One. 2011;6(2):e16110.
  • 12
    Hosmer DW Jr, Lemeshow S, Sturdivant RX. Applied Logistic Regression. 3rd ed. New Jersey: John Wiley & Sons; 2013.
  • 13
    Gusmão Vicente F, Polito Lomar F, Melot C, Vincent JL. Can the experienced ICU physician predict ICU length of stay and outcome better than less experienced colleagues? Intensive Care Med. 2004;30(4):655-9.
  • 14
    Kruse JA, Thill-Baharozian MC, Carlson RW. Comparison of clinical assessment with APACHE II for predicting mortality risk in patients admitted to a medical intensive care unit. JAMA. 1988;260(12):1739-42.
  • 15
    Zier LS, Sottile PD, Hong SY, Weissfield LA, White DB. Surrogate decision makers’ interpretation of prognostic information: a mixed-methods study. Ann Intern Med. 2012;156(5):360-6.
  • 16
    Baile WF, Buckman R, Lenzi R, Glober G, Beale EA, Kudelka AP. SPIKES-A six-step protocol for delivering bad news: application to the patient with cancer. Oncologist. 2000;5(4):302-11.
  • 17
    Brannen AL 2nd, Godfrey LJ, Goetter WE. Prediction of outcome from critical illness. A comparison of clinical judgment with a prediction rule. Arch Intern Med. 1989;149(5):1083-6.
  • 18
    Garrouste-Orgeas M, Montuclard L, Timsit JF, Misset B, Christias M, Carlet J. Triaging patients to the ICU: a pilot study of factors influencing admission decisions and patient outcomes. Intensive Care Med. 2003;29(5):774-81.
  • 19
    Slovic P. The psychology of risk. Saude Soc. 2010;19(4):731-47.
  • 20
    Lipkus IM. Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med Decis Making. 2007;27(5):696-713.
  • 21
    Figueroa-Casas JB, Connery SM, Montoya R, Dwivedi AK, Lee S. Accuracy of early prediction of duration of mechanical ventilation by intensivists. Ann Am Thorac Soc. 2014;11(2): 182-5.
  • 22
    Young D, Harrison DA, Cuthbertson BH, Rowan K; TracMan Collaborators. Effect of early vs late tracheostomy placement on survival in patients receiving mechanical ventilation: the TracMan randomized trial. JAMA. 2013;309(20):2121-9.
  • 23
    Detsky ME, Harhay MO, Bayard DF, Delman AM, Buehler AE, Kent SA, et al. Discriminative accuracy of physician and nurse predictions for survival and functional outcomes 6 months after an ICU admission. JAMA. 2017;317(21):2187-95.
  • 24
    Peters E, Hart PS, Fraenkel L. Informing patients: the influence of numeracy, framing, and format of side effect information on risk perceptions. Med Decis Making. 2011;31(3):432-6.
  • 25
    Keller C, Siegrist M. Effect of risk communication formats on risk perception depending on numeracy. Med Decis Making. 2009;29(4):483-90.

Edited by

Responsible editor: Jorge Ibrain Figueira Salluh

Publication Dates

  • Publication in this collection
    08 Aug 2022
  • Date of issue
    Apr-Jun 2022

History

  • Received
    26 Aug 2021
  • Accepted
    20 Dec 2021
Associação de Medicina Intensiva Brasileira - AMIB Rua Arminda, 93 - Vila Olímpia, CEP 04545-100 - São Paulo - SP - Brasil, Tel.: (11) 5089-2642 - São Paulo - SP - Brazil
E-mail: rbti.artigos@amib.com.br