Skip to main content

GENERAL COMMENTARY article

Front. Med., 21 November 2023
Sec. Infectious Diseases: Pathogenesis and Therapy

Response: Commentary: Modeling mortality risk in patients with severe COVID-19 from Mexico

\r\nGerald Stanley Zavorsky
&#x;&#x;Gerald Stanley Zavorsky1*Arturo Cortes-Telles&#x;&#x;Arturo Cortes-Telles2Esperanza Figueroa-Hurtado&#x;Esperanza Figueroa-Hurtado2Diana Lizbeth Ortiz-Farias&#x;Diana Lizbeth Ortiz-Farias2
  • 1Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA, United States
  • 2Respiratory and Thoracic Surgery Unit, Hospital Regional de Alta Especialidad de la Península de Yucatan, Yucatan, Mexico

A response to
Commentary: Modeling mortality risk in patients with severe COVID-19 from Mexico

by Sanjari, E., Toosizadeh, S., and Raeisi Shahraki, H. (2023). Front. Med. 10:1247741. doi: 10.3389/fmed.2023.1247741

We appreciate the commentary from Sanjari et al. (1) about our recent work (2). We want to respond:

A. Large odds ratios are not necessarily the best predictors

They mention that sizable odds ratios don't always pinpoint the top predictor, suggesting that we should be reporting the area under the receiver operating characteristic curve (AUC). We concur that a variable with the highest odds ratio in regression can indicate a strong association but might not be the most important predictor. To gauge how the model fits to the outcome (1 = death, 0 = alive at discharge), we can separately assess the Bayesian Information Criterion (BIC) for each predictor. These predictors can be arranged in order of significance based on the BIC: the lowest BIC is the most important, while the highest BIC points to the least important in the model.

In our study, the predictor boasting the largest odds ratio, which was the requirement for mechanical ventilation (1 meaning yes, 0 meaning no), was also the one with the smallest BIC (BIC = 153.7; log-likelihood = −71.4) when stacked against the other four predictors in our model. Following that, the second most vital predictor when analyzed individually was the pulse oximetry saturation upon admission (BIC = 258.8; log-likelihood = −123.9), succeeded by the logarithm of the derived neutrophil-to-lymphocyte ratio (dNLR; BIC = 296.9; log-likelihood = −142.9), age (BIC = 307.9; log-likelihood = −148.4), and lastly, the logarithm of platelet counts upon admission (BIC = 340.9 and log-likelihood = −164.9). Additionally, a “random forests” visualization (3) demonstrated that the need for mechanical ventilation emerged as the paramount predictor among the five. This conclusion was drawn from the marked decrease in accuracy in our model presented in Table 3 of our publication when this predictor was switched out randomly. The significance rank of the subsequent four variables aligned with their BIC rankings.

In the discussion segment of our paper (2), we highlight that the AUC of a receiver operating characteristic should no longer be used as an indicator for screening performance (4). The AUC's value isn't intuitive for many; for instance, an AUC of 0.70 doesn't immediately provide insights into the detection rate (also known as sensitivity) for a given rate of false positives (which is the inverse of specificity), or the other way around. Furthermore, when standard deviations of screening markers vary between affected and unaffected subjects, the AUC can become ambiguous or even give a false impression. Matthew's correlation coefficient offers a more accurate approach for classification (57).

B. LASSO is not appropriate for explanation modeling

In our paper's methods section (2), we employed LASSO for pinpointing the key predictors linked to mortality. LASSO not only selects essential variables but also reduces other coefficients to zero, effectively guarding against model overfitting. This approach melds both variable selection and regularization into a robust regression methodology. After applying LASSO logistic regression, we transitioned to standard logistic regression for the identified predictors. This step was taken to offer readers clear coefficient estimates, enabling them to assess the predictors without the influence of the L1 penalty. Indeed, as seen in Table 3 of our study (2), all five predictors had a Variance Inflation Factor under 2.0, indicative of minimal multicollinearity.

C. The presence of sparse data bias

Our study encompassed 247 participants (2). Out of these, 138 (or 56%) had to undergo mechanical ventilation. Furthermore, 101 out of the 247 individuals, accounting for 40%, unfortunately passed away. Notably, there wasn't any instance of variables with scant observations, ensuring data robustness.

Sanjari et al. (1) state that due to the exponentiation of some of our coefficients, there were enormous odds ratios, specifically for baseline platelet counts and dNLR. Due to the evident skewness in these values (1.6 and 4.5 respectively), we employed logarithmic transformations on these predictors to mitigate the impact of potential outliers. As a result, this adjustment offered a more harmonized model fit, as indicated by the Bayesian Information Criterion (BIC). When comparing our chosen model showcased in Table 3 of our publication (2) to its counterpart without the logarithmic adjustments on platelets and dNLR, the evidence ratio stands at 1.44. This suggests that the data backs our presented model in Table 3 approximately 1.44 times more than the alternative model. It's an indication that our model choice is on the right track.

Yet, in the spirit of complete disclosure, when we exhibit the model analogous to what's in Table 3 of our paper (2) but without logarithmic adjustments for the baseline platelet counts or dNLR, the odds ratios for age (1.07, 95% CI = 1.03–1.12), pulse oximetry saturation during admission (0.96, 95% CI = 0.92–0.997), and the need for mechanical ventilation (164, 95% CI = 39–700) hold steady. Breaking it down: each yearly increase in age increases the odds of death by 3% to 12%; each 1% elevation in pulse oximetry saturation at the point of admission curtails the odds of death by 0.3%−8%; and the necessity for mechanical ventilation amplifies the odds of death from 39 to as much as 700 times. When it comes to baseline platelet counts (expressed as × 109 cells/L), the odds ratio was 0.99 (95% CI = 0.985–0.997), and for the baseline dNLR, it was 1.21 (95% CI = 1.04–1.39). Hence, with every increment of 1 × 109 cell per liter in platelets, mortality odds are reduced by 0.3%−1.5%. On the flip side, every unit hike in dNLR pushes the death odds up by 4%−39%.

Firth's bias-reduced regression (8) is a nice thought, but implementing this statistical technique resulted in coefficients like our model presented in Table 3. We used the R “logistic” package (version 1.26.0), and there was no practical difference between the model result from Firth's biased reduced regression and our model in Table 3.

We appreciate the commentary by Sanjari et al. (1) concerning our paper highlighting our contribution to the field.

Author contributions

AC-T: Writing – review & editing. EF-H: Writing – review & editing. DO-F: Writing – review & editing. GZ: Formal analysis, Methodology, Writing – original draft, Writing – review & editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sanjari E, Toosizadeh S, Raeisi Shahraki H. Commentary: Modeling mortality risk in patients with severe COVID-19 from Mexico. Front Med. (2023) 10:1247741. doi: 10.3389/fmed.2023.1247741

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Cortes-Telles A, Figueroa-Hurtado E, Ortiz-Farias DL, Zavorsky GS. Modeling mortality risk in patients with severe COVID-19 from Mexico. Front Med. (2023) 10:1187288. doi: 10.3389/fmed.2023.1187288

CrossRef Full Text | Google Scholar

3. Breiman L. Random forests. Mach Learn. (2001) 45:5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

4. Wald NJ, Bestwick JP. Is the area under an ROC curve a valid measure of the performance of a screening or diagnostic test? [Corrigendum: 2020. vol 27(3), page 170]. J Med Screen. (2014) 21:51–6. doi: 10.1177/0969141313517497

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Chicco D, Warrens MJ, Jurman G. The Matthews Correlation Coefficient (MCC) is more informative than Cohen's Kappa and Brier Score in binary classification assessment. IEEE Access. (2021) 9:78368. doi: 10.1109/ACCESS.2021.3084050

CrossRef Full Text | Google Scholar

6. Chicco D, Totsch N, Jurman G. The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Min. (2021) 14:13. doi: 10.1186/s13040-021-00244-z

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics. (2020) 21:6. doi: 10.1186/s12864-019-6413-7

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Firth D. Bias reduction of maximum-likelihood-estimates. Biometrika. (1993) 80:27–38. doi: 10.1093/biomet/80.1.27

CrossRef Full Text | Google Scholar

Keywords: LASSO, least absolute shrinkage and selection operator, COVID-19, mechanical ventilation, biostatistics

Citation: Zavorsky GS, Cortes-Telles A, Figueroa-Hurtado E and Ortiz-Farias DL (2023) Response: Commentary: Modeling mortality risk in patients with severe COVID-19 from Mexico. Front. Med. 10:1301349. doi: 10.3389/fmed.2023.1301349

Received: 24 September 2023; Accepted: 01 November 2023;
Published: 21 November 2023.

Edited by:

Neftali Eduardo Antonio-Villa, National Institute of Cardiology Ignacio Chavez, Mexico

Reviewed by:

Nicolas Padilla-Raygoza, Institute of Public Health of the State of Guanajuato (ISAPEG), Mexico

Copyright © 2023 Zavorsky, Cortes-Telles, Figueroa-Hurtado and Ortiz-Farias. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Gerald Stanley Zavorsky, Z3N6YXZvcnNreSYjeDAwMDQwO3VjZGF2aXMuZWR1

These authors share first authorship

ORCID: Gerald Stanley Zavorsky orcid.org/0000-0002-4473-1601
Arturo Cortes-Telles orcid.org/0000-0002-5322-5604
Esperanza Figueroa-Hurtado orcid.org/0000-0003-1388-0572
Diana Lizbeth Ortiz-Farias orcid.org/0000-0002-3212-1929

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.