Response: Commentary: Modeling mortality risk in patients with severe COVID-19 from Mexico
- 1Student Research Committee, Shahrekord University of Medical Sciences, Shahrekord, Iran
- 2Department of Epidemiology and Biostatistics, Faculty of Health, Shahrekord University of Medical Sciences, Shahrekord, Iran
A Commentary on
Modeling mortality risk in patients with severe COVID-19 from Mexico
by Cortes-Telles, A., Figueroa-Hurtado, E., Ortiz-Farias D. L., and Zavorsky, G. S. (2023). Front. Med. 10:1187288. doi: 10.3389/fmed.2023.1187288
Introduction
We meticulously read the paper by Cortes-Telles et al. (1) that was published online in Frontiers in Medicine in May 2023. The study was conducted to determine significant predictors of mortality among hospital-admitted COVID-19 patients. Finally, they ranked the five crucial predictors of death as (1) need to a mechanical ventilator, (2) platelet concentration at admission, (3) increased derived neutrophil to lymphocyte ratio, (4) age and (5) pulse oximetry saturation respectively (1). Undoubtedly, their study makes a valuable contribution to the area, but some methodological issues need to be taken into account to avoid misinterpretation of the study's results.
The higher OR doesn't necessarily show the best predictor
The odds ratio is a valid metric to investigate any association between the quantitative independent variables and a binary outcome but the presence of such an association has no information about the prediction capability. The OR is affected by variables' scales and may not be comparable due to the fact that they have different types of units. Instead, the standardized ORs that are extracted from the standardized regression coefficients have the same unit and are comparable (2). Moreover, to compare the prediction accuracy of models, the area under the cure (AUC) is highly suggested (3).
LASSO is not appropriate for explanation modeling
Using regression models for the causal explanation is very different from the empirical prediction aims. If highly correlated variables exist, the lasso retains only one variable and sets the others to zero. That will possibly lead to misleading results for the explanation aims. So, like all greedy algorithms, LASSO is good for prediction aims and not appropriate for explanation aims. More information about the difference between the explanation and prediction models, reading an article entitled “to explain or to predict” is helpful (4).
The presence of sparse data bias
The lack of adequate case numbers for some of the variables in the logistic regression leads to a phenomenon called sparse data biased. A further upward bias is expected due to the fact that odds are obtained by taking the exponentiation of the coefficients which leads to impossibly huge odds (5). Cortes-Telles et al. (1) report the need for a mechanical ventilator equal to 193 (43 to 878) and the logarithm of platelets counts as 0.002 (0.0003 to 0.09) and the logarithm of dNLR as 14.1 (1.2 to 169.5) which are not reliable.
Discussion
The take-home message of this note for the readers is that using true statistical analysis and an appropriate interpretation is critical in medical investigations. To avoid sparse data bias, using Firth's bias-reduced logistic regression which uses penalized maximum likelihood estimation, the exact logistic regression and Bayesian approaches are recommended.
Author contributions
ES, ST, and HR: conception and design and drafting the article. Also, the manuscript has been read and approved by all the authors. All authors contributed to the article and approved the submitted version.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Cortes-Telles A, Figueroa-Hurtado E, Ortiz-Farias DL, Zavorsky GS. Modeling mortality risk in patients with severe COVID-19 from Mexico. Front Med. (2023) 10:1187288. doi: 10.3389/fmed.2023.1187288
2. Peng CY, So TS. Logistic regression analysis and reporting: A primer. Underst Stat. (2002) 1:31–70. doi: 10.1207/S15328031US0101_04
3. Chatterjee A, Woodruff H, Wu G, Lambin P. Limitations of only reporting the odds ratio in the age of precision medicine: A deterministic simulation study. Front Med. (2021) 8:640854. doi: 10.3389/fmed.2021.640854
Keywords: prediction, regression, variable importance, odds ratio, statistical modeling
Citation: Sanjari E, Toosizadeh S and Raeisi Shahraki H (2023) Commentary: Modeling mortality risk in patients with severe COVID-19 from Mexico. Front. Med. 10:1247741. doi: 10.3389/fmed.2023.1247741
Received: 03 July 2023; Accepted: 14 September 2023;
Published: 28 September 2023.
Edited by:
Neftali Eduardo Antonio-Villa, National Institute of Cardiology Ignacio Chavez, MexicoReviewed by:
Nicolas Padilla-Raygoza, Institute of Public Health of the State of Guanajuato (ISAPEG), MexicoEfrain Navarro-Olivos, Institute of Public Health of the State of Guanajuato (ISAPEG), Mexico
Ashuin Kammar-García, Instituto Nacional de Geriatría, Mexico
Copyright © 2023 Sanjari, Toosizadeh and Raeisi Shahraki. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hadi Raeisi Shahraki, cmFlaXNpLnNoYWhyYWtpX2hhZGlAeWFob28uY29t