A machine learning-derived risk score to predict left ventricular diastolic dysfunction from clinical cardiovascular magnetic resonance imaging

Zhou, Qingtao; Wang, Lin; Craft, Jason; Weber, Jonathan; Passick, Michael; Ngai, Nora; Khalique, Omar K.; Goldfarb, James W.; Barasch, Eddy; Cao, J. Jane

doi:10.3389/fcvm.2024.1382418

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 06 June 2024

Sec. Cardiovascular Imaging

Volume 11 - 2024 | https://doi.org/10.3389/fcvm.2024.1382418

A machine learning-derived risk score to predict left ventricular diastolic dysfunction from clinical cardiovascular magnetic resonance imaging

Qingtao Zhou¹

Lin Wang^1,2

Jason Craft^1,2

Jonathan Weber¹

Michael Passick^1,2

Nora Ngai^1,2

Omar K. Khalique^1,2

James W. Goldfarb¹

Eddy Barasch^1,2

J. Jane Cao^1,2*

¹DeMatteis Cardiovascular Institute, St. Francis Hospital & Heart Center, Roslyn, NY, United States
²Division of Cardiac Imaging, St. Francis Hospital & Heart Center, Roslyn, NY, United States

Introduction: The evaluation of left ventricular diastolic dysfunction (LVDD) by clinical cardiac magnetic resonance (CMR) remains a challenge. We aimed to train and evaluate a machine-learning (ML) algorithm for the assessment of LVDD by clinical CMR variables and to investigate its prognostic value for predicting hospitalized heart failure and all-cause mortality.

Methods: LVDD was characterized by echocardiography following the ASE guidelines. Eight demographic and nineteen common clinical CMR variables including delayed enhancement were used to train Random Forest models with a Bayesian optimizer. The model was evaluated using bootstrap and five-fold cross-validation. Area under the ROC curve (AUC) was utilized to evaluate the model performance. An ML risk score was used to stratify the risk of heart failure hospitalization and all-cause mortality.

Results: A total of 606 consecutive patients underwent CMR and echocardiography within 7 days for cardiovascular disease evaluation. LVDD was present in 303 subjects by echocardiography. The performance of the ML algorithm was good using the CMR variables alone with an AUC of 0.868 (95% CI: 0.811–0.917), which was improved by combining with demographic data yielding an AUC 0.895 (95% CI: 0.845–0.939). The algorithm performed well in an independent validation cohort with AUC 0.810 (0.731–0.874). Subjects with higher ML scores (>0.4121) were associated with increased adjusted hazard ratio for a composite outcome than subjects with lower ML scores (1.72, 95% confidence interval 1.09–2.71).

Discussion: An ML algorithm using variables derived from clinical CMR is effective in identifying patients with LVDD and providing prognostication for adverse clinical outcomes.

1 Introduction

Left ventricular diastolic dysfunction (LVDD) is an important risk factor for heart failure (HF) and all-cause mortality (1, 2). Effectively and promptly identifying patients with LVDD may decrease their associated risk of adverse outcomes, improve survival, and reduce the heavy social and economic burden of heart failure (3). Echocardiography is the most commonly used modality to assess LVDD where standardized guidelines have been established and adopted for clinical use (4).

While cardiac magnetic resonance (CMR) is highly valuable in the evaluation of LV function and myocardial tissue properties, integrated assessment of LVDD using available parameters captured in routine clinical CMR examination remains challenging (5, 6), albeit individual variables derived from clinical CMR such as left atrial size and strain, LV mass and longitudinal strain are closely associated with LVDD (7, 8). In addition, measurements beyond routine CMR images have shown early promise in LVDD evaluation (9–11). The goal of the present study was to test a hypothesis that the machine learning (ML) risk score can determine the probability of LVDD using CMR variables captured in routine clinical examinations. In addition, we aimed to investigate the prognostic value of our ML score to estimate the risk of heart failure (HF) hospitalization and all-cause mortality.

2 Materials and methods

2.1 Study subjects

We identified consecutive patients who underwent echocardiography and CMR as clinically indicated with gadolinium contrast within 7 days of each other from a single center between January 2004 and December 2014. Demographic information and cardiovascular history were collected prospectively at the time of CMR. A total of 702 patients were identified initially. After excluding 96 subjects who had missing echocardiographic data for adequate evaluation of LVDD or inadequate CMR image quality using real time imaging due to arrhythmia, there were 606 subjects included in the analysis. Of those, 395 were prospectively enrolled with echocardiography and CMR obtained on the same day and the remaining 211 were identified from the clinical patient population. The study was approved by the St. Francis Hospital Institutional Review Board and a waiver was granted for the analysis of retrospective data. An additional set of subjects following the same inclusion/exclusion criterea were identified from a separate hospital within the same health system (30 miles apart representing a different catchment area) and were used as an external validation cohort. The Good Samaritan University Hospital Institutional Review Board granted approval for our use of subjects for the validation cohort and provided a waiver of informed consent for retrospective data collection.

LVDD by echocardiography was characterized following ASE guidelines (4). Details about these subjects were published in a prior study from our group (12). The outcome of HF hospitalization was identified from electronic medical records of the Catholic Health System of Long Island, NY. All-cause mortality data was obtained from both the electronic medical records and National Death Index between 2004 and 2018. All-cause mortality and HF hospitalization were combined to form a composite time-to-event outcome.

2.2 Imaging acquisition—transthoracic echocardiography

Comprehensive echocardiographic examination was performed using a multi-frequency transducer ultrasound system (Philips IE 33, Andover, MA) as described previously (12). Briefly, from the apical window, pulsed wave Doppler was used to interrogate mitral inflow from 3 to 5 cardiac cycles at the level of the mitral valve annulus and at the mitral leaflets' tips. Subjects with atrial fibrillation were excluded. Tissue Doppler was applied to record mitral annular velocities at the septal and lateral corners of the annulus, and the results are given as an average of both from 3 to 5 cardiac cycles. Tricuspid regurgitation velocity was recorded by continuous wave Doppler from multiple windows. Two-dimensional measurements were performed according to recommendations of the American Society of Echocardiography (13) and indexed to body surface area. Echocardiograms were utilized only for the diagnosis of presence and severity of LVDD in our analysis. Mitral inflow early (E) and late (A) peak velocities, early diastolic annular myocardial longitudinal velocity (e′), tricuspid regurgitation and left atrial volume (LA) index were measured in order to determine the DD grade. Subjects with indeterminate DD diagnosis (N = 7) were excluded; however, subjects with DD but indeterminate of grade following current guidelines were included and were considered to have DD. These subjects with DD present but indeterminate of grade were not further classified. The scan and analysis protocols were kept the same for the validation cohort.

2.3 Imaging acquisition—CMR

All subjects underwent CMR on a 1.5T scanner (Avanto, Siemens, Malvern, PA) with an 8-element phased array surface coil. Cine imaging of the long axis planes (2-, 3-, and 4-chamber views) and a stack of 8–12 short axis planes (contiguous 8 mm slice), starting from the mitral annulus, were acquired using balanced steady state free precession sequence with 30 phases per cardiac cycle. The average temporal resolution was 50 ms, with a typical field of view of 240 mm, flip angle of 70 degrees, repetition time (TR) of 3.1 ms and echo time (TE) 1.3 ms. Phase sensitive inversion recovery late gadolinium enhancement (LGE) imaging was performed 10–15 min after the administration of 0.15 mmol/kg of gadopentetate dimenglumine on a stack of LV diastolic short axis slices with the following parameters; TE 3.17 ms, TR = 1 × RR interval, flip angle 25°, voxel size 1.9 mm² × 1.4 mm² × 8 mm³ and FOV 360 mm² × 290 mm². The inversion time was selected from TI scout imaging. CMR performed on the validation cohort followed the same imaging protocol on a 1.5T scanner (Aera, Siemens, Malvern, PA) with an exception of using gadobutrol at 0.15 mmol/kg for LGE imaging.

2.4 Image post processing

The volumetric analysis of the cine images from CMR was analyzed using QMASS software (Version: 7.2. Medis, Leiden, The Netherlands). LA volume was analyzed following the area and length method using 2- and 4-chamber long axis cine images with commercial software (Circle Cardiovascular Imaging Inc, version: 5.11, Calgary, Canada) and indexed to body surface area (BSA). LA maximum volume (LAV_max) was assessed at LV end systole and LA minimum volume (LAV_min) at LV end diastole. Regional wall motion abnormalities (RWMA) were assessed and expressed as present vs. absent. Global LV peak strain was measured in the longitudinal, circumferential and radial directions by feature tracking software (Circle Cardiovascular Imaging Inc, Calgary, Canada). The epi- and endocardial contours were drawn manually on the end diastolic phase of the 2-, 3-, and 4-chamber long-axis cine images for longitudinal strain and short-axis segmented cine images for the circumferential and radial strain and propagated automatically to calculate global peak systolic strains. All image analyses were performed by experienced operators. LGE was interpreted by cardiologists who had 6–20 years of experience. LGE was assessed as binary variable as well as characterized as either absent, ischemic or non-ischemic fibrotic patterns.

2.5 Machine learning algorithm and evaluation

We explored three tree-based approaches including Random Forest, XGBoost and AdaBoost. Random Forest was selected due to its superior model performance, evaluated by the highest R-squared value (data not shown).

Five Random Forest models were evaluated whose variables included (1) demographic variables only; (2) clinical CMR variables only; (3) clinical CMR + demographic variables; (4) clinical CMR + LV peak systolic strain variables; (5) clinical CMR + LV peak systolic strain + demographic variables (list of all variables was included in Table 1). Area under the curve (AUC) from the Receiver Operating Characteristic analysis was used to evaluate each model's performance. The model with the best performance was chosen to compute the ML risk score. The ML risk score was defined as the model's predicted probability of of LVDD diagnosis, conditional on the risk factors in the selected model. The Random Forest model was developed using python (v. 3.10.2) and its associated libraries [Random Forest (https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html); SHAP (https://github.com/slundberg/shap) and Bayesian optimizer (https://pypi.org/project/bayesian-optimization)].

Table 1

Table 1. Baseline characteristics of the training cohort between subjects with and without left ventricular diastolic dysfunction.

A bootstrap approach was utilized to calculate 95% confidence intervals (CI). Each of the clinical datasets was stratified into five folds where four folds were used as training (80%) and the remaining (20%) as a testing dataset. This process was repeated four times. A nested cross-validation procedure (14, 15) was incorporated into the ML algorithms where the training dataset was further divided into five sub-folds and four of which were used for sub-training and the remaining for validation. In addition, a Bayesian optimizer was utilized to identify the best hyper-parameters for the random forest models. The model output for the probability of LVDD diagnosis was defined as the ML risk score.

2.6 Model interpretation and feature importance

We leveraged the Random Forest models with the Shapley values (16) to explore the individual contributions of the clinical variables to the probability of LVDD diagnosis. Briefly, we examine the pathway each individual subject takes through the model. The subject reaches a decision point (variable split) where the value of each contributing variable either increases or decreases the subject's probability of LVDD. Each option contributes a weight, and that weight is associated with the variables used to determine the decision point. By aggregating these decision point weights, we can identify the important features among all risk factors.

2.7 Statistical analysis

Continuous variables were summarized as mean ± SD and the categorical variables were expressed as numbers or percentage (%). Random forest-derived models generated scores demonstrating risk of LVDD diagnosis. Youden's index, (calculated as sensitivity + specificity − 1) was used to identify the optimal threshold for the ML risk score, which was subsequently used to stratify the subjects into having high or low ML risk scores or higher or lower probability of LVDD. Kaplan–Meier plots, generated using MedCalc (v20.110, MedCalc Software Ltd, Belgium) were developed to compare event-free survival between those with high and low probability of LVDD based on ML risk scores (evaluated with log-rank tests). Additionally, Cox proportional hazards models were created in order to evaluate the prognostic role of the probable LVDD based on our ML risk score in the context of traditional risk factors. To validate the models, we used an external dataset. The previously calibrated models based on the training dataset were tested on the validation dataset. Models were evaluated based on demographic data alone, CMR parameters alone and the combination of demographic and CMR parameters. AUC were used to validate the model performance. Analyses were performed using SigmaPlot (v14, Systat Software Inc., Palo Alto, CA). All two-sided p values <0.05 were considered statistically significant.

3 Results

Of the 606 subjects included, the average age was 66 ± 16 years and 62% were female. Of those, 305 subjects were diagnosed with LVDD by echocardiography with 113 subjects classified with grade 1, 63 grade II, 31 grade III and 98 with indeterminate LVDD grade. Compared to the subjects with no LVDD, those with LVDD were older and more likely to have cardiovascular risk factors such as hypertension, diabetes mellitus and hyperlipidemia (Table 1). In addition, the left and right ventricular ejection fractions and stroke volumes were lower, LGE prevalence was higher (64% vs. 29%, p < 0.001) and the LV peak systolic strain values were reduced.

We tested ML model performance based on different sets of variables. The CMR variables alone were predictive of LVDD strongly with an AUC of 0.868 (95% CI: 0.811, 0.917) higher than demographic variables alone, which had an AUC 0.717 (0.641, 0.787) (p for comparison <0.001) (Figure 1). Combined CMR and demographical variables yielded improved model performance with an excellent AUC of 0.895 (0.846, 0.939) compared with CMR or demographical variables alone (p < 0.001). Conversely, the addition of LV peak strain parameters did not further improve the model performance whether in combination with CMR or with CMR plus demographic variables (0.868 vs. 0.870, p = 0.835 and 0.895 vs. 0.893, p = 0.803, respectively).

Figure 1

Figure 1. Area under the ROC curves (95% confidence intervals) from random forest algorithm predicting left ventricular diastolic dysfunction based on different variable datasets.

We ranked the importance features following the Shapley values based on the model with the best performance from the combination of CMR and demographic variables. The top 5 important features were LVEF, age, LV mass, LAEF and LAV_min (Figure 2). Next, we calculated a ML risk score for each subject. Shown in Figure 3 are two examples how Shapley values of each variable varies from subject to subject, both with normal LV ejection fraction contributing to drastically different ML risk scores.

Figure 2

Figure 2. Global importance features of the random forest model.

Figure 3

Figure 3. Examples of two patients with similar left ventricular ejection fraction but different machine learning scores. The first patient (A) was a 58 year-old female with left ventricular ejection fraction of 54%. The high machine-learning score of 0.9986 was due to unfavorable left ventricular mass, body mass index, right ventricular size and the presence of late gadolinium enhancement. The second patient (B) was a 53 year-old male with a left ventricular ejection fraction of 54%. The low machine-learning risk score of 0.000392 was attributed to his favorable body mass index, left ventricular mass, left and right ventricular size.

Subjects were divided into high-risk and low-risk groups at a cut-point (0.4121) defined by Youden's index. Correspondingly, at the cut-off values, the ML risk score for identifying the LVDD were associated with a sensitivity of 0.82, a specificity of 0.80, a positive predictive value of 0.81, a negative predictive value of 0.79, and an accuracy of 0.80. Among those with low ML risk score (N = 309) 65 (21%) subjects were misclassified chiefly due to grade I (8%) or undetermined (11%) LVDD grade, which was 88% of all those misclassified. The remainder were due to misclassification of the advanced LVDD (grade II or III). Conversely, among those with high ML risk score (N = 297) 57 (19%) subjects with no LVDD were misclassified.

After a mean follow up of 4.8 years, 123 subjects were hospitalized for HF, 99 died and 182 had a composite outcome of either HF or all-cause death. Subjects with higher ML risk scores were more likely to be hospitalized for HF, experience all-cause mortality, or had a composite outcome than subjects with lower ML risk scores as seen in the Kaplan–Meier curves (Figure 4) (log-rank p < 0.001 for all 3 comparisons). Cox proportional hazards models revealed that after adjustment for traditional risk factors including age, gender, BMI, history of hypertension, diabetes, coronary artery disease, as well as LVEF and presence of scar measured on CMR, having a high LVDD risk score was associated with a 72% increase in hazards of our composite outcome [hazard ratio 1.72, (95% CI: 1.09–2.71)], (Table 2). Additionally, compared to a model including the traditional risk factors, adding the LVDD ML risk score significantly improved model fit (likelihood ratio test p = 0.015).

Figure 4

Figure 4. The Kaplan–Meier curves showing reduced probability of event-free survival among those with high machine-learning score than those with low score for the outcome of hospitalized heart failure (A), all-cause mortality (B) and the composite outcome (C) (log-rank p < 0.001 for all three plots).

Table 2

Table 2. Hazards of composite all-cause mortality and inpatient heart failure admission associated with machine-learning predicted probability of LVDD.

For the validation cohort (N = 96) the average age was 55 ± 17, younger than the training cohort of 66 ± 16 years. Participants were 41% female, fewer than the training cohort of 62%. Out of these subjects, 55 were identified as having LVDD. The subjects with LVDD were older, and had a higher prevalence of cardiovascular risk factors such as hypertension and hyperlipidemia (Table 3). Additionally, the left and right ventricular ejection fractions and stroke volumes were lower and LGE prevalence was higher. Notably the prevalence of LGE and RWMA was higher in this cohort than the training cohort, 57% vs. 46% and 79% vs. 29%, respectively. Similar to the training cohort, the model performance of the validation cohort was better using CMR variables alone than using demographic variables, AUC of 0.786 (0.710–0.856) vs. 0.724 (0.634–0.814). Combined CMR and demographic variables showed a stronger model performance with AUC 0.810 (0.731–0.874).

Table 3

Table 3. Demographic characteristics of the validation cohort between subjects with and without left ventricular diastolic dysfunction.

4 Discussion

In the present study, we trained and evaluated ML algorithms to identify patients with LVDD based on CMR variables that are routinely captured for clinical purposes. The model using only CMR variables was robust and was further improved when combined with the demographic information. The addition of myocardial strains had a neutral effect on the model performance. The model performed well on an independent external dataset. The ML risk score defining the probability of LVDD diagnosis was effective at differentiating subjects at risk of hospitalized HF and all-cause mortality.

To date, echocardiography remains the most important modality for the diagnosis of LVDD despite the complexity of the diagnostic algorithm and commonly undetermined classification. On the other hand, CMR is still limited in evaluating LVDD even though it is invaluable in assessing cardiac systolic function and myocardial tissue properties. It is worth noting that a few small CMR studies have demonstrated the feasibility of using innovative imaging or post processing techniques to evaluate LVDD (9–11, 17). Nevertheless, most of those protocols require either additional image acquisitions beyond standard clinical CMR protocols or complex post processing in need of proprietary software. In contrast, the LVDD diagnostic algorithm by echocardiography is developed based on variables that are captured during routine clinical studies. The presence or absence of LVDD can therefore be successfully defined in most cases when image quality is appropriate, albeit in a subset of patients in whom LVDD status is undetermined largely due to discordant findings among the essential variables or lack of measurement such as tricuspid regurgitation velocity when the regurgitation is absent. In the present study we tested the ML algorithm based on CMR variables that are clinically acquired, similar to the echocardiographic approach, and found the performance of the algorithm is excellent at identifying LVDD especially after combining CMR variables with common clinical variables yielding an AUC 0.895 (0.845, 0.939). More importantly, the success of the ML approach can be achieved from using variables that are clinically acquired without the need of additional image acquisitions or post processing. Interestingly, each of the top 5 most important features including LVEF, age, LV mass, LAEF and LAV_min have all been previously demonstrated to be associated with LVDD thereby supporting the validity of our ML algorithm (4, 7, 18). To validate the algorithm we applied the model to an independent dataset obtained from a community hospital where echocardiography and CMR were aquired by the local team for clinically referred patients, who were much more prevalent in male gender, the presence of LGE and RWMA, suggesting a different patient population from the training cohort. Nonetheless, the model performed well in predicting LVDD despite the small cohort size.

LVDD is an important condition to recognize, as it is associated with significantly elevated risk of HF and mortality. At the present time, the clinical diagnosis of LVDD is largely based on echocardiographic criteria recommended by ASE guidelines and will continue to evolve towards greater simplicity and accuracy (19, 20), which is a shared goal for the field of CMR. In this study we included only the most commonly assessed variables from clinical CMR such as cardiac chamber volumes, regional and global systolic function, and LGE, which seem to effectively support the strong performance of the ML algorithm. In addition to common clinical variables we have also examined the LV peak systolic strains in the longitudinal, circumferential and radial directions since strains have been linked to LVDD (8, 21). In our cohort, LV strains were significantly reduced in subjects with LVDD although strain impairment did not appear to further improve model performance when combined with CMR assessment. We speculate that the ML approach helps to efficiently utilize multiple clinical variables simultaneously thereby maximizing the value of each variable and consequently simplifying the evaluation. It appears feasible based on our observation, not to mention highly desirable to offer LVDD evaluation as part of the routine clinical CMR examination to compliment the assessment of systolic function and tissue characterization.

ML has often been criticized as a black box because the output data is not always easily interpretable in a clinically familiar manner. To ensure our findings are clinically relevant we applied Shapley values to change this “black box” into a “glass box” by using the explanation functions (22, 23). The Shapley values are especially helpful when they are applied to individual subjects (24). Such applications can be appreciated in two examples shown in our study where Shapley values of the same variables differed significantly between two patients, as each value is unique to the patient. Consequently, the ML risk score differentiated two patients having high or low likelihood of LVDD despite the similar LVEF between them (Figure 3). While the overall model performance of an ML algorithm is important, the true clinical value of any ML algorithm lies in the function of assessing individual patients for diagnostic or prognostic purposes. Our approach of using ML risk score is promising in assessing individual patients for the diagnosis of LVDD.

There are several limitations in the present study. The algorithm provided dichotomized probability of having or not having LVDD. The binary evaluation appears to be effective in differentiating the risk of adverse clinical outcomes providing important prognostication. However, there is lack of a grading system to further delineate the severity of LVDD, which will require additional validation and is beyond the scope of current study. We did not have invasive hemodynamis available to assist in the validation of our algorithm although LVDD may present in individuals with normal LV end diastolic pressure. While the overall performance of the ML algorithm is strong there are misclassifications in the diagnosis of LVDD. In the low ML risk score group the misclassified cases were largely made up of subjects with grade I or undetermined LVDD grade. The current guideline classifies LVDD for anyone who has reduced LV ejection fraction, an approach that continues to draw debate which may in part contribute to the misclassification. In addition, the undetermined LVDD grade by the reference standard of echocardiography is undoubtedly a source of challenge and a cause of misclassification for CMR. Nevertheless, the adverse outcome risk was significantly lower among those with low ML risk scores than those with high scores thereby supporting the validity of the LVDD diagnostic probability assessment. We included the LV peak systolic strain in the analysis. However, the diastolic strain rate would have been a better alternative in assessing LVDD although it requires additional post processing. To add variables in need of extensive post-processing would have defeated our intention to make the evaluation clinically friendly. The validation cohort size was small and the time lapse was 30 days between echocardiography and CMR as opposed to 7 days in the training cohort. Despite the constrains the ML model performed well supporting the robustness of the algorithm. Nonetheless, it is still highly desirable to have a multi-center study design with a more diverse patient population to test the generalizability of the ML algorithms.

To conclude, the proposed ML algorithm using variables derived from clinical CMR is effective in identifying individual patients with LVDD and providing important prognostic value for adverse clinical outcome assessment.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by St. Francis Institutional Review Board. The studies were conducted in accordance with the local legislation and institutional requirements. The Ethics Committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because Research was retrospective and did not require patient contact. The research involves no more than minimal risk and involves no procedures for which written consent is normally required outside of the research context.

Author contributions

QZ: Conceptualization, Data curation, Formal Analysis, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. LW: Conceptualization, Data curation, Methodology, Writing – review & editing. JC: Investigation, Methodology, Resources, Writing – review & editing. JW: Formal Analysis, Methodology, Project administration, Resources, Software, Writing – review & editing. MP: Data curation, Investigation, Software, Writing – review & editing. NN: Data curation, Investigation, Software, Writing – review & editing. OK: Supervision, Writing – review & editing. JG: Methodology, Resources, Software, Writing – review & editing. EB: Investigation, Methodology, Software, Writing – review & editing. JC: Conceptualization, Investigation, Project administration, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article.

This study was funded by an endowment from the St. Francis Hospital Foundation.

Acknowledgments

The authors acknowledge great contributions of technologists, research nurses, coordinators and research volunteers from St Francis Hospital for their support with this work.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

AUC, area under the curve; CAD, coronary artery disease; CI, confidence interval; CMR, cardiac magnetic resonance; DD, diastolic dysfunction; LAEF, LA emptying fraction; LAVmax, left atrium maximum volume; LAVmin, LA minimum volume; LGE, late gadolinium enhancement; LVEDV, left ventricular end diastolic volume; LVEF, LV ejection fraction; LVESV, LV end systolic volume; LV mass, LV mass; LVSV, LV stroke volume; RWMA score, number of segments with regional wall movement abnormalities; RVEDV, right ventricular end diastolic volume; RVEF, RV ejection fraction; RVESV, RV end systolic volume; RVSV, RV systolic volume; RWMA score, number of segments with regional wall movement abnormalities; TE, echo time; TR, repetition time.

References

1. Tennøe AH, Murbræch K, Andreassen JC, Fretheim H, Garen T, Gude E, et al. Left ventricular diastolic dysfunction predicts mortality in patients with systemic sclerosis. J Am Coll Cardiol. (2018) 72(15):1804–13. doi: 10.1016/j.jacc.2018.07.068

Crossref Full Text | Google Scholar

2. Lam CSP, Gamble GD, Ling LH, Sim D, Leong KTG, Yeo PSD, et al. Mortality associated with heart failure with preserved vs. reduced ejection fraction in a prospective international multi-ethnic cohort study. Eur Heart J. (2018) 39(20):1770–80. doi: 10.1093/eurheartj/ehy005

PubMed Abstract | Crossref Full Text | Google Scholar

3. TRUST Participants, Testa M, Cappuccio A, Latella M, Napolitano S, Milli M, et al. The emotional and social burden of heart failure: integrating physicians’, patients’, and caregivers’ perspectives through narrative medicine. BMC Cardiovasc Disord. (2020) 20(1):522. doi: 10.1186/s12872-020-01809-2

PubMed Abstract | Crossref Full Text | Google Scholar

4. Nagueh SF, Smiseth OA, Appleton CP, Byrd BF III, Dokainish H, Edvardsen T, et al. Recommendations for the evaluation of left ventricular diastolic function by echocardiography: an update from the American society of echocardiography and the European association of cardiovascular imaging. Eur Heart J Cardiovasc Imaging. (2016) 17(12):1321–60. doi: 10.1093/ehjci/jew082

PubMed Abstract | Crossref Full Text | Google Scholar

5. Kramer CM, Barkhausen J, Flamm SD, Kim RJ, Nagel E. Standardized cardiovascular magnetic resonance imaging (cmr) protocols, society for cardiovascular magnetic resonance: board of trustees task force on standardized protocols. J Cardiovasc Magn Reson. (2008) 10(1):35. doi: 10.1186/1532-429X-10-35

PubMed Abstract | Crossref Full Text | Google Scholar

6. Society for Cardiovascular Magnetic R, Board of Trustees Task Force on Standardized P, Kramer CM, Barkhausen J, Flamm SD, Kim RJ, Nagel E. Standardized cardiovascular magnetic resonance (cmr) protocols 2013 update. J Cardiovasc Magn Reson. (2013) 15(1):91. doi: 10.1186/1532-429X-15-91

PubMed Abstract | Crossref Full Text | Google Scholar

7. Abhayaratna WP. Characteristics of left ventricular diastolic dysfunction in the community: an echocardiographic survey. Heart. (2006) 92(9):1259–64. doi: 10.1136/hrt.2005.080150

PubMed Abstract | Crossref Full Text | Google Scholar

8. Nguyen J, Weber J, Hsu B, Mulyala RR, Wang L, Cao JJ. Comparing left atrial indices by cmr in association with left ventricular diastolic dysfunction and adverse clinical outcomes. Sci Rep. (2021) 11(1):21331. doi: 10.1038/s41598-021-00596-w

PubMed Abstract | Crossref Full Text | Google Scholar

9. Ramos JG, Fyrdahl A, Wieslander B, Thalén S, Reiter G, Jin N, et al. Comprehensive cardiovascular magnetic resonance diastolic dysfunction grading shows very good agreement compared with echocardiography. J Am Coll Cardiol Imgaging. (2020) 13(122):2530–42. doi: 10.1016/j.jcmg.2020.06.027

Crossref Full Text | Google Scholar

10. Kermer J, Traber J, Utz W, Hennig P, Menza M, Jung B, et al. Assessment of diastolic dysfunction: comparison of different cardiovascular magnetic resonance techniques. ESC Heart Failure. (2020) 7(5):2637–49. doi: 10.1002/ehf2.12846

PubMed Abstract | Crossref Full Text | Google Scholar

11. Saba SG, Chung S, Bhagavatula S, Donnino R, Srichai MB, Saric S, et al. A novel and practical cardiovascular magnetic resonance method to quantify mitral annular excursion and recoil applied to hypertrophic cardiomyopathy. J Cardiovasc Magn Reson. (2014) 16(1):35. doi: 10.1186/1532-429X-16-35

PubMed Abstract | Crossref Full Text | Google Scholar

12. Wang L, Singh H, Mulyala RR, Weber J, Barasch E, Cao JJ. The association between left ventricular diastolic dysfunction and myocardial scar and their collective impact on all-cause mortality. J Am Soc Echocardiogr. (2020) 33(2):161–70. doi: 10.1016/j.echo.2019.09.022

PubMed Abstract | Crossref Full Text | Google Scholar

13. Mitchell C, Rahko PS, Blauwet LA, Canaday B, Finstuen JA, Foster MC, et al. Guidelines for performing a comprehensive transthoracic echocardiographic examination in adults: recommendations from the American society of echocardiography. J Am Soc Echocardiogr. (2019) 32(1):1–64. doi: 10.1016/j.echo.2018.06.004

PubMed Abstract | Crossref Full Text | Google Scholar

14. Sabovčik F, Cauwenberghs N, Kouznetsov D, Haddad F, Alonso-Betanzos A, Vens C, et al. Applying machine learning to detect early stages of cardiac remodelling and dysfunction. Eur Heart J Cardiovasc Imaging. (2021) 22(10):1208–17. doi: 10.1093/ehjci/jeaa135

Crossref Full Text | Google Scholar

15. Krstajic D, Buturovic LJ, Leahy DE, Thomas S. Cross-Validation pitfalls when selecting and assessing regression and classification models. J Cheminform. (2014) 6(1):10. doi: 10.1186/1758-2946-6-10

PubMed Abstract | Crossref Full Text | Google Scholar

16. Shapley L. Notes on the N-Person Game—II: The Value of an N-Person Game”. Santa Monica, CA: RAND Corporation (1951).

Google Scholar

17. Chamsi-Pasha MA, Zhan Y, Debs D, Shah DJ. CMR in the evaluation of diastolic dysfunction and phenotyping of hfpef. JACC Cardiovasc Imaging. (2020) 13(1):283–96. doi: 10.1016/j.jcmg.2019.02.031

PubMed Abstract | Crossref Full Text | Google Scholar

18. Posina K, McLaughlin J, Rhee P, Li L, Cheng J, Schapiro W, et al. Relationship of phasic left atrial volume and emptying function to left ventricular filling pressure: a cardiovascular magnetic resonance study. J Cardiovasc Magn Reson. (2013) 15:99. doi: 10.1186/1532-429x-15-99

PubMed Abstract | Crossref Full Text | Google Scholar

19. Oh JK, Miranda WR, Bird JG, Kane GC, Nagueh SF. The 2016 diastolic function guideline: is it already time to revisit or revise them? JACC Cardiovasc Imaging. (2020) 13(1 Pt 2):327–35. doi: 10.1016/j.jcmg.2019.12.004

PubMed Abstract | Crossref Full Text | Google Scholar

20. Vasan RS, Larson SMG, Benjamin EJ, Evans JC, Reiss CK, Levy D. Congestive heart failure in subjects with normal versus reduced left ventricular ejection fraction. J Am Coll Cardiol. (1999) 33(7):1948–55. doi: 10.1016/S0735-1097(99)00118-7

PubMed Abstract | Crossref Full Text | Google Scholar

21. Colak A, Muderrisoglu H, Pirat B, Eroglu S, Aydinalp A, Sezgin A, et al. Longitudinal strain and strain rate for estimating left ventricular filling pressure in heart transplant recipients. Am J Cardiol. (2020) 137:63–70. doi: 10.1016/j.amjcard.2020.09.037

PubMed Abstract | Crossref Full Text | Google Scholar

22. Roder J, Maguire L, Georgantas R, Roder H. Explaining multivariate molecular diagnostic tests via shapley values. BMC Med Inform Decis Mak. (2021) 21(1):211. doi: 10.1186/s12911-021-01569-9

PubMed Abstract | Crossref Full Text | Google Scholar

23. Rai A. Explainable ai: from black box to glass box. J Acad Mark Sci. (2020) 48(1):137–41. doi: 10.1007/s11747-019-00710-5

Crossref Full Text | Google Scholar

24. Wang K, Tian J, Zheng C, Yang H, Ren J, Liu Y, et al. Interpretable prediction of 3-year all-cause mortality in patients with heart failure caused by coronary heart disease based on machine learning and shap. Comput Biol Med. (2021) 137:104813. doi: 10.1016/j.compbiomed.2021.104813

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: machine learning, LVDD, heart failure, echocardiography, cardiac magnetic resonance imaging

Citation: Zhou Q, Wang L, Craft J, Weber J, Passick M, Ngai N, Khalique OK, Goldfarb JW, Barasch E and Cao JJ (2024) A machine learning-derived risk score to predict left ventricular diastolic dysfunction from clinical cardiovascular magnetic resonance imaging. Front. Cardiovasc. Med. 11:1382418. doi: 10.3389/fcvm.2024.1382418

Received: 5 February 2024; Accepted: 27 May 2024;
Published: 6 June 2024.

Edited by:

Ernst Wellnhofer, Berlin Technical University of Applied Sciences, Germany

Reviewed by:

Akhmetzhan Galimzhanov, Semey State Medical University, Kazakhstan
Inga Voges, University Medical Center Schleswig-Holstein, Germany

© 2024 Zhou, Wang, Craft, Weber, Passick, Ngai, Khalique, Goldfarb, Barasch and Cao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: J. Jane Cao, amFuZS5jYW9AY2hzbGkub3Jn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.