- 1Division of Pediatrics III – Cardiology, Pulmonology, Allergology and Cystic Fibrosis, Department of Child and Adolescent Health, Medical University of Innsbruck, Innsbruck, Austria
- 2Division Pediatrics I – Inherited Metabolic Disorders, Department of Child and Adolescent Health, Medical University of Innsbruck, Innsbruck, Austria
- 3Center of Pediatric Cardiology and Congenital Heart Disease, Heart and Diabetes Center North Rhine-Westphalia, Ruhr-University of Bochum, Bad Oeynhausen, Germany
- 4Clinic for Pediatrics, Medizin Campus Bodensee, Friedrichshafen, Germany
Introduction: It is increasingly common to simultaneously determine a large number of metabolites in order to assess the metabolic state of, or clarify biochemical pathways in, an organism (“metabolomics”). This approach is increasingly used in the investigation of the development of heart failure. Recently, the first reports with respect to a metabolomic approach for the assessment of patients with complex congenital heart disease have been published. Classical statistical analysis of such data is challenging.
Objective: This study aims to present an alternative to classical statistics with respect to identifying relevant metabolites in a classification task and numerically estimating their relative impact.
Methods: Data from two metabolomic studies on 20 patients with complex congenital heart disease and Fontan circulation and 20 controls were reanalysed using random forest (RF) methodology. Results were compared to those of classical statistics.
Results: RF analysis required no elaborate data pre-processing. The ranking of the variables with respect to classification impact (subject diseased, or not) was remarkably similar irrespective of the evaluation method used, leading to identical clinical interpretation.
Conclusion: In metabolomic classification in adult patients with complex congenital heart disease, RF analysis as a one-step method delivers the most adequate results with minimum effort. RF may serve as an adjunct to traditional statistics also in this small but crucial-to-monitor patient group.
Introduction
In biology and medicine, the metabolome, i.e., the concentration of metabolites in tissue, body fluids, excrements, exhaled gas, or other biomaterial, reflects the metabolic state of an organism (1). Modern to a high degree automated laboratory technology allows for the simultaneous determination of a vast multitude of metabolites from minute specimens at moderate costs in order to assess the metabolic state of, or clarify biochemical pathways in, an organism (“metabolomics”) (2). One approach to interpret the data is to assess both diseased subjects (“patients”) and healthy controls. Any difference in the metabolomic pattern between groups gives a hint at the metabolic pathways altered in the patient group. Mathematically speaking, the individual metabolomic pattern classifies the subject as either patient or control. Using traditional statistical approaches such as regression modeling, this study aims to rank various metabolites with respect to their classification impact and to identify those metabolites most promising to enlighten biochemical pathways and pathophysiology underlying the disease. The ultimate goal is to develop new concepts for diagnostics, monitoring, and therapy. Such concepts play an important role in cardiology, particularly with regard to the risk of the development of heart failure in the small but increasing group of adult patients with complex congenital heart disease (CHD). Classical statistical analysis of such data typically comprising hundreds of variables is a challenge, especially if the number of subjects to compare is low, if relations between dependent and independent variables are not linear, or if normal distributions or same variance are not met, and while classical statistics' strength is their power to evaluate the accuracy of correlations, they are less accurate for outcome prediction. With machine learning (ML) tools, which can be able to learn from the actual data to analyse instead of pre-programmed assumptions and instructions, there are alternatives for the evaluation of big data, especially allowing for outcome prediction.
Pattern recognition is a strength of artificial neural networks (3) that perform well as a classification tool. However, presenting to the user as “black box,” even from the underlying theory, it is difficult to identify those input variables contributing most to the classification task (3).
The random forest (RF) approach seems to be more promising (4–6). Following a sophisticated algorithm, a large number of individual decision trees with respect to the relationship between an input variable and classification result are generated. Then, each subject's metabolomic pattern is subjected in sequence to all the decision trees, internally noting each classification made. The majority vote of the trees is the class the subject is ultimately assigned to (7). In addition to subject classification, the algorithm delivers both the rank order of the several input variables' classification impact and their numerical classification impact score (8). Thus, it should be easy to identify the most promising metabolites deserving further (statistical) analysis and interpretation.
Machine learning algorithms, among the RFs, have been reported to offer an alternative approach to standard prognostic modeling, especially in the field of adult cardiology (9), e.g., for patients with heart failure for readmission prediction (10) (n = 977 patients, 472 input variables), or patients with angina for ischemia prediction (11) (n = 932 patients, 43 input variables). Data for the application of ML tools, including RF applied to the growing but a much smaller group of adult patients with CHD, are scarce. To date, the London group reported the only large study on ML (neural network, no RF) applied to >10,000 adult patients with various forms of CHD (<20% of them with complex CHD) (12), which showed that ML algorithms trained on large datasets are suitable to estimate prognosis and potentially guide therapy in adult patients with CHD. Chu et al. recently reported the benefit of ML tools, including RF on pregnant women with CHD, to successfully predict maternal or neonatal adverse events (13) (n = 318 patients; heterogeneous cardiac diagnoses, <20% of the patients with complex CHD; 22 input variables). For adult patients with complex CHD (n = 386, among them 40% with Fontan circulation), the very first report on the successful application of ML tools (recursive partitioning) to predict executive function (14) has been recently published. The group of adults with Fontan circulation, where the blood flows passively into the lungs and where the single ventricle pumps the blood into the system, is at the highest risk of all patients with CHD to develop organ dysfunction over time. Thus, this group of patients is crucial to focus on, and it is crucial to monitor and predict outcomes to accurately develop timely interventions. However, the major issue is that these patients are scarce and that uniform monitoring among centers is challenging, as is the application of standard (especially imaging) diagnostic tools, which are designed for a biventricular heart. Using a traditional regression approach (heatmap correlation), we recently reported distinct patterns of serum analytes as derived by metabolomics, taking into account more than hundreds of analytes (“big data”). The hope is, with this large amount of serum data, to ultimately find analyte patterns that early delineate adverse function, hemodynamic, or organ alterations. Taking a step back to the classification of Fontan patients vs. controls, we aim at applying RF, which has not been done before, and proving the RF's performance and feasibility on real metabolomic data. We hypothesized that for research on the metabolism of this small but crucial to scrutinize patient group, RF may present a simple adjunct, or even alternative, to classical statistics with respect to identifying the most relevant variables in a classification task and numerically estimating their relative impact.
Materials and methods
Data from two recent metabolomic studies on 20 adult Fontan patients (univentricular palliation of an underlying complex CHD) and 20 healthy controls matched for age and sex (15, 16) were reanalysed. While in our previous analysis (15, 16), we used a heatmap correlation model to evaluate our hypothesis, in this study, we used RF methodology for classification (programme RF++ rel. 1.0) (17). We developed two training models. Input data were all available raw data from our recently published work, comprising 30 [amino acids (15)] and 110 [phospholipids (16)] metabolomic variables, respectively. Output (prediction) was the classification of patients as diseased (Fontan patient) vs. non-diseased (control proband). The parameter settings—number of variables used, number of candidate predictors randomly drawn for a split (mtry, candidate predictors that are randomly drawn for a split) (n = 6 for the amino acid model; n = 11 for the phospholipid model), and number of trees (n = 12,000 for both the amino acid and the phospholipid model)—were chosen as suggested by the software used (8, 17, 18). On each dataset, we performed several runs: for amino acids, we performed 9 runs: mtry, range 4–12 (default: 6); trees, range 1,000–40,000 (default: 12,000). For lipids, we performed 14 runs: mtry, range 6–15 (default: 11); trees, range 1,000–40,000 (default: 12,000) (Supplementary material 1).
All calculations were done using an ordinary notebook personal computer. The results were compared to those of classical statistics (15, 16). Finally, the method's feasibility was judged.
Results
With respect to amino acids and derivatives, the RF run comprised all 30 metabolites analyzed by classical statistics (15). Subject classification (patient vs. control) was correct in 80% (accuracy score). The ranking of the variables with regard to classification impact given both as “permutation-based proportion” (8) and the slightly more expressive “mean decrease in the margin (MDM)” (19) was strikingly similar irrespective of the evaluation method used: the metabolites with an RF rank of 1–3 appeared among the topmost three metabolites as ranked by classical statistics (Table 1, top), leading to identical clinical interpretation (15).
With respect to phospholipids (16), the RF run simultaneously comprising all 110 metabolites that had been subjected to classical statistics resulted in a correct subject classification of 85%. Compared to classical statistics, RF analysis resulted in a similar ranking of the top variables with regard to classification impact: the RF ranks 1–8 comprised the topmost 3 metabolites ranked by classical statistics (Table 1, bottom), again suggesting identical clinical interpretation (16).
The cumulative impact of amino acids and phospholipids is shown in Figures 1A,B. With a cumulative impact normalized to 100% (per group), it becomes obvious that already a small number of analytes (the topmost ranked 6/30 as for amino acids and the topmost ranked 21/110 as for phospholipids) account for 80% of the classification optimum.
Figure 1. Cumulative classification impact (diseased vs. non-diseased)—normalized to 100%—of the respective metabolites (dotted curves) vs. absolute impact rank. The linear line indicates the analyte's rank normalized to 100%. %, percentage; MDM, mean decrease in the margin (variable importance with respect to classification into diseased vs. non-diseased); NormCumMDM, normalized cumulated mean decrease in the margin; rank, rank with respect to classification impact MDM, starting with the highest impact as rank 1. (A) Amino acids. The topmost ranked 6/30 amino acids (20% of analytes; open circle) account for 80% of the classification total. (B) Phospholipids. The topmost ranked 21/110 phospholipids (19% of analytes; open circle) account for 80% of the classification total.
Doing repeated RF runs, all results proved robust with regard to even wide variations in the RF parameter setting (Supplementary material 1). Each run takes only a few seconds to complete.
Classical statistics required elaborate pre-processing of the data (logspline imputing of values < limit of detection; log2 transformation/normalization; outlier detection and elimination), and p-correction for multiple testing thus losing sensitivity (15, 16). No such data pre-processing was required for the RF analyses (8). While classical statistics required the participation of an expert statistician as a prerequisite for dependable results and sound interpretation, there was no such requirement with respect to RF analysis.
Discussion
Machine learning algorithms have been reported to successfully estimate prognosis and guide therapy for adult patients with various CHD (9–14). As we have shown, also in patients with complex CHD and Fontan circulation, where patient numbers are low and input variable numbers may be large, both ML algorithms (RF analysis in our case) and classical statistics are potent classification tools. With respect to amino acids (phospholipids), the RF topmost ranking 6 (21) metabolites inherit more than 80% of the classification competence. Both methods depict a heavily overlapping (especially true for amino acids, functioning more heterogeneously than phospholipids) albeit not identical bunch of metabolites as most promising for classification (diseased vs. non-diseased), deserving further analysis and interpretation. Thus, the clinical conclusions to be drawn (15, 16) are identical. The topmost ranked metabolites—ranked by either RF or classical statistics—(Met-SO/Met, Met-SO, Glu; phospholipids with medium and long alkyl chains) suggest that in Fontan patients, the signaling and inflammatory pathways hinting at oxidative stress and endothelial dysfunction, as well as energy and structural metabolism are affected, as reported for other cardiovascular circumstances as for patients with heart failure (15, 16, 20, 21). The metabolites ranking ≥5 do not show but small differences in their absolute fold change values between successive ranks; beyond rank 20—particularly with the lipids (as with a set of proteomics data, see performance analysis as mentioned under limitations)—the differences are minimal, rendering further ranking meaningless for clinical interpretation (biological variability, small sample size).
By no means, classical heatmap correlation analysis is an objective method. Actually, its results to some degree depend on the expert operator's parameter setting according to clinical judgement (22). Thus, it must not be regarded as the golden standard for the analysis of such types of data as given here. It might well be that RF analysis reveals interdependencies ignored by traditional regression analysis and vice versa. Hence, both types of analysis are complementing each other.
Although RF analysis is technically much easier to perform than sophisticated classical statistics, it delivers dependable results even to the statistically less experienced. RF parameter settings are uncritical, even if the parameters are well beyond the range suggested by the programme used. Contrary to classical statistics, RF can handle huge numbers of variables per case without performance loss (8). Its output is easy to interpret. While each variable's classification impact is numerically scored as “permutation-based proportion” and “mean decrease in margin (MDM)” (8, 19), classical indicators of the grade of the significance of the various results—equivalent to eta2 and p-value in classical statistics or the here used fold change—are missing. If this is of importance, it is debatable as long as the overall interpretation of the results is the same. Insofar, metabolomic RF analysis should be considered an alternative to classical statistics. In case one feels uneasy about the lack of such measures of determination and prefers classical statistics, in the first step one could perform RF analysis as an adjunct in order to identify the most promising variables, and in the next step subject a set of selected variables to classical statistics thus reducing processing effort.
Limitations, confounders
The analysis presented in this study depends on data of only 40 subjects, hence one should be careful with generalizing our findings on methodology. Independent internal validation would have been desirable for both RF and classical statistics. Two features might, however, compensate to some degree for this missing internal validation: For RF, we used out-of-bag (OOB) prediction, meaning that every tree in the RF results from only a fraction of the data points in the dataset. OOB predictions with respect to a specific variable are generated by using only those trees that did not use that variable to generate the predictions. This provides a more unbiased estimate of the prediction error of the random forest and should give a similar error rate to that when making predictions on a new and independent dataset. Still, while the OOB error rate in RF is quite acceptable, due to the lack of an independent validation sample for classical statistics, an error rate cannot be provided, thus precluding the comparison of the two methods in this regard.
For example, in another performance estimate on the same 40 subjects and a set of 526 proteomics-derived metabolites per subject (work submitted), RF analysis tagged 4 of the 6 most promising metabolites according to classical statistics as most important (the odds of such a hit by guessing are <1 in a million), underlining the potential of RF. As for the cumulative impact, the topmost ranked 27/526 analytes accounted for 80% of the classification total.
With RF, we used easy-to-operate software freely available online and fully meeting our demands (8, 17). But, in other settings, more sophisticated RF software might be more adequate (23).
Conclusion
Our preliminary data on a small patient group implies that RF is applicable to the scarce but crucial to monitor group of patients with complex diseases which impact metabolism. In a metabolomic classification setting, RF analysis as a one-step method delivers the most adequate results with a minimum of effort. RF analysis is both a compelling stand-alone tool and an adjunct to classical statistics. These conclusions have to be proven by analysis of a larger patient number.
Study registration number
Study no. 20210803-2653 (registered at the Competence Center for Clinical Trials, Medical University of Innsbruck).
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors. This study is a subwork of the main study protocol (Trial registration number: ClinicalTrials.gov Identifier NCT03886935).
Ethics statement
The studies involving human participants were reviewed and approved by Local Ethics Committee of the Medical University of Innsbruck, Austria (AN2015-0303 357/4.3 4507a) and of the Heart and Diabetes Center North Rhine-Westphalia, Ruhr-University of Bochum, Germany (AZ 53/2016). The methods were carried out in accordance with the Helsinki Declaration and the International Conference on Harmonization Good Clinical Practice Guidelines. The patients/participants provided their written informed consent to participate in this study.
Author contributions
Conceptualization, validation, data curation, writing—original draft preparation, supervision, and project administration: MM and EM. Methodology, formal analysis, and visualization: EM. Investigation and writing—review and editing: MM, EM, KL, K-OD, and SS-B. Resources and funding acquisition: MM. All authors have read and agreed to the published version of the manuscript.
Funding
MM received funding from the Austrian Science Fund (no. KLI1036-B), the Austrian Society of Pediatrics (no. 04/2021), the Tiroler Wissenschaftsförderung (no. UNI-0404-2126), and the Medizinischer Forschungsfonds Tirol (no. 327). The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2022.994068/full#supplementary-material
References
1. Tweeddale H, Notley-McRobb L, Ferenci T. Effect of slow growth on metabolism of Escherichia coli, as revealed by global metabolite pool (“metabolome”) analysis. J Bacteriol. (1998) 180:5109–16. doi: 10.1128/JB.180.19.5109-5116.1998
2. Gonzalez-Covarrubias V, Martinez-Martinez E, Del Bosque-Plata L. The potential of metabolomics in biomedical applications. Metabolites. (2022) 12:194. doi: 10.3390/metabo12020194
3. Michel E, Zernikow B, Wichert SA. Use of an artificial neural network (ANN) for classifying nursing care needed, using incomplete input data. Med Inform Internet Med. (2000) 25:147–58. doi: 10.1080/14639230050058329
4. Ho TK. Random Decision Forests. In: Proceedings of the 3rd International Conference on Document Analysis and Recognition. Montreal, QC; New York, NY: IEEE (1995). p. 278–82. doi: 10.1109/ICDAR.1995.598994
5. Amit Y, Geman D. Shape quantization and recognition with randomized trees. Neural Computation. (1997) 9:1545–88. doi: 10.1162/neco.1997.9.7.1545
7. Fawagreh K, Gaber MM, Elyan E. Random forests: from early developments to recent advancements. Syst Sci Control Eng. (2014) 2:602–9. doi: 10.1080/21642583.2014.956265
8. Karpievitch YV, Hill EG, Leclerc AP, Dabney AR, Almeida JS. An introspective comparison of Random Forest-based classifiers for the analysis of cluster-correlated data by way of RF. PLoS ONE. (2009) 4: e7087. doi: 10.1371/journal.pone.0007087
9. Krittanawong C, Zhang H, Wang Z, Aydar M, Kitai T. Artificial intelligence in precision cardiovascular medicine. J Am Coll Cardiol. (2017) 69:2657–64. doi: 10.1016/j.jacc.2017.03.571
10. Mortazavi BJ, Downing NS, Bucholz EM, Dharmarajan K, Manhapra A, Li SX, et al. Analysis of machine learning techniques for heart failure readmissions. Circ Cardiovasc Qual Outcomes. (2016) 9:629–40. doi: 10.1161/CIRCOUTCOMES.116.003039
11. Hae H, Kang SJ, Kim WJ, Choi SY, Lee JG, Bae Y, et al. Machine learning assessment of myocardial ischemia using angiography: development and retrospective validation. PLoS Med. (2018) 15:e1002693. doi: 10.1371/journal.pmed.1002693
12. Diller GP, Kempny A, Babu-Narayan SV, Henrichs M, Brida M, Uebing A, et al. Machine learning algorithms estimating prognosis and guiding therapy in adult congenital heart disease: data from a single tertiary centre including 10019 patients. Eur Heart J. (2019) 40:1069–77. doi: 10.1093/eurheartj/ehy915
13. Chu R, Chen W, Song G, Yao S, Xie L, Song L, et al. Predicting the risk of adverse events in pregnant women with congenital heart disease. J Am Heart Assoc. (2020) 9:e016371. doi: 10.1161/JAHA.120.016371
14. Majeed V A, Rofeberg, Bellinger DC, Wypij D, Newburger JW. Machine learning to predict executive function in adolescents with repaired d-transposition of the great arteries, tetralogy of Fallot, and Fontan palliation. J Pediatr. (2022) 246:145–53. doi: 10.1016/j.jpeds.2022.03.021
15. Michel M, Dubowy KO, Entenmann A, Karall D, Adam MG, Zlamy M, et al. Targeted metabolomic analysis of serum amino acids in the adult Fontan patient with a dominant left ventricle. Sci Rep. (2020) 10:8930. doi: 10.1038/s41598-020-65852-x
16. Michel M, Dubowy KO, Zlamy M, Karall D, Adam MG, Entenmann A, et al. Targeted metabolomic analysis of serum phospholipid and acylcarnitine in the adult Fontan patient with a dominant left ventricle. Ther Adv Chronic Dis. (2020) 11:1–25. doi: 10.1177/2040622320916031
17. Karpievitch YV, Leclerc AP, Hill EG, Almeida JS. RF++: Improved Random Forest for Clustered Data Classification. Available online at: http://sourceforge.net/projects/rfpp/files/rfpp/1.0.0 (accessed July 03, 2021).
18. Karpievitch YV, Leclerc AP. RF++ User Manual rel. 1.0.0. Available online at: http://sourceforge.net/projects/rfpp (accessed July 03, 2021).
19. Yang F, Li X, Zhou Q, Luo L. Margin based variable importance for random forest. In: 2011 6th International Conference on Computer Science & Education (ICCSE) New York, NY: IEEE IEEE. doi: 10.1109/ICCSE.2011.6028885
20. Neubauer S. The failing heart - an engine out of fuel. N Engl J Med. (2007) 356:1140–51. doi: 10.1056/NEJMra063052
21. Arimura K, Egashira K, Nakamura R, Ide T, Tsutsui H, Shimokawa H, et al. Increased inactivation of nitric oxide is involved in coronary endothelial dysfunction in heart failure. Am J Physiol Heart Circ Physiol. (2001) 280:H68–75. doi: 10.1152/ajpheart.2001.280.1.H68
22. Horne E, Tibble H, Sheik A, Tsanas A. Challenges of clustering multimodal clinical data: review of applications in asthma subtyping. JMIR Med Inform. (2020) 5:e16452. doi: 10.2196/16452
Keywords: congenital heart disease, Fontan, metabolomics, random forest, statistics
Citation: Michel M, Laser KT, Dubowy K-O, Scholl-Bürgi S and Michel E (2022) Metabolomics and random forests in patients with complex congenital heart disease. Front. Cardiovasc. Med. 9:994068. doi: 10.3389/fcvm.2022.994068
Received: 14 July 2022; Accepted: 12 September 2022;
Published: 05 October 2022.
Edited by:
Umberto Morbiducci, Politecnico di Torino, ItalyReviewed by:
Michela Sperti, Politecnico di Torino, ItalyPaul Clift, University Hospitals Birmingham NHS Foundation Trust, United Kingdom
Copyright © 2022 Michel, Laser, Dubowy, Scholl-Bürgi and Michel. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Miriam Michel, bWlyaWFtLm1pY2hlbCYjeDAwMDQwO2ktbWVkLmFjLmF0