Skip to main content

ORIGINAL RESEARCH article

Front. Endocrinol., 11 June 2024
Sec. Reproduction

Prediction model of preeclampsia using machine learning based methods: a population based cohort study in China

Taishun Li,&#x;Taishun Li1,2†Mingyang Xu&#x;Mingyang Xu3†Yuan WangYuan Wang1Ya WangYa Wang1Huirong TangHuirong Tang1Honglei DuanHonglei Duan1Guangfeng ZhaoGuangfeng Zhao1Mingming Zheng*Mingming Zheng1*Yali Hu*Yali Hu1*
  • 1Department of Obstetrics and Gynecology, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
  • 2Medical Statistics and Analysis Center, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
  • 3Information Management Division, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China

Introduction: Preeclampsia is a disease with an unknown pathogenesis and is one of the leading causes of maternal and perinatal morbidity. At present, early identification of high-risk groups for preeclampsia and timely intervention with aspirin is an effective preventive method against preeclampsia. This study aims to develop a robust and effective preeclampsia prediction model with good performance by machine learning algorithms based on maternal characteristics, biophysical and biochemical markers at 11–13 + 6 weeks’ gestation, providing an effective tool for early screening and prediction of preeclampsia.

Methods: This study included 5116 singleton pregnant women who underwent PE screening and fetal aneuploidy from a prospective cohort longitudinal study in China. Maternal characteristics (such as maternal age, height, pre-pregnancy weight), past medical history, mean arterial pressure, uterine artery pulsatility index, pregnancy-associated plasma protein A, and placental growth factor were collected as the covariates for the preeclampsia prediction model. Five classification algorithms including Logistic Regression, Extra Trees Classifier, Voting Classifier, Gaussian Process Classifier and Stacking Classifier were applied for the prediction model development. Five-fold cross-validation with an 8:2 train-test split was applied for model validation.

Results: We ultimately included 49 cases of preterm preeclampsia and 161 cases of term preeclampsia from the 4644 pregnant women data in the final analysis. Compared with other prediction algorithms, the AUC and detection rate at 10% FPR of the Voting Classifier algorithm showed better performance in the prediction of preterm preeclampsia (AUC=0.884, DR at 10%FPR=0.625) under all covariates included. However, its performance was similar to that of other model algorithms in all PE and term PE prediction. In the prediction of all preeclampsia, the contribution of PLGF was higher than PAPP-A (11.9% VS 8.7%), while the situation was opposite in the prediction of preterm preeclampsia (7.2% VS 16.5%). The performance for preeclampsia or preterm preeclampsia using machine learning algorithms was similar to that achieved by the fetal medicine foundation competing risk model under the same predictive factors (AUCs of 0.797 and 0.856 for PE and preterm PE, respectively).

Conclusions: Our models provide an accessible tool for large-scale population screening and prediction of preeclampsia, which helps reduce the disease burden and improve maternal and fetal outcomes.

1 Introduction

Pre-eclampsia (PE) is one of the great obstetrical syndromes (13) and affects 2–5% of pregnancies worldwide. PE is a major cause of maternal and perinatal morbidity and mortality (4, 5), accounting for 70000 maternal deaths and 500000 fetal deaths worldwide every year (6). The pathogenesis of PE remains unclear and curative treatments are limited in clinical practice, with placental ischemia, endothelial dysfunction, and immune maladaptation being possible mechanisms leading to PE (79). Previous researches showed that early intervention with aspirin given from 12 gestational weeks can effectively reduce the occurrence of PE (10, 11). Thus, accurately predicting and identifying high-risk groups of PE during the first trimester of pregnancy is beneficial for timely prevention strategies and improving maternal and fetal outcomes. The prevailing strategy for PE screening involves identifying risk factors based on maternal demographic characteristics and medical history (12, 13). As outlined in the guidelines from the American College of Obstetricians and Gynecologists (ACOG), Obstetrics and Gynecology branch of the Chinese Medical Association, if the pregnant women exhibit any high-risk factors (like the history of PE, chronic hypertension, renal disease, type 1 or 2 diabetes and autoimmune disease etc.) or if they have at least two moderate-risk factors (such as age≥40 years, nulliparity, etc.) they should take asplin (14, 15). An increasing body of evidence suggests that the incorporation of maternal history with some physical signs, such as mean arterial pressure (MAP), uterine artery pulsatility index (UtA-PI), and biomarkers such as serum pregnancy-associated plasma protein A (PAPP-A), and serum placental growth factor (PLGF) will improve the prediction efficiency of PE. The Competing risks model in screening for PE maternal characteristics and medical history established by wright indicates that the model-based Bayes theorem using the combination of a priori risk from maternal characteristics and the biomarkers results greatly improved the overall screening performance of PE (16, 17).

Recently, Machine Learning (ML), a subset of artificial intelligence, has emerged as a revolutionary tool in the realm of complex diseases prediction and diagnosis (18, 19). With the capacity to process vast amounts of data and extract meaningful patterns, ML algorithms have been instrumental in early disease detection, enhancing diagnostic accuracy, and offering insights beyond the capabilities of traditional methods. Ansbacher-Feldman Z (19) and Gil MM (20) research have found that machine-learning models utilizing neural networks can effectively screen for PE with high accuracy, using maternal characteristics and raw biomarker data. Melinte-Popescu AS’s study (21) included four machine learning-based models: decision tree (DT), naïve Bayes (NB), support vector machine (SVM), and random forest (RF) for PE screening in the first trimester, the study indicates that machine learning-based models could be useful tools for PE prediction in the first trimester of pregnancy. Torres-Torres J (22) study also finds that elastic net regression offers a potential solution for developing accurate and efficient prediction models for PE and offers significant clinical benefits. The predictive model performance of PE may vary among different ethnic groups. Currently, there are few studies on developing machine learning algorithm-based prediction models specifically for Chinese population cohorts. Liu M study (23) shows that machine learning, particularly using RF, accurately predicts PE by using clinical history and prenatal screening results in a retrospective cohort study in China.

In this study, we develop predictive models for PE using new ML techniques such as the Extra Trees Classifier (ETC), Voting Classifier (VC), Gaussian Process Classifier (GPC), and Stacking Classifier (SC) based on prospective cohort study in China. These advanced algorithms offer more nuanced and potentially more accurate predictive capabilities compared to traditional machine learning algorithms.

2 Materials and methods

2.1 Study population

This was a prospective cohort longitudinal study from early pregnancy within 14 weeks of gestation (with a crown-rump length of 45–84 mm) to childbirth for PE studybased on the combined screening for fetal aneuploidy in early pregnancy. This study included 5116 singleton pregnant women who underwent PE screening and fetal aneuploidy at the affiliated Nanjing Drum Tower Hospital of Nanjing University Medical School from January 2017 to September 2020. This study excluded patients with incomplete information (including 275 missing PLGF data and 34 missing UtA-PI data), and those who experienced natural miscarriages before 28 weeks (56 cases). Also, patients who terminated their pregnancy for personal reasons (6 cases), those who terminated due to fetal malformations or chromosomal abnormalities (18 cases), and those lost to follow-up (83 cases) were excluded. A total of 4,644 participants were ultimately included in this analysis. This study protocol was approved by the institutional review board of Nanjing Drum Tower Hospital (2016–113–01). This study followed the TRIPOD statement for reporting (24).

2.2 Model covariates and outcome

Covariates in the prediction model included (1) maternal demographic characteristics (maternal age, height, pre-pregnancy weight, nulliparous, method of conception [natural; ovulation induction; in-vitro fertilization-embryo transfer], family history of PE, and smoking); (2) past medical history (history of PE, history of chronic hypertension, history of chronic kidney disease, type 1 or type 2 diabetes, history of systemic lupus erythematosus and/or antiphospholipid syndrome); (3) Biophysical markers (MAP, UtA-PI); (4) Biochemical markers (PAPP-A and PLGF). The measurement of biophysical and biochemical markers was conducted between 11 weeks and 13 + 6 weeks of gestation.

The reasons for choosing four biophysical or biochemical markers as predictive factors are as follows: UtA-PI is a measure of the resistance to blood flow in the uterine arteries. High resistance (high UtA-PI) suggests poor placentation, as it reflects the inadequate remodeling of spiral arteries. The uterine blood supply consists of a vascular structure decreasing in size as it progresses through the myometrium and endometrium, culminating in spiral arteries. During early pregnancy, >100 spiral arteries are remodeled into high-flow uteroplacental vessels with low resistance, to ensure the provision of an adequate blood supply to the developing fetus. These vascular changes are therefore crucial for decreasing maternal vascular resistance and increasing uteroplacental blood flow by up to ten-fold during this time (from ~50 ml per minute pre-pregnancy to ~500 ml per minute upon completion of placentation). Impaired or incomplete spiral artery remodeling is implicated in PE, intrauterine growth restriction and recurrent miscarriage, due to various degrees of insufficient blood flow to the fetus. In PE pregnancies, abnormal spiral artery remodeling with incomplete placentation and poor placental perfusion, can lead to maternal systemic hypoxia and hypertensive pathology, activation of the maternal renal and cardiovascular systems with endothelial damage, and potential end-organ damage (2528). MAP is a composite measure of cardiac output and systemic vascular resistance. Systemic vascular resistance is increased due to endothelial dysfunction in PE, which can be reflected in elevated MAP early in pregnancy before clinical symptoms appear (29); PAPP-A is a protein produced by the placenta. Low levels in the first trimester have been associated with poor placental development and function, leading to increased risk of PE (30); PLGF is an angiogenic factor that promotes placental blood vessel development. Low levels of PLGF are indicative of placental insufficiency and have been linked to the development of PE (31). The outcome in this study was the development of PE, PE is defined as the occurrence of a systolic blood pressure of ≥140 mmHg and/or a diastolic blood pressure of ≥90 mmHg in pregnant women after 20 weeks of gestation, accompanied by any one of the following: a urinary protein quantitation of ≥0.3 g/24 h, a urine protein/creatinine ratio of ≥0.3, or a random urinary protein level of ≥ (+) (as a testing method when protein quantitation is not conducted unconditionally). We divided PE into preterm PE (delivery gestational week <37 weeks) and term PE (delivery gestational week ≥37 weeks) (32). Gestational week in this study was determined by measurements of fetal crown-rump length (CRL) within the first trimester of pregnancy (33).

2.3 Quality control

All selected pregnant women were interviewed on-site by researchers to collect their medical history. The pregnancy outcomes for women who gave birth in our hospital were obtained from medical records (accounting for 90.8%), while those for women who gave birth in other hospitals were collected through dedicated telephone follow-ups (accounting for 9.2%). All research data were collected with the Viewpoint 6.0 software by data administrators.

Quality control standards for the detections of biophysical and biochemical markers were as follows (1). MAP: Blood pressure was measured on-site in a standardized manner using an automatic blood pressure measuring device (3BTO-A2, Microlife Corporation, Taiwan, China) by trained designated doctors. The blood pressure monitor was regularly calibrated by the hospital’s quality inspection department. Before measuring blood pressure, pregnant women were seated comfortably for at least 5 minutes, and a cuff of appropriate size was selected based on the arm circumference. The blood pressure of both arms were measured simultaneously, recorded every 1 minute, until the difference in consecutive readings was within 10 mmHg for systolic pressure and within 6 mmHg for diastolic pressure. The MAP for both arms were calculated based on the average of the last two stable measurements. The final blood pressure was determined by the higher average MAP of the two arms (34); (2) UtA-PI: The measurement of UtA-PI was conducted using the Voluson E8 color Doppler ultrasound diagnostic device from GE, USA, with the probe models RAB6-D/0B and RAB4–8-D/OB, and a frequency of 4–6 MHz. The measurement was performed by ultrasound doctors who had participated in FMF ultrasound technology training and passed the uterine artery monitoring qualification certification. In accordance with FMF measurement standards, pulse wave Doppler was used to obtain three similar continuous waveforms from the ascending uterine artery at the level of the cervical internal. The UtA-PI of both sides was measured and the average UtA-PI was calculated (35); (3) PAPP-A: PAPP-A was derived from the records of serum screening for Down syndrome during early pregnancy in our hospital. It was detected using the AutoDELFIA 1235 automatic immune analysis system (time-resolved fluorescence immunoassay method, AutoDELFIA PAPP-A reagent kit, PerkinElmer Company, Finland); (4) PLGF: Blood samples from all subjects were collected on the day of enrollment and sent to the sample bank for centrifugation within 2 hours without anticoagulants. After drawing blood for PlGF, the sample was inverted at least 5 times, then the clot was allowed to sit at room temperature for about 30 minutes before centrifugation. After centrifugation, the serum was stored at -80°C. The PlGF value was measured using the AutoDELFIA 1235 automatic immune analysis system (time-resolved fluorescence immunoassay, PerkinElmer AutoDELFIA PlGF reagent kit, Finland) or Cobas e602 system (Roche Diagnostics, Germany). Quality control requirements stipulate that the coefficient of variation for measurements of quality control materials with different concentrations in each batch must be less than 5%; and the measurement range for quality control materials in each test should be within two standard deviations.

2.4 Model development

In our model, five classification algorithms including Logistic Regression (LR)[2], Extra Trees Classifier (ETC), Voting Classifier (VC), Gaussian Process Classifier (GPC)[3] and Stacking Classifier (SC) were applied. To enhance the overall accuracy and robustness of our model, we incorporated certain ensemble learning techniques [6], the VC and SC which are frequently deployed in algorithm competitions. As for VC, multiple models were trained on the same dataset, and their predictions were combined to make a final prediction. The VC can be used for classification problems, where each individual model is an estimator, and the final prediction is made by combining the predictions of all the classifiers using a voting strategy. For our model, we chose Random Forest (RF) and ETC as its estimators. The voting criteria we used was soft voting, which calculates the voting based on the estimators’ predicted probabilities.

For the SC, multiple models, known as base models, were trained on the same dataset. Instead of directly combining their predictions, these outputs served as input features for another model. This subsequent model, the ‘meta’ classifier or ‘meta learner’, is particularly important in the stacking approach. In our setup, we employed the GPC as the meta classifier. This model was trained to make the final prediction, informed by the outputs of the base models. The unique advantage of the SC is its capability to discern nuanced patterns and relationships in the predictions of individual base models. For our base models, we opted for Support Vector Machines (SVM), ETC, and GPC. By strategically leveraging these models through the SC, we aim to harness the individual strengths of each model, potentially surpassing the accuracy and consistency that any single model could achieve on its own.

Since our dataset was unbalanced with a 1:23 positive and negative weights, we set class weight of LR, ETC, and all the estimators of VC to balanced, this automatically adjusted the weights inversely proportional to their frequencies in the data. Before training, we normalized all the data in the range from 0 to 1, which could help improve the performance and stability of training.

To optimize the performance of each algorithmic model, we implemented Bayesian Optimization, conducting hyperparameter tuning across 20 repetitions of 5-fold cross-validation. The Area Under the Receiver Operating Characteristic (AUC-ROC) curve was chosen as the metric to evaluate the effectiveness of tuning for each model. Given the complex nature of voting classifiers and stacking classifiers, which, as ensemble learning techniques, amalgamate diverse models and thus exhibit a vast hyperparameter space, we strategically tuned the hyperparameters of each constituent estimator in isolation. Following this meticulous individual optimization, we then proceeded to integrate these finely tuned estimators.

2.5 Model evaluation

We evaluated the performance of our model using both discrimination and calibration metrics. We used the area under the receiver operating characteristic curve (AUC-ROC), sensitivity, specificity to evaluate the discrimination of our model. We also determined the cut-off value for classification using the Youden Index[8], which is a commonly used algorithm for calculating the optimal cut-off point. This algorithm determines the cut-off value when sensitivity = 1 - specificity in the ROC-curve. We evaluated the calibration of our model using the Brier score, calibration slope, and calibration intercept. The Brier score measures the accuracy of probabilistic predictions, while the calibration slope and intercept indicate the reliability of the predicted probabilities.

ML models are often considered “black boxes,” making it difficult to interpret their results. In our study, we introduced SHAP (SHapley Additive exPlanations) values[9] to break down our models and explain their predictions. We plotted a beeswarm chart and a pie chart for the VC model to analyze the distribution of the feature values and the contribution of each feature.

2.6 Model validation

To validate the performance of our model, we applied a 5-fold cross-validation with an 8:2 train-test split. We conducted 200 repetitions of cross-validation with random seeds for all the models individually. After each fold in every repetition, we recorded the performance measurements. Upon concluding 1000 evaluations (5 folds * 200 repetitions), we employed the bootstrapping method to calculate the 95% confidence interval of the aggregate results, thus providing a robust measure of the model’s reliability.

2.7 Statistical analysis

Our model was developed using Scikit-learn 1.2.0 (sklearn), a widely recognized open-source ML library in Python. To ensure rigorous and transparent evaluation, all performance metrics were presented as point estimates accompanied by 95% confidence intervals (CIs). This interval estimation approach was pivotal in comparing the performance of different models. The selection of the best model was grounded on its Area Under the Receiver Operating Characteristic (AUROC), ascertained from the validation set. For the statistical description of continuous metrics, we employed mean and standard deviation, whereas categorical data were characterized using frequencies and percentages.

2.8 Comparison to previous studies

We have also evaluated our optimal algorithm for predicting PE or Pre-term PE, contrasting it with algorithms from previous studies conducted over the past five years. Studies were selected based on the following criteria: (1) The document type was required to be an original article; (2) The model predictive factors were required to include maternal baseline and clinical biomarkers; (3) Clinical biomarkers were required to include at least two of PLGF, PAPP-A, UtA-PI and MAP; (4) Preterm PE was required to be defined as occurring before 37 weeks of gestation; (5) The compared algorithm was required to be the best-performing one in the study; (6) All predictive indicators was required to be obtained early in pregnancy (within 14 weeks of gestation). All selected publications needed to report the sample size, data sources, model algorithms, predictive factors used in the model, and model performance indicators.

3 Results

3.1 Study population characteristics

A total of 5116 singleton pregnant women participated in this cohort, 472 were excluded for various reasons (Figure 1), and 4644 were ultimately included for analysis. Among them, 210 pregnant women developed PE, with an incidence of 4.5%. This includes 49 cases of preterm PE and 161 cases of term PE. A detailed flow chart of the study is presented in Figure 1. Table 1 shows maternal characteristics and biophysical markers for study subjects with and without PE (used in the prediction model of PE). Subjects who developed PE had older age and higher weight than those without PE. Participants with a history of PE, chronic hypertension, or a family history of PE were more likely to develop PE. The MAP in those who developed PE was higher than those who did not develop PE, but with lower level of PLGF and PAPP-A. There was statistically significant difference in maternal age, pre-pregnancy weight, nulliparous, history of diabetes mellitus, history of chronic hypertension, family history of PE, MAP, UtA-PI and PAPP-A between the preterm PE group and no preterm PE group.

Figure 1
www.frontiersin.org

Figure 1 The flow chart of the study.

Table 1
www.frontiersin.org

Table 1 Study population characteristics.

3.2 Model performance

The reported area under the AUC, sensitivity, specificity for PE model on the test set are summarized in Table 2. The prediction was performed based on the maternal demographic characteristics and medical history data and those characteristics plus MAP, PAPP-A, UtA-PI and PLGF respectively. The AUC increased consistently with the addition of the biomarkers. Specifically, the AUC for PE model in VC algorithms increased from 0.746 to 0.814 when MAP was added in the model, the sensitivity increased from 0.678 to 0.755, but the specificity was remained unchanged at 0.866. When adding the other biomarkers for VC algorithms, the results were slightly improved (AUC of 0.831 and sensitivity of 0.770). Compared to other algorithms, the LR algorithm performed better in terms of AUC in most model scenarios for all PE prediction, especially achieving the best AUC in the scenario of Maternal Characteristics plus MAP (LR: AUC=0.816; ETC: AUC=0.811; VC: AUC=0.814; GPC: AUC=0.814; SC: AUC=0.811).ROC plots with respective AUC of LR, ETC, VC, GPC and SC algorithms for PE model with maternal characteristics plus all biomarkers are shown in Figure 2. The AUC value for each of the LR, ETC, VC, GPC and SC algorithms was 0.824, 0.817, 0.832, 0.828 and 0.825 respectively. The performance results of the term PE model are shown in the Supplementary Table S1.

Table 2
www.frontiersin.org

Table 2 Performance of machine learning algorithms in the preeclampsia model.

Figure 2
www.frontiersin.org

Figure 2 Receiver operating characteristics (ROC) curves for the five ML algorithms in the PE prediction (Maternal Characteristics + MAP + UtA-PI + PLGF + PAPP-A). The plot displays the ROC curve results from the validation set for each of the five model, differentiated by distinct colors. The mean ROC representing the average outcome across all folds within a single epoch of cross-validation. The dashed line is a reference line. AUC is the area under the ROC curve.

The performance results of the preterm PE model are presented in Table 3. Compared to the PE prediction model, the evaluation indicators of preterm PE predictive model in VC algorithms based on the maternal characteristics plus MAP have increased (AUC of 0.842 and sensitivity of 0.762). Notably, the AUC value for predicting PE with maternal characteristics plus the all biomarkers we used was highest in the VC and GPC (Figure 3). When the prediction performances were compared among the five algorithms, the VC algorithms had the best performance for predicting preterm PE. The AUC of the VC algorithms was 0.884, sensitivity was 0.860, specificity was 0.834. Details about the model’s calibration are provided in the Appendix, specifically within Supplementary Tables S2-5.

Table 3
www.frontiersin.org

Table 3 Performance of machine learning algorithms in the preterm preeclampsia model.

Figure 3
www.frontiersin.org

Figure 3 Receiver operating characteristics (ROC) curves for the ML learning algorithms in the preterm PE prediction (Maternal Characteristics + MAP + UtA-PI + PLGF + PAPP-A). The plot displays the ROC curve results from the validation set for each of the five model, differentiated by distinct colors. The mean ROC representing the average outcome across all folds within a single epoch of cross-validation. The dashed line is a reference line. AUC is the area under the ROC curve.

The FMF competing risk models for predicting PE and preterm PE, incorporating maternal characteristics plus all biomarkers, exhibit AUCs of 0.797 and 0.856, respectively. The performance for PE or preterm PE using ML algorithms in this study was similar to that achieved by the FMF competing risk model under the same predictive factors. Details about the performances of FMF competing risk model for PE and preterm PE prediction are provided in Supplementary Table S6.

3.3 Contribution of variables to prediction accuracy

The influence of each variable on prediction accuracy was evaluated by Shapley values, a large absolute Shapley value indicates that the input variable typically contributes more to decision making. The Shapley values represent the average contribution to the score of each input variable when computed with different combinations of the other variables. Contributions of variables to PE prediction are shown in Figure 4. The highest contribution was provided by MAP, for which a high value led to a high risk for preterm PE, followed in order by PLGF, pre-pregnancy weight, PAPP-A, nulliparous, UtA-PI, history of chronic hypertension, assisted reproduction, maternal age, and previous history of PE (Figures 4A, C). From the feature contribution distribution chart (Figure 4B), the contribution ratio of MAP for PE prediction is 44.3%, with the maternal characteristics contributing 29.6%. This is followed by three biomarkers: PLGF, PAPP-A, and UtA-PI, which account for 11.9%, 8.7%, and 5.5%, respectively.

Figure 4
www.frontiersin.org

Figure 4 Contribution of variables to PE prediction based on Shapley values. (A) The mean Shapley values of predictors of PE based on the VC model. Input variables are ordered by their mean Shapley value, which reflect the relative importance of the predictors in PE predictive model. (B) The pie chart of the feature group contribution based on the VC model for PE prediction. the average feature group contribution was calculated based on the mean absolute Shapley values for each feature group. (C) Shapley summary plot for PE predictors based on the VC model. The input predictors (Y-axis) are ordered by the mean absolute Shapley values, which represent their average influence on the model output. Values of the predictor for each subject are colored by their Shapley values, red indicating high contribution and blue indicating low contribution. Positive Shapley values indicate increased risk for PE and negative Shapley values indicate decreased risk for PE.

The relative importance of the selected variables in the preterm PE prediction model is described in Figure 5. The most important predictive factor is still MAP, followed by PAPP-A, pre-pregnancy weight, UtA-PI, maternal age, and PLGF. Other indicators, such as smoking, had a very limited contribution (Figures 5A, C). The contributions of MAP and maternal characteristics to the prediction of preterm PE are 37.0% and 29.8% respectively. Among the three biomarkers, PAPP-A has the highest proportion at 16.5%, while the remaining PLGF and UtA-PI account for 7.2% and 9.5% respectively (Figure 5B).

Figure 5
www.frontiersin.org

Figure 5 Contribution of variables to preterm PE prediction based on Shapley values. (A) The mean Shapley values of predictors of preterm PE based on the VC model. Input variables are ordered by their mean Shapley value, which reflect the relative importance of the predictors in preterm PE predictive model. (B) The pie chart of the feature group contribution based on the VC model for preterm PE prediction. the average feature group contribution was calculated based on the mean absolute Shapley values for each feature group. (C) Shapley summary plot for preterm PE predictors based on the VC model. The input predictors (Y-axis) are ordered by the mean absolute Shapley values, which represent their average influence on the model output. Values of the predictor for each subject are colored by their Shapley values, red indicating high contribution and blue indicating low contribution. Positive Shapley values indicate increased risk for preterm PE and negative Shapley values indicate decreased risk for preterm PE.

3.4 Comparison to previous studies

We searched PubMed for articles on prediction models for PE published between 2019 and 2023, including both developed and validated models. We retrieved a total of 242 publications within the past five years from the PubMed database using the search terms (Preeclampsia [Title]) AND (prediction [Title]). Ultimately, nine relevant publications were selected based on the inclusion and exclusion criteria (Table 4). In the articles predicting all PE, we identified that the Ansbacher-Feldman (19) study from the UK and the Gil, M. M (20) study from Spain employed the same clinical biomarkers as we did. The predictive model AUCs in these two studies were 0.817 and 0.848, respectively, and the detection rates at a 10% false positive rate were 0.529 and 0.548, respectively, which are comparable to the levels of VC algorithm in our study (AUC=0.831, DR at 10%FPR=0.513). Similarly, the two aforementioned studies used the same predictive factors to forecast pre-term PE, with AUCs of 0.909 and 0.912, respectively. The detection rates at a 10% false positive rate were 0.753 and 0.778, slightly higher than VC algorithm in our study (AUC=0.884, DR at 10%FPR=0.625).

Table 4
www.frontiersin.org

Table 4 Predictive performances of the Voting Classifier model for the PE compared to those from previous studies.

4 Discussion

In this study, we successfully developed a fully automated prediction model for all PE and preterm PE by using various ML algorithms. Compared with other prediction algorithms, the AUC and detection rate at 10% FPR of the VC algorithm showed better performance in the prediction of PE (AUC=0.831, DR at 10%FPR=0.513) and preterm PE (AUC=0.884, DR at 10%FPR=0.625). For predicting PE and preterm PE, the most crucial predictive factors were MAP and maternal characteristics. In predicting all PE, the contribution of PLGF was higher than PAPP-A (11.9% versus 8.7%), whereas the situation was reversed in the prediction of preterm PE (7.2% versus 16.5%).

The guidelines of both the International Society for the Study of Hypertension in Pregnancy (39) and the International Federation of Gynecology and Obstetrics (FIGO) (6) emphasized the critical importance of early prediction and prevention of PE for reducing the incidence of PE, and directly improving the health outcomes of the maternal and newborns. FMF competing risk model had provided significant insights and an effective method for PE prediction (40). Our study showed that the AUC for the PE prediction model, based on the competing risk model with maternal characteristics plus all biomarkers, was 0.797, while the AUC for the preterm PE prediction model was 0.856. Some researchers had adopted other predictive algorithms for PE prediction, demonstrating similar model performance. For example, Tarca, A. L. et al. achieved good performance in predicting PE using a multivariate Poisson regression model based on maternal baseline, biophysical and biochemical biomarkers (36). Our study constructed prediction models for PE using several ML algorithms and obtained similar model performance. However, the biomarkers values in the ML algorithms do not need to be expressed as multiples of the median (MoM) and adjusted for gestational age and various maternal factors, the application scenarios for ML algorithms may be more extensive. ML algorithms can particularly utilize raw clinical data directly and are capable of analyzing complex, nonlinear, and high-dimensional data, aligning with the practical scenarios of PE clinical predictions. Additionally, ML algorithms learn and adapt to new data, leading to continuous improvement in predictions over time. Therefore, establishing rapid and simple prediction tools, such as online web pages, based on ML algorithms is suitable for carrying out large-scale screening and prediction of PE.

Apart from the prediction model algorithms, the predictive factors included are also crucial for predicting PE. The models that are overly complex or costly are not practical for screening large populations at the community level. A required predictive model should be both cost-effective and highly sensitive, meaning that the predictors within the model should be low-cost and yet acceptable in terms of accuracy (41). Our research findings indicate that in the prediction model for PE, the combination of maternal characteristics, MAP, PAPP-A as predictors can achieve an AUC of 0.82. For preterm PE, this combination can achieve an AUC of 0.87. The increase in predictive effect was limited by adding the other parameters. PLGF and UtA-PI could only increase the AUC to 0.83 for PE prediction and to 0.88 for preterm PE prediction. J. Torres-Torres’s study (22) also found that in PE prediction, the AUC was 0.786 when the predictive factors included maternal characteristics and MAP. However, when PLGF and UtA-PI were added, the AUC decreased to 0.778. In Benkő, Z’s study (42) on the prediction of preterm PE in twin pregnancy, a similar finding was observed. The AUC was 0.742 when the prediction was based on maternal characteristics and MAP alone. However, with the inclusion of PLGF, UtA-PI, and PAPP-A, the AUC only increased to 0.776. In summary, based on the performance of the predictive model, the combination of maternal baseline, MAP, and PAPP-A meets the expectations for predicting PE and preterm PE. Considering the interpretability and parsimony of the model, as well as the real-world requirements for low-cost effectiveness analysis in clinical applications, we recommend the combination of maternal baseline characteristics with MAP and PAPP-A as predictive factors for the prediction of PE and preterm PE.

The strengths of this study are listed as follows: Firstly, the study was designed as a prospective cohort study. Strict quality control was maintained in data collection throughout the research process to ensure that the data used for modeling were objective and credible; Secondly, the model underwent rigorous validation using a 5-fold cross-validation with an 8:2 train-test split. This stringent testing ensures that the model is not only accurate but robust, instilling confidence in its practical application; Finally, despite the often “black box” nature of ML models, we employed SHAP values to enhance the interpretability of our model. This transparency is crucial for clinical adoption, offering insights into the underlying factors driving the predictions. The main limitation of this study is the relatively small sample size from a single center, lacking external data validation. It is essential to test the model on a more varied dataset to confirm its applicability across different populations. It is anticipated that it will be necessary to assess the model performance in diverse real-world settings to confirm its efficacy and reliability.

5 Conclusions

In conclusion, our study offers several automated machine-learning algorithms to make PE and preterm PE predictions more accessible, cost-effective, and reliable in the first-trimester. The integration of maternal baseline, MAP, and PAPP-A into the predictive model could potentially revolutionize PE screening, making it more accessible and reliable, especially in developing countries where resources and specialized training are limited.

Data availability statement

The datasets used during the current study are available from the corresponding author on reasonable request.

Ethics statement

The studies involving humans were approved by the institutional review board of Nanjing Drum Tower Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

TL: Formal analysis, Software, Writing – original draft. MX: Formal analysis, Methodology, Validation, Writing – review & editing. YuW: Data curation, Project administration, Writing – review & editing. YaW: Data curation, Project administration, Writing – review & editing. HT: Data curation, Project administration, Writing – review & editing. HD: Data curation, Project administration, Writing – review & editing. GZ: Conceptualization, Funding acquisition, Writing – review & editing. MZ: Conceptualization, Supervision, Writing – review & editing. YH: Conceptualization, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was funded by grants from the National Key R&D Program of China (2021YFC2701603).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2024.1345573/full#supplementary-material

Abbreviations

PE, preeclampsia; ACOG, American College of Obstetricians and Gynecologists; MAP, mean arterial pressure; UtA-PI, uterine artery pulsatility index; PAPP-A, pregnancy-associated plasma protein A; PLGF, placental growth factor; ML, Machine Learning; LR, Logistic Regression; ETC, Extra Trees Classifier; VC, Voting Classifier; GPC, Gaussian Process Classifier; SC, Stacking Classifier; RF, Random Forest; SVM, Support Vector Machines; AUC, area under the receiver operating characteristic curve; CI, confidence intervals; FPR, false positive rate; DR, Detection rate; MoM, multiples of the median.

References

1. Nissaisorakarn P, Sharif S, Jim B. Hypertension in pregnancy: defining blood pressure goals and the value of biomarkers for preeclampsia. Curr Cardiol Rep. (2016) 18:131. doi: 10.1007/s11886–016-0782–1

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Aye I, Aiken CE, Charnock-Jones DS, Smith GCS. Placental energy metabolism in health and disease-significance of development and implications for preeclampsia. Am J Obstet Gynecol. (2022) 226:S928–S44. doi: 10.1016/j.ajog.2020.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Melchiorre K, Giorgione V, Thilaganathan B. The placenta and preeclampsia: villain or victim? Am J Obstet Gynecol. (2022) 226:S954–S62. doi: 10.1016/j.ajog.2020.10.024

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Rana S, Lemoine E, Granger JP, Karumanchi SA. Preeclampsia: pathophysiology, challenges, and perspectives. Circ Res. (2019) 124:1094–112. doi: 10.1161/circresaha.118.313276

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Abalos E, Cuesta C, Grosso AL, Chou D, Say L. Global and regional estimates of preeclampsia and eclampsia: A systematic review. Eur J Obstet Gynecol Reprod Biol. (2013) 170:1–7. doi: 10.1016/j.ejogrb.2013.05.005

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Poon LC, Shennan A, Hyett JA, Kapur A, Hadar E, Divakar H, et al. The international federation of gynecology and obstetrics (Figo) initiative on pre-eclampsia: A pragmatic guide for first-trimester screening and prevention. Int J gynaecology obstetrics: Off Organ Int Fed Gynaecology Obstetrics. (2019) 145 Suppl 1:1–33. doi: 10.1002/ijgo.12802

CrossRef Full Text | Google Scholar

7. Dimitriadis E, Rolnik DL, Zhou W, Estrada-Gutierrez G, Koga K, Francisco RPV, et al. Pre-eclampsia. Nat Rev Dis Primers. (2023) 9:8. doi: 10.1038/s41572–023-00417–6

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Phipps EA, Thadhani R, Benzing T, Karumanchi SA. Pre-eclampsia: pathogenesis, novel diagnostics and therapies. Nat Rev Nephrol. (2019) 15:275–89. doi: 10.1038/s41581–019-0119–6

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Opichka MA, Rappelt MW, Gutterman DD, Grobe JL, McIntosh JJ. Vascular dysfunction in preeclampsia. Cells. (2021) 10:3055. doi: 10.3390/cells10113055

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Roberge S, Bujold E, Nicolaides KH. Aspirin for the prevention of preterm and term preeclampsia: systematic review and metaanalysis. Am J Obstet Gynecol. (2018) 218:287–93 e1. doi: 10.1016/j.ajog.2017.11.561

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Henderson JT, Vesco KK, Senger CA, Thomas RG, Redmond N. Aspirin use to prevent preeclampsia and related morbidity and mortality: updated evidence report and systematic review for the us preventive services task force. JAMA: J Am Med Assoc. (2021) 326:1192–206. doi: 10.1001/jama.2021.8551

CrossRef Full Text | Google Scholar

12. Committee opinion no. 638: first-trimester risk assessment for early-onset preeclampsia. Obstet Gynecol. (2015) 126:e25–e7. doi: 10.1097/01.AOG.0000471176.72443.88

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Force USPST, Davidson KW, Barry MJ, Mangione CM, Cabana M, Caughey AB, et al. Screening for gestational diabetes: us preventive services task force recommendation statement. JAMA: J Am Med Assoc. (2021) 326:531–8. doi: 10.1001/jama.2021.11922

CrossRef Full Text | Google Scholar

14. Acog committee opinion no. 743 summary: low-dose aspirin use during pregnancy. Obstet Gynecol. (2018) 132:254–6. doi: 10.1097/AOG.0000000000002709

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Hypertensive Disorders in Pregnancy Subgroup CSoOaG, Chinese Medical Association. Diagnosis and treatment of hypertension and pre-eclampsia in pregnancy: A clinical practice guideline in China(2020). Chin J Obstet Gynecol. (2020) 55:227–38. doi: 10.3760/cma.j.cn112141-20200114-00039

CrossRef Full Text | Google Scholar

16. O'Gorman N, Wright D, Syngelaki A, Akolekar R, Wright A, Poon LC, et al. Competing risks model in screening for preeclampsia by maternal factors and biomarkers at 11–13 weeks gestation. Am J Obstet Gynecol. (2016) 214:103.e1–.e12. doi: 10.1016/j.ajog.2015.08.034

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Wright D, Tan MY, O'Gorman N, Poon LC, Syngelaki A, Wright A, et al. Predictive performance of the competing risk model in screening for preeclampsia. Am J Obstet Gynecol. (2019) 220:199.e1–.e13. doi: 10.1016/j.ajog.2018.11.1087

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Chen Y, Huang X, Wu S, Guo P, Huang J, Zhou L, et al. Machine-learning predictive model of pregnancy-induced hypertension in the first trimester. Hypertens Res. (2023) 46:2135–44. doi: 10.1038/s41440–023-01298–8

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Ansbacher-Feldman Z, Syngelaki A, Meiri H, Cirkin R, Nicolaides KH, Louzoun Y. Machine-learning-based prediction of pre-eclampsia using first-trimester maternal characteristics and biomarkers. Ultrasound obstetrics gynecology: Off J Int Soc Ultrasound Obstetrics Gynecology. (2022) 60:739–45. doi: 10.1002/uog.26105

CrossRef Full Text | Google Scholar

20. Gil MM, Cuenca-Gomez D, Rolle V, Pertegal M, Diaz C, Revello R, et al. Validating a machine-learning model for first-trimester prediction of pre-eclampsia using the cohort from the preval study. Ultrasound Obstet Gynecol. (2023) 63:68–74. doi: 10.1002/uog.27478

CrossRef Full Text | Google Scholar

21. Melinte-Popescu AS, Vasilache IA, Socolov D, Melinte-Popescu M. Predictive performance of machine learning-based methods for the prediction of preeclampsia-a prospective study. J Clin Med. (2023) 12:418. doi: 10.3390/jcm12020418

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Torres-Torres J, Villafan-Bernal JR, Martinez-Portilla RJ, Hidalgo-Carrera JA, Estrada-Gutierrez G, Adalid-Martinez-Cisneros R, et al. Performance of a machine learning approach for the prediction of pre-eclampsia in a middle-income country. Ultrasound Obstet Gynecol. (2023) 63:350–57. doi: 10.1002/uog.27510

CrossRef Full Text | Google Scholar

23. Liu M, Yang X, Chen G, Ding Y, Shi M, Sun L, et al. Development of a prediction model on preeclampsia using machine learning-based method: A retrospective cohort study in China. Front Physiol. (2022) 13:896969. doi: 10.3389/fphys.2022.896969

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (Tripod): the tripod statement. BMJ. (2015) 350:g7594. doi: 10.1136/bmj.g7594

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Tay J, Masini G, McEniery CM, Giussani DA, Shaw CJ, Wilkinson IB, et al. Uterine and fetal placental doppler indices are associated with maternal cardiovascular function. Am J Obstet Gynecol. (2019) 220:96 e1– e8. doi: 10.1016/j.ajog.2018.09.017

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Cui Y, Wang W, Dong N, Lou J, Srinivasan DK, Cheng W, et al. Role of corin in trophoblast invasion and uterine spiral artery remodelling in pregnancy. Nature. (2012) 484:246–50. doi: 10.1038/nature10897

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Yang X, Ding Y, Sun L, Shi M, Zhang P, Huang Z, et al. Ferritin light chain deficiency-induced ferroptosis is involved in preeclampsia pathophysiology by disturbing uterine spiral artery remodelling. Redox Biol. (2022) 58:102555. doi: 10.1016/j.redox.2022.102555

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Schiffer V, Evers L, de Haas S, Ghossein-Doha C, Al-Nasiry S, Spaanderman M. Spiral artery blood flow during pregnancy: A systematic review and meta-analysis. BMC Pregnancy Childbirth. (2020) 20:680. doi: 10.1186/s12884–020-03150–0

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Noori M, Donald AE, Angelakopoulou A, Hingorani AD, Williams DJ. Prospective Study of Placental Angiogenic Factors and Maternal Vascular Function before and after Preeclampsia and Gestational Hypertension. Circulation. (2010) 122:478–87. doi: 10.1161/CIRCULATIONAHA.109.895458

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Conti E, Zezza L, Ralli E, Caserta D, Musumeci MB, Moscarini M, et al. Growth factors in preeclampsia: A vascular disease model. A failed vasodilation and angiogenic challenge from pregnancy onwards? Cytokine Growth Factor Rev. (2013) 24:411–25. doi: 10.1016/j.cytogfr.2013.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

31. MacDonald TM, Walker SP, Hannan NJ, Tong S, Kaitu'u-Lino TJ. Clinical tools and biomarkers to predict preeclampsia. EBioMedicine. (2022) 75:103780. doi: 10.1016/j.ebiom.2021.103780

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Chaemsaithong P, Pooh RK, Zheng M, Ma R, Chaiyasit N, Tokunaka M, et al. Prospective evaluation of screening performance of first-trimester prediction models for preterm preeclampsia in an asian population. Am J Obstet Gynecol. (2019) 221:650.e1–.e16. doi: 10.1016/j.ajog.2019.09.041

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Spencer K, Cowans NJ, Chefetz I, Tal J, Meiri H. First-trimester maternal serum pp-13, papp-a and second-trimester uterine artery doppler pulsatility index as markers of pre-eclampsia. Ultrasound obstetrics gynecology: Off J Int Soc Ultrasound Obstetrics Gynecology. (2007) 29:128–34. doi: 10.1002/uog.3876

CrossRef Full Text | Google Scholar

34. Roberts L, Chaemsaithong P, Sahota DS, Nicolaides KH, Poon LCY. Protocol for measurement of mean arterial pressure at 10–40weeks' Gestation. Pregnancy hypertension. (2017) 10:155–60. doi: 10.1016/j.preghy.2017.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Plasencia W, Maiz N, Bonino S, Kaihura C, Nicolaides KH. Uterine artery doppler at 11 + 0 to 13 + 6 weeks in the prediction of pre-eclampsia. Ultrasound obstetrics gynecology: Off J Int Soc Ultrasound Obstetrics Gynecology. (2007) 30:742–9. doi: 10.1002/uog.5157

CrossRef Full Text | Google Scholar

36. Tarca AL, Taran A, Romero R, Jung E, Paredes C, Bhatti G, et al. Prediction of preeclampsia throughout gestation with maternal characteristics and biophysical and biochemical markers: A longitudinal study. Am J Obstet Gynecol. (2022) 226:126.e1–.e22. doi: 10.1016/j.ajog.2021.01.020

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Zhang J, Han L, Li W, Chen Q, Lei J, Long M, et al. Early prediction of preeclampsia and small-for-gestational-age via multi-marker model in chinese pregnancies: A prospective screening study. BMC Pregnancy Childbirth. (2019) 19:304. doi: 10.1186/s12884–019-2455–8

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Rezende KBC, Cunha A, Amim J Jr., Oliveira WM, Leao MEB, Menezes MOA, et al. Performance of fetal medicine foundation software for pre-eclampsia prediction upon marker customization: cross-sectional study. J Med Internet Res. (2019) 21:e14738. doi: 10.2196/14738

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Magee LA, Brown MA, Hall DR, Gupte S, Hennessy A, Karumanchi SA, et al. The 2021 international society for the study of hypertension in pregnancy classification, diagnosis & Management recommendations for international practice. Pregnancy hypertension. (2022) 27:148–69. doi: 10.1016/j.preghy.2021.09.008

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Wright D, Syngelaki A, Akolekar R, Poon LC, Nicolaides KH. Competing risks model in screening for preeclampsia by maternal characteristics and medical history. Am J Obstet Gynecol. (2015) 213:62.e1–.e10. doi: 10.1016/j.ajog.2015.02.018

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Sufriyana H, Wu YW, Su EC. Artificial intelligence-assisted prediction of preeclampsia: development and external validation of a nationwide health insurance dataset of the bpjs kesehatan in Indonesia. EBioMedicine. (2020) 54:102710. doi: 10.1016/j.ebiom.2020.102710

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Benkő Z, Wright A, Rehal A, Cimpoca B, Syngelaki A, Delgado JL, et al. Prediction of pre-eclampsia in twin pregnancy by maternal factors and biomarkers at 11–13 Weeks' Gestation: data from events trial. Ultrasound obstetrics gynecology: Off J Int Soc Ultrasound Obstetrics Gynecology. (2021) 57:257–65. doi: 10.1002/uog.23531

CrossRef Full Text | Google Scholar

Keywords: preeclampsia, machine learning, cohort, voting classifier, competing risk model

Citation: Li T, Xu M, Wang Y, Wang Y, Tang H, Duan H, Zhao G, Zheng M and Hu Y (2024) Prediction model of preeclampsia using machine learning based methods: a population based cohort study in China. Front. Endocrinol. 15:1345573. doi: 10.3389/fendo.2024.1345573

Received: 28 November 2023; Accepted: 27 May 2024;
Published: 11 June 2024.

Edited by:

Jan I. Olofsson, Karolinska Institutet (KI), Sweden

Reviewed by:

Rupsha Fraser, University of Edinburgh, United Kingdom
Zheng Liu, Peking University, China

Copyright © 2024 Li, Xu, Wang, Wang, Tang, Duan, Zhao, Zheng and Hu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mingming Zheng, ZHJtaW5nbWluZ3poZW5nQDE2My5jb20=; Yali Hu, eWFsaWh1QG5qdS5lZHUuY24=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.