Predictive analytics in bronchopulmonary dysplasia: past, present, and future

McOmber, Bryan G.; Moreira, Alvaro G.; Kirkman, Kelsey; Acosta, Sebastian; Rusin, Craig; Shivanna, Binoy

doi:10.3389/fped.2024.1483940

REVIEW article

Front. Pediatr., 20 November 2024

Sec. Pediatric Pulmonology

Volume 12 - 2024 | https://doi.org/10.3389/fped.2024.1483940

This article is part of the Research TopicBronchopulmonary Dysplasia: Latest Advances-Volume IIView all 12 articles

Predictive analytics in bronchopulmonary dysplasia: past, present, and future

Bryan G. McOmber¹

Alvaro G. Moreira¹

Kelsey Kirkman²

Sebastian Acosta³

Craig Rusin³

Binoy Shivanna^2*

¹Division of Neonatology, Department of Pediatrics, University Hospital, University of Texas Health Science Center at San Antonio, San Antonio, TX, United States
²Division of Neonatology, Department of Pediatrics, Texas Children’s Hospital, Baylor College of Medicine, Houston, TX, United States
³Division of Pediatric Cardiology, Department of Pediatrics, Texas Children’s Hospital, Baylor College of Medicine, Houston, TX, United States

Bronchopulmonary dysplasia (BPD) remains a significant complication of prematurity, impacting approximately 18,000 infants annually in the United States. Advances in neonatal care have not reduced BPD, and its management is challenged by the rising survival of extremely premature infants and the variability in clinical practices. Leveraging statistical and machine learning techniques, predictive analytics can enhance BPD management by utilizing large clinical datasets to predict individual patient outcomes. This review explores the foundations and applications of predictive analytics in the context of BPD, examining commonly used data sources, modeling techniques, and metrics for model evaluation. We also highlight bioinformatics’ potential role in understanding BPD's molecular basis and discuss case studies demonstrating the use of machine learning models for risk prediction and prognosis in neonates. Challenges such as data bias, model complexity, and ethical considerations are outlined, along with strategies to address these issues. Future directions for advancing the integration of predictive analytics into clinical practice include improving model interpretability, expanding data sharing and interoperability, and aligning predictive models with precision medicine goals. By overcoming current challenges, predictive analytics holds promise for transforming neonatal care and providing personalized interventions for infants at risk of BPD.

1 Introduction

Bronchopulmonary dysplasia (BPD) is a chronic lung disease primarily caused by inflammation and lung injury due to mechanical ventilation and supplemental oxygen therapy (1). This condition disrupts the growth and development of alveoli and pulmonary vasculature and plays a significant role in the prognosis and long-term outcomes of premature neonates (2). Despite advances in neonatal care and improved survival of premature infants, BPD rates remain high, affecting as many as 18,000 infants annually in the US (1). Definitions of BPD have evolved since it was first described in 1967, with the National Institute of Child Health and Human Development (NICHD) and the Neonatal Research Network (NRN) adopting severity-based grading systems based on respiratory support at 36 weeks postmenstrual age (PMA) (3, 4). Some studies suggest defining BPD at 40 weeks PMA may better predict serious long-term respiratory outcomes (5).

Managing BPD remains challenging due to the rising survival rates of extremely premature infants and variability in clinical practices. Current strategies include non-invasive ventilation, surfactant therapy, and early caffeine administration (6). Although postnatal steroid therapy aids in weaning neonates from mechanical ventilation, studies have not demonstrated a significant reduction in overall BPD rates (7). Individualized care is crucial given the multifactorial nature of BPD, with factors such as oxygen therapy, ventilation methods, medications, nutrition, and genetics influencing outcomes. Predictive analytics holds promise for developing more targeted and personalized therapies to improve the management and prognosis of BPD.

2 Foundations and techniques of predictive analytics

Predictive analytics in healthcare encompasses a variety of statistical and machine learning techniques aimed to forecast predictions about future outcomes based on historical data (8, 9). The central idea is to utilize past clinical trajectories of large cohorts to predict outcomes for patients with similar clinical characteristics. In this regard, supervised machine learning techniques are commonly used for predictive modeling, where algorithms are trained on labeled datasets to understand the relationship between input features (e.g., clinical data) and known outcomes. In contrast, unsupervised machine learning is used to discover patterns or clusters within data without predefined labels, offering insights into developing clinical phenotypes (8, 10).

2.1 Commonly used data sources and types in predictive modeling

Predictive modeling leverages multi-modal data sources to enhance clinical outcomes and optimize care strategies (Table 1). Primary data sources are from electronic health records (EHRs) (9), which provide comprehensive, longitudinal data on patient demographics, medical history, treatment plans, and clinical outcomes. Imaging data, such as ultrasounds and MRI scans (11, 12), also play a critical role in diagnosing and predicting the progression of various disorders. Additionally, real-time monitoring data, capturing vital signs such as heart rate, respiratory rate, and oxygen saturation, are increasingly being integrated into predictive models (13–15). This high frequency, time-series data is instrumental in the early detection of conditions like sepsis or respiratory distress. Other valuable data sources include genomic data and other bioinformatics, which can provide insights into congenital conditions and inform personalized treatment plans.

Table 1

Table 1. Key components of predictive analytics in BPD management.

2.2 Techniques and algorithms for predictive analytics

Multiple methods can be utilized to develop predictive algorithms in healthcare. A common approach involves defining a time period before the event is predicted (usually called the pre-deterioration class) and identifying a time period where no event occurs (the control class) (13). Standard classification algorithms can then be applied to distinguish between these two classes, resulting in an algorithm capable of detecting the pre-deterioration state that occurs before the event of interest.

Linear regression, logistic regression, and Cox proportional hazards models are commonly used as a first step (10). For larger and more complicated datasets, more sophisticated machine learning approaches, such as decision trees, random forests, and support vector machines, offer enhanced robustness and accuracy in predictions (8, 9). Deep learning, a subset of neural network machine learning, has also gained prominence due to its ability to model intricate patterns in data through neural networks (16–18). Convolutional neural networks (CNNs) are particularly effective in image analysis (11, 12, 19), while recurrent neural networks (RNNs) are particularly effective in handling time-series data (20). Additionally, ensemble methods, which combine multiple algorithms to improve predictive performance, are also frequently utilized (17). The choice of predictive model depends on the amount and complexity of data, with more advanced models typically required for larger and more intricate datasets.

2.3 Evaluation metrics for predictive models

Evaluating the performance of predictive models is critical to ensuring their reliability and efficacy in clinical settings. Common evaluation metrics include alert rate, accuracy, sensitivity, specificity, precision, and the area under the receiver operating characteristic curve (AUC-ROC) (13, 17, 18). Accuracy measures the proportion of correct predictions out of the total predictions, providing a general sense of model performance. Sensitivity (or recall) assesses the model's ability to identify true positive clinical events correctly, to ensure that at-risk patients are accurately detected. Specificity evaluates the model's ability to identify true negatives correctly, preventing the misdiagnosis of healthy individuals. Precision, which is the proportion of true positives among the predicted positives, reflects the model's reliability in identifying true positive predictions.

The AUC-ROC provides a comprehensive measure of model performance across different threshold settings, balancing sensitivity and specificity to give an overall picture of the model's discriminatory ability. In clinical settings, an AUC value above 0.70–0.75 is generally considered acceptable, while values above 0.80–0.85 are preferred, indicating a strong ability to differentiate between patients who will and will not experience the outcome. Alert rate is the number of alerts that are generated per patient per day, which is important to understand from a clinical workload management perspective (i.e., you can have high performance, but if it comes at the cost of hundreds of alarms per patient per day, the model is not clinically viable). These metrics offer a robust framework for assessing and validating predictive models, ensuring their clinical utility and effectiveness.

Performance metrics are calculated and reported on both a training data set and a separate independent data set for validation purposes. Importantly, the validation data set is not used in the initial training of the model. Validation testing is required to ensure that these machine learning models are not overfitting the training data. Model overfitting occurs when the training process of the algorithm essentially allows the model to memorize the inputs and outputs for a given data set. This can occur if there are a large number of input variables (also called model input features) and a low number of observations. A model that performs well on both training data and independent validation data sets at the same time is considered to have good generalization, which is critical to a successful clinical deployment. A workflow for predictive analytics in healthcare can be seen in Figure 1.

Figure 1

Figure 1. Overview of the machine learning (ML) pipeline in healthcare, illustrating the key stages from data collection to model deployment and continuous improvement. The process begins with Data Collection and Sources (Step 1), including clinical data such as MRIs, genomics, and oxygen saturation metrics. Data Preprocessing (Step 2) involves cleaning datasets and addressing missing values across healthcare institutions. In Model Development and Training (Step 3), algorithms like Random Forests, CNNs, RNNs, and gradient boosting decision trees (GBDTs) are utilized, with ensemble methods aiding in model selection. Validation and Evaluation (Step 4) ensures performance reliability through metrics such as AUC, sensitivity, and specificity, alongside bias audits. Deployment in Clinical Settings (Step 5) introduces real-time monitoring tools and alerts for high-risk patients. Feedback and Continuous Improvement (Step 6) and Ongoing Model Calibration (Step 7) ensure sustained performance and adaptability in evolving clinical environments.

3 Predictive modeling and bioinformatics

3.1 Predictive models for risk stratification and prognosis

Previous studies have attempted to develop functional predictive models for BPD, with varied results (Table 2) (49–51). These models usually consisted of formulas or web calculators based on demographic factors such as birth weight, gestational age, and sex. While many of these studies described a high predictive performance, they were often inconsistent with one another for various reasons. Among these were differences in their definitions of BPD, such as the outcome, small sample sizes, and poor handling of missing data. Of note, a recent meta-analysis of seven studies using lung ultrasound scores in neonates to predict BPD showed significant diagnostic accuracy in predicting BPD at 7 & 14 days of life (52).

Table 2

Table 2. Examples of predictive modeling approaches for BPD.

In 2022, the National Institutes for Child Health and Development updated a previous model for calculating BPD in infants with gestational ages between 23 and 28 weeks (31, 53). This estimator now includes factors such as antenatal steroid administration and the occurrence of surgical necrotizing enterocolitis. Unfortunately, these predictive models do not yet incorporate omics data, as this study area is still under research. However, integrating omics data could significantly enhance the predictive power of these models.

In recent years, researchers have begun using artificial intelligence (AI) or high computational analyses to decipher large data (54, 55). These AI models serve as rapid tools that can be leveraged for predictive modeling that can optimize accuracy. This capability could also be applied to precision medicine on a patient-by-patient basis. For example, an AI-powered predictive model integrated into a hospital's electronic health system could quickly and affordably assess an individual patient's genome, vital sign capture, or mechanical ventilator data for BPD risk factors, offering real-time decision support based on personalized risk factors.

3.2 Bioinformatics

Bioinformatics plays a significant role in understanding the molecular mechanisms underpinning BPD. Omics could participate in precision medicine in two ways: prediction and treatment. The integration of omics data enables the development of predictive models that can identify preterm infants at higher risk for BPD. By using genomic changes observed in the first few days of life, researchers can categorize neonates based on their risk profiles. This stratification can help during clinical trial selection by recruiting neonates at high risk for BPD, making sure that treatments are tested on individuals most likely to benefit. In addition, with a better understanding of the dysregulated pathways, new therapies can be developed to specifically target modifiable mechanisms, aiding in the development of new therapies. The following are examples of bioinformatic studies used to better understand BPD:

• Transcriptomics: Genes involved in ferroptosis and T-cell immunity have been associated with BPD (25, 56).

• Metabolomic: Pathways, including the citrate cycle, alanine, aspartate, and glutamate metabolism have been linked with BPD (57).

• Microbiomics: Increased Proteobacteria and decreased Firmicutes in the gut modulate systemic inflammatory levels of IL-1β, IL-6, and TNF-α, impacting lung health (58).

4 Case-Based example of using predictive analytics in BPD

Two recent studies illustrate the practical application of ML models in predicting BPD risk and severity.

4.1 Gradient boosting model for early BPD prediction

This study presented the application of machine learning, specifically gradient boosting decision trees (GBDT), to predict the immediate postnatal risk of death or BPD in very preterm and very low birth weight infants using data from a nationwide Japanese database (59). The GBDT algorithm, trained on clinical variables available at birth (e.g., gestational age, birth weight, Apgar scores, and maternal factors), accurately identified infants at high risk for adverse outcomes, enabling for potential early intervention strategies.

The clinical implications of this study are significant, as early and accurate risk stratification can be a powerful tool for clinicians. The ability to predict death or BPD shortly after birth enables healthcare providers to better allocate resources, such as assigning high-risk infants to specialized teams, tailoring respiratory support, and optimizing nutritional strategies. Moreover, the GBDT model's interpretability aids clinicians in identifying key risk factors, enhancing decision-making and parental counseling. This level of personalized risk assessment also aligns with the push towards precision medicine in neonatology, where interventions can be targeted to those who need them the most. Implementing such predictive models in clinical practice could improve outcomes by enabling earlier interventions for infants at risk of BPD or death. However, successful integration requires model validation across diverse populations beyond Japan, and seamless implementation into clinical workflows with proper staff training.

4.2 Two-stage neural network model for BPD severity prediction

Using a two-stage machine learning approach, Hwang et al. took data from a nationwide cohort of VLBWs to build predictive models that help in the early identification of those at high risk of developing BPD (60). The first stage involved training models to predict moderate to severe BPD, and the second stage focused on predicting the presence and severity of BPD based on new clinical data gathered later in the neonatal period. By incorporating longitudinal data, the model aimed to enhance prediction accuracy by reflecting changes in the infant's condition over time.

The study used clinical and demographic data to create its prediction model. They included gestational age, birth weight, respiratory support requirements, and other indicators known to influence the risk of BPD in preterm infants. The results demonstrated that the two-stage learning-based approach could outperform traditional single-stage models, achieving higher predictive accuracy and identifying at-risk infants with good sensitivity and specificity.

This study's clinical relevance lies in early BPD prediction, enabling targeted interventions and resource allocation in NICUs. The model helps clinicians identify infants for preventive strategies like optimized ventilation and early treatments. Its two-stage approach mirrors real-world decision-making, making it adaptable and valuable in dynamic NICU settings, ultimately improving patient outcomes with personalized care.

5 Challenges and limitations of predictive analytics

Predictive modeling in healthcare holds significant promise; however, various challenges and limitations can affect the accuracy and reliability of these models.

5.1 Data gaps

5.1.1 Incomplete and unstandardized data

Missing data, often resulting from incomplete records or data entry errors, can introduce bias into model predictions (61–65), reducing its accuracy and reliability. Further, the data may vary greatly in size, form, and format; they are often complex, heterogeneous, poorly annotated, and frequently unstructured, preventing effective modeling (66).

5.1.2 Data collection bias

Certain populations may be underrepresented in datasets, causing biased predictions that fail to generalize effectively across diverse patient groups (67). As a result, the trained model may perform poorly for these underrepresented populations.

5.1.3 Infrequent updates

Medical records or patient data may be updated inconsistently. If a model is trained on outdated information, it might not accurately capture current trends or changes in patient conditions.

5.2 Real-Time data integration challenges

5.2.1 Heterogeneous data

Heterogeneous data poses another challenge; variations in data entry practices, the use of different types of data sources (e.g., EHRs, wearable devices, and medical imaging), and incompatible formats and standards from different healthcare systems can hinder the seamless exchange and integration of data (64, 65, 68–70).

5.2.2 High volume and high velocity data

The high volume of data can overwhelm traditional data processing systems, while the high velocity of real-time data from wearable devices and monitoring systems demands rapid processing and analysis to maintain clinical relevance (70).

5.2.3 Delay in data availability

The laboratory results or diagnostic imaging reports may experience processing delays, hindering the ability to make accurate real-time predictions.

5.2.4 Constantly changing data

Additionally, the dynamic nature of healthcare data introduces further complexity; as healthcare knowledge and practices evolve, maintaining up-to-date models becomes challenging (71). Moreover, patient health conditions can also change rapidly, requiring models to adapt in real time (71, 72).

5.2.5 Heterogeneous data source platforms

Often, the data source platforms used by the institutions vary, creating a barrier for effective real-time data integration (73).

5.2.6 Data privacy and security

Real-time integration of data involves handling sensitive healthcare data, which raises concerns about privacy and security (74). Strict compliance with regulations like the US Health Insurance Portability and Accountability Act (HIPAA) can complicate data usage, sharing, and integration. Ensuring compliance with regulations like HIPAA is essential to mitigate risks of unauthorized access or breaches of sensitive data. Institutions must also incorporate legal safeguards that align with emerging global frameworks, such as the EU's General Data Protection Regulation (GDPR), which emphasizes data minimization and user control.

5.3 Model complexity and interpretability

While predictive modeling has a significant impact on healthcare, the complexity and interpretability of these models pose challenges and limitations. Advanced predictive models, particularly those built with AI and ML, often result in “black-boxes” with intricate internal workings that are not easily interpretable by clinicians (75). This lack of clarity raises concerns about trust, as it becomes difficult to understand or validate how these models generate predictions. As a result, healthcare professionals may hesitate to adopt these tools at the bedside, fearing errors, misjudgments, or liability issues without being able to challenge or explain the recommendations.

The interpretability problem becomes especially critical in high-stakes environments, such as diagnosing life-threatening conditions or determining optimal treatment plans (76). A predictive model's lack of transparency can undermine clinician confidence, especially when it suggests withholding treatment or recommending experimental interventions. This uncertainty affects decision-making and complicates patient consent, as families may be uneasy trusting decisions without clear human explanations.

Another challenge with these complex models is the risk of overfitting, where a model learns patterns from training data too precisely, including noise or irrelevant correlations (77). While this may produce highly accurate predictions on known datasets, the model's performance may worsen when applied to new, real-world data. This poor generalization can lead to biased predictions, especially if the training data does not represent the diversity of patient populations adequately. For example, a model trained primarily on data from urban hospitals might struggle to provide accurate predictions for rural or underserved communities.

Ensuring robustness and reliability in predictive models also requires balancing complexity with simplicity. While deep learning models may offer superior predictive power, simpler algorithms—such as logistic regression or decision trees—might be more suitable in clinical settings because they are easier to interpret. Explainability plays a crucial role in integrating predictive analytics into healthcare.

5.4 Resource, cost constraints, disparities in care, and regulatory and ethical concerns

Predictive analytics in healthcare introduces various ethical and regulatory challenges, including patient consent, data privacy, and the potential misuse of sensitive information. As noted by Carini and Seyhan, the effectiveness of predictive models relies on access to high-quality data, raising questions about data ownership, security, and interoperability (78). Institutions must invest in infrastructure that supports ethical AI development while managing the growing burden of regulatory obligations. Developing, implementing, and maintaining predictive analytics tools is resource- and cost-intensive, demanding significant investment in technology, data infrastructure, and skilled personnel.

Beyond initial implementation, organizations must budget for the continuous upkeep of these systems, which involves periodic recalibration of algorithms to address evolving patient demographics and medical practices. Furthermore, as Marques et al. emphasize, smaller healthcare providers or facilities in underserved areas may struggle to afford these advanced technologies, potentially widening disparities in access to care (79). Additionally, the integration of predictive analytics requires robust training programs to equip healthcare professionals with the necessary skills to interpret and utilize AI-generated recommendations effectively. Institutions must also account for potential disruptions and interoperability issues when incorporating these tools into existing workflows, as outdated or fragmented systems may impede seamless adoption.

There are also hidden costs associated with data management. Predictive analytics depends on large, well-annotated datasets, which require significant storage capacity and security protocols. Ensuring compliance with privacy regulations such as the GDPR adds another layer of complexity, as it may necessitate specialized software for data anonymization and consent management. Moreover, these tools often rely on cloud-based infrastructure, incurring recurring costs for data hosting and cybersecurity measures to prevent breaches.

Finally, the high initial and operational costs may lead institutions to rely heavily on third-party vendors, raising additional concerns about dependency and data sovereignty. Over-reliance on external partners can also complicate accountability in the event of errors or data breaches, further increasing the burden on institutions to establish clear governance frameworks that align with ethical standards and regulatory requirements.

5.5 Data bias and misuse risks

Predictive models, though powerful, can unintentionally perpetuate biases if not carefully designed. Marques et al. emphasize that AI models are only as reliable as the data they are trained on, and these datasets often reflect existing social biases (79). This can lead to inequitable care or discrimination, particularly against marginalized populations. To address these concerns, predictive systems must undergo regular auditing to detect bias, and the development process should include input from diverse stakeholders to ensure fair outcomes.

Another emerging concern is the potential misuse of predictive tools, particularly when these models are deployed without appropriate oversight. In some cases, predictive analytics could be misused to ration care, denying necessary services to patients deemed “low risk” by an algorithm. Regulatory frameworks must ensure that predictive tools are not only effective but also used ethically, with clear accountability mechanisms to prevent misuse. Transparency is essential—healthcare providers and patients should have access to detailed explanations of how predictions are generated, as well as the ability to contest decisions based on algorithmic outputs.

Finally, mitigating risks requires a culture of accountability in AI development, with governance frameworks for bias assessments and addressing unintended consequences. Engaging diverse stakeholders throughout the AI lifecycle ensures fairness and equity, improving model reliability and fostering trust among patients and providers for more just, effective healthcare.

5.6 Accountability and trust in predictive systems

Predictive analytics in healthcare complicates accountability, especially when algorithm-driven decisions cause harm. The lack of transparency makes it difficult to assign responsibility, posing challenges for both developers and healthcare providers (78). Trust in predictive systems requires transparency, ongoing validation, and explainable AI to help clinicians understand outputs. Clear ethical guidelines outlining stakeholder responsibilities are essential for public trust and safe technology integration.

5.7 Challenges and limitations of BPD predictive models

Despite the presence of many BPD predictive models (25, 50, 51, 80, 81), only a few (31, 53) are widely available to clinicians, and practically none are used regularly in clinical practice (50). Most models have methodological flaws (small sample size, poor calibration and validation, and increased bias) and lack dynamism (50, 51). Further, most models lack comprehensive, continuous, high-definition physiological data and a mechanistic biomarker panel. Finally, the data gaps and real-time data integration challenges discussed above significantly impact BPD predictive models.

Several of the limitations discussed in this section also pose significant challenges in translating research-based predictive models into clinical practice. The next section offers strategies to address these challenges.

6 Future directions and opportunities

6.1 Enhancing model interpretability

A key priority for advancing predictive analytics is improving AI model interpretability. Complex algorithms like deep learning, despite their predictive power, face limited clinical adoption due to their “black-box” nature. Clinicians hesitate to trust opaque recommendations, especially in high-stakes areas like neonatal care. Explainable AI (XAI) frameworks are crucial to addressing this challenge (75). These frameworks aim to provide transparency by breaking down how predictions are generated, offering clinicians meaningful insights into which variables contributed to the outcome. Additionally, human-in-the-loop systems, where clinicians interact with the model to refine its predictions, can further improve adoption and create feedback loops that enhance the model's performance over time. Efforts to integrate visualization tools into clinical workflows, where model outputs are presented in intuitive dashboards, will also empower healthcare providers to use predictive analytics with greater confidence.

6.2 Improving data sharing and interoperability

A second key priority is improving data sharing and interoperability across healthcare institutions. Fragmented data sources, ranging from clinical records and genetic data to wearable sensor outputs—limit the effectiveness of predictive models. Interoperable systems that standardize how data is collected, stored, and exchanged will make it easier for researchers and clinicians to build robust, generalizable models. Developing shared data formats and platforms, such as Fast Healthcare Interoperability Resources (FHIR), is a step toward seamless data exchange across hospitals and research institutions (80). Large-scale collaborations supported by secure data-sharing agreements will enable the aggregation of more comprehensive datasets, increasing the statistical power of predictive models (81). However, these efforts must also navigate regulatory challenges, ensuring compliance with frameworks such as HIPAA and GDPR while maintaining the privacy and security of patient data. Institutions will need to invest in governance structures and advanced encryption technologies to enable responsible and compliant data sharing.

6.3 Expanding access and addressing bias

Equitable access to predictive analytics is a critical priority for the future. AI-driven models, if not carefully designed, risk exacerbating existing disparities by benefiting only well-resourced healthcare settings. Institutions in rural areas or those serving low-income populations often lack the infrastructure to adopt these advanced technologies, which may leave their patients underserved. It is crucial to ensure that predictive models are validated across diverse patient populations to prevent bias and ensure fair outcomes. This involves expanding the datasets used to train these models to include underrepresented groups, such as racial minorities and economically disadvantaged populations, who are often excluded from medical research. Regular audits for algorithmic bias must become a standard practice, with adjustments made to correct any disparities in performance. Additionally, stakeholder engagement—bringing in perspectives from underrepresented communities and clinicians working in underserved areas—will ensure the models align with the needs of all patient populations. Future policies should also focus on ensuring that these tools remain accessible and affordable, preventing cost from becoming a barrier to equitable healthcare delivery.

6.4 Workforce training and interdisciplinary collaboration

A skilled workforce is vital for integrating predictive analytics in healthcare. Interdisciplinary collaboration among clinicians, data scientists, engineers, and informaticians is key. Clinicians must develop basic competencies in data science and AI to actively engage with predictive tools and interpret model outputs effectively (82). This will ensure that clinical decisions informed by predictive analytics are made thoughtfully and responsibly. At the same time, data scientists and engineers need to develop a deeper understanding of clinical contexts, such as the unique challenges faced in neonatal care, to design models that align with clinical realities. Interdisciplinary training programs that bridge these gaps will foster collaboration and innovation, helping professionals from different fields work together seamlessly. Cross-disciplinary programs, like joint fellowships or certification courses, can equip future healthcare professionals and data scientists to effectively implement and advance predictive technologies.

Developing leadership in clinical informatics will also be essential to manage the intersection of technology and clinical practice (83). Clinical informaticians are key to integrating predictive models into EHRs and creating actionable decision-support tools. By designing workflows that incorporate AI predictions, they help clinicians make faster, informed decisions without being overwhelmed by data. Institutions fostering multidisciplinary collaboration will better harness these technologies to improve patient outcomes.

6.5 Aligning predictive analytics with precision medicine

The ultimate goal of predictive analytics in neonatal care and other fields is to advance precision medicine, where treatment plans are tailored to the individual needs of each patient. Predictive models can identify patient-specific risk factors and predict disease trajectories based on genetic, environmental, and clinical data (84). For example, models that integrate genomic data with real-time monitoring can identify biomarkers linked to early signs of respiratory complications, guiding clinicians toward more targeted interventions to reduce the severity or incidence of conditions like BPD. As Kim et al. suggest, combining genomics with continuous physiological data can yield new insights and improve individualized care strategies (83).

However, to fully harness predictive analytics in precision medicine, large, well-annotated datasets representing diverse populations are essential. Investments in data collection initiatives like biobanks and healthcare registries are crucial. Ethical considerations such as fairness, transparency, and patient autonomy must guide model deployment, with clear communication and informed consent to maintain trust. Policies governing ethical use will ensure these innovations benefit all, not just a select few.

7 Conclusions

The integration of predictive analytics into the management of BPD represents a transformative shift towards more personalized and precise neonatal care. By leveraging advanced machine learning techniques and incorporating diverse data sources such as EHRs, imaging, and bioinformatics, predictive models have the potential to significantly enhance risk stratification, tailor interventions, and ultimately improve outcomes for premature infants. Despite notable advancements, challenges remain, including data integration issues, model interpretability, and ethical considerations surrounding privacy and equity. Addressing these challenges through interdisciplinary collaboration, among data scientists, neonatologists, and ethicists, alongside robust validation, and ongoing research, is essential for realizing the full potential of predictive analytics in BPD management. Moving forward, a concerted effort to refine predictive models, integrate real-time data, and adhere to ethical standards will be crucial in advancing precision medicine and delivering better, individualized care for neonates at risk of BPD.

Author contributions

BM: Data curation, Formal Analysis, Methodology, Validation, Writing – original draft, Writing – review & editing. AM: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing. KK: Data curation, Formal Analysis, Methodology, Validation, Writing – original draft, Writing – review & editing. SA: Data curation, Formal Analysis, Methodology, Validation, Writing – original draft, Writing – review & editing. CR: Conceptualization, Data curation, Formal Analysis, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing. BS: Conceptualization, Data curation, Formal Analysis, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by Parker B. Francis; the National Institutes of Health (NIH) Eunice Kennedy Shriver National Institute of Child Health and Human Development K23HD101701; NIH National Heart Lung and Blood Institute 5R25HL126140-06 Subaward to AM.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

AI, artificial intelligence; AUC-ROC, area under the receiver operating characteristic curve; BPD, bronchopulmonary dysplasia; CNNs, convoluted neural networks; CPAP, continuous positive airway pressure; EHRs, electronic health records; GBDTs, Gradient Boosting Decision Trees; GDPR, general data protection regulation; HIPPA, health insurance portability and accountability Act; ML, machine learning; NICHD, national Institute of Child Health and Human Development; NRN, neonatal research network; PMA, postmenstrual age; RNNs, recurrent neural networks; XAI, explainable AI.

References

1. Thébaud B, Goss KN, Laughon M, Whitsett JA, Abman SH, Steinhorn RH, et al. Bronchopulmonary dysplasia. Nat Rev Dis Primer. (2019) 5(1):78. doi: 10.1038/s41572-019-0127-7

PubMed Abstract | Crossref Full Text | Google Scholar

2. Jobe AH, Bancalari E. Bronchopulmonary dysplasia. Am J Respir Crit Care Med. (2001) 163(7):1723–9. doi: 10.1164/ajrccm.163.7.2011060

PubMed Abstract | Crossref Full Text | Google Scholar

3. Higgins RD, Jobe AH, Koso-Thomas M, Bancalari E, Viscardi RM, Hartert TV, et al. Bronchopulmonary dysplasia: executive summary of a workshop. J Pediatr. (2018) 197:300–8. doi: 10.1016/j.jpeds.2018.01.043

PubMed Abstract | Crossref Full Text | Google Scholar

4. Jensen EA, Dysart K, Gantz MG, McDonald S, Bamat NA, Keszler M, et al. The diagnosis of bronchopulmonary dysplasia in very preterm infants. An evidence-based approach. Am J Respir Crit Care Med. (2019) 200(6):751–9. doi: 10.1164/rccm.201812-2348OC

PubMed Abstract | Crossref Full Text | Google Scholar

5. Isayama T, Lee SK, Yang J, Lee D, Daspal S, Dunn M, et al. Revisiting the definition of bronchopulmonary dysplasia: effect of changing panoply of respiratory support for preterm neonates. JAMA Pediatr. (2017) 171(3):271. doi: 10.1001/jamapediatrics.2016.4141

PubMed Abstract | Crossref Full Text | Google Scholar

6. Hwang JS, Rehan VK. Recent advances in bronchopulmonary dysplasia: pathophysiology, prevention, and treatment. Lung. (2018) 196(2):129–38. doi: 10.1007/s00408-018-0084-z

PubMed Abstract | Crossref Full Text | Google Scholar

7. Harris C, Greenough A. The prevention and management strategies for neonatal chronic lung disease. Expert Rev Respir Med. (2023) 17(2):143–54. doi: 10.1080/17476348.2023.2183842

PubMed Abstract | Crossref Full Text | Google Scholar

8. Sanchez-Pinto LN, Luo Y, Churpek MM. Big data and data science in critical care. Chest. (2018) 154(5):1239–48. doi: 10.1016/j.chest.2018.04.037

PubMed Abstract | Crossref Full Text | Google Scholar

9. Liang H, Tsui BY, Ni H, Valentim CCS, Baxter SL, Liu G, et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med. (2019) 25(3):433–8. doi: 10.1038/s41591-018-0335-9

PubMed Abstract | Crossref Full Text | Google Scholar

10. Ramgopal S, Sanchez-Pinto LN, Horvat CM, Carroll MS, Luo Y, Florin TA. Artificial intelligence-based clinical decision support in pediatrics. Pediatr Res. (2023) 93(2):334–41. doi: 10.1038/s41390-022-02226-1

PubMed Abstract | Crossref Full Text | Google Scholar

11. Papp D, Castillo T JM, Wielopolski PA, Ciet P, Veenland JF, Kotek G, et al. Deep learning for improving ZTE MRI images in free breathing. Magn Reson Imaging. (2023) 98:97–104. doi: 10.1016/j.mri.2023.01.019

PubMed Abstract | Crossref Full Text | Google Scholar

12. Shen L, Zheng J, Lee EH, Shpanskaya K, McKenna ES, Atluri MG, et al. Attention-guided deep learning for gestational age prediction using fetal brain MRI. Sci Rep. (2022) 12(1):1408. doi: 10.1038/s41598-022-05468-5

PubMed Abstract | Crossref Full Text | Google Scholar

13. Rusin CG, Acosta SI, Vu EL, Ahmed M, Brady KM, Penny DJ. Automated prediction of cardiorespiratory deterioration in patients with single ventricle. J Am Coll Cardiol. (2021) 77(25):3184–92. doi: 10.1016/j.jacc.2021.04.072

PubMed Abstract | Crossref Full Text | Google Scholar

14. Rusin CG, Acosta SI, Shekerdemian LS, Vu EL, Bavare AC, Myers RB, et al. Prediction of imminent, severe deterioration of children with parallel circulations using real-time processing of physiologic data. J Thorac Cardiovasc Surg. (2016) 152(1):171–7. doi: 10.1016/j.jtcvs.2016.03.083

PubMed Abstract | Crossref Full Text | Google Scholar

15. Miller C, Manious M, Portnoy J. Artificial intelligence and machine learning for anaphylaxis algorithms. Curr Opin Allergy Clin Immunol. (2024) 24(5):305–12. doi: 10.1097/ACI.0000000000001015

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ghazal S, Sauthier M, Brossier D, Bouachir W, Jouvet PA, Noumeir R. Using machine learning models to predict oxygen saturation following ventilator support adjustment in critically ill children: a single center pilot study. PloS One. (2019) 14(2):e0198921. doi: 10.1371/journal.pone.0198921

PubMed Abstract | Crossref Full Text | Google Scholar

17. Kong D, Tao Y, Xiao H, Xiong H, Wei W, Cai M. Predicting preterm birth using auto-ML frameworks: a large observational study using electronic inpatient discharge data. Front Pediatr. (2024) 12:1330420. doi: 10.3389/fped.2024.1330420

PubMed Abstract | Crossref Full Text | Google Scholar

18. Jiang H, Guo J, Li J, Li C, Du W, Canavese F, et al. Artificial neural network modeling to predict neonatal metabolic bone disease in the prenatal and postnatal periods. JAMA Netw Open. (2023) 6(1):e2251849. doi: 10.1001/jamanetworkopen.2022.51849

PubMed Abstract | Crossref Full Text | Google Scholar

19. Santomartino SM, Putman K, Beheshtian E, Parekh VS, Yi PH. Evaluating the robustness of a deep learning bone age algorithm to clinical image variation using computational stress testing. Radiol Artif Intell. (2024) 6(3):e230240. doi: 10.1148/ryai.230240

PubMed Abstract | Crossref Full Text | Google Scholar

20. Ho LV, Aczon M, Ledbetter D, Wetzel R. Interpreting a recurrent neural network’s predictions of ICU mortality risk. J Biomed Inform. (2021) 114:103672. doi: 10.1016/j.jbi.2021.103672

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chioma R, Healy DB, Finn D, Walsh BH, Reynolds C, O’Sullivan D, et al. The bronchopulmonary dysplasia score: a predictive model for bronchopulmonary dysplasia or death in high-risk preterm infants. Acta Paediatr Oslo Nor 1992. (2024) 113(8):1781–90. doi: 10.1111/apa.17304

PubMed Abstract | Crossref Full Text | Google Scholar

22. Chou HY, Lin YC, Hsieh SY, Chou HH, Lai CS, Wang B, et al. Deep learning model for prediction of bronchopulmonary dysplasia in preterm infants using chest radiographs. J Imaging Inform Med. (2024) 37(5):2063–73. doi: 10.1007/s10278-024-01050-9

PubMed Abstract | Crossref Full Text | Google Scholar

23. Gao Y, Liu D, Guo Y, Cao M. Risk prediction of bronchopulmonary dysplasia in preterm infants by the nomogram model. Front Pediatr. (2023) 11:1117142. doi: 10.3389/fped.2023.1117142

PubMed Abstract | Crossref Full Text | Google Scholar

24. Kostekci YE, Bakırarar B, Okulu E, Erdeve O, Atasay B, Arsan S. An early prediction model for estimating bronchopulmonary dysplasia in preterm infants. Neonatology. (2023) 120(6):709–17. doi: 10.1159/000533299

PubMed Abstract | Crossref Full Text | Google Scholar

25. Moreira A, Tovar M, Smith AM, Lee GC, Meunier JA, Cheema Z, et al. Development of a peripheral blood transcriptomic gene signature to predict bronchopulmonary dysplasia. Am J Physiol Lung Cell Mol Physiol. (2023) 324(1):L76–87. doi: 10.1152/ajplung.00250.2022

PubMed Abstract | Crossref Full Text | Google Scholar

26. Ou W, Lei K, Wang H, Ma H, Deng X, He P, et al. Development of a blood proteins-based model for bronchopulmonary dysplasia prediction in premature infants. BMC Pediatr. (2023) 23(1):304. doi: 10.1186/s12887-023-04065-3

PubMed Abstract | Crossref Full Text | Google Scholar

27. Shen X, Patel N, Zhu W, Chen X, Lu K, Cheng R, et al. A nomogram for predicting the risk of bronchopulmonary dysplasia in premature infants. Heliyon. (2023) 9(8):e18964. doi: 10.1016/j.heliyon.2023.e18964

PubMed Abstract | Crossref Full Text | Google Scholar

28. Ahmed S, Odumade OA, van Zalm P, Smolen KK, Fujimura K, Muntel J, et al. Urine proteomics for noninvasive monitoring of biomarkers in bronchopulmonary dysplasia. Neonatology. (2022) 119(2):193–203. doi: 10.1159/000520680

PubMed Abstract | Crossref Full Text | Google Scholar

29. Alonso-Ojembarrena A, Méndez-Abad P, Alonso-Quintela P, Zafra-Rodríguez P, Oulego-Erroz I, Lubián-López SP. Lung ultrasound score has better diagnostic ability than NT-proBNP to predict moderate-severe bronchopulmonary dysplasia. Eur J Pediatr. (2022) 181(8):3013–21. doi: 10.1007/s00431-022-04491-y

PubMed Abstract | Crossref Full Text | Google Scholar

30. Bhattacharjee I, Das A, Collin M, Aly H. Predicting outcomes of mechanically ventilated premature infants using respiratory severity score. J Matern-Fetal Neonatal Med Off J Eur Assoc Perinat Med Fed Asia Ocean Perinat Soc Int Soc Perinat Obstet. (2022) 35(23):4620–7. doi: 10.1080/14767058.2020.1858277

PubMed Abstract | Crossref Full Text | Google Scholar

31. Greenberg RG, McDonald SA, Laughon MM, Tanaka D, Jensen E, Van Meurs K, et al. Online clinical tool to estimate risk of bronchopulmonary dysplasia in extremely preterm infants. Arch Dis Child Fetal Neonatal Ed. (2022) 107(6):638–43. doi: 10.1136/archdischild-2021-323573

PubMed Abstract | Crossref Full Text | Google Scholar

32. Kindt ASD, Förster KM, Cochius-den Otter SCM, Flemmer AW, Hauck SM, Flatley A, et al. Validation of disease-specific biomarkers for the early detection of bronchopulmonary dysplasia. Pediatr Res. (2023) 93(3):625–32. doi: 10.1038/s41390-022-02093-w

PubMed Abstract | Crossref Full Text | Google Scholar

33. Umapathi KK, Muller B, Sosnowski C, Thavamani A, Murphy J, Awad S, et al. A novel patent ductus arteriosus severity score to predict clinical outcomes in premature neonates. J Cardiovasc Dev Dis. (2022) 9(4):114. doi: 10.3390/jcdd9040114

PubMed Abstract | Crossref Full Text | Google Scholar

34. Zayat N, Truffert P, Drumez E, Duhamel A, Labreuche J, Zemlin M, et al. Systemic steroids in preventing bronchopulmonary dysplasia (BPD): neurodevelopmental outcome according to the risk of BPD in the EPICE cohort. Int J Environ Res Public Health. (2022 ) 19(9):5600. doi: 10.3390/ijerph19095600

PubMed Abstract | Crossref Full Text | Google Scholar

35. Aldecoa-Bilbao V, Velilla M, Teresa-Palacio M, Esponera CB, Barbero AH, Sin-Soler M, et al. Lung ultrasound in bronchopulmonary dysplasia: patterns and predictors in very preterm infants. Neonatology. (2021) 118(5):537–45. doi: 10.1159/000517585

PubMed Abstract | Crossref Full Text | Google Scholar

36. Alonso-Ojembarrena A, Serna-Guerediaga I, Aldecoa-Bilbao V, Gregorio-Hernández R, Alonso-Quintela P, Concheiro-Guisán A, et al. The predictive value of lung ultrasound scores in developing bronchopulmonary dysplasia: a prospective multicenter diagnostic accuracy study. Chest. (2021) 160(3):1006–16. doi: 10.1016/j.chest.2021.02.066

PubMed Abstract | Crossref Full Text | Google Scholar

37. Baud O, Laughon M, Lehert P. Survival without bronchopulmonary dysplasia of extremely preterm infants: a predictive model at birth. Neonatology. (2021) 118(4):385–93. doi: 10.1159/000515898

PubMed Abstract | Crossref Full Text | Google Scholar

38. Dai D, Chen H, Dong X, Chen J, Mei M, Lu Y, et al. Bronchopulmonary dysplasia predicted by developing a machine learning model of genetic and clinical information. Front Genet. (2021) 12:689071. doi: 10.3389/fgene.2021.689071

PubMed Abstract | Crossref Full Text | Google Scholar

39. Gerull R, Neumann RP, Atkinson A, Bernasconi L, Schulzke SM, Wellmann S. Respiratory morbidity in preterm infants predicted by natriuretic peptide (MR-proANP) and endothelin-1 (CT-proET-1). Pediatr Res. (2022) 91(6):1478–84. doi: 10.1038/s41390-021-01493-8

PubMed Abstract | Crossref Full Text | Google Scholar

40. Khurshid F, Coo H, Khalil A, Messiha J, Ting JY, Wong J, et al. Comparison of multivariable logistic regression and machine learning models for predicting bronchopulmonary dysplasia or death in very preterm infants. Front Pediatr. (2021) 9:759776. doi: 10.3389/fped.2021.759776

PubMed Abstract | Crossref Full Text | Google Scholar

41. Liu X, Lv X, Jin D, Li H, Wu H. Lung ultrasound predicts the development of bronchopulmonary dysplasia: a prospective observational diagnostic accuracy study. Eur J Pediatr. (2021) 180(9):2781–9. doi: 10.1007/s00431-021-04021-2

PubMed Abstract | Crossref Full Text | Google Scholar

42. Mohamed A, Mohsen N, Diambomba Y, Lashin A, Louis D, Elsayed Y, et al. Lung ultrasound for prediction of bronchopulmonary dysplasia in extreme preterm neonates: a prospective diagnostic cohort study. J Pediatr. (2021) 238:187–192.e2. doi: 10.1016/j.jpeds.2021.06.079

PubMed Abstract | Crossref Full Text | Google Scholar

43. Shim SY, Yun JY, Cho SJ, Kim MH, Park EA. The prediction of bronchopulmonary dysplasia in very low birth weight infants through clinical indicators within 1 hour of delivery. J Korean Med Sci. (2021) 36(11):e81. doi: 10.3346/jkms.2021.36.e81

PubMed Abstract | Crossref Full Text | Google Scholar

44. Song M, Lei M, Luo C, Shi Z, Cheng X, Ding W, et al. Development of a nomogram for moderate-to-severe bronchopulmonary dysplasia or death: role of N-terminal pro-brain natriuretic peptide as a biomarker. Front Pediatr. (2021) 9:727362. doi: 10.3389/fped.2021.727362

PubMed Abstract | Crossref Full Text | Google Scholar

45. Soullane S, Patel S, Claveau M, Wazneh L, Sant’Anna G, Beltempo M. Fluid status in the first 10 days of life and death/bronchopulmonary dysplasia among preterm infants. Pediatr Res. (2021) 90(2):353–8. doi: 10.1038/s41390-021-01485-8

PubMed Abstract | Crossref Full Text | Google Scholar

46. Ushida T, Moriyama Y, Nakatochi M, Kobayashi Y, Imai K, Nakano-Kobayashi T, et al. Antenatal prediction models for short- and medium-term outcomes in preterm infants. Acta Obstet Gynecol Scand. (2021) 100(6):1089–96. doi: 10.1111/aogs.14136

PubMed Abstract | Crossref Full Text | Google Scholar

47. Woods PL, Stoecklin B, Woods A, Gill AW. Early lung ultrasound affords little to the prediction of bronchopulmonary dysplasia. Arch Dis Child Fetal Neonatal Ed. (2021) 106(6):657–62. doi: 10.1136/archdischild-2020-320830

PubMed Abstract | Crossref Full Text | Google Scholar

48. Zhang R, Xu FL, Li WL, Qin FY, Jin XY, Zhang Y, et al. Construction of early risk prediction models for bronchopulmonary dysplasia in preterm infants. Zhongguo Dang Dai Er Ke Za Zhi Chin J Contemp Pediatr. (2021) 23(10):994–1001. doi: 10.7499/j.issn.1008-8830.2107035

PubMed Abstract | Crossref Full Text | Google Scholar

49. Peng HB, Zhan YL, Chen Y, Jin ZC, Liu F, Wang B, et al. Prediction models for bronchopulmonary dysplasia in preterm infants: a systematic review. Front Pediatr. (2022) 10:856159. doi: 10.3389/fped.2022.856159

PubMed Abstract | Crossref Full Text | Google Scholar

50. Kwok TC, Batey N, Luu KL, Prayle A, Sharkey D. Bronchopulmonary dysplasia prediction models: a systematic review and meta-analysis with validation. Pediatr Res. (2023) 94(1):43–54. doi: 10.1038/s41390-022-02451-8

PubMed Abstract | Crossref Full Text | Google Scholar

51. Romijn M, Dhiman P, Finken MJJ, van Kaam AH, Katz TA, Rotteveel J, et al. Prediction models for bronchopulmonary dysplasia in preterm infants: a systematic review and meta-analysis. J Pediatr. (2023) 258:113370. doi: 10.1016/j.jpeds.2023.01.024

PubMed Abstract | Crossref Full Text | Google Scholar

52. Pezza L, Alonso-Ojembarrena A, Elsayed Y, Yousef N, Vedovelli L, Raimondi F, et al. Meta-Analysis of lung ultrasound scores for early prediction of bronchopulmonary dysplasia. Ann Am Thorac Soc. (2022) 19(4):659–67. doi: 10.1513/AnnalsATS.202107-822OC

PubMed Abstract | Crossref Full Text | Google Scholar

53. Laughon MM, Langer JC, Bose CL, Smith PB, Ambalavanan N, Kennedy KA, et al. Prediction of bronchopulmonary dysplasia by postnatal age in extremely premature infants. Am J Respir Crit Care Med. (2011) 183(12):1715–22. doi: 10.1164/rccm.201101-0055OC

PubMed Abstract | Crossref Full Text | Google Scholar

54. Palit S, Shrestha AK, Thapa S, Grimm S L, Coarfa C, Theis F, et al. Leveraging integrated RNA sequencing to decipher adrenomedullin’s protective mechanisms in experimental bronchopulmonary dysplasia. Genes (Basel). (2024) 15(6):806. doi: 10.3390/genes15060806

PubMed Abstract | Crossref Full Text | Google Scholar

55. Pammi M, Aghaeepour N, Neu J. Multiomics, artificial intelligence, and precision medicine in perinatology. Pediatr Res. (2023) 93(2):308–15. doi: 10.1038/s41390-022-02181-x

PubMed Abstract | Crossref Full Text | Google Scholar

56. Hu Z, Liu C, Mao Y, Shi J, Xu J, Zhou G, et al. Integration of transcriptomics reveals ferroptosis-related signatures and immune cell infiltration in bronchopulmonary dysplasia. Heliyon. (2023) 9(10):e21093. doi: 10.1016/j.heliyon.2023.e21093

PubMed Abstract | Crossref Full Text | Google Scholar

57. You Y, Wang L, Liu C, Wang X, Zhou L, Zhang Y, et al. Early metabolic markers as predictors of respiratory complications in preterm infants with bronchopulmonary dysplasia. Early Hum Dev. (2024) 190:105950. doi: 10.1016/j.earlhumdev.2024.105950

PubMed Abstract | Crossref Full Text | Google Scholar

58. Li Y, He L, Zhao Q, Bo T. Microbial and metabolic profiles of bronchopulmonary dysplasia and therapeutic effects of potential probiotics limosilactobacillus reuteri and Bifidobacterium bifidum. J Appl Microbiol. (2022) 133(2):908–21. doi: 10.1111/jam.15602

PubMed Abstract | Crossref Full Text | Google Scholar

59. Yoneda K, Seki T, Kawazoe Y, Ohe K, Takahashi N. Neonatal research network of Japan. Immediate postnatal prediction of death or bronchopulmonary dysplasia among very preterm and very low birth weight infants based on gradient boosting decision trees algorithm: a nationwide database study in Japan. PloS One. (2024) 19(3):e0300817. doi: 10.1371/journal.pone.0300817

PubMed Abstract | Crossref Full Text | Google Scholar

60. Hwang JK, Kim DH, Na JY, Son J, Oh YJ, Jung D, et al. Two-stage learning-based prediction of bronchopulmonary dysplasia in very low birth weight infants: a nationwide cohort study. Front Pediatr. (2023) 11:1155921. doi: 10.3389/fped.2023.1155921

PubMed Abstract | Crossref Full Text | Google Scholar

61. Picard M, Scott-Boyer MP, Bodein A, Périn O, Droit A. Integration strategies of multi-omics data for machine learning analysis. Comput Struct Biotechnol J. (2021) 19:3735–46. doi: 10.1016/j.csbj.2021.06.030

PubMed Abstract | Crossref Full Text | Google Scholar

62. Baena-Miret S, Reverter F, Vegas E. A framework for block-wise missing data in multi-omics. PloS One. (2024) 19(7):e0307482. doi: 10.1371/journal.pone.0307482

PubMed Abstract | Crossref Full Text | Google Scholar

63. Davies EM, Buckley BJR, Austin P, Lip GYH, Oni L, McDowell G, et al. Routine cardiac biomarkers for the prediction of incident major adverse cardiac events in patients with glomerulonephritis: a real-world analysis using a global federated database. BMC Nephrol. (2024) 25(1):233. doi: 10.1186/s12882-024-03667-y

PubMed Abstract | Crossref Full Text | Google Scholar

64. Tang AS, Woldemariam SR, Miramontes S, Norgeot B, Oskotsky TT, Sirota M. Harnessing EHR data for health research. Nat Med. (2024) 30(7):1847–55. doi: 10.1038/s41591-024-03074-8

PubMed Abstract | Crossref Full Text | Google Scholar

65. Kim MK, Rouphael C, McMichael J, Welch N, Dasarathy S. Challenges in and opportunities for electronic health record-based data analysis and interpretation. Gut Liver. (2024) 18(2):201–8. doi: 10.5009/gnl230272

PubMed Abstract | Crossref Full Text | Google Scholar

66. Mirza B, Wang W, Wang J, Choi H, Chung NC, Ping P. Machine learning and integrative analysis of biomedical big data. Genes (Basel). (2019) 10(2):87. doi: 10.3390/genes10020087

PubMed Abstract | Crossref Full Text | Google Scholar

67. Franklin G, Stephens R, Piracha M, Tiosano S, Lehouillier F, Koppel R, et al. The sociodemographic biases in machine learning algorithms: a biomedical informatics perspective. Life Basel Switz. (2024) 14(6):652. doi: 10.3390/life14060652

PubMed Abstract | Crossref Full Text | Google Scholar

68. Ashton JJ, Young A, Johnson MJ, Beattie RM. Using machine learning to impact on long-term clinical care: principles, challenges, and practicalities. Pediatr Res. (2023) 93(2):324–33. doi: 10.1038/s41390-022-02194-6

PubMed Abstract | Crossref Full Text | Google Scholar

69. Heneghan JA, Walker SB, Fawcett A, Bennett TD, Dziorny AC, Sanchez-Pinto LN, et al. The pediatric data science and analytics subgroup of the pediatric acute lung injury and sepsis investigators network: use of supervised machine learning applications in pediatric critical care medicine research. Pediatr Crit Care Med J Soc Crit Care Med World Fed Pediatr Intensive Crit Care Soc. (2024) 25(4):364–74. doi: 10.1097/PCC.0000000000003425

PubMed Abstract | Crossref Full Text | Google Scholar

70. Wilson CG, Altamirano AE, Hillman T, Tan JB. Data analytics in a clinical setting: applications to understanding breathing patterns and their relevance to neonatal disease. Semin Fetal Neonatal Med. (2022) 27(5):101399. doi: 10.1016/j.siny.2022.101399

PubMed Abstract | Crossref Full Text | Google Scholar

71. Obermeyer Z, Emanuel EJ. Predicting the future—big data, machine learning, and clinical medicine. N Engl J Med. (2016) 375(13):1216–9. doi: 10.1056/NEJMp1606181

PubMed Abstract | Crossref Full Text | Google Scholar

72. Mpanya D, Celik T, Klug E, Ntsinjana H. Machine learning and statistical methods for predicting mortality in heart failure. Heart Fail Rev. (2021) 26(3):545–52. doi: 10.1007/s10741-020-10052-y

PubMed Abstract | Crossref Full Text | Google Scholar

73. Sedlakova J, Daniore P, Horn Wintsch A, Wolf M, Stanikic M, Haag C, et al. Challenges and best practices for digital unstructured data enrichment in health research: a systematic narrative review. PLOS Digit Health. (2023) 2(10):e0000347. doi: 10.1371/journal.pdig.0000347

PubMed Abstract | Crossref Full Text | Google Scholar

74. Li Y, Bai C, Reddy CK. A distributed ensemble approach for mining healthcare data under privacy constraints. Inf Sci. (2016) 330:245–59. doi: 10.1016/j.ins.2015.10.011

PubMed Abstract | Crossref Full Text | Google Scholar

75. Ghasemi A, Hashtarkhani S, Schwartz DL, Shaban-Nejad A. Explainable artificial intelligence in breast cancer detection and risk prediction: a systematic scoping review. Cancer Innov. (2024) 3(5):e136. doi: 10.1002/cai2.136

PubMed Abstract | Crossref Full Text | Google Scholar

76. Barda AJ, Horvat CM, Hochheiser H. A qualitative research framework for the design of user-centered displays of explanations for machine learning model predictions in healthcare. BMC Med Inform Decis Mak. (2020) 20(1):257. doi: 10.1186/s12911-020-01276-x

PubMed Abstract | Crossref Full Text | Google Scholar

77. Liu H, Zhong Z, Sebe N, Satoh S. Mitigating robust overfitting via self-residual-calibration regularization. Artif Intell. (2023) 317:103877. doi: 10.1016/j.artint.2023.103877

Crossref Full Text | Google Scholar

78. Carini C, Seyhan AA. Tribulations and future opportunities for artificial intelligence in precision medicine. J Transl Med. (2024) 22(1):411. doi: 10.1186/s12967-024-05067-0

PubMed Abstract | Crossref Full Text | Google Scholar

79. Marques M, Almeida A, Pereira H. The Medicine Revolution Through Artificial Intelligence: Ethical Challenges of Machine Learning Algorithms in Decision-Making. Cureus. 2024 September 14 (cited 2024 October 24). Available online at: https://www.cureus.com/articles/288300-the-medicine-revolution-through-artificial-intelligence-ethical-challenges-of-machine-learning-algorithms-in-decision-making

Google Scholar

80. Leigh RM, Pham A, Rao SS, Vora FM, Hou G, Kent C, et al. Machine learning for prediction of bronchopulmonary dysplasia-free survival among very preterm infants. BMC Pediatr. (2022) 22(1):542. doi: 10.1186/s12887-022-03602-w

PubMed Abstract | Crossref Full Text | Google Scholar

81. Mandel JC, Kreda DA, Mandl KD, Kohane IS, Ramoni RB. SMART On FHIR: a standards-based, interoperable apps platform for electronic health records. J Am Med Inform Assoc. (2016) 23(5):899–908. doi: 10.1093/jamia/ocv189

PubMed Abstract | Crossref Full Text | Google Scholar

82. Ohno-Machado L. Sharing data for the public good and protecting individual privacy: informatics solutions to combine different goals. J Am Med Inform Assoc. (2013) 20(1):1–1. doi: 10.1136/amiajnl-2012-001513

PubMed Abstract | Crossref Full Text | Google Scholar

83. Sullivan BA, Beam K, Vesoulis ZA, Aziz KB, Husain AN, Knake LA, et al. Transforming neonatal care with artificial intelligence: challenges, ethical consideration, and opportunities. J Perinatol. (2024) 44(1):1–11. doi: 10.1038/s41372-023-01848-5

PubMed Abstract | Crossref Full Text | Google Scholar

84. Kim J, Villarreal M, Arya S, Hernandez A, Moreira A. Bridging the gap: exploring bronchopulmonary dysplasia through the Lens of biomedical informatics. J Clin Med. (2024) 13(4):1077. doi: 10.3390/jcm13041077

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: bronchopulmonary dysplasia, predictive analytics, artificial intelligence, machine learning, personalized medicine, bioinformatics

Citation: McOmber BG, Moreira AG, Kirkman K, Acosta S, Rusin C and Shivanna B (2024) Predictive analytics in bronchopulmonary dysplasia: past, present, and future. Front. Pediatr. 12:1483940. doi: 10.3389/fped.2024.1483940

Received: 20 August 2024; Accepted: 29 October 2024;
Published: 20 November 2024.

Edited by:

Jing Liu, Capital Medical University, China

Reviewed by:

Daniel Vijlbrief, University Medical Center Utrecht, Netherlands
Jonathan Michael Davis, Tufts University, United States

Copyright: © 2024 McOmber, Moreira, Kirkman, Acosta, Rusin and Shivanna. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Binoy Shivanna, c2hpdmFubmFAYmNtLmVkdQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.