Respiratory distress syndrome prediction at birth by optical skin maturity assessment and machine learning models for limited-resource settings: a development and validation study

Reis, Zilma Silveira Nogueira; Pappa, Gisele Lobo; Nader, Paulo de Jesus H.; do Vale, Marynea Silva; Silveira Neves, Gabriela; Vitral, Gabriela Luiza Nogueira; Mussagy, Nilza; Norberto Dias, Ivana Mara; Romanelli, Roberta Maia de Castro

doi:10.3389/fped.2023.1264527

CLINICAL TRIAL article

Front. Pediatr., 15 November 2023

Sec. Neonatology

Volume 11 - 2023 | https://doi.org/10.3389/fped.2023.1264527

This article is part of the Research TopicTechnologies for Neonatal Care in LMICsView all 11 articles

Respiratory distress syndrome prediction at birth by optical skin maturity assessment and machine learning models for limited-resource settings: a development and validation study

Zilma Silveira Nogueira Reis^1*

Gisele Lobo Pappa²

Paulo de Jesus H. Nader³

Marynea Silva do Vale⁴

Gabriela Silveira Neves⁵

Gabriela Luiza Nogueira Vitral⁶

Nilza Mussagy⁷

Ivana Mara Norberto Dias⁷

Roberta Maia de Castro Romanelli¹

¹Faculdade de Medicina, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
²Departamento de Ciência da Computação, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
³Pediatrics and Neonatology Department, University Hospital, ULBRA, Canoas, Brazil
⁴Neonatal Intensive Care Unit, University Hospital, UFMA, São Luis, Brazil
⁵Hospital Sofia Feldman, Belo Horizonte, Brazil
⁶Faculdade de Medicina da Ciências Médicas de Minas Gerais, Belo Horizonte, Brazil
⁷Hospital Central de Maputo, Maputo, Mozambique

Background: A handheld optical device was developed to evaluate a newborn's skin maturity by assessing the photobiological properties of the tissue and processing it with other variables to predict early neonatal prognosis related to prematurity. This study assessed the device's ability to predict respiratory distress syndrome (RDS).

Methods: To assess the device's utility we enrolled newborns at childbirth in six urban perinatal centers from two multicenter single-blinded clinical trials. All newborns had inpatient follow-up until 72 h of life. We trained supervised machine learning models with data from 780 newborns in a Brazilian trial and provided external validation with data from 305 low-birth-weight newborns from another trial that assessed Brazilian and Mozambican newborns. The index test measured skin optical reflection with an optical sensor and adjusted acquired values with clinical variables such as birth weight and prenatal corticoid exposition for lung maturity, maternal diabetes, and hypertensive disturbances. The performance of the models was evaluated using intrasample k-parts cross-validation and external validation in an independent sample.

Results: Models adjusting three predictors (skin reflection, birth weight, and antenatal corticoid exposure) or five predictors had a similar performance, including or not maternal diabetes and hypertensive diseases. The best global accuracy was 89.7 (95% CI: 87.4 to 91.8, with a high sensitivity of 85.6% (80.2 to 90.0) and specificity of 91.3% (95% CI: 88.7 to 93.5). The test correctly discriminated RDS newborns in external validation, with 82.3% (95% CI: 77.5 to 86.4) accuracy. Our findings demonstrate a new way to assess a newborn's lung maturity, providing potential opportunities for earlier and more effective care.

Trial registration: RBR-3f5bm5 (online access: http://www.ensaiosclinicos.gov.br/rg/RBR-3f5bm5/), and RBR-33mjf (online access: https://ensaiosclinicos.gov.br/rg/RBR-33rnjf/).

Introduction

Infant mortality is a critical human development indicator since it reflects the quality of assistance, and social, economic, and environmental factors (1). Most child deaths occur due to prematurity meeting lung immaturity as the main bare reason (2). Approximately 11% of newborns worldwide are preterm, born earlier than 37 weeks of gestational age, and of whom 6% are late preterm, born between 34 and 37 weeks of gestational age (3) and require specialized care (4). Respiratory distress syndrome (RDS) is a common reason for neonatal intensive care unit (NICU) admission and neonatal mortality. Since lung immaturity due to surfactant deficiency is the cause of the disease, respiratory failure occurs soon after birth. However, most respiratory insufficiency at birth is not accurately evaluated, leading to poor outcomes because of delays in appropriate treatment (4, 5). Indeed, on many occasions, the respiratory picture at birth can be confused with an adaptive syndrome such as transient tachypnea of the newborn (TTN), as well as non-respiratory reasons, which may be cardiac, neurological, metabolic, or hematological, among others (6). Clinical history, lung image assessment, and blood lab tests are clues to discriminate between RDS and other respiratory distress, pointing newborns at higher risks of severe complications (7). Beyond clinical manifestation, assessing lung maturity is supported by biochemical and biophysical tests on amniotic fluid, genetic approaches, and microbubble evaluation in gastric aspirates (8). Unfortunately, the lack of healthcare technologies increases exponentially in low- and middle-income countries (LMICs) in scenarios with limited neonatal assistance, where the burden of preterm birth is higher than in other countries (4).

To achieve lower infant morbidity and mortality rates focused on the day of birth, early identification of lung maturity risk enhances chances of survival even based on referral safe transportation among facilities. Nevertheless, very often, especially late preterm infants are inappropriately classified as full-term newborns, delaying care for the former (9). This way, improvements centered on equity of technology access and quality of antenatal and childbirth care can reduce neonatal health disparities among birth scenarios with or without full support for preterm children identification and treatment (1, 10). The search for an affordable approach to quickly identify premature infants according to the degree of lung maturity remains a relevant target for health systems. Early intervention to manage respiratory distress in a newborn could mean the difference between survival and, possibly, a reduction in mortality (11).

Lungs develop linearly before childbirth; however, the maturational competence for extrauterine breathing occurs later in pregnancy or under stressful influences such as maternal disease, placental dysfunction, and drug exposition (12). Under the scientific basis, evidence is extensive concerning the influence of corticosteroid exposition during the prenatal period to prepare fetuses for after-birth life (13). At the same time, the skin is a tissue with late maturation, postponing the protective external barrier to near-term and term gestation (14, 15). Meanwhile, there is a direct relationship between epidermal layer competence and neonatal survival, facing risks of hypothermia, water loss, and infections (16, 17). Likewise, in this organ, antenatal corticotherapy induces cytodifferentiation and keratinization, enhancing the chances of survival (13). Beyond visual inspection of skin appearance, which characterizes preterm newborns (18), an objective measure of skin reflectance with a photometer was correlated with gestational age (19). Based on a multicenter clinical trial, a new medical device was able to assess the gestational age by adjusting a machine learning model for optical skin maturity to antenatal corticosteroid therapy for fetal maturation (ACTFM) and birth weight, discriminating preterm from term newborns, with 37 weeks of gestational age or more, with an area under ROC curve of 0.970, [95% CI: 0.959–0.981] (20). The present study explored new machine learning algorithms on the same optical device, to evaluate its ability to predict RDS in the first 72 h of life, even in places with scarce resources.

Methods

Cohorts

We analyzed two birth scenarios, one to provide predictive models and the other to apply them to a more realistic picture of the usage of the model. Accordingly, both studies were multicenter prospective, concurrent cohorts comprised of six urban referral perinatal centers. Five Brazilian urban referral centers for high-complexity perinatal care took part in the study: Clinical Hospital—Universidade Federal de Minas Gerais (as coordinator), Minas Gerais State; Sofia Feldman Hospital—Minas Gerais State; Hospital da Universidade Luterana do Brasil—Rio Grande do Sul State; Hospital Materno-infantil de Brasília—Federal District; and Hospital Universitário da Universidade Federal do Maranhão—Maranhão State. One referral center in Mozambique, the Maputo Central Hospital, the largest in the country, is headquartered in its capital.

Both cohorts shared inclusion criteria for live newborns enrolled within the first day of life, with the available reference standard gestational age, and childbirth after 24 weeks of gestation. Combining the last menstrual period with obstetric ultrasound assessment, we assessed gestational age at birth following international consensus for the due date (21). Anhydramnios, edema, congenital skin diseases, or chorioamnionitis were the exclusion criteria because they could modify skin structure, affecting the optical properties of the tissues. Teams of trained and certified health professionals and health professionals' research assistants enrolled and evaluated skin optical reflectance and clinical data at birth. All newborns had inpatient follow-up within the first 72 h of life to monitor immediate neonatal outcomes, with an early ending when discharge or death occurred, according to clinical trial protocols deposited in protocolos.IO (22). However, differences between the clinical characteristics of the newborns express different realities provided by birth weight eligibility criteria below 2.5 kg in the validation cohort (Figure 1).

FIGURE 1

Figure 1. Database, birth scenarios, and index test (outcomes). LBW, low birth weight; RDS, respiratory distress syndrome.

For transparency, the clinical trials register and details of enrollment remain public. From clinical trial 1, registered under the number RBR-3f5bm5 (23), we evaluated Brazilian newborns with a gestational age of 24 weeks, and with any birth weight. The enrollment occurred from 2 January 2019 to 30 May 2021. Data from this study grounded the modeling process of machine learning prediction, thus being the baseline cohort. From clinical trial 2, registered under the number RBR-33rnjf (24), we assessed only newborns with birth weights under 2.5 kg in Brazil and Mozambique. The enrollment occurred from 15 February 2019 to 11 December 2021, and the dataset was used as the validation cohort. Most of the newborns were Mozambican (n = 177, 58.0%).

Primary outcome

The primary outcome was to predict the RDS. The reference standard for RDS diagnosis has a basis in clinical, laboratory, and radiological findings and respiratory outcomes (7). However, concerning the reference standard in the scenario of LBW Mozambican newborns, when a radiological exam was absent, the diagnosis was based on clinical evaluations such as tachypnea, nasal flaring, retractions, and grunting with the possibility of progress to respiratory failure (24). In such a scenario where propaedeutics and other resources are unavailable, maternal and delivery context and clinical progress of respiratory failure were considered, based on clinical priority in 72 h of follow-up. Transient tachypnea of the newborn (TTN) was a differential diagnosis of respiratory complications at birth. Despite RDS being the target outcome, we introduced an exploratory modeling step by discriminating between RDS, TTN, or none. The diagnosis had a basis in clinical findings and respiratory outcomes (7). Again, TTN was diagnosed for exclusion in the Mozambican center, typically with clinical evidence of tachypnea shortly after birth, grunting, nasal flaring, retractions, and occasionally cyanosis (24). The procedures for clinical evaluation, complementary exams of the newborn, and RDS diagnosis are available in the Supplementary Material. Subgroups of analysis, according to LBW and very-LBW newborns, with a birth weight of less than 2.5 Kg and 1.5 Kg, respectively, provided a potential picture of the application according to ranges of birth weight.

The index test

The assessment of newborns' skin maturity with the optical device was possible with the development of the equipment. We already noticed a high agreement between gestational age calculated by this device with the best available gestational age as a reference, as well the accuracy for discrimination of preterm against term infants (24). The error of the optical component had a prior evaluation, resulting in an intraobserver error of 1.97% (95% CI: 1.84–2.11) and an interobserver error of 2.6% (95% CI: 2.1–3.1) (24). The present analysis focused on RDS prediction as an additional value beyond the gestational age. Here, the index test was intended to analyze newborn lung maturity, clinically represented by RDS, as an unprecedented association with the optical skin maturity measurement in a machine learning algorithm.

In this study, data temporality of predictors was the first day of life, a moment when the user did not receive the result of RDS prediction to provide test blinding. Alongside skin reflectance, automatically acquired with the device when it touches the sole of the newborn, clinical variables were added by the user, and machine learning algorithms delivered the RDS prediction and were stored in the processor (Figure 2). In the future, the RDS prediction will be available on the device's screen.

FIGURE 2

Figure 2. Steps of the testing process. (1) The device touches the skin over the sole of a newborn. (2) The sensor acquires skin maturity by assessing the photobiological properties of the tissue when measuring the reflection portions of the light beam incident on the skin. (3) The user inputs clinical data. (4) The data processor uses machine learning algorithms to predict respiratory distress syndrome.

The testing steps were standardized and supported by the prior proof of concept publications. The sole was the site of the newborn's body with a higher correlation between the skin reflection and pregnancy dating than other body sites, with the advantage of fulfilling the patient security recommendation for minimum manipulation of newborns (19). The influence of skin color and environmental conditions such as humidity, temperature, and ambient light were reasons for enhanced sensor design, achieving a prediction model without its adjustments (19, 25). This approach to newborns attended to requirements of patient security, including disinfection of the device with alcohol 70, and minimum manipulation of the child anywhere they were: inside incubators, warm crib, or in the mother's lap.

Standard and data collection

According to recommendations for good clinical practices involving human research with medical devices, and according to the International Organization for Standardization (ISO 14155:2011), trained research assistants collected data on 65 demographic and clinical features and 25 skin variables. The framework of variables is available in a previous report (20). Clinical information was collected through structured questionnaires using software developed for the clinical trials, and, simultaneously, in paper formularies containing the exact requests. The data curation process double-checked the data from paper and electronic collection conducted by senior researchers, before opening the outcome blinding. Data consistency and completeness resulted in only one exclusion.

Data availability

Data is available upon reasonable request and after anonymization to ensure ethical and legal data sharing, thus preserving the confidentiality of the persons who participated in this study.

Ethics and dissemination

The studies involving humans had independent ethical board approval at each hospital. The Brazilian National Research Council approved the clinical trials under numbers 81347817.6.1001.5149 and 91134218.4.0000.5149. In Mozambique, ethical approval was under the number IRB00002657, according to the National Bioethics Council. Parents signed an informed consent form on behalf of the newborns as recommended by the Regulatory Bodies for Good Clinical Research Practice, and copies were retained in case they should be needed. Patients were not involved in the design of clinical trials. However, participants' parents received oral explanations and a press-illustrated folder with the proposal of the studies. Besides scientific articles, the results are continuously disseminated by non-scientific publications in media and on the project website: http://skinage.medicina.ufmg.br.

Methods for estimating or comparing measures of diagnostic accuracy

Model development

We trained the models to binary prediction of RDS occurrence until 72 h of life with the five variables, and, additionally, for RDS, TTN, or none. The variables were: skin reflection, birth weight, ACTMF, diabetes, and hypertensive disturbances. The choice of independent variables took into account the easy access to data in the delivery scenario, the biological plausibility, and the importance-feature graphic analysis. Furthermore, we compared models based on three or five independent variables, including or not including maternal diseases. A wide range of models was tested, and the best results were obtained by the XGBoost Regressor model (26).

Model validation

The model was created using data from Clinical Trial 1. Two experiments were performed. In the first one, a ten-fold cross-validation procedure was used to assess the robustness of the model. This procedure was repeated 30 times, generating a total of 300 models that had their metrics of accuracy averaged and reported together with confidence intervals. The second experiment used data from Clinical Trial 1 to generate the model and from Clinical Trial 2 to validate the model.

Statistical analysis

For descriptive analysis of variables, we used average (SD) and median (IQR) to describe continuous variables for symmetric and asymmetric distributions, respectively. We used frequencies (%) for categorical variables. The Mean-T and Mann-Whitney U tests were used to compare the mean or median between two groups of interest as RDS yes or no, according to the variables’ parametric or non-parametric frequency distribution. For comparisons between frequencies, the Chi-square Test evaluated the independence hypothesis between categorical variables as preterm vs. RDS yes or no, and the Likelihood ratio chi-square statistic was the alternative when more than 20% of expected values were above five. ANOVA or Kruskall Wallis tests compared three groups analysis as RDS, TTN, and none according to the variables' parametric or non-parametric frequency distribution.

The set of machine learning models provided outcomes for binary RDS (yes or no) and three classes (RDS, TTN, none). The choice of the best models occurred by means of reliability analysis. The accuracy of the prediction of best models was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio, and negative likelihood ratio. P-values of <0.05 were considered suggestive of statistical significance. SPSS software (version 19.0; IBM Corp) was used for statistical data analysis.

Results

Description of newborns

Newborns from two clinical trials summed up 1,085 tests with the medical device. From the baseline scenario dataset where we set the RDS predictive models, we analyzed data from 702 Brazilian pregnant women who gave birth to 781 newborns with gestational ages older than 24 weeks (scenario 1). One exclusion occurred due to uncertainty in either an TTN or RDS diagnosis. Among 780 included newborns, 325 (41.7%) were low-birth-weight (LBW), and 27.6% (n = 215) had RDS. In the validation scenario, we analyzed data from 263 pregnant women who gave birth to 308 newborns with birth weights under 2.5 kg (scenario 2). Three exclusions occurred due to incorrect enrollment. Among the 305 included newborns, 37.7% (n = 112) had RDS. An overview of participants, according to development and model validation steps with respective birth scenarios and test outcomes, for the best models of prediction, is shown in Figure 3.

FIGURE 3

Figure 3. Flowchart of participants using STARD diagram, according to development and model validation birth scenarios.

The participants' baseline demographic and clinical characteristics are shown in Table 1, considering subgroups of newborns with and without RDS in the birth scenarios of the study. Regarding prenatal data, newborns with RDS had a higher frequency of mothers with diabetes (p < 0.001) and hypertensive disease (p < 0.001) in birth scenario 1, but not in scenario 2 (p = 0.086 and p = 0.453, respectively). An important baseline characteristic to highlight is the no-RDS subgroup profile with high maternal disease frequency, ventilatory support, and NICU admission. For instance, the no-RDS subgroup of LBW newborns in the validation scenario comprised 102 (53.1%) newborns with mothers affected by hypertensive diseases and 115 (59.6%) newborns admitted to NICU. In both scenarios, children with RDS had higher ACTMF exposition (p < 0.001), lower gestational age (p < 0.001), lower birth weight (p < 0.001), and lower first-minute Apgar score (p < 0.001) than those without RDS.

TABLE 1

Table 1. Baseline demographic and clinical characteristics of the pregnancy and newborns of the baseline and validation cohorts.

Comparing birth scenarios, the newborns had similar characteristics concerning rupture of membranes more than 18 h (p = 0.421), positive-pressure ventilation (p = 0.844), intubation at birth (p = 0.131) surfactant resuscitation steps, (p = 0.697), and mechanical ventilation (0.864) until 72 h of life. However, the LBW newborns in the birth scenario 2 had higher morbidity and mortality rates (p < 0.001) than newborns in the birth scenario 1.

Despite the primary outcome being RDS prediction, we still provided a more detailed analysis in the Supplementary Material, comparing three subgroups: RDS newborns, TTN newborns, and newborns without RDS or TTN.

Primary outcome

The machine learning modeling incorporated combinations of maternal and newborn characteristics associated with RDS to develop predictive algorithms that are useful at birth. Analyzing the importance feature given by XGBoost (Figure 4), and metrics of accuracy, precision, and recall (Supplementary Material), we consider the gain insufficient when maternal disease variables were inserted into the model. Models including hypertensive disease and diabetes data for the binary outcome for RDS had similar accuracy and F1 scores to models with the three baseline variables: skin reflection, birth weight, and ACTMF. The ACTMF was the variable with the highest importance in predicting RDS, followed by birth weight and skin reflection acquired by the optical component of the medical device in model 1 and model 2 (Figure 4).

FIGURE 4

Figure 4. Attribute importance given by XGBoost when considering information gain that a variable brings when inserted into the model. (A) Model 1: trained with skin reflection + birth weight + Antenatal corticosteroid therapy for lung maturation, for the binary outcome RDS vs. non-RDS. (B) Model 2: trained with Skin reflection + birth weight + Antenatal corticosteroid therapy for lung maturation + diabetes + hypertensive diseases for the binary outcome RDS vs. non-RDS.

In relation to discriminating among RDS, TTN, and neither of them using three classes of outcome modeling (models 3 and 4, Supplementary Material), the performance was worse than binary RDS yes/no prediction (models 1 and 2, Supplementary Material). When applying the models in the scenario of LBW newborns for external validation, metrics of prediction performance confirmed the advantages of the three-variable model with a binary RDS yes or no outcome, with an accuracy of 89.4% (95% CI: 88.6 to 90.3) and 82.3% in the cross-validation and external validation, respectively (model 1, Supplementary Material). As detailed in Supplementary Material, we chose the most parsimonious models for complete accuracy analysis.

There were no adverse events when performing the index test. The prediction accuracy of the test using the medical device at birth for RDS occurrence until 72 h of life is detailed in Table 2. Using cross-validation in the birth scenario used for modeling, algorithms with three or five independent variables delivered similar predictions regarding RDS discrimination, 89.7% (95% CI: 87.4 to 91.8) and 89.4% (95% CI: 87.0 to 91.4), respectively. Such accuracy occurred with high sensitivity and specificity, and the likelihood ratio for RDS was increased by approximately 10 times when the index test was positive. According to LBW and very-LBW newborns subgroup analysis, RDS prediction occurred with a high accuracy of 91.9% (95% CI: 86.0 to 95.9) despite a low specificity of 9.1% (95% CI: 0.23 to 41.3) when using model 1. Model 2, obtained with five variables, had no utility for RDS prediction in very-LBW newborns.

TABLE 2

Table 2. Accuracy for respiratory distress syndrome during the first 72 h of life, according to the predictive algorithms with binary outcomes.

Using the models for external validation in LBW newborns, algorithms with or without maternal diseases included had similar performance in predicting RDS as RDS occurrence was correctly predicted in 82% of newborns (95% CI: 77.5 to 86.4). The likelihood ratio for RDS increased approximately five times when the index test was positive (Table 2). Regarding the subgroup analysis of very-LBW newborns, global accuracy was similar to the overall group: 84.9% (95% CI: 74.6 to 92.2) for the model with or without maternal diseases as predictors.

Analyzing the confusion matrix for RDS prediction according to gestational age at birth (Figure 5), we found false positives and false negatives more frequently around 33 and 34 weeks of gestation in both birth scenarios. However, it is relevant to notice that, in external validation, the three-variable model (model 1) discriminated most of the LBW newborns with (true positive) and without (true negative) RDS in the range of 29 to 37 weeks of gestation.

FIGURE 5

Figure 5. Confusion matrix for Respiratory Distress Syndrome prediction until 72 hours of life, according to gestational age at birth, using a three-variable-mode. (A) Incorrect prediction in birth scenario 1 - Cross-validation (n = 780). (B) Incorrect prediction in birth scenario 2, LBW - External validation (n = 305). (C) Correct prediction in birth scenario 1 - Cross-validation (n = 780). (D) Correct prediction in birth scenario 2, LBW - External validation (n = 305).

In order to inspect similarities and differences between newborns with or without correct RDS prediction, we compared the clinical characteristics in the validation scenario (Supplementary Material). Gestational age, birth weight, maternal diseases, and TTN occurrence were statistically similar between subgroups. Only NICU admission within the first 72 h occurred more frequently in newborns with an incorrect prediction (90.7% vs. 70.9%, p = 0.002).

Discussion

Main findings

Improving healthcare equity is a primary goal of the United Nations — this aim makes the reduction of infant mortality a priority (27). Digital health, including affordable and valuable medical devices and artificial intelligence, has brought hope to improve health for everyone (28, 29). The main outcome of the present study was providing a promissory predictive model using a medical device with an AI algorithm inside. Of every 100 newborns assessed, 90 were correctly classified as a higher risk or not for RDS until 72 h of life, considering the dataset that provides predictive models. The prediction accuracy remained high in the LBW newborns that composed the validation scenario, 82 in every 100, where the RDS and other neonatal morbidities and mortality were more frequent than in the model development scenario.

The same sort of study has been presented, integrating computational technology to identify predictors of neonatal mortality, such as the lecithin and sphingomyelin ratio by machine learning applied to mild-infrared spectra (30) or acoustic features of the crying of newborns (31). Reviews have highlighted the importance of birth weight, Apgar score, and antenatal steroids (28). Our approach has the advantage of using only three predictive variables obtained from a prospective temporality clinical trial approach to provide prediction before the disease occurrence. Models with five predictive variables, including maternal diseases (i.e., diabetes and hypertensive diseases) did not show advantages over models based on skin maturity optical assessment, birth weight, and steroids. This finding will certainly facilitate the use of the device by caregivers who deliver care at birth in LMICs.

Comparisons and subgroups of analysis

Considering the very-LBW subgroup of analysis, our results with a three-variables predictive model achieved an accuracy of 84.9% (95% CI, 74.6 to 92.2). In comparison, using an extensive historical 14-year inpatient dataset and many predictive variables, Jaskari et al. classified bronchopulmonary dysplasia in a retrospective dataset of very-LBW, with an accuracy of around 0.899 AUROC (32). Furthermore, analyzing a prospective dataset of newborns older than 24 weeks of gestation, our modeling achieves an accuracy of 89.7% (95% CI, 87.4 to 91.8), while Betts et al. reported RDS prediction with an accuracy of 0.923 (0.917, 0.928) among inpatients younger than 39 weeks of gestation (33), using the same dataset as Jaskari et al. (32). So far, our study is the first that has used a physical measurement of skin maturity, previously described (16, 19, 20), using a prospective dataset from clinical trials with nearly similar accuracy to other more complex models.

Early detection of severe neonatal morbidities such as RDS is critical to halt disease progression and prevent further complications or death. Risk identification of the occurrence might provide means for opportune diagnosis and due care with surfactant access, enhancing chances of survival with minimal sequelae, even with the referral of newborns (5). In LMICs, the availability of a NICU in a center of excellence is often far from the place of birth of this preterm infant (4). The limited number of intensive care beds that can receive real RDS-risk newborns justifies a reliable and helpful predictive test to support low-risk newborns' retention decisions, optimizing resources. By analyzing the confusion matrix, the outcome of the present study showed early and promising discrimination of RDS even in late preterm newborns in the development and LBW validation scenarios.

Worldwide, hard decisions in scenarios with scarce resources are taken daily based on birth weight, with particular attention to late preterm births that account for most preterm births (34). Birth weight is the most accessible and significant determinant of the likelihood of survival at birth, but it alone is not enough to predict neonatal outcomes. Placental dysfunction, maternal-fetal conditions affecting lung maturation such as smoking, cardiovascular diseases, and prenatal exposure to drugs such as steroids are also determinants (35). Known antenatal predictors of RDS, such as prenatal Doppler velocimetry and the lamellar body count test on gastric aspirates have limitations in LMICs due to high costs and a lack of professionals with the necessary skills (8, 36).

Implications for practice and the role of the index test

The role of the index test used to predict RDS might be a prompt risk indication immediately at birth, anticipating best practices of management in scenarios with limited resources or optimizing access to existing facilities. This study is a premarket approach using data from two clinical trials to validate the algorithm for real-time RDS prediction at birth. The skin reflection can be acquired from the device, and the user quickly introduces some clinical variables, as presented in Figure 1. Facilities without neonatologists, mobile emergency services, and caregivers in primary units where a preterm birth can occur are the potential targets of this device. The approach is intended to quickly offer a prediction based on variables easily accessible at birth scenarios added to the skin maturity assessment, even outside hospitals. In the same way, a professional in maternity and NICU settings could be interested in this prediction to manage clinical follow-up of newborns and bed occupancy.

Despite recent advances in the perinatal management of RDS, controversies still exist. Lower emphasis on radiographic diagnosis and classification of RDS, such as ground glass with air bronchograms, directs management toward a preventive surfactant treatment approach. Definitions based on blood gas analyses are also redundant, as management has moved towards a preventive surfactant treatment approach based on clinical assessment of the work of breathing and oxygen requirement to avoid worsening the syndrome. Current RDS management aims to maximize survival by minimizing complications such as air leaks and bronchopulmonary dysplasia (5).

Sources of potential bias and generalizability

Despite the development of a new technology that allows skin maturity associated with birth data to be used as a marker of lung maturity, sources of potential bias can limit the generalizability of the outcomes. The development and validation scenarios had relevant differences regarding RDS frequency in newborns, morbidity, and mortality. Moreover, the accuracy of the machine learning models was sustained by a high specificity of 91.3% (95% CI, 88.7 to 93.5). In false-positive RDS prediction in LBW newborns, unnecessary interventions such as transferring to a referral center can occur in approximately 18% of newborns. Nonetheless, assuming the implementation of a screening test, a point-of-care prediction in conjunction with clinical protocols, this approach has the potential to enhance neonatal care. Future studies are necessary to measure the influence of disease incidence on generalizing the models, as in the primary care birth scenario or low complexity hospitals where the incidence of preterm birth and RDS is lower than ours. The performance of the prediction in the subgroups analysis considering ranges of gestational age and birth weight might still require further large samples.

Regarding skin maturity importance in the model, the rationale which relies on a direct relationship between epidermal barrier competence and neonatal survival faces limitations after 35 weeks of gestation, when the epidermis is complete (37). Therefore, the test may perform better in preterm newborns than in term newborns; similar to previous studies, we used the device to predict gestational age (38). Finally, there is a potential bias associated with suboptimal pregnancy dating in the validation scenario since the inclusion criteria admitted obstetric ultrasound examinations before 24 weeks or just using a reliable last menstrual period, which has already been reported (38). At the same time, data from the clinical trials in Brazil and Mozambique provided a picture of using the test under natural conditions with barriers to high-cost technologies.

Conclusions

The objective measurement of skin maturity alongside machine learning models opens new opportunities to recognize complex patterns among variables in RDS outcome prediction. The models adjusted for skin reflection, birth weight, and ACTMF at birth as RDS predictors for 72 h of life achieved high accuracy in developing and validating modeling using clinical trial datasets. This study demonstrates a new way to assess neonatal lung immaturity, providing potential opportunities for more effective and early caring with an automated medical device tester.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: Data is available upon reasonable request and after anonymization to ensure ethical and legal data sharing, thus preserving the confidentiality of the persons who participated in this study. Requests to access these datasets should be directed toemlsbWFAdWZtZy5icg==.

Ethics statement

The studies involving humans were approved by The Brazilian National Research Council approved the clinical trials under numbers 81347817.6.1001.5149 and 91134218.4.0000.5149. In Mozambique, ethical approval was under the number IRB00002657, according to the National Bioethics Council. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

ZR: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Methodology, Supervision, Writing – original draft, Writing – review & editing. GP: Writing – original draft, Formal Analysis, Methodology, Software, Validation, Writing – review & editing. PN: Writing – review & editing, Conceptualization, Data curation, Investigation, Supervision, Validation, Writing – original draft. MM: Writing – review & editing, Conceptualization, Data curation, Investigation, Supervision, Writing – original draft. GS: Validation, Data curation, Investigation, Writing – original draft, Writing – review & editing. GV: Writing – original draft, Data curation, Investigation, Validation, Writing – review & editing. NM: Data curation, Investigation, Methodology, Writing – review & editing, Writing – original draft. IN: Data curation, Investigation, Supervision, Validation, Writing – review & editing, Writing – original draft. RR: Conceptualization, Data curation, Formal Analysis, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article.

This study was funded by Fundação Oswaldo Cruz—Rio de Janeiro, Brazil (grant number VPPIS-002-FEX-20), and Grand Challenges Canada's Programs, Government of Canada—Toronto, Canada (grant number R-ST-POC-1807-13515). ZSNR is a researcher with a grant from the Conselho Nacional de Pesquisa (CNPq 305837/2021-4), Brazil. No sponsor had any role in the study design, data collection, data analysis, data interpretation, writing, or decision to submit the manuscript.

Acknowledgments

This work was supported, in whole or in part, by the Bill & Melinda Gates Foundation [Grant Number OPP1128907].

Conflict of interest

The authors declare a patent deposit on behalf of the Universidade Federal de Minas Gerais and Fundação de Amparo a Pesquisa de Minas Gerais, Brazil (http://www.fapemig.br/en/). BR1020170235688 (CTIT-PN862).

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2023.1264527/full#supplementary-material

References

1. World Health Organization. Preterm birth. (2023). https://www.who.int/news-room/fact-sheets/detail/preterm-birth (Accessed May 31, 2023).

2. Kumaran G, Sethu G, Ganapathy D. Respiratory distress syndrome in infants-an overview. PalArch's J Archaeol Egypt/Egyptol. (2020) 17(7):1902–11. Available from: https://archives.palarch.nl/index.php/jae/article/view/1432

Google Scholar

3. Delnord M, Zeitlin J. Epidemiology of late preterm and early term births–an international perspective. Semin Fetal Neonatal Med. (2019). 24(1):3–10. doi: 10.1016/j.siny.2018.09.00110

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Walani SR. Global burden of preterm birth. Int J Gynaecol Obstet. (2020) 150(1):31–3. doi: 10.1002/ijgo.13195

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Sweet DG, Carnielli V, Greisen G, Hallman M, Ozek E, Plavka R, et al. European Consensus guidelines on the management of neonatal respiratory distress syndrome in preterm infants–2010 update. Neonatology. (2010) 97(4):402–17. doi: 10.1159/000297773

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Elfarargy MS, Al-Ashmawy GM, Abu-Risha S, Khattab H. Novel predictor markers for early differentiation between transient tachypnea of newborn and respiratory distress syndrome in neonates. Int J Immunopathol Pharmacol. (2021) 35:20587384211000554. doi: 10.1177/20587384211000554

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Reuter S, Moser C, Baack M. Respiratory distress in the newborn. Pediatr Rev. (2014) 35(10):417–28; quiz 29. doi: 10.1542/pir.35.10.417

PubMed Abstract | CrossRef Full Text | Google Scholar

8. da Silva Daniel IWB,, Fiori HH, Piva JP, Munhoz TP, Nectoux AV, Fiori RM. Lamellar body count and stable microbubble test on gastric aspirates from preterm infants for the diagnosis of respiratory distress syndrome. Neonatology. (2010) 98(2):150–5. doi: 10.1159/000279887

PubMed Abstract | CrossRef Full Text | Google Scholar

9. De Luca D. Respiratory distress syndrome in preterm neonates in the era of precision medicine: a modern critical care-based approach. Pediatr Neonatol. (2021) 62:S3–9. doi: 10.1016/j.pedneo.2020.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Reichman V, Brachio SS, Madu CR, Montoya-Williams D, Peña M-M. Using rising tides to lift all boats: equity-focused quality improvement as a tool to reduce neonatal health disparities. Semin Fetal Neonatal Med. (2021) 26(1):101198.33558160

PubMed Abstract | Google Scholar

11. World Health Organization. WHO recommendations on interventions to improve preterm birth outcomes. Geneve: WHO Library Cataloguing-in-Publication Data (2015).

12. Stocks J, Hislop A, Sonnappa S. Early lung development: lifelong effect on respiratory health and disease. Lancet Respir Med. (2013) 1(9):728–42. doi: 10.1016/S2213-2600(13)70118-8

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Ballard PL, Ballard RA. Scientific basis and therapeutic regimens for use of antenatal glucocorticoids. Am J Obstet Gynecol. (1995) 173(1):254–62. doi: 10.1016/0002-9378(95)90210-4

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Darmstadt GL, Dinulos JG. Neonatal skin care. Pediatr Clin N Am. (2000) 47(4):757–82. doi: 10.1016/S0031-3955(05)70239-X

CrossRef Full Text | Google Scholar

15. Hardman MJ, Moore L, Ferguson MW, Byrne C. Barrier formation in the human fetus is patterned. J Invest Dermatol. (1999) 113(6):1106–13. doi: 10.1046/j.1523-1747.1999.00800.x

PubMed Abstract | CrossRef Full Text | Google Scholar

16. de Souza IMF, Vitral GLN, Caliari MV, Reis ZSN. Association between the chronology of gestation and the morphometrical skin characteristics at childbirth: a development of predictive model. BMJ Health Care Inform. (2021) 28(1):e100476. doi: 10.1136/bmjhci-2021-100476

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Menon GK, Cleary GW, Lane ME. The structure and function of the stratum corneum. Int J Pharm. (2012) 435(1):3–9. doi: 10.1016/j.ijpharm.2012.06.005

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Ballard J, Khoury J, Wedig K, Wang L, Eilers-Walsman B, Lipp R. New ballard score, expanded to include extremely premature infants. J Pediatr. (1991) 119(3):417–23. doi: 10.1016/S0022-3476(05)82056-6

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Reis ZSN, Vitral GLN, de Souza IMF, Rego MAS, Guimaraes RN. Newborn skin reflection: proof of concept for a new approach for predicting gestational age at birth. A cross-sectional study. PLoS One. (2017) 12(9):e0184734. doi: 10.1371/journal.pone.0184734

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Reis ZSN, Romanelli RMC, Guimarães RN, Gaspar JS, Neves GS, do Vale MS, et al. Newborn skin maturity medical device validation for gestational age prediction: clinical trial. J Med Internet Res. (2022) 24(9):e38727. doi: 10.2196/38727

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Committee on Obstetric Practice tAIoUiM, and the Society for Maternal-Fetal Medicine. Committee opinion No 700: methods for estimating the due date. Obstet Gynecol. (2017) 129(5):e150–e4. doi: 10.1097/AOG.0000000000002046

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Reis Z, Vitral GLN, Guimarães RN, Aguiar RAPLD, Romanelli RMC. The preemie-test for the assessment of the newborn skin maturity. Protocols.io. (2019). doi: 10.17504/protocols.io.7ynhpve

CrossRef Full Text | Google Scholar

23. Reis ZSN, Guimarães RN, Rego MAS, Maia de Castro Romanelli R, Gaspar JDS, Vitral GLN, et al. Prematurity detection evaluating interaction between the skin of the newborn and light: protocol for the preemie-test multicentre clinical trial in Brazilian hospitals to validate a new medical device. BMJ Open. (2019) 9(3):e027442. doi: 10.1136/bmjopen-2018-027442

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Reis Z, Vitral G, Guimarães R, Gaspar J, Colosimo E, Taunde S, et al. Premature or small for gestational age discrimination: international multicenter trial protocol for classification of the low-birth-weight newborn through the optical properties of the skin. JMIR Res Protoc. (2020) 9(7):e16477. doi: 10.2196/16477

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Silva PC, Guimarães RN, Souza RG, Reis ZSN. A quantitative cross-sectional analysis of the melanin index in the skin of preterm newborns and its association with gestational age at birth. Skin Res Technol. (2020) 26(3):356–61. doi: 10.1111/srt.12810

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Chen T, Guestrin C. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. (2016).

27. Liu L, Oza S, Hogan D, Chu Y, Perin J, Zhu J, et al. Global, regional, and national causes of under-5 mortality in 2000–15: an updated systematic analysis with implications for the sustainable development goals. Lancet. (2016) 388(10063):3027–35. doi: 10.1016/S0140-6736(16)31593-8

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Mangold C, Zoretic S, Thallapureddy K, Moreira A, Chorath K, Moreira A. Machine learning models for predicting neonatal mortality: a systematic review. Neonatology. (2021) 118(4):394–405. doi: 10.1159/000516891

PubMed Abstract | CrossRef Full Text | Google Scholar

29. World Health Organization. WHO compendium of innovative health technologies for low-resource settings, 2016, 2017. Geneva: World Health Organization (2018).

30. Ahmed W, Veluthandath AV, Rowe DJ, Madsen J, Clark HW, Postle AD, et al. Prediction of neonatal respiratory distress biomarker concentration by application of machine learning to mid-infrared spectra. Sensors. (2022) 22(5):1744. doi: 10.3390/s22051744

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Khalilzad Z, Hasasneh A, Tadj C. Newborn cry-based diagnostic system to distinguish between sepsis and respiratory distress syndrome using combined acoustic features. Diagnostics. (2022) 12(11):2802. doi: 10.3390/diagnostics12112802

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Jaskari J, Myllärinen J, Leskinen M, Rad AB, Hollmén J, Andersson S, et al. Machine learning methods for neonatal mortality and morbidity classification. Ieee Access. (2020) 8:123347–58. doi: 10.1109/ACCESS.2020.3006710

CrossRef Full Text | Google Scholar

33. Betts KS, Kisely S, Alati R. Predicting neonatal respiratory distress syndrome and hypoglycaemia prior to discharge: leveraging health administrative data and machine learning. J Biomed Inform. (2021) 114:103651. doi: 10.1016/j.jbi.2020.103651

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Miller L, Wanduru P, Santos N, Butrick E, Waiswa P, Otieno P, et al. Working with what you have: how the east Africa preterm birth initiative used gestational age data from facility maternity registers. PloS one. (2020) 15(8):e0237656. doi: 10.1371/journal.pone.0237656

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Wilcox AJ. On the importance—and the unimportance—of birthweight. Int J Epidemiol. (2001) 30(6):1233–41. doi: 10.1093/ije/30.6.1233

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Moety G, Gaafar H, El Rifai N. Can fetal pulmonary artery Doppler indices predict neonatal respiratory distress syndrome? J Perinatol. (2015) 35(12):1015–9. doi: 10.1038/jp.2015.128

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Kalia YN, Nonato LB, Lund CH, Guy RH. Development of skin barrier function in premature infants. J Invest Dermatol. (1998) 111(2):320–6. doi: 10.1046/j.1523-1747.1998.00289.x

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Vitral GLN, de Castro Romanelli RM, Reis ZSN, Guimarães RN, Dias I, Mussagy N, et al. Gestational age assessed by optical skin reflection in low-birth-weight newborns: applications in classification at birth. Front Pediatr. (2023) 11:1141894.37056944

PubMed Abstract | Google Scholar

Keywords: respiratory distress syndrome, newborn, prematurity, childbirth, skin physiological phenomena, machine learning, equipment and supplies, medical device

Citation: Reis ZSN, Pappa GL, Nader PdJH, do Vale MS, Silveira Neves G, Vitral GLN, Mussagy N, Norberto Dias IM and Romanelli RMdC (2023) Respiratory distress syndrome prediction at birth by optical skin maturity assessment and machine learning models for limited-resource settings: a development and validation study. Front. Pediatr. 11:1264527. doi: 10.3389/fped.2023.1264527

Received: 20 July 2023; Accepted: 23 October 2023;
Published: 15 November 2023.

Edited by:

Tina Marye Slusher, University of Minnesota Twin Cities, United States

Reviewed by:

Gabriela Corina Zaharie, University of Medicine and Pharmacy Iuliu Hatieganu, Romania
Matthew Nudelman, Santa Clara Valley Medical Center, United States

© 2023 Reis, Pappa, Nader, do Vale, Silveira Neves, Vitral, Mussagy, Norberto Dias and Romanelli. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zilma Silveira Nogueira Reis emlsbWEubWVkaWNpbmFAZ21haWwuY29t

Abbreviations LMICs, low- and middle-income countries; ACU, accuracy; ACTFM, antenatal corticosteroid therapy for fetal maturation; CI, confidence interval; CPAP, continuous positive airway pressure; DB, diabetes; HD, hypertensive disease; IQR, interquartile range; LBW, low birth weight; LR+, likelihood ratio positive; LR, likelihood ratio negative; NICU, neonatal intensive care unit; NPV, negative predictive value; RDS, respiratory distress syndrome; SEN, sensibility; SPE, specificity; TTN, transient tachypnea of the newborn; PPV, positive-pressure ventilation; PPV, positive predictive value.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.