Skip to main content

ORIGINAL RESEARCH article

Front. Pediatr., 22 May 2023
Sec. Pediatric Immunology

Machine learning clinical decision support systems for surveillance: a case study on pertussis and RSV in children

\r\nKimberly A. Mc Cord&#x;De IacoKimberly A. Mc Cord—De IacoFrancesco Gesualdo
Francesco Gesualdo*Elisabetta PandolfiElisabetta PandolfiIleana CrociIleana CrociAlberto Eugenio Tozzi\r\nAlberto Eugenio Tozzi
  • Predictive and Preventive Medicine Research Unit, Bambino Gesù Children's Hospital IRCCS, Rome, Italy

We tested the performance of a machine learning (ML) algorithm based on signs and symptoms for the diagnosis of RSV infection or pertussis in the first year of age to support clinical decisions and provide timely data for public health surveillance. We used data from a retrospective case series of children in the first year of life investigated for acute respiratory infections in the emergency room from 2015 to 2020. We collected data from PCR laboratory tests for confirming pertussis or RSV infection, clinical symptoms, and routine blood testing results, which were used for the algorithm development. We used a LightGBM model to develop 2 sets of models for predicting pertussis and RSV infection: for each type of infection, we developed one model trained with the combination of clinical symptoms and results from routine blood test (white blood cell count, lymphocyte fraction and C-reactive protein), and one with symptoms only. All analyses were performed using Python 3.7.4 with Shapley values (Shap values) visualization package for predictor visualization. The performance of the models was assessed through confusion matrices. The models were developed on a dataset of 599 children. The recall for the pertussis model combining symptoms and routine laboratory tests was 0.72, and 0.74 with clinical symptoms only. For RSV infection, recall was 0.68 with clinical symptoms and laboratory tests and 0.71 with clinical symptoms only. The F1 score for the pertussis model was 0.72 in both models, and, for RSV infection, it was 0.69 and 0.75. ML models can support the diagnosis and surveillance of infectious diseases such as pertussis or RSV infection in children based on common symptoms and laboratory tests. ML-based clinical decision support systems may be developed in the future in large networks to create accurate tools for clinical support and public health surveillance.

Introduction

Public health strategies for controlling and preventing infectious diseases are based on surveillance to continuously collect, analyze, and interpret data from the healthcare system (1). As surveillance systems should ideally collect data from different settings, case definitions are used to standardize the detection and monitoring of events to allow for trend interpretation (2). Specificity of case definitions depends on the combination of clinical symptoms and on the use of laboratory tests for confirmation (3). Clinical case definitions are valuable at the point of hospital care for rapid triage, for determining the need of laboratory tests, and for deciding the most appropriate management and treatment (2). Syndromic surveillance has been developed in response to emerging threats to timely recognize unexpected patterns of symptoms (4), and has been extensively tested for influenza (5). However, there is much less effort on extending syndromic surveillance to other diseases and to settings where a timely management of cases is needed and where laboratory facilities are not immediately available.

Pertussis is an endemic and highly transmissible disease occurring worldwide (6). Although efficacious vaccines are routinely used for controlling the disease and high immunization coverage rates have been achieved over time, B. pertussis has continued to circulate, possibly due to waning immunity after immunization and natural infection (6). Respiratory Syncytial Virus (RSV) is extremely common: it is estimated that almost every child experiences an infection by RSV by the age of 2 (7, 8). The epidemiology of pertussis and RSV infection is traditionally seasonal, with peaks of pertussis in spring-summer and peaks of RSV infections during autumn-winter (9, 10). Despite their distinct etiology, pertussis and RSV infections in infants may be a diagnostic challenge for clinicians due to similar clinical features (11, 12). The increased availability of laboratory tests for confirming the etiology of these infections has been helpful to counteract the low specificity of respiratory symptoms.

The epidemiology of acute respiratory infections, including pertussis and RSV infection (12, 13), has been disrupted by the SARS-CoV-2 pandemic. Social distancing policies, face mask wearing and possibly some ecological pressure exerted by SARS-CoV2 have contributed to a decrease in the circulation of other respiratory pathogens (14). After the easing of containment policies, respiratory infections other than COVID-19 were expected to re-emerge (12).

Artificial intelligence (AI) has been indicated as a powerful tool for improving the timeliness and accuracy in surveillance systems gathering information from heterogeneous sources (15). There is also the need to move towards pre-syndromic surveillance, to identify space-time clusters of novel syndromic entities not yet recognised by traditional systems, which may be detected with the support of AI-based techniques (16, 17). In essence, AI algorithms developed for syndromic surveillance on large datasets may be more accurate and flexible than traditional case definitions.

Clinical decision support systems (CDSS) based on AI models provide physicians with patient-specific assessments or recommendations towards a decision tailored on the specific characteristics of each patient and based on current evidence (18). In a post-COVID scenario, developing such systems to provide diagnosis support for acute respiratory diseases before laboratory confirmation may have important public health and clinical implications.

As clinical case definitions for some respiratory infections currently have a low specificity, we focused on the hypothesis that AI algorithms may be trained to recognize specific infections to support clinical decisions and public health surveillance. For these reasons, we tested the performance of a ML algorithm for the diagnosis of RSV infection or pertussis in the first year of age at the emergency department, and developed models suitable for different settings.

Methods

Population and settings

Italy has a universal, free of charge healthcare system, which includes primary pediatric care for all children. Family pediatricians are available for pediatric visits during office hours. Clinical requests outside office hours or holidays and urgent clinical problems are usually referred to Emergency Rooms, where services are offered continuously (19).

The population included in this study consisted of patients enrolled in previous surveillance studies coordinated by the Bambino Gesù Children's Hospital, a pediatric research hospital located in Rome, Italy (20). In previous surveillance activities, we enrolled children less than 1 year of age presenting to the emergency department with respiratory symptoms between August 2015 and June 2020 and, after obtaining a signed informed consent, a research nurse collected, through a standardized questionnaire, demographic information and respiratory signs and symptoms. For each patient, we also recorded, through patients' hospital records, full white blood cell count, lymphocyte fraction and the result of a multiplex PCR test on nasopharyngeal aspirates for respiratory pathogens, including B. pertussis and RSV. We did not include asymptomatic patients or those with other symptoms. Fever was defined as follows: single > 37.8°C oral/tympanic membrane (21). To evaluate hypoxia, we considered the arterial oxygen saturation (SaO2) [which refers to the amount of oxygen bound to hemoglobin in arterial blood]. The measurement is given as a percentage. Resting SaO2 less than or equal to 95% was considered abnormal. “Cough” and “paroxysmal cough” were treated as two independent variables. This is consistent with the clinical observations in the clinic, where these two symptoms may be observed independently. Only one child had a paroxysmal cough but no “normal” cough (i.e., most patients experiencing paroxysmal coughs also report “normal” cough).

Laboratory confirmation

Nasopharyngeal aspirates were collected and processed using a standardized protocol (22).

The presence of B. pertussis was investigated using a Bordetella Real Time PCR kit targeting IS481 (Bordetella R-gene assay Argene, Biomerieux, Marcy l'Etoile, France). For RSV identification, we used a commercial multiplex RT-PCR kit (AllplexTM Respiratory Full Panel Assay) for 16 respiratory viruses including RSV A and B, influenza virus A and B, human coronavirus OC43, 229E, NL-63 and HUK1, adenovirus, human rhinovirus, parainfluenza virus 1–2–3–4, human metapneumovirus and human bocavirus.

Feature selection

The diagnostic outcome for pertussis and RSV infection data was already labeled, and there was no missing data for this variable. We defined a binary outcome (normal or high) for three laboratory values based on standard pediatric cutoffs, as follows. For C-reactive protein (CRP), any value greater than 1.0 ml/dl was considered high. White blood counts were considered high if they were greater than 34.0 × 103/μl at 30 days of age and 19.5 × 103/μl over 30 days of age (23). The lymphocyte fraction was considered high if greater than 50% at any age. Symptoms recorded for this study included Cough, Cyanosis, Emesis, Petechiae, Paroxysmal cough, Fever, Conjunctival petechiae, Stridor, Hypoxia, Dehydration, Convulsions, Difficulty feeding and Pre-existing conditions, which were also classified as binary variables.

Machine learning approach

We performed a retrospective binary classification to identify patients diagnosed with pertussis (model 1) or RSV infection (model 2) with a set of symptoms, either with (model 1–2 a) or without (model 1–2 b) routine blood testing results (CRP, white blood cell count, lymphocyte fraction). Rather than developing a complex model to identify the possible respiratory pathogen, we focused on developing simple models to discern between the 2 pathogens most likely to require additional interventions or early treatment in clinical care. For pertussis, the aim was to identify patients early in order to initiate antibiotic treatment as soon as possible; whereas for RSV infection the aim was to identify patients with this viral illness in order to exclude other respiratory pathogens and orient proper management.

Since different clinical sites may have different needs and resources, we developed 4 models so that we could 1) identify patients presenting with pertussis a) using clinical symptoms and routine blood testing, or b) simply using clinical symptoms; or 2) to identify RSV infection a) using clinical symptoms and routine blood testing, or b) simply using clinical symptoms.

To enable estimation of pertussis and RSV infection case predictions with minimal clinical data, we conducted a retrospective diagnostic classification analysis by training gradient-boosting decision tree binary classifiers. We compared several ML models (Logistic Regression, Gaussian Naive Bayes, Random Forest and Light Gradient Boosting Machine—LightGBM) before choosing LightGBM as the highest performing at a moderate time cost as well as being able to ingest missing values (which may be useful for point-of-care CDSS implementation). All analyses were performed using Python 3.7.4 with Shapley values (Shap values) visualization package for predictor visualization.

In order to reduce the complexity of the models as much as possible, we included 14 binary clinical symptoms (Cough, Cyanosis, Emesis, Petechiae, Paroxysmal cough, Fever, Conjunctival petechiae, Stridor, Hypoxia, Dehydration, Convulsions, Difficulty feeding and Pre-existing conditions) and four demographic binary features (gender male/female, age less than/greater than 30 days, prematurity yes/no, duration of illness greater than/less than 3 days) in the basic models (models b) and included the most commonly performed testing for emergency departments or clinics in the primary model (lymphocyte count high/normal, white blood cell count high/normal, CRP high/normal—models a).

While there was a significant class imbalance, we did not wish to introduce bias and thus avoided resampling the data. Due to the class imbalance and the nature of the application of the model, we focused on the recall metric for model selection and hyperparameter tuning (as to minimize false negatives).

Hyperparameter tuning of each LightGBM model was performed using the verstack package (Optuna) and feature importance graphs were obtained with the Shap package.

Results

Data and patient characteristics

The sample size was 599 children, but 14 observations were dropped since children were positive to both B. pertussis and RSV. Out of 585 final observations, 154 (26%) patients had a positive PCR test for B. pertussis, 97 (17%) had a positive PCR test for RSV, in 123 patients (21.0%) the RT-PCR was not able to identify a pathogen among the 16 included in the diagnostic panel, while the remaining patients with respiratory symptoms had evidence of other viral infections. 313 (54%) patients were males and 272 (46%) females, 514 (88%) were over a month of age and 71 (12%) were less than 30 days old at diagnosis (median age in days 63, IQR 42–108 days). The symptom duration was less than or equal to 3 days in 272 children (46%) and greater than 3 days in 313 (54%) children (median duration in days 9, IQR 2–11). A total of 179 children (14%) were born preterm (less than 37 weeks of gestation). White blood cell count was normal in 508 (87%) children, whereas lymphocyte fraction was elevated in 318 (54%). The frequency of symptoms was the following: cough 85%, paroxysmal cough 56%, emesis 39%, fever 38%, cyanosis 34%, stridor 25%, hypoxia 11%,difficulty feeding 11%, petechiae 5%, conjunctival petechiae 5%, dehydration 2% and convulsions 1%. Fifty-six children (10%) had pre-existing conditions, mostly chronic diseases.

Model performance

The performance of our four ML models was fair (Table 1). For pertussis (model 1) recall was 0.72 including laboratory testing and 0.74 with clinical symptoms only. For RSV infection (model 2), recall was 0.68 including laboratory testing and 0.71 with clinical symptoms only. The F1 score for the pertussis model was 0.72 in both models a and b, and, for RSV infection, it was 0.69 and 0.75 for models a and b, respectively. The area under the receiver operating characteristic curve (AUC) was 0.69 for model 1a, 0.71 for model 1b, 0.59 for model 2a, and 0.62 for model 2b.

TABLE 1
www.frontiersin.org

Table 1. Model performance.

For pertussis model 1a, the principal driving feature was elevated lymphocyte fraction, followed by stridor and cyanosis (Figure 1); whereas for pertussis model 1b, the most important features were stridor, cyanosis and absence of fever (Figure 2). For RSV model 2a the most important features were cough, stridor and hypoxia (Figure 3); and for RSV model 2b they were hypoxia, emesis and stridor (Figure 4). Most of the feature's Shap values contributed fairly equally among the two classes.

FIGURE 1
www.frontiersin.org

Figure 1. Pertussis model A (with laboratory testing).

FIGURE 2
www.frontiersin.org

Figure 2. Pertussis model B (syndromic only).

FIGURE 3
www.frontiersin.org

Figure 3. RSV model A (with laboratory testing).

FIGURE 4
www.frontiersin.org

Figure 4. RSV model B (syndromic only).

Overall the model which performed the best was the pertussis prediction without the inclusion of laboratory testing, and for both illnesses the addition of these three features did not improve the prediction in a substantial way. Generally, negative patients to both infections had better prediction scores (hence it was easier for the model to predict children who did not have pertussis or RSV infection than those that were sick), likely due to the class imbalance. We only had 4.5% missing data. Due to the nature of the data collection (prospective, active, by research nurses) we didn't feel comfortable assuming that missing data would equate to a “0” (symptom not present, which would be a safe assumption in a clinical note). Hence, these rows were dropped to avoid imputation and potential addition of bias to the analysis.

Confusion matrices

The confusion matrices are reported in Table 2. Briefly, the sensitivity and specificity were 83% and 61% for pertussis model a and 75% and 72% for model b. For the RSV model, the sensitivity was 69% and the specificity 67% for model a, and 74% and 69% for model b.

TABLE 2
www.frontiersin.org

Table 2. Confusion matrix.

Accordingly, in our holdout set of 117 children, we would have missed 14 cases of pertussis and classified another 14 with an incorrect pertussis diagnosis (model 1a). For RSV infection, we would have missed only six children with the virus but would have incorrectly diagnosed 31 with having the virus when in fact they were negative. Depending on the preference of the hospital or pediatrician's office, the standard threshold of 0.5 (where any predicted probability above would be classified as a 1 and inversely as a 0) could be adjusted to further reduce false positives or false negatives.

Discussion

We developed four models for the diagnosis of pertussis and RSV infection that, based on clinical symptoms and routine blood tests, accurately predict the laboratory test confirmation for these pathogens. These models represent a prototype which could be deployed in an application (eg. for smartphones or for electronic health records) that may be integrated in routine clinical care to assist clinicians in the proper management of these diseases and to provide information that may be immediately available for public surveillance. To our knowledge, no Machine Learning-Clinical Decision Support Systems (ML-CDSS) exists yet to support the diagnosis of pertussis or RSV infection in children.

Our work suggests that AI-based decision support systems may be developed with little clinical data to identify pertussis and RSV infection in children < 1 year of age. The same approach may be easily applied to other infectious diseases that would benefit from surveillance.

Interestingly, the performance of the two models was not improved by the addition of routine blood tests. Regarding pertussis, there was a slight improvement in the detection of the cases with the inclusion of laboratory testing in terms of recall performance, which is consistent with the clinical practice of considering lymphocytosis for the differentiation of pertussis vs. other respiratory infections. However, the overall model performance was not strong enough to justify its deployment, considering that classifying patients with symptoms features only is less complex and more scalable.

The models for pertussis yielded a higher classification performance than those for RSV infection, however, the sample size was lower for RSV infection and the class imbalance more pronounced, therefore, it is possible that the performance could improve basing the analysis on a larger dataset. While these predictions alone may not replace a laboratory test for confirmation, a CDSS integrating these models at the point of care could help to timely support the clinician's judgment in further testing or initiating early treatment when necessary. The simplicity of a CDSS based only on syndromic data makes it invaluable during periods of changing epidemiology, such as the one we are facing with RSV infections in the aftermath of COVID-19, providing information immediately suitable for public health surveillance.

Indeed, other studies attempted COVID-19 case predictions based on AI algorithms (24), but required additional sources of information, such as data obtained from patients through surveys, possibly leading to delays as well as selection biases, which would be minimized with a system embedded at the clinical point-of-care.

A further development of this system could consist in a model for identification of novel clinical syndromes. This kind of model should be fed with unlabelled data collected prospectively and should be based on unsupervised learning.

In a hypothetical future scenario, the identification of individual cases of infectious diseases within the community in a rapid timeframe, as well as the ability to flag novel syndromes or pathogens that pose a threat to public health systems, will rely on fully automated and integrated systems across hospitals and primary care providers, similar to the one conceptualized in this study.

We chose two case studies that have relevance both from the point of view of the clinician and for public health. A pertussis clinical case definition for surveillance is available, although it is not very helpful at the point of care, as it includes a long duration of cough that may only be assessed after early medical encounters (19). We already demonstrated that a decision tree algorithm outperforms the accuracy of a clinical case definition for pertussis (19).

The interest for RSV surveillance has recently increased, as new candidates of RSV vaccines and monoclonal antibodies are in late-stage clinical trials (2527). However, a universal surveillance system for RSV does not exist yet. A standard algorithmic approach may be scaled up to multiple institutions and support timely surveillance systems.

Indeed, syndromic surveillance at the emergency department has been frequently used to complement existing surveillance systems (5) and may remain in place for a long time (28, 29). These systems have been classically developed for monitoring emerging threats from bioterrorism (30).

ML-CDSS for infectious diseases are drawing increased interest, but a minority of them have been developed in emergency care or primary care settings (31). A recent review showed that ML-CDSS for diagnosis have been developed for tuberculosis, healthcare associated infections, surgical site infections, infections at admission into hospital, bacterial or viral meningitis (31).

The next steps for this pilot would be the deployment of our algorithms in an EHR system, making predictions but not displaying them to the physicians (shadow mode). This way, we could manually validate a subsample of these predictions and compare them to a gold standard (such as a PCR result). Only then would we release the AI algorithm monitoring changes in clinical performance (quality control study).

The importance of our work is in representing a proof of concept of ML-CDSS for diagnosis of infectious diseases that can be collectively developed and trained by multiple sites increasing performance and generalizability. These systems may be deployed either into electronic health records and in simple applications for smartphones to enhance or create sustainable and timely surveillance systems for events of interest in hospital and primary care settings.

Limitations

One of the main limitations of our study is its retrospective nature. Our model is based on data collected for two previous surveillance studies on respiratory infections. Therefore, although the quality of our data is high, the data collection had not been specifically designed with the purpose of developing a ML-CDSS. An improvement of the model could be achieved through a prospective study design, which would enable us to collect additional data to improve the diagnostic evaluation and increase the accuracy of the model. Additional clinical information that might be considered for a future model are respiratory rate, chest retractions and breath sounds (rales, wheezing), which are typically found in children with RSV infections. An improved model to be applied in high resource settings might also include audio tracks of respiratory auscultation.

Another limitation concerns the fact that we transformed continuous variables (such as oxygen saturation, fever, leukocyte/lymphocyte counts, CRP) into binary variables using cut-off values based on the published literature. A possible improvement of the model should include an assessment of the variability in the accuracy based on different thresholds of the continuous variables.

Another limitation of our analysis is that it was difficult to determine if the predictive abilities of our models were magnified or minimized by our data. On the one hand, since our data was collected in the emergency room, there likely was a selection of more severe (and possibly easier to identify) patients. On the other hand, the relatively small sample size, particularly with class imbalance, may have underestimated the real signal. However, our example can be improved with additional data from multiple sites and serves the purpose of describing the potential of a CDSS in the Italian healthcare system.

Another limitation regards the elimination of missing data. We decided to adopt a conservative approach, and to drop the records that included missing data. Although this may have biased the result, the proportion of missing data was very small (4.5%), therefore we do not expect it to have had a major impact on our results.

Moreover, we decided not to include observations with coinfections in our model, in order to minimize the influence of coinfections on our findings. The number of coinfections in our dataset was actually small (2.3%), but the fact that they were not included in the model might limit its applicability. A potential amelioration of our model should better take into account the possibility of having patients with coinfections.

The high rate of false positive for RSV yielded by our model may seem worrying. However, this may suggest that such tools may be more helpful in epidemic settings, when the prevalence of the condition is high. Again, a better performance of the model may be achieved including a larger set of parameters in the model, as previously discussed.

Finally, generalizability could be also limited by the high quality of our data, due to the research setting in which it was gathered. Validation of the model in a wider, real world scenario would better elucidate its potentials.

Conclusions

ML-CDSS may represent valuable tools for surveillance of infectious diseases. We present here a proof-of-concept which could serve as a basis for the future development of AI-based surveillance systems. The model we tested in this feasibility study included a limited set of high-quality clinical and laboratory parameters, obtained in a specific research setting, which might limit its performance and generalizability. To improve performance and generalizability, future research steps should aim at a collective effort to: enlarge the dataset used for training and validation, including data from different settings; increase number of parameters used to train and validate the model; testing the model in a real life scenario, e.g., shadow testing on EHRs. These perspectives deserve to be further explored to extend the application of artificial intelligence to public health surveillance.

Data availability statement

The datasets presented in this article are not readily available due to institutional constraints. Requests to access the datasets should be directed to Alberto Eugenio Tozzi, albertoeugenio.tozzi@opbg.net.

Ethics statement

The studies involving human participants were reviewed and approved by Bambino Gesù Children's Hospital IRCCS Ethical Committee. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author contributions

KAMC-DL conceptualized and wrote the manuscript and carried out the data analysis; FG collected and curated the data, interpreted the results from a critical point of view, and drafted the manuscript; EP collected and curated the data, interpreted the results from a clinical point of view (as above), and edited the manuscript; IC performed the data management, the preliminary analysis of the results and edited the manuscript; AET coordinated the work, interpreted the results, wrote and edited the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

AET has been working as an expert consultant on vaccine strategies for Pfizer and Sanofi. FG received support for participating in MSD and Moderna meetings.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Morgan OW, Pinner RW. Surveillance of infectious diseases. Encyclopedia of Microbiol. (2009):759–74. doi: 10.1016/B978-012373944-5.00209-1

CrossRef Full Text | Google Scholar

2. Baruch J, Rojek A, Kartsonaki C, Vijayaraghavan BKT, Gonçalves BP, Pritchard MG, et al. Symptom-based case definitions for COVID-19: time and geographical variations for detection at hospital admission among 260,000 patients. Influenza Other Respir Viruses. (2022) 16:1040–50. doi: 10.1111/irv.13039

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ghanaie RM, Karimi A, Sadeghi H, Esteghamti A, Falah F, Armin S, et al. Sensitivity and specificity of the world health organization pertussis clinical case definition. Int J Infect Dis. (2010) 14:e1072–5. doi: 10.1016/j.ijid.2010.07.005

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Henning KJ. What is syndromic surveillance? MMWR Suppl. (2004) 53:5–11.15714620

PubMed Abstract | Google Scholar

5. Hughes HE, Edeghere O, O’Brien SJ, Vivancos R, Elliot AJ. Emergency department syndromic surveillance systems: a systematic review. BMC Public Health. (2020) 20:1891. doi: 10.1186/s12889-020-09949-y

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Saadatian-Elahi M, Plotkin S, Mills KHG, Halperin SA, McIntyre PB, Picot V, et al. Pertussis: biology, epidemiology and prevention. Vaccine. (2016) 34:5819–26. doi: 10.1016/j.vaccine.2016.10.029

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Griffiths C, Drews SJ, Marchant DJ. Respiratory syncytial virus: infection, detection, and new options for prevention and treatment. Clin Microbiol Rev. (2017) 30:277–319. doi: 10.1128/CMR.00010-16

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Barr R, Green CA, Sande CJ, Drysdale SB. Respiratory syncytial virus: diagnosis, prevention and management. Ther Adv Infect Dis. (2019) 6:2049936119865798. doi: 10.1177/2049936119865798

PubMed Abstract | CrossRef Full Text | Google Scholar

9. De Greeff SC, Dekkers ALM, Teunis P, Rahamat-Langendoen JC, Mooi FR, De Melker HE. Seasonal patterns in time series of pertussis. Epidemiol Infect. (2009) 137:1388–95. doi: 10.1017/S0950268809002489

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Wildenbeest JG, Billard M-N, Zuurbier RP, Korsten K, Langedijk AC, van de Ven PM, et al. The burden of respiratory syncytial virus in healthy term-born infants in Europe: a prospective birth cohort study. Lancet Respir Med. (2022) S2213-2600(22):00414–3. doi: 10.1016/S2213-2600(22)00414-3

CrossRef Full Text | Google Scholar

11. Walsh PF, Kimmel L, Feola M, Tran T, Lim C, De Salvia L, et al. Prevalence of Bordetella pertussis and Bordetella parapertussis in infants presenting to the emergency department with bronchiolitis. J Emerg Med. (2011) 40:256–61. doi: 10.1016/j.jemermed.2008.04.048

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zheng Z, Pitzer VE, Shapiro ED, Bont LJ, Weinberger DM. Estimation of the timing and intensity of reemergence of respiratory syncytial virus following the COVID-19 pandemic in the US. JAMA Network Open. (2021) 4:e2141779. doi: 10.1001/jamanetworkopen.2021.41779

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Falkenstein-Hagander K, Appelqvist E, Cavefors A-SF, Källberg H, Nilsson LJ, Silfverdal S-A, et al. Waning infant pertussis during COVID-19 pandemic. Arch Dis Child. (2022) 107:e19. doi: 10.1136/archdischild-2021-323055

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Li H, Ling F, Zhang S, Liu Y, Wang C, Lin H, et al. Comparison of 19 major infectious diseases during COVID-19 epidemic and previous years in zhejiang, implications for prevention measures. BMC Infect Dis. (2022) 22:296. doi: 10.1186/s12879-022-07301-w

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Zeng D, Cao Z, Neill DB. Artificial intelligence–enabled public health surveillance—from local detection to global epidemic monitoring and control. Artif Intell Med. (2021):437–53. doi: 10.1016/B978-0-12-821259-2.00022-3

CrossRef Full Text | Google Scholar

16. Faigen Z, Deyneka L, Ising A, Neill D, Conway M, Fairchild G, et al. Cross-Disciplinary consultancy to bridge public health technical needs and analytic developers: asyndromic surveillance use case. Online J Public Health Inform. (2015) 7:e228. doi: 10.5210/ojphi.v7i3.6354

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Lake IR, Colón-González FJ, Barker GC, Morbey RA, Smith GE, Elliot AJ. Machine learning to refine decision making within a syndromic surveillance service. BMC Public Health. (2019) 19:559. doi: 10.1186/s12889-019-6916-9

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Sim I, Gorman P, Greenes RA, Haynes RB, Kaplan B, Lehmann H, et al. Clinical decision support systems for the practice of evidence-based medicine. J Am Med Inform Assoc. (2001) 8:527–34. doi: 10.1136/jamia.2001.0080527

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Tozzi AE, Gesualdo F, Rizzo C, Carloni E, Russo L, Campagna I, et al. A data driven clinical algorithm for differential diagnosis of pertussis and other respiratory infections in infants. PLoS One. (2020) 15:e0236041. doi: 10.1371/journal.pone.0236041

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Pandolfi E, Gesualdo F, Rizzo C, Russo L, Campagna I, Carloni E, et al. The impact of pertussis in infants: insights from a hospital-based enhanced surveillance system, lazio region, Italy, 2016 to 2019. Eurosurveillance. (2021) 26:2000562. doi: 10.2807/1560-7917.ES.2021.26.24.2000562

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Healthcare-associated infections and antimicrobial use in European long-term care facilities (HALT-3) CASE DEFINITIONS OF INFECTIONS. https://www.ecdc.europa.eu/sites/default/files/media/en/publications/Report%20Assets/hai-net/HALT3ValidationProtocol-Annex4-CaseDefinitions.pdf (Accessed February 13, 2023)

22. Merdrignac L, Aït El Belghiti F, Pandolfi E, Jané M, Murphy J, Fabiánová K, et al. Incidence and severity of pertussis hospitalisations in infants aged less than 1 year in 37 hospitals of six EU/EEA countries, results of PERTINENT sentinel pilot surveillance system, December 2015 to December 2018. Euro Surveill. (2021) 26:1900762. doi: 10.2807/1560-7917.ES.2021.26.4.1900762

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Andropoulos DB. “Appendix B: pediatric normal laboratory values.,” In: Gregory GA, Andropoulos DB, editors. Gregory's pediatric anesthesia. San Francisco, CA: Wiley (2012). p. 1300–14 doi: 10.1002/9781444345186.app2

24. Astley CM, Tuli G, Mc Cord KA, Cohn EL, Rader B, Varrelman TJ, et al. Global monitoring of the impact of the COVID-19 pandemic through online surveys sampled from the Facebook user base. Proc Natl Acad Sci USA. (2021) 118:e2111455118. doi: 10.1073/pnas.2111455118

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Roberts JN, Graham BS, Karron RA, Munoz FM, Falsey AR, Anderson LJ, et al. Challenges and opportunities in RSV vaccine development: meeting report from FDA/NIH workshop. Vaccine. (2016) 34:4843–9. doi: 10.1016/j.vaccine.2016.07.057

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Higgins D, Trujillo C, Keech C. Advances in RSV vaccine research and development—a global agenda. Vaccine. (2016) 34:2870–5. doi: 10.1016/j.vaccine.2016.03.109

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Griffin MP, Yuan Y, Takas T, Domachowske JB, Madhi SA, Manzoni P, et al. Single-Dose nirsevimab for prevention of RSV in preterm infants. N Engl J Med. (2020) 383:415–25. doi: 10.1056/NEJMoa1913556

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Heffernan R, Mostashari F, Das D, Karpati A, Kulldorff M, Weiss D. Syndromic surveillance in public health practice, New York city. Emerg Infect Dis. (2004) 10:858–64. doi: 10.3201/eid1005.030646

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Fouillet A, Bousquet V, Pontais I, Gallay A, Schönemann CC. The French emergency department OSCOUR network: evaluation after a 10-year existence. Online J Public Health Inform. (2015) 7:e74. doi: 10.5210/ojphi.v7i1.5740

CrossRef Full Text | Google Scholar

30. Gould DW, Walker D, Yoon PW. The evolution of BioSense: lessons learned and future directions. Public Health Rep. (2017) 132:7S–11S. doi: 10.1177/0033354917706954

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Peiffer-Smadja N, Rawson TM, Ahmad R, Buchard A, Georgiou P, Lescure F-X, et al. Machine learning for clinical decision support in infectious diseases: a narrative review of current applications. Clin Microbiol Infect. (2020) 26:584–95. doi: 10.1016/j.cmi.2019.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, artificial intelligence, surveillance, clinical decision support systems, pertussis, respiratory syncytial virus, infectious diseases

Citation: Mc Cord—De Iaco KA, Gesualdo F, Pandolfi E, Croci I and Tozzi AE (2023) Machine learning clinical decision support systems for surveillance: a case study on pertussis and RSV in children. Front. Pediatr. 11:1112074. doi: 10.3389/fped.2023.1112074

Received: 7 December 2022; Accepted: 19 April 2023;
Published: 22 May 2023.

Edited by:

Luis Ignacio Gonzalez-Granado, University Hospital October 12, Spain

Reviewed by:

Padmanabhan Ramnarayan, Imperial College London, United Kingdom
Francesca Musto, Sapienza University of Rome, Italy

© 2023 Mc Cord—De Iaco, Gesualdo, Pandolfi, Croci and Tozzi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Francesco Gesualdo ZnJhbmNlc2NvLmdlc3VhbGRvQG9wYmcubmV0

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.