AUTHOR=Guo Jing-yan TITLE=Machine learning for screening and predicting the availability of medications for children: a cross-sectional survey study JOURNAL=Frontiers in Pediatrics VOLUME=12 YEAR=2024 URL=https://www.frontiersin.org/journals/pediatrics/articles/10.3389/fped.2024.1341199 DOI=10.3389/fped.2024.1341199 ISSN=2296-2360 ABSTRACT=Objective

The aim of the study was to explore the factors influencing the availability of medications for children, and establish a machine learning model to provide an empirical basis for the subsequent formulation and improvement of relevant policies.

Methods

Design: Cross-sectional survey. Setting: 12 provinces, China. Medical doctors from 25 public hospitals were enrolled. All data were randomly divided into a training set and a validation set at a ratio of 7:3. Three prediction models, namely random forest (RF), logistic regression (LR), and extreme gradient boosting (XGBoost), were developed and compared. The receiver operating characteristic curve (ROC) and the associated area under the curve (AUC) were used to evaluate the three models. A nomogram and clinical impact curve (CIC) for availability of medication were developed.

Results

Fifteen of 29 factors in the database that were most likely to be selected were considered to establish the prediction model. The XGBoost model (AUC = 0.915) demonstrated better performance than the RF model (AUC = 0.902) and the LR model (AUC = 0.890). According to the Shapley additive explanation values, the five factors that most significantly affected the availability of medications for children in the XGboost model were as follows: the relatively small number of specialized dosage forms for children; unaffordable medications for children; public education on the accessibility and safety of medication for children; uneven distribution of medical resources, leading to insufficient access to medication for children; and years of service as a doctor. The CIC was used to assess the practical applicability of the factor prediction nomogram.

Conclusions

The XGBoost model can be used to establish a prediction model to screen the factors associated with the availability of medications for children. The most important contributing factors to the models were the following: the relatively small number of specialized dosage forms for children; unaffordable medications for children; public education on the accessibility and safety of medication for children; uneven distribution of medical resources, leading to insufficient access to medication for children; and years of service as a doctor.