Skip to main content

ORIGINAL RESEARCH article

Front. Dent. Med, 12 April 2024
Sec. Pediatric Dentistry
This article is part of the Research Topic Improving Children's Oral Health View all 4 articles

Identifying early permanent teeth caries factors in children using random forest algorithm

\r\nFatemeh MasaebiFatemeh Masaebi1Zahra GhorbaniZahra Ghorbani2Mehdi Azizmohammad LoohaMehdi Azizmohammad Looha3Marzie DeghatipourMarzie Deghatipour4Morteza MohammadzadehMorteza Mohammadzadeh5Mitra Ghazizadeh AhsaieMitra Ghazizadeh Ahsaie6Fariba AsadiFariba Asadi1Farid Zayeri
\r\nFarid Zayeri7*
  • 1Department of Biostatistics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
  • 2Department of Community Oral Health, Dental School, Shahid Beheshti University of Medical Sciences, Tehran, Iran
  • 3Basic and Molecular Epidemiology of Gastrointestinal Disorders Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
  • 4Department of Community Oral Health, Dental School, Zahedan University of Medical Sciences, Zahedan, Iran
  • 5Department of Biostatistics, School of Public Health, Iran University of Medical Sciences, Tehran, Iran
  • 6Department of Oral and Maxillofacial Radiology, School of Dentistry, Shahid Beheshti University of Medical Sciences, Tehran, Iran
  • 7Proteomics Research Center and Department of Biostatistics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran

Introduction: Early permanent dental caries can pose a serious threat to oral health in the coming years. This study aimed to investigate the key factors influencing early dental caries in permanent teeth among first-grade Iranian children.

Methods: A cross-sectional study involving 778 randomly selected first-grade children from public schools in Tehran, Iran, was conducted between November 2017 and January 2018. The oral health of the children, evaluated by two trained dentists, was recorded based on the DMFT index. Information on maternal education, gender, dmft index, brushing frequency, dental visits, flossing, and sweet consumption was also collected. The Random Forest method was employed to identify factors associated with early permanent dental caries, and its performance was compared with logistic regression using the Area Under the Curve (AUC) index.

Results: Logistic regression, represented by odds ratios (OR), revealed a significant association between early permanent dental caries and dmft index [OR = 1.13, 95% CI (1.07, 1.20), p-value <0.001], maternal education [OR = 2.04, 95% CI (1.15, 3.62), p-value <0.05], and sweet consumption [OR = 0.59, 95% CI (0.36, 0.98), p-value <0.05]. Random Forest analysis indicated that male gender, higher maternal education, and lower sweet consumption were associated with increased likelihood of being caries-free. Notably, Random Forest demonstrated superior performance (AUC = 0.81) compared to logistic regression (AUC = 0.72).

Conclusion: Early permanent dental caries can be effectively managed by caring primary teeth and reducing consumption of sweets. Maternal education emerged as a pivotal factor in mitigating the risk of early permanent dental caries. Therefore, prioritizing these factors and preventing permanent teeth caries in childhood can be remarkably influential in reducing future caries. The usage of the Random Forest algorithm is highly recommended for identifying relevant risk factors associated with early permanent teeth.

Introduction

Dental caries represents a significant global oral health concern affecting individuals across diverse age groups. Epidemiological estimates indicate a prevalence of 64.6 million cases in permanent teeth and 62.9 million cases in deciduous teeth worldwide (1). This disease is a multifactorial problem caused by the interaction of specific bacteria in the mouth, a susceptible tooth surface, or a diet high in sugars. Dental caries involves the demineralization and breakdown of the tooth structure, leading to the formation of cavities or holes in the enamel, dentin, and eventually reaching the pulp of the tooth (2). According to the World Health Organization (WHO), dental caries is one of the most prevalent non-communicable diseases among school-aged children, affecting 60%–70% of them in both developing and developed countries (3). The ramifications extend beyond oral health, encompassing pain, infection, impediments to eating and speaking, thereby impacting growth and overall well-being in children. Additionally, untreated caries may disrupt school attendance and performance due to pain and dental appointments.

Dental caries in permanent teeth is commonly evaluated using the DMFT index, which is a valuable tool utilized by dental health professionals to assess oral health status. This index, encompassing decayed, missing, and permanently filled teeth, plays a pivotal role in enhancing children's oral health and alleviating the burden of dental caries in communities (4).

The risk factors associated with dental caries in permanent teeth, as expressed by the DMF index, are multifaceted. Inadequate oral hygiene practices, marked by infrequent brushing and insufficient flossing, contribute to the accumulation of dental plaque and bacteria, fostering tooth decay. High consumption of sugary foods and beverages provides a favorable environment for acid-producing bacteria and leads to an increased risk of dental caries (5). Furthermore, having a higher dmft score in primary teeth can be considered as a risk factor for an increased DMFT score in permanent teeth (6). This may be due to the fact that caries in primary teeth suggests a higher susceptibility to tooth decay, which may persist in permanent teeth. The presence of dental caries in primary teeth can also indicate poor oral hygiene practices, dietary habits, or other factors that may continue to contribute to the development of caries in permanent teeth. Notably, an underexplored risk factor highlighted in the literature is the manifestation of dental caries in permanent dentition during early childhood, specifically at 6–7 years of age. This can be considered as a serious alarm for immediate attention and preventive actions to avoid the development of further caries in the coming years.

In the exploration of factors associated with the DMFT index in children, previous studies have employed diverse statistical methodologies, encompassing traditional univariate approaches and multivariable regression models such as loglinear regression for count responses and logistic regression for binary outcomes when considering the DMFT index. In recent years, advances in statistical methods have led to the utilization of more sophisticated techniques, such as machine learning algorithms, in the field of dental caries research. These methods enable analysts to capture intricate relationships among variables, yielding more precise predictions for diverse health outcomes, including the manifestation of dental caries, and facilitating a more robust identification of associated risk factors. Notably, in the realm of machine learning algorithms, the random forest technique has gained prominence. The random forest approach employs multiple decision trees for prediction, with each tree being trained on a random subset of the dataset. The outcomes from these individual trees are subsequently amalgamated to generate a final prediction. This supervised learning strategy not only enhances predictive accuracy but also provides users with results that are more comprehensible and interpretable (7, 8).

Reviewing the published research in this area demonstrates that there are various studies about the DMFT index and its associated risk factors among school children in different regions of the world. Notably, a study conducted in Korea reported the DMFT index of 2.08 in 12-year-old children. This index was reported as 1.19, 0.70, and 0.70, respectively in the United States, Germany, and Finland. Among 12- and 15-year-old Iranian children, this index was 2.09 and 3.29, respectively (9). In a recent 2021 study involving seventh grade children (age 11–12 years), the researchers found that 48.2% had caries in their permanent teeth, with a mean DMFT of 1.7. This condition was associated with various behavioral and socioeconomic factors, including the screening site, mother's employment, brushing, the status of dental treatment, and visits (10). Furthermore, based on reviewing the statistical methods applied to the published articles, the majority of them have used univariate approaches or simple regression models to discover the relationship between risk factors and dental caries in children. Regarding this, and given the promising attributes of Random Forest, we decided to conduct the present study to assess the performance of random forest for a binary response (DMFT = 0 or DMFT>0) to identify the factors associated with the DMFT index among fist-grade children in Tehran, Iran. Additionally, the accuracy of this advanced method was compared with the binary logistic regression model, which is commonly used in such investigations.

Methods

Ethical approval

The present study has been approved by the Ethics Committee of Shahid Beheshti University of Medical Sciences (Ethics code: IR.SBMU.RETECH.REC.1402.136).

Study design and participants

This cross-sectional study was conducted on 778 first-grade children attending public schools in Pishva and Pakdasht cities, located in the southern regions of Tehran Province, Iran. The study sample was selected using a single-stage cluster sampling technique, wherein first-grade students were randomly chosen from a comprehensive list of both urban and rural students in Pishva and Pakdasht. Subsequently, all first-grade students between the ages of 6 and 7 years were included in the study from November 2017 to January 2018. Inclusion criteria for first-grade children necessitated adherence to specific conditions, including being within the age range of 6 to 7 years, obtaining parental consent, and having no history of congenital or genetic issues. The study consent form provided a detailed explanation of the study's purpose, characteristics, and significance.

Methods of examination

In the present study, the oral health of children was evaluated by two dentists with specialized training in conducting examinations. To reduce interobserver variability, the dental examiners underwent periodic calibration of their procedures. The examination of the children's teeth was performed using essential tools such as a dental mirror, a round-tip periodontal probe, rubber gloves, cloth or paper hand towels, gauze, and halogen lights. Additionally, standardized WHO forms were employed in collecting information about the children's oral health status and medical records.

Main outcome and explanatory variables

The primary objective of the study was to assess the occurrence of caries in permanent teeth, with the dependent variable dichotomized as a binary outcome represented by the DMFT index (DMFT = 0 or DMFT>0). In the statistical analysis process, a variety of explanatory variables associated with dental caries were divided into demographic characteristics (including gender and mothers’ educational level; medium (Under 12 years of education) and high (12 or more years of education)) and dental-related information (such as dmft index, having dental visits, frequency of dental flossing and brushing, and sweet consumption frequency).

Statistical analysis

Frequencies, percentages and box plots were used to summarize and visualize qualitative variables, while means and standard deviations were computed for describing continuous features. The association between the DMFT index and categorical variables was evaluated using the chi-square test, whereas continuous variables were assessed through independent sample t-tests.

In order to identify the important variables associated with the DMFT index in first-grade children, the Random Forest method was employed. Based on this statistical approach, the covariate space is recursively partitioned by splitting rules derived from the features. For a given node, the best split is chosen among all possible binary splits obtained from a covariate. Generally, the Random Forest algorithm is one of the most popular approaches, created by drawing bootstrap samples from the original data, so that for each bootstrap dataset, a tree with a selected splitting criterion is constructed. The tree grows up by randomly selecting q0 out of q covariates at each node, and when the stopping criterion is reached, the growth ends. Finally, the average results extracted from all trees are used to make an accurate prediction for a new observation.

In this study, the dichotomized DMFT index (consisting of two classes: DMFT>0 or DMFT = 0) was used as the main outcome under study, and the dmft index, gender, tooth brushing, dental flossing, sweet consumption, dental visit, and maternal education level as predictor variables were used to build the Random Forest model.

Variable importance, a key output of the Random Forest method, was assessed through the permutation importance index. In this context, the importance of variables is determined by the permutation importance index. This index determines the importance of variables by assessing the alteration in prediction error during the random permutation process of out-of-bag (OOB) data specific to each variable (7, 8).

The results derived from the Random Forest technique were contrasted with those generated by a dominant parametric methodology, specifically a logistic regression model. It should be noted that the applied Random Forest and logistic regression models were identical in terms of both independent and dependent variables. Furthermore, the strength of the relationship between indicators and the DMFT index was reported using the odds ratio [OR = exp(B)] in the logistic regression model. In assessing predictive efficacy, a comparison of model performance utilizing the Area Under the Curve (AUC) metric was undertaken between the Random Forest and logistic regression models.

The statistical analysis was conducted using R software, version 4.2.2. The Random Forest was constructed based on the default settings, including the number of trees (analyzing 1,000 trees) and the number of variables evaluated to find the best split at each node.

Results

This investigation encompassed a total of 778 first-grade children, comprising 313 (40.2%) males and 465 (59.8%) females. Within this study, 115 students (14.8%) exhibited at least one decayed permanent tooth (DMFT>0), while 663 subjects (85.2%) remained caries-free (DMFT = 0). Notably, the mean dmft index among children with permanent dental caries was significantly elevated compared to caries-free individuals (7.97 ± 3.44 vs. 6.0 ± 3.88, p-value <0.001). Table 1 shows the general characteristics of the participants by DMFT index as the primary outcome. Results indicate significant associations (p-value <0.05) between gender, mother's educational level, consumption of sweets, and dental visits, and the dichotomized DMFT index in these children.

Table 1
www.frontiersin.org

Table 1. Characteristics of the subjects according to the DMFT index.

In the next step, a multivariable logistic regression model was fitted to investigate the adjusted effect of the covariates on the DMFT index. The results, presented in Table 2, showcase the derived estimates. These estimates illuminate a significant association between mother's educational level, consumption of sweets, and the dmft index with the outcome variable. Specifically, one-unit increase in the dmft index has led to about 13% rise in the odds of dental caries (ie. DMFT>0) in these children (p < 0.001). Furthermore, the estimated odds ratio of 0.59 for sweet consumption indicates that children who consume sweets once a day have approximately 41% lower odds of experiencing dental caries compared to their counterparts (p < 0.05). Noteworthy is the finding that children with mothers possessing lower levels of education exhibit a 2.04 times higher likelihood of dental caries in permanent teeth compared to those with highly educated mothers (p < 0.05).

Table 2
www.frontiersin.org

Table 2. Logistic regression analyses of factors associated with the DMFT index.

In the final step, the random forest model was fitted to the data. According to the results of this model, the relative importance of the predictors under study was determined, as illustrated in Figure 1. These results highlight that the dmft index and sweet consumption exhibit the most substantial contributions to the prediction of the DMFT index. Subsequently, the importance of gender and maternal educational level are in the next positions. Furthermore, the probability of the zero DMFT value (being caries-free) for the first three qualitative features extracted from the Random Forest is visualized using a box plot in Figure 2. This plot shows male gender, high maternal education, and a lower tendency to consume sweets can increase the probability of being caries-free. Also, the estimated AUC values for the logistic regression and Random Forest models were 0.72 and 0.81, respectively. This shows a better fit of the random forest model in prediction of the dichotomized DMFT outcome compared to the binary logistic regression model.

Figure 1
www.frontiersin.org

Figure 1. Variable importance of random forest model for predicting the DMFT index in first-grade children.

Figure 2
www.frontiersin.org

Figure 2. Box plots for probability of being caries-free based on gender, maternal education and sweet consumption.

Discussion

Dental caries in permanent teeth, assessed by the DMFT index, constitutes a significant oral health concern across all age groups, with particular relevance to school-aged children. Although, dental caries in permanent teeth is less common in first-grade children, identifying risk factors associated with this problem is crucial for preventive measures and early intervention to reduce the likelihood of developing dental caries in permanent teeth. According to the published evidence, dental caries has detrimental impacts on the quality of life, growth, and academic performance of children. Despite our knowledge about the significant role of identifying dental caries in permanent teeth in early stages, limited studies have been conducted to examine the factors affecting dental caries in permanent teeth. Also, the published articles in this field did not give adequate consideration to capturing the intricate relationship between indicators and outcome in their statistical analysis. This study addresses this gap by employing sophisticated statistical methods to discern crucial factors associated with dental caries in permanent teeth, acknowledging the complexity and nonlinearity of the relationship between the outcome variable and potential predictors. The estimated predictive power indices for the applied models indicated that both of them had acceptable accuracy in predicting dental caries. However, the performance of the random forest model was better than the ordinary logistic regression model. According to our findings, the results of the variable importance analysis using the Random Forest model revealed that dental caries in deciduous teeth and sweet consumption were the predictors with highest relative importance among the assessed indicators. Moreover, gender and maternal educational level had a remarkable influence on the prediction of dental caries in permanent teeth (DMFT). Despite some differences in identifying important variables, the obtained results from the random forest analysis were generally aligned with those found from the logistic regression model.

The obtained results from the present study showed that more than 85% of the first-grade children were caries-free. Consistent with our findings, a study by Masood et al. in 2004 on Malaysian primary school children (aged 6 years) demonstrated a congruent pattern, where the mean (SD) DMFT index was 0.06 (0.306), and 95.4% of children were caries-free. This study underscored a robust association between early caries in permanent teeth at the age of six and the predisposition to future caries (11). In another study by Kamiab et al. in Iran, the researchers stated that 364 (17.9%) of similarly aged children had decayed permanent molars (12). In addition, our findings are similar to the study conducted by Movahed et al. in Iran. They reported a mean of 0.2 for DMFT index in 6-year-old Iranian children. However, our findings are slightly lower than the prevalence of dental caries in permanent teeth reported by two other studies conducted by Kazeminia et al. and Heinrich-Weltzien et al. In a systematic review carried out by Kazeminia et al., the global prevalence of dental decay in permanent teeth was reported to be 53.8% in children. This difference in prevalence of dental decay may be attributed to the diversity in age groups under study in the above-mentioned researches (13). Furthermore, in a study performed by Heinrich-Weltzien et al. on 1,962 Filipino children aged 6–7 year, the prevalence of caries in permanent teeth was reported as 39.7% (14). In another study conducted by Bakhshi et al. in Iran, their findings showed a mean (SD) DMFT of 2.76 (2.67) among 12–15-year-old adolescents (9). This higher DMFT index may be caused by untreated dental caries in permanent teeth in the early stages of childhood. In summary, these findings indicate a markedly higher DMFT index among teenage students compared to younger age groups in Iran, emphasizing the imperative goals outlined by the World Health Organization to reduce the DMFT index among 12- to 15-year old children (15).

Our investigation into demographic factors reveals a noteworthy association between maternal education levels and the prevalence of dental caries in permanent teeth among children. Specifically, our findings indicate that children with mothers possessing lower educational attainment exhibit a substantially higher prevalence of dental caries, with odds more than 2 times greater compared to their counterparts with literate mothers. This finding is supported by the results of various studies across all age groups and different types of teeth. For example, the obtained results from a study, carried out by Nembhwan et al. on 120 mother–child pairs (under 12 years old), revealed that children who had mothers with higher levels of education and occupation had experienced lower scores of DMFT (16). In another study conducted by Borges et al. on 10–17 years old Brazilian students of both genders, a significant correlation between maternal education and dental caries was observed, so that students with mothers who had lower levels of education had a higher prevalence of caries in permanent teeth (17). In a study by Teresa et al. on 7–12 years old students, the results demonstrated that children whose mothers had lower levels of education were more likely to have dental caries (OR = 2.27) (18). The significant association between maternal education and dental caries has also been confirmed by a longitudinal study in Australia. This study, conducted on indigenous children, showed that those with low-educated mothers (less than 10 years of education) tended to have more dental caries compared to those with highly-educated mothers (19). However, in a study by Wu in China, interestingly, multivariable logistic regression analysis showed no relationship between maternal educational level and risk of caries in 12-year-old Chinese students (20). This discrepancy may be attributed to multicollinearity among variables in the employed multivariable model.

The positive impact of maternal education, particularly in oral health education, on reducing dental caries can be effectively managed and controlled. Through targeted oral health educational interventions, there is a proactive opportunity to prevent dental caries. Empowering mothers with essential knowledge and resources enables them to make informed decisions about oral health practices, ultimately leading to a notable decrease in the prevalence and severity of dental caries in both mothers and their children. This assertion is supported by various studies, such as a trial conducted by Deghatipour et al. in 2021 involving 439 mothers observed from pregnancy to 24 months post-delivery to assess interventions aimed at improving children's oral health. Participants were divided into intervention (n = 239) and control (n = 200) groups. In the intervention group, mothers received heightened awareness through oral health messages delivered via discussions with dentists or face-to-face education by primary healthcare providers. The study concluded that children in the intervention group were more likely to be free from caries by the end of the follow-up period compared to those in the control group (21). Furthermore, in another trial study by Mohebbi et al., Iranian mothers with 1- to 2-year-old children received education on sugar consumption, nighttime feeding practices, and oral hygiene. Results indicated that this intervention significantly improved the oral health of mother-child pairs (22). Therefore, enhancing mothers’ knowledge in oral health can be achieved through various strategies, such as conducting oral health workshops, distributing printed materials, sharing educational videos on social media, integrating oral health education into parenting classes, and providing ongoing support and follow-up sessions. These initiatives play a crucial role in promoting improved oral health outcomes for both mothers and their children by empowering them with essential information and skills to prioritize oral health and implement effective oral hygiene practices at home.

Concerning oral behavior factors, our statistical analysis unveiled two primary findings aligned with existing literature. Firstly, the dmft index (caries index of primary teeth) exhibited a significant impact on the occurrence of caries in permanent teeth among first-grade school children. Specifically, our results indicate that a one-unit increase in the dmft index corresponds to an approximately 13 percent elevation in the odds of developing caries in permanent teeth. In a study conducted by Motohashi et al. on Japanese primary school students, the researchers obtained a significant relationship between the dmft score and caries in permanent teeth. They showed that the dmft score can be considered as a helpful indicator to predict caries in permanent teeth with an acceptable accuracy (6). In a prospective cohort study, which followed Brazilian children from birth to age 12 years, strong evidence was found between the pattern of caries in primary and permanent dentition (23). Additionally, the findings of an eight-year cohort study on 362 Chinese children (at the baseline, they were 3–5 years old) revealed that caries in the primary teeth can predict caries in the permanent dentition with a high degree of accuracy. Precisely, children with caries in their primary teeth were three times more likely to have caries in their permanent teeth (24). In our study, the estimated odds ratio of caries in permanent teeth based on the dmft score was remarkably lower than the studies on Chinese children. This contradiction may be due to the fact that the first-grade school children with a few caries in their permanent teeth were examined.

Second, we found a significant relationship between the occurrence of dental caries in permanent teeth and sweet consumption, so that the children who consume sweets more frequently were more likely to develop dental caries in their permanent teeth. These findings are supported by other studies in this field. For example, in a study conducted by Feldens with the aim of evaluating the association between sugar consumption patterns and permanent dentition caries in 6-year-old children, it was reported that sugar-related behaviors were significantly associated with permanent caries (25). The findings from another study performed by Bassa et al. on 761 school-aged participants showed that the consumption of sweets leads to more than 4 times increase in the risk of dental caries in permanent teeth (26). Additionally, a study by Hong et al. in the UK showed strong evidence for the significant relationship between sweet consumption and decay in permanent teeth in children (27). Furthermore, our study revealed that gender is one of the important demographic factors that affects dental caries. Specifically, our findings indicated that male children in the first grade are at a lower risk of developing dental caries in permanent teeth than females. Several studies have shown a significant association between gender and caries in permanent teeth, with their results indicating a higher risk of caries in females compared to males. For instance, Mubaraki et al. conducted a cross-sectional study in 2022 to explore the prevalence of permanent dental caries among children in Saudi Arabia. Their findings revealed that the overall prevalence of decayed first permanent teeth was 30.5%, highlighting a significant disparity between genders. Specifically, the prevalence of caries in the first permanent teeth was notably higher in females compared to males (28). Additionally, in a longitudinal study conducted in 2016, researchers examined children from Shanghai, China, at the age of 5 years and followed up with them at the age of 12 years. The study revealed that the prevalence of dental caries was 33.9% in boys and 37.9% in girls with mean DMFT scores in permanent teeth of 0.7 ± 0.0 and 0.8 ± 0.1, respectively (29). Also, in a study conducted by Papadaki in 2021, UK children aged 5, 8, 12, and 15 years were considered. Interestingly, no gender disparities in dental caries were found among 5- and 8-year-old children. However, in adolescents aged 12 and 15 years, females had higher DMFT scores than males (30).

In the present study, the advanced method of Random Forest was used alongside the logistic regression model, as a common method, to identify associated factors with early permanent teeth caries. Some unique features of Random Forest method, such as simplicity, versatility, and robust performance, make it more popular compared to other machine learning and deep learning algorithms. Unlike the majority of machine or deep learning models, which act as a “black box”, Random Forest provides a clear understanding of the importance of features in predicting the outcome of our study. This method allows us to accurately investigate all the complex relationships between the indicators and outcome variable. More importantly, Random Forest can be used as a helpful tool to handle imbalanced data by setting the sampsize parameter in building each tree in the R software. In other words, the balanced sample solution aims to induce the algorithm to build trees based on a balanced bootstrap sample. This feature was constructive in our study as the utilized data was imbalanced and included a considerable number of zero values. At the same time, there are several potential predictors associated with dental caries, including maternal oral health, socioeconomic backgrounds, and fluoride exposure. Neglecting to account for these factors represents a limitation in our study. Addressing these covariates in our models could enhance the accuracy of models to predict DMFT. For instance, a recent study by Sadegh Zadeh et al. in 2022 applied various machine learning models based on fluoride exposure and other oral factors to predict dental caries in Iranian children. By incorporating these variables, their findings indicated that random forest yielded the highest predictive performance, with an accuracy of 97.4% (31). Likewise, in a study conducted by Ogwo et al. in 2023 involving young adults, the utilization of fluoride, income status, and other oral behavioral factors as covariates in machine learning models like LASSO regression and generalized boosting machines led to improve the accuracy of models. Their results demonstrated that integrating these covariates could boost the AUC of LASSO regression to as high as 93%. Therefore, while our current study incorporated significant influential factors like dmft, recognizing additional variables could further enhance the model's accuracy (32). Despite the appealing features of the Random Forest model in identifying influential factors, it is still rarely used in the field of caries of permanent teeth. For example, in the cross-sectional study conducted by Park et al., machine learning algorithm and logistic regression model were employed to predict early childhood caries in Korean 1- to 5-year-old Korean children. Their findings showed that two methods, with the range of accuracy between 0.774 and 0.785, had an acceptable performance for predicting and identifying associated factors with dental caries (33, 34).

Conclusion

In summary, our investigation demonstrated that both the Random Forest and logistic regression models exhibit satisfactory accuracy in predicting dental caries. However, a comparative analysis of predictive power indices revealed that the Random Forest model is the preferred choice for predicting dental caries, as it marginally outperformed logistic regression on the examined dataset. Clinically, we found a strong relationship between the educational level of mothers and the occurrence of dental caries in their children. More importantly, we also found that effectively managing dental caries in primary teeth and reducing sweet consumption can significantly decrease the risk of developing caries in permanent teeth, particularly in first-grade children. Additionally, early permanent teeth caries is an important alarm for predicting future caries incidence in adulthood. In light of these findings, it can be conclusively affirmed that early detection of permanent dental caries, coupled with the implementation of preventative measures, is imperative for maintaining optimal oral health. These insights contribute to the broader understanding of the intricate relationships between predictive models, maternal education, early dental care practices, and the long-term implications for oral health.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by The present study has been approved by the Ethics Committee of Shahid Beheshti University of Medical Sciences (Ethics code: IR.SBMU.RETECH.REC.1402.136). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

FM: Conceptualization, Data curation, Formal Analysis, Methodology, Software, Writing – original draft, Writing – review & editing. ZG: Conceptualization, Data curation, Writing – review & editing. MA: Software, Writing – review & editing. MD: Data curation, Writing – review & editing. MM: Software, Writing – review & editing. MA: Data curation, Writing – review & editing. FA: Conceptualization, Writing – review & editing. FZ: Conceptualization, Formal Analysis, Software, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Acknowledgments

We would like to express our gratitude to the study participants for their collaboration throughout the process of collecting data.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Wen PY, Chen MX, Zhong YJ, Dong QQ, Wong HM. Global burden and inequality of dental caries, 1990 to 2019. J Dent Res. (2022) 101(4):392–9. doi: 10.1177/00220345211056247

PubMed Abstract | Crossref Full Text | Google Scholar

2. Fejerskov O, Nyvad B, Kidd EA. Pathology of dental caries. In: Fejerskov O, Kidd E. editors. Dental Caries: the Disease and its Clinical Management. Vol. 3. Oxford: Blackwell Munksgaard (2003). p. 7–9.

Google Scholar

3. Elamin A, Garemo M, Mulder A. Determinants of dental caries in children in the Middle East and North Africa region: a systematic review based on literature published from 2000 to 2019. BMC Oral Health. (2021) 21(1):1–30. doi: 10.1186/s12903-021-01482-7

PubMed Abstract | Crossref Full Text | Google Scholar

4. Moradi G, Bolbanabad AM, Moinafshar A, Adabi H, Sharafi M, Zareie B. Evaluation of oral health status based on the decayed, missing and filled teeth (DMFT) index. Iran J Public Health. (2019) 48(11):2050–7. PMID: 31970104.31970104

PubMed Abstract | Google Scholar

5. Alm A. On Dental Caries and Caries-Related Factors in Children and Teenagers. Gothenburg: Inst of Odontology. Dept of Cariology (2008).

Google Scholar

6. Motohashi M, Yamada H, Genkai F, Kato H, Imai T, Sato S, et al. Employing dmft score as a risk predictor for caries development in the permanent teeth in Japanese primary school girls. J Oral Sci. (2006) 48(4):233–7. doi: 10.2334/josnusd.48.233

PubMed Abstract | Crossref Full Text | Google Scholar

7. Hastie T, Tibshirani R, Friedman JH, Friedman JH. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: springer (2009).

Google Scholar

8. Liaw A, Wiener M. Classification and regression by randomForest. R News. (2002) 2(3):18–22.

Google Scholar

9. Bakhshi E, Yazdanipour MA, Rahgozar M, Ghorbani Z, Deghatipour M. Overall effects of risk factors associated with dental caries indices using the marginalized zero-inflated negative binomial model. Caries Res. (2019) 53(5):541–6. doi: 10.1159/000498892

PubMed Abstract | Crossref Full Text | Google Scholar

10. Alraqiq H, Eddali A, Boufis R. Prevalence of dental caries and associated factors among school-aged children in Tripoli, Libya: a cross-sectional study. BMC Oral Health. (2021) 21(1):224. doi: 10.1186/s12903-021-01545-9

PubMed Abstract | Crossref Full Text | Google Scholar

11. Masood M, Yusof N, Hassan MI, Jaafar N. Assessment of dental caries predictors in 6-year-old school children-results from 5-year retrospective cohort study. BMC public Health. (2012) 12:1–7. doi: 10.1186/1471-2458-12-989

PubMed Abstract | Crossref Full Text | Google Scholar

12. Kamiab N, Kamalabadi YM, Fathollahi MS. DMFT of the first permanent molars, dmft and related factors among all first-grade primary school students in Rafsanjan urban area. J Dent. (2021) 22(2):109. doi: 10.30476/DENTJODS.2020.85573.1136

PubMed Abstract | Crossref Full Text | Google Scholar

13. Kazeminia M, Abdi A, Shohaimi S, Jalali R, Vaisi-Raygani A, Salari N, et al. Dental caries in primary and permanent teeth in children’s worldwide, 1995 to 2019: a systematic review and meta-analysis. Head Face Med. (2020) 16(1):1–21. doi: 10.1186/s13005-020-00237-z

PubMed Abstract | Crossref Full Text | Google Scholar

14. Heinrich-Weltzien R, Monse B, Benzian H, Heinrich J, Kromeyer-Hauschild K. Association of dental caries and weight status in 6-to 7-year-old Filipino children. Clin Oral Investig. (2013) 17:1515–23. doi: 10.1007/s00784-012-0849-3

PubMed Abstract | Crossref Full Text | Google Scholar

15. Hobdell M, Clarkson J, Petersen PE, Johnson N. Global goals for oral health 2020. Int Dent J. (2003) 53(5):285–8. doi: 10.1111/j.1875-595X.2003.tb00761.x

PubMed Abstract | Crossref Full Text | Google Scholar

16. Nembhwani HV, Varkey I. Caries experience and its relationship with mother’s educational level and occupational Status: a cross-sectional survey. Int J Clin Pediatr Dent. (2022) 15(Suppl 2):S226. doi: 10.5005/jp-journals-10005-2163

PubMed Abstract | Crossref Full Text | Google Scholar

17. Borges TS, Schwanke NL, Reuter CP, Neto LK, Burgos MS. Factors associated with caries: a survey of students from Southern Brazil. Rev Paul Pediatr. (2016) 34:489–94. doi: 10.1016/j.rppede.2016.02.013

PubMed Abstract | Crossref Full Text | Google Scholar

18. Villanueva-Gutiérrez T, Irigoyen-Camacho ME, Castaño-Seiquier A, Zepeda-Zepeda MA, Sanchez-Pérez L, Frechero NM. Prevalence and severity of molar–incisor hypomineralization, maternal education, and dental caries: a cross-sectional study of Mexican schoolchildren with low socioeconomic status. J Int Soc Prev Community Dent. (2019) 9(5):513. doi: 10.4103/jispcd.JISPCD_130_19

Crossref Full Text | Google Scholar

19. Ju X, Jamieson LM, Mejia GC. Estimating the effects of maternal education on child dental caries using marginal structural models: the longitudinal study of indigenous Australian children. Community Dent Oral Epidemiol. (2016 Dec) 44(6):602–10. doi: 10.1111/cdoe.12259

PubMed Abstract | Crossref Full Text | Google Scholar

20. Wu L, Li J, Zhang Y, Zhou Y, Liang Y, Huang S. Oral health status and risk factors for caries in permanent teeth among 12-year-old students in Guangdong, Southern China: a population-based epidemiological survey. Oral Health Prev. Dent. (2020) 18:731–40. doi: 10.3290/j.ohpd.a45076

PubMed Abstract | Crossref Full Text | Google Scholar

21. Deghatipour M, Ghorbani Z, Mokhlesi AH, Ghanbari S, Namdari M. Community-based interventions to reduce dental caries among 24-month old children: a pilot study of a field trial. BMC Oral Health. (2021) 21(1):637. doi: 10.1186/s12903-021-01999-x

PubMed Abstract | Crossref Full Text | Google Scholar

22. Mohebbi SZ, Virtanen JI, Vehkalahti MM. Improvements in the behaviour of mother-child pairs following low-cost oral health education. Oral Health Prev Dent. (2014) 12(1):15–8. doi: 10.3290/j.ohpd.a31227

Crossref Full Text | Google Scholar

23. Feldens CA, Braga VS, Kramer PF, Vítolo MR, Rodrigues PH, de Barros Coelho EM, et al. Primary dentition caries patterns as predictors of permanent dentition caries: a prospective cohort study. Caries Res. (2023) 57(2):167–76. doi: 10.1159/000529620

PubMed Abstract | Crossref Full Text | Google Scholar

24. Li Y, Wang W. Predicting caries in permanent teeth from caries in primary teeth: an eight-year cohort study. J Dent Res. (2002) 81(8):561–6. doi: 10.1177/154405910208100812

PubMed Abstract | Crossref Full Text | Google Scholar

25. Feldens CA, Dos Santos IF, Kramer PF, Vítolo MR, Braga VS, Chaffee BW. Early-life patterns of sugar consumption and dental caries in the permanent teeth: a birth cohort study. Caries Res. (2021) 55(5):505–14. doi: 10.1159/000518890

PubMed Abstract | Crossref Full Text | Google Scholar

26. Bassa S, Workie SB, Kassa Y, Tegbaru DW. Prevalence of dental caries and relation with nutritional status among school-age children in resource limited setting of southern Ethiopia. BMC Oral Health. (2023) 23(1):84. doi: 10.1186/s12903-023-02786-6

PubMed Abstract | Crossref Full Text | Google Scholar

27. Hong J, Whelton H, Douglas G, Kang J. Consumption frequency of added sugars and UK children’s dental caries. Community Dent Oral Epidemiol. (2018) 46(5):457–64. doi: 10.1111/cdoe.12413

PubMed Abstract | Crossref Full Text | Google Scholar

28. Mubaraki S, AlOlyan R, AlBrekeit J, AlFouzan S, Abosharkh M, AlSaeri N, et al. Prevalence of caries in first permanent molar among children in Saudi Arabia: a retrospective study. Eur Rev Med Pharmacol Sci. (2022) 26(20):7550–4. doi: 10.26355/eurrev_202210_30026

PubMed Abstract | Crossref Full Text | Google Scholar

29. Du Q, Yu M, Li Y, Du H, Gao W, Mei H, et al. Permanent caries experience is associated with primary caries experience: a 7-year longitudinal study in China. Community Dent Oral Epidemiol. (2017) 45(1):43–8. doi: 10.1111/cdoe.12257

PubMed Abstract | Crossref Full Text | Google Scholar

30. Papadaki S, Douglas GV, HaniBani A, Kang J. Gender differences in caries and periodontal Status in UK children. medRxiv. (2021):6–8. doi: 10.1101/2021.03.24.21253842

Crossref Full Text | Google Scholar

31. Sadegh-Zadeh SA, Rahmani Qeranqayeh A, Benkhalifa E, Dyke D, Taylor L, Bagheri M. Dental caries risk assessment in children 5 years old and under via machine learning. Dent J (Basel). (2022) 10(9):164. doi: 10.3390/dj10090164

PubMed Abstract | Crossref Full Text | Google Scholar

32. Ogwo C, Grant B, Warren J, Caplan D, Levy S. Predicting Dental Caries Outcomes in Young Adults Using Machine Learning Approach. 2023.

Google Scholar

33. Park YH, Kim SH, Choi YY. Prediction models of early childhood caries based on machine learning algorithms. Int J Environ Res Public Health. (2021) 18(16):8613. doi: 10.3390/ijerph18168613

PubMed Abstract | Crossref Full Text | Google Scholar

34. Bayat Movahed S, Samadzadeh H, Ziyarati L, Memory N, Khosravi R, Sadr Eshkevari PS. Oral health of Iranian children in 2004: a national pathfinder survey of dental caries and treatment needs. EMHJ. (2011) 17(3):243–9. doi: 10.26719/2011.17.3.243

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: dental caries, random forest, permanent teeth, DMFT index, logistic regression

Citation: Masaebi F, Ghorbani Z, Azizmohammad Looha M, Deghatipour M, Mohammadzadeh M, Ahsaie MG, Asadi F and Zayeri F (2024) Identifying early permanent teeth caries factors in children using random forest algorithm. Front. Dent. Med 5:1359379. doi: 10.3389/fdmed.2024.1359379

Received: 21 December 2023; Accepted: 28 March 2024;
Published: 12 April 2024.

Edited by:

Tammy D. Duangthip, The Ohio State University, United States

Reviewed by:

Eleftherios G. Kaklamanos, School of Dentistry Aristotle University of Thessaloniki, Greece
Fatih Sengul, Atatürk University, Türkiye

© 2024 Masaebi, Ghorbani, Azizmohammad Looha, Deghatipour, Mohammadzadeh, Ahsaie, Asadi and Zayeri. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Farid Zayeri, ZnpheWVyaUBnbWFpbC5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.