Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 01 December 2022
Sec. Digital Public Health
This article is part of the Research Topic Machine Learning Used in Biomedical Computing and Intelligence Healthcare, Volume III View all 9 articles

Evaluation of a machine learning algorithms for predicting the dental age of adolescent based on different preprocessing methods

\nShihui Shen,,,,,&#x;Shihui Shen1,2,3,4,5,6Xiaoyan Yuan,,,,,&#x;Xiaoyan Yuan1,2,3,4,5,6Jian Wang,,,,,Jian Wang1,2,3,4,5,6Linfeng Fan,,,,,Linfeng Fan2,3,4,5,6,7Junjun Zhao,,,,,
Junjun Zhao1,2,3,4,5,6*Jiang Tao,,,,,
Jiang Tao1,2,3,4,5,6*
  • 1Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
  • 2College of Stomatology, Shanghai Jiao Tong University, Shanghai, China
  • 3National Center for Stomatology, Shanghai, China
  • 4National Clinical Research Center for Oral Diseases, Shanghai, China
  • 5Shanghai Key Laboratory of Stomatology, Shanghai, China
  • 6Shanghai Research Institute of Stomatology, Shanghai, China
  • 7Department of Radiology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China

Background: Machine learning (ML) algorithms play a key role in estimating dental age. In this study, three ML models were used for dental age estimation, based on different preprocessing methods.

Aim: The seven mandibular teeth on the digital panorama were measured and evaluated according to the Cameriere and the Demirjian method, respectively. Correlation data were used for decision tree (DT), Bayesian ridge regression (BRR), k-nearest neighbors (KNN) models for dental age estimation. An accuracy comparison was made among different methods.

Subjects and methods: We analyzed 748 orthopantomographs (392 males and 356 females) from eastern China between the age of 5 and 13 years in this retrospective study. Three models, DT, BRR, and KNN, were used to estimate the dental age. The data in ML is obtained according to the Cameriere method and the Demirjian method. Coefficient of determination (R2), mean error (ME), root mean square error (RMSE), mean square error (MSE) and mean absolute error (MAE), the above five metrics were used to evaluate the accuracy of age estimation.

Results: Our experimental results showed that the prediction accuracy of dental age was affected by ML algorithms. MD, MAD, MSE, RMSE of the dental age predicted by ML were significantly decreased. Among all the methods, the KNN model based on the Cameriere method had the highest accuracy (ME = 0.015, MAE = 0.473, MSE = 0.340, RMSE = 0.583, R2 = 0.94).

Conclusion: The results show that the prediction accuracy of dental age is influenced by ML algorithms and preprocessing method. The KNN model based on the Cameriere method was able to infer dental age more accurately in a clinical setting.

Introduction

Age estimation through the application of dental morphology and radiology plays an important role in clinical medicine and forensic medicine (13). Age can now be estimated by skeletal maturation or dental development (4, 5). However, age estimation may be biased when individuals encounter chronic diseases or nutritional deficiencies during growth and development. Compared to skeletal maturity, dental development is less influenced by the environment, which may be related to the strict genetic control of tooth development (6, 7).

The Demirjian method classifies the teeth into eight stages from A to H depending on maturity and calcification (8). The sum of seven permanent teeth in the left lower jaw corresponds to different ages for boys and girls, respectively. The Demirjian method relies on the subjective judgment of the assessor involved, which may lead to a high level of error. Recently, a new method was invented by Cameriere, which has been widely used all over the world (9). It is a European formula that measures the open apices of the seven permanent teeth in the left lower jaw by means of panoramic radiographs. This method is more objective by measuring data related to the teeth to estimate the dental age.

Both of these methods have been applied in China. The study by Ye et al. (10) showed that the dental age estimated with the Demirjian method was 1.68 years higher than the chronological age for males and 1.28 years higher for females. Shen et al. showed that the dental age obtained with the European formula was underestimated by 0.690 years for males and 0.484 years for females (11). However, by using machine learning (random forest, support vector machine, and linear regression model), the difference between dental age and chronological age was reduced to < 0.01 years.

Machine learning (ML) methods are widely used for early prediction and identification of different types of diseases (1214). We are very pleased to see that ML is gradually being used in clinical medicine, and that the combination of medicine and engineering is opening up more possibilities for clinical practice. Brain age, ophthalmology and dental age have all been studied with ML (12, 13, 15). For adolescent children, the mean absolute error (MAE) for age estimation using ML was < 1 year in all cases (11, 13, 16). In 2021, Galibourg et al. (16) reported surprising results for machine learning based on the Demirjian method for estimating age without using traditional conversion tables. For adults, the MAE was 6.022 years when dental age was estimated (17).

The advent of ML has further reduced the error in age estimation and made age estimation more reliable and practical. Many studies have applied ML to the estimation of bone age (18), and the application of ML to dental age is gradually increasing (11, 16). Studies have shown that ML is more accurate than traditional radiological methods. With ML, the accuracy of both methods has been improved (11, 16). However, it has not been discussed about which is more accurate when both Demirjian and Cameriere methods use ML.

The purpose of this study is to compare (1) the dental age prediction ability of Demirjian and Cameriere methods by ML algorithm (2) the dental age prediction ability of 3 ML algorithms: decision tree (DT), Bayesian ridge regression (BRR), k-nearest neighbors (KNN) model.

Materials and methods

Samples

The research was authorized by the Independent Ethics Committee of the Shanghai Ninth Hospital affiliated with Shanghai Jiao Tong University, School of Medicine (2017-282-T212). All methods were performed in accordance with relevant guidelines and regulations. Informed consent was obtained from all subjects or their legal guardian(s) participating in the study.

This retrospective study selected digital panoramic radiographs taken by KODAK 8000C Panoramic and Cephalometric Digital Dental X-ray Machine collected during outpatient treatment between 2000 and 2013. A total of 748 panoramic images of adolescents aged 5–13, including 356 females and 392 males, were included in this study. Their age and gender distributions are shown in Table 1. Since the date of birth and the date of taking the panoramic images were known for each subject, the chronological age was calculated as the difference between these two dates, rounded to two decimal places.

TABLE 1
www.frontiersin.org

Table 1. Age groups and gender distribution.

The inclusion criteria for panoramic radiographs are as follows: complete mandibular permanent teeth (except third molars); clearly visible root development; no systemic disease; no history of root canal therapy; no related diseases affecting mandibular development, such as cysts or cancer.

Dental age estimation

Digital panoramic radiographs were stored on a computer and processed by computer-aided measuring software (Adobe Photoshop CC 2017). All panoramic slices were pre-processed according to the Cameriere and Demirjian methods, and the relevant data were recorded.

According to the Demirjian method, 7 mandibular teeth were classified into eight stages from A to H according to their development and mineralization (8) (Figure 1, left). When it comes to Demirjian method, the assessment grade of the7 permanent teeth and the gender of the sample were included into the ML algorithm as variables. In the Demirjian method, we used one-hot encoding. One-Hot encoding, also known as one-bit valid encoding, essentially encodes N states using N-bit state registers, each with its own register bits, with only one bit valid at any time.

FIGURE 1
www.frontiersin.org

Figure 1. (Left) Schematic representation of the developmental assessment of seven molars according to the Demirjian method (the letters represent the stages of the Demirjian tooth age inference method). (Right) Schematic representation of the measurement of single-rooted and double-rooted teeth according[[Inline Image]] to the Cameriere method.

The Cameriere method, in short, divides the distance (Ai, i = 1, ..., 7) between the inner sides of the open apex of each of the seven left mandibular teeth by the length of the tooth (Li, i = 1, ...,7) to obtain the normalized value (Xi = Ai/ Li, i = 1, ...,7). Xi (i = 1, …, 7) = 0 if tooth development is complete and the apical foramen is closed (Figure 1, right) (9). The standard values (Xi, i = 1, ..., 7) of the seven permanent teeth and the gender of the sample were included into the ML algorithm as variables, when it comes to Cameriere method.

The following ML supervised regression algorithms were tested, DT, BRR and KNN model. Out-of-sample performance have been assessed by means of the well-known K-fold cross-validation. More precisely, in this study, we divided the data into ten groups. The image data were divided into 10 groups, 9 groups are used as training data and one for validation (19, 20). This process was repeated 10 times until each of these 10 sets became the validation dataset. The machine learning models based on Demirjian method were trained on the information sources as follows: gender (g) and the assessment grade of the seven permanent teeth. The machine learning models based on Cameriere method were trained on the information sources as follows: gender (g), the normalized measurements of the seven permanent developing teeth on the left mandible (Xi, i = 1, …,7), the sum of the normalized open apices (s, s = X1+ X2+…+ X7), the number of teeth with complete root development (N0) and the first-order relationship between s and N0 (s·N0). The target value was the chronological age.

All three ML algorithms are supervised learning. DT is a supervised learning methodology commonly adopted for both classification and regression (21). Ridge regression is a model tuning method that is used to analyze the data that suffers from multi-collinearity (22). Ridge regression treats the model parameters as fixed and finds estimates by minimizing a penalized cost function. BRR is the Bayesian counterpart, hence, in this framework the parameters are random variables whose prior distribution, exploiting Bayes rule, is updated by the data likelihood, leading to a posterior distribution for the parameter vector. It must be pointed out that the Bayesian framework allows a more natural uncertainty quantification, since it allows for an explicit probabilistic interpretation to parameters.

The KNN algorithm is supervised learning, where each piece of data in the training sample set has a definite known label. KNN does not have an explicit learning process and is, in fact, a well-known representative of lazy learning. The KNN algorithm itself is simple, easy to understand and implement, and it is highly accurate and insensitive to outliers (23, 24). The disadvantages are long computation time, high complexity and high memory overhead, as the training data needs to be parked in memory.

To avoid overfitting, a 20% validation dataset was used during hyperparameter optimization. The tuning of the hyperparameters to obtain the best model was achieved by exploring multiple combinations using the GridSearchCV function. The hyperparameters described in Supplementary Table S1 were tuned.

The method of age inference may underestimate or overestimate age, which is known as prediction error. An accurate method has low prediction error according to some performance metric, meaning that the average difference between dental age and chronological age will be close to zero. Accuracy refers to how large the difference between the dental age (DA) and chronological age (CA). The difference between DA and CA can be expressed in many ways according to the considered performance metric, such as mean absolute error, median absolute error, etc.

The chronological age of each subject was calculated by subtracting the date of birth from the date the panoramic was taken. In this study, the mean error (ME) of dental age and chronological age was calculated to quantify the direction of error (CA-DA), where positive values indicate that dental age is underestimated and negative values conversely. The mean absolute difference (MAD) of dental age and chronological age was calculated to quantify the magnitude of the error. In addition to the above two metrics, three other indicators were used to asses accuracy, the coefficient of determination (R2), root mean square error (RMSE) and mean square error (MSE) were also used to evaluate the accuracy of age estimation.

Data analysis and related icon production were performed through SPSS 25.0 (IBM Corp. Released 2017. IBM SPSS Statistics for Windows, Version 25.0. Armonk, NY: IBM Corp.), Pycharm 2021 and Python 3.8.2. The significance level was set at 5%.

Intra and interobserver agreement

Since the possibility of reliably replication measurements is an important component of any measurement study, both intra- and interobserver error was tested. Both observers participating in the study were trained in age estimation methods. Each observer assessed each of the 748 radiographs. To assess interobserver reliability, both observers twice evaluated 50 randomly selected X-rays before starting the original study. The intraobserver correlation coefficient (ICC) for the intra-observer agreement was 0.92 for both observers, whereas it was 0.86(for Cameriere method) and 0.82(for Deimirjian method) for the inter-observer agreement. The results of the intra-class correlation coefficient show the inter-observer and test-recovery reliability.

Results

A total of 748 panoramic images, 356 females and 392 males, were included in this study. The distribution of males and females in each group was relatively even. The performance metrics are shown in Table 2.

TABLE 2
www.frontiersin.org

Table 2. Mean error (ME), mean absolute error (MAE), mean square error (MSE), root mean square error (RMSE), and R2 values assessing performance of machine learning regression methods based on Demirjian and Cameriere method, respectively.

Compared with traditional methods, whether it is Deimirjian or Cameriere method, ME, MAE, MSE and RMSE have been greatly reduced after the optimization of ML, which means that the accuracy is greatly improved. When using the traditional Deimirjian method, its ME and MAE were −0.647 and 0.982, respectively, and the MSE and RMSE were 22.254 and 4.717. That is, the dental age estimated by the traditional Deimirjian method was 0.647 year higher than the chronological age, with an error of 0.982 year. With the aid of ML, the ME of the three models (BRR, DT, and KNN models) were −0.002, 0.011 and −0.027, respectively. The MAE also dropped to 0.510, 0.523, and 0.517. ML also had obvious advantages in the Cameriere method. Before using ML, the ME and MAE of the Cameriere method were 0.592 and 0.846, respectively. After optimization with ML, the ME became −0.030 (BRR), 0.052 (DT) and −0.015 (KNN), and the MAE dropped to 0.535 (BRR), 0.584 (DT) and 0.473 (KNN).

For the Cameriere method, among the three ML models, the KNN model performed the best, with the highest R2 (0.940) and the smallest ME and MAE. The DT model had the lowest R2 (0.893), that was, the lowest fit. In the Deimirjian method, the DT model was also a ML model with the lowest fitting degree (R2 = 0.892). The best performer in the Deimirjian method was the BRR model (R2 = 0.928, ME = −0.002, MAE = 0.510), followed by the KNN model (R2 = 0.923, ME = −0.027, MAE = 0.517) (Figure 2).

FIGURE 2
www.frontiersin.org

Figure 2. Mean error (ME) and mean absolute error (MAE) of different machine learning algorithms (DT, decision tree; BRR, Bayesian ridge regression; KNN, k-nearest neighbors) based on Demirjian or Cameriere method.

Among all the methods in this study, the smallest MAE is the KNN model ML based on the Cameriere method (MAE = 0.473), the dental age is only overestimated by 0.015 years, which is close to 0. The fit was also very high, at 0.94.

The linear equation was used to estimate the absolute error of the dependence of the sample population on the dental age under different methods (Table 3). Errors were only calculated for the test set during the 10-fold cross-validation. As show in this Table 3, the accuracy of the model decreases as individuals are getting older and ranges between 0.219 and 0.73 years.

TABLE 3
www.frontiersin.org

Table 3. Typical absolute error, in dependency of dental age for different situations.

As can be seen from Figure 3, most of the samples lie whithin the DA ± δ interval. For samples aged 5–13, the model was most accurate in the middle age group. Accuracy begins to decrease as the age of individuals age grows or decreases. The points are more concentrated under the KNN model. If the interval is doubled, i.e., DA ± 2δ, more individuals will be covered. In Figure 3, the green area corresponds to the interval DA ± δ, whereas the red area corresponds to the interval DA ± 2δ. The δ was approximated as linear function of absolute error in dependency of dental age. This means that the independent variable of δ was the dental age and the dependent variable was the corresponding absolute error.

FIGURE 3
www.frontiersin.org

Figure 3. The model performance for different machine learning algorithms (DT, decision tree; BRR, Bayesian ridge regression; KNN, k-nearest neighbors) based on Demirjian or Cameriere method. Green area: dental age ±δ; red + green area: dental age ± 2δ; ideal line: chronological age = estimated dental.

In addition, correlation coefficients were calculated for the respective variables of the Cameriere and Demirjian methods. The heat map of the correlation coefficients is shown in Figure 4. As seen in Figure 4, each variable has a correlation coefficient close to 1 or −1 with respect to the target value, meaning that they all contribute considerably to the prediction. The closer the correlation coefficient is to 1 or −1, the stronger the positive/inverse linear relationship, while a value of 0 indicates that there is no linear relationship between the two variables.

FIGURE 4
www.frontiersin.org

Figure 4. Heatmap of the correlation coefficient between the variables. (Left) The variables of the Cameriere method, gender (g), the normalized measurements of the seven permanent developing teeth on the left mandible (Xi, i = 1, …,7), the sum of the normalized open apices (s, s = X1+ X2+…+ X7), the number of teeth with complete root development (N0) and the first-order relationship between s and N0 (s·N0). (Right) The variables of the Demirjian method, gender (g) and the assessment grade of the seven permanent teeth.

Discussion

In oral forensics, the need to infer the age of life becomes increasingly important as more immigrants (illegal or otherwise) come to a country without a valid legal identity document. Second, missing or uncertain birth data is also common. For these reasons, more and more studies have been conducted on the estimation of dental age.

In other studies on the Cameriere's European formula, children in Bosnia-Herzegovina overestimated the dental age of girls by 0.10 year and underestimated the dental age of boys by 0.02 year (25); the dental age in Turkey was underestimated by 0.35 year (0.24 year for girls and 0.47 year for boys) (26, 27); the mean dental age of girls in Mexico was overestimated by 0.01 year, while the age of boys was underestimated by 0.00 year (28). In addition, in Germany, the dental age was overestimated by 0.16 years for boys and 0.18 year for girls (29); Malaysian Chinese children were overestimated by 0.50 year (0.33 year for girls and 0.66 year for boys) (30). Among children in northern China, the dental age was overestimated by 0.23 year (0.43 year for boys and 0.03 year for girls) (31). This suggests that Cameriere's European formula is more accurate in inferring the age of European children (Figure 5), whereas it is less accurate in other populations, which may be related to ethnicity.

FIGURE 5
www.frontiersin.org

Figure 5. Application of Cameriere European formula in various regions.

Take a look at Demirjian in use around the world. A study by Jayaraman et al. (32) showed that the dental age was overestimated by 0.60 year in French–Canadian male and 0.65 year in females. According to Maber et al. (33) the Demirjian method was found to overestimate age in Bengalis and British Caucasians with an average accuracy of 0.25 years in males and 0.23 years in females. Previously, Wang et al. (34) investigated the applicability of the Demirjian method in adolescents in eastern China and found that the dental age inferred with these methods was underestimated. The findings were consistent even when the subjects were in different age groups (6–13 years in this study and 11–18 in Wang et al.'s study). Both Cameriere and Demirjian seem to be geographically limited in their application.

A study by Gaur et al. (35) showed that children born in families of higher socioeconomic status develop teeth earlier than children from low-income families. Hiernaux et al. (36) found that three-quarters of African-American have teeth that erupt earlier than Caucasians in Europe or the United States. According to Olze et al. (37), the average age of third molar eruption is 0.5–3 years and 0.5–2 years higher in Asians and Africans than in Caucasians. Thus, environmental and ethnic differences, as well as genetic factors, may contribute to these differences.

The inclusion of ML can substantially improve the accuracy of age inference. Among the three ML algorithms in this study, the DT model performed the worst. Perhaps this is because age estimation is a regression problem rather than a binary classification problem. Therefore, regression models (such as KNN in this study) are more suitable for age estimation. In the study of Galibourg et al. (16), KNN also has good performance (ME = 0.009, MAE = 0.738) in the ML algorithm based on Demirjian method. This is consistent with our findings.

Convolutional neural network (CNN) is part of deep learning, a new research direction in machine learning that is more complex and specialized. A study by Zaborowicz et al. (38) showed that the MAE for dental age inference using CNN in children aged 4–15 years ranged from 0.195 to 0.384 years. The MAE for the current study ranged from 0.21 to 0. 73 years. Compared to traditional manual methods, both deep learning and machine learning regression methods showed high accuracy with mean errors close to zero.

This is the first comparison of the accuracy of the Cameriere and Demirjian methods of dental age estimation using ML simultaneously. The ML demonstrated its excellent performance, with very low MAE, ME MSE and RMSE in dental age. We believe that this breakthrough will significantly increase the feasibility of clinical dental age estimation. ML algorithm helps to establish assessment criteria and improve the accuracy of dental age estimates in local populations.

The present study provides a reference for future clinical applications of dental age, such as the assessment of child development and the projection of the age of the missing population. The sample for this experiment was drawn from hospitals in eastern China, filling a gap in machine learning for dental age inference in that region. This further increases the potential for global clinical application of dental age. Of course, this study still has geographical limitations, the data from other regions were not collected, which needs to be further confirmed in subsequent experiments. Individual differences caused by developmental differences and nutritional dietary habits of different ethnic groups are inevitable in the extrapolation of dental age. Perhaps including samples from more regions in the ML could reduce such errors. In addition, further research is needed to develop models regarding fully automated dental age inference. We believe that as the sample size increases, the accuracy of machine learning will further improve.

Conclusion

In this study, ML was shown to greatly improve the accuracy of dental age inference. It was shown that the accuracy of dental age inference was higher than that of the traditional Demirjian method or Cameriere European formula, for both KNN, DT and BRR models. The highest accuracy among the machine learning dental age inference based on preprocessing of different methods was the KNN model based on the Cameriere method (ME = 0.015, MAE = 0.473, MSE = 0.340, RMSE = 0.583, R2 = 0.94).

In future studies, ML can be used for dental age estimation in a larger geographical area and over a larger age range.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving human participants were reviewed and approved by the Independent Ethics Committee of the Shanghai Ninth Hospital affiliated to Shanghai Jiao Tong University, School of Medicine (2017-282-T212). Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.

Author contributions

SS: methodology and writing original draft. XY: investigation and resources. JW and LF: methodology and formal analysis. JZ and JT: conceptualization, supervision, and project administration. All authors have read and approved the final manuscript.

Funding

This work was sponsored by the Interdisciplinary Program of Shanghai Jiao Tong University (YG2019ZDA07), Innovative Research Team of High-Level Local Universities in Shanghai (SHSMU-ZLCX20212401), and Shanghai Sailing Program (No. 21YF1424100).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2022.1068253/full#supplementary-material

References

1. Kundert AML, Nikolaidis PT, di Gangi S, Rosemann T, Knechtle B. Changes in jumping and throwing performances in age-group athletes competing in the European masters athletics championships between 1978 and 2017. Int J Environ Res Public Health. (2019) 16:1200. doi: 10.3390/ijerph16071200

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Zingg MA, Knechtle B, Rust CA, Rosemann T, Lepers R. Analysis of participation and performance in athletes by age group in ultramarathons of more than 200 km in length. Int J Gen Med. (2013) 6:209–20. doi: 10.2147/IJGM.S43454

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Reinoso M, Juffer F, Tieman W. Children's and parents' thoughts and feelings about adoption, birth culture identity and discrimination in families with internationally adopted children. Child Fam Soc Work. (2013) 18:264–74. doi: 10.1111/j.1365-2206.2012.00841.x

CrossRef Full Text | Google Scholar

4. Franklin D. Forensic age estimation in human skeletal remains: current concepts and future directions. Leg Med (Tokyo). (2010) 12:1–7. doi: 10.1016/j.legalmed.2009.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bagherian A, Sadeghi M. Assessment of dental maturity of children aged 3.5 to 13.5 years using the Demirjian method in an Iranian population. J Oral Sci. (2011) 53:37–42. doi: 10.2334/josnusd.53.37

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Conceicao EL, Cardoso HF. Environmental effects on skeletal versus dental development II: further testing of a basic assumption in human osteological research. Am J Phys Anthropol. (2011) 144:463–70. doi: 10.1002/ajpa.21433

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Cardoso HF. Environmental effects on skeletal versus dental development: Using a documented subadult skeletal sample to test a basic assumption in human osteological research. Am J Phys Anthropol. (2007) 132:223–33. doi: 10.1002/ajpa.20482

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Demirjian A, Goldstein H, Tanner JM. A new system of dental age assessment. Hum Biol. (1973) 45:211–27.

PubMed Abstract | Google Scholar

9. Cameriere R, de Angelis D, Ferrante L, Scarpino F, Cingolani M. Age estimation in children by measurement of open apices in teeth: a European formula. Int J Legal Med. (2007) 121:449–53. doi: 10.1007/s00414-007-0179-1

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Ye X, Jiang F, Sheng X, Huang H, Shen X. Dental age assessment in 7-14-year-old Chinese children: comparison of Demirjian and Willems methods. Forensic Sci Int. (2014) 244:36–41. doi: 10.1016/j.forsciint.2014.07.027

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Shen S, Liu Z, Wang J, Fan L, Ji F, Tao J. Machine learning assisted Cameriere method for dental age estimation. BMC Oral Health. (2021) 21, 1–10. doi: 10.1186/s12903-021-01996-0

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Beheshti I, Ganaie MA, Paliwal V, Rastogi A, Razzak I, Tanveer M. Predicting brain age using machine learning algorithms: a comprehensive evaluation. IEEE J Biomed Health Inform. (2022) 26:1432–40. doi: 10.1109/JBHI.2021.3083187

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Tao J, Wang J, Wang A, Xie Z, Wang Z, Wu S, et al. Dental age estimation: a machine learning perspective. In:Hassanien AE, Azar AT, Gaber T, Bhatnagar RF, Tolba M, , editors. The International Conference on Advanced Machine Learning Technologies and Applications (AMLTA2019). Cham: Springer International Publishing (2020). p. 722–33. doi: 10.1007/978-3-030-14118-9_71

CrossRef Full Text | Google Scholar

14. Vila-Blanco N, Carreira MJ, Varas-Quintana P, Balsa-Castro C, Tomás I. Deep neural networks for chronological age estimation from OPG images. IEEE Trans Med Imaging. (2020) 39:2374–84. doi: 10.1109/TMI.2020.2968765

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Bernard A, Xia SZ, Saleh S, Ndukwe T, Meyer J, Soloway E, et al. EyeScreen. Ophthalmol Sci. (2022) 2:100158. doi: 10.1016/j.xops.2022.100158

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Galibourg A, Cussat-Blanc S, Dumoncel J, Telmon N, Monsarrat P, Maret D. Comparison of different machine learning approaches to predict dental age using Demirjian's staging approach. Int J Legal Med. (2021) 135:665–75. doi: 10.1007/s00414-020-02489-5

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Saric R, Kevric J, Hadziabdic N, Osmanovic A, Kadic M, Saracevic M, et al. Dental age assessment based on CBCT images using machine learning algorithms. Forensic Sci Int. (2022) 1:334. doi: 10.1016/j.forsciint.2022.111245

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Dallora AL, Anderberg P, Kvist O, Mendes E, Ruiz SD, Berglund JS. Bone age assessment with various machine learning techniques: a systematic literature review and meta-analysis. PLoS ONE. (2019) 14:1–22. doi: 10.1371/journal.pone.0220242

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Maslove DM, Podchiyska T, Lowe HJ. Discretization of continuous features in clinical datasets. J Am Med Inform Assoc. (2013) 20:544–53. doi: 10.1136/amiajnl-2012-000929

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Sturges HA. The choice of a class interval. J Am Stat Assoc. (1926) 21:65–6. doi: 10.1080/01621459.1926.10502161

CrossRef Full Text | Google Scholar

21. Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees (CART). Biometrics. (1984) 40:358. doi: 10.2307/2530946

CrossRef Full Text | Google Scholar

22. Hoerl AE, Kennard RW. Ridge regression: Biased estimation for non-orthogonal problems. Technometrics. (2000) 42:80–6. doi: 10.1080/00401706.2000.10485983

CrossRef Full Text | Google Scholar

23. Zhang M-L, Zhou Z-H. A k-nearest neighbor based algorithm for multi-label classification. In: 2005 IEEE international conference on granular computing. IEEE (2005). p. 718–21.

PubMed Abstract | Google Scholar

24. Peterson LE. K-nearest neighbor. Scholarpedia. (2009) 4:1883. doi: 10.4249/scholarpedia.1883

CrossRef Full Text | Google Scholar

25. Galić I, Vodanović M, Cameriere R, Nakaš E, Galić E, Selimović E, et al. Accuracy of Cameriere, Haavikko, and Willems radiographic methods on age estimation on Bosnian-Herzegovian children age groups 6-13. Int J Legal Med. (2011) 125:315–21. doi: 10.1007/s00414-010-0515-8

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Gulsahi A, Tirali RE, Cehreli SB, de Luca S, Ferrante L, Cameriere R. The reliability of Cameriere's method in Turkish children: a preliminary report. Forensic Sci Int. (2015) 249:319 e1–5. doi: 10.1016/j.forsciint.2015.01.031

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Apaydin BK, Yasar F. Accuracy of the demirjian, willems and cameriere methods of estimating dental age on Turkish children. Niger J Clin Pract. (2018) 21:257–63. doi: 10.4103/1119-3077.226966

PubMed Abstract | CrossRef Full Text | Google Scholar

28. de Luca S, de Giorgio S, Butti AC, Biagi R, Cingolani M, Cameriere R. Age estimation in children by measurement of open apices in tooth roots: study of a Mexican sample. Forensic Sci Int. (2012) 221:155.e1–155.e7. doi: 10.1016/j.forsciint.2012.04.026

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Wolf TG, Briseño-Marroquín B, Callaway A, Patyna M, Müller VT, Willershausen I, et al. Dental age assessment in 6- to 14-year old German children: comparison of Cameriere and Demirjian methods. BMC Oral Health. (2016) 16:1–8. doi: 10.1186/s12903-016-0315-8

PubMed Abstract | CrossRef Full Text | Google Scholar

30. George RP, Donald PM, Nagraj SK, Kyawe HH. Age Estimation in children by measurement of open apices in tooth roots in Chinese population in Malaysia: a radiologic study. Oral Surg Oral Med Oral Pathol Oral Radiol. (2015) 119:e168–e168. doi: 10.1016/j.oooo.2014.07.293

CrossRef Full Text | Google Scholar

31. Guo YC, Yan CX, Lin XW, Zhou H, Li JP, Pan F, et al. Age estimation in northern Chinese children by measurement of open apices in tooth roots. Int J Legal Med. (2015) 129:179–86. doi: 10.1007/s00414-014-1035-8

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Jayaraman J, King NM, Roberts GJ, Wong HM. Dental age assessment: are Demirjian's standards appropriate for southern Chinese children? J Forensic Odonto-Stomatol. (2011) 29:22. Available online at: https://search.crossref.org/?q=+&from_ui=yes

PubMed Abstract | Google Scholar

33. Maber M, Liversidge HM, Hector MP. Accuracy of age estimation of radiographic methods using developing teeth. Forensic Sci Int. (2006) 46:893–5. doi: 10.1016/j.forsciint.2006.02.019

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Wang J, Bai X, Wang M, Zhou Z, Bian X, Qiu C, et al. Applicability and accuracy of Demirjian and Willems methods in a population of Eastern Chinese subadults. Forensic Sci Int. (2018) 292:90–6. doi: 10.1016/j.forsciint.2018.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Gaur R, Boparai G, Saini K. Effect of under-nutrition on permanent tooth emergence among Rajputs of Himachal Pradesh, India. Ann Hum Biol. (2011) 38:84–92. doi: 10.3109/03014460.2010.497499

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hiernaux J. Ethnic differences in growth and development. Eugen Q. (1968) 15:12–21. doi: 10.1080/19485565.1968.9987748

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Olze A, van Niekerk P, Ishikawa T, Zhu BL, Schulz R, Maeda H, et al. Comparative study on the effect of ethnicity on wisdom tooth eruption. Int J Legal Med. (2007) 121:445–8. doi: 10.1007/s00414-007-0171-9

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Zaborowicz M, Zaborowicz K, Biedziak B, Garbowski T. Deep learning neural modelling as a precise method in the assessment of the chronological age of children and adolescents using tooth and bone parameters. Sensors. (2022) 22:637. doi: 10.3390/s22020637

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: dental age, development, machine learning, youth, panoramic

Citation: Shen S, Yuan X, Wang J, Fan L, Zhao J and Tao J (2022) Evaluation of a machine learning algorithms for predicting the dental age of adolescent based on different preprocessing methods. Front. Public Health 10:1068253. doi: 10.3389/fpubh.2022.1068253

Received: 12 October 2022; Accepted: 14 November 2022;
Published: 01 December 2022.

Edited by:

Wenbing Zhao, Cleveland State University, United States

Reviewed by:

Yu-wen Chen, Chongqing Institute of Green and Intelligent Technology (CAS), China
Felice Festa, University of Studies G. d'Annunzio Chieti and Pescara, Italy

Copyright © 2022 Shen, Yuan, Wang, Fan, Zhao and Tao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jiang Tao, doctor_taojiang@126.com; Junjun Zhao, 13611967296@163.com

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.