A Study to Evaluate Accuracy and Validity of the EFAI Computer-Aided Bone Age Diagnosis System Compared With Qualified Physicians

Cheng, Chi-Fung; Liao, Ken Ying-Kai; Lee, Kuan-Jung; Tsai, Fuu-Jen

doi:10.3389/fped.2022.829372

ORIGINAL RESEARCH article

Front. Pediatr. , 08 April 2022

Sec. Pediatric Endocrinology

Volume 10 - 2022 | https://doi.org/10.3389/fped.2022.829372

This article is part of the Research Topic Controversies in Growth Hormone Treatment and Diagnosis View all 17 articles

A Study to Evaluate Accuracy and Validity of the EFAI Computer-Aided Bone Age Diagnosis System Compared With Qualified Physicians

$\r\nChi-Fung Cheng$ Chi-Fung Cheng¹

Ken Ying-Kai Liao²

Kuan-Jung Lee² $Fuu-Jen Tsai*\r\n$ Fuu-Jen Tsai^3*

¹Big Data Center, China Medical University Hospital, Taichung City, Taiwan
²Ever Fortune.AI Co., Ltd., Taichung City, Taiwan
³Department of Medical Genetics, China Medical University Hospital, Taichung City, Taiwan

Study Objectives: In previous research, we built a deep neural network model based on Inception-Resnet-v2 to predict bone age (EFAI-BAA). The primary objective of the study was to determine if the EFAI-BAA was substantially concordant with the qualified physicians in assessing bone ages. The secondary objective of the study was to determine if the EFAI-BAA was no different in the clinical rating (advanced, normal, or delayed) with the qualified physicians.

Method: This was a retrospective study. The left-hand X-ray images of male subjects aged 3–16 years old and female subjects aged 2–15 years old were collected from China Medical University Hospital (CMUH) and Asia University Hospital (AUH) retrospectively since the trial began until the included image amount reached 368. This was a blinded study. The qualified physicians who ran, read, and interpreted the tests were blinded to the values assessed by the other qualified physicians and the EFAI-BAA.

Results: The concordance correlation coefficient (CCC) between the EFAI-BAA (EFAI-BAA), the evaluation of bone age by physician in Kaohsiung Veterans General Hospital (KVGH), Taichung Veterans General Hospital (TVGH2), and in Taipei Tzu Chi Hospital (TZUCHI-TP) was 0.9828 (95% CI: 0.9790–0.9859, p-value = 0.6782), 0.9739 (95% CI: 0.9681–0.9786, p-value = 0.0202), and 0.9592 (95% CI: 0.9501–0.9666, p-value = 0.4855), respectively.

Conclusion: There was a consistency of bone age assessment between the EFAI-BAA and each one of the three qualified physicians (CCC = 0.9). As the significant difference in the clinical rating was only found between the EFAI-BAA and the qualified physician in TVGH2, the performance of the EFAI-BAA was considered similar to the qualified physicians.

Background

In pediatrics, the interpretation of bone age can accurately assess the maturity of an individual, and can also be used as a reference for the diagnosis of endocrine disorders in children (1). The well-known manual methods for bone age assessment are Greulich and Pyle (GP method) (2) and Tanner-Whitehouse (TW method) (3). The assessments are based on visual inspection or scoring and are characterized by intra- or extra-observer variability (4, 5). External variability is the difference in judgment standards or differences in the level of interpretation experience among physicians; internal variability is the possible difference in interpretation of the same image by the same physician at different times (6). In addition, the average interpretation time of the GP method in the past study was 1.4 min and TW method was 7.9 min. Both of these methods invisibly increase the time cost of physician visits (7).

In view of the rapid development of artificial intelligence in recent years, image recognition systems developed based on deep learning technology are becoming more and more mature in clinical applications. In the previous research, we introduced the Inception-Resnet-v2 neural network that was pre-trained on ImageNet database, from which to extract features as the basic model (8). At each bone age assessment, the radiologist compares the client’s X-ray image to the GP reference image to assess their bone age and uses this as the ground truth for the model. Using training data from children and adolescents aged 2–18 in Taiwan, the network can predict well when given only the left hand bone X-ray and gender information. The purpose of this AI model is to reduce interpretation errors and actually reduce the complexity, time and cost of the bone age assessment process. The purpose of this research is to use the previously established deep learning model to examine the consistency and effectiveness of this model when it is actually put into clinical application scenarios.

Materials and Methods

This was a blinded retrospective study. Since all recognizable information had been removed before data collection, no informed consent was required for this study. The qualified physicians who ran, read, and interpreted the tests were blinded to the values assessed by the other qualified physicians and the EFAI-BAA. This study was designed to evaluate the concordance of the EFAI-BAA in assessing bone ages, in comparison to each one of the three qualified physicians.

After the whole included images had been determined, the physicians received the data disk with all included images in and the guidance on how to use the electronic data capture (EDC) system. A physician had to fill in the bone age he/she assessed on the EDC after receiving the data disk. After the bone age corresponding to an image was filled in on the EDC, it might be changed with a rational explanation, and the process was recorded in the EDC. Only after all the physicians finished assessing all the allotted images, can the X-ray images be imported to the EFAI-BAA to get the bone ages inferred by the EFAI-BAA.

Study Design and Participants

The study subjects were selected from China Medical University Hospital (CMUH) and Asia University Hospital (AUH). Subjects were enrolled by using the following criteria. Inclusion criteria: (1) Male subjects aged 3 to 16 years old and female subjects aged 2–15 years old at the time of left-hand X-ray PA view image taking. (2) The image quality should be good enough for the physicians to evaluate the bone age. Exclusion criteria: (1) Subjects with skeletal dysplasia. (2) Subjects with congenital anomaly over the hand and wrist. (3) Any severe fracture over the hand and wrist that hindered the determination of the age. (4) Subjects with known malignancy of the left hand. The left-hand X-ray PA view images of male subjects aged 3–16 years old and female subjects aged 2–15 years old at the time when X-ray was taken were retrospectively provided by Medical record department. A total of 368 left-hand X-ray PA view images that met the inclusion/exclusion criteria from these studies were sequentially selected for the proposed study. The flowchart of the subject-selection process is presented in Figure 1.

FIGURE 1

Figure 1. Flowchart of radiographs enrolled in the study.

Three independent certified qualified (with physician license) physicians from three centers in Taiwan, who were not part of the EFAI-BAA development, validation, or clinical study read the left-hand X-ray PA view images. Each of the three qualified physicians was provided with the same set of anonymized left-hand PA X-ray images. They assessed these left-hand X-ray PA view images manually and provide the bone age assessments in the EDC. The same set of left-hand X-ray PA view images were imported to the EFAI-BAA by an independent trained technician for bone age assessment. After the assessments were complete, the results were exported for the statistical analyses.

Imaging Filtering

In this study, the images were collected retrospectively from CMUH and AUH. A total of 368 DICOM files of left-hand PA view X-ray radiographs were collected (the number of images from either site should not be less than 30%). The information of the subject, which included gender, birth date, and examination date was acquired. At the time when the left-hand X-ray images were taken, the male subjects should be aged 3–16 years old and the female subjects should be aged 2–15 years old.

The X-ray images from Sep 1st, 2017 to Aug 31st, 2020 from CMUH and AUH were queried. The researcher should be used to conduct simple random sampling and provide the order of these random numbers using R (version 3.6.2). The researcher checked the basic information of the subjects including chronological age and gender based on the order and should assign the data to the corresponding age groups.

The expected number in each age group was shown in Table 1.

TABLE 1

Table 1. Summary of baseline characteristics.

Screening

All the included images were burned into a data disk by the research assistant and provided to physicians, to examine the quality of every image. The criteria were (1) Complete left hand and wrist (the distal end of radius and ulna included). (2) The X-ray image of the left-hand PA view. (3) No shadow on the image (such as wearing a ring or a holding fist). (4) The edge of each bone including carpals and metacarpals should be seen and the size of the epiphyseal plate and the degree the epiphyseal plate merged with the bone should be distinguishable.

After the image quality was confirmed, subjects were eligible for enrollment in the study only if they met all the inclusion/exclusion criteria. Subsequently, the research assistant should log in to the EDC system with his/her account and should establish the eCRF for each subject being included after the filtering process. The following information should be entered into the corresponding column: gender, birth date, and the date X-ray taken.

Re-screening

After the screening process described above, the data amount might be insufficient since the disqualification was sifted. On that occasion, the process was repeated from checking each set of data in the order decided through simple random sampling, assigning the data to the age groups, to the image quality and data qualification screening. The process was repeated until the included amount reached the expected amount.

Bone Age Assessment

On each included X-ray image, a verification code (ckCode) was marked. Subsequently, the X-ray images along with gender were burned into the data disk, followed by providing two duplicate disks to physicians who participated in this trial. The physicians evaluated the bone age of each image according to the GP method. The physicians logged in to the EDC system with their accounts and passwords. The physicians keyed in the ckCode and corresponding bone age of the image on the eCRF. Only after confirming all the participating physicians had finished evaluating, the included images were imported into the EFAI-BAA by the research assistant to get the bone ages inferred by the medical device for the test.

Statistical Analysis

The agreement between the EFAI-BAA and each one of the three qualified physicians was assessed using the concordance correlation coefficient (CCC) statistical analysis method (9). The performance of the EFAI-BAA was validated when the concordance criterion between the EFAI-BAA and each one of the three qualified physicians was met. The clinical rating assessed by the EFAI-BAA and the qualified physicians was considered, and the Chi-square test was used to determine the difference in the clinical rating between the EFAI-BAA and each one of the three qualified physicians. The accuracy of the EFAI-BAA compared to each one of the three qualified physicians was calculated as well. The performance of the EFAI-BAA was evaluated by the Root Mean Square (RMS) and Mean Absolute Deviation (MAD) of bone age assessment between the EFAI-BAA and each one of the three qualified physicians. The paired t-test was used to compare the mean difference in bone age assessment between the EFAI-BAA and each one of the three qualified physicians. The Bland-Altman plot was created for displaying the difference in bone age assessment between the EFAI-BAA and each one of the three qualified physicians (Supplementary Figures 1–3). For general consideration, descriptive statistics for categorical variables included the number of subjects and percentage; descriptive statistics for continuous variables included the number of observations, mean, SD, median, minimum, and maximum values.

Results

In this study, the images were collected retrospectively from CMUH and AUH. A total of 368 DICOM files of left-hand PA view X-ray radiographs were collected (the number of images from either site should not be less than 30%). The information of the subject, which included gender, birth date, and date of examination, was acquired. The results of the physicians’ assessments were compared against the bone age assessments by the EFAI-BAA.

The primary endpoint for the study was the bone ages assessed by the EFAI-BAA and the qualified physicians. The analysis result of the primary endpoint was presented in Table 2. The CCC between EFAI-BAA and KVGH (#1) was 0.98 (0.98, 0.99); the CCC between EFAI-BAA and TVGH2 (#2) was 0.97 (0.97, 0.98); the CCC between EFAI-BAA and TZUCHI-TP (#3) was 0.96 (0.95, 0.97).

TABLE 2

Table 2. Differences in the CCC scores (primary endpoint) between three physicians and EFAI-BAA.

The secondary endpoint was the clinical rating assessed by the EFAI-BAA and the qualified physicians. By calculating the 95% interval of the normal bone age distribution by the mean bone age ± 2SD, the bone age assessed would fall within the normal range (normal), out of the upper side of the normal range (advanced), or out of the lower side of the normal range (delayed). The analysis result of the secondary endpoint was presented in Table 3. The number and percentage of “Advanced,” “Normal,” and “Delayed” for EFAI-BAA was 38 (10.33%), 249 (67.66%), and 81 (22.01%), respectively (p = 0.6782); for KVGH (#1) was 35 (9.51%), 260 (70.65%), and 73 (19.84%), respectively; for TVGH2 (#2) was 49 (13.32%), 266 (72.28%), and 53 (14.40%), respectively (p = 0.0202); and, for TZUCHI-TP (#3) was 41 (11.14%), 259 (70.38%), and 68 (18.48%), respectively (p = 0.4855).

TABLE 3

Table 3. Differences in the clinical rating (secondary endpoint) between three physicians and EFAI-BAA.

The accuracy of the EFAI-BAA was presented in Table 4. The accuracy of EFAI-BAA compared to KVGH (#1) in the pre-puberty, early and mid-puberty, and late puberty group, and the overall age groups was 76.02, 81.02, 93.33, and 80.71%, respectively; the accuracy of EFAI-BAA compared to TVGH2 (#2) in the pre-puberty, early and mid-puberty, and late puberty group, and the overall age groups was 70.76, 86.13, 95.00, and 80.43%, respectively; the accuracy of EFAI-BAA compared to TZUCHI-TP (#3) in the pre-puberty, early and mid-puberty, and late puberty group, and the overall age groups were 66.67, 77.37, 96.67, and 75.54%, respectively.

TABLE 4

Table 4. Accuracy of the EFAI-BAA compared with different sites physicians.

The RMS and MAD and paired t-test of bone age assessment in each age group were presented in Table 5. The RMS (MAD) between EFAI-BAA and KVGH (#1) in the pre-puberty, early and mid-puberty, and late puberty group, and the overall age groups was 0.81 (0.62), 0.75 (0.60), 1.02 (0.92), and 0.82 (0.66), respectively (p = 0.0889); the RMS (MAD) between EFAI-BAA and TVGH2 (#2) in the pre-puberty, early and mid-puberty, and late puberty group, and the overall age groups was 1.22 (0.90), 0.73 (0.56), 0.89 (0.76), and 1.01 (0.75), respectively (p < 0.0001); the RMS (MAD) between EFAI-BAA and TZUCHI-TP (#3) in the pre-puberty, early and mid-puberty, and late puberty group, and the overall age groups was 1.19 (0.94), 1.46 (0.88), 0.87 (0.74), and 1.25 (0.89), respectively (p = 0.2206).

TABLE 5

Table 5. Root mean square and mean absolute deviation of bone age assessment in each puberty group.

Discussion

This retrospective study evaluated the accuracy and efficiency of AI system developed for automatic bone age assessment of children in Taiwan. The results show that compared with EFAI-BAA in manually assessed bone age based on the Greulich-Pyle method by three physicians from different hospitals, regardless of gender, this AI model can obtain a highly consistent and accurate bone age assessment by automatically analyzing X-rays of the left wrist.

The bone age assessment of KVGH (#1) was highly consistent with EFAI-BAA in the CCC and the distribution of clinical rating (Tables 2, 3). The bone age assessment of TVGH2 (#2) was averagely higher than that of EFAI-BAA, thus the mean of bone age assessment of TVGH2 (#2) was significantly different from that of EFAI-BAA (Table 5), and the distribution of clinical rating of TVGH2 (#2) was slightly shifted to the grade of “Advanced” (Table 3). Although the divergence of bone age assessment of TZUCHI-TP (#3) was high, TZUCHI-TP (#3) was still similar to EFAI-BAA in the mean of bone age assessment and the distribution of clinical rating (Tables 3, 5), respectively.

Because each lower bound of the two-sided 95% CI of the CCC between the EFAI-BAA and each one of the three qualified physicians was greater than 0.90, the three null hypotheses were all rejected, which meant there was a consistency of bone age assessment between the EFAI-BAA and each one of the three qualified physicians. As the significant difference in the clinical rating was only found between the EFAI-BAA and the qualified physician in TVGH2 (#2), the performance of the EFAI-BAA was considered similar to the qualified physicians.

In recent years, many studies have begun to try to use deep learning methods to assess bone age on left-hand x-ray images (10–16), and a well-trained AI bone age assessment system is as accurate as clinical experts. There was significant intra-individual variability of 0.94 vs. 0.74 years for the GP and TW methods, respectively (7). This variability can be reduced to 0.31 years through EFAI-BAA (8). Clinical diagnostic tools developed by deep learning models are often criticized because they cannot be explained intuitively (black box) (17–19). However, attribute to its excellent interpretation efficiency compared with traditional GP and TW methods, it has been proven to save more interpretation time for physicians (20).

The Greulich-Pyle method is used to assess the maturity of bone age and has been widely used. However, it should be noted that this method is established on Caucasian ethnicity and is highly dependent on the experience of radiologists. It’s prone to cause bias when GP method was applied to different generations, races or specific age groups for bone age assessment (21–26). Similarly, due to this study was a retrospective design, all x-ray images were from the China Medical University Hospital and Asia University Hospital. Therefore, the accuracy of EFAI-BAA has yet to be evaluated in different races or children who were less than 2 years old or over 16 years old. Finally, although there is no statistically significant difference in the assessment between EFAI-BAA and the three clinicians, it does not substitute the doctor’s clinical decision-making, and can only provide the doctor with clinical assistance. EFAI-BAA only predicts the bone age based on the information provided by the images and lacks other clinical information and other physiological factors of the patient.

Conclusion

In our study, it was shown that there was no statistically significant difference between bone age assessment of EFAI-BAA and three physicians from different sites in Taiwan. In addition, our results show that the AI-based bone age assessment system greatly reduces the time of interpreting bone age by physician compared with the Greulich-Pyle method. It can improve the efficiency of routine clinical examinations without affecting the accuracy of the assessment.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics Statement

The studies involving human participants were reviewed and approved by the China Medical University Hospital Institutional Review Board. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author Contributions

F-JT had the idea and designed the study and responsible for acquisition of data. C-FC, KY-KL, and K-JL analyzed and interpreted the data and provided administrative, technical, logistical or material support. C-FC drafted the article and submitted the manuscript for publication. F-JT and C-FC critically revised the manuscript for important intellectual contents. All authors had the final approval of the manuscript.

Conflict of Interest

KY-KL and K-JL were employed by Ever Fortune.AI Co., Ltd., Taichung, Taiwan.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2022.829372/full#supplementary-material

Supplementary Figure 1 | The Bland-Altman plot for EFAI-BAA vs. KVGH (#1).

Supplementary Figure 2 | The Bland-Altman plot for EFAI-BAA vs. TVGH2 (#2).

Supplementary Figure 3 | The Bland-Altman plot for EFAI-BAA vs. TZUCHI-TP (#3).

References

1. Gilsanz V, Ratib O. Hand Bone Age: A Digital Atlas of Skeletal Maturity. Heidelberg: Springer Science & Business Media (2005).

Google Scholar

2. Greulich WW, Pyle SI. Radiographic Atlas of Skeletal Development of the Hand and Wrist. Redwood City, CA: Stanford University Press (1959).

Google Scholar

3. Tanner JM. Assessment of Skeletal Maturity and Prediction of Adult Height: TW 2 Method. San Diego, CA: Academic Press (1983).

Google Scholar

4. Cox LA. Tanner-Whitehouse method of assessing skeletal maturity: problems and common errors. Horm Res. (1996) 45:53–5. doi: 10.1159/000184848

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bull R, Edwards P, Kemp P, Fry S, Hughes I. Bone age assessment: a large scale comparison of the Greulich and Pyle, and Tanner and Whitehouse (TW2) methods. Arch Dis Childh. (1999) 81:172–3. doi: 10.1136/adc.81.2.172

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Roche A, Rohmann CG, French NY, Daìvila GH. Effect of training on replicability of assessments of skeletal maturity (Greulich-Pyle). Am J Roentgenol. (1970) 108:511–5. doi: 10.2214/ajr.108.3.511

PubMed Abstract | CrossRef Full Text | Google Scholar

7. King D, Steventon D, O’sullivan M, Cook A, Hornsby V, Jefferson I, et al. Reproducibility of bone ages when performed by radiology registrars: an audit of Tanner and Whitehouse II versus Greulich and Pyle methods. Br J Radiol. (1994) 67:848–51. doi: 10.1259/0007-1285-67-801-848

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Cheng CF, Huang ET-C, Kuo J-T, Liao KY-K, Tsai FJ. Report of clinical bone age assessment using deep learning for an Asian population in Taiwan. Biomedicine. (2021) 11:50–8. doi: 10.37796/2211-8039.1256

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Lin LI. A concordance correlation coefficient to evaluate reproducibility. Biometrics. (1989) 45:255–68. doi: 10.2307/2532051

CrossRef Full Text | Google Scholar

10. Lee K-C, Lee K-H, Kang CH, Ahn K-S, Chung LY, Lee J-J, et al. Clinical validation of a deep learning-based hybrid (Greulich-Pyle and Modified Tanner-Whitehouse) method for bone age assessment. Korean J Radiol. (2021) 22:2017–25. doi: 10.3348/kjr.2020.1468

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Wang F, Gu X, Chen S, Liu Y, Shen Q, Pan H, et al. Artificial intelligence system can achieve comparable results to experts for bone age assessment of Chinese children with abnormal growth and development. PeerJ. (2020) 8:e8854. doi: 10.7717/peerj.8854

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Booz C, Yel I, Wichmann JL, Boettger S, Al Kamali A, Albrecht MH, et al. Artificial intelligence in bone age assessment: accuracy and efficiency of a novel fully automated algorithm compared to the Greulich-Pyle method. Eur Radiol Exp. (2020) 4:6. doi: 10.1186/s41747-019-0139-9

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Lee H, Tajmir S, Lee J, Zissen M, Yeshiwas BA, Alkasab TK, et al. Fully automated deep learning system for bone age assessment. J Digit Imaging. (2017) 30:427–41. doi: 10.1007/s10278-017-9955-8

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Han Y, Wang G. Skeletal bone age prediction based on a deep residual network with spatial transformer. Comput Methods Programs Biomed. (2020) 197:105754. doi: 10.1016/j.cmpb.2020.105754

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Tong C, Liang B, Li J, Zheng Z. A deep automated skeletal bone age assessment model with heterogeneous features learning. J Med Syst. (2018) 42:249. doi: 10.1007/s10916-018-1091-6

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Gao Y, Zhu T, Xu X. Bone age assessment based on deep convolution neural network incorporated with segmentation. Int J Comput Assist Radiol Surg. (2020) 15:1951–62. doi: 10.1007/s11548-020-02266-0

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Park SH. Artificial intelligence in medicine: beginner’s guide. J Korean Soc Radiol. (2018) 78:301–8. doi: 10.3122/jabfm.2022.01.210226

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Castelvecchi D. Can we open the black box of AI? Nat News. (2016) 538:20. doi: 10.1038/538020a

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Poon AI, Sung JJ. Opening the black box of AI-Medicine. J Gastroenterol Hepatol. (2021) 36:581–4. doi: 10.1111/jgh.15384

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Dallora AL, Anderberg P, Kvist O, Mendes E, Diaz Ruiz S, Sanmartin Berglund J. Bone age assessment with various machine learning techniques: a systematic literature review and meta-analysis. PLoS One. (2019) 14:e0220242. doi: 10.1371/journal.pone.0220242

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Maggio A, Flavel A, Hart R, Franklin D. Assessment of the accuracy of the Greulich and Pyle hand-wrist atlas for age estimation in a contemporary Australian population. Aust J Forensic Sci. (2018) 50:385–95. doi: 10.1080/00450618.2016.1251970

CrossRef Full Text | Google Scholar

22. Moradi M, Sirous M, Morovatti P. The reliability of skeletal age determination in an Iranian sample using Greulich and Pyle method. Forensic Sci Int. (2012) 223:372.e1–4. doi: 10.1016/j.forsciint.2012.08.030

PubMed Abstract | CrossRef Full Text | Google Scholar

23. udia Santos C, Ferreira M, Alves FC, nia Cunha E. Comparative study of greulich and pyle atlas and maturos 4.0 program for age estimation in a Portuguese sample. Forensic Sci Int. (2011) 212:276.e1–7. doi: 10.1016/j.forsciint.2011.05.032

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Patil ST, Parchand M, Meshram M, Kamdi N. Applicability of greulich and pyle skeletal age standards to Indian children. Forensic Sci Int. (2012) 216:200.e1–4. doi: 10.1016/j.forsciint.2011.09.022

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Büken B, Şafak A, Yazici B, Büken E, Mayda AS. Is the assessment of bone age by the Greulich-Pyle method reliable at forensic age estimation for Turkish children. Forensic Sci Int. (2007) 173:146–53. doi: 10.1016/j.forsciint.2007.02.023

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Cavallo F, Mohn A, Chiarelli F, Giannini C. Evaluation of bone age in children: a mini-review. Front Pediatr. (2021) 9:580314. doi: 10.3389/fped.2021.580314

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: bone age assessment, artificial intelligence, deep learning, concordance correlation coefficient (CCC), clinical practice

Citation: Cheng C-F, Liao KY-K, Lee K-J and Tsai F-J (2022) A Study to Evaluate Accuracy and Validity of the EFAI Computer-Aided Bone Age Diagnosis System Compared With Qualified Physicians. Front. Pediatr. 10:829372. doi: 10.3389/fped.2022.829372

Received: 05 December 2021; Accepted: 25 February 2022;
Published: 08 April 2022.

Edited by:

M. Savage, Queen Mary University of London, United Kingdom

Reviewed by:

Giorgio Radetti, Ospedale di Bolzano, Italy
Fergus Cameron, Royal Children’s Hospital, Australia

Copyright © 2022 Cheng, Liao, Lee and Tsai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fuu-Jen Tsai, ZDA3MDRAbWFpbC5jbXVoLm9yZy50dw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

A Study to Evaluate Accuracy and Validity of the EFAI Computer-Aided Bone Age Diagnosis System Compared With Qualified Physicians

Background

Materials and Methods

Study Design and Participants

Imaging Filtering

Screening

Re-screening

Bone Age Assessment

Statistical Analysis

Results

Discussion

Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Conflict of Interest

Publisher’s Note

Supplementary Material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good