AUTHOR=Jiang Yang , Jiang Hanyu , Zhang Jing , Chen Tao , Li Ying , Zhou Yuehua , Chen Youxin , Li Fusheng TITLE=The development of a machine learning model to train junior ophthalmologists in diagnosing the pre-clinical keratoconus JOURNAL=Frontiers in Medicine VOLUME=11 YEAR=2024 URL=https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2024.1458356 DOI=10.3389/fmed.2024.1458356 ISSN=2296-858X ABSTRACT=Purpose

This study aims to evaluate the diagnostic performance of a machine learning model (ML model) to train junior ophthalmologists in detecting preclinical keratoconus (PKC).

Methods

A total of 1,334 corneal topography images (The Pentacam HR system) from 413 keratoconus eyes, 32 PKC eyes and 222 normal eyes were collected. Five junior ophthalmologists were trained and annotated the images with or without the suggestions proposed by the ML model. The diagnostic performance of PKC was evaluated among three groups: junior ophthalmologist group (control group), ML model group and ML model-training junior ophthalmologist group (test group).

Results

The accuracy of the ML model between the eyes of patients with KC and NEs in all three clinics (99% accuracy, area under the receiver operating characteristic (ROC) curve AUC of 1.00, 99% sensitivity, 99% specificity) was higher than that for Belin-Ambrósio enhanced ectasia display total deviation (BAD-D) (86% accuracy, AUC of 0.97, 97% sensitivity, 69% specificity). The accuracy of the ML model between eyes with PKC and NEs in all three clinics (98% accuracy, AUC of 0.96, 98% sensitivity, 98% specificity) was higher than that of BAD-D (69% accuracy, AUC of 0.73, 67% sensitivity, 69% specificity). The diagnostic accuracy of PKC was 47.5% (95%CI, 0.5–71.6%), 100% (95%CI, 100–100%) and 94.4% (95%CI, 14.7–94.7%) in the control group, ML model group and test group. With the assistance of the proposed ML model, the diagnostic accuracy of junior ophthalmologists improved with statistical significance (p < 0.05). According to the questionnaire of all the junior ophthalmologists, the average score was 4 (total 5) regarding to the comprehensiveness that the AI model has been in their keratoconus diagnosis learning; the average score was 4.4 (total 5) regarding to the convenience that the AI model has been in their keratoconus diagnosis learning.

Conclusion

The proposed ML model provided a novel approach for the detection of PKC with high diagnostic accuracy and assisted to improve the performance of junior ophthalmologists, resulting especially in reducing the risk of missed diagnoses.