Longtime Vision Function Prediction in Childhood Cataract Patients Based on Optical Coherence Tomography Images

Xiang, Yifan; Chen, Jingjing; Xu, Fabao; Lin, Zhuoling; Xiao, Jun; Lin, Zhenzhe; Lin, Haotian

doi:10.3389/fbioe.2021.646479

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol., 05 March 2021

Sec. Bionics and Biomimetics

Volume 9 - 2021 | https://doi.org/10.3389/fbioe.2021.646479

This article is part of the Research TopicFlexible Biosensors and Intelligent Medical Devices in Health and DiseaseView all 15 articles

Longtime Vision Function Prediction in Childhood Cataract Patients Based on Optical Coherence Tomography Images

Yifan Xiang^1†

Jingjing Chen^1†

Fabao Xu^1†

Zhuoling Lin¹

Jun Xiao¹

Zhenzhe Lin^1*

Haotian Lin^1,2*

¹State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
²Center of Precision Medicine, Sun Yat-sen University, Guangzhou, China

The results of visual prediction reflect the tendency and speed of visual development during a future period, based on which ophthalmologists and guardians can know the potential visual prognosis in advance, decide on an intervention plan, and contribute to visual development. In our study, we developed an intelligent system based on the features of optical coherence tomography images for long-term prediction of best corrected visual acuity (BCVA) 3 and 5 years in advance. Two hundred eyes of 132 patients were included. Six machine learning algorithms were applied. In the BCVA predictions, small errors within two lines of the visual chart were achieved. The mean absolute errors (MAEs) between the prediction results and ground truth were 0.1482–0.2117 logMAR for 3-year predictions and 0.1198–0.1845 logMAR for 5-year predictions; the root mean square errors (RMSEs) were 0.1916–0.2942 logMAR for 3-year predictions and 0.1692–0.2537 logMAR for 5-year predictions. This is the first study to predict post-therapeutic BCVAs in young children. This work establishes a reliable method to predict prognosis 5 years in advance. The application of our research contributes to the design of visual intervention plans and visual prognosis.

Introduction

Normal visual development and visual acuity (VA) are important for young children and are the basis of infantile brain development (Stjerna et al., 2015; Danka Mohammed and Khalil, 2020) and ability development (Wu et al., 2019; Havstam Johansson et al., 2020). Consequently, the results of visual prediction are meaningful by reflecting the tendency and speed of visual development during a future period. Exact VA prediction is beneficial for young children, especially children with ophthalmopathy, based on which ophthalmologists and guardians can determine the potential visual prognosis in advance, decide on an intervention plan, and contribute to visual development. However, the nature of ocular growth and myopia drift in young children may disrupt exact visual prediction and affect result accuracy. To date, no children-applicable technology for VA prediction has been reported. Existing research (Rohm et al., 2018) has focused on short-term visual prediction for adults within a year.

Fundus imaging, especially optical coherence tomography (OCT), is recognized as a key factor for visual prediction (Guo et al., 2017; Esaka et al., 2019; Park et al., 2020). OCT images have been applied to predict prognostic visual function in age-related macular degeneration (AMD) (Rohm et al., 2018) and have achieved excellent performance in visual prediction. In our study, based on clinical data, features of OCT images, and follow-up results of children with childhood cataract (CC), we developed a machine learning system for long-term visual prediction of best corrected visual acuity (BCVA) 3 and 5 years in advance. This system can help ophthalmologists and guardians monitor patients’ visual development (Long et al., 2017) and adopt necessary visual intervention (Sniatecki et al., 2015; Wang and Xiao, 2015) in time, thereby contributing to the visual prognosis of young children.

Materials and Methods

A prospective study was conducted at the Zhongshan Ophthalmic Center (ZOC), Guangdong, China, from June 2011 to February 2019. The data were collected from a national project for CC treatment and research: the CC Program of the Chinese Ministry of Health (CCPMOH). This study followed the tenets of the Declaration of Helsinki and was approved by the Institutional Review Board of the ZOC at Sun Yat-sen University (IRB-ZOC-SYSU).

Clinical and Imaging Data

Two hundred eyes of 132 patients from the ZOC were included in the study. The inclusion criteria were as follows: (1) diagnosed with CC at the ZOC, (2) had surgical treatment, (3) had a horizontal OCT B scan with a scan quality index of good, (4) had a clear axis after treatment, and (5) had follow-up BCVA exams at 3 or 5 years after the OCT B scan. The exclusion criteria were as follows: (1) diagnosed with another ophthalmic disease or (2) diagnosed with a neurological or mental disease. The collected clinical data included sex, laterality, surgical age, surgical type, age at OCT B scan, follow-up BCVA results, and other examination and prognostic information.

Measured features of OCT images were extracted from the Optovue software. The retinal thickness of nine parts, namely the macular area and its eight surrounding regions (inner and outer sides of nasal, temporal, superior, and inferior regions to macula, see Figure 1C), was recorded. Furthermore, the thickness of the retinal nerve fiber layer (RNFL) was labeled manually based on the OCT images. The thickness of the RNFL was divided into four grades. A label of 1 indicated the complete existence of RNFL, 2 indicated the existence of most of RNFL with a thickness more than half of the normal one, 3 indicated the intermittent existence of RNFL with a thickness less than half of the normal one, and 4 indicated the complete absence of RNFL. The label work was finished by three retinal ophthalmologists and confirmed by a retinal professor.

FIGURE 1

Figure 1. Overall study pipeline. The work flow of developing machine learning models for predicting 3- and 5-year BCVA based on medical records and fundus images. (A) The participant numbers; (B) the collected medical records; (C) the collected fundus images; (D) the applied models; and (E) the evaluation method. BCVA, best corrected visual acuity; GBDT, gradient boosting decision tree.

Model Training and Evaluation

The research procedure is shown in Figure 1. To predict the BCVA of CC patients 3 and 5 years in advance, we tried six machine learning models, namely random forest, ExtraTrees, gradient boosting decision tree (GBDT), ridge regression, lasso regression, and ElasticNet. The age at OCT B scan, the label of RNFL thickness, and the retinal thickness of nine parts obtained from OCT were applied as training features.

For the prediction tasks, we used five-fold cross-validation. The proportion of the training set and the test set were 80 and 20%, respectively. To quantitatively evaluate the prediction performance, we applied two evaluation metrics, mean absolute error (MAE) and root mean square error (RMSE). The MAE is calculated as the average value of the absolute error of the prediction results, which directly reflects the deviation of the predicted values from the actual values. The formula for MAE is as follows:

MAE = \frac{1}{N} \sum_{i = 1}^{N} | {\tilde{y}}_{i} - y_{i} |

The RMSE is the square root of mean square error (MSE). The MSE is calculated as the average value of the square of the error of the prediction results. The RMSE is more interpretable considering its unit consistency with the original variables. The formula for RMSE is as follows:

RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {({\tilde{y}}_{i} - y_{i})}^{2}}

In the above two formulas, N is the number of predictions per fold, y_i is the ground truth, and ${\tilde{y}}_{i}$ is the predicted value.

Results

The training data included 200 eyes of 132 patients (46 females and 86 males), containing 197 eyes of 131 patients for 3-year prediction training and 172 eyes of 114 patients for 5-year prediction training (Table 1). The average surgical age was nearly 47 months, and the average age at OCT B scan was close to 55 months. The average endpoint BCVAs were 0.45 and 0.33 logMAR, respectively, in the two prediction groups.

TABLE 1

Table 1. Characteristics of patients regarding the 3- and 5-year predictions.

For the prediction evaluation, most of the models achieved excellent performance of errors within two lines (0.2 logMAR) of the VA chart in both 3- and 5-year predictions (Table 2). The random forest and GBDT models achieved the best 3-year prediction, and the ExtraTrees and GBDT models achieved the best 5-year prediction. In the 3-year prediction test, the MAEs ranged from 0.1482 to 0.2117, and the RMSEs ranged from 0.1916 to 0.2942. In the 5-year prediction test, the MAEs ranged from 0.1198 to 0.1845, and the RMSEs ranged from 0.1692 to 0.2537. For the same model, the prediction errors of 5-year tasks were always lower than those of 3-year tasks.

TABLE 2

Table 2. Prediction errors in 3- and 5-year BCVA predictions.

Figure 2 shows the weights of features for 3- and 5-year BCVA predictions in the random forest model. The thickness of RNFL plays a key role in BCVA prediction with a weight of nearly 0.8. The age at OCT B scan is the second most important factor with a weight of very nearly 0.1. The retinal thickness of the macula is the most important of the retinal thickness of the nine regions at and near the macula. In the 5-year prediction, the weight of the thickness of RNFL is higher than that of the 3-year prediction. The results of other models are similar to those of the random forest model. The thickness of RNFL remains most important in all the models.

FIGURE 2

Figure 2. Feature weights of random forest for 3- and 5-year BCVA predictions. The thickness of RNFL has a weight of nearly 0.7–0.8. The age at OCT B scan has a weight of nearly 0.1. The retinal thickness of the macula is the most important of the retinal thickness of the nine regions at and near the macula. BCVA, best corrected visual acuity; RNFL, retinal nerve fiber layer.

Figure 3 shows both the ground-truth and prediction values of each test example in 3- and 5-year predictions, respectively, based on the random forest model. The examples were ordered based on the ground-truth BCVA in Figures 3A,B. When the true BCVAs were lower than 0.2 logMAR, the prediction values were always higher than the true results. As the true BCVAs increased, the prediction values fluctuated around the true results. In Figures 3C,D, the examples were ordered based on the age at OCT B scan, and the prediction values mainly fluctuated around the true results.

FIGURE 3

Figure 3. The ground-truth and prediction values of each test example in 3- and 5-year BCVA predictions based on random forest. The examples were ordered based on the ground-truth BCVA in (A,B) and ordered based on the age undergoing OCT B scan in (C,D). BCVA, best corrected visual acuity.

Discussion

This is the first study to predict the post-therapeutic long-term BCVA of CC children based on OCT images through artificial intelligence (AI), and it demonstrated that the long-term visual function of children can be accurately predicted based on imageology using machine learning.

The post-therapeutic visual function of children with ocular diseases is one of the most important factors (Mndeme et al., 2020) focused on by doctors and guardians, as visual prognosis plays a key role in intelligence development (Li et al., 2017), school attendance (Negretti et al., 2015), and quality of life. However, there has been limited research contributing to the visual prediction of children with ophthalmopathy, with most published studies paying attention to the short-term visual prediction of adults within a year (Rohm et al., 2018). Our study addresses both limitations by demonstrating children-applicable prediction and long-term prediction of VA based on imageology. The machine learning models can predict 3- and 5-year BCVAs in advance with a small error within two lines of the visual chart. Based on the results predicted by our model, ophthalmologists and guardians can provide necessary assistance and individually targeted intervention (Sniatecki et al., 2015) to help children obtain better visual outcomes (Pinto et al., 2015) and quality of life, which may be of significant importance to childhood brain development.

Our research achieves precise prediction of long-term BCVA based only on features of OCT images and age, which makes it more accessible and stable than other methods. Most CC patients will take a fundus photo or undergo an OCT B scan to check fundus function after cataract surgery, and our model can be conveniently applied. The reported research (Rohm et al., 2018) applied 165 features to achieve a 6-month prediction, including 41 features from clinical records and 124 features from OCT images. Compared with previous research, our models are simpler and more convenient for general application.

The feature weights shown in Figure 2 specify that the thickness of the RNFL is closely related to long-term visual development after therapy. In the longer prediction, its importance increases. The RNFL lacks the ability to regenerate (Cen et al., 2018; Wu et al., 2020). If the OCT image indicates that the RNFL has atrophied (balducci et al., 2017) at the baseline examination, the visual function would not improve much in the post-therapeutic follow-up. On the other hand, if the OCT image indicates that the RNFL is complete at baseline, CC patients may achieve remarkable visual improvement after surgery with proper intervention. Above all, the thickness of RNFL is a dominant and stable indicator in post-therapeutic BCVA prediction.

Random forest (Breiman, 2001), ExtraTrees (Geurts et al., 2006), and GBDT (Schwenk and Bengio, 1998) all belong to ensemble learning (Kadiyala and Kumar, 2018), in which random forest and ExtraTrees apply the bagging method (Kristína et al., 2006) and GBDT uses the boosting method. Each predicted function was parallel in the bagging model and serial in the boosting model. Bagging always behaves better in preventing overfitting in small sample learning. The models of ridge regression, lasso regression, and ElasticNet (Ogutu et al., 2012) tend to apply the least squares method to predict values, which behaves better in data with multicollinearity and does not exactly fit the weights of each dimension of our data without multicollinearity.

Limitation

The limitations of our research should be considered. Larger samples of CC patients are necessary to increase the dataset and to improve the prediction stability. Additionally, a longer follow-up period contributes to extending the predicted time span. Besides, data for external validation are warranted to test the prediction model in real-world settings.

Data Availability Statement

The data is available from the corresponding author upon reasonable request.

Ethics Statement

The studies involving human participants were reviewed and approved by the institutional review board of the ZOC at Sun Yat-sen University (IRB-ZOC-SYSU). Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author Contributions

YX, JC, and HL conceived and designed the experiments. JC and ZuL collected the data. YX and FX labeled the data. ZeL and JX performed the experiments and analyzed the data. YX wrote the manuscript. JC, FX, and HL revised it. All authors read and approved the final manuscript.

Funding

This study was funded by the National Key R&D Program of China (2018YFC0116500), the National Natural Science Foundation of China (81770967 and 81822010), and the Science and Technology Planning Projects of Guangdong Province (2019B030316012). The sponsors of the study played no role in the study protocol design, data collection, analysis, or interpretation, manuscript preparation, or the decision to submit the manuscript for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors are grateful to all the participants and CC patients for supporting our study.

References

Balducci, N., Ciardella, A., Gattegna, R., Zhou, Q., Cascavilla, M. L., La Morgia, C., et al. (2017). Optical coherence tomography angiography of the peripapillary retina and opti nerve head in dominant optic atrophy. Mitochondrion 36, 60–65. doi: 10.1016/j.mito.2017.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Breiman, L. (2001). Random forests. Mach. Learn. 45, 5–32.

Google Scholar

Cen, L. P., Ng, T. K., Liang, J. J., Zhuang, X., Yao, X., Yam, G. H., et al. (2018). Human periodontal ligament-derived stem cells promote retinal ganglion cell survival and axon regeneration after optic nerve injury. Stem Cells 36, 844–855. doi: 10.1002/stem.2812

PubMed Abstract | CrossRef Full Text | Google Scholar

Danka Mohammed, C. P., and Khalil, R. (2020). Postnatal development of visual cortical function in the mammalian brain. Front. Syst. Neurosci. 14:29. doi: 10.3389/fnsys.2020.00029

PubMed Abstract | CrossRef Full Text | Google Scholar

Esaka, Y., Kojima, T., Dogru, M., Hasegawa, A., Tamaoki, A., Uno, Y., et al. (2019). Prediction of best-corrected visual acuity with swept-source optical coherence tomography parameters in keratoconus. Cornea 38, 1154–1160. doi: 10.1097/ICO.0000000000002043

PubMed Abstract | CrossRef Full Text | Google Scholar

Geurts, P., Ernst, D., and Wehenkel, L. (2006). Extremely randomized trees. Mach. Learn. 63, 3–42.

Google Scholar

Guo, Z., Kwon, Y. H., Lee, K., Wang, K., Wahle, A., Alward, W., et al. (2017). Optical coherence tomography analysis based prediction of humphrey 24-2 visual field thresholds in patients with glaucoma. Invest. Ophthalmol. Vis. Sci. 58, 3975–3985. doi: 10.1167/iovs.17-21832

PubMed Abstract | CrossRef Full Text | Google Scholar

Havstam Johansson, L., Škiljić, D., Falk Erhag, H., Ahlner, F., Pernheim, C., Rydberg Sterner, T., et al. (2020). Vision-related quality of life and visual function in a 70-year-old Swedish population. Acta Ophthalmol. 98, 521–529. doi: 10.1111/aos.14341

PubMed Abstract | CrossRef Full Text | Google Scholar

Kadiyala, A., and Kumar, A. (2018). Applications of python to evaluate the performance of bagging methods. Environ. Prog. Sustain. Energy 37, 1555–1559.

Google Scholar

Kristína, M., František, B., and Peter, B. (2006). A bagging method using decision trees in the role of base classifiers. Acta Polytech. Hung. 3, 121–132.

Google Scholar

Li, Q., Song, M., Xu, J., Qin, W., Yu, C., and Jiang, T. (2017). Cortical thickness development of human primary visual cortex related to the age of blindness onset. Brain Imaging Behav. 11, 1029–1036. doi: 10.1007/s11682-016-9576-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Long, E., Zhang, X., Liu, Z., Wu, X., Tan, X., Lin, D., et al. (2017). Dynamic response to initial stage blindness in visual system development. Clin. Sci. 131, 1515–1527. doi: 10.1042/CS20170234

PubMed Abstract | CrossRef Full Text | Google Scholar

Mndeme, F. G., Mmbaga, B. T., Msina, M., Mwende, J., Vaitha, S. J., Kim, M. J., et al. (2020). Presentation, surgery and 1-year outcomes of childhood cataract surgery in Tanzania. Br. J. Ophthalmol. 105, 334–340. doi: 10.1136/bjophthalmol-2020-316042

PubMed Abstract | CrossRef Full Text | Google Scholar

Negretti, G. S., Ayoub, T., Ahmed, S., Deb, R., Majumder, U., Jewel, J., et al. (2015). Cataract surgery outcomes in bangladeshi children. Ophthalmology 122, 882–887. doi: 10.1016/j.ophtha.2015.01.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Ogutu, J. O., Schulz-Streeck, T., and Piepho, H. P. (2012). Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions. BMC Proc. 6:S10. doi: 10.1186/1753-6561-6-S2-S10

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, K., Kim, J., Kim, S., and Shin, J. (2020). Prediction of visual field from swept-source optical coherence tomography using deep learning algorithms. Graefes Arch. Clin. Exp. Ophthalmol. 258, 2489–2499. doi: 10.1007/s00417-020-04909-z

CrossRef Full Text | Google Scholar

Pinto, J. G., Jones, D. G., Williams, C. K., and Murphy, K. M. (2015). Characterizing synaptic protein development in human visual cortex enables alignment of synaptic age with rat visual cortex. Front. Neural Circuits 9:3. doi: 10.3389/fncir.2015.00003

PubMed Abstract | CrossRef Full Text | Google Scholar

Rohm, M., Tresp, V., Müller, M., Kern, C., Manakov, I., Weiss, M., et al. (2018). Predicting visual acuity by using machine learning in patients treated for neovascular age-related macular degeneration. Ophthalmology 125, 1028–1036. doi: 10.1016/j.ophtha.2017.12.034

PubMed Abstract | CrossRef Full Text | Google Scholar

Schwenk, H., and Bengio, Y. (1998). “Training methods for adaptive boosting of neural networks,” in Advances in Neural Information Processing Systems, eds M. Jordan I M. J. Kearns and S. A. Solla (Cambridge, MA: The MIT Press).

Google Scholar

Sniatecki, J. J., Styles, C., Boyle, N., and Sanders, R. (2015). Cataract surgery: factors influencing decision to treat and implications for training (south-east Scotland 2008-2014). Clin. Ophthalmol. 9, 1821–1827. doi: 10.2147/OPTH.S92803

PubMed Abstract | CrossRef Full Text | Google Scholar

Stjerna, S., Sairanen, V., Gröhn, R., Andersson, S., Metsäranta, M., Lano, A., et al. (2015). Visual fixation in human newborns correlates with extensive white matter networks and predicts long-term neurocognitive development. J. Neurosci. 35, 4824–4829. doi: 10.1523/JNEUROSCI.5162-14.2015

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, M., and Xiao, W. (2015). Congenital cataract: progress in surgical treatment and postoperative recovery of visual function. Eye Sci. 30, 38–47.

Google Scholar

Wu, N., Kong, X., Gao, J., and Sun, X. (2019). Vision-related quality of life in glaucoma patients and its correlations with psychological disturbances and visual function indices. J. Glaucoma. 28, 207–215. doi: 10.1097/IJG.0000000000001178

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, Y., Zhan, Z., Quan, Y., Yang, Y., Chen, X., Liu, L., et al. (2020). SP1-mediated upregulation of LINGO-1 promotes degeneration of retinal ganglion cells in optic nerve injury. CNS Neurosci. Ther. 26, 1010–1020. doi: 10.1111/cns.13426

CrossRef Full Text | Google Scholar

Keywords: machine learning, optical coherence tomography, childhood cataract, visual prediction, intelligent system

Citation: Xiang Y, Chen J, Xu F, Lin Z, Xiao J, Lin Z and Lin H (2021) Longtime Vision Function Prediction in Childhood Cataract Patients Based on Optical Coherence Tomography Images. Front. Bioeng. Biotechnol. 9:646479. doi: 10.3389/fbioe.2021.646479

Received: 27 December 2020; Accepted: 08 February 2021;
Published: 05 March 2021.

Edited by:

Jian Yang, Wuyi University, China

Reviewed by:

Xin Liu, University of California, San Diego, United States
Ivan S. Babichuk, Wuyi University, China

Copyright © 2021 Xiang, Chen, Xu, Lin, Xiao, Lin and Lin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Haotian Lin, bGluaHQ1QG1haWwuc3lzdS5lZHUuY24=; aGFvdC5saW5AaG90bWFpbC5jb20=; Zhenzhe Lin, b2YwNzI3QDE2My5jb20=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.