A deep learning approach for anterior cruciate ligament rupture localization on knee MR images

Qu, Cheng; Yang, Heng; Wang, Cong; Wang, Chongyang; Ying, Mengjie; Chen, Zheyi; Yang, Kai; Zhang, Jing; Li, Kang; Dimitriou, Dimitris; Tsai, Tsung-Yuan; Liu, Xudong

doi:10.3389/fbioe.2022.1024527

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol. , 30 September 2022

Sec. Biomechanics

Volume 10 - 2022 | https://doi.org/10.3389/fbioe.2022.1024527

This article is part of the Research Topic Advanced pre-clinical and pre-surgical assessment of musculo-skeletal medical devices View all 21 articles

A deep learning approach for anterior cruciate ligament rupture localization on knee MR images

Cheng Qu¹^†

Heng Yang²^†

Cong Wang³

Chongyang Wang¹

Mengjie Ying¹

Zheyi Chen⁴

Kai Yang⁵

Jing Zhang²

Kang Li⁶

Dimitris Dimitriou⁷

Tsung-Yuan Tsai³*

Xudong Liu¹*

¹Department of Orthopedics, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
²College of Electrical Engineering, Sichuan University, Chengdu, China
³School of Biomedical Engineering and Med-X Research Institute, Shanghai Jiao Tong University, Shanghai, China
⁴Department of Radiology, Shanghai Municipal Eighth People’s Hospital, Shanghai, China
⁵Department of Radiology, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
⁶West China Hospital, Sichuan University, Chengdu, China
⁷Department of Orthopedics, Balgrist University Hospital, University of Zürich, Zurich, Switzerland

Purpose: To develop and evaluate a deep learning-based method to localize and classify anterior cruciate ligament (ACL) ruptures on knee MR images by using arthroscopy as the reference standard.

Methods: We proposed a fully automated ACL rupture localization system to localize and classify ACL ruptures. The classification of ACL ruptures was based on the projection coordinates of the ACL rupture point on the line connecting the center coordinates of the femoral and tibial footprints. The line was divided into three equal parts and the position of the projection coordinates indicated the classification of the ACL ruptures (femoral side, middle and tibial side). In total, 85 patients (mean age: 27; male: 56) who underwent ACL reconstruction surgery under arthroscopy were included. Three clinical readers evaluated the datasets separately and their diagnostic performances were compared with those of the model. The performance metrics included the accuracy, error rate, sensitivity, specificity, precision, and F1-score. A one-way ANOVA was used to evaluate the performance of the convolutional neural networks (CNNs) and clinical readers. Intraclass correlation coefficients (ICC) were used to assess interobserver agreement between the clinical readers.

Results: The accuracy of ACL localization was 3.77 ± 2.74 and 4.68 ± 3.92 (mm) for three-dimensional (3D) and two-dimensional (2D) CNNs, respectively. There was no significant difference in the ACL rupture location performance between the 3D and 2D CNNs or among the clinical readers (Accuracy, p < 0.01). The 3D CNNs performed best among the five evaluators in classifying the femoral side (sensitivity of 0.86 and specificity of 0.79), middle side (sensitivity of 0.71 and specificity of 0.84) and tibial side ACL rupture (sensitivity of 0.71 and specificity of 0.99), and the overall accuracy for sides classifying of ACL rupture achieved 0.79.

Conclusion: The proposed deep learning-based model achieved high diagnostic performances in locating and classifying ACL fractures on knee MR images.

1 Introduction

Anterior cruciate ligament (ACL) injuries are common sports-related musculoskeletal diseases (Spindler and Wright, 2008) that increase the risk of developing posttraumatic osteoarthritis and require an early diagnosis and intervention (Wang et al., 2020). In current clinical practice, most orthopedic surgeons will perform ACL reconstruction in patients with ACL injuries (Reijman et al., 2021). However, reconstruction surgery has many disadvantages, such as anterior knee pain (Janani et al., 2020), muscle atrophy (Lindström et al., 2013), and loss of proprioception at the reconstructed surgical site. In addition, native gait kinematics cannot be restored and revision surgery, if necessary, can be difficult due to tunnel widening and malpositioning (Aga et al., 2017; Kraeutler et al., 2017). In the 1970s and 1980s, open primary ACL repair was commonly performed but was eventually abandoned due to poor surgical results and complications (Feagin and Curl, 1976; Taylor et al., 2009). With the development and application of arthroscopy, biotechnology, stronger internal fixation techniques, and more rational postoperative rehabilitation, ACL repair has received renewed attention from orthopedic surgeons (van der List and DiFelice, 2017; DiFelice and van der List, 2018; Mahapatra et al., 2018; Ahmad et al., 2019; Hoogeslag et al., 2019; Murray et al., 2020; Li, 2022). Isolated ACL repair has been reported using various techniques including suture anchor primary ACL repair, internal brace ligament augmentation, bridge-enhanced ACL repair (BEAR), and dynamic intraligamentary stabilization (DIS) methods (Heusdens, 2021). Sherman et al. (Sherman et al., 1991) were the first to classify ACL tears arthroscopically according to both tear location and tissue quality and named it “Sherman classifications” in 1991. More recently, Van der List et al. (van der List and DiFelice, 2016) proposed a treatment algorithm based on the modified Sherman classification and suggested that only proximal ACL tears with good to excellent tissue quality should be repaired. Note that the key question now is how to identify the location and tissue quality of the injured ACL to determine if ACL repair surgery can be performed. Currently, MRI is a non-invasive method that demonstrates excellent sensitivity and specificity for the diagnosis of ACL injuries (Odgaard et al., 2002). Several studies have suggested that MRI may help surgeons to predict the reparability of ACL tears (van der List et al., 2017; van der List and DiFelice, 2018; Mehier et al., 2022). Mehier, C. et al. (Mehier et al., 2022) proposed three classification criteria for ACL tears based on tear location and tissue quality, including MRI Sherman tear location (MSTL), MRI Sherman tissue quality (MSTQ), and simplified MRI Sherman tissue quality (S-MSTQ) classifications. The diagnostic accuracy of the three criteria was 70% (50/71), 52% (15/29), and 90% (26/29), respectively. Interobserver agreement was good for MSTL (κ = 0.78) and moderate-to-good for the MSTQ and S-MSTQ classifications (κ = 0.44 and 0.63 respectively). Based on the above studies (Sherman et al., 1991; van der List and DiFelice, 2016; van der List et al., 2017; van der List and DiFelice, 2018; Mehier et al., 2022), we focused primarily on the localization of ACL injuries, and we simplified the classification of ACL injury sites to the femoral side, middle and tibial side, with each classification accounting for one-third of the entire ACL.

Deep learning has notable advantages in helping clinicians with limited experience or time in reading MR images and increasing the accuracy of the MR imaging interpretations (Shin et al., 2016). Several previous studies have focused on the application of deep learning for disease diagnoses in medical imaging; the applications include lung adenocarcinoma (Yu et al., 2021), abnormal pulmonary nodules (Sim et al., 2020), and breast masses (Caballo et al., 2020). In the case of diagnosing ACL injuries, previous work has been limited to the use of deep learning methods to detect the presence or absence of ACL injuries (Bien et al., 2018; Liu et al., 2019) and grading the hierarchical severity staging of ACL injuries on knee MR images (Namiri et al., 2020; Awan et al., 2021; Javed Awan et al., 2021). However, deep learning methods have yet to be applied to localizing the ACL rupture.

The main purpose of our study was to develop and evaluate a deep learning-based method to localize and classify ACL ruptures (femoral side, middle and tibial side) (Sherman et al., 1991; van der List and DiFelice, 2016; van der List et al., 2017; van der List and DiFelice, 2018; Mehier et al., 2022) on knee MR images by using arthroscopy as the reference standard.

The remainder of this article is structured as follows. Some of the recent work closely related to this study will be discussed in Section 2. In Section 3, the details of the MRI datasets are presented, and the architecture, implementation details, and performance metrics of the fully automated ACL rupture localization system are presented. The experimental results are analyzed in Section 4. The advantages and limitations of the proposed method are discussed in Section 5. Finally, the conclusion of our study is given in Section 6.

2 Recent works

In recent years, various deep learning-based methods have been developed in ACL segmentation and injury assessment. In 2021, Flannery et al. (Flannery et al., 2021) developed an automated intact ACL segmentation model based on 2D U-Net. The reference standard for training the model was the results of segmentation by an experienced (>5 years) physician, and the model was evaluated for anatomical similarity and the accuracy of quantitative metrics (i.e., signal intensity and volume). The model performed well on anatomical performance metrics (Dice coefficient = 0.84, precision = 0.82, and sensitivity = 0.85). The median signal intensities and volumes of the model were not significantly different from the ground truth. Recently, the team used a transfer learning approach to segment the surgically treated ACL automatically (Flannery et al., 2022). Compared with the intact ACL segmentation model, the anatomical performance of the automated segmentation model for surgically treated ACLs was slightly decreased (repairs/grafts: Dice coefficient = 0.80/0.78, precision = 0.79/0.78, sensitivity = 0.82/0.80). There were no significant differences in quantitative metrics between the ground truth and automatic segmentation of surgically treated ACLs. In 2018, Bien et al. (Bien et al., 2018) developed MRNet for detecting ACL tears on knee MR images. Using the labels of three musculoskeletal radiologists with an average of 12 years of experience as a reference standard, researchers evaluated the performance of MRNet and compared it with the performances of nine other physicians (model/physicians: sensitivity = 0.76/0.91, specificity = 0.97/0.93). In addition, the area under the receiver operating characteristic (ROC) curve (AUC) reached 0.82 when validated directly with MRNet on a public dataset from Clinical Hospital Centre Rijeka, Croatia, and improved to 0.91 after retraining. MRNet took less than 30 min to train on and less than 2 min to evaluate the public dataset, indicating that MRNet can improve clinician performance in the interpretation of medical imaging on both internal and external datasets. In 2019, Liu et al. (Liu et al., 2019) proposed a fully automated ACL tear detection system by using two convolutional neural networks (CNNs) to isolate the ACL on knee MR images followed by a classification CNN to detect ACL injuries on the selected image sections. A retrospective study of 350 subjects was conducted to evaluate the sensitivity and specificity of the model and those of the five radiologists in detecting ACL tears using arthroscopy as the reference standard. The overall training time was 11.62 h, while the average time for the model to detect an ACL tear for one subject was 9 s. The sensitivity and specificity of the model at the optimal threshold were 0.96 and 0.96, respectively. In contrast, the sensitivity of the radiologists ranged between 0.96 and 0.98, while the specificity ranged between 0.90 and 0.98. In 2020, Namiri et al. (Namiri et al., 2020) proposed a deep learning-based pipeline to isolate the ACL region of interest (ROI), detect abnormal ACL, and stage lesion severity using three-dimensional (3D) and two-dimensional (2D) CNN, respectively. The overall accuracy of the 3D and 2D CNN in classifying ACL injuries (reconstructed, fully torn, partially torn, and intact ACLs) was 0.89 and 0.92, respectively. In a recent study, Namiri et al. (Astuto et al., 2021) developed a 3D CNN model for full-knee ROI (cartilage, bone marrow, menisci, and ACL) detection and lesion classification. Binary injury sensitivity reported for all tissues was between 0.70 and 0.88, while the specificity ranged from 0.85 to 0.89.

3 Materials and methods

3.1 MRI datasets

This retrospective study was performed with approval from our institutional internal review boards and ethical committees (Ethics Committee Northeast and Central Switzerland 2018-01410). The MRI datasets were obtained from 85 patients with ACL ruptures (Male: 56, Female: 29) with an average age of 27 (range: 10–57) years who underwent knee MRI examination and subsequent ACL reconstruction surgery under arthroscopy between January 2010 and April 2018 (Figure 1). Inclusion criteria were patients younger than 57 years, with no history of previous trauma or surgery on the injured knee, and MRIs that were performed within 1 month of injury. The patient exclusion criteria were as follows: (a) partial tear; (b) multiple ligamentous knee injuries; (c) MRI unavailable or of insufficient quality; (d) significant lacking information.

FIGURE 1

FIGURE 1. Inclusion and exclusion criteria. ACL, anterior cruciate ligament.

All the patients were scanned using a 3.0-T MR Scanner (Achieva; Philips Healthcare, Netherlands). The MRI datasets consisted of sagittal T2-weighted turbo spin-echo and coronal T1–weighted high spatial resolution turbo spin-echo sequences. The detailed imaging parameters of the sequences are summarized in Table 1.

TABLE 1

TABLE 1. Parameters for the knee MRI sequences used to locate ACL rupture.

3.2 Fully automated anterior cruciate ligament rupture localization system

In this study, we propose a two-step, coarse-to-fine deep learning-based pipeline to isolate the specific areas that contain ACL in the knee and we locate the ACL rupture site using 2D and 3D convolutions with MR images.

Our deep learning framework consists of the segmentation network that categorizes the knee into 4 distinct anatomic components and the landmark detection network to localize the centroid of an ACL rupture (Figure 2). The first segmentation network was implemented to approximately narrow the specific areas that contain ACLs; this network was based on a 3D U-Net architecture (Cicek et al., 2016). Based on the position of the femoral footprint and tibial footprint, we cropped the patches containing the ACL from the MR images to eliminate the unnecessary details and used them as input images to the localization CNNs. In the second stage, we compared the localization performance of the CNNs on 2D slices with 3D cropped images. All CNNs were developed through a cascaded approach to create a fully automated processing pipeline. The detailed network structure for the CNNs is summarized in Supplementary Table S1 (Litjens et al., 2017).

FIGURE 2

FIGURE 2. The convolutional neural network (CNN) pipeline for the deep learning-based fully automated ACL rupture localization system. The proposed methods including 2D and 3D CNNs consisted of segmentation and landmark detection network connected in a cascaded fashion to create a fully automated image processing pipeline. ACL, anterior cruciate ligament; BN, batch normalization; Conv, convolution; Norm, normalization; LReLU, Leaky-ReLU; ReLU, rectified-linear activation; 2D, two-dimensional; 3D, three-dimensional.

3.2.1 2D

This scheme consists of two stages, a slice selection and landmark localization. The slice selection network was constructed by a 3D full CNN (Figure 3) with an input size of 6 × 256 × 256, and it had nine sets of convolutional layers and eight pooling layers. The first eight sets of convolutional layers were used to extract features with two 1 × 3 × 3 convolution operations in each layer, while the last convolutional layer was used for reducing the feature dimension to one channel. The image size became 6 × 1 × 1 through eight max pooling layers, which only implemented downsampling on the slice size, followed by a softmax layer, and the network was trained by the cross-entropy loss values between the output vector and standard vector.

FIGURE 3

FIGURE 3. Flow chart for the slice detection network of 2D CNNs.

The landmark detection stage of our method is mainly based on YOLOF, which is formulated to predict keypoint coordinates by the bounding box center. YOLOF is one of the latest single-level detectors, which only uses the final low resolution feature map C5 to detect objects. We used the ResNet-101 network (Ranjan et al., 2019) as the backbone network in our training phase. Based on the real pixel coordinates (usually decimal) of the rupture point on the axis of the high-resolution slice in the cropped MRI images, we select the integers on both sides of the decimal as the slices (2 slices) where the real rupture point is located. We maintained the slice resolution as 0.25 mm * 0.25 mm in order to use high-precision images that would ensure the accuracy of the results. We utilized random rotation, flipping and elastic transformation to enhance the data and expand our training dataset.

In sum, the selected slice was adopted as the z value of the final coordinate, and the coordinates (x, y) were obtained by the predicted bounding box center. After mapping the obtained pixel coordinates into the physical coordinates of the original MRI image, the automatic localization of the ACL rupture point was completed.

3.2.2 3D

The 3D localization scheme was based on the heatmap regression network, which was adapted from a 3D full resolution nnU-Net (Isensee et al., 2021). The network has an encoder-decoder structure. The encoder is comprised of a sequence of convolution layers with strided convolution downsampling, which compresses the original input volume into low-resolution and highly abstracted feature maps. The decoder has the same structure with a transpose convolution upsampling, which processes the downsampling abstracted feature maps into outputs with the same resolution as the input, in a way that is symmetrical to what is done in the compression layers. The feature maps of the same level are concatenated by a skip connection. All batch normalizations were replaced by group normalizations, and we used the combination of Dice loss and focal loss (Lin et al., 2020) as the loss function to train the model.

In particular, in the training phase, we use a 3D Gaussian function, centered at the manually labeled rupture position, as a probability heatmap. The probability values are multiplied by a constant to scale the maximum to 1 (the groundtruth of the landmark). In the landmark mask, the probability value gradually decreases from 1 at the center position in the voxel range of the Gaussian heatmap distribution (the landmark voxels), and the value of the background voxels is set to zero. Note that we incorporate a false-positive suppression strategy during the training phase to make our model more robust. Specifically, we force the values that are very close to the landmark voxels (e.g., < 2 mm) to be negative rather than zero, so they are regarded as invalid voxels to avoid being calculated in the loss function. Finally, we mark the rupture voxels by using the standard probability threshold of 0.5 and calculate the centroid of the whole region as the output coordinate. The heatmaps were generated using the Matplotlib library (https://matplotlib.org/).

3.3 Definition of simplified classification of anterior cruciate ligament injury sites on our deep learning-based model

The ACL rupture was approximately described based on the line connecting the center coordinates of the femoral and tibial footprints. The line was divided into three equal parts to indicate which section (femoral side, middle, and tibial side) the rupture area was located on, while the rupture area was interpreted as the coordinate of the perpendicular foot between the rupture point and the ACL line.

The entire ACL measures approximately 38 mm in length and 11 mm in width (Girgis et al., 1975). According to our simplified classification of ACL injury sites, each section accounts for one-third of the entire ACL, which is approximately 12 mm. Based on the anatomy of ACL (Girgis et al., 1975) and the study of Payer, C. et al. (Payer et al., 2016) on medical image landmark localization, for both 2D and 3D CNNs, a localization failure case occurred when the distance between the ground-truth location and the predicted location was larger than 10 mm.

3.4 Implementation

The training and evaluation of our pipeline was done on a desktop computer running a 64-bit Linux operating system with 8 V100 SXM3-32GB GPUs and CUDA version 10.2. All machine learning algorithms were implemented in PyTorch with Python 3.7, and each CNN was trained individually. The model was validated by a fivefold cross-validation. The data were randomly divided up into 5 non-overlapping groups known as folds and each fold consisted of 17 MRI images. One of those folds was retained as the validation set, and the remaining images were used for training. The average accuracy of all the folds was the overall accuracy of our system.

3.5 Training and evaluation of the fully automated anterior cruciate ligament rupture localization system

The reference standard for training the segmentation network was the image patch segmentation bounded by a manually labeled femoral footprint and tibial footprint performed on the sagittal T2-weighted sequences of all 85 subjects. The labeling of the femoral footprint and tibial footprint areas was performed by an orthopedic fellow (D.D., with 8 years of labeling experience) using the ITK-SNAP program (https://www.itksnap.org/pmwiki/pmwiki.php). The reference standard for training the localization network was the centroid physical coordinate of the rupture region marked on the MRI of the corresponding patient by an orthopedic fellow (D.D.) using the location of the arthroscopic ACL injury as the reference standard.

3.6 Evaluation by clinical readers

To compare the localization accuracy of our pipeline with that of clinical readers, a 3rd-year musculoskeletal clinician [MY (Resident 1)], a 6th-year musculoskeletal clinician [CYW (Resident 2)], and a 6th-year radiologist [ZC (Fellow)] independently reviewed the MR images of all 85 patients. The clinical readers identified the site of the ACL rupture by placing image patches where they believed the ACL rupture occurred on the sagittal T2-weighted MR images using the ITK-SNAP program. Then, the centroid physical coordinates of the manually labeled image patches are calculated and compared with the coordinates predicted by the deep learning-based model to evaluate the localization accuracy and classification performance of the model. All the clinical reviewers had no formal training or calibration courses prior to evaluating the ACL rupture site.

3.7 Statistical analysis

All the statistical analyses were calculated using SPSS (Version 26; IBM Corporation, Armonk, NY, United States). The p values less than 0.05 were considered statistically significant. Euclidean distances (mean value ±standard deviation, millimeters) between the ground-truth locations and the predicted locations of the landmarks were used to evaluate the algorithm localization accuracy. The localization error rate was defined as the ratio between the number of failure cases and the total samples. Interobserver agreement between two of the three independent blinded clinical readers was assessed using single-measure intraclass correlation coefficients (ICC) with a two-way random-effects model for absolute agreement. The performance statistics for the classification of ACL rupture were reported for sensitivity, specificity, precision, F1-score, and overall accuracy.

\begin{array}{c} S e n s i t i v i t y = T P / (T P + F N) \end{array} (1)

\begin{array}{c} S p e c i f i c i t y = T N / (T N + F P) \end{array} (2)

\begin{array}{c} P r e c i s i o n = T P / (T P + F P) \end{array} (3)

\begin{array}{c} F 1 - s c o r e = T P / (T P + 0.5 * (F P + F N)) \end{array} (4)

\begin{array}{c} O v e r a l l a c c u r a c y = \frac{c o r r e c t c l a s s i f i c a t i o n s}{a l l c l a s s i f i c a t i o n s} \end{array} (5)

where TP, TN, FP, and FN are true positive, true negative, false positive, and false negative, respectively. Also, '*' and '/' represent multiplication and division, respectively.

4 Results

Compared with models proposed by Bien et al. (Bien et al., 2018) and Liu et al. (Liu et al., 2019), the training time for our pipeline was 60 min, and the average time for the ACL rupture localization system to locate and classify the rupture site for one subject was 1.6 s using the trained networks.

Table 2 compares the accuracy and error rates of the proposed pipeline (both the 2D and 3D methods) with those of the clinical readers. The mean localization accuracies were 4.68 ± 3.92 [standard deviation] (mm) for the 2D method, 3.77 ± 2.74 (mm) for the 3D method, 8.27 ± 4.47 (mm) for Resident 1, 8.34 ± 3.36 (mm) for Resident 2, and 8.00 ± 5.74 (mm) for Fellow. There was no significant difference in ACL rupture location performance between the 3D and 2D CNNs or among the clinical readers (Accuracy, p < 0.01). The error rates of the 2D and 3D CNNs were 11% (9/85) and 3.5% (3/85), respectively. In comparison, the error rates of the clinical readers ranged between 31% (28/85) and 40% (34/85). Table 3 shows the ICC values for interobserver agreement between the clinical readers in the localization of ACL ruptures on the same image patches. There was poor to moderate interobserver agreement between the clinical readers, with ICC values between 0.19 and 0.54.

TABLE 2

TABLE 2. Accuracy and error rate of clinical residents, musculoskeletal radiology fellow, 2D CNNs, and 3D CNNs in localization of ACL ruptures.

TABLE 3

TABLE 3. Intraclass correlation coefficients (ICC) for Interobserver Agreement between the Clinical Readers in Localization of ACL Ruptures.

Tables 4, 5 show the confusion matrices and sensitivity and also the specificity, precision, F1-score, and overall system accuracy values for the clinical readers, and also the 2D and 3D CNNs for evaluating the side classification performance on ACL ruptures on the image patches in all 85 MR datasets. The confusion matrix results for the ACL injury classification corresponding to each evaluator in Table 4 show that the 3D CNNs had the highest performance on ACL rupture classifications. As shown in Table 5, both models performed better than the clinical readers in describing the location of ACL ruptures. The 3D CNNs performed best among the five evaluators in classifying the femoral side (sensitivity of 0.86 and specificity of 0.79), middle side (sensitivity of 0.71 and specificity of 0.84), and tibial side ACL rupture (sensitivity of 0.71 and specificity of 0.99). While the overall accuracy of clinical readers ranged between 0.42 and 0.56, the overall accuracy of the ACL localization system for the 3D and 2D CNNs was 0.79 and 0.61, respectively.

TABLE 4

TABLE 4. Confusion matrices for the clinical residents, musculoskeletal radiology fellow, 2D CNNs, and 3D CNNs for performance in sides classifying of ACL rupture on the image patches.

TABLE 5

TABLE 5. Sensitivity, specificity, precision, F1-score, and overall accuracy for clinical residents, musculoskeletal radiology fellow, 2D CNNs, and 3D CNNs for performance in sides classifying of ACL rupture on the image patches.

Figure 4 displays sagittal views of the cropped knee MR image, which were processed by the deep learning model for mislocalization and false classification. The true part of the rupture is on the middle side but the model outputs a classification result on the femoral side. The deep learning pipeline outputs incorrect localization results due to the Euclidean distance between the true and predicted rupture point locations being greater than 10 mm, which exceeds the maximum error threshold we set. Based on our model, the results of ACL rupture classification are directly related to the accuracy of its rupture localization, and incorrect localization leads to incorrect classification.

FIGURE 4

FIGURE 4. Sagittal views of the cropped MR image, mislocalization and false classification. The predicted rupture point is marked by red circle, while the true rupture point is green. The deep learning pipeline outputs incorrect localization results due to the Euclidean distance between the true and predicted rupture point locations being greater than 10 mm, which exceeds the maximum error threshold we set. A mislocalization resulted in a false classification. The true part of the rupture is the middle side, but the prediction is femoral side.

Figure 5 shows that the predicted rupture point location is very close to the true rupture point location and the Euclidean distance between them is within the set error range. The deep learning model is able to correctly locate the ACL rupture point and therefore outputs the correct classification.

FIGURE 5

FIGURE 5. Sagittal views of the cropped MR image, correct localization and classification. The predicted rupture point is marked by red circle, while the true rupture point is green. The model predicted a correct localization, and the system shows a correct classification (the middle side).

5 Discussion

Our study describes a fully automated ACL rupture localization system utilizing a segmentation network adapted from 3D U-Net (Cicek et al., 2016) for approximately narrowing the specific areas that contain an ACL. This is followed by a second landmark detection network based on the YOLOF (for the 2D model) and 3D full resolution nnU-Net (Isensee et al., 2021) (for the 3D model) with several modifications to localize the ACL rupture within the cropping patches that contain the ACL rupture region of interest according to the coordinate. The 3D CNNs achieved the highest performance among all the models and clinicians, with a localization accuracy reaching 3.77 ± 2.74 (mm). The error rate and the overall system accuracy were 3.5% (3/85) and 79%, respectively. In addition, the 3D CNNs performed best among the five evaluators in classifying the femoral side (F1-score: 0.83), middle side (F1-score: 0.74), and tibial side ACL rupture (F1-score: 0.77).

Previous work using deep learning methods has been limited to detecting the presence or absence of ACL ruptures or triaging the lesion severity of ACL injuries on knee MR images. Bien et al. (Bien et al., 2018) made predictions from three series types of knee MRIs to train different MRNets with a pretrained AlexNet, and the experimental results showed a 0.911 AUC, 0.968 specificity, and 0.759 sensitivity for ACL tears. Namiri et al. (Namiri et al., 2020) created a deep learning model to predict four lesion severities for the ACL, used V-Net to segment the knee and determined the ACL boundaries of the original input MRI. Then, the cropped images were tested on the 2D and 3D CNNs, which detected reconstructed, fully torn, partially torn and intact ACLs. The 2D and 3D CNNs achieved high overall accuracies of 92% and 89%, respectively. Most recently, Awan et al. (Javed Awan et al., 2021) trained a customized ResNet-14 architecture utlilizing class balancing and data augmentation, which performed at an average accuracy of 92% for three classes. The results showed that the AUC was 0.980 for healthy ACLs, 0.970 for partially torn ACLs and 0.999 for fully torn ACLs. In contrast to the work described above, our pipeline has many advantages. Our pipeline localizes and classifies ACL ruptures on knee MR images, which can help clinicians roughly determine whether a patient has a potential for ACL repair based on the results of ACL injury classification. Based on the ACL injury treatment algorithm proposed by Van der List et al. (van der List and DiFelice, 2016; van der List et al., 2017), we believe that proximal ACL injuries (femoral side) have the potential for ACL repair surgery whereas ACL reconstruction surgery is recommended for injuries near the middle and tibial sides. In addition, our localization system is not influenced by human factors. The interobserver agreement between clinicians in our study did not perform very well (ICC range between 0.19 and 0.54), which may be due to inexperience, distraction, and different interpretations of MRI by clinicians with different specialties. Our pipeline avoids these problems by using arthroscopy as a reference standard and labeling the location of the ACL injury on the corresponding MR images. Furthermore, both CNNs and clinical readers localized ACL rupture within a set threshold (10 mm), but CNNs performed better than clinicians in localization (CNNs/clinicians: 3.77–4.68 mm/8.00–8.34 mm). With accurate localization of ACL injuries, our system also allows the surgeon to adjust the range of ACL injury classification to suit the actual situation.

Our study had several limitations. First, our dataset has a small sample size which only allows for the process of data cross-validation, and more data are needed to verify the reliability of our system. Second, proton density-weighted MR sequences are considered to be commonly used to evaluate knee injuries. MR data in our study are sagittal T2-weighted and coronal T1–weighted MR sequences, and more sequences need to be added to train the localization system to make the results more reliable. In addition, given the fair interobserver agreement among clinicians, we need more experienced clinicians to join the evaluators to calculate ACL injury localization accuracy and classification reliability. Finally, we can include the negative control group in which there is no ACL rupture in the development of the deep learning pipeline, which may be of greater translational and applied value to clinical scenarios.

6 Conclusion

In conclusion, our pipeline was found to be more accurate in locating and classifying ACL ruptures (femoral side, middle, and tibial side) than clinicians with varying levels of experience, which may help clinicians determine whether an ACL injured patient has the potential for ACL repair based on the classification results.

Data availability statement

The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee Northeast and Central Switzerland 2018-01410. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin. Written informed consent was obtained from the individual(s), and minor(s)' legal guardian/next of kin, for the publication of any potentially identifiable images or data included in this article.

Author contributions

Guarantors of integrity of entire study, CQ, HY, and XL; study concepts/study design or data acquisition or data analysis/interpretation, all authors; manuscript drafting or manuscript revision for important intellectual content, all authors; manuscript final version approval, all authors; agrees to ensure any questions related to the work are appropriately resolved, all authors; literature research, CQ, HY, CW, JZ, and XL; clinical studies, CQ, HY, CyW, MY, ZC, DD, and XL; experimental studies, CQ, HY, KY, and JZ; statistical analysis, CQ, HY, CW, and XL; and manuscript editing, CQ, HY, CyW, KL, T-YT, and XL.

Funding

This study has received funding by the National Natural Science Foundation of China (Grant Nos. 62176157, 31972924), National Key R&D Program of China (Grant No. 2019YFC0120600), the Science and Technology Commission of Shanghai Municipality (Grant Nos. 21DZ2208200, 22S31906000), the Pudong Science Technology and Economy Commission (Grant No. 210H1147900).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2022.1024527/full#supplementary-material

References

Aga, C., Wilson, K. J., Johansen, S., Dornan, G., La Prade, R. F., and Engebretsen, L. (2017). Tunnel widening in single- versus double-bundle anterior cruciate ligament reconstructed knees. Knee Surg. Sports Traumatol. Arthrosc. 25 (4), 1316–1327. doi:10.1007/s00167-016-4204-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Ahmad, S. S., Schreiner, A. J., Hirschmann, M. T., Schröter, S., Döbele, S., Ahrend, M. D., et al. (2019). Dynamic intraligamentary stabilization for ACL repair: A systematic review. Knee Surg. Sports Traumatol. Arthrosc. 27 (1), 13–20. doi:10.1007/s00167-018-5301-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Astuto, B., Flament, I., K Namiri, N., Shah, R., Bharadwaj, U., M Link, T., et al. (2021). Automatic deep learning-assisted detection and grading of abnormalities in knee MRI studies. Radiol. Artif. Intell. 3 (3), e200165. doi:10.1148/ryai.2021200165

PubMed Abstract | CrossRef Full Text | Google Scholar

Awan, M. J., Rahim, M. S. M., Salim, N., Rehman, A., Nobanee, H., and Shabir, H. (2021). Improved deep convolutional neural network to classify osteoarthritis from anterior cruciate ligament tear using magnetic resonance imaging. J. Pers. Med. 11 (11), 1163. doi:10.3390/jpm11111163

PubMed Abstract | CrossRef Full Text | Google Scholar

Bien, N., Rajpurkar, P., Ball, R. L., Irvin, J., Park, A., Jones, E., et al. (2018). Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet. PLoS Med. 15 (11), e1002699. doi:10.1371/journal.pmed.1002699

PubMed Abstract | CrossRef Full Text | Google Scholar

Caballo, M., Pangallo, D. R., Mann, R. M., and Sechopoulos, I. (2020). Deep learning-based segmentation of breast masses in dedicated breast CT imaging: Radiomic feature stability between radiologists and artificial intelligence. Comput. Biol. Med. 118, 103629. doi:10.1016/j.compbiomed.2020.103629

PubMed Abstract | CrossRef Full Text | Google Scholar

Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T., and Ronneberger, O. (2016). 3D U-net: Learning dense volumetric segmentation from sparse annotation. Med Image Comput Comput Assist Interv. 424–432. Springer.

Google Scholar

DiFelice, G. S., and van der List, J. P. (2018). Clinical outcomes of arthroscopic primary repair of proximal anterior cruciate ligament tears are maintained at mid-term follow-up. Arthrosc. J. Arthrosc. Relat. Surg. 34 (4), 1085–1093. doi:10.1016/j.arthro.2017.10.028

CrossRef Full Text | Google Scholar

Feagin, J. A., and Curl, W. W. (1976). Isolated tear of the anterior cruciate ligament: 5-year follow-up study. Am. J. Sports Med. 4 (3), 95–100. doi:10.1177/036354657600400301

PubMed Abstract | CrossRef Full Text | Google Scholar

Flannery, S. W., Kiapour, A. M., Edgar, D. J., Murray, M. M., Beveridge, J. E., and Fleming, B. C. (2022). A transfer learning approach for automatic segmentation of the surgically treated anterior cruciate ligament. J. Orthop. Res. 40 (1), 277–284. doi:10.1002/jor.24984

PubMed Abstract | CrossRef Full Text | Google Scholar

Flannery, S. W., Kiapour, A. M., Edgar, D. J., Murray, M. M., and Fleming, B. C. (2021). Automated magnetic resonance image segmentation of the anterior cruciate ligament. J. Orthop. Res. 39 (4), 831–840. doi:10.1002/jor.24926

PubMed Abstract | CrossRef Full Text | Google Scholar

Girgis, F. G., Marshall, J. L., and Monajem, A. (1975). The cruciate ligaments of the knee joint. Anatomical, functional and experimental analysis. Clin. Orthop. Relat. Res. 106, 216–231. doi:10.1097/00003086-197501000-00033

PubMed Abstract | CrossRef Full Text | Google Scholar

Heusdens, C. H. W. (2021). ACL repair: A game changer or will history repeat itself? A critical appraisal. J. Clin. Med. 10 (5), 912. doi:10.3390/jcm10050912

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoogeslag, R. A. G., Brouwer, R. W., Boer, B. C., de Vries, A. J., and Huis In 't Veld, R. (2019). Acute anterior cruciate ligament rupture: Repair or reconstruction? Two-year results of a randomized controlled clinical trial. Am. J. Sports Med. 47 (3), 567–577. doi:10.1177/0363546519825878

PubMed Abstract | CrossRef Full Text | Google Scholar

Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J., and Maier-Hein, K. H. (2021). nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18 (2), 203–211. doi:10.1038/s41592-020-01008-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Janani, G., Suresh, P., Prakash, A., Parthiban, J., Anand, K., and Arumugam, S. (2020). Anterior knee pain in ACL reconstruction with BPTB graft - is it a myth? Comparative outcome analysis with hamstring graft in 1, 250 patients. J. Orthop. 22, 408–413. doi:10.1016/j.jor.2020.09.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Javed Awan, M., Mohd Rahim, M. S., Salim, N., Mohammed, M. A., Garcia-Zapirain, B., and Abdulkareem, K. H. (2021). Efficient detection of knee anterior cruciate ligament from magnetic resonance imaging using deep learning approach. Diagnostics 11 (1), 105. doi:10.3390/diagnostics11010105

PubMed Abstract | CrossRef Full Text | Google Scholar

Kraeutler, M. J., Welton, K. L., McCarty, E. C., and Bravman, J. T. (2017). Revision anterior cruciate ligament reconstruction. J. Bone Jt. Surg. 99 (19), 1689–1696. doi:10.2106/jbjs.17.00412

CrossRef Full Text | Google Scholar

Li, Z. (2022). Efficacy of repair for ACL injury: A meta-analysis of randomized controlled trials. Int. J. Sports Med. [online ahead of print] doi:10.1055/a-1755-4925

CrossRef Full Text | Google Scholar

Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2020). Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42 (2), 318–327. doi:10.1109/tpami.2018.2858826

PubMed Abstract | CrossRef Full Text | Google Scholar

Lindström, M., Strandberg, S., Wredmark, T., Felländer-Tsai, L., and Henriksson, M. (2013). Functional and muscle morphometric effects of ACL reconstruction. A prospective CT study with 1 year follow-up. Scand. J. Med. Sci. Sports 23 (4), 431–442. doi:10.1111/j.1600-0838.2011.01417.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Litjens, G., Kooi, T., Bejnordi, B. E., Setio, A. A. A., Ciompi, F., Ghafoorian, M., et al. (2017). A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88. doi:10.1016/j.media.2017.07.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, F., Guan, B., Zhou, Z., Samsonov, A., Rosas, H., Lian, K., et al. (2019). Fully automated diagnosis of anterior cruciate ligament tears on knee mr images by using deep learning. Radiol. Artif. Intell. 1 (3), 180091. doi:10.1148/ryai.2019180091

PubMed Abstract | CrossRef Full Text | Google Scholar

Mahapatra, P., Horriat, S., and Anand, B. S. (2018). Anterior cruciate ligament repair - past, present and future. J. Exp. Orthop. 5 (1), 20. doi:10.1186/s40634-018-0136-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Mehier, C., Ract, I., Metten, M. A., Najihi, N., and Guillin, R. (2022). Primary anterior cruciate ligament repair: Magnetic resonance imaging characterisation of reparable lesions and correlation with arthroscopy. Eur. Radiol. 32 (1), 582–592. doi:10.1007/s00330-021-08155-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Murray, M. M., Fleming, B. C., Badger, G. J., Freiberger, C., Henderson, R., Barnett, S., et al. (2020). Bridge-enhanced anterior cruciate ligament repair is not inferior to autograft anterior cruciate ligament reconstruction at 2 Years: Results of a prospective randomized clinical trial. Am. J. Sports Med. 48 (6), 1305–1315. doi:10.1177/0363546520913532

PubMed Abstract | CrossRef Full Text | Google Scholar

Namiri, N. K., Flament, I., Astuto, B., Shah, R., Tibrewala, R., Caliva, F., et al. (2020). Deep learning for hierarchical severity staging of anterior cruciate ligament injuries from mri. Radiol. Artif. Intell. 1 (4), e190207–e190208. doi:10.1148/ryai.2020190207

PubMed Abstract | CrossRef Full Text | Google Scholar

Odgaard, F., Tuxoe, J., Joergensen, U., Lange, B., Lausten, G., Brettlau, T., et al. (2002). Clinical decision making in the acutely injured knee based on repeat clinical examination and MRI. Scand. J. Med. Sci. Sports 12 (3), 154–162. doi:10.1034/j.1600-0838.2002.00246.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Payer, C., Stern, D., Bischof, H., and Urschler, M. (2016). “Regressing heatmaps for multiple landmark localization using CNNs,” in Proceeding of the International Conference on Medical Image Computing and Computer-Assisted Intervention, 02 October 2016 (New York City: Springer Link), 230–238. doi:10.1007/978-3-319-46723-8_27

CrossRef Full Text | Google Scholar

Ranjan, R., Patel, V. M., and Chellappa, R. (2019). HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. 41 (1), 121–135. doi:10.1109/tpami.2017.2781233

PubMed Abstract | CrossRef Full Text | Google Scholar

Reijman, M., Eggerding, V., van Es, E., van Arkel, E., van den Brand, I., van Linge, J., et al. (2021). Early surgical reconstruction versus rehabilitation with elective delayed reconstruction for patients with anterior cruciate ligament rupture: COMPARE randomised controlled trial. Bmj 372, n375. doi:10.1136/bmj.n375

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherman, M. F., Lieber, L., Bonamo, J. R., Podesta, L., and Reiter, I. (1991). The long-term followup of primary anterior cruciate ligament repair. Defining a rationale for augmentation. Am. J. Sports Med. 19 (3), 243–255. doi:10.1177/036354659101900307

PubMed Abstract | CrossRef Full Text | Google Scholar

Shin, H. C., Roth, H. R., Gao, M., Lu, L., Xu, Z., Nogues, I., et al. (2016). Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 35 (5), 1285–1298. doi:10.1109/TMI.2016.2528162

PubMed Abstract | CrossRef Full Text | Google Scholar

Sim, Y., Chung, M. J., Kotter, E., Yune, S., Kim, M., Do, S., et al. (2020). Deep convolutional neural network-based software improves radiologist detection of malignant lung nodules on chest radiographs. Radiology 294 (1), 199–209. doi:10.1148/radiol.2019182465

PubMed Abstract | CrossRef Full Text | Google Scholar

Spindler, K. P., and Wright, R. W. (2008). Anterior cruciate ligament tear. N. Engl. J. Med. Overseas. Ed. 359 (20), 2135–2142. doi:10.1056/NEJMcp0804745

PubMed Abstract | CrossRef Full Text | Google Scholar

Taylor, D. C., Posner, M., Curl, W. W., and Feagin, J. A. (2009). Isolated tears of the anterior cruciate ligament: Over 30-year follow-up of patients treated with arthrotomy and primary repair. Am. J. Sports Med. 37 (1), 65–71. doi:10.1177/0363546508325660

PubMed Abstract | CrossRef Full Text | Google Scholar

van der List, J. P., and DiFelice, G. S. (2018). Preoperative magnetic resonance imaging predicts eligibility for arthroscopic primary anterior cruciate ligament repair. Knee Surg. Sports Traumatol. Arthrosc. 26 (2), 660–671. doi:10.1007/s00167-017-4646-z

PubMed Abstract | CrossRef Full Text | Google Scholar

van der List, J. P., and DiFelice, G. S. (2016). Preservation of the anterior cruciate ligament: A treatment algorithm based on tear location and tissue quality. Am. J. Orthop. 45 (7), E393–E405.

PubMed Abstract | Google Scholar

van der List, J. P., and DiFelice, G. S. (2017). Primary repair of the anterior cruciate ligament: A paradigm shift. Surg. 15 (3), 161–168. doi:10.1016/j.surge.2016.09.006

CrossRef Full Text | Google Scholar

van der List, J. P., Mintz, D. N., and DiFelice, G. S. (2017). The location of anterior cruciate ligament tears: A prevalence study using magnetic resonance imaging. Orthop. J. Sports Med. 5 (6), 232596711770996. doi:10.1177/2325967117709966

CrossRef Full Text | Google Scholar

Wang, L. J., Zeng, N., Yan, Z. P., Li, J. T., and Ni, G. X. (2020). Post-traumatic osteoarthritis following ACL injury. Arthritis Res. Ther. 22 (1), 57. doi:10.1186/s13075-020-02156-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Y., Wang, N., Huang, N., Liu, X., Zheng, Y., Fu, Y., et al. (2021). Determining the invasiveness of ground-glass nodules using a 3D multi-task network. Eur. Radiol. 31 (9), 7162–7171. doi:10.1007/s00330-021-07794-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: artificial intelligence, deep learning, computer-assisted diagnosis, anterior cruciate ligament, localization, primary ACL repair, ACL reconstruction

Citation: Qu C, Yang H, Wang C, Wang C, Ying M, Chen Z, Yang K, Zhang J, Li K, Dimitriou D, Tsai T-Y and Liu X (2022) A deep learning approach for anterior cruciate ligament rupture localization on knee MR images. Front. Bioeng. Biotechnol. 10:1024527. doi: 10.3389/fbioe.2022.1024527

Received: 21 August 2022; Accepted: 14 September 2022;
Published: 30 September 2022.

Edited by:

Stephen Ferguson, ETH Zürich, Switzerland

Reviewed by:

Jan Kubicek, VSB-Technical University of Ostrava, Czechia
Yun Peng, NuVasive, United States

Copyright © 2022 Qu, Yang, Wang, Wang, Ying, Chen, Yang, Zhang, Li, Dimitriou, Tsai and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tsung-Yuan Tsai, dHl0c2FpQHNqdHUuZWR1LmNu; Xudong Liu, eGRsaXVAc2p0dS5lZHUuY24=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

A deep learning approach for anterior cruciate ligament rupture localization on knee MR images

1 Introduction

2 Recent works

3 Materials and methods

3.1 MRI datasets

3.2 Fully automated anterior cruciate ligament rupture localization system

3.2.1 2D

3.2.2 3D

3.3 Definition of simplified classification of anterior cruciate ligament injury sites on our deep learning-based model

3.4 Implementation

3.5 Training and evaluation of the fully automated anterior cruciate ligament rupture localization system

3.6 Evaluation by clinical readers

3.7 Statistical analysis

4 Results

5 Discussion

6 Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good