Compare three deep learning-based artificial intelligence models for classification of calcified lumbar disc herniation: a multicenter diagnostic study

Liu, Zhiming; Zhang, Hao; Zhang, Min; Qu, Changpeng; Li, Lei; Sun, Yihao; Ma, Xuexiao

doi:10.3389/fsurg.2024.1458569

STUDY PROTOCOL article

Front. Surg., 06 November 2024

Sec. Orthopedic Surgery

Volume 11 - 2024 | https://doi.org/10.3389/fsurg.2024.1458569

This article is part of the Research Topic Harnessing Artificial Intelligence for Multimodal Predictive Modeling in Orthopedic Surgery View all 6 articles

Compare three deep learning-based artificial intelligence models for classification of calcified lumbar disc herniation: a multicenter diagnostic study

$\r\nZhiming Liu$ Zhiming Liu¹

Hao Zhang¹

Min Zhang²

Changpeng Qu¹

Lei Li¹

Yihao Sun¹ $Xuexiao Ma \r\n$ Xuexiao Ma^1*

¹Department of Spine Surgery, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
²Department of Neonatology, The Second Affiliated Hospital and Yuying Children’s Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, China

Objective: To develop and validate an artificial intelligence diagnostic model for identifying calcified lumbar disc herniation based on lateral lumbar magnetic resonance imaging(MRI).

Methods: During the period from January 2019 to March 2024, patients meeting the inclusion criteria were collected. All patients had undergone both lumbar spine MRI and computed tomography(CT) examinations, with regions of interest (ROI) clearly marked on the lumbar sagittal MRI images. The participants were then divided into separate sets for training, testing, and external validation. Ultimately, we developed a deep learning model using the ResNet-34 algorithm model and evaluated its diagnostic efficacy.

Results: A total of 1,224 eligible patients were included in this study, consisting of 610 males and 614 females, with an average age of 53.34 ± 10.61 years. Notably, the test datasets displayed an impressive classification accuracy rate of 91.67%, whereas the external validation datasets achieved a classification accuracy rate of 88.76%. Among the test datasets, the ResNet34 model outperformed other models, yielding the highest area under the curve (AUC) of 0.96 (95% CI: 0.93, 0.99). Additionally, the ResNet34 model also exhibited superior performance in the external validation datasets, exhibiting an AUC of 0.88 (95% CI: 0.80, 0.93).

Conclusion: In this study, we established a deep learning model with excellent performance in identifying calcified intervertebral discs, thereby offering a valuable and efficient diagnostic tool for clinical surgeons.

1 Introduction

Calcified lumbar disc herniation (CLDH) is a specific subtype of lumbar disc herniation. This condition exhibits a relatively low prevalence rate and its underlying causes remain uncertain (1, 2). Long-term lumbar disc herniation exceeding six months can result in the calcification of the protruded nucleus pulposus (3). Consequently, the calcified nucleus pulposus tissue forms extensive adhesions with the dura mater and nerve roots, thereby posing a risk of these tissue tearing (4). Consequently, a significant number of CLDH patients present with pronounced neurological manifestations, encompassing symptoms such as lumbar and leg numbness, pain, and lower limb weakness. Despite attempts at conservative management involving pharmacological, physical, and restorative interventions, the observed effectiveness is often minimal, prompting surgical intervention (5).

Conventional management of CLDH often involves open surgical procedures (6). Nevertheless, advancements in spinal endoscopy equipment and ultrasonic osteotomy techniques have introduced the possibility of utilizing spinal endoscopic surgery for CLDH treatment. Precise determination of intervertebral disc calcification plays a vital role in devising effective treatment strategies. Presently, diagnosis primarily relies on CT values [ranging from 120.1 to 383.7 HU (7)] and histopathological examination involving the identification of calcification foci through microscopic analysis. However, the use of CT scanning poses potential risks of radiation-induced harm to individuals, including pregnant women (8–10), teenagers and children (11–13), and patients with thyroid diseases (14–16). Therefore, the presence of calcification in intervertebral discs significantly affects the use of surgical instruments, the management of intraoperative risks, the duration of surgery, the postoperative recovery, the management of postoperative pain, and the occurrence of postoperative complications. Consequently, the development of a precise, rapid, and non-invasive tool for identifying calcified intervertebral discs is of paramount significance.

In recent years, significant progress has been made in image recognition technology, thanks to the revival of large-scale annotated datasets (i.e., ImageNet) (17) and deep convolutional neural networks (CNNs) (18). ImageNet is a large-scale visual database that contains millions of annotated images, and the CNN models trained on this database are the cornerstone for significantly improving medical image classification problems (19).

Deep learning, a branch of machine learning, has made significant breakthroughs in recent years, particularly in image, language, and speech understanding. Unlike traditional machine learning methods, deep learning can automatically learn data features without the need for manual feature extraction. Deep learning models can handle various types of data and continue to improve with increasing data volume (20).

Deep learning is a type of machine learning method that uses neural network structures similar to those found in the human brain to learn complex patterns in data. CNN is a specific type of deep learning that is particularly suitable for processing data with grid-like structures, such as images (2D grids) and videos (3D grids) (21, 22).

Deep learning has demonstrated remarkable progress in the diagnosis of thyroid cancer (23), esophageal cancer, gastric cancer (24), and skin cancer (25), rivaling the expertise of experienced radiologists (26, 27). Deep learning, with its ability to analyze complex data, has made significant strides in genomics, offering solutions for predicting genetic risks (28, 29), identifying pathogenic mutations (30, 31), and utilizing biomarkers for early disease diagnosis and monitoring (32–34).

Among the fundamental models extensively employed in CNNs, the Residual Network (ResNet) has exhibited commendable performance in both object detection and image classification tasks (35).

The primary aim of this research is to employ deep learning techniques to develop an artificial intelligence-based model that utilizes lumbar spine sagittal MRI images for the precise identification and diagnosis of calcified intervertebral discs, thereby providing assistance to medical practitioners. The performance evaluation of the model will involve utilizing internal test datasets and external validation datasets, where lumbar spine CT scans and intervertebral disc pathology results will serve as the benchmark criteria. The ultimate objective of this study is to furnish clinicians with a rapid and accurate auxiliary diagnostic tool, thereby improving the diagnostic accuracy of calcified intervertebral discs and offering substantial support for disease treatment and patient recovery.

2 Materials and methods

2.1 Datasets

This study is a retrospective analysis conducted with the approval of the Ethics Committee of our hospital and the informed consent of the patients. The study cohort consisted of 1,224 individuals diagnosed with lumbar disc herniation at our hospital (n = 613), Qingdao Municipal Hospital (n = 376), and the Second Affiliated Hospital of Wenzhou Medical University (n = 235) from January 2019 to March 2024. All enrolled patients underwent routine lumbar MRI and CT scans.

The inclusion criteria were as follows: (1) Diagnosed patients with lumbar disc herniation (including calcified and non-calcified). (2) Lumbar intervertebral disc herniation at the L1-S1 segment, with symptoms lasting at least 3 months, ineffective after conservative treatment or recurrent episodes, and other criteria that meet the indications for surgery. (3) The MRI and CT scans for all patients’ lumbar regions are performed within a 4-week interval. Additionally, patients with calcified lumbar disc herniation must also meet the following criteria: (1) The range of CT value of calcified intervertebral discs in patients with lumbar disc herniation was measured to be between 120.1 and 383.7 Hounsfield units (HU) (7). (2) Histopathological analysis of the excised intervertebral disc tissues confirmed the presence of granular or patchy calcifications when observed under a microscope. Patients who meet all the above three criteria at the same time are finally included in the study.

The exclusion criteria were as follows: (1) Presence of primary or secondary bone tumors, lumbar spine infections, lumbar spine tuberculosis, and other related conditions. (2) History of previous lumbar spine surgeries. (3) suffering from lumbar scoliosis or severe deformity. (4) Poor image quality or low signal-to-noise ratio. The selection process for patients meeting the inclusion criteria is presented in Figure 1.

Figure 1

Figure 1. Workflow diagram for developing and evaluating deep learning models for intervertebral disc calcification classification.

In order to verify the accuracy of the deep learning model in clinical practice, 611 patients with age-, sex- and body mass index (BMI)-matched were collected. We employed Propensity Score Matching (PSM) to match 611 patients from the external validation set with 613 patients from the training set, constructing a logistic regression model to predict whether a patient belongs to the training or the external validation set. Using this model, we calculated the probability of each patient belonging to the external validation set, which is the propensity score. The propensity score was then used to match patients in the training dataset with those in the external validation dataset.

2.2 Radiological examination

Lumbar spine MRI images were obtained from all patients using a 3.0 T magnetic resonance system, while lumbar spine CT images were acquired using a 128-slice spiral CT scanner.

2.3 Image Reading and annotation

Two radiology professors with over 10 years of work experience participated in the identification of ROI. They used 3D Slicer (version 5.6.0) to identify the calcified intervertebral disc segments in the mid-sagittal CT images of the patients’ lumbar spine. The presence of intervertebral disc calcification in these segments was determined by a comprehensive assessment that combined the identification results of the radiology professors and the pathological results. Following this, the two radiology professors outlined a rectangular area as the ROI on the mid-sagittal MRI images of the lumbar spine for the corresponding segments of patients with intervertebral disc calcification. This area was centered on the intervertebral disc and extended 0.8–1.2 centimeters above and below. Any discrepancies were resolved through consensus-based discussions. Subsequently, the manually selected images were converted to PNG format with dimensions of 224*224 pixels in order to facilitate subsequent deep learning analysis (Figure 2).

Figure 2

Figure 2. The overall framework of deep learning. To begin with, the protruding intervertebral discs should be accurately delineated on the lateral lumbar MRI images, and the images should be converted to PNG format. Next, the intervertebral disc images need to be resized to 224*224 pixels to prepare them for input into the deep learning model. Subsequently, a classification model for lumbar intervertebral disc calcification will be established based on ResNet-34, DenseNet-121, and Mobilevit_s algorithms. The performance of the model will be evaluated using external validation datasets and the ROC curve. Ultimately, this deep learning model can serve as an assisting tool in the identification of lumbar inervertebral disc calcification.

For the non-calcification control group, the specific method for determining the ROI is as follows: In this study, a total of 422 patients were diagnosed with intervertebral disc calcification, with the number and proportion of each segment as follows: L1-2 (n = 8, 1.90%), L2-3 (n = 7, 1.66%), L3-4 (n = 10, 2.37%), L4-5 (n = 175, 41.47%), L5-S1 (n = 222, 52.60%). Therefore, based on the proportion of each segment in the calcification group, the number and proportion of each segment in the 802 non-calcification patients are as follows: L1-2 (n = 15, 1.87%), L2-3 (n = 13, 1.62%), L3-4 (n = 19, 2.37%), L4-5 (n = 333, 41.52%), L5-S1 (n = 422, 52.62%). The ROI determination method for the non-calcification group refers to the ROI determination method used for calcification, where on the mid-sagittal MRI image, a rectangular area is drawn with the intervertebral disc at the center, extending 0.8–1.2 cm above and below the disc.

The MR acquisition sequence in this study has a resolution of 0.2 millimeters, an Echo Time (TE) of 20 milliseconds, a Repetition Time (TR) of 1,000 milliseconds, and a Field of View (FOV) of 200 millimeters. The CT acquisition sequence in this study employs a tube voltage of 120 kilovolts (kVp), a milliampere-seconds (mAs) value of 200, a slice thickness of 5 millimeters, a field of view (FOV) of 500 millimeters, and a pitch of 1.0.

2.4 Model training

The patients within the development datasets were randomly allocated into training and validation sets at a ratio of 8:2. The training set was utilized to construct the deep learning model for calcified lumbar intervertebral discs, whereas the validation set served to assess the model's performance. The number of patients in each dataset is presented in Table 1.

Table 1

Table 1. Patient characteristics.

For this study, we utilized the ResNet-34 architecture model. The input images were adjusted to a resolution of 224*224 pixels and normalized using Mean = [0.485, 0.456, 0.406] and STD = [0.229, 0.224, 0.225]. We conducted hyperparameter tuning on the optimizer, learning rate, initial weights, image size, and batch size. For the optimizer, we evaluated Stochastic Gradient Descent (SGD) and Adam. For initial weights, we assessed the impact of normal distribution initial weights and selected the best weights from the training epochs (epochs = 150); for the learning rate, the search range for SGD was 0.0001, and default parameters were used for the Adam optimizer; for image size, the search range was 128, 256, and 512 pixels; for batch size, the search range was 2–32. The best performing model was ResNet-34, with the optimal optimizer being Adam (learning rate of 0.0001), using initial weights, and a batch size of 6. Furthermore, to enhance model accuracy, prevent overfitting, and improve training efficiency, we utilized a pre-trained ResNet-34 model on ImageNet as the foundation and fine-tuned it based on this. This strategy can reduce the amount of training data and accelerate the training process. Initializing the weights of the convolutional layers with a normal distribution can help the model escape local optima and improve its generalization capability.

Horizontal flipping is employed to augment and amplify original images, minimizing overfitting. This normalization ensures smooth convergence during training, boosting model stability and efficacy.

2.5 Validation of the model

The test datasets served to validate the model's performance. Additionally, external validation datasets from Qingdao Municipal Hospital, and the Second Affiliated Hospital of Wenzhou Medical University (all meeting the same inclusion and exclusion criteria as the training set) was used to assess the robustness of the model. Additionally, the DenseNet-121 and Mobilevit_s models, as well as the human recognition results for images, are compared with the ResNet-34 architecture model to determine the best model. For the manual identification process, the other two radiology experts analyzed the ROI on the previously obtained MRI images to determine whether the segment was calcified, and they were not aware of the CT images or pathological results beforehand.

2.6 Statistical analysis

Data analysis and model evaluation were performed using Python (version 3.7.0). The performance of the model was evaluated using ROC curves, AUC, confusion matrices, accuracy, sensitivity, specificity, precision, F1 scores, positive predictive values (PPVs), and negative predictive values (NPVs). The AUC values were calculated using the trapezoidal rule, and the optimal threshold was determined using the maximum Youden index method. The confusion matrices were calculated based on the true positives, false positives, true negatives, and false negatives. Continuous variables were presented as means ± standard deviations and were tested for normality using the Shapiro-Wilk test. Statistical significance was determined using the independent samples t-test for continuous variables and the chi-squared test for categorical variables. P < 0.05 was considered statistically significant.

3 Results

3.1 Population characteristics

A total of 1,224 patients were enrolled in this study, comprising 610 males and 614 females, with an average age of 53.34 ± 10.61 years. Out of the 1,224 patients, 802 had non-calcified intervertebral discs, whereas 422 exhibited calcified intervertebral discs (Table 1).

3.2 Establishment of deep learning model

After 87 training iterations, no further improvement in the accuracy of calcified intervertebral disc prediction was observed, indicating the completion of the training process.

3.3 Deep learning model performance on the different datasets

The ResNet-34 model showed the best performance in the binary classification of distinguishing whether the intervertebral disc is calcified, with an AUC of 0.96 (95% CI: 0.93, 0.99). The DenseNet-121 model achieved an AUC of 0.87 (95% CI: 0.81, 0.93), while the Mobilevit_s model yielded an AUC of 0.82 (95% CI: 0.75, 0.90). The AUC for human recognition is 0.652 (95% confidence interval: 0.587, 0.733).

Similar to the testing datasets, the ResNet-34 model also exhibited the highest performance in the external validation datasets, with an AUC of 0.88 (95% CI: 0.80, 0.93). The DenseNet-121 model obtained an AUC of 0.66 (95% CI: 0.57, 0.69), whereas the Mobilevit_s model achieved an AUC of 0.70 (95% CI: 0.66, 0.79) The AUC of human recognition is 0.41 (95% CI: 0.35, 0.58). (Table 2 and Figure 3).

Table 2

Table 2. The performance of deep learning on different datasets and different models.

Figure 3

Figure 3. The performance of deep learning models was assessed using internal and external datasets. ROC (A) and normalized confusion matrix (B) of the ResNet-34 model in the internal test data set. ROC (C) and normalized confusion matrix (D) of the DenseNet-121 model in the internal test data set. ROC (E) and normalized confusion matrix (F) of the Mobilevit_S model in the internal test data set. ROC (G) and normalized confusion matrix (H) of the ResNet-34 model in the external validation data set. ROC (I) and normalized confusion matrix (J) of the DenseNet-121 model in the external validation data set. ROC (K) and normalized confusion matrix (L) of the Mobilevit_S model in the external validation data set. Class1 = calcification datasets; Class0 = Non-calcification datasets.

4 Discussion

In this research, we have successfully devised a deep learning model capable of effectively discriminating intervertebral disc calcification. Our model exhibits commendable accuracy and specificity when tested on both internal and external datasets. Its implementation renders formidable assistance to surgeons in precisely diagnosing cases of calcified intervertebral discs and devising appropriate surgical strategies.

Recently, many experts and scholars have attempted to utilize the rapidly developing and widely used artificial intelligence (AI) methods to detect Lumbar disc herniation. Hou et al. have developed a deep learning framework to establish a classifier for diagnosing LDH, which is trained using semi-supervised learning approaches (36). Sustersic et al. have proposed a deep learning model based on convolutional neural networks (CNNs) for the automatic detection and classification of LDH through MRI images (37). Tsai et al. have used the YOLOv3 algorithm to achieve automatic detection of LDH by enhancing and feature-processing MRI images (38). Similar to these articles, this paper also adopts the ResNet architecture and CNN model, and the performance evaluation of the model employs the confusion matrix, accuracy, sensitivity, and ROC curve.

It is undeniable that MRI is less sensitive than CT in determining whether a disc is calcified, because MRI has lower contrast for calcified tissue, while CT scanning can directly display the calcified areas. Despite this, MRI has unique advantages in assessing the disc and its surrounding soft tissues: unlike CT, MRI does not use ionizing radiation, posing a smaller long-term health risk to patients; MRI can provide imaging in any plane, including sagittal, coronal, and axial views, which helps in a more comprehensive assessment of spinal structure; MRI provides better soft tissue contrast, aiding in the differentiation of structures such as the disc, vertebrae, ligaments, muscles, and nerves. The use of this model in conjunction with CT allows doctors to fully understand the patient's condition, thus enabling the creation of more precise and personalized surgical plans.

Alomari et al. conducted a clinical trial demonstrating the favorable effectiveness of T2-weighted lumbar spine sagittal MRI images in evaluating lumbar disc herniation (39). Consequently, we employed T2-weighted lumbar spine sagittal MRI images to construct our deep learning model, aiming to attain enhanced contrast and clearer anatomical features. Previous research has established deep learning models for diagnosing lumbar disc herniation, with their selected ROI encompassing the intervertebral disc as well as the adjacent superior and inferior vertebrae (40). Similarly, in this study, we adopted a similar approach, defining the ROI as the intervertebral disc's central portion along with a 0.8–1.2 cm segment of the surrounding superior and inferior vertebrae.

ResNet-34 (Residual Network-34) is a deep learning model that performs particularly well in image recognition and classification, containing 33 convolutional layers and 1 fully connected layer. A key feature of this network architecture is residual learning, which enables the network to learn the difference between input and output through skip connections, allowing it to be effectively trained even with a deep network. When using ResNet-34, it is typically pre-trained and then fine-tuned for specific tasks such as object classification, target detection, or semantic segmentation.The deep learning model established using ResNet-34 also demonstrated good image recognition performance, capable of distinguishing calcified and non-calcified discs effectively (41–43).

DenseNet-121 is a dense connection network with 121 convolutional layers, each of which is connected to all the previous layers. Due to the dense connections between layers, DenseNet can more efficiently utilize computational resources, reduce the number of parameters, and improve training speed. In addition, DenseNet can also enhance the accuracy of the model, especially in image recognition tasks (44).

MobileViT_s is a lightweight visual transformation model for mobile devices, serving as a variant of the MobileViT model. MobileViT is a model that combines the advantages of convolutional neural networks (CNNs) with the architecture of Transformers. It reduces the number of model parameters and computational complexity by employing depth-wise separable convolutions and mobile windowing mechanisms, while still maintaining the performance benefits of Transformers. Furthermore, the model reduces the size and computational demands, enabling efficient operation on mobile devices (45).

Many studies have demonstrated the effectiveness of deep learning algorithms in the diagnosis of various medical diseases, including breast cancer (46–48), brain tumors (49–51), etc. The results of these studies indicate that deep learning algorithms are promising tools for future medical diagnosis. However, deep learning algorithms also face some challenges in medical image analysis. Firstly, medical image data are typically high-dimensional and complex, requiring substantial computational resources and storage space. Secondly, the annotation process of medical image data often necessitates expertise and technology, consuming a significant amount of time and manpower. Therefore, how to optimize algorithms and improve annotation efficiency is an important research direction in the application of deep learning to medical image analysis. To address these issues, researchers have proposed several methods. For example, transfer learning techniques can be employed to initialize new models with already trained models, thus reducing the amount of data that needs to be annotated (52–54). Additionally, the use of multimodal learning techniques can combine different types of medical image data to enhance the diagnostic performance of models (55–57).

The future development in the field of this study can consider integrating other imaging characteristics for multimodal feature fusion. For instance, the inclusion of lumbar MRI coronal images and x-ray data into the model can be explored to investigate the correlations and complementarity between these different examination modalities, aiming to enhance the performance and accuracy of the model. Additionally, the application of weakly supervised learning methods can be pursued, such as utilizing unlabeled data for self-learning and techniques like transfer learning, to reduce the manual annotation workload and enhance the model's generalization capability. Furthermore, the formation and evolution of calcified intervertebral discs are temporal processes, thus it is worth considering incorporating time-series information into the model. By analyzing the trends and dynamic features of image sequences at different time points, a better understanding of the developmental patterns of calcified intervertebral discs can be gained, thereby improving the predictive ability of the model.

This article inevitably has some shortcomings. First, the number of included patients was limited, with only 1,224 cases. Second, the severity of the disease varied among the enrolled patients, regardless of whether there was intervertebral disc calcification or not, and the degree of lumbar disc protrusion also differed. Third, there was only one external validation datasets in this study, highlighting the importance of additional external validation datasets to verify the established model. The model may need further optimization and adjustment for adaptation to various clinical requirements in actual clinical applications.

5 Conclusion

We developed a CNN-based artificial intelligence model for high-accuracy MRI analysis of intervertebral discs, trained on a dataset of calcified and non-calcified scans. It offers surgeons a fast, reliable diagnostic tool, aiding early prediction and description of disc calcification for optimized treatment and outcomes.

Ethics statement

The studies involving humans were approved by the Ethics Committees of the Affiliated Hospital of Qingdao University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

ZL: Conceptualization, Data curation, Formal Analysis, Methodology, Software, Writing – original draft, Writing – review & editing. HZ: Formal Analysis, Methodology, Resources, Software, Visualization, Writing – original draft. MZ: Data curation, Formal Analysis, Investigation, Methodology, Writing – original draft. CQ: Investigation, Software, Writing – original draft. LL: Methodology, Software, Writing – original draft. YS: Methodology, Software, Writing – original draft. XM: Funding acquisition, Project administration, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Qingdao Science and Technology Benefit the People Demonstration Project (No. 23-2-8-smjk-7-nsh).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Karamouzian S, Eskandary H, Faramarzee M, Saba M, Safizade H, Ghadipasha M, et al. Frequency of lumbar intervertebral disc calcification and angiogenesis, and their correlation with clinical, surgical, and magnetic resonance imaging findings. Spine (Phila Pa 1976). (2010) 8:881–86. doi: 10.1097/BRS.0b013e3181b9c986

PubMed Abstract | Crossref Full Text | Google Scholar

2. Cheng Y, Zhang Q, Li Y, Chen X, Wu H. Percutaneous endoscopic interlaminar discectomy for l5-s1 calcified lumbar disc herniation: a retrospective study. Front Surg. (2022) 9:998231. doi: 10.3389/fsurg.2022.998231

PubMed Abstract | Crossref Full Text | Google Scholar

3. Jasper GP, Francisco GM, Telfeian AE. Clinical success of transforaminal endoscopic discectomy with foraminotomy: a retrospective evaluation. Clin Neurol Neurosurg. (2013) 115:1961–65. doi: 10.1016/j.clineuro.2013.05.033

PubMed Abstract | Crossref Full Text | Google Scholar

4. Bostelmann R, Eicker S, Steiger HJ, Cornelius JF. Spontaneous disruption of dura mater and fascicular continuity of the l5 nerve root by a calcified disc herniation. Acta Neurochir (Wien). (2011) 153:1447–48. doi: 10.1007/s00701-011-1027-0

PubMed Abstract | Crossref Full Text | Google Scholar

5. Hahne AJ, Ford JJ, McMeeken JM. Conservative management of lumbar disc herniation with associated radiculopathy: a systematic review. Spine (Phila Pa 1976). (2010) 35:E488–504. doi: 10.1097/BRS.0b013e3181cc3f56

PubMed Abstract | Crossref Full Text | Google Scholar

6. Liu N, Chen Z, Qi Q, Li W, Guo Z. Circumspinal decompression and fusion through a posterior midline incision to treat central calcified thoracolumbar disc herniation: a minimal 2-year follow-up study with reconstruction ct. Eur Spine J. (2014) 23:373–81. doi: 10.1007/s00586-013-3054-4

PubMed Abstract | Crossref Full Text | Google Scholar

7. Dabo X, Ziqiang C, Yinchuan Z, Haijian N, Kai C, Yanbin L, et al. The clinical results of percutaneous endoscopic interlaminar discectomy (peid) in the treatment of calcified lumbar disc herniation: a case-control study. Pain Physician. (2016) 2:69–76.

Google Scholar

8. Markou P. Fetus radiation doses from nuclear medicine and radiology diagnostic procedures. Potential risks and radiation protection instructions. Hell J Nucl Med. (2007) 1:48–55.

Google Scholar

9. Mattsson S, Leide-Svegborn S, Andersson M. X-ray and molecular imaging during pregnancy and breastfeeding-when should we be worried? Radiat Prot Dosimetry. (2021) 195:339–48. doi: 10.1093/rpd/ncab041

PubMed Abstract | Crossref Full Text | Google Scholar

10. Fiebich M, Block A, Borowski M, Geworski L, Happel C, Kamp A, et al. Prenatal radiation exposure in diagnostic and interventional radiology. Rofo. (2020) 7:778–86. doi: 10.1055/a-1313-7527

PubMed Abstract | Crossref Full Text | Google Scholar

11. Pauwels EK, Bourguignon MH. Radiation dose features and solid cancer induction in pediatric computed tomography. Med Princ Pract. (2012) 21:508–15. doi: 10.1159/000337404

PubMed Abstract | Crossref Full Text | Google Scholar

12. Bosch DBGM, Thierry-Chef I, Harbron R, Hauptmann M, Byrnes G, Bernier MO, et al. Risk of hematological malignancies from ct radiation exposure in children, adolescents and young adults. Nat Med. (2023) 29:3111–19. doi: 10.1038/s41591-023-02620-0

PubMed Abstract | Crossref Full Text | Google Scholar

13. Pearce MS, Salotti JA, Little MP, McHugh K, Lee C, Kim KP, et al. Radiation exposure from ct scans in childhood and subsequent risk of leukaemia and brain tumours: a retrospective cohort study. Lancet. (2012) 380:499–505. doi: 10.1016/S0140-6736(12)60815-0

PubMed Abstract | Crossref Full Text | Google Scholar

14. Su YP, Niu HW, Chen JB, Fu YH, Xiao GB, Sun QF. Radiation dose in the thyroid and the thyroid cancer risk attributable to ct scans for pediatric patients in one general hospital of China. Int J Environ Res Public Health. (2014) 11:2793–803. doi: 10.3390/ijerph110302793

PubMed Abstract | Crossref Full Text | Google Scholar

15. Huda W, Spampinato MV, Tipnis SV, Magill D. Computation of thyroid doses and carcinogenic radiation risks to patients undergoing neck ct examinations. Radiat Prot Dosimetry. (2013) 156:436–44. doi: 10.1093/rpd/nct090

PubMed Abstract | Crossref Full Text | Google Scholar

16. Printz C. Radiation from computed tomography scans is associated with increased risk for thyroid cancer and leukemia. Cancer. (2020) 126:1601. doi: 10.1002/cncr.32863

PubMed Abstract | Crossref Full Text | Google Scholar

17. Deng J, Dong W, Socher R, Li L, Li K, Fei-Fei L. Imagenet: A Large-scale Hierarchical Image Database. IEEE CVPR (2009).

Google Scholar

18. Krizhevsky A, Sutskever I, Hinton GE. Imagenet Classification with Deep Convolutional Neural Networks. NIPS (2012).

Google Scholar

19. Shin HC, Roth HR, Gao M, Lu L, Xu Z, Nogues I, et al. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging. (2016) 35:1285–98. doi: 10.1109/TMI.2016.2528162

PubMed Abstract | Crossref Full Text | Google Scholar

20. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Nat Med. (2019) 25:24–9. doi: 10.1038/s41591-018-0316-z

PubMed Abstract | Crossref Full Text | Google Scholar

21. Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos. Proceedings of the Advances in Neural Information Processing Systems (2014). p. 567–75.

Google Scholar

22. Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE (1998).

Google Scholar

23. Li X, Zhang S, Zhang Q, Wei X, Pan Y, Zhao J, et al. Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images: a retrospective, multicohort, diagnostic study. Lancet Oncol. (2019) 20:193–201. doi: 10.1016/S1470-2045(18)30762-9

PubMed Abstract | Crossref Full Text | Google Scholar

24. Luo H, Xu G, Li C, He L, Luo L, Wang Z, et al. Real-time artificial intelligence for detection of upper gastrointestinal cancer by endoscopy: a multicentre, case-control, diagnostic study. Lancet Oncol. (2019) 20:1645–54. doi: 10.1016/S1470-2045(19)30637-0

PubMed Abstract | Crossref Full Text | Google Scholar

25. Razzak I, Naz S. Unit-vise: deep shallow unit-vise residual neural networks with transition layer for expert level skin cancer classification. IEEE/Acm Trans Comput Biol Bioinform. (2022) 19:1225–34. doi: 10.1109/TCBB.2020.3039358

PubMed Abstract | Crossref Full Text | Google Scholar

26. Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. (2016) 316:2402–10. doi: 10.1001/jama.2016.17216

PubMed Abstract | Crossref Full Text | Google Scholar

27. Chen W, Liu X, Li K, Luo Y, Bai S, Wu J, et al. A deep-learning model for identifying fresh vertebral compression fractures on digital radiography. Eur Radiol. (2022) 32:1496–505. doi: 10.1007/s00330-021-08247-4

PubMed Abstract | Crossref Full Text | Google Scholar

28. Dudley JT, Listgarten J, Stegle O, Brenner SE, Parts L. Personalized medicine: from genotypes, molecular phenotypes and the quantified self, towards improved medicine. Pac Symp Biocomput. (2015):342–46.25592594

PubMed Abstract | Google Scholar

29. Xiong HY, Alipanahi B, Lee LJ, Bretschneider H, Merico D, Yuen RK, et al. Rna splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science. (2015) 347:1254806. doi: 10.1126/science.1254806

PubMed Abstract | Crossref Full Text | Google Scholar

30. Kircher M, Witten DM, Jain P, O'Roak BJ, Cooper GM, Shendure J. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. (2014) 46:310–15. doi: 10.1038/ng.2892

PubMed Abstract | Crossref Full Text | Google Scholar

31. Quang D, Chen Y, Xie X. Dann: a deep learning approach for annotating the pathogenicity of genetic variants. Bioinformatics. (2015) 31:761–63. doi: 10.1093/bioinformatics/btu703

PubMed Abstract | Crossref Full Text | Google Scholar

32. Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, et al. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface. (2018) 15(141):20170387. doi: 10.1098/rsif.2017.0387

PubMed Abstract | Crossref Full Text | Google Scholar

33. Angermueller C, Lee HJ, Reik W, Stegle O. Deepcpg: accurate prediction of single-cell dna methylation states using deep learning. Genome Biol. (2017) 18:67. doi: 10.1186/s13059-017-1189-z

PubMed Abstract | Crossref Full Text | Google Scholar

34. Chen Y, Li Y, Narayan R, Subramanian A, Xie X. Gene expression inference with deep learning. Bioinformatics. (2016) 32:1832–39. doi: 10.1093/bioinformatics/btw074

PubMed Abstract | Crossref Full Text | Google Scholar

35. Xu F, Xiong Y, Ye G, Liang Y, Guo W, Deng Q, et al. Deep learning-based artificial intelligence model for classification of vertebral compression fractures: a multicenter diagnostic study. Front Endocrinol (Lausanne). (2023) 14:1025749. doi: 10.3389/fendo.2023.1025749

PubMed Abstract | Crossref Full Text | Google Scholar

36. Ghosh S, Chaudhary V. Supervised methods for detection and segmentation of tissues in clinical lumbar MRI. Comput Med Imaging Graph. (2014) 38(7):639–49. doi: 10.1016/j.compmedimag.2014.03.005

PubMed Abstract | Crossref Full Text | Google Scholar

37. Sustersic T, Rankovic V, Milovanovic V, Kovacevic V, Rasulic L, Filipovic N. A deep learning model for automatic detection and classification of disc herniation in magnetic resonance images. IEEE J Biomed Health Inform. (2022) 26:6036–46. doi: 10.1109/JBHI.2022.3209585

PubMed Abstract | Crossref Full Text | Google Scholar

38. Tsai JY, Hung IY, Guo YL, Jan YK, Lin CY, Shih TT, et al. Lumbar disc herniation automatic detection in magnetic resonance imaging based on deep learning. Front Bioeng Biotechnol. (2021) 9:708137. doi: 10.3389/fbioe.2021.708137

PubMed Abstract | Crossref Full Text | Google Scholar

39. Alomari RS, Corso JJ, Chaudhary V, Dhillon G. Lumbar spine disc herniation diagnosis with a joint shape model. Lect Notes Comput Vis Biomech (2014) 17:87–98. doi: 10.1007/978-3-319-07269-2_8

Crossref Full Text | Google Scholar

40. Prisilla AA, Guo YL, Jan YK, Lin CY, Lin FY, Liau BY, et al. An approach to the diagnosis of lumbar disc herniation using deep learning models. Front Bioeng Biotechnol. (2023) 11:1247112. doi: 10.3389/fbioe.2023.1247112

PubMed Abstract | Crossref Full Text | Google Scholar

41. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016).

Google Scholar

42. He K, Zhang X, Ren S, Sun J. Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018).

Google Scholar

43. He K, Zhang H, Ren S, Sun J. Identity mappings in deep residual networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016).

Google Scholar

44. Zhang X, Liu H, Zhu Z, Xu Z. Learning to search efficient densenet with layer-wise pruning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020).

Google Scholar

45. Mehta S, Rastegari M. Mobilevit: light-weight, general-purpose, and mobile-friendly vision transformer. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021).

Google Scholar

46. Din NMU, Dar RA, Rasool M, Assad A. Breast cancer detection using deep learning: datasets, methods, and challenges ahead. Comput Biol Med. (2022) 149:106073. doi: 10.1016/j.compbiomed.2022.106073

PubMed Abstract | Crossref Full Text | Google Scholar

47. Wang Y, Acs B, Robertson S, Liu B, Solorzano L, Wahlby C, et al. Improved breast cancer histological grading using deep learning. Ann Oncol. (2022) 33:89–98. doi: 10.1016/j.annonc.2021.09.007

PubMed Abstract | Crossref Full Text | Google Scholar

48. Witowski J, Heacock L, Reig B, Kang SK, Lewin A, Pysarenko K, et al. Improving breast cancer diagnostics with deep learning for mri. Sci Transl Med. (2022) 664:o4802. doi: 10.1126/scitranslmed.abo4802

PubMed Abstract | Crossref Full Text | Google Scholar

49. Badrigilan S, Nabavi S, Abin AA, Rostampour N, Abedi I, Shirvani A, et al. Deep learning approaches for automated classification and segmentation of head and neck cancers and brain tumors in magnetic resonance images: a meta-analysis study. Int J Comput Assist Radiol Surg. (2021) 16:529–42. doi: 10.1007/s11548-021-02326-z

PubMed Abstract | Crossref Full Text | Google Scholar

50. Munir K, Frezza F, Rizzi A. Deep learning hybrid techniques for brain tumor segmentation. Sensors (Basel). (2022) 2(21):8201. doi: 10.3390/s22218201

PubMed Abstract | Crossref Full Text | Google Scholar

51. Islam KT, Wijewickrema S, O'Leary S. A deep learning framework for segmenting brain tumors using MRI and synthetically generated CT images. Sensors (Basel). (2022) 22(2):523. doi: 10.3390/s22020523

PubMed Abstract | Crossref Full Text | Google Scholar

52. Zoetmulder R, Gavves E, Caan M, Marquering H. Domain- and task-specific transfer learning for medical segmentation tasks. Comput Methods Programs Biomed. (2022) 214:106539. doi: 10.1016/j.cmpb.2021.106539

PubMed Abstract | Crossref Full Text | Google Scholar

53. Karimi D, Warfield SK, Gholipour A. Transfer learning in medical image segmentation: new insights from analysis of the dynamics of model parameters and learned representations. Artif Intell Med. (2021) 116:102078. doi: 10.1016/j.artmed.2021.102078

PubMed Abstract | Crossref Full Text | Google Scholar

54. Aswiga RV, Shanthi AP. A multilevel transfer learning technique and lstm framework for generating medical captions for limited ct and dbt images. J Digit Imaging. (2022) 35:564–80. doi: 10.1007/s10278-021-00567-7

PubMed Abstract | Crossref Full Text | Google Scholar

55. Stahlschmidt SR, Ulfenborg B, Synnergren J. Multimodal deep learning for biomedical data fusion: a review. Brief Bioinform. (2022) 23(2):bbab569. doi: 10.1093/bib/bbab569

PubMed Abstract | Crossref Full Text | Google Scholar

56. Ming Y, Dong X, Zhao J, Chen Z, Wang H, Wu N. Deep learning-based multimodal image analysis for cervical cancer detection. Methods. (2022) 205:46–52. doi: 10.1016/j.ymeth.2022.05.004

PubMed Abstract | Crossref Full Text | Google Scholar

57. Pfau M, Kunzel SH, Pfau K, Schmitz-Valckenberg S, Fleckenstein M, Holz FG. Multimodal imaging and deep learning in geographic atrophy secondary to age-related macular degeneration. Acta Ophthalmol. (2023) 101:881–90. doi: 10.1111/aos.15796

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: calcifed lumbar disc herniation, deep learning, artificial intelligence, MRI, ResNet34

Citation: Liu Z, Zhang H, Zhang M, Qu C, Li L, Sun Y and Ma X (2024) Compare three deep learning-based artificial intelligence models for classification of calcified lumbar disc herniation: a multicenter diagnostic study. Front. Surg. 11:1458569. doi: 10.3389/fsurg.2024.1458569

Received: 2 July 2024; Accepted: 21 October 2024;
Published: 6 November 2024.

Edited by:

Babak Saravi, University of Freiburg Medical Center, Germany

Reviewed by:

Yuanpei Cheng, Jilin University, China
Sherwan Hamawandi, Hawler Medical University, Iraq

Copyright: © 2024 Liu, Zhang, Zhang, Qu, Li, Sun and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xuexiao Ma, bWF4dWV4aWFvc3BpbmFsQDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.