- 1Department of Orthopedics, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
- 2School of Life Sciences, Tsinghua University, Beijing, China
- 3Department of Biomedical Engineering, School of Medicine, Tsinghua University, Beijing, China
- 4Longwood Valley Medical Technology Co., Ltd., Beijing, China
Background: Currently, manual measurement of lumbosacral radiological parameters is time-consuming and laborious, and inevitably produces considerable variability. This study aimed to develop and evaluate a deep learning-based model for automatically measuring lumbosacral radiographic parameters on lateral lumbar radiographs.
Methods: We retrospectively collected 1,240 lateral lumbar radiographs to train the model. The included images were randomly divided into training, validation, and test sets in a ratio of approximately 8:1:1 for model training, fine-tuning, and performance evaluation, respectively. The parameters measured in this study were lumbar lordosis (LL), sacral horizontal angle (SHA), intervertebral space angle (ISA) at L4–L5 and L5–S1 segments, and the percentage of lumbar spondylolisthesis (PLS) at L4–L5 and L5–S1 segments. The model identified key points using image segmentation results and calculated measurements. The average results of key points annotated by the three spine surgeons were used as the reference standard. The model’s performance was evaluated using the percentage of correct key points (PCK), intra-class correlation coefficient (ICC), Pearson correlation coefficient (r), mean absolute error (MAE), root mean square error (RMSE), and box plots.
Results: The model’s mean differences from the reference standard for LL, SHA, ISA (L4–L5), ISA (L5–S1), PLS (L4–L5), and PLS (L5–S1) were 1.69°, 1.36°, 1.55°, 1.90°, 1.60%, and 2.43%, respectively. When compared with the reference standard, the measurements of the model had better correlation and consistency (LL, SHA, and ISA: ICC = 0.91–0.97, r = 0.91–0.96, MAE = 1.89–2.47, RMSE = 2.32–3.12; PLS: ICC = 0.90–0.92, r = 0.90–0.91, MAE = 1.95–2.93, RMSE = 2.52–3.70), and the differences between them were not statistically significant (p > 0.05).
Conclusion: The model developed in this study could correctly identify key vertebral points on lateral lumbar radiographs and automatically calculate lumbosacral radiographic parameters. The measurement results of the model had good consistency and reliability compared to manual measurements. With additional training and optimization, this technology holds promise for future measurements in clinical practice and analysis of large datasets.
1 Introduction
Low back pain is a common clinical symptom of lumbar diseases, affecting the quality of life and health of patients of all ages while imposing significant economic burdens on individuals, families, and governments (Hoy et al., 2012; Hong et al., 2013; Maher et al., 2017; Kim et al., 2019). According to research, low back pain is associated with lumbosacral instability (Panjabi, 2003). While many studies have focused on the treatment and prevention of lumbar diseases, there has been a shift toward researching the role of the lumbosacral sagittal alignment in lower back pain and lumbar diseases, emphasizing the importance of accurately measuring lumbosacral radiographic parameters (Kalidindi et al., 2022; Tartara et al., 2023).
The presence of various spinal disorders is linked to abnormalities in the spine’s sagittal alignment, making proper spinal sagittal alignment critical for quantitatively assessing spinal health (Scheer et al., 2013; Schwab et al., 2013; Liu et al., 2015; Brink et al., 2017). Spinal instability and the resulting compensatory reactions can put additional strain on important spinal structures, resulting in pain (Diebo et al., 2015; Lafage et al., 2017). Therefore, achieving proper sagittal plane alignment is a crucial treatment goal for spine surgeons. Accurately assessing and quantifying changes in lumbosacral radiographic parameters is critical for clinical diagnosis, treatment, surgical planning, and postoperative analysis of spinal diseases (Kumar et al., 2001; Schwab et al., 2010). However, current manual measurement method is time-consuming, rely on physician experience, and are susceptible to inter-observer and intra-observer variability, resulting in significant measurement errors that have an impact on clinical diagnosis and decision-making (Loder et al., 2004; Dang et al., 2005; Chan et al., 2014). Obviously, the traditional manual measurement method has failed to keep up with the advancement of imaging technology and the increase in the number of imaging examinations, making it difficult to meet the demand for accurate clinical diagnosis and treatment.
With the recent rapid development of artificial intelligence and its increasing integration into the field of orthopedics, using artificial intelligence technology to accurately process complex X-ray image data has emerged as a research trend (Wang et al., 2022). Recently, many studies have used models based on deep learning algorithms to measure various spinal parameters, improving the accuracy and speed of medical images analysis (McBee et al., 2018).
The goal of this study was to develop a deep learning-based model for automatically measuring lumbosacral radiographic parameters on lateral lumbar radiographs. Furthermore, this study will evaluate the performance of the model, which is expected to be an effective tool for replacing manual measurements if it achieves high accuracy and efficiency in measuring lumbosacral radiographic parameters.
2 Materials and methods
2.1 Date preparation
We retrospectively collected data from 2,853 patients who received standing lateral lumbar radiographs at the orthopedics outpatient department of Beijing Chaoyang Hospital between October 2022 and October 2023. The inclusion criteria were adult patients with vertebral endplate closure. The following exclusion criteria were used (Maher et al., 2017): a history of spinal surgery (Kim et al., 2019); severe spinal deformity (Hong et al., 2013); patients with metabolic bone disease, spinal fracture, tuberculosis, and tumors; and (Hoy et al., 2012) patients with poor X-ray image quality, severe osteophyte formation, or other factors affecting measurements. The hospital’s institutional review board and ethics committee approved this study. Furthermore, all aspects of this study conformed to the principles outlined in the Declaration of Helsinki.
A total of 1,240 lateral lumbar radiographs were included after screening based on the inclusion and exclusion criteria. These images were randomly divided into the training set, validation set and test sets in the ratio of 8:1:1. The test set (n = 124) was used to evaluate the final prediction performance of the model; the training set (n = 992) was used to train the model and optimize the model parameters; and the validation set (n = 124) was used to adjust the model hyperparameters and conduct a preliminary evaluation of the model performance. Figure 1 shows the flowchart of image screening.
2.2 Key point annotations
Three spine surgeons (S1, S2, and S3) received standardized professional training and annotated 1,240 lateral lumbar radiographs using the Labelme software for model training and validation. Figure 2 depicts the detailed process of annotating key points, naming vertebral key points, and measuring parameter values (Koslosky and Gendelberg, 2020; Morita et al., 2020; Zhou et al., 2022). All images were annotated independently by each spine surgeon, with no knowledge of the annotations of the others. The test set was re-annotated by the spine surgeon (S1) after 1 month to evaluate intra-observer reliability.
Figure 2. Annotating key points, naming vertebral key points, and measuring parameter values. (A) A typical example image annotated with Labelme software. (B) L1SA: Vertex of the anterior superior edge of L1 vertebra; L1SP: Vertex of the posterior superior edge of L1 vertebra; L4IA: Vertex of the anterior inferior edge of L4 vertebra; L4IP: Vertex of the posterior inferior edge of L4 vertebra; L5SA: Vertex of the anterior superior edge of L5 vertebra; L5SP: Vertex of the posterior superior edge of L5 vertebra; L5IA: Vertex of the anterior inferior edge of L5 vertebra; L5IP: Vertex of the posterior inferior edge of L5 vertebra; S1SA: Vertex of the anterior superior edge of S1 vertebra; S1SP: Vertex of the posterior superior edge of S1 vertebra. (C) Lumbar lordosis (LL): The angle between the tangent line of the upper endplate of the L1 vertebra and the tangent line of the upper endplate of the S1 vertebra. (D) Sacral horizontal angle (SHA): The angle between the tangent line of the upper endplate of the S1 vertebra and the horizontal line. (E) Intervertebral space angle (ISA): The angle between the tangent line of the lower endplate of the upper vertebra and the tangent line of the upper endplate of the lower vertebra. (F) Percentage of lumbar spondylolisthesis (PLS): Measure the distance (N) between the extension line of the posterior edge of the upper vertebra and the extension line of the posterior edge of the lower vertebra, and then measure the distance (M) between two points on the anterior and posterior edges of the upper endplate of the lower vertebra. Percentage of lumbar spondylolisthesis = N/M × 100%. Because lumbar instability is most common at the L4–L5 and L5–S1 segments, this study only measures ISA and PLS at the L4–L5 and L5–S1 segments.
2.3 Training model
First, lateral lumbar radiographs were annotated with the Labelme software. The model was then trained with the segmentation network model based on RADMFNet algorithm (described in greater detail later in Section 2.4 of the article). Histogram enhancement, random Gamma transformation, and random rotation of the original and annotated images were used to increase the number of samples and improve the robustness of the model (Shin et al., 2020). Finally, the corner detection algorithm was used to process the segmentation results and identify the corresponding key points. The model then computed LL, SHA, ISA(L4–L5), ISA(L5–S1), PLS(L4–L5), and PLS(L4–L5) using the positions of these key points. The training process of the model is shown in Figure 3.
2.4 Segmentation network
To create a fast and accurate segmentation network model, this study used dilated convolution (Zhang et al., 2020), ResNet (Tian et al., 2022), attention mechanism (Rondinella et al., 2023), multi-scale feature fusion (Gao et al., 2023), and other technologies. Figure 4A illustrates the structure of the segmentation network model.
Figure 4. The segmentation network model. (A) Structure of the segmentation network. (B) Structure of the backbone network based on the multi-scale attention mechanism. (C) Network structure of the fusion module.
First, the images were fed into the backbone network, which used the multi-scale attention mechanism to extract image features. Figure 4B shows the structure of the backbone network based on the multi-scale attention mechanism. The backbone network consisted of three distinct scale branches. Among them, the 3 × 3 Convolution + BN + SiLU module was utilized for feature extraction. The Res–Attention Block was primarily improved based on the SE–Resnet structure. The two consecutive FC layers (fully connected layers) of SE–Resnet were converted to 1 × 1 convolutions, significantly increasing the training efficiency of the network. Meanwhile, the activation function was changed from ReLU to SiLU, increasing the sensitivity of the model to edge information. The fusion module fused the feature maps from multiple scales, reducing the risk of overfitting and improving the generalization ability of the model. Figure 4C depicts the network structure of the fusion module. First, the multi-scale feature maps were unified to the same scale using the sampling operation. The feature maps were then concatenated, and 1 × 1 convolution was used to perform a preliminary fusion of inter-channel information. To ensure the integrity of detailed information, the preliminary fusion result was added to the feature map of the minimum scale and subjected to 3 × 3 convolution operations, effectively completing the feature fusion.
Next, the feature maps of the backbone network were processed by two branches: the Residual-based Dilated Convolutional Module and the Convolutional Feature Extraction Module. The Residual-based Dilated Convolutional Module must perform three dilated convolution operations on the feature maps: 1 × 1, 3 × 3, and 5 × 5. Dilated convolution broadened the receptive field, allowing for a more comprehensive perception of image features. To achieve a more powerful feature expression, the feature maps processed by each dilated convolution operation were concatenated and then subjected to a 1 × 1 convolution operation to fuse the features. Meanwhile, the information on the feature maps of the backbone network was better preserved by referencing the residuals, which avoided problems like gradient vanishing and improved the robustness of the model. The Convolutional Feature Extraction Module primarily re-extracted the image features obtained by the backbone network. It then combined the re-extracted image features with the feature maps from the Residual-based Dilated Convolutional Module to generate the segmentation result.
2.5 Identifying key points
The identification of key points was based on the segmentation results of the images. Figure 5 illustrates a flowchart for identifying key points. First, the lumbar vertebra and sacrum were extracted from the segmentation results by connecting regions. The Harris corner detection method was then applied to identify key points (Harris and Stephens, 1988). Harris corner detection was used to detect corners in images. First, this algorithm calculated the gradient at each pixel in images and the degree of gradient change in the neighborhood around each pixel. Then, it evaluated the direction and intensity of grayscale changes at each pixel by computing the covariance matrix. Based on the eigenvalues of the covariance matrix, the algorithm calculated the corner response function to determine whether the pixel was a corner. Finally, the pixel with the largest response value was selected by the method of non-maximum suppression to obtain the final corner point detection result.
2.6 Statistical analysis
SPSS (version 27.00; Chicago, Illinois, United States) was used for statistical analysis. The distribution of the demographic information of the patients across the training set, validation set, and test sets was represented with descriptive statistics. A p < 0.05 was considered statistically significant.
2.6.1 Reliability of the key point annotations
To evaluate inter-observer and intra-observer reliability of key point annotations, percentages of key points within distance thresholds of 1, 2, 3, 4, and 5 mm were calculated.
2.6.2 Segmentation performance
The accuracy and Dice coefficient were used to evaluate the segmentation performance of the model. The segmentation results for the lumbar and sacral regions were evaluated separately. We further compared the segmentation performance of our model with four other existing models, including UNet, Att-UNet, UNet 3+, and TransUNet. All models were trained on the same dataset.
2.6.3 Performance of key point prediction
The performance of the model in predicting key points was evaluated using the PCK. The average of the results annotated by three spine surgeons was the reference standard. PCK denoted the percentage of predicted key points that were within a radius r of the reference standard.
2.6.4 Measurement performance of the model
We compared and evaluated the measured values of the model with the average measured values of the three spine surgeons. We calculated the intra-class correlation coefficient (ICC), Pearson correlation coefficient (r), mean difference (MD), standard deviation (SD), mean absolute error (MAE), and root mean square error (RMSE) between the reference standard and model estimates to evaluate the performance of the model. ICC is an indicator of consistency, and ICC ≥ 0.75 is deemed sufficiently reliable. |r| ≥ 0.7 indicates a strong correlation. Additionally, box plots were used to show the distribution of error values between the measured values of the model and reference standard.
3 Results
3.1 Demographic data
We obtained 1,240 lateral lumbar radiographs (male-to-female ratio of 1:1). They were divided into three sets: 80% for training, 10% for validation, and 10% for test. The included data sets showed no significant differences in gender composition or age. The Demographic data distribution is shown in Table 1.
3.2 Reliability of the key point annotations
The percentage of intra-observer annotation distance within the 3 mm threshold was 96%, while the inter-observer annotation distance within the 3 mm threshold was 83% (S1 and S2), 81% (S1 and S3), and 80% (S2 and S3), respectively (Table 2).
3.3 Segmentation performance
For lumbar segmentation, the Dice coefficient and accuracy were 0.962 and 0.947, respectively. For sacrum segmentation, the Dice coefficient and accuracy were 0.954 and 0.939, respectively. The segmentation results for the lumbar region were better than those for the sacrum. Table 3 and Figure 6 showed that the segmentation performance of our model was better than the other four existing models.
Figure 6. The comparison of the segmentation results of our model with UNet, Att-UNet, UNet 3+, and TransUNet.
3.4 Performance of key point prediction
Table 4 shows that the PCK within the 3-mm distance threshold ranged from 88% to 98%.
3.5 Measurement performance of the model
Comparing the measured values of the model with the reference standard, the result indicated that the reference standards for LL, SHA, ISA(L4–L5), ISA(L5–S1), PLS(L4–L5), PLS(L5–S1) were 49.77° ± 7.82°, 38.09° ± 6.85°, 14.86° ± 4.07°, 19.15° ± 8.07°, 12.27% ± 10.31%, and 13.36% ± 9.25%, and the measured values of the model were 49.20° ± 7.03°, 37.79° ± 6.53°, 14.15° ± 3.78°, 18.86° ± 7.63°, 12.45% ± 10.38%, and 13.58% ± 10.75%, respectively. The differences between them were not statistically significant (p > 0.05), as shown in Table 5.
The results of our study found that the measured values of the model for lumbosacral radiographic parameters were consistent and reliable (LL, SHA, and ISA: ICC = 0.91–0.97, r = 0.91–0.96, MAE = 1.89–2.47, RMSE = 2.32–3.12; PLS: ICC = 0.90–0.92, r = 0.90–0.91, MAE = 1.95–2.93, RMSE = 2.52–3.70), as shown in Table 6. In addition, to visually demonstrate the robustness of the algorithm, statistical analyses were performed on the maximum, upper quartile, median, lower quartile, and minimum errors between the measured value of the model and the reference standard for the lumbar angular parameters and PLS, as illustrated in Figure 7.
Figure 7. The box plots illustrate the distribution of error values between the measured values of the model and reference standard for the parameters of lumbar spine angle (A) and PLS (B).
4 Discussion
Accurate measurement of lumbosacral radiographic parameters of the lumbar spine is critical for biomechanics evaluation, clinical diagnosis, surgical planning, and prognosis prediction of lumbar diseases (Sparrey et al., 2014; Azimi et al., 2021; Schlösser et al., 2021). However, manual measurement of these parameters is time-consuming and laborious, and inevitably produces considerable variability. This study aimed to develop an accurate artificial intelligence automated measurement technique that could recognize and segment the lumbar and sacrum on lateral lumbar radiographs, as well as automatically measure lumbosacral radiographic parameters.
Our model had excellent segmentation quality and precise measurement of lumbosacral radiographic parameters. The Dice coefficient and accuracy of lumbar segmentation were 0.962 and 0.947, respectively. For sacrum segmentation, the Dice coefficient and accuracy were 0.954 and 0.939, respectively. Our study found that (Maher et al., 2017): The model accurately and automatically identified the key points, with the PCK ranging from 88% to 98% within the 3-mm distance thresholds (Kim et al., 2019); The ICC (MAE) for LL, SHA, and ISA ranged from 0.91 to 0.97 (1.89–2.47), and the ICC (MAE) for PLS ranged from 0.90 to 0.92 (1.95–2.93), which was comparable to or better than spine surgeons. The excellent performance of the model in measuring PLS is noteworthy. Due to the small volume of vertebrae, slight errors can cause significant changes in PLS values, leading to incorrect staging of lumbar spondylolisthesis. It is challenging to accurately quantify the severity of lumbar spondylolisthesis.
The traditional manual measurement method is based on the experience and judgment of the measurer, which results in inter-observer and intra-observer variability (Loder et al., 2004; Dang et al., 2005; Chan et al., 2014). Studies have shown that measurement errors can range from 3° to 10° (Loder et al., 2004; Mok et al., 2008). Chen et al. (2010) found that it was clinically significant to perform biomechanical or clinical analysis when the average distance between key points annotated by different observers was less than 3 mm. In this study, the percentage of key points annotated by different spine surgeons within the 3 mm threshold ranged from 81% to 83%, while the PCK predicted by the model within a 3 mm distance threshold ranged from 88% to 98%. This indicated that the measurement results of the model exceeded those of our spinal surgeons.
The segmentation network model developed in this study combined dilated convolution, RestNet, attention mechanism, multi-scale feature fusion, and other methods that offered high speed and high accuracy. We found that this model obtained these measurements much faster than spine surgeons, with the model obtaining measurements in 0.5 s instead of several minutes for spine surgeons. In addition, we used histogram enhancement, random Gamma transform, and random rotation on the images to increase data volume and improve the performance of the model. However, in the L5–S1 region, PCK predicted by the model within the 1–2 mm threshold was relatively poor due to the overlap of the iliac, lumbar, and sacrum on the lateral lumbar radiographs. This also explained why the segmentation results for the lumbar region were superior those for the sacrum. For the image overlap problem, we intend to manually adjust the results based on the prediction of the model and incorporate them into the training set to continuously optimize the algorithm.
Many studies had used deep learning-based models to automatically measure spinal parameters (Pang et al., 2019; Wang et al., 2019; Huang et al., 2020; Korez et al., 2020; Schwartz et al., 2021; Vrtovec and Ibragimov, 2022). Schwartz et al. (2021) used MultiResUNet for image segmentation and spinopelvic parameter calculations from lateral lumbar radiographs. The algorithm developed in their study worked well in segmenting images, with an overall Dice coefficient and an accuracy of 0.951 and 0.936, respectively. Referring to a systematic review published by Vrtovec and Ibragimov (2022), it was found that our model based on the RADMFNet algorithm outperformed models based on the UNet, Mask R-CNN and MultiResUNet algorithms in terms of segmentation performance. Korez et al. (2020) conducted a study using RetinaNet and U-Net algorithms to collect measurements in the sagittal plane. Their study included patients with internal spinal fixation devices; however, the proportion of images with internal fixation devices was not reported, and no subgroup analyses were performed to determine the effect of internal fixation devices on model performance. It is worth noting that most studies used annotations from a single or two observers (Pang et al., 2019; Wang et al., 2019; Huang et al., 2020; Korez et al., 2020; Vrtovec and Ibragimov, 2022), whereas our study used annotations from three observers, thus constructing a more reliable reference standard. To achieve high accuracy, it is essential to train the model on a large dataset. However, collecting and annotating images can be a time-consuming and expensive process (Willemink et al., 2020). In the future, while expanding the dataset, the training set can be enriched by using data enhancement (applying image flipping, panning, rotating, cropping, and intensity transformations), ensemble learning (training multiple models and then combining their results), or synthetic case generation (for example, using generative adversarial networks) (Shin et al., 2020; Vrtovec and Ibragimov, 2022).
Although this study has made advancements in automatically measuring lumbosacral radiographic parameters, some limitations remain. First, the training set for this study consisted of 992 lateral lumbar radiographs. For complex spinal diseases and clinical settings, this amount of data is insufficient. Furthermore, we included patients with an uneven age distribution and all from the same hospital, making it impossible to determine whether the performance of the model was influenced by age, X-ray machines from different hospitals, or variations in imaging acquisition techniques. In the future, we plan to use a larger, more diverse, and multicenter cohort to further train the model, increasing its clinical utility. Third, our model is currently unable to identify lumbosacral transitional vertebrae. One reason for this is that spine surgeons do not accurately identify and annotate lumbosacral transitional vertebrae. Another reason is that our algorithm has not yet incorporated the function to identify anatomical variations and abnormalities. Future studies could further include data from lumbosacral transitional vertebrae to determine whether the performance of the model is affected by both anatomical variations and abnormalities, allowing the model to be improved even further. Finally, a limitation of this algorithm is the error handling. The box plots demonstrated that the overall extreme error values of ISA and PLS were larger in the L5–S1 segment than those in the L4–L5 segment. Because in the fifth lumbar vertebra and sacral regions, segmentation defects may occur and the segmentation quality of the model was poor, leading to inaccurate measurements. We should incorporate error handling functionality into the algorithm to prevent the generation of highly erroneous measurement results when key points are not correctly identified. So far, such error handling capability has only been proposed in the study by Schwartz et al. (2021).
5 Conclusion
In summary, we had developed a model that could accurately identify vertebral key points and automatically calculate lumbosacral radiographic parameters. This model measured LL, SHA, ISA, and PLS on lumbar lateral radiographs with high accuracy and speed. Furthermore, compared to manual measurements, the measurements of the model were more consistent and reliable. The automatic measurement of lumbosacral radiographic parameters is anticipated to have a significant impact on spinal surgery in the coming years. In clinical practice, our model can help spinal surgeons save time and effort when measuring radiographic parameters. Besides, the model can be applied in research settings to conduct large-scale studies on lumbar anatomical parameters. In future research, we intend to include additional datasets and disease categories to improve the accuracy and stability of the model, as well as to continuously explore and improve the model’s algorithms.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the ethics committees of the Beijing Chaoyang Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
SY: Conceptualization, Methodology, Writing–original draft. RC: Conceptualization, Methodology, Writing–original draft. XL: Conceptualization, Methodology, Writing–original draft. TW: Investigation, Writing–review and editing. AW: Investigation, Writing–review and editing. NF: Investigation, Writing–review and editing. PD: Investigation, Writing–review and editing. YX: Investigation, Writing–review and editing. ZG: Data curation, Formal Analysis, Writing–review and editing. YZ: Conceptualization, Project administration, Supervision, Writing–review and editing. LZ: Conceptualization, Funding acquisition, Project administration, Supervision, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was supported by Beijing Natural Science Foundation (7242059), Beijing Hospitals Authority Clinical medicine Development of special funding support (YGLX202305), and key medical disciplines of Shijingshan district (2023006).
Acknowledgments
The authors would like to thank the reviewers for their helpful remarks.
Conflict of interest
Authors ZG and YZ were employed by Longwood Valley Medical Technology Co., Ltd.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Azimi, P., Yazdanian, T., Benzel, E. C., Hai, Y., and Montazeri, A. (2021). Sagittal balance of the cervical spine: a systematic review and meta-analysis. Eur. Spine J. 30 (6), 1411–1439. doi:10.1007/s00586-021-06825-0
Brink, R. C., Colo, D., Schlösser, T. P. C., Vincken, K. L., van Stralen, M., Hui, S. C. N., et al. (2017). Upright, prone, and supine spinal morphology and alignment in adolescent idiopathic scoliosis. Scoliosis Spinal Disord. 12, 6. doi:10.1186/s13013-017-0111-5
Chan, A. C., Morrison, D. G., Nguyen, D. V., Hill, D. L., Parent, E., and Lou, E. H. (2014). Intra- and interobserver reliability of the cobb angle-vertebral rotation angle-spinous process angle for adolescent idiopathic scoliosis. Spine Deform. 2 (3), 168–175. doi:10.1016/j.jspd.2014.02.006
Chen, H. C., Lin, C. J., Wu, C. H., Wang, C. K., and Sun, Y. N. (2010). Automatic Insall-Salvati ratio measurement on lateral knee x-ray images using model-guided landmark localization. Phys. Med. Biol. 55 (22), 6785–6800. doi:10.1088/0031-9155/55/22/012
Dang, N. R., Moreau, M. J., Hill, D. L., Mahood, J. K., and Raso, J. (2005). Intra-observer reproducibility and interobserver reliability of the radiographic parameters in the spinal deformity study group’s AIS radiographic measurement manual. Spine (Phila Pa 1976) 30 (9), 1064–1069. doi:10.1097/01.brs.0000160840.51621.6b
Diebo, B. G., Varghese, J. J., Lafage, R., Schwab, F. J., and Lafage, V. (2015). Sagittal alignment of the spine: what do you need to know? Clin. Neurol. Neurosurg. 139, 295–301. doi:10.1016/j.clineuro.2015.10.024
Gao, Y., Zhang, S., Zuo, D., Yan, W., and Pan, X. (2023). TMNet: a two-branch multi-scale semantic segmentation network for remote sensing images. Sensors (Basel) 23 (13), 5909. doi:10.3390/s23135909
Harris, C., and Stephens, M. (1988). A combined corner and edge detector. Proc. AVC 23.1–23.6. doi:10.5244/C.2.23
Hong, J., Reed, C., Novick, D., and Happich, M. (2013). Costs associated with treatment of chronic low back pain: an analysis of the UK General Practice Research Database. Spine (Phila Pa 1976) 38 (1), 75–82. doi:10.1097/BRS.0b013e318276450f
Hoy, D., Bain, C., Williams, G., March, L., Brooks, P., Blyth, F., et al. (2012). A systematic review of the global prevalence of low back pain. Arthritis Rheum. 64 (6), 2028–2037. doi:10.1002/art.34347
Huang, J., Shen, H., Wu, J., Hu, X., Zhu, Z., Lv, X., et al. (2020). Spine Explorer: a deep learning based fully automated program for efficient and reliable quantifications of the vertebrae and discs on sagittal lumbar spine MR images. Spine J. 20 (4), 590–599. doi:10.1016/j.spinee.2019.11.010
Kalidindi, K. K. V., Sangondimath, G., Bansal, K., Vishwakarma, G., and Chhabra, H. S. (2022). Introduction of a novel "segmentation line" to analyze the variations in segmental lordosis, location of the lumbar apex, and their correlation with spinopelvic parameters in asymptomatic adults. Asian Spine J. 16 (4), 502–509. doi:10.31616/asj.2021.0006
Kim, L. H., Vail, D., Azad, T. D., Bentley, J. P., Zhang, Y., Ho, A. L., et al. (2019). Expenditures and health care utilization among adults with newly diagnosed low back and lower extremity pain. JAMA Netw. Open 2 (5), e193676. doi:10.1001/jamanetworkopen.2019.3676
Korez, R., Putzier, M., and Vrtovec, T. (2020). A deep learning tool for fully automated measurements of sagittal spinopelvic balance from X-ray images: performance evaluation. Eur. Spine J. 29 (9), 2295–2305. doi:10.1007/s00586-020-06406-7
Koslosky, E., and Gendelberg, D. (2020). Classification in brief: the meyerding classification system of spondylolisthesis. Clin. Orthop. Relat. Res. 478 (5), 1125–1130. doi:10.1097/corr.0000000000001153
Kumar, M. N., Baklanov, A., and Chopin, D. (2001). Correlation between sagittal plane changes and adjacent segment degeneration following lumbar spine fusion. Eur. Spine J. 10 (4), 314–319. doi:10.1007/s005860000239
Lafage, R., Liabaud, B., Diebo, B. G., Oren, J. H., Vira, S., Pesenti, S., et al. (2017). Defining the role of the lower limbs in compensating for sagittal malalignment. Spine (Phila Pa 1976) 42 (22), E1282–E1288. doi:10.1097/brs.0000000000002157
Liu, H., Li, S., Zheng, Z., Wang, J., Wang, H., and Li, X. (2015). Pelvic retroversion is the key protective mechanism of L4-5 degenerative spondylolisthesis. Eur. Spine J. 24 (6), 1204–1211. doi:10.1007/s00586-014-3395-7
Loder, R. T., Spiegel, D., Gutknecht, S., Kleist, K., Ly, T., and Mehbod, A. (2004). The assessment of intraobserver and interobserver error in the measurement of noncongenital scoliosis in children ≤ 10 Years of age. Spine (Phila Pa 1976) 29 (22), 2548–2553. doi:10.1097/01.brs.0000144828.72721.d8
Maher, C., Underwood, M., and Buchbinder, R. (2017). Non-specific low back pain. Lancet. 389 (10070), 736–747. doi:10.1016/s0140-6736(16)30970-9
McBee, M. P., Awan, O. A., Colucci, A. T., Ghobadi, C. W., Kadom, N., Kansagra, A. P., et al. (2018). Deep learning in radiology. Acad. Radiol. 25 (11), 1472–1480. doi:10.1016/j.acra.2018.02.018
Mok, J. M., Berven, S. H., Diab, M., Hackbarth, M., Hu, S. S., and Deviren, V. (2008). Comparison of observer variation in conventional and three digital radiographic methods used in the evaluation of patients with adolescent idiopathic scoliosis. Spine (Phila Pa 1976) 33 (6), 681–686. doi:10.1097/BRS.0b013e318166aa8d
Morita, T., Yoshimoto, M., Terashima, Y., Tanimoto, K., Iesato, N., Ogon, I., et al. (2020). Do we have adequate flexion-extension radiographs for evaluating instability in patients with lumbar spondylolisthesis? Spine (Phila Pa 1976) 45 (1), 48–54. doi:10.1097/brs.0000000000003203
Pang, S., Su, Z., Leung, S., Nachum, I. B., Chen, B., Feng, Q., et al. (2019). Direct automated quantitative measurement of spine by cascade amplifier regression network with manifold regularization. Med. Image Anal. 55, 103–115. doi:10.1016/j.media.2019.04.012
Panjabi, M. M. (2003). Clinical spinal instability and low back pain. J. Electromyogr. Kinesiol 13 (4), 371–379. doi:10.1016/s1050-6411(03)00044-0
Rondinella, A., Crispino, E., Guarnera, F., Giudice, O., Ortis, A., Russo, G., et al. (2023). Boosting multiple sclerosis lesion segmentation through attention mechanism. Comput. Biol. Med. 161, 107021. doi:10.1016/j.compbiomed.2023.107021
Scheer, J. K., Tang, J. A., Smith, J. S., Acosta, F. L., Protopsaltis, T. S., Blondel, B., et al. (2013). Cervical spine alignment, sagittal deformity, and clinical implications: a review. J. Neurosurg. Spine 19 (2), 141–159. doi:10.3171/2013.4.Spine12838
Schlösser, T. P. C., Castelein, R. M., Grobost, P., Shah, S. A., and Abelin-Genevois, K. (2021). Specific sagittal alignment patterns are already present in mild adolescent idiopathic scoliosis. Eur. Spine J. 30 (7), 1881–1887. doi:10.1007/s00586-021-06772-w
Schwab, F., Patel, A., Ungar, B., Farcy, J. P., and Lafage, V. (2010). Adult spinal deformity-postoperative standing imbalance: how much can you tolerate? An overview of key parameters in assessing alignment and planning corrective surgery. Spine (Phila Pa 1976) 35 (25), 2224–2231. doi:10.1097/BRS.0b013e3181ee6bd4
Schwab, F. J., Blondel, B., Bess, S., Hostin, R., Shaffrey, C. I., Smith, J. S., et al. (2013). Radiographical spinopelvic parameters and disability in the setting of adult spinal deformity: a prospective multicenter analysis. Spine (Phila Pa 1976) 38 (13), E803–E812. doi:10.1097/BRS.0b013e318292b7b9
Schwartz, J. T., Cho, B. H., Tang, P., Schefflein, J., Arvind, V., Kim, J. S., et al. (2021). Deep learning automates measurement of spinopelvic parameters on lateral lumbar radiographs. Spine (Phila Pa 1976) 46 (12), E671–E678. doi:10.1097/brs.0000000000003830
Shin, Y., Han, K., and Lee, Y. H. (2020). Temporal trends in cervical spine curvature of south Korean adults assessed by deep learning system segmentation, 2006-2018. JAMA Netw. Open 3 (10), e2020961. doi:10.1001/jamanetworkopen.2020.20961
Sparrey, C. J., Bailey, J. F., Safaee, M., Clark, A. J., Lafage, V., Schwab, F., et al. (2014). Etiology of lumbar lordosis and its pathophysiology: a review of the evolution of lumbar lordosis, and the mechanics and biology of lumbar degeneration. Neurosurg. Focus 36 (5), E1. doi:10.3171/2014.1.Focus13551
Tartara, F., Garbossa, D., Armocida, D., Di Perna, G., Ajello, M., Marengo, N., et al. (2023). Relationship between lumbar lordosis, pelvic parameters, PI-LL mismatch and outcome after short fusion surgery for lumbar degenerative disease. Literature review, rational and presentation of public study protocol: RELApSE study (registry for evaluation of lumbar arthrodesis sagittal alignEment). World Neurosurg. X 18, 100162. doi:10.1016/j.wnsx.2023.100162
Tian, T., Wang, L., Luo, M., Sun, Y., and Liu, X. (2022). ResNet-50 based technique for EEG image characterization due to varying environmental stimuli. Comput. Methods Programs Biomed. 225, 107092. doi:10.1016/j.cmpb.2022.107092
Vrtovec, T., and Ibragimov, B. (2022). Spinopelvic measurements of sagittal balance with deep learning: systematic review and critical evaluation. Eur. Spine J. 31 (8), 2031–2045. doi:10.1007/s00586-022-07155-5
Wang, L., Wang, H., Huang, Y., Yan, B., Chang, Z., Liu, Z., et al. (2022). Trends in the application of deep learning networks in medical image analysis: evolution between 2012 and 2020. Eur. J. Radiol. 146, 110069. doi:10.1016/j.ejrad.2021.110069
Wang, L., Xu, Q., Leung, S., Chung, J., Chen, B., and Li, S. (2019). Accurate automated Cobb angles estimation using multi-view extrapolation net. Med. Image Anal. 58, 101542. doi:10.1016/j.media.2019.101542
Willemink, M. J., Koszek, W. A., Hardell, C., Wu, J., Fleischmann, D., Harvey, H., et al. (2020). Preparing medical imaging data for machine learning. Radiology 295 (1), 4–15. doi:10.1148/radiol.2020192224
Zhang, R., Xiao, X., Liu, Z., Li, Y., and Li, S. (2020). MRLN: multi-task relational learning network for MRI vertebral localization, identification, and segmentation. IEEE J. Biomed. Health Inf. 24 (10), 2902–2911. doi:10.1109/jbhi.2020.2969084
Keywords: artificial intelligence, deep learning, automatic measurement, lateral lumbar radiograph, lumbosacral radiographic parameters
Citation: Yuan S, Chen R, Liu X, Wang T, Wang A, Fan N, Du P, Xi Y, Gu Z, Zhang Y and Zang L (2024) Artificial intelligence automatic measurement technology of lumbosacral radiographic parameters. Front. Bioeng. Biotechnol. 12:1404058. doi: 10.3389/fbioe.2024.1404058
Received: 22 March 2024; Accepted: 17 June 2024;
Published: 01 July 2024.
Edited by:
Ron Noah Alkalay, Beth Israel Deaconess Medical Center and Harvard Medical School, United StatesReviewed by:
Peikai Chen, The University of Hong Kong, ChinaNanfang Xu, Peking University Third Hospital, China
Copyright © 2024 Yuan, Chen, Liu, Wang, Wang, Fan, Du, Xi, Gu, Zhang and Zang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lei Zang, zanglei@ccmu.edu.cn; Yiling Zhang, ylzhang@changmugu.com
†These authors have contributed equally to this work and share first authorship