Skip to main content

ORIGINAL RESEARCH article

Front. Med., 21 July 2023
Sec. Precision Medicine

Development and validation of a risk prediction model for cage subsidence after instrumented posterior lumbar fusion based on machine learning: a retrospective observational cohort study

Tuotuo Xiong&#x;Tuotuo Xiong1Ben Wang,&#x;Ben Wang1,2Wanyuan QinWanyuan Qin1Ling YangLing Yang3Yunsheng Ou
Yunsheng Ou1*
  • 1Department of Orthopedics, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
  • 2Plastic Surgery Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
  • 3Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China

Background: Interbody cage subsidence is a common complication after instrumented posterior lumbar fusion surgery, several previous studies have shown that cage subsidence is related to multiple factors. But the current research has not combined these factors to predict the subsidence, there is a lack of an individualized and comprehensive evaluation of the risk of cage subsidence following the surgery. So we attempt to identify potential risk factors and develop a risk prediction model that can predict the possibility of subsidence by providing a Cage Subsidence Score (CSS) after surgery, and evaluate whether machine learning-related techniques can effectively predict the subsidence.

Methods: This study reviewed 59 patients who underwent posterior lumbar fusion in our hospital from 2014 to 2019. They were divided into a subsidence group and a non-subsidence group according to whether the interbody fusion cage subsidence occurred during follow-up. Data were collected on the patient, including age, sex, cage segment, number of fusion segments, preoperative space height, postoperative space height, preoperative L4 lordosis Angle, postoperative L4 lordosis Angle, preoperative L5 lordosis Angle, postoperative PT, postoperative SS, postoperative PI. The conventional statistical analysis method was used to find potential risk factors that can lead to subsidence, then the results were incorporated into stepwise regression and machine learning algorithms, respectively, to build a model that could predict the subsidence. Finally the diagnostic efficiency of prediction is verified.

Results: Univariate analysis showed significant differences in pre−/postoperative intervertebral disc height, postoperative L4 segment lordosis, postoperative PT, and postoperative SS between the subsidence group and the non-subsidence group (p < 0.05). The CSS was trained by stepwise regression: 2 points for postoperative disc height > 14.68 mm, 3 points for postoperative L4 segment lordosis angle >16.91°, and 4 points for postoperative PT > 22.69°. If the total score is larger than 0.5, it is the high-risk subsidence group, while less than 0.5 is low-risk. The score obtains the area under the curve (AUC) of 0.857 and 0.806 in the development and validation set, respectively. The AUC of the GBM model based on the machine learning algorithm to predict the risk in the training set is 0.971 and the validation set is 0.889. The AUC of the avNNet model reached 0.931 in the training set and 0.868 in the validation set, respectively.

Conclusion: The machine learning algorithm has advantages in some indicators, and we have preliminarily established a CSS that can predict the risk of postoperative subsidence after lumbar fusion and confirmed the important application prospect of machine learning in solving practical clinical problems.

1. Introduction

Posterior lumbar interbody fusion is the treatment of lumbar disc herniation, lumbar spinal stenosis, and lumbar spondylolisthesis, which is the most commonly used surgical procedure (1). Inserting a cage to maintain the intervertebral space height is an important step in lumbar fusion. But long-term follow-up after surgery shows that the intervertebral height may be lost. This can lead to instrumentation failure, pseudoarthrosis, kyphotic deformity, adjacent-segment disease, and loss of foraminal height, any of which can lead to recurrent nerve root impingement and radicular pain (24). And studies have shown that cage subsidence is associated with postoperative revision surgery, and the rate of revision is significantly higher in patients with high-grade cage subsidence after fusion, there are some studies showed the rate of cage subsidence can reach 38% (57). Previous clinical studies have shown risk factors of cage subsidence (25, 8). But currently, it is rare to comprehensively use multiple relevant risk factors to predict and analyze the risk of subsidence. Also, it is difficult to predict it accurately affected by complex risk factors (913). The most known used method for prediction in clinical practice is the scoring system based on logistic regression analysis (1416). But as a classic statistic model, logistic regression-based methods may be insufficient in making full use of risk factor information (1720). The emergence of ML (machine learning) has brought a turning point for improving this problem. It can detect the details of data features that human cannot, also reflected in its ability to efficiently handle complex nonlinear data (21, 22), and enable more efficient diagnosis (23).

Machine learning is one kind of computer algorithm, but its high-speed development recently makes it considered to bring about the fourth industrial revolution (24). Such as the widely used face recognition technology, which based on the convolutional neural network (CNN) algorithm (25, 26). The reality of self-driving cars is also highly dependent on deep learning algorithms (27). The most exciting thing is that Alpha Go based on the DL algorithm successfully beat master in Go chess (28, 29). And this is unimaginable without machine learning algorithms, because of the huge search space of Go, far beyond the capabilities of traditional methods such as the exhaustive method (30, 31). Different from the Deep Blue computer, which is also famous for beating humans, the underlying technology involved in Alpha Go, namely machine learning technology, is extremely versatile and can be extended to many application fields (32). Following the breakthrough of the DL in the field of Go, the alphafold also based on DL, significantly improved the accuracy of protein structure predictions that have been slowly thriving for decades (33, 34). So, machine learning has revolutionary value in the development of ‘Future Medicine’ (35).

Recently several papers focusing on the application prospects of ML technology in the future medical field had published (36, 37). Machine learning-related technologies have achieved remarkable results in several medical fields. In terms of risk prediction of complications, studies by Tomasev (38, 39) show that ML can effectively predict the risk of chronic kidney disease with diabetes, which facilitates early intervention for patients. It has also shown significantly better diagnostic performance than humans in image diagnosis: Hannun et al. (40) applied deep networks to ECG data and showed that the machine learning-based model can effectively tell ECG. But in general, the current attempts to combine medical practice with machine learning technology are still insufficient.

Therefore, the purpose of this study is to synthesize multiple risk factors to the traditional stepwise regression model and the machine learning model, respectively, to establish a scoring system, so as to predict the probability of cage subsidence after surgery, and further evaluate and compare whether they can effectively predict the risk after instrumented posterior lumbar fusion surgery.

2. Materials and methods

2.1. Case data

This study retrospectively collected the medical records and imaging data of 59 patients (73 surgical segments) who were admitted to the hospital from October 10, 2014, to December 29, 2019. Inclusion criteria: (1) Performed posterior lumbar fusion and implanted a cage. (2) The follow-up lasted for more than half a year. (3) Clinical diagnosis of lumbar disc herniation or lumbar spondylolisthesis. Exclusion criteria: (1) Patients with tumors, infections, and spinal fractures. (2) Patients with incomplete imaging data.

2.2. Imaging measurement methods

To measure the distance and angle of follow-up imaging data, respectively, by the Carestream imaging software (Figure 1). The height of the disc is defined as the mean of the front and middle and posterior intervertebral space. The height at the last follow-up less than 2 mm from 7 days after surgery was defined as cage subsidence (41, 42). The spinal and pelvic parameters were also collected: (1) Lumbar Segmental Lordosis: the cobb between the lower and upper end plate of the surgical segment, (2) Pelvic Incidence (PI): the angle between the line drawn from the femoral head axis to the midpoint of the sacral plate and the line perpendicular to the sacral plate, (3) Pelvic Tilt (PT): the angle between the vertical and the line drawn from the femoral head axis to the midpoint of the sacral plate, and (4) Sacral Slope: the angle between the sacral plate and the horizontal (43).

FIGURE 1
www.frontiersin.org

Figure 1. Diagram of the measurement of imaging data. PI, Pelvic incidence; SS, sacral slope; PT, pelvic tilt.

2.3. Principles of GBM model construction

Gradient boosting machine (GBM), which belongs to boosting algorithm, is a machine learning algorithm that can be used to deal with regression and classification problems (44). Promote. The simple principle of the algorithm can be understood as the establishment of many classification models by combining a series of weak, low-cost, and complex classifiers (weak learner), which are continuously adjusted through each classification, and finally the model is optimized (45). Model Averaged Neural Network (avNNet), the neural network is a machine learning algorithm that contains three layers (input layer, hidden layer, output layer), the hidden layer contains hidden neuron nodes, these neurons accept the input layer and output layer information at the same time. Initially, the neuron functions of the neural network was randomly distributed, but by continuously learning the sample features, the neurons constantly modify their parameters to achieve an accurate classification performance (46).

2.4. Surgical methods

The patient is placed in a prone position under general anesthesia, the surgical segment is positioned by C-arm before the operation, the anatomical structures at all levels are exposed in turn, and the pedicle screws are drilled and implanted. After the C-arm confirms the stability of the screws, choose an appropriate length of titanium rod, bend it into a physiological curvature, and lock the nut. Decompress and release the nerve root, the intervertebral disc tissue was removed, scrape the cartilage terminal plate to the vertebral end plate with a curette, and the intervertebral space was flushed. The allogeneic bone is taken and wrapped in a cage of suitable size and inserted into the intervertebral space. Drainage tubes should be placed, the close the wound after confirming that the cage is in a good position under fluoroscopy.

2.5. Modeling methods

Firstly, SPSS 21.0 software was used to conduct univariate analysis on the suspected cage subsidence-related factors selected in this study. The variables with p < 0.05 in the univariate analysis were subjected to stepwise regression, and the variables were screened based on the AIC principle (47). The selected variables were incorporated into the stepwise logistic regression and machine learning models. The code implementation involved in this study was done under R version 4.1.0.1 First, use the install. Packages (“*”) command to complete the installation of the integrated machine learning package caret (48), the ROC curve drawing package pROC (49). And use the createDataPartition command and follow 0. 75: 0.25 scale the original data set into a training set and a validation set. Then use the training command in the caret package (48) to train the model for statistically different variables (as described in the Results section) in the training set, use the glm function to build a logistic regression model, and finally use the trained model to train the training set. And the validation set is predicted (predict command), and the confusion matrix command is used to obtain the confusion matrix of the predicted results, as well as the specificity, sensitivity, F1 value and other indicators. The receiver operating curve (ROC) is drawn by the pROC package (49), and the core modeling code is shown in Supplementary material.

2.6. Statistical methods

It is statistically significant to use the t-test to test the difference between the sedimentation group and the non-sedimentation group, and if the variance was uneven, the Mann–Whitney U test was used. The results were expressed as mean ± standard deviation, and the counting data were expressed as frequency. The Chi-square test or Fisher’s precision test was used to test whether the frequency difference between groups was statistically significant, and p < 0.05 was used as the threshold for statistical significance.

The work has been reported in line with the STARD criteria (50).

3. Results

3.1. Basic information

The research process is shown in Figure 2. The basic information about the patients is shown in Table 1. There are 30 surgical segments from men and 43 from women, of which 39 are in the subsidence group, and 34 are in the non-subsidence group. The incidence rate was 53.42%, and there was no significant statistical difference between the subsidence group and the non-subsidence group in the basic characteristics (age, gender) of the patients (age: p = 0.982, gender: p = 0.643).

FIGURE 2
www.frontiersin.org

Figure 2. Research design.

TABLE 1
www.frontiersin.org

Table 1. Single factor analysis results (x¯±s).

In terms of surgical segments, a total of 37 L5-S1 and 36 L4-L5 fusion segments were collected, of which the fusion subsidence rate of the L5-S1 segment was 62.16%, and the subsidence rate of the L4-L5 segment was 44.44%. There was no significant difference in the sedimentation rate between the two segments (p = 0.129). Among the collected cases, 39 cases were single-segment fusion, and 34 were double-segment fusion. The difference was not statistically significant (61.54% vs. 44.12%, p = 0.137).

As shown in Table 1, compared with the non-subsidence group, the subsidence group had statistically significant differences in preoperative/postoperative intervertebral space height, postoperative L4 segment lordosis angle, and postoperative PT and SS variables. Compared with the non-subsidence group, the subsidence group had higher preoperative/postoperative intervertebral space height (11.68 ± 2.71 vs. 9.28 ± 3.42 preoperatively, 15.92 ± 2.92 vs. 13.22 ± 3.07 postoperatively, p < 0.001), indicating that patients with higher preoperative intervertebral space height, excessive intraoperative distraction, and inappropriate cage size may be risk factors for postoperative cage subsidence.

In terms of postoperative lumbar spine angle, the postoperative L4 segment lordosis of the subsidence group was significantly higher than that of the non-subsidence group (12.97 ± 5.4 vs. 10.47 ± 5.03, p = 0.045), and this indicator did not achieve a statistically significant difference in preoperative measurements (10.88 ± 5.2 8 vs. 9.19 ± 5.23, p = 0.175), suggesting that the effect of surgery on the patient’s lumbar curvature (such as the angle of bar bending) may be a risk factor for postoperative sedimentation.

In terms of pelvic parameters, patients in the subsidence group have a smaller Pelvic Tilt (PT) compared to the non-subsidence group (21.74 ± 8.28 and 26.36 ± 8.82, p = 0.024) and larger Sacral Slope (SS) (38.16 ± 8.53 and 33.99 ± 8.47, p = 0.04), but there was no statistical difference in Pelvic Incidence angle (Pelvic Incidence, PI) (59.89 ± 11.69 and 60.36 ± 12.29, p = 0.87).

3.2. Sedimentation risk score based on stepwise logistic regression

Combining clinical experience and univariate analysis results, we included preoperative/postoperative intervertebral space height, postoperative L4 lordosis angle, postoperative PT and postoperative SS variables, using the step (model, direction = ‘both’) command, based on The Akaike information criterion (AIC) (47) and the forward-backward stepwise regression method (51) are used to initially screen the above variables. The AIC criterion was founded by Akaike Hiroji, a variable screening method based on the concept of information entropy to balance model complexity and goodness of fit (47). The results are shown in Table 2, since the p-value of the preoperative intervertebral space height and the postoperative SS factor is greater than 0.05 under AIC optimization, we ruled out these factors and carried out secondary modeling. As shown in Table 3, postoperative PT is an independent protective factor for cage subsidence, while postoperative L4 lordosis angle and postoperative intervertebral space height are independent risk factors for cage subsidence. Patients with lower postoperative PT, higher postoperative L4 lordosis angle, and higher postoperative intervertebral space height had a higher risk of cage subsidence, and the forest map of influencing factors of cage subsidence is shown in Figure 3.

TABLE 2
www.frontiersin.org

Table 2. First stepwise regression results.

TABLE 3
www.frontiersin.org

Table 3. Regression results after excluding nonsignificant factors.

FIGURE 3
www.frontiersin.org

Figure 3. The forest map of influencing factors.

To further construct a cage subsidence score (CSS) that is convenient for clinical use, as shown in Table 4, based on the OR value of the above regression analysis, we assigned 2 points for the postoperative intervertebral space height, 3 points for postoperative L4 lordosis angle and −4 points for postoperative PT. The total score ranges from −4 to 5.

TABLE 4
www.frontiersin.org

Table 4. Cage subsidence risk scoring table.

To determine the reasonable cut-off value for each risk factor, we performed ROC curve analysis on postoperative PT, postoperative L4 lordosis angle, and postoperative intervertebral space height, respectively. The results are shown in Figure 4. The AUC of postoperative PT is 0.640 (0.490–0.791) and postoperative L4 lordosis angle AUC is 0.617 (0.467–0.766), and postoperative intervertebral space height AUC is 0.751 (0.621–0.862). These results indicate that these factors can independently predict the risk of cage subsidence in patients to a certain extent, but their predictive accuracy is not ideal. Finally, the optimal cut-off value for risk factors is determined according to the ROC curve shown in Figure 4 and Table 4, namely, 2 points for postoperative intervertebral space height > 14.68, and 3 points for postoperative L4 lordosis angle >16.91 and postoperative PT > 22.69 was assigned −4 points.

FIGURE 4
www.frontiersin.org

Figure 4. ROC curve of subsidence-related risk factors. (A: Postoperative PT; B: Postoperative L4 lordosis; C: Postoperative dis height).

Further predictive performance tests show that the scoring model could more accurately predict the risk of postoperative cage subsidence in patients. As shown in Figure 5 and Table 5, the AUC of the CSS score in the training set is 0.857, the sensitivity is 0.923, the specificity is 0.633, the positive predictive value was 0.686, the negative predictive value was 0.905, and the F1 value is 0.787. In the validation set, its AUC is 0.806, the sensitivity is 0.875, the specificity is 0.556, the positive predictive value is 0.636, the negative predictive value is 0.833, and the F1 value is 0.737. According to Figure 5A, when the score cutoff value is set to 0.5, it has the best prediction performance, so the subsidence risk is considered to be high (overall rate: 88.89%) when the total score is greater than 0.5, and is considered to be low (32.61%) when it is less than 0.5. And the calibration curves of the CSS illustrated consistency between the observed and predicted results (Figure 5C). As shown in Figure 5D, decision curve analysis (DCA) curve indicated that the CSS can serve as a precise tool for subsidence assessment.

FIGURE 5
www.frontiersin.org

Figure 5. CSS (A: training set, B: validation set, C: calibration curve, D: decision curve analysis).

TABLE 5
www.frontiersin.org

Table 5. Prediction efficiency of settlement risk score CSS in the training set and verification set.

3.3. Comparison of machine learning model and CSS scoring performance

Further, we use the caret package to build the risk prediction model of the Gradient Boosting Machine (GBM) and the Model Averaged Neural Network (avNNet). Our results show that both GBM-based and avNNet-based risk assessment models can achieve accurate subsidence risk prediction. As shown in Tables 6, 7 and Figures 6, 7, the AUC of the GBM model in the training set and the validation set are 0.971 (0.937–1.000) and 0.889 (0.733–1.000) respectively, and 0.931 (0.861–1.000), 0.868 (0.687–1.000) in the avNNet model which is all larger than the performance of CSS scores (Table 5, training set: 0.857, validation set: 0.806), where the Delong test indicates that the AUC increment of the GBM model in the training set is statistically significant compared with the CSS score (p = 0.004).

TABLE 6
www.frontiersin.org

Table 6. Comparison of parameters of a prediction model in the training set (n = 56).

TABLE 7
www.frontiersin.org

Table 7. Comparison of various parameters of the prediction model in the validation set (n = 17).

FIGURE 6
www.frontiersin.org

Figure 6. GBM model (A: training set, B: validation set, C: calibration curve, D: decision curve analysis).

FIGURE 7
www.frontiersin.org

Figure 7. avNNet model (A: training set, B: validation set, C: calibration curve, D: decision curve analysis).

As shown in the analysis of the ROC curve in Figures 6A, 7A, the prediction score of the GBM model >0.497 and c > 0.650 are considered to have a higher risk of subsidence, which belongs to the high-risk subsidence group. At this optimal cutoff value (Tables 5, 6), the sensitivity of the GBM model training set is 0.885, specificity 0.933, positive predictive value 0.920, negative predictive value 0.903, and F1 value 0.902. The sensitivity of the validation set is 0.750, specificity 0.778, positive predictive value 0.750, negative predictive value 0.778, and F1 value 0.750. The training set sensitivity of the avNNet model is 0.885, specificity 0.900, positive predictive value 0.885, negative predictive value 0.900, and F1 value 0.885. The sensitivity of the validation set is 0.875, specificity 0.778, positive predictive value 0.778, negative predictive value 0.875, and F1 value 0.824. The calibration curves of the GBM and the avNNet both illustrated consistency between the observed and predicted results (Figures 6C, 7C). As shown in Figures 6D, 7D, DCAs were performed and the results proved that they both can serve as an effective tool for prediction. And the Delong test indicates that the AUC of the GBM model is no statistically significant compared with the avNNet model (P > 0.05).

These results suggest that, under certain circumstances, machine learning-based prediction methods are expected to be more accurate in predicting the risk of subsidence in post-fusion patients than scoring tables based on logistic regression analysis.

4. Discussion

Cage subsidence is a common clinical complication after spinal fusion surgery, Tempel et al. believe (52) that cage subsidence can be used as a predictor of postoperative revision, and patients with high-grade cage subsidence had significantly higher rates of postoperative revision compared with patients with low-grade cage subsidence. This suggests that cage subsidence is closely related to the postoperative prognosis. Therefore, it is necessary to identify the risk factors related to the occurrence of cage subsidence in patients after interbody fusion, to conduct a more accurate assessment of the risk of postoperative complications in patients before surgery, and to take effective plans to prevent the occurrence of postoperative cage subsidence.

However, even though multiple previous studies have identified lots of risk factors associated with postoperative cage subsidence, such as postoperative intervertebral space height (53), the number of surgical segments (54), and sagittal spine-pelvic balance (55), suggesting that that the subsidence of the cage is determined by multiple risk factors. However, due to the lack of means for joint assessment of multiple comprehensive factors, previous risk factor studies could not accurately determine the risk of subsidence in patients, which hindered the identification of patients with a high risk of subsidence and reduced the practical clinical application value of relevant research. In this study, machine learning and a logistic regression model are used to predict the postoperative subsidence risk of patients by combining multiple risk factors. It helps to identify patients with a high risk of subsidence, to realize the early detection and prevention of interbody cage subsidence, and reduce the incidence of postoperative complications. Further research and verification will promote “intelligent medicine” and “precision medicine” in spinal surgery at home and abroad.

In this study, univariate analysis showed that preoperative/postoperative vertebral space height, postoperative L4 lordosis angle, postoperative PT and postoperative SS had significant statistical differences. These 5 significant variables were included in stepwise regression analysis, and based on AIC criterion, it was concluded that postoperative PT, postoperative L4 lordosis angle and postoperative vertebral space height were independent predictors, with OR values of 1.147, 0.8, and 0.593, respectively. As is known to all, when the OR value is greater than 1, this variable can be regarded as a contributing factor to the outcome, and the outcome index we set is no subsidence. In other words, the risk of cage subsidence will decrease when PT increases within a certain range. However, when the OR value was in the range of 0 to 1, this variable could be regarded as an obstacle to the occurrence of outcome, that is, larger postoperative L4 lordosis angle and postoperative vertebral space height would increase the probability of subsidence.

Postoperative PT, postoperative L4 lordosis angle and postoperative vertebral space height were analyzed by ROC curve, and the AUC of postoperative vertebral space height was the largest, which was 0.751, indicating that it had the most predictive effect in this study. With regard to the height of the postoperative intervertebral space, Yang et al. (56) pointed out that the excessive extension of the intervertebral space could lead to the subsidence of the fusion cage, Le et al. and Malham et al. recommended the use of a cage with a height of 8 ~ 12 mm in lumbar intervertebral fusion (57, 58). In our study, the cage height used was also 8 ~ 12 mm, and the average height of the postoperative intervertebral space in the subsidence group was (15.92 ± 2.92) mm, higher than the maximum height of the fusion cage used 12 mm, indicating the existence of postoperative overextension of the intervertebral space. Due to the biomechanical imbalance during compression during surgery, the L4 lordosis angle in the subsidence group was larger than that in the non-subsidence group, and the difference was statistically significant, which further indicated that there was overextension or insufficient anterior compression in the vertebral space. We further hypothesized that it was the injury to the endplate during intraoperative cage placement. On the one hand, based on the anatomical basis that the lumbar anterior intervertebral disc height is higher than the posterior intervertebral disc height, it is difficult to implant a cage with a forward angle from the rear. Therefore, more intervertebral space is extended, and the posterior endplate will always bear greater stress during the implantation of the intervertebral fusion device, which is easy to cause posterior endplate injury and increase the risk of the fusion device subsidence. On the other hand, during intraoperative intervertebral compression, the posterior intervertebral space is subjected to more compression force than the anterior intervertebral space, which may lead to insufficient compression force in the overstretched anterior intervertebral space, resulting in reduced contact area and stress concentration between the cage and the endplate, increasing the risk of fusion cage subsidence. Therefore, we suggest that the height of the intervertebral space be measured before surgery to avoid the subsidence caused by blind over-extension during surgery. Intraoperative fusion device implantation should be performed on the decompression side first, and then on the non-decompression side to facilitate implantation. Choose a cage with appropriate specifications. Do not forcibly place the fusion device to avoid damaging the upper and lower bony end plates of the rear space. After implantation of the cage, the decompression side should be pressurized first and the non-decompression side should be pressurized again to ensure that the fusion device is not loose or displaced. However, excessive pressure should not be applied to avoid the concentration of compressive stress in the rear due to insufficient pressure in the front clearance.

With increasing attention to patient-related outcomes, there is increasing evidence that appropriate sagittal alignment during lumbar fusion improves outcomes. In our research results, postoperative PT in the sedimentation group was significantly smaller than that in the non-sedimentation group, and postoperative L4 lordosis in the subsidence group was significantly larger than that in the non- subsidence group. In the normal lumbar spine, where the pelvic morphology is associated with varying degrees of lordosis. Excessive intraoperative pressure can result in a larger lordosis angle, and the larger the lordosis Angle, due to the need for sagittal balance, the body tilts the pelvis backward to reduce the PT to maintain balance. As mentioned above, excessive pressure is often associated with endplate injury, which increases the risk of postoperative subsidence. According to our ROC analysis of postoperative L4 lordosis Angle, the probability of subsidence in this study will increase significantly when the postoperative L4 lordosis Angle is >16.91°. Therefore, we suggest that clinicians should pay more attention to the rod bending angle during surgery, and should not over-correct the lordosis to increase the risk of subsidence.

The main innovation of this study is reflected in the synthesis of previous studies on the risk factors related to cage subsidence and a method for comprehensively evaluating the risk of cage subsidence was established based on machine learning and a logistic regression algorithm.

There are certain limitations in this article: this is a retrospective study and the evidence is not as strong as in prospective cohort studies. Cage subsidence after lumbar interbody fusion is related to many factors. BMI, body function, osteoporosis, etc., due to the long case history and lack of clinical data of patients, the above indexes could not be obtained. Moreover, the sample size of this study was small. We will include those additional indicators in future studies to obtain more reliable results. This study only proves that both machine learning and CSS score can effectively predict the risk of cage settlement after surgery, and preliminarily suggests that the prediction model based on machine learning may have advantages in some indicators. However, it is only a single center study and need an external validation in the future.

5. Conclusion

Overall, the results of this paper show that both machine learning and CSS scores can accurately predict the risk of postoperative cage subsidence. Under specific circumstances, the machine learning algorithm is able to achieve a more accurate postoperative risk assessment. These results preliminarily confirm the application value of machine learning in solving practical clinical problems and provide a convenient CSS risk score for the risk assessment of cage subsidence after interbody fusion as well. These results will provide preliminary application experience for the future organic combination of precision medicine, intelligent medicine, and surgical practice.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by this study was conducted in accordance with the Declaration of Helsinki (as revised in 2013) and was approved by the Institutional Ethics Board of the First Affiliated Hospital of Chongqing Medical University. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

TX and BW: writing, original draft preparation, methodology, and formal analysis. TX, BW, WQ, and LY: data curation and editing. TX and YO: supervision and critical review. All authors had read and agreed to the final manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2023.1196384/full#supplementary-material

Footnotes

References

1. Pu, H, Chen, Q, Huang, K, Zeng, R, and Wei, P. Forearm T-score as a predictor of cage subsidence in patients with degenerative lumbar spine disease following posterior single-segment lumbar interbody fusion. BMC Musculoskelet Disord. (2022) 23:1–9. doi: 10.1186/s12891-022-05930-5

CrossRef Full Text | Google Scholar

2. Abe, K, Orita, S, Mannoji, C, Motegi, H, Aramomi, M, Ishikawa, T, et al. Perioperative complications in 155 patients who underwent oblique lateral interbody fusion surgery: perspectives and indications from a retrospective, Multicenter survey. Spine. (2017) 42:55–62. doi: 10.1097/BRS.0000000000001650

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Zavras, AG, Federico, V, Nolte, MT, Butler, AJ, Dandu, N, Munim, M, et al. Risk factors for subsidence following anterior lumbar interbody fusion. Global. Spine J. (2022):219256822211035. doi: 10.1177/21925682221103588

CrossRef Full Text | Google Scholar

4. Davis, TT, Hynes, RA, Fung, DA, Spann, SW, MacMillan, M, Kwon, B, et al. Retroperitoneal oblique corridor to the L2-S1 intervertebral discs in the lateral position: an anatomic study. J Neurosurg Spine. (2014) 21:785–93. doi: 10.3171/2014.7.SPINE13564

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Tempel, ZJ, McDowell, MM, Panczykowski, DM, Gandhoke, GS, Hamilton, DK, Okonkwo, DO, et al. Graft subsidence as a predictor of revision surgery following stand-alone lateral lumbar interbody fusion. J Neurosurg Spine. (2018) 28:50–6. doi: 10.3171/2017.5.SPINE16427

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Potter, BK, Freedman, BA, Verwiebe, EG, Hall, JM, Polly, DW, and Kuklo, TR. Transforaminal lumbar interbody fusion: clinical and radiographic results and complications in 100 consecutive patients. J Spinal Disord Tech. (2005) 18:337–46. doi: 10.1097/01.bsd.0000166642.69189.45

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Massaad, E, Fatima, N, Kiapour, A, Hadzipasic, M, Shankar, GM, and Shin, JH. Polyetheretherketone versus titanium cages for posterior lumbar interbody fusion: meta-analysis and review of the literature. Neurospine. (2020) 17:125–35. doi: 10.14245/ns.2040058.029

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Zhang, Z, Fogel, GR, Liao, Z, Sun, Y, and Liu, W. Biomechanical analysis of lumbar interbody fusion cages with various lordotic angles: a finite element study. Comput Methods Biomech Biomed Engin. (2018) 21:247–54. doi: 10.1080/10255842.2018.1442443

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Soliman, MAR, Aguirre, AO, Kuo, CC, Ruggiero, N, Azmy, S, Khan, A, et al. Vertebral bone quality score independently predicts cage subsidence following transforaminal lumbar interbody fusion. Spine J. (2022) 22:2017–23. doi: 10.1016/j.spinee.2022.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Hu, Y-H, Yeh, Y-C, Niu, C-C, Hsieh, MK, Tsai, TT, Chen, WJ, et al. Novel MRI-based vertebral bone quality score as a predictor of cage subsidence following transforaminal lumbar interbody fusion. J Neurosurg Spine. (2022) 37:654–62. doi: 10.3171/2022.3.SPINE211489

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Pisano, AJ, Fredericks, DR, Steelman, T, Riccio, C, Helgeson, MD, and Wagner, SC. Lumbar disc height and vertebral Hounsfield units: association with interbody cage subsidence. Neurosurg Focus. (2020) 49:E9. doi: 10.3171/2020.4.FOCUS20286

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Jones, C, Okano, I, Arzani, A, Dodo, Y, Moser, M, Reisener, M–J, et al. The predictive value of a novel site-specific MRI-based bone quality assessment, endplate bone quality (EBQ), for severe cage subsidence among patients undergoing standalone lateral lumbar interbody fusion. Spine J. (2023) 22:1875–83. doi: 10.1016/j.spinee.2022.07.085

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Moser, M, Adl Amini, D, Jones, C, Zhu, J, Okano, I, Oezel, L, et al. The predictive value of psoas and paraspinal muscle parameters measured on MRI for severe cage subsidence after standalone lateral lumbar interbody fusion. Spine J. (2022) 23:42–53. doi: 10.1016/j.spinee.2022.03.009

CrossRef Full Text | Google Scholar

14. Tang, X-R, Li, Y-Q, Liang, S-B, Jiang, W, Liu, F, Ge, WX, et al. Development and validation of a gene expression-based signature to predict distant metastasis in locoregionally advanced nasopharyngeal carcinoma: a retrospective, multicentre, cohort study. Lancet Oncol. (2018) 19:382–93. doi: 10.1016/S1470-2045(18)30080-9

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Maurichi, A, Miceli, R, Eriksson, H, Newton-Bishop, J, Nsengimana, J, Chan, M, et al. Factors affecting sentinel node metastasis in thin (T1) cutaneous melanomas: development and external validation of a predictive nomogram. J Clin Oncol. (2020) 38:1591–601. doi: 10.1200/JCO.19.01902

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Jiang, W, Wang, J, Shen, X, Lu, W, Wang, Y, Li, W, et al. Establishment and validation of a risk prediction model for early diabetic kidney disease based on a systematic review and Meta-analysis of 20 cohorts. Diabetes Care. (2020) 43:925–33. doi: 10.2337/dc19-1897

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Shabani, S, Varamesh, S, Moayedi, H, and Le Van, B. Modeling the susceptibility of an uneven-aged broad-leaved forest to snowstorm damage using spatially explicit machine learning. Environ Sci Pollut Res Int. (2022) 30:34203–13. doi: 10.1007/s11356-022-24660-8

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Ushida, T, Kotani, T, Baba, J, Imai, K, Moriyama, Y, Nakano-Kobayashi, T, et al. Antenatal prediction models for outcomes of extremely and very preterm infants based on machine learning. Arch Gynecol Obstet. (2022). doi: 10.1007/s00404-022-06865-x

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Elsaid, AF, Fahmi, RM, Shehta, N, and Ramadan, BM. Machine learning approach for hemorrhagic transformation prediction: capturing predictors’ interaction. Front Neurol. (2022) 13:951401. doi: 10.3389/fneur.2022.951401

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Beunza, J-J, Puertas, E, García-Ovejero, E, Villalba, G, Condes, E, Koleva, G, et al. Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease). J Biomed Inform. (2019) 97:103257. doi: 10.1016/j.jbi.2019.103257

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Lai, J-P, Lin, Y-L, Lin, H-C, Shih, C-Y, Wang, Y-P, and Pai, P-F. RLC circuit forecast in Analog IC packaging and testing by machine learning techniques. Micromachines (Basel). (2022) 13:1305. doi: 10.3390/mi13081305

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Liu, S, Yu, Y, Liu, K, Wang, F, Wen, W, and Qiao, H. Hierarchical neighbors embedding. IEEE Trans Neural Netw Learn Syst. (2022):1–14. doi: 10.1109/TNNLS.2022.3221103

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Esteva, A, Robicquet, A, Ramsundar, B, Kuleshov, V, DePristo, M, Chou, K, et al. A guide to deep learning in healthcare. NatMed. (2019) 25:24–9. doi: 10.1038/s41591-018-0316-z

CrossRef Full Text | Google Scholar

24. Sarker, IH. Machine learning: algorithms, real-world applications and research directions. SN Comput Sci. (2021) 2:160. doi: 10.1007/s42979-021-00592-x

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Ge, H, Dai, Y, Zhu, Z, and Wang, B. Robust face recognition based on multi-task convolutional neural network. Math Biosci Eng. (2021) 18:6638–51. doi: 10.3934/mbe.2021329

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Liu, H, Mo, Z-H, Yang, H, Zhang, ZF, Hong, D, Wen, L, et al. Automatic facial recognition of Williams-Beuren syndrome based on deep convolutional neural networks. Front Pediatr. (2021) 9:648255. doi: 10.3389/fped.2021.648255

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Russell, RL, and Reale, C. Multivariate uncertainty in deep learning. IEEE Trans Neural Netw Learn Syst. (2022) 33:7937–43. doi: 10.1109/TNNLS.2021.3086757

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Shao, K, Zhao, D, Tang, Z, and Zhu, Y. Move prediction in Gomoku using deep learning. In Proceeding of the 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC). (2016) 292–297.

Google Scholar

29. Li, F, and Du, Y. From AlphaGo to power system AI: what engineers can learn from solving the most complex board game. IEEE Power Energy Magazine. (2018) 16:76–84. doi: 10.1109/MPE.2017.2779554

CrossRef Full Text | Google Scholar

30. Gomes, Carla P. (2020). AI for advancing scientific discovery for a sustainable future. In Proceedings of the 19th international conference on autonomous agents and MultiAgent systems (AAMAS '20).

Google Scholar

31. Baraniuk, R, Donoho, D, and Gavish, M. The science of deep learning. Proc Natl Acad Sci. (2020) 117:30029–32. doi: 10.1073/pnas.2020596117

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Tomayko, JE. Behind deep blue: building the computer that defeated the world chess champion (review). Technol Cult. (2003) 44:634–5. doi: 10.1353/tech.2003.0140

CrossRef Full Text | Google Scholar

33. Sean, O'N. Artificial intelligence cracks a major challenge in biology in 50 years. Engineering. (2021) 7:11–6.

Google Scholar

34. Jumper, J, Evans, R, Pritzel, A, Green, T, Figurnov, M, Ronneberger, O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. (2021) 596:583–9. doi: 10.1038/s41586-021-03819-2

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Krittanawong, C, Zhang, H, Wang, Z, Aydar, M, and Kitai, T. Artificial Intelligence in Precision Cardiovascular Medicine. J Am Coll Cardiol. (2017) 69:2657–64. doi: 10.1016/j.jacc.2017.03.571

CrossRef Full Text | Google Scholar

36. Topol, EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. (2019) 25:44–56. doi: 10.1038/s41591-018-0300-7

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Price, WN, and Cohen, IG. Privacy in the age of medical big data. Nat Med. (2019) 25:37–43. doi: 10.1038/s41591-018-0272-7

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Tomašev, N, Glorot, X, Rae, JW, Zielinski, M, Askham, H, Saraiva, A, et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature. (2019) 572:116–9. doi: 10.1038/s41586-019-1390-1

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Sun, M, Baron, J, Dighe, A, Szolovits, P, Wunderink, RG, Isakova, T, et al. Early prediction of acute kidney injury in critical care setting using clinical notes and structured multivariate physiological measurements. Stud Health Technol Inform. (2019) 264:368–72. doi: 10.3233/SHTI190245

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Hannun, AY, Rajpurkar, P, Haghpanahi, M, Tison, GH, Bourn, C, Turakhia, MP, et al. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network[J]. Nat Med. (2019) 25:65–9. doi: 10.1038/s41591-018-0268-3

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Stickley, C, Philipp, T, Wang, E, Zhong, J, Balouch, E, O'Malley, N, et al. Expandable cages increase the risk of intraoperative subsidence but do not improve perioperative outcomes in single level transforaminal lumbar interbody fusion. Spine J. (2021) 21:37–44. doi: 10.1016/j.spinee.2020.08.019

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Pinter, ZW, Reed, R, Townsley, SE, Mikula, AL, Dittman, L, Xiong, A, et al. Titanium cervical cage subsidence: postoperative computed tomography analysis defining incidence and associated risk factors. Global. Spine J. (2021):219256822110468. doi: 10.1177/21925682211046897

CrossRef Full Text | Google Scholar

43. Obeid, I, Boniello, A, Boissiere, L, Bourghli, A, Pointillart, V, Gille, O, et al. Cervical spine alignment following lumbar pedicle subtraction osteotomy for sagittal imbalance. Eur Spine J. (2015) 24:1191–8. doi: 10.1007/s00586-014-3738-4

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Shojaie, M, Cabrerizo, M, DeKosky, ST, Vaillancourt, DE, Loewenstein, D, Duara, R, et al. A transfer learning approach based on gradient boosting machine for diagnosis of Alzheimer’s disease. Front. Aging. Neurosci. (2022) 14:966883.

Google Scholar

45. Friedman, JH. Stochastic gradient boosting[J]. Comput Stat Data Analysis. (2002) 38:367–78. doi: 10.1016/S0167-9473(01)00065-2

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Ellmann, S, Seyler, L, Evers, J, Heinen, H, Bozec, A, Prante, O, et al. Prediction of early metastatic disease in experimental breast cancer bone metastasis by combining PET/CT and MRI parameters to a model-averaged neural network[J]. Bone. (2019) 120:254–61. doi: 10.1016/j.bone.2018.11.008

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Sugiura, N. Further analysis of the data by Akaike's information criterion and the finite corrections. Commun Stat. (1978) 7:13–26. doi: 10.1080/03610927808827599

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Kuhn, M. Caret: classification and regression training. R package version 60–90. Available at: https://CRAN.R-project.org/package=caret (2021).

Google Scholar

49. Robin, X, Turck, N, Hainard, A, Tiberti, N, Lisacek, F, Sanchez, JC, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. (2011) 12:77. doi: 10.1186/1471-2105-12-77

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Bossuyt, PM, Reitsma, JB, Bruns, DE, Gatsonis, CA, Glasziou, PP, Irwig, L, et al. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ. (2015) 351:h5527. doi: 10.1136/bmj.h5527

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Deelen, J, Kettunen, J, Fischer, K, van der Spek, A, Trompet, S, Kastenmüller, G, et al. A metabolic profile of all-cause mortality risk identified in an observational study of 44,168 individuals. Nat Commun. (2019) 10:3346. doi: 10.1038/s41467-019-11311-9

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Tempel, ZJ, McDowell, MM, Panczykowski, DM, Gandhoke, GS, Hamilton, DK, Okonkwo, DO, et al. Graft subsidence as a predictor of revision surgery following stand-alone lateral lumbar interbody fusion[J]. J Neurosurg Spine. (2018) 28:50–6. doi: 10.3171/2017.5.SPINE16427

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Zhang, K, Zhang, F, Zhao, CQ, Tian, JP, Sun, XJ, Li, H, et al. Analysis of cage migration after lumbar interbody fusion and Revision strategies[J]. Chin. J. Ortho. (2016) 36:1093–1098. doi: 10.3760/cma.j.issn.02532352.2016.17.004

CrossRef Full Text | Google Scholar

54. Kao, T, Wu, C, Chou, Y, Chen, HT, Chen, WH, and Tsou, HK. Risk factors for subsidence in anterior cervical fusion with stand-alone polyetheretherketone (PEEK) cages: a review of 82 cases and 182 levels. Arch Orthop Trauma Surg. (2014) 134:1343–51. doi: 10.1007/s00402-014-2047-z

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Bocahut, N, Audureau, E, Poignard, A, Delambre, J, Queinnec, S, Flouzat Lachaniette, CH, et al. Incidence and impact of implant subsidence after stand-alone lateral lumbar interbody fusion. Orthop Traumatol Surg Res. (2018) 104:405–10. doi: 10.1016/j.otsr.2017.11.018

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Yang, JJ, Yu, CH, Chang, BS, Yeom, JS, Lee, JH, and Lee, CK. Subsidence and nonunion after anterior cervical interbody fusion using a stand-alone polyetheretherketone (PEEK) cage. Clin Orthop Surg. (2011) 3:16–23. doi: 10.4055/cios.2011.3.1.16

PubMed Abstract | CrossRef Full Text | Google Scholar

57. le, TV, Baaj, AA, Dakwar, E, Burkett, CJ, Murray, G, Smith, DA, et al. Subsidence of polyetheretherketone intervertebral cages in minimally invasive lateral retroperitoneal transpsoas lumbar interbody fusion. Spine. (2012) 37:1268–73. doi: 10.1097/BRS.0b013e3182458b2f

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Malham, GM, Parker, RM, Blecher, CM, and Seex, KA. Assessment and classification of subsidence after lateral interbody fusion using serial computed tomography. J Neurosurg Spine. (2015) 23:589–97. doi: 10.3171/2015.1.SPINE14566

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: spine surgery, fusion, risk factors, prediction model, degenerative disease, cage subsidence

Citation: Xiong T, Wang B, Qin W, Yang L and Ou Y (2023) Development and validation of a risk prediction model for cage subsidence after instrumented posterior lumbar fusion based on machine learning: a retrospective observational cohort study. Front. Med. 10:1196384. doi: 10.3389/fmed.2023.1196384

Received: 29 March 2023; Accepted: 03 July 2023;
Published: 21 July 2023.

Edited by:

Yujiang Fan, Sichuan University, China

Reviewed by:

Seifollah Gholampour, The University of Chicago, United States
ZhiJian Song, Fudan University, China

Copyright © 2023 Xiong, Wang, Qin, Yang and Ou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yunsheng Ou, ouyunsheng2001@163.com

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.