![Man ultramarathon runner in the mountains he trains at sunset](https://d2csxpduxe849s.cloudfront.net/media/E32629C6-9347-4F84-81FEAEF7BFA342B3/0B4B1380-42EB-4FD5-9D7E2DBC603E79F8/webimage-C4875379-1478-416F-B03DF68FE3D8DBB5.png)
94% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Med. Technol. , 05 February 2025
Sec. Diagnostic and Therapeutic Devices
Volume 7 - 2025 | https://doi.org/10.3389/fmedt.2025.1485244
Objective: This study aims to develop and validate a nomogram that combines traditional ultrasound radiomics features with clinical parameters to assess early intracranial hypertension (IH) following primary decompressive craniectomy (DC) in patients with severe traumatic brain injury (TBI). The study incorporates the Shapley Additive Explanations (SHAP) method to interpret the radiomics model.
Methods: This study included 199 patients with severe TBI (training cohort: n = 159; testing cohort: n = 40). Postoperative ultrasound images of the optic nerve sheath (ONS) were obtained at 6 and 18 h after DC. Based on invasive intracranial pressure (ICPi) measurements, patients were grouped according to threshold values of 15 mmHg and 20 mmHg. Radiomics features were extracted from ONS images, and feature selection methods were applied to construct predictive models using logistic regression (LR), support vector machine (SVM), random forest (RF), and K-Nearest Neighbors (KNN). Clinical-ultrasound variables were incorporated into the model through univariate and multivariate logistic regression. A combined nomogram was developed by integrating radiomics features with clinical-ultrasound variables, and its diagnostic performance was evaluated using Receiver Operating Characteristic (ROC) curve analysis and decision curve analysis (DCA). The SHAP method was adopted to explain the prediction models.
Results: Among the machine learning models, the LR model demonstrated superior predictive efficiency and robustness at threshold values of 15 mmHg and 20 mmHg. At a threshold of 20 mmHg, the AUC values for the training and testing cohorts were 0.803 and 0.735 for the clinical model, 0.908 and 0.891 for the radiomics model, and 0.918 and 0.902 for the nomogram model, respectively. Similarly, at a threshold of 15 mmHg, the AUC values were consistent across models: 0.803 and 0.735 for the clinical model, 0.908 and 0.891 for the radiomics model, and 0.918 and 0.902 for the nomogram model. Notably, the nomogram model outperformed the clinical model. Decision curve analysis (DCA) further confirmed a higher net benefit for predicting intracranial hypertension across all models.
Conclusion: The nomogram model, which integrates both clinical-semantic and radiomics features, demonstrated strong performance in predicting intracranial hypertension across different threshold values. It shows promise for enhancing non-invasive ICP monitoring and supporting individualized therapeutic strategies.
Monitoring and regulation of intracranial pressure (ICP) are essential in the comprehensive management of various neurological disorders, especially traumatic brain injuries (TBI) (1). Research indicates that up to 72% of TBI patients experience elevated ICP within the first 24 h post-injury (2). Although invasive ICP (ICPi) monitoring is widely used, it is constrained by the need for specialized resources, expertise, and the potential risks it entails, such as infection and hemorrhage (3). These limitations highlight the ongoing need for noninvasive, accurate, and accessible methods of ICP assessment in TBI patients.
It is acknowledged that the role of ICPi monitoring in TBI patients undergoing primary decompressive craniotomy (DC) remains a topic of debate. Proponents argue that maintaining adequate cerebral perfusion pressure (CPP) is essential for TBI management, and that without ICPi monitoring, it is challenging to guide and implement CPP-directed treatments. Studies indicate that many TBI patients continue to experience elevated ICP post-DC, which can result in dangerously low CPP (<60 mmHg), thus making ICPi monitoring critical for informed treatment decisions (4, 5). Conversely, some experts oppose routine ICPi monitoring, suggesting that experienced clinicians can estimate ICP by evaluating pressure at the decompression window and using CT or MRI to detect complications such as recurrent hematomas or changes in ventricular size. They also point to the increased costs and potential risks associated with invasive monitoring. Furthermore, there is no consensus on critical ICP thresholds after primary DC, and for severe TBI, management strategies that focus on maintaining ICP below 20 mmHg have not demonstrated superiority over imaging and clinical examination-based approaches (6, 7). Despite these differing opinions, both sides emphasize the importance of correctly interpreting monitoring data and acknowledge non-invasive ICP monitoring as a valuable alternative.
Among various noninvasive approaches, measuring the optic nerve sheath diameter (ONSD) via ultrasound has emerged as a promising technique due to its simplicity, bedside applicability, and cost-effectiveness (8). The rationale behind this approach is straightforward: elevated ICP causes distension of the optic nerve sheath (ONS), leading to an increase in ONSD. Several studies have shown a strong correlation between ONSD and ICP, establishing it as a valuable parameter for noninvasive ICP assessment (9–11). However, the clinical implementation of ultrasonic ONSD measurement faces challenges due to methodological inconsistencies, including variations in transducer selection, measurement planes, and patient positioning, leading to inaccuracies and limit the generalizability of results (9, 10). Given these limitations, is there an alternative method that could enhance the diagnostic capability for early prediction of increased ICP based on ONSD images? One potential solution is the application of ultrasound radiomics.
Ultrasound radiomics, an emerging and rapidly evolving field, has transformed medical imaging by enabling comprehensive feature extraction and in-depth analysis of tumor phenotypes (12, 13). The confluence of advanced algorithms and artificial intelligence has expanded the scope of radiomic analysis beyond conventional boundaries, particularly in tumor classification and prognosis, as evidenced by a growing body of research that delineates its utility in lesion characterization (13–15). Despite these advancements, there is a paucity of literature quantifying the radiomic features of the ONS on sonographic imaging, an area ripe for exploration in terms of its clinical applicability. In light of this, investigating the comparative efficacy of current machine learning models utilizing ultrasound radiomics for the early diagnosis of intracranial hypertension (IH) is both timely and highly significant.
In this study, we employed different machine learning methodologies to develop and validate an integrated model that combines radiomics features with clinical-ultrasound parameters to predict the onset of early IH under different thresholds (15 mmHg, 20 mmHg) in primary DC patients after TBI. Moreover, the Shapley Additive Explanations (SHAP) technique was utilized to interpret our models. This method helps clarify the significance of individual features within the prediction model and illustrates their contributions to specific predictions (16).
This retrospective observational study's materials were collected at the Neuro-intensive Care Unit (NICU) of the second hospital affiliated with Shandong First Medical University, located in Taian, Shandong Province, China. It was conducted from May 2018 to August 2024 and was approved by the hospital's Ethics Committee (Nos: 2021-A-037). A total of 199 patients with TBI who underwent primary DC were included after a series of inclusion and exclusion criteria (Figure 1. Flowchart for selecting the study population). All cases had complete two-dimensional ultrasound images of ONS. The cases were randomly divided into a training cohort and a testing cohort in an 8:2 ratio.
Figure 1. Flowchart for selecting the study population. DC, decompressive craniectomy; ICP, intracranial pressure; ICPi, intracranial pressure invasive; MCA, middle cerebral artery; NICU, neuro-intensive care unit; ONSD, optic nerve sheath diameter; TBI, traumatic brain injury; TCCD, transcranial color doppler; IH, intracranial hypertension.
In TBI patients, the decision to perform primary DC is based on injury mechanism, clinical presentation, Glasgow Coma Scale (GCS) scores, and CT findings. Unilateral DC is performed for hematomas confined to one hemisphere, while bilateral or frontotemporal decompression is used for bilateral hematomas or frontal lobe injuries. The surgical incision typically involves a 9 × 9 cm bone window, extended arcuately to the dura mater. Hematomas, necrotic tissue, and blood clots are removed to ensure hemostasis, and the dura mater is loosely closed or covered with an artificial substitute. ICP is measured using either an intraparenchymal probe or a ventricular catheter connected to an external pressure transducer. The placement of the ICP probe depends on lesion location, ipsilateral for unilateral lesions and on the right side for diffuse injuries. ICP management follows a protocol that includes sedation, optimizing CPP, administering hyperosmolar fluids, and hypothermia, as per institutional guidelines. Given the absence of definitive guidelines for IH following primary DC, we established cutoff values of 15 mmHg (mild elevation) and 20 mmHg (moderate to severe elevation) to assess the model's predictive performance across these thresholds.
ONS imagings were performed within 24 h post-DC using the Mindary M9 ultrasound system with a L12-4s probe (1–5 MHz) (Shenzhen, Guangzhou, China). Patients were in a supine position and the probe was applied to the closed upper eyelid with standard gel, angled to capture the ONS in the transverse plane. Adhering to the as low as reasonably achievable (ALARA) principle, the ultrasound output was minimized for safety. Measurements for ONSD were taken 3 mm behind the optic disk, with duplicates in transverse section per eye to reduce variability.
Additionally, bedside middle cerebral artery (MCA) assessments were also recorded by the Mindary M9 ultrasound system equipped with an SP5-1s probe (1–5 MHz). bilateral evaluation of MCA, including pulse index (PI), peak Systolic Velocity (PSV), end-dystolic velocity (EDV), mean velocity (MV). The definitive PI value was ascertained by averaging the two side measurements, or by selecting the higher value in cases of substantial discrepancy.
Clinical records and imaging data were obtained from our hospital's routine documentation system and picture archiving and communication system (PACS). A retrospective analysis was conducted on clinical variables, including age, weight, height, body mass index (BMI), gender, ejection fraction (EF), mean arterial pressure (MAP), partial pressures of oxygen (PO2) and carbon dioxide (PCO2) in arterial blood, heart rate, and respiratory rate. Two experienced ultrasound specialists (QC and WQQ), blinded to the ICPi value and clinical data, independently evaluated the ONS images. One specialist had 4 years of experience, and the other had 15 years of expertise in ultrasound examinations. Both had undergone standardized training for ONSD measurements and reached consensus on the image assessments.
All US images were stored in Digital Imaging and Communications in Medicine (DICOM) format and were independently reviewed by two experienced ultrasound specialists above. They manually segmented the region of interest (ROI) by using ITK-SNAP software (version3.8.0, http://www.itksnap.org/). A complete schematic diagram is depicted in Figure 2.
The reliability of intra- and inter-observer reproducibility was assessed using the intraclass correlation coefficient (ICC). A random sample of 60 patients was selected, including 18 with IH and 42 with normal intracranial pressure (ICP <20 mmHg), for ROI segmentation. This segmentation was performed by the specialists one month before the study began. An ICC value greater than 0.9 was considered to indicate strong agreement.
Radiomic features were extracted from the segmented ROI regions using PyRadiomics (http://pyradiomics.readthedocs.io). The extracted features were classified into three main categories: (I) geometry, (II) intensity, and (III) texture. Geometric features characterize the two-dimensional shape of ONS, while intensity features represent the statistical distribution of voxel intensities within the ONS. These intensity features reflect the first-order statistical distribution of voxel intensities, providing insight into the ONS's overall brightness and contrast. Texture features capture the patterns and spatial distribution of intensities at higher orders, offering additional information on the ONS's internal structure. Specifically, the extraction of textural features employs a variety of methods such as the gray-level co-occurrence matrix (GLCM), gray-level run length matrix (GLRLM), gray level size zone matrix (GLSZM), Gray level dependence matrix (GLDM) and neighborhood gray-tone difference matrix (NGTDM).
For feature Selection, Z-score normalization was applied to standardize all features. Statistical analysis was performed using the Mann–Whitney U-test, and features were selected based on their p-values, with only those having a p-value <0.05 retained. To evaluate feature repeatability, Spearman's rank correlation coefficient was computed. Features with a correlation coefficient greater than 0.9 were considered highly correlated, and one of the correlated features was excluded. The signature construction from the discovery dataset was carried out using Least Absolute Shrinkage and Selection Operator (LASSO) regression. LASSO regression applies the regularization parameter λ to shrink regression coefficients toward zero, eliminating irrelevant features by assigning them a coefficient of zero. The optimal value of λ was determined through 10-fold cross-validation, selecting the value that minimized cross-validation error. LASSO regression modeling was performed using the scikit-learn package in Python.
After LASSO feature selection, the refined set of features was used to develop radiomics models. To identify the classifier with the highest performance in recognizing IH, we evaluated four widely used machine learning algorithms: logistic regression (LR), support vector machine (SVM), random forest (RF), and K-Nearest Neighbors (KNN). The diagnostic efficacy of these models was compared based on metrics such as the area under the receiver operating characteristic curve (AUC), specificity, positive predictive value (PPV), and negative predictive value (NPV). The optimal radiomics model was determined based on these comparisons.
For clinical-ultrasonic variables, univariate logistic regression was first performed to identify significant predictors. A subsequent multivariable analysis refined the final variables in the training group to build the model. Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated for each factor. The same machine learning models used for the radiomic signature were applied, with 5-fold cross-validation and a fixed test cohort for consistent evaluation.
The radiomics nomogram was created by integrating both radiomic and clinical data. Its diagnostic performance was validated in the test cohort through receiver operating characteristic (ROC) analysis. Calibration accuracy was assessed using calibration plots and the Hosmer–Lemeshow test. The clinical applicability of the predictive models was evaluated through decision curve analysis (DCA).
We utilized MedCalc (version 20.015.0) for standard statistical analysis, including Student's t-test or Mann-Whitney U-test for continuous variables (mean ± SD) and the chi-square or Fisher's exact test for categorical variables (ratios). For the extraction of ultrasound radiomic features, Python (version 3.8.2) with PyRadiomics was employed. The Python scikit-learn package was used for LASSO regression modeling. We considered P-values <0.05 as statistically significant.
A total of 199 patients, comprising 130 males and 69 females, were enrolled in our research, with 159 randomly allocated to the training set and 40 to the test set. According to different ICP thresholds, at 20 mmHg, the proportion of hypertension IH was 30.65%, and at 15 mmHg, the proportion of IH was 38.19%. The Reasons for DC, including: Cerebral contusion/lacer-ation (n = 34, 17.09%), Acute Intracerebral hematoma (n = 58, 29.15%), Acute subdural hematoma (n = 107, 53.77%).
No significant disparities were observed between these cohorts in (age, weight, height, BMI, gender, EF, MAP, PO2, PCO2, heart rate, and respiratory rate), as detailed in Table 1 and Supplementary S1. Notable distinctions (PI, ONSD, MV, EDV) were found in the training set between normal and elevated ICP groups, as well as in the validation set (PI, ONSD). Univariate and multivariable logistic regression analyses also identified PI and ONSD, as significant predictors (Table 2 and Supplementary S2). These variables were integrated into a clinical predictive model. Patients in the elevated ICP group (≥20 mmHg) had a higher likelihood of elevated PI (OR, 4.965; 95% CI, 1.739–18.247), increased ONSD (OR, 3.609; 95% CI, 1.73–7.523), and reduced MV (OR, 0.723; 95% CI, 0.685–0.886), as well as decreased EDV (OR, 0.914; 95% CI, 0.735–0.993). At the 15 mmHg threshold, a similar trend was observed, with elevated PI (OR, 1.047; 95% CI, 1.002–1.073), increased ONSD (OR, 10.912; 95% CI, 4.332–27.716), and reduced MV (OR, 0.955; 95% CI, 0.925–0.987), along with decreased EDV (OR, 0.947; 95% CI, 0.912–0.983).
Table 2. Univariate and multivariable logistic regression analyses for selecting clinical features of model development (ICP ≥ 20 mmHg).
A total of 107 handcrafted features are extracted, including 18 first order features, 14 shape features, and the last are texture features. The Mann-Whitney U-test identified 87 radiomic features with statistical significance. Following the Spearman rank correlation test, 49 features were selected for further evaluation. The Rad score was calculated using a LASSO regression model, which identified 14 features at 20 mmHg and 18 features at 15 mmHg with nonzero coefficients (depicted in Supplementary Table S3). The coefficients and mean standard errors (MSE) obtained from 10-fold cross-validation are presented in Figures 3A,B, while Figure 3C shows the coefficient values for the final selected features at 20 mmHg. For the 15 mmHg data, refer to Supplementary Figure S1.
Figure 3. Radiomics feature selection using the LASSO algorithm and performance of the radiomics signature model. (A) LASSO coefficient profiles for each feature. Different colored lines represent the corresponding coefficients of each feature. (B) Selection of the tuning parameter (λ) in the LASSO model. (C) Feature weight coefficients after selection. (D) ROC curves for the radiomics signature LR model in both the training and testing cohorts. LASSO, least absolute shrinkage and selection operator; ROC, receiver operating characteristic.
Figure 4 illustrates the ROC curves for four distinct radiomic models: LR, SVM, RF, and KNN, evaluated in both the training and validation cohorts (Figures 4A,B for 20 mmHg, Figures 4C,D for 15 mmHg). In the 20 mmHg training cohort, the RF model demonstrated the highest efficacy, achieving an AUC of 0.941, with an accuracy of 0.893, sensitivity of 0.694, specificity of 0.982, PPV of 0.944, and NPV of 0.878. In contrast, the LR model outperformed the others in the 20 mmHg validation cohort, with an AUC of 0.840, accuracy of 0.853, sensitivity of 0.696, specificity of 0.892, PPV of 0.800, and NPV of 0.858 (see Table 3 for further details). The RF model's performance in both cohorts indicated potential overfitting. Consequently, to ensure robustness and generalizability, the LR model was considered the most suitable for clinical application (Figure 3D). The same situation was also observed in both cohorts at 15 mmHg (Supplementary Figure S1).
Figure 4. ROC analysis of different models on rad signature. (A) ROC curves for the training cohort at 20 mmHg. (B) ROC curves for the testing cohort at 20 mmHg. (C) ROC curves for the training cohort at 15 mmHg. (D) ROC curves for the testing cohort at 15 mmHg. LR, logistic regression; SVM, support vector machine; RF, random forest; AUC, area under the curve; ROC, receiver operating characteristic.
Table 3. Diagnostic performance of different models for predicting IH in training and test cohorts (≥20 mmHg).
At the threshold of 20 mmHg, the performance of the clinical, radiomics, and nomogram models is also summarized in Table 3. In the training cohort (Figure 5A), the clinical model achieved an AUC of 0.803, the radiomics model achieved an AUC of 0.908, and the nomogram model achieved an AUC of 0.918. In the testing cohort (Figure 5D), the clinical model attained an AUC of 0.735, the radiomics model reached an AUC of 0.891, and the nomogram model achieved an AUC of 0.902. The decision curve analysis (DCA) of all three models is shown in Figures 5B,E, demonstrating that, within the training cohort, the nomogram model and radiomics model provided a higher net benefit compared to the clinical model in predicting IH. Calibration curves, presented in Figures 5C,F, illustrate the concordance between the predicted and observed IH in both cohorts. The Hosmer–Lemeshow test indicated no significant difference between the predicted and observed lines in both the training (P = 0.319) and testing cohorts (P = 0.125). Additionally, the DeLong test confirmed that the nomogram and radiomics models outperformed the clinical model in predictive accuracy in the training cohort (P = 0.001), while no significant advantage was observed in the testing cohort (P = 0.115). Figure 6 illustrates the nomogram designed for clinical application, where the total score represents the probability of IH (20 mmHg) in post-DC TBI patients. For the 15 mmHg data, refer to Supplementary Figures S2, S3.
Figure 5. Performance evaluation of clinical, radiomics, and nomogram models using AUC, DCA, and calibration curves. (A–C) Results for the training cohort. (D–F) Results for the testing cohort. AUC, area under the curve; DCA, decision curve analysis.
Figure 6. Clinical application of the nomogram in the prediction of IH (≥20 mmHg) in post-DC TBI patients.
The relationships between features and IH post-DC, which were used to construct the optimal predictive model (clinical-radiomic models), were analyzed using the SHAP algorithm. In the IH prediction model (≥20 mmHg), we identified that features such as original_firstorder_Maximum, original_shape_Voxel Volume, and original_gldm_Dependence NonUniformity Normalized were positively correlated with the occurrence of IH (Figure 7A).
Figure 7. SHAP summary plots for the clinical-radiomic model in predicting intracranial hypertension. (A) Feature relevance and the contribution of combined features to the model's predictive performance. (B) SHAP force plots illustrating how the clinical-radiomic model distinguishes treatment responses in two patients. Patient 1 has intracranial hypertension, while Patient 2 does not. The data presented in this figure are shown with two decimal places following Z-score normalization. SHAP, Shapley Additive Explanations.
Furthermore, we demonstrated how to interpret the assessment of a single patient within the model. For patient 1, the SHAP value was higher than the baseline value, indicating that this patient exhibited IH. The arrows representing the features highlighted their quantitative contribution to the assessment of elevated intracranial pressure (ICP) (Figure 7B). For instance, the value of the feature original_glcm_Inverse Variance was −1.263 for this patient, which positively contributed to the SHAP value. In contrast, for patient 2, who had normal ICP, the SHAP value was noticeably lower than the baseline value. The feature original_glrm_Gray Level NonUniformity Normalized had a value of 2.3798, which decreased the SHAP value, indicating a lower likelihood of IH.
Machine learning provides objective, reliable, and accurate models to support clinical decision-making (17). Several studies (18–21) have highlighted its effectiveness in assisting medical practitioners in identifying IH. However, radiomic studies leveraging ONS ultrasound images through machine learning remain limited. Our study demonstrates that ultrasound-based radiomic models not only predict the early onset of increased ICP following DC but also improve diagnostic performance when combined with clinical features in a nomogram. These findings suggest a promising non-invasive assessment method with practical value and reliability for the early detection of IH in post-DC patients.
As is well known, patients undergoing DC do not conform to the Monroe-Kelly doctrine. This is because, after DC, the cranial contents can bulge towards the decompressive bone window. Such changes not only disrupt the original equilibrium state but also affect cerebrovascular autoregulation. Consequently, the traditional pressure-regulating mechanism, which relies on a closed cranial cavity, becomes compromised. In this scenario, the standard invasive ICP thresholds (e.g., 20 mmHg) are no longer applicable to DC patients. Furthermore, the clinical applicability of indices such as the PI, EDV and related non-invasive ICP estimation formulas is also called into question. Although some studies (22, 23) have attempted to assess ICP by observing changes in the ONSD post-DC, these approaches are limited by small sample sizes, poor reproducibility, and high operator dependence, making large-scale clinical application difficult.
Scholars hold differing views on the threshold for defining IH and its management after primary DC (24, 25). Some suggest that when ICP reaches 15 mmHg, brain tissue in post-DC patients may already be suffering from IH-related damage, necessitating proactive clinical interventions. In our study, we utilized two ICP thresholds (15 mmHg and 20 mmHg) to distinguish post-DC IH. The results revealed that radiomic features based on ONS achieved stable AUC values around 0.9 in both the training (0.908 vs. 0.950) and testing cohorts (0.891 vs. 0.889) for both thresholds. In contrast, clinical features performed worse at the 15 mmHg threshold compared to 20 mmHg in both training (0.795 vs. 0.808) and testing (0.681 vs. 0.735) cohorts. This suggests that ONS radiomic features are more advantageous in detecting early-stage ICP elevation.The observed results may be attributed to radiomics extracted from ONS imaging, which uncover subtle imaging features imperceptible to the human eye, providing additional insights for IH prediction. Meanwhile, the limited ultrasonic ONSD expansion from 15 to 20 mmHg, combined with operator measurement errors, contributes to the instability of the clinical model.
Through SHAP analysis, we observed that key radiomic features primarily originate from the GLCM and the GLSZM. GLCM features, such as contrast and correlation, capture the spatial distribution relationships of pixel intensities within the ONS. Early IH leads to ONS enlargement, causing blurred and uneven reflective interfaces within the ROI. These changes alter GLCM feature values, providing critical diagnostic information. GLSZM features, particularly those representing large low-gray regions, highlight changes in the proportion of hypoechoic or anechoic areas within the ONS. These alterations are associated with increased fluid volume and tissue acoustic property changes due to edema, further aiding in the diagnosis of IH. In terms of shape features, parameters such as roundness and aspect ratio are influenced by variations in the shape of the ONS caused by changes in ONSD. Alterations in roundness can effectively quantify these changes, providing valuable diagnostic insights. After rigorous selection and validation, these features collectively characterize the pathophysiological changes of IH from multiple dimensions, offering key information to support model predictions and providing a foundation for clinical interpretation.
Additionally, our clinical data indicate that post-DC patients with IH often exhibit ONS dilation alongside increased PI and reduced EDV and MV, consistent with most studies (26–29). However, minimal variations in respiratory rate, heart rate, and MAP between groups can be attributed to stringent postoperative ICP management, ensuring PaO2 and PCO2 are maintained within optimal ranges. Although the clinical signature model achieved an AUC of 0.803 in the training set and 0.735 in the test set, these results fall short compared to previous studies (25) reporting AUCs of 0.88–0.99. Discrepancies are largely due to factors such as race, age, and measurement standards significantly influencing ONSD measurements, leading to varying cutoff values across studies (8). Summarizing these findings, we conclude that ONSD and its related clinical models, at the current stage, lack the accuracy and reliability required to replace invasive ICP monitoring, especially in post-DC TBI patients.
Finally, in our comparative analysis of predictive models, LR demonstrated superior predictive performance compared to SVM, RF, and KNN models, consistent with findings reported by Kim (27). This advantage likely stems from the intrinsic linearity of early IH data, which aligns with the linear algorithm of LR, enabling it to outperform the nonlinear approaches of RF and SVM. Although the RF model achieves high accuracy through ensemble learning across multiple decision trees, its generalizability is limited in smaller sample sizes, making it susceptible to overfitting. In contrast, the LR model exhibits greater stability and efficiency, achieving comparable predictive performance with smaller datasets, even when compared to larger training cohorts.
This study has several notable limitations. First, its retrospective and single-center design may have introduced selection bias. Second, there remain ongoing debates regarding the subjective nature of manual segmentation, particularly in establishing boundaries. Future efforts aim to achieve full automation through deep learning techniques. Third, unlike typical TBI cases, post-DC TBI patients do not conform to the Monroe-Kelly doctrine, and their ICP thresholds are less clearly defined. Additional parameters, such as surgical incision size, operation duration, and blood loss, may be necessary for more accurate classification and evaluation. Fourth, the study focused solely on ONSD imaging, overlooking the potential advantages of a multimodal approach, which has shown significant promise in similar contexts. Further prospective studies are needed to explore whether integrating multimodal radiomic data could improve the model's predictive accuracy. Fifth, the small sample size of the training and testing cohorts underscores the need for larger studies to improve the predictive model. Expanding the sample size, using stratified sampling, and applying regularization techniques like LASSO or ridge regression can reduce bias, prevent overfitting, and enhance model stability through further validation.
Our study introduced and validated an integrated model based on ONS ultrasonic imaging that combines clinical-ultrasonic parameters with radiomics signatures for the early detection of IH within the post-DC TBI patients at different thresholds. The model, optimized with logistic regression (LR), demonstrated superior diagnostic efficacy in our analysis. It offers a novel avenue for precision medicine and has the potential to improve clinical treatment strategies, ultimately aiming to enhance patient outcomes in the critical period following TBI.
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
The studies involving humans were approved by the Ethics Committee of the second affiliated hospital of the First Medical University of Shandong (2021-A-037). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
ZF: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. LP: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. LG: Methodology, Writing – original draft. CQ: Formal Analysis, Software, Writing – original draft. YY: Software, Writing – original draft. JZ: Investigation, Supervision, Writing – original draft. YL: Conceptualization, Funding acquisition, Project administration, Writing – original draft, Writing – review & editing.
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by Tai'an Innovation Development Program for Science and Technology (Nos. 2021NS433).
We would like to express our gratitude to all TBI patients who participated in this study.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmedt.2025.1485244/full#supplementary-material
ALARA, as low as reasonably achievable; BMI, body mass index; Cis, confidence intervals; DCA, decision curve analysis; DICOM, digital imaging and communications in medicine; EDV, end-diastolic velocity (MCA); EF, ejection fraction; GLCM, gray-level co-occurrence matrix; GLRLM, gray-level run-length matrix; GLSZM, gray-level size zone matrix; GLDM, gray-level dependence matrix; ICC, intraclass correlation coefficient; ICP, intracranial pressure; KNN, K-nearest neighbors; LASSO, least absolute shrinkage and selection operator; LR, logistic regression; MAP, mean arterial pressure; MCA, middle cerebral artery; MV, mean velocity (MCA); NPV, negative predictive value; ONS, optic nerve sheath; ONSD, optic nerve sheath diameter; ORs, odds ratios; PaO2, oxygen partial pressure (Artery); PaCO2, carbon dioxide partial pressure (Artery); PACS, picture archiving and communication systems; PI, pulse index (middle cerebral artery); PPV, positive predictive value; PSV, peak systolic velocity (MCA); RF, random forest; ROC, receiver operating characteristic; ROI, region of interest; SVM, support vector machine; TBI, traumatic brain injury; TCCD, transcranial color doppler; DC, decompressive craniectomy; GCS, glasgow coma scale; ICPi, invasive ICP; CPP, cerebral perfusion pressure.
1. Hawryluk GWJ, Citerio G, Hutchinson P, Kolias A, Meyfroidt G, Robba C, et al. Intracranial pressure: current perspectives on physiology and monitoring. Intensive Care Med. (2022) 48(10):1471–81. doi: 10.1007/s00134-022-06786-y
2. Murray NM, Wolman DN, Mlynash M, Threlkeld ZD, Christensen S, Heit JJ, et al. Early head computed tomography abnormalities associated with elevated intracranial pressure in severe traumatic brain injury. J Neuroimaging. (2021) 31(1):199–208. doi: 10.1111/jon.12799
3. Netteland DF, Aarhus M, Sandset EC, Padayachy L, Helseth E, Brekken R. Noninvasive assessment of intracranial pressure: deformability index as an adjunct to optic nerve sheath diameter to increase diagnostic ability. Neurocrit Care. (2024) 41(2):479–88. doi: 10.1007/s12028-024-01955-x
4. Demetriades AK. Intracranial pressure monitoring after primary decompressive craniectomy: is it useful? Acta Neurochir (Wien). (2017) 159(4):623–4. doi: 10.1007/s00701-017-3119-y
5. Picetti E, Caspani ML, Iaccarino C, Pastorello G, Salsi P, Viaroli E, et al. Intracranial pressure monitoring after primary decompressive craniectomy in traumatic brain injury: a clinical study. Acta Neurochir (Wien). (2017) 159(4):615–22. doi: 10.1007/s00701-017-3118-z
6. Chesnut RM, Temkin N, Carney N, Dikmen S, Rondina C, Videtta W, et al. A trial of intracranial-pressure monitoring in traumatic brain injury. N Engl J Med. (2012) 367(26):2471–81. doi: 10.1056/NEJMoa1207363
7. Melhem S, Shutter L, Kaynar A. A trial of intracranial pressure monitoring in traumatic brain injury. Crit Care. (2014) 18(1):302. doi: 10.1186/cc13713
8. Robba C. Measuring optic nerve sheath diameter using ultrasonography for the detection of non invasive intracranial pressure: what it is and what it is not. Arq Neuropsiquiatr. (2022) 80(6):547–9. doi: 10.1590/0004-282x-anp-2022-e006
9. Bhargava V, Tawfik D, Tan YJ, Dunbar T, Haileselassie B, Su E. Ultrasonographic optic nerve sheath diameter measurement to detect intracranial hypertension in children with neurological injury: a systematic review. Pediatr Crit Care Med. (2020) 21(9):e858–e68. doi: 10.1097/PCC.0000000000002453
10. Dubourg J, Javouhey E, Geeraerts T, Messerer M, Kassai B. Ultrasonography of optic nerve sheath diameter for detection of raised intracranial pressure: a systematic review and meta-analysis. Intensive Care Med. (2011) 37(7):1059–68. doi: 10.1007/s00134-011-2224-2
11. Helmke K, Hansen HC. Fundamentals of transorbital sonographic evaluation of optic nerve sheath expansion under intracranial hypertension II. Patient study. Pediatr Radiol. (1996) 26(10):706–10. doi: 10.1007/BF01383384
12. Yao S, Yao D, Huang Y, Qin S, Chen Q. A machine learning model based on clinical features and ultrasound radiomics features for pancreatic tumor classification. Front Endocrinol (Lausanne). (2024) 15:1381822. doi: 10.3389/fendo.2024.1381822
13. Zhang M, Li X, Zhou P, Zhang P, Wang G, Lin X. Prediction value study of breast cancer tumor infiltrating lymphocyte levels based on ultrasound imaging radiomics. Front Oncol. (2024) 14:1411261. doi: 10.3389/fonc.2024.1411261
14. Chen W, Liu F, Wang R, Qi M, Zhang J, Liu X, et al. End-to-end deep learning radiomics: development and validation of a novel attention-based aggregate convolutional neural network to distinguish breast diffuse large B-cell lymphoma from breast invasive ductal carcinoma. Quant Imaging Med Surg. (2023) 13(10):6598–614. doi: 10.21037/qims-22-1333
15. Zhang S, Liu R, Wang Y, Zhang Y, Li M, Wang Y, et al. Ultrasound-base radiomics for discerning lymph node metastasis in thyroid cancer: a systematic review and meta-analysis. Acad Radiol. (2024) 31(8):3118–30. doi: 10.1016/j.acra.2024.03.012
16. Rodriguez-Perez R, Bajorath J. Interpretation of compound activity predictions from complex machine learning models using local approximations and shapley values. J Med Chem. (2020) 63(16):8761–77. doi: 10.1021/acs.jmedchem.9b01101
17. Parmar C, Grossmann P, Bussink J, Lambin P, Aerts H. Machine learning methods for quantitative radiomic biomarkers. Sci Rep. (2015) 5:13087. doi: 10.1038/srep13087
18. Jung MK, Ahn D, Park CM, Ha EJ, Roh TH, You NK, et al. Prediction of serious intracranial hypertension from low-resolution neuromonitoring in traumatic brain injury: an explainable machine learning approach. IEEE J Biomed Health Inform. (2023). doi: 10.1109/JBHI.2023.3240460
19. Schweingruber N, Mader MMD, Wiehe A, Roder F, Gottsche J, Kluge S, et al. A recurrent machine learning model predicts intracranial hypertension in neurointensive care patients. Brain. (2022) 145(8):2910–9. doi: 10.1093/brain/awab453
20. Doshi H, Solli E, Elze T, Pasquale LR, Wall M, Kupersmith MJ. Unsupervised machine learning shows change in visual field loss in the idiopathic intracranial hypertension treatment trial. Ophthalmology. (2022) 129(8):903–11. doi: 10.1016/j.ophtha.2022.03.027
21. Doshi H, Solli E, Elze T, Pasquale LR, Wall M, Kupersmith MJ. Unsupervised machine learning identifies quantifiable patterns of visual field loss in idiopathic intracranial hypertension. Transl Vis Sci Technol. (2021) 10(9):37. doi: 10.1167/tvst.10.9.37
22. Wang J, Li K, Li H, Ji C, Wu Z, Chen H, et al. Ultrasonographic optic nerve sheath diameter correlation with ICP and accuracy as a tool for noninvasive surrogate ICP measurement in patients with decompressive craniotomy. J Neurosurg. (2020) 133(2):514–20. doi: 10.3171/2019.4.JNS183297
23. Xu X, Lu Y, Liu J, Xu R, Zhao K, Tao A. Diagnostic value of the combination of ultrasonographic optic nerve sheath diameter and width of crural cistern with respect to the intracranial pressure in patients treated with decompressive craniotomy. Neurocrit Care. (2023) 39(2):436–44. doi: 10.1007/s12028-023-01711-7
24. Rubiano AM, Figaji A, Hawryluk GW. Intracranial pressure management: moving beyond guidelines. Curr Opin Crit Care. (2022) 28(2):101–10. doi: 10.1097/MCC.0000000000000920
25. Faria BCD, Sacramento LGG, Queiroz AVR, Leite FAD, Oliveira H, Kimura TY, et al. The use of noninvasive measurements of intracranial pressure in patients with traumatic brain injury: a narrative review. Arq Neuropsiquiatr. (2023) 81(6):551–63. doi: 10.1055/s-0043-1764411
26. Okpara SE, Uche EO, Iloanusi NI, Iloabachie IC, Mezue WC, Onyia EE, et al. Correlation of the transorbital ultrasonographic optic nerve sheath diameter with intracranial pressure measured intraoperatively in infants with hydrocephalus. J Neurosurg Pediatr. (2024) 33(4):334–42. doi: 10.3171/2023.12.PEDS23273
27. Kim KH, Kang HK, Koo HW. Prediction of intracranial pressure in patients with an aneurysmal subarachnoid hemorrhage using optic nerve sheath diameter via explainable predictive modeling. J Clin Med. (2024) 13(7):2107. doi: 10.3390/jcm13072107
28. Sitanaya SN, Kamayanti F, Nugroho HA, Prabowo B. Comparing ultrasonographic optic nerve sheath diameter to head computed tomography scan to predict intracranial pressure elevation. SAGE Open Med. (2022) 10:20503121221077834. doi: 10.1177/20503121221077834
Keywords: ultrasound imaging, severe traumatic brain injury, intracranial pressure, ultrasound radiomics, machine learning, optic nerve sheath diameter, transcranial color doppler
Citation: Fu Z, Peng L, Guo L, Qin C, Yu Y, Zhang J and Liu Y (2025) Ultrasound-based radiomics and clinical factors-based nomogram for early intracranial hypertension detection in patients with decompressive craniotomy. Front. Med. Technol. 7:1485244. doi: 10.3389/fmedt.2025.1485244
Received: 23 August 2024; Accepted: 13 January 2025;
Published: 5 February 2025.
Edited by:
Jeeun Kang, Johns Hopkins University, United StatesReviewed by:
Santiago Cepeda, Hospital Universitario Río Hortega, SpainCopyright: © 2025 Fu, Peng, Guo, Qin, Yu, Zhang and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yan Liu, MjI3NTMxNzUzM0BxcS5jb20=
†These authors have contributed equally to this work and share first authorship
‡ORCID:
Yan Liu
orcid.org/0009-0001-6749-1225
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.