
95% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Med. , 24 March 2025
Sec. Translational Medicine
Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1547588
Background: Early diagnosis of non-union fractures is vital for treatment planning, yet studies using bone morphometric parameters for this purpose are scarce. This study aims to create a fracture micro-CT image dataset, design a deep learning algorithm for fracture segmentation, and develop an early diagnosis model for fracture non-union.
Methods: Using fracture animal models, micro-CT images from 12 rats at various healing stages (days 1, 7, 14, 21, 28, and 35) were analyzed. Fracture lesion frames were annotated to create a high-resolution dataset. We proposed the Vision Mamba Triplet Attention and Edge Feature Decoupling Module UNet (VM-TE-UNet) for fracture area segmentation. And we extracted bone morphometric parameters to establish an early diagnostic evaluation system for the non-union of fractures.
Results: A dataset comprising 2,448 micro-CT images of the rat fracture lesions with fracture Region of Interest (ROI), bone callus and healing characteristics was established and used to train and test the proposed VM-TE-UNet which achieved a Dice Similarity Coefficient of 0.809, an improvement over the baseline's 0.765, and reduced the 95th Hausdorff Distance to 13.1. Through ablation studies, comparative experiments, and result analysis, the algorithm's effectiveness and superiority were validated. Significant differences (p < 0.05) were observed between the fracture and fracture non-union groups during the inflammatory and repair phases. Key indices, such as the average CT values of hematoma and cartilage tissues, BS/TS and BS/TV of mineralized cartilage, BS/TV of osteogenic tissue, and BV/TV of osteogenic tissue, align with clinical methods for diagnosing fracture non-union by assessing callus presence and local soft tissue swelling. On day 14, the early diagnosis model achieved an AUC of 0.995, demonstrating its ability to diagnose fracture non-union during the soft-callus phase.
Conclusion: This study proposed the VM-TE-UNet for fracture areas segmentation, extracted micro-CT indices, and established an early diagnostic model for fracture non-union. We believe that the prediction model can effectively screen out samples of poor fracture rehabilitation caused by blood supply limitations in rats 14 days after fracture, rather than the widely accepted 35 or 40 days. This provides important reference for the clinical prediction of fracture non-union and early intervention treatment.
Fractures are a global public health issue. The Global Burden of Disease Study 2019 (1) indicated that there were 178 million new fracture cases worldwide in 2019, with a total of 455 million cases. Fracture healing can be divided into four stages: the inflammatory phase, the repair phase, which include soft-callus formation phase and hard-callus formation phase, and the remodeling phase. Immediately after a fracture occurs, a hematoma forms (2). In the following weeks, the cartilage callus forms (3), which then undergoes mineralization a few weeks or months post-fracture (4), and finally, the original bone is remodeled over several months to years (5). However, among all fracture cases, ~5%−10% develop into non-union or delayed union, particularly in instances of compromised vascular supply (6), with the clinical diagnosis of non-union requiring 9 months (7, 8). Non-union of fractures is defined as a fracture that has not healed for at least 9 months and shows no signs of healing for 3 consecutive months (9). Delayed or non-union fractures can lead to long-term pain, disability, and repeated surgical interventions, consequently posing a significant economic burden and productivity loss to both the patient and society (10–12). Thus, early diagnosis of non-union fractures is crucial in terms of the formulation of treatment plans. Despite a plethora of research on the assessment of the progression of fracture healing (13–15), there is currently a lack of publicly available studies that directly utilize bone morphometric parameters for modeling the early diagnosis of non-union fractures.
The specific biological mechanisms behind non-union in avascular fractures remain unclear, and early radiological diagnostic characteristics and targets are not well-defined (13). Ethical and technical constraints have led to a lag in clinical prevention and early diagnosis of non-union, and existing imaging diagnostic methods like X-rays (14, 15) and Computed Tomography (CT) scans (16) often fail to capture the detailed features of fracture sites. The research on fracture healing is further hampered by the scarcity of high-quality public imaging datasets, which inhibits more in-depth studies. Consequently, the establishment of an imaging dataset that facilitates continuous observation of the healing process in ischemic fractures is crucial. Such a dataset would be invaluable for early diagnosis of non-union and for gaining a comprehensive understanding of the healing process.
Micro-Computed Tomography (micro-CT) technology, characterized by its high-resolution capabilities, has become an essential tool in non-invasive small animal imaging research (17). However, repeated scans throughout the healing process can be harmful to humans (18). Rats, with their strong vitality, exhibit minimal impact on healing processes due to weekly micro-CT imaging (19). Rats typically confirm non-union post-fracture in about 40 days, markedly shorter than the 270 days in humans, making them a suitable model for medical research (20). Therefore, this study chose rats as the experimental subjects, utilizing the research team's previous studies on animal models (21) and employing micro-CT imaging as the primary observation tool.
In recent years, with the rapid development of deep learning, models based on Convolutional Neural Networks (CNNs), such as U-Net (22), have made achieved significant progress in medical image processing (23). However, these models struggle to utilize global context effectively in complex 2D medical images (24). The incorporation of Transformer (25), including Swin Transformer (26) and Vision Transformer (ViT) (27), has improved segmentation accuracy but faces challenges with small medical image datasets, which are prone to overfitting. Recently, the Mamba model-based segmentation algorithms have drawn interest in medical imaging. Ruan and Xiang (28) introduced Vision Mamba UNet (VM-UNet), which combines Mamba blocks with the U-Net structure to enhance the model's ability to capture long-range dependencies, replacing traditional convolutional layers. This is particularly relevant for tasks like fracture area segmentation that involve dynamic changes, where there is a lack of specialized automated segmentation algorithms.
The aim of this study is to develop a high-resolution CT image dataset, identify the most indicative parameters for fracture non-union, and construct a predictive model for early diagnosis of non-union. To achieve the aforementioned goals, our main contributions are as follows:
(i) Based on our fracture animal models (21), a total of 45,275 micro-CT images were obtained from 12 rats across various stages of fracture healing (days 1, 7, 14, 21, 28, and 35 post-fracture). Among these, 2,448 frames containing the fracture lesion were annotated, including fracture Region of Interest (ROI), bone callus and healing characteristics, and a high-resolution fracture CT image dataset was established which can not only be used on fracture area segmentation and early diagnostic models for fracture non-union, but also for the study of other fracture-related algorithms and applications in medical education.
(ii) Based on the recently prominent high-performance semantic segmentation algorithm—VM-UNet (28), we have made adaptive improvements for the task of fracture area edge segmentation. Consequently, we proposed the Vision Mamba Triplet Attention and Edge Feature Decoupling Module UNet (VM-TE-UNet), which includes the Edge Feature Decoupling Module (EFDM) and Vision State Space Triplet Attention (VSS-TA) blocks, and also improved the loss functions.
(iii) The fracture site was quantitatively analyzed by extracting bone morphometric parameters based on data derived from micro-CT images, enabling a detailed assessment of the structural characteristics. Four micro-CT indicators were extracted and screened to characterize key changes in inflammatory hematoma tissue, cartilage tissue, mineralized cartilage tissue, and osteogenic tissue, including average CT value, Bone Area Fraction (BS/TS), Bone Volume Fraction (BV/TV), and Bone Area to Volume Ratio (BS/TV). On this basis, and taking into account the existing clinical diagnostic experience, we have constructed an early diagnosis model for non-union fractures. On the 14th day, the Area Under the Curve (AUC) (29), Accuracy (30), and Precision (31) of the early diagnosis model reached 0.995, 0.996, and 0.994, respectively, achieving the diagnosis of fracture non-union during soft-callus formation phase.
Figure 1 illustrates the technical roadmap of this study. Through fracture modeling experiments, micro-CT images were obtained from 12 SD rats at days 1, 7, 14, 21, 28, and 35 post-fracture, with the frames containing the fracture area being annotated to establish a dataset. To address the issue of insufficient image segmentation accuracy in the fracture area, this study proposed VM-TE-UNet and conducted comparative and ablation experiments to verify the effectiveness and advantages of the algorithm. Four micro-CT indicators (the average CT value, BS/TS, BV/TV, and BS/TV) were extracted and screened to characterize key changes in inflammatory hematoma tissue, cartilage tissue, mineralized cartilage tissue, and osteogenic tissue. Based on this, and considering existing clinical diagnostic experience, we have constructed an early diagnosis model for non-union fractures.
All animal experiments have been approved by the experimental animal ethics committee of Tianjin University of China (TJUE-2021-137) and were conducted in strict accordance with relevant regulations. The experiment used a total of 12 male Sprague Dawley (SD) rats (aging 25–35 days, weighting 180–210 g), which were randomly divided into two groups: the Fracture group (F group) and the Fracture + Saphenous Artery Cut-off group (F+S group), with 6 rats in each group. All rats were housed in the experimental animal center of the Fourth Central Hospital of Tianjin, China.
In the surgical procedures, first, the 12 rats were intraperitoneally injected with 0.3% pentobarbital sodium anesthetic at a ratio of 40–50 mg/kg. Once their breathing stabilized, the surface of the right medial malleolus was shaved, and the skin was incised along the long axis. Then, using a molding instrument developed by the research team, a 1 mm wide incision was created from the distal medial side of the ankle joint to the central medial malleolus, reaching part of the cortical bone and trabeculae without causing bone fragments to fall off. For the F+S group, the saphenous artery was cut off. Finally, the skin of both groups was sutured and the wounds were disinfected postoperatively.
Within 35 days post-fracture at six time points (day 1, 7, 14, 21, 28, and 35), a total of 12 rats were scanned (SkyScan-1276, Bruker, Berlin, Germany), with an average of ~650 micro-CT images obtained per scan. The scans used isotropic voxel sizes of 10 μm, an exposure time of 486 ms, a tube voltage of 85 kV, and a tube current of 200 μA. The eligible experimental samples were selected from imaging workstations by two experienced radiologists based on stringent inclusion criteria, which included complete data, absence of significant artifacts or noise, appropriate grayscale levels, and clear tissue differentiation. High-quality, lossless TIFF data meeting these criteria were exported for further image processing and analysis.
Two sets of manual segmentations for all fracture lesions were performed independently by two different experienced radiologists. Dice Similarity Coefficient (DSC) (32) was used to assess the agreement between the two annotators on classifying whether a pixel from a Micro-CT image belonged to the fracture lesion or non-fracture lesion class.
The DSC is defined as:
where A and B are the annotated ROI of the two physicians.
When the DSC exceeds 0.95, it indicates a high level of consistency, and the annotation from either doctor can be selected as the segmentation ROI. If the DSC is below 0.95, the two doctors will discuss and reach a consensus to redefine a unified ROI.
In this study, we proposed the VM-TE-UNet model (Figure 2A) for medical image segmentation, which is an adaption of the recently prominent high-performance semantic segmentation algorithm—VM-UNet (28). This novel model is designed to enhance segmentation performance through the integration of cutting-edge modules.
Figure 2. Overview of the deep learning model. (A) EFDM and VSS-TA block are applied to the overall network structure of VM-UNet. (B) In the specific implementation of EFDM, each rectangular box of the NSM module and TPM module represents a convolution layer, normalization layer and ReLU. (C) In the module structure diagram of the EGM, the numbers in each rectangular box represent the corresponding spatial convolution rate. (D) VSS-TA block, the improved attention mechanism network structure.
The network primarily consists of an encoder, a decoder, and the Edge Feature Decoupling Module (EFDM, Figure 2B). The EFDM is further divided into the Noise Suppression Module (NSM), the Texture Preservation Module (TPM), and the Edge-Guided Module (EGM, Figure 2C). The EGM enhances edge features through a series of atrous convolutions, batch normalization, and activation functions, thereby improving segmentation performance. The NSM learns the characteristics of noise by minimizing the difference between the predicted noise feature map and the ground truth noise mask, achieving noise suppression. The TPM learns and retains texture information in the fracture area by minimizing the difference between the predicted texture feature map and the ground truth texture mask.
The encoder consists of multiple Vision State Space Triplet Attention (VSS-TA) blocks (Figure 2D), each block followed by a patch merging operation, reducing spatial dimensions and increasing feature depth. The VSS-TA block is an integration of the Triplet Attention mechanism (33) on the basis of the original Vision State Space (VSS) block (34), aiming to strengthen the capability of feature extraction. The decoder recovers the spatial dimensions through the patch expanding operation and integrates the EFDM blocks after each hidden layer, thereby bolstering the segmentation accuracy.
Through this modular design, VM-TE-UNet can effectively capture and handle complex fracture area characteristics, significantly improving the accuracy and robustness of segmentation. The introduction of the Triplet Attention mechanism enhances the ability to focus on important features, while the EFDM block further optimizes the segmentation process through feature decoupling, ensuring accurate detection of fracture area boundaries and effective suppression of noise.
The VM-TE-UNet network was trained to minimize the total loss that consists of four parts: loss functions for the three branches of EFDM and the loss function for the VM-TE-UNet network. The total loss is calculated as:
where the LNSM, LTPM, and LEGM are the loss functions for NSM, TPM, and EGM, LSEG represents the segmentation loss of the VM-TE-UNet network, and δ is used to balance the weights of the different loss functions.
LEGM is composed of weighted dice loss LDice and focal loss LFocal, which can be written as:
where α is the weight factor.
LNSM is defined as:
where Yn is the predicted noise feature map and Mn is the ground truth noise mask.
LTPM is defined as:
where Yt is the predicted texture feature map and Mt is the ground truth texture mask.
LSEG is composed of binary cross-entropy loss LBce and dice loss LDice, which can be written as:
where the weight factors of λ1 and λ2 are initially set to 1.
In our experiments, the proposed VM-TE-UNet network is trained with Adam optimizer with a learning rate of 1e−5 and a batch size of 16. All the models are built using PyTorch platform and trained on a NVIDIA RTX 3090 GPU with a memory of 24 GB.
We employed three metrics to evaluate the performance of our model, which are as follows:
(i) The DSC is defined as the Equation 1, where A and B are the manual and predicted segmented masks, respectively. It is a statistical tool for comparing the similarity and consistency of sample sets.
(ii) The Hausdorff Distance (HD) (35) is the greatest of all the distances from a point in one set to the closest point in the other set, measuring the accuracy of boundary segmentation. It's defined as:
where A and B are the two sample sets and d (a, b) is the Euclidean distance between A and B. The 95th Hausdorff Distance (HD95) mitigates the impact of outliers by calculating the distance at the 95th percentile, making it more suitable for measuring boundary differences in medical imaging.
The following indices of inflammatory hematoma tissue, cartilage tissue, mineralized cartilage tissue, and osteogenic tissue structure and composition were evaluated from the micro-CT images: the average CT value, BS/TS, BV/TV, and BS/TV (36). The segmentation of tissues within the fracture lesion was performed using specific Hounsfield Unit (HU) thresholds: inflammatory hematoma tissue was delineated at 0–224 HU, cartilage tissue at 225–330 HU, mineralized cartilage tissue at 331–700 HU, and osteogenic tissue at 701–1,000 HU (37).
The bone morphometric parameters were analyzed in this study to evaluate the structural characteristics of tissues, with all measurements conducted using CTvox (https://www.microphotonics.com/micro-ct-systems/visualization-software/) to ensure precision and consistency. The average CT value of a specific tissue was determined by averaging the CT values of all pixels within that tissue. For the four segmented tissues, the pixel count in each tissue was calculated for every image, yielding the Bone Area (BS). By summing the BS of all tissues, the Total Area (TS) was obtained. Similarly, micro-CT images of each sample were analyzed to calculate the Bone Volume (BV) and Total Volume (TV). These parameters were further used to derive ratios such as BS/TS, BV/TV, and BS/TV for each tissue, providing a comprehensive assessment of their structural properties.
Based on the extracted data, we performed statistical analyses and selected relevant indices to construct an early diagnostic model for fracture non-union by integrating clinical diagnostic approaches. This study employed a variety of machine learning classification methods to enhance the performance and robustness of the model, including Support Vector Machine (SVM) (38), Multilayer Perceptron (MLP) (39), Decision Tree (40), and Random Forest (41). The evaluation metrics include Accuracy (30), Precision (31) and F1 Score (42), which are defined as follows:
where TP is true positives, TN is true negatives, FP is false positives, and FN is false negatives (43).
Additionally, the AUC (29), which stands for the area under the Receiver Operating Characteristic (ROC) curve, is also used to evaluate the performance of the model. The five-fold cross-validation was adopted.
Within 35 days after the fracture, at six time points (day 1, 7, 14, 21, 28, and 35), a total of 12 rats were scanned, collecting 45,275 micro-CT images, of which 2,448 frames contained the fracture lesion. Figure 3 shows the scan results of two rats in two groups. We made an incision in the medial malleolus of the rats (Figures 3A, C). After 5 weeks of healing, the medial malleolus of the group F rat finally healed at week 5 (Figure 3B), but there was no sign of healing in the F+S group (Figure 3D).
Figure 3. Micro-CT images of two rats in two groups. (A, B) Are micro-CT images of a group F rat on days 1 and 35 post-fracture. (C, D) Are micro-CT images of a group F+S rat on days 1 and 35 post-fracture.
In the fracture segmentation part of this study, 9 rats were used as the segmentation training set, 1 rat as the validation set, and 2 rats as the test set. For the early fracture non-union prediction, all 12 rats were utilized in a five-fold cross-validation approach for machine learning, ensuring robust model evaluation and performance assessment.
The comparison of the proposed VM-TE-UNet with previous state-of-the-art methods on the avascular fracture non-union micro-CT dataset is presented in Table 1. In the domain of medical image segmentation, UNet (22) and Vision Transformer (ViT) (27) are established models, yet they exhibit suboptimal performance in this particular task. Specifically, ViT underperforms relative to UNet across all metrics, showing its limitations in segmenting fracture lesions. Conversely, Swin-UNet (44) and SegResNet (45) have demonstrated some performance improvements, albeit not markedly. EfficientNet (46) has achieved mixed results, with a DSC of 0.7313, indicating a marginal advancement in boundary treatment.
In comparison to the conventional UNet, VM-UNet (28) has substantially enhanced segmentation accuracy by incorporating the VSS block, achieving a DSC of 0.765. Furthermore, VM-TA-UNet, which introduces a Triplet Attention mechanism (33), has bolstered feature extraction capabilities, increasing the DSC to 0.798 and demonstrating superior performance in managing complex boundaries. Notably, the proposed VM-TE-UNet outperforms all other models across evaluation metrics, with a DSC and HD95 of 0.809, and 13.1, respectively. The experimental outcomes indicate that the proposed methodology effectively mitigates issues related to boundary noise and texture information interference.
Figure 4 shows a more precise and fine segmentation output of the proposed network than the existing baselines. It is apparent that traditional UNet and ViT models exhibit subpar performance when segmenting complex boundaries, often resulting in notable false positive issues. Although Swin-UNet and SegResNet have marginally improved boundary segmentation and reduced some false positive occurrences, the segmentation results remain inaccurate due to the models' inability to fully capture the complex fracture boundaries.
In contrast, the VM-UNet model significantly enhances the recognition of complex boundary areas. However, due to the Mamba architecture's approach to image sequence processing, the model pays less attention to boundary information during segmentation, especially when dealing with irregular or blurred boundaries, leading to local information loss and boundary blurring. The VM-TA-UNet, which significantly improves segmentation performance, still experiences missegmentation in certain areas, particularly with more complex structures, indicating that despite the reduction in false positive phenomena, the model has not yet fully eliminated errors. Finally, the VM-TE-UNet, which integrates the EFDM, further enhances the model's segmentation capabilities. As can be seen from the figures, the segmentation results after EFDM improvement are smoother, with significantly enhanced boundary accuracy and effective control over false positive issues. Compared to other segmentation algorithms, VM-TE-UNet excels in processing noise information and capturing subtle boundary features, resulting in more precise and complete segmentation of fracture areas, providing more valuable technical support for the early diagnosis of fracture non-union.
Table 2 presents the performance of the different attention mechanisms in the fracture lesion segmentation task. A comparison was made among multiple spatial, channel, and hybrid attention mechanisms to evaluate their impact on segmentation accuracy (DSC and HD95). It is observed that single-type attention mechanisms, such as Non-local and SE, do not significantly enhance segmentation performance. In contrast, hybrid attention mechanisms like CBAM (47) and DANet (48) integrate feature attention across different dimensions, allowing for more accurate capture of complex structures in fracture areas. Notably, the Triplet Attention mechanism (33) demonstrates superior segmentation outcomes, achieving an encouraging DSC of 0.798 and significantly reducing HD95 to 15.4. By processing input tensors through rotations in height, width, and channel dimensions, Triplet Attention efficiently captures three-dimensional feature interactions, enhances local detail focus, improves global context understanding, and is computationally suitable for high-resolution medical imaging.
Enhancing VM-UNet's performance, these improvements also pave the way for integrating Mamba architecture with attention mechanisms, potentially increasing segmentation accuracy and fostering innovative, precise solutions in medical image segmentation.
To verify the effectiveness of each branch within the EFDM, we conducted an ablation study. The results are shown in Table 3.
Compared to the basic VM-UNet without the introduction of any EFDM branches and with only the VSS block improved, the addition of the EGM branch alone increased the DSC to 0.804, and reduced HD95 to 14.9. Further inclusion of the NSM branch enhanced the DSC to 0.806, with HD95 lowering to 14.4. Ultimately, the incorporation of the TPM branch achieved the highest DSC of 0.809, with HD95 reducing to 13.1. The results indicate that EGM, NSM, and TPM branches, respectively, refine and enhance the segmentation results by strengthening boundary features, eliminating noise, and preserving texture information.
The average CT value, BS/TS, BS/TV, and BV/TV for both groups of rats are shown in Table 4.
Table 4. The average CT values, BS/TS, BS/TV, and BV/TV of the F group and F+S group on days 1, 7, 14, 21, 28, and 35 post-fracture.
On day 1 postoperatively, the average CT value of the inflammatory hematoma tissue in the F group was 88.5 HU, and in the F+S group, it was 76.0 HU, with no significant difference. However, on days 7, 14, 21, and 28 post-fracture, the values in the F group (79.0, 103, 85.9, and 105 HU) were significantly higher than in the F+S group (59.2, 50.4, 64.8, and 71.3 HU). Similarly, the average CT value of the cartilage tissue in the F group (211, 174, 203, 204, and 196 HU) were significantly higher than in the F+S group (155, 149, 144, 145, and 156 HU) on days 1, 7, 14, 21, and 28 post-fracture.
On days 1, 7, 14, 21, and 28 post-fracture, the BS/TS of the inflammatory hematoma tissue and cartilage tissue in the F group (4.83%, 5.54%, 3.97%, 2.31%, and 6.89% for inflammatory hematoma tissue, respectively; 5.27%, 5.26%, 4.16%, 3.14%, and 6.25% for cartilage tissue, respectively) were significantly lower than in the F+S group (13.0%, 8.35%, 5.29%, 4.13%, and 10.5% for inflammatory hematoma tissue; respectively, 6.11%, 5.74%, 5.00%, and 7.46% for cartilage tissue, respectively). On day 21 post-fracture, the BS/TS of the mineralized cartilage tissue in the F group (30.4%) was lower than in the F+S group (35.4%). However, on days 28 and 35 post-fracture, the values in the F group (37.2% and 33.2%) were significantly higher than in the F+S group (33.9% and 30.8%).
On day 1 postoperatively, the BS/TV of the inflammatory hematoma tissue and cartilage tissue in the F group (0.0661 and 0.0612 mm−1) were significantly higher than in the F+S group (0.182 and 0.139 mm−1). On day 1 postoperatively, the BS/TV of the mineralized cartilage tissue in the F group (0.300 mm−1) was lower than in the F+S group (0.570 mm−1). However, on days 21, 28, and 35 post-fracture, the values in the F group (0.344, 0.346, and 0.346 mm−1) were significantly higher than in the F+S group (0.266, 0.290, and 0.224 mm−1). The BS/TV of the osteogenic tissue in the F group (0.336 mm−1) was lower than in the F+S group (0.648 mm−1) on day 1 postoperatively. However, the values in the F group (0.490, 0.598, 0.674, 0.439, and 0.519 mm−1) were significantly higher than in the F+S group (0.400, 0.410, 0.311, 0.310, and 0.370 mm−1) on days 7, 14, 21, 28, and 35 post-fracture.
On day 21 post-fracture, the BV/TV of the osteogenic tissue in the F group was 61.7%, while in the F+S group, it was 47.6%, with a significant difference.
According to the analysis presented in Section 3.3.1, we selected the following 10 indices for constructing the early diagnosis model of fracture non-union: (i) the average CT value of inflammatory hematoma and cartilage tissues, (ii) the BS/TS for all four types of tissues, (iii) the BS/TV of mineralized cartilage and osteogenic tissues, and (iv) the BV/TV of osteogenic tissue. Table 5 lists the AUC, Accuracy, Precision, and F1 Score of SVM, MLP, Decision Tree, and Random Forest models at different time points post-fracture.
Table 5. The performance of SVM, MLP, decision tree, and random forest models at different time points post-fracture.
On the first day post-fracture, due to inter-individual variations among rats, the micro-CT indices exhibited significant discrepancies, which impacted the early diagnostic model. Therefore, we excluded the data of the first day when establishing the model. The SVM, MLP, decision tree, and random forest models demonstrated optimal performance on days 28, 14, 28, and 28, respectively. Moreover, among all models and time points, the MLP model on day 14 post-fracture achieved the highest evaluation metrics, with AUC, Accuracy, Precision, and F1 Score reaching 0.995, 0.996, 0.994, and 0.997, respectively. In summary, the MLP model is capable of early diagnosis of fracture non-union on day 14 post-fracture in rats, which is the soft-callus formation phase and equivalent to a month post-fracture in humans.
These analyses indicate that our early diagnosis model has the potential to achieve the diagnosis during the soft-callus formation phase of fracture healing. Relative to the current diagnosis of non-union after 9 months post-fracture, the diagnosis was advanced by 8 months, thereby providing a golden time window for early intervention and treatment.
In this study, we established a high-resolution fracture CT image dataset, proposed the VM-TE-UNet for fracture areas segmentation, extracted micro-CT indices, and established an early diagnostic model for fracture non-union, achieving the diagnosis of fracture non-union during soft-callus formation phase.
In this study, experiments were conducted based on our animal model (21), and a total of 12 rats' micro-CT imaging data at various stages of healing (on days 1, 7, 14, 21, 28, and 35 post-fracture) were collected, ultimately accumulating 2,448 high-resolution images of the rat ankle healing area. Subsequently, after preprocessing and annotation of the images, a micro-CT dataset was formed.
VM-TE-UNet, an advanced algorithm for segmenting rat ankle fracture areas, has been successfully developed. It builds on the VM-UNet, incorporating the Triple Attention mechanism into the VSS-TA block to boost feature capture. The VM-TA-UNet with this module achieved a DSC of 0.798, an improvement over the baseline's 0.765. To enhance performance, the EFDM was integrated into the final VM-TE-UNet model, which effectively separates boundary, noise, and texture from feature maps. Ablation studies showed that VM-TE-UNet with EFDM achieved a higher DSC of 0.809 and lowered the HD95 to 13.1, outperforming the model without EFDM and confirming EFDM's role in improving segmentation precision and minimizing false positives.
Currently, the clinical diagnosis of fracture non-union primarily relies on physical examination and radiographic evaluation. The physical examination includes assessing the presence or not of pain and restricted movement in the affected limb and its adjacent joints, which is aimed at confirming the blood supply to the distal part of the affected limb and determining the presence of pseudoarthrosis (49, 50). Localized soft tissue swelling that compresses blood vessels and fractures that damage blood vessels can both lead to poor blood supply. The poor blood may impair callus formation, therefore delay fracture healing.
Among imaging methods, X-ray is the basic method for diagnosing non-union. The scoring systems such as the Radiographic Union Score for Tibial Fractures (RUST) (14) and the Radiographic Union Score for Hip (RUSH) (51) can greatly improve the accuracy of non-union diagnosis. In the RUST system, the focus is on the presence of callus, fracture line, and bridging callus; whereas the RUSH system further scores cortical bridging, cortical disappearance, trabecular consolidation, and trabecular disappearance. The advantages of radiographic union scores are low cost, rapid, ease of implementation, suitability for use across multiple anatomic sites, and high diagnostic accuracy. However, they are semi-quantitative and subjective, which may lead to wide scoring discrepancies in the early diagnosis of non-union. Additionally, a significant drawback of X-rays is the inability to perform 3D imaging, with diagnostic accuracy being greatly influenced by imaging angles. In cases where X-ray film reveals no apparent non-union despite the presence of clinical symptoms, CT is employed for further diagnosis.
Rats reach the soft-callus formation phase 2–3 weeks after fracture, which is equivalent to 1 month after a human fracture. The average CT value can reflect the mineral density of various tissues.
On days 1, 7, 14, 21, and 28 post-fracture (the inflammatory phase and repair phase) the average CT values of the inflammatory hematoma and cartilage tissues in the F+S group are significantly lower than those in the F group, indicating that callus formation and mineralization are hindered in the F+S group. This is consistent with the theory that the soft-callus formation phase is prolonged and mineralization is affected under ischemic conditions in rats. The changes in the BS/TS of the mineralized cartilage tissue during the early hard-callus formation phase can reflect the negative effects of ischemia on cartilage callus mineralization, consistent with the theory. The BS/TV of osteogenic and mineralized cartilage tissue during the repair phase indicates that mineralization is hindered in the F+S group. The differences in BV/TV between the two groups during the hard-callus formation phase also support this conclusion. The differences in these indices correspond to the clinical method of diagnose the fracture non-union by assessing the presence of callus with X-rays.
The BS/TV of inflammatory hematoma and cartilage tissues in the inflammatory phase reflects the high proportion of soft tissue in the lesion area of the F+S group during this period. In the inflammatory phase and repair phase, the BS/TS of the inflammatory hematoma and cartilage tissues also indicates that the proportion of soft tissue in the lesion area is high in the non-union group of rats, corresponding to the clinical assessment of local soft tissue swelling causing vascular compression to diagnose non-union.
Based on the extracted micro-CT indices, a machine learning model for the early diagnosis of fracture non-union was constructed by comparing multiple models at multiple time points. The study revealed that on the 14th day, the MLP achieved an AUC of 0.995, an Accuracy of 0.996, and a Precision of 0.994, achieving the diagnosis of fracture non-union during soft-callus formation phase. The SVM, Decision Tree, and Random Forest models all exhibited optimal performance on the 28th day, achieving the diagnosis during hard-callus formation phase. The superior performance of the MLP in the study may be attributed to the dataset's complex non-linear boundaries, which the MLP, with its multi-layered structure and non-linear activations, can better capture, thereby enhancing classification performance. In contrast, SVM requires an appropriate kernel function to address non-linear issues, and Decision Trees and Random Forests may necessitate more sophisticated feature engineering to improve performance. Furthermore, the dataset comprised 2,448 samples, which may be a suitable quantity for the MLP. Meanwhile, SVM's computational complexity significantly increases with larger sample sizes. The input parameters consisted of 10 features, a moderate feature set for the MLP, which can effectively manage such a quantity. Decision Trees and Random Forests, on the other hand, may require more data to prevent overfitting.
It can be observed that early diagnostic models perform better on day 14 or 28 than on day 35 post-fracture, which contradicts common understanding. We speculate that there are two possible reasons for this phenomenon: on one hand, the limited sample size may lead to statistical errors, and on the other hand, there might be a muscular compensation mechanism that obscures the radiological features, thereby making the 14–28 days a more favorable time for diagnosing non-union fractures.
We believe that our prediction model based on micro-CT bone morphometric parameters can effectively screen out samples of poor fracture rehabilitation caused by blood supply limitations in rats 14 days after fracture, rather than the widely accepted 35 or 40 days. And we plan to further develop and validate the model proposed in this paper in human trails for clinical prediction for fracture non-union, providing a new approach for early intervention and treatment.
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
The animal study was approved by Tianjin University, Tianjin, China. The study was conducted in accordance with the local legislation and institutional requirements.
HY: Conceptualization, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing. QM: Formal analysis, Writing – original draft, Writing – review & editing. ZW: Data curation, Supervision, Writing – review & editing. YG: Supervision, Writing – review & editing. JZ: Data curation, Formal analysis, Writing – original draft. GW: Formal analysis, Validation, Writing – original draft. QW: Formal analysis, Visualization, Writing – original draft. XM: Data curation, Writing – review & editing. XD: Data curation, Writing – review & editing. SW: Conceptualization, Formal analysis, Methodology, Writing – original draft, Writing – review & editing. JS: Conceptualization, Formal analysis, Methodology, Resources, Writing – review & editing.
The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by Tianjin Key Research and Development Program of China (grant number: 23YFZCSN00070) and National Key Clinical Specialty Projects of Tianjin, China (grant number: TJWJ2024ZK003).
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declare that no Gen AI was used in the creation of this manuscript.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
1. Wu A-M, Bisignano C, James SL, Abady GG, Abedi A, Abu-Gharbieh E, et al. Global, Regional, and National Burden of Bone Fractures in 204 Countries and Territories, 1990–2019: a systematic analysis from the Global Burden of Disease Study 2019. Lancet Healthy Longev. (2021) 2:e580–92. doi: 10.1016/S2666-7568(21)00172-0
2. Ho-Shui-Ling A, Bolander J, Rustom LE, Johnson AW, Luyten FP, Picart C. Bone regeneration strategies: engineered scaffolds, bioactive molecules and stem cells current stage and future perspectives. Biomaterials. (2018) 180:143–62. doi: 10.1016/j.biomaterials.2018.07.017
3. Loi F, Córdova LA, Pajarinen J, Lin T-h, Yao Z, Goodman SB. Inflammation, fracture and bone repair. Bone. (2016) 86:119–30. doi: 10.1016/j.bone.2016.02.020
4. Kenkre JS, Bassett J. The bone remodelling cycle. Ann Clin Biochem. (2018) 55:308–27. doi: 10.1177/0004563218759371
5. Maruyama M, Rhee C, Utsunomiya T, Zhang N, Ueno M, Yao Z, et al. Modulation of the inflammatory response and bone healing. Front Endocrinol. (2020) 11:386. doi: 10.3389/fendo.2020.00386
6. ElHawary H, Baradaran A, Abi-Rafeh J, Vorstenbosch J, Xu L, Efanov JI. Bone healing and inflammation: principles of fracture and repair. Semin Plast Surg. (2021) 35:198–203. doi: 10.1055/s-0041-1732334
7. Borrelli Jr J, Prickett WD, Ricci WM. Treatment of nonunions and osseous defects with bone graft and calcium sulfate. Clin Orthop Relat Res. (2003) (411):245–54. doi: 10.1097/01.blo.0000069893.31220.6f
8. Vaishya R, Gupta BM, Kappi MM, Mamdapur GMN, Ali KS, Vaish A. Scientometric analysis of global research on delayed and nonunion of fractures (2004–2023): insights from the web of science core collections. Injury. (2024) 55:111882. doi: 10.1016/j.injury.2024.111882
9. Andersen C, Wragg NM, Shariatzadeh M, Wilson SL. The use of platelet-rich plasma (prp) for the management of non-union fractures. Curr Osteoporos Rep. (2021) 19:1–14. doi: 10.1007/s11914-020-00643-x
10. Walter N, Hierl K, Brochhausen C, Alt V, Rupp M. The epidemiology and direct healthcare costs of aseptic nonunions in germany—a descriptive report. Bone Joint Res. (2022) 11:541–7. doi: 10.1302/2046-3758.118.BJR-2021-0238.R3
11. Wildemann B, Ignatius A, Leung F, Taitsman LA, Smith RM, Pesántez R, et al. Non-union bone fractures. Nat Rev Dis Primers. (2021) 7:57. doi: 10.1038/s41572-021-00289-8
12. Bowers KM, Anderson DE. Delayed union and nonunion: current concepts, prevention, and correction: a review. Bioengineering. (2024) 11:525. doi: 10.3390/bioengineering11060525
13. Makaram NS, Leow JM, Clement ND, Oliver WM, Ng ZH, Simpson C, et al. Risk factors associated with delayed and aseptic nonunion following tibial diaphyseal fractures managed with intramedullary nailing. Bone Jt Open. (2021) 2:227–35. doi: 10.1302/2633-1462.24.BJO-2021-0012.R1
14. Whelan DB, Bhandari M, Stephen D, Kreder H, McKee MD, Zdero R, et al. Development of the radiographic union score for tibial fractures for the assessment of tibial fracture healing after intramedullary fixation. J Trauma. (2010) 68:629–32. doi: 10.1097/TA.0b013e3181a7c16d
15. Leow JM, Clement ND, Tawonsawatruk T, Simpson CJ, Simpson AH. The radiographic union scale in tibial (RUST) fractures: reliability of the outcome measure at an independent centre. Bone Joint Res. (2016) 5:116–21. doi: 10.1302/2046-3758.54.2000628
16. Beck BR, Bergman AG, Miner M, Arendt EA, Klevansky AB, Matheson GO, et al. Tibial stress injury: relationship of radiographic, nuclear medicine bone scanning, MR imaging, and CT severity grades to clinical severity and time to healing. Radiology. (2012) 263:811–8. doi: 10.1148/radiol.12102426
17. Clark DP, Badea CT. Advances in micro-ct imaging of small animals. Physica Medica. (2021) 88:175–92. doi: 10.1016/j.ejmp.2021.07.005
18. Wehrle E, Tourolle né Betts DC, Kuhn G, Floreani ED, Nambiar MH, Schroeder BJ, et al. Longitudinal in vivo micro-ct-based approach allows spatio-temporal characterization of fracture healing patterns and assessment of biomaterials in mouse femur defect models. bioRxiv. (2020) 2020–10. doi: 10.1101/2020.10.02.324061
19. Wehrle E. Tourolle né Betts DC, Kuhn GA, Scheuren AC, Hofmann S, Müller R. evaluation of longitudinal time-lapsed in vivo micro-ct for monitoring fracture healing in mouse femur defect models. Sci Rep. (2019) 9:17445. doi: 10.1038/s41598-019-53822-x
20. Dülgeroglu TC, Metineren H. Evaluation of the effect of platelet-rich fibrin on long bone healing: an experimental rat model. Orthopedics. (2017) 40:e479–84. doi: 10.3928/01477447-20170308-02
21. Sun J, Li Q, Wang S, Wang G, Zhao J, Li H, et al. Establishment and evaluation of a rat model of medial malleolar fracture with vascular injury. Orthop Surg. (2022) 14:2701–10. doi: 10.1111/os.13455
22. Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015. Cham: Springer International Publishing (2015).
23. Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, et al. A survey on deep learning in medical image analysis. Med Image Anal. (2017) 42:60–88. doi: 10.1016/j.media.2017.07.005
24. Xu Y, He X, Xu G, Qi G, Yu K, Yin L, et al. A medical image segmentation method based on multi-dimensional statistical features. Front Neurosci. (2022) 16:1009581. doi: 10.3389/fnins.2022.1009581
25. Vaswani A, Shazeer NM, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. In: NIPS' 17: Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc. (2017). p. 30.
26. Liu Z, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, et al. Swin transformer: hierarchical vision transformer using shifted windows. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2021). p. 9992–10002.
27. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, et al. An image is worth 16x16 words: transformers for image recognition at scale. arXiv [Preprint]. arXiv:2010.11929. (2020). Available online at: https://arxiv.org/abs/2402.02491 (accessed October 22, 2020).
28. Ruan J, Xiang S. Vm-Unet: vision mamba Unet for medical image segmentation. arXiv [Preprint]. arXiv:2402.02491. (2024). Available online at: https://arxiv.org/abs/2402.02491 (accessed February 04, 2024).
29. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. (1988) 44:837–45. doi: 10.2307/2531595
30. Rainio O, Teuho J, Klén R. Evaluation metrics and statistical tests for machine learning. Sci Rep. (2024) 14:6086. doi: 10.1038/s41598-024-56706-x
31. Cox D, Vladescu J. Statistics for Applied Behavior Analysis Practitioners and Researchers. New York: Academic Press (2023).
32. Zhou J, Damasceno PF, Chachad R, Cheung JR, Ballatori A, Lotz JC, et al. Automatic vertebral body segmentation based on deep learning of dixon images for bone marrow fat fraction quantification. Front Endocrinol. (2020) 11:612. doi: 10.3389/fendo.2020.00612
33. Misra D, Nalamada T, Arasanipalai AU, Hou Q. Rotate to attend: convolutional triplet attention module. In: in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV). New York, NY: IEEE (2020). p. 3138–47.
34. Liu Y, Tian Y, Zhao Y, Yu H, Xie L, Wang Y, et al. Vmamba: visual state space model. arXiv [Preprint]. arXiv:2401.10166 (2024). Available online at: https://arxiv.org/abs/2401.10166 (accessed December 18, 2024).
35. Karimi D, Salcudean SE. Reducing the hausdorff distance in medical image segmentation with convolutional neural networks. IEEE Trans Med Imaging. (2020) 39:499–513. doi: 10.1109/TMI.2019.2930068
36. Bouxsein ML, Boyd SK, Christiansen BA, Guldberg RE, Jepsen KJ, Müller R. Guidelines for assessment of bone microstructure in rodents using micro–computed tomography. J Bone Min Res. (2010) 25:1468–86. doi: 10.1002/jbmr.141
37. Freeman TA, Patel P, Parvizi J, Antoci V Jr, Shapiro IM. Micro-ct analysis with multiple thresholds allows detection of bone formation and resorption during ultrasound-treated fracture healing. J Orthop Res. (2009) 27:673–9. doi: 10.1002/jor.20771
38. Cortes C, Vapnik V. Support-vector networks. Mach Learn. (1995) 20:273–97. doi: 10.1007/BF00994018
39. Rumelhart DE, Hinton GE, Williams RJ. Learning representations by back-propagating errors. Nature. (1986) 323:533–6. doi: 10.1038/323533a0
40. Hunt EB, Marin J, Stone PJ. Experiments in Induction. New York: New York Academic of Science (1966).
42. Owusu-Adjei M, Ben Hayfron-Acquah J, Frimpong T, Abdul-Salaam G. Imbalanced class distribution and performance evaluation metrics: a systematic review of prediction accuracy for determining model performance in healthcare systems. PLoS Digital Health. (2023) 2:e0000290. doi: 10.1371/journal.pdig.0000290
43. Zhu W, Zeng N, Wang N. Sensitivity, specificity, accuracy, associated confidence interval and Roc analysis with practical SAS ® implementations. In: Health Care and Life Sciences. Baltimore, MD: NorthEast SAS Users Group (2010).
44. Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, et al. Swin-Unet: unet-like pure transformer for medical image segmentation. In: Computer Vision—ECCV 2022 Workshops. Cham: Springer Nature (2023).
45. Myronenko A. 3d Mri Brain Tumor Segmentation Using Autoencoder Regularization. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. Cham: Springer International Publishing (2019).
46. Tan M, Le Q. Efficientnet: rethinking model scaling for convolutional neural networks. arXiv [Preprint]. arXiv:1905.11946 (2019). Available online at: https://arxiv.org/abs/1905.11946 (accessed December 18, 2024).
47. Woo S, Park J, Lee J-Y, Kweon IS. Cbam: convolutional block attention module. In: Computer Vision—ECCV 2018. Cham: Springer International Publishing (2018).
48. Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, et al. Dual attention network for scene segmentation. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019). Available online at: https://doi.org/10.48550/arXiv.1809.02983 (accessed June 15–20, 2019).
49. Fisher JS, Kazam JJ, Fufa D, Bartolotta RJ. Radiologic Evaluation of Fracture Healing. Skeletal Radiol. (2019) 48:349–61. doi: 10.1007/s00256-018-3051-0
50. Gelalis ID, Politis AN, Arnaoutoglou CM, Korompilias AV, Pakos EE, Vekris MD, et al. Diagnostic and treatment modalities in nonunions of the femoral shaft: a review. Injury. (2012) 43:980–8. doi: 10.1016/j.injury.2011.06.030
51. Bhandari M, Chiavaras MM, Parasu N, Choudur H, Ayeni O, Chakravertty R, et al. Radiographic union score for hip substantially improves agreement between surgeons and radiologists. BMC Musculoskelet Disord. (2013) 14:70. doi: 10.1186/1471-2474-14-70
52. Isensee F, Jaeger PF, Kohl SAA, Petersen J, Maier-Hein KH. Nnu-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. (2021) 18:203–11. doi: 10.1038/s41592-020-01008-z
53. Wang X, Girshick R, Gupta A, He K. Non-local neural networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT: IEEE (2018).
54. Li Y, Hou Q, Zheng Z, Cheng MM, Yang J, Li X. Large selective kernel network for remote sensing object detection. In: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) (2023). Available online at: https://api.semanticscholar.org/CorpusID:257557163 (accessed October 1–6, 2023).
55. Hu J, Shen L, Sun G. Squeeze-and-excitation networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York City: IEEE (2018).
56. Wang Q, Wu B, Zhu P, Li P, Zuo W, Hu Q. Eca-net: efficient channel attention for deep convolutional neural networks. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2020).
57. Dai Y, Gieseke F, Oehmcke S, Wu Y, Barnard K. Attentional feature fusion. In: 2021 IEEE Winter Conference on Applications of Computer Vision (WACV). Waikoloa, HI: IEEE (2021).
Keywords: micro-CT, fracture non-union prediction, bone morphometric parameters, deep learning, Mamba network
Citation: Yu H, Mu Q, Wang Z, Guo Y, Zhao J, Wang G, Wang Q, Meng X, Dong X, Wang S and Sun J (2025) A study on early diagnosis for fracture non-union prediction using deep learning and bone morphometric parameters. Front. Med. 12:1547588. doi: 10.3389/fmed.2025.1547588
Received: 18 December 2024; Accepted: 07 March 2025;
Published: 24 March 2025.
Edited by:
Nico Sollmann, Ulm University Medical Center, GermanyReviewed by:
Michael Götz, University of Ulm, GermanyCopyright © 2025 Yu, Mu, Wang, Guo, Zhao, Wang, Wang, Meng, Dong, Wang and Sun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shuo Wang, d3MxMTFAdGp1LmVkdS5jbg==; Jinglai Sun, c3VuamluZ2xhaUB0anUuZWR1LmNu
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.