Survival prediction for patients with glioblastoma multiforme using a Cox proportional hazards denoising autoencoder network

Yan, Ting; Yan, Zhenpeng; Liu, Lili; Zhang, Xiaoyu; Chen, Guohui; Xu, Feng; Li, Ying; Zhang, Lijuan; Peng, Meilan; Wang, Lu; Li, Dandan; Zhao, Dong

doi:10.3389/fncom.2022.916511

ORIGINAL RESEARCH article

Front. Comput. Neurosci., 10 January 2023

Volume 16 - 2022 | https://doi.org/10.3389/fncom.2022.916511

Survival prediction for patients with glioblastoma multiforme using a Cox proportional hazards denoising autoencoder network

Ting Yan¹

Zhenpeng Yan¹

Lili Liu¹

Xiaoyu Zhang²

Guohui Chen¹

Feng Xu²

Ying Li²

Lijuan Zhang³

Meilan Peng¹

Lu Wang¹

Dandan Li²^*

Dong Zhao⁴^*

¹Key Laboratory of Cellular Physiology of the Ministry of Education, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi, China
²College of Information and Computer, Taiyuan University of Technology, Taiyuan, China
³Shanxi Provincial People's Hospital, Taiyuan, China
⁴Department of Stomatology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China

Objectives: This study aimed to establish and validate a prognostic model based on magnetic resonance imaging and clinical features to predict the survival time of patients with glioblastoma multiforme (GBM).

Methods: In this study, a convolutional denoising autoencoder (DAE) network combined with the loss function of the Cox proportional hazard regression model was used to extract features for survival prediction. In addition, the Kaplan–Meier curve, the Schoenfeld residual analysis, the time-dependent receiver operating characteristic curve, the nomogram, and the calibration curve were performed to assess the survival prediction ability.

Results: The concordance index (C-index) of the survival prediction model, which combines the DAE and the Cox proportional hazard regression model, reached 0.78 in the training set, 0.75 in the validation set, and 0.74 in the test set. Patients were divided into high- and low-risk groups based on the median prognostic index (PI). Kaplan–Meier curve was used for survival analysis (p = < 2e-16 in the training set, p = 3e-04 in the validation set, and p = 0.007 in the test set), which showed that the survival probability of different groups was significantly different, and the PI of the network played an influential role in the prediction of survival probability. In the residual verification of the PI, the fitting curve of the scatter plot was roughly parallel to the x-axis, and the p-value of the test was 0.11, proving that the PI and survival time were independent of each other and the survival prediction ability of the PI was less affected than survival time. The areas under the curve of the training set were 0.843, 0.871, 0.903, and 0.941; those of the validation set were 0.687, 0.895, 1.000, and 0.967; and those of the test set were 0.757, 0.852, 0.683, and 0.898.

Conclusion: The survival prediction model, which combines the DAE and the Cox proportional hazard regression model, can effectively predict the prognosis of patients with GBM.

Introduction

Glioblastoma multiforme (GBM) is a tumor caused by the carcinogenesis of glial cells in the brain and spinal cord through the interaction of genetic and environmental factors (Weller et al., 2015; Xu et al., 2020). The potential risk factors include specific gene polymorphism, ionizing radiation, and virus infection. GBM may also cause increased intracranial pressure, edema, brain herniation, epilepsy, and mental symptoms as complications. Therefore, GBM is difficult to treat and frequently recurs, which results in high mortality (Poff et al., 2019; Peng et al., 2020). Due to individual differences and personalized therapy, the survival time of different patients with GBM shows high heterogeneity (Badve and Gökmen-Polar, 2015; Li et al., 2021). Studies have shown that the factors that affect the survival time of GBM include the morphology and degree of edema around the tumor, the degree of tumor necrosis, the overall morphology and size of the tumor, and whether the tumor has cystic changes.

Magnetic resonance imaging (MRI) is an important diagnostic tool for brain tumors (Takaya et al., 2021). It has been extensively applied in the diagnosis of brain tumors and some neurodegenerative diseases (such as Alzheimer's disease) (Wang et al., 2017; Yan et al., 2018) and has become a recognized imaging mode in the clinical treatment of GBM. Currently, conventional MRI assists in diagnosis, planning operative protocol, and monitoring disease progression and treatment response (Jenkinson et al., 2007). Radiomics is a technique that applies high-throughput computational approaches to extract quantitative features from images, such as MRI and PET. These features can be used to differentiate emerging cerebral lesions or predict the effect of different treatment options on patients with GBM. Radiomics also has the potential to non-invasively assess important prognostic and survival models (Lohmann et al., 2018; Conti et al., 2021; Schniering et al., 2022). By building the survival prediction model, clinicians can provide a reliable reference for each patient, formulate a therapeutic regimen, and evaluate the potency to alleviate patients' symptoms and improve the survival time.

Recently, deep learning-based models can learn more abstract features, reflecting more potential biological information. However, due to the complex patterns and sharpness, it is challenging to build a survival prediction model from medical images, especially using three-dimensional (3D) medical images. As the calculated amount arises, it becomes more challenging to construct an effective survival prediction model, while the advantages of deep learning are more prominent. Nie et al. (2019) used a multi-channel architecture of 3D convolutional neural networks (CNNs) for deep learning to extract high-level predictive features. Those deeply learned features and the limited demographic and tumor-related features are inputted into a support vector machine (SVM) to generate the final prediction result. The obtained multi-channel deep survival prediction framework can predict the survival time of patients with great accuracy. Lao et al. (2017) used the CNN_S model to extract deep features, then the least absolute shrinkage and selection operator (LASSO) Cox regression model was applied to construct a six-deep-feature signature. This study indicated the potential of deep imaging feature-based biomarkers in the preoperative care of patients with GBM. Zhu et al. (2016) used a deep CNN for survival analysis (DeepConvSurv) based on medical image data to prove that compared with the radiomics approach, this method has a significant performance improvement. Mobadersany et al. (2018) proposed the survival convolutional neural networks (SCNN) model, which performed deep learning to combine the functions of an adaptive machine-learning algorithm with a traditional survival model. This integrated model has high accuracy in predicting the survival rate of patients with GBM. Liu et al. (2019) proposed a 3D-deep CNN based on the attention mechanism to use multimodal MRI of GBM to predict survival. In this network, the attention module was incorporated into the deep-learning network to enhance the ability to express meaningful features while suppressing insignificant features. It involved a 3D volume of interest (VOI) from four modal MRIs as input, and the output was the risk value of each patient. The addition of the attention mechanism improved the predictive efficiency. Denoising autoencoder (DAE), an unsupervised deep-learning algorithm, is a random version of autoencoder formulation. It is designed to force the hidden layer of the autoencoder to capture more robust features. Wang et al. (2020) trained the autoencoder from a partially damaged (corrupted) input to rebuild a clean (repaired) input based on the basic principle. Thus, a good representation could be gained steadily from a damaged input, and the corresponding clean input could be restored. Tang et al. (2020) performed a multi-task CNN to select characteristics related to tumor genotype from preoperative multimodal MRI data to develop a tumor genotype and survival prediction model. However, whether the DAE methods can improve the extraction efficiency of survival features remains unknown.

In the present study, we built a convolutional DAE network with the loss function of the Cox proportional hazard regression model for the extraction of features for survival prediction. The DAE was applied to extract the features of the tumor area from the MRI of four modes and added into the risk prediction branch at the minimum feature matrix. The prognostic index (PI) was calculated from the risk scores of the Cox model and used for constructing a survival prediction model. This network can fully extract the features of the tumor region and build a more accurate prognostic model for patients with GBM.

Materials and methods

Data set and preprocessing

This article uses the MRI of 205 patients from the public dataset of the 2019 Multimodal Brain Tumor Segmentation (BraTS) Challenge containing four modalities. Clinical data for 100 of these patients were obtained from the Cancer Genome Atlas (TCGA) GBM project. All multimodal MRI scans are provided in NIfTI files (.nii.gz), and the four MRI modalities are T1, T2, T1ce, and Flair. According to the same annotation protocol, all imaging data sets were manually segmented by one to four evaluators according to the same annotation standards. Their annotations were approved by experienced neuroradiologists (Bakas et al., 2017a). The published dataset can be used freely on the premise of citing specific documents. All patients have survival time and survival status, and 100 of them have clinical data (age, sex, race, Karnofsky performance score (KPS), radiotherapy, chemotherapy).

As shown in Figure 1, each case has four modes, namely T1, T2, T1ce, and Flair, and the specifications are 4 × 240 mm × 240 mm × 150 mm. At the same time, the data set also provides GBM labels manually segmented by multiple experts. The labels are three nested regions, which are the whole tumor region (the green, yellow, and red sets in Figure 1), the area of the tumor core (the yellow and red areas in Figure 1), and the enhancing tumor (ET) (the red area in Figure 1).

FIGURE 1

Figure 1. MRI and segmented labeling of the four modes of glioma. (A) T1, (B) T2, (C) T1ce, (D) Flair, and (E) Ground Truth. The labels in (E) are three nested regions, which are the whole tumor region (the green, yellow and red sets), the area of the tumor core (the yellow and red areas), and the Enhancing Tumor (ET) (the red area).

The public dataset provided by the competition has been partially preprocessed, including multimodal MRI data registered to the same spatial template, the image is resampled to 1 mm × 1 mm × 1 mm, and brain MRI also performed skull dissection. The dataset is provided by different scanning device configurations and institutions (Bakas et al., 2017b,c). Factors, such as the scanner itself and many unknown issues, can cause differences in brightness on MRI images, and the intensity value can vary within the same tissue, which is called a bias field. The bias field is a low-frequency smooth undesirable signal, which will cause unevenness in the MRI image. If the uncorrected bias field images are directly used for deep learning, it will affect the results of survival prediction. Therefore, before training the survival prediction model of GBM, it is necessary to perform offset field correction to minimize the influence of the offset field on survival prediction. This article uses ANTs N4BiasField Correction (Avants et al., 2009) to offset field correction. Then, the data of each mode is normalized separately. All the preprocessed data were compressed to H5 files. The training, validation, and test sets were randomly divided according to the index. At the same time, the data can be enhanced online (mirroring or rotating on different axes) during the input network.

Network structure

In this study, the survival prediction model for GBM uses a convolutional DAE. Autoencoder (AE) is a deep neural network used for semi-supervised or unsupervised learning. The autoencoder can restore the output to the same as the input according to the deep features. After the autoencoder is trained, its output can replicate the input as much as possible. The network in this study is based on the DAE, a variant of the AE, which can realize the powerful functions of anti-noise and dimensionality reduction. The network structure of this article is shown in Figure 2. The self-encoder contains two parts: an encoder and a decoder. The encoder is expressed as φ(x), and the input is the four modal MRI images of the GBM lesion area. After the input, the batch normalization layer and the drop out layer are added to randomly discard the image pixels to enhance the anti-noise and robustness of the network. The encoder includes three convolutions and downsampling to obtain the hidden characteristic matrix of the middle layer. Three convolutions generate 16, 32, and 64 feature maps, respectively, with a LeakyReLU activation behind each layer. The decoder is expressed as ψ[φ(x)], which includes three convolutions and upsampling. Three convolutions generate 64, 32, and 16 feature maps, respectively, with a LeakyReLU activation behind each layer. The output layer of the decoder generates four feature maps through one convolution, followed by a sigmoid activation. Finally, it restores the features of the middle layer to the MRI images of the four modalities corresponding to the input. After the middle layer, the survival prediction branch was added, and the network was trained with the observed result data (survival/follow-up time). The feature matrix was flattened after the hidden layer feature matrix by Flatten. After flattening, there are a total of 65,536 features, using dense full connections to 1,024 features, randomly discarding (drop out) some neurons, and using dense full connections to 128 features, randomly discarding some neurons to prevent overfitting of the branching network. The feature corresponds to the 128-dimensional feature and the corresponding neural network weight. It also corresponds to the product of the independent variable and the partial regression coefficient in the Cox proportional hazard regression model.

FIGURE 2

Figure 2. Network structure combining DAE and Cox. Conv, convolutional layer; Ds, downsampling layer; FC, fully connected layer; Up, upsampling layer.

Loss function

A fusion of the reconstruction loss function and the Cox proportional hazard regression loss function is used in this network. In the reconstruction loss function of the convolution noise reduction autoencoder, the mean square error loss function is selected. It is defined as follows:

\begin{array}{l} L_{r} = \frac{1}{n} \sum_{i = 1}^{n} | | x_{i} - ψ (ϕ (x_{i})) | |^{2} \end{array}

where n represents the size of the sample. Minimizing the reconstruction loss function ensures that the hidden layer in the DAE can effectively learn the potentially valuable features in GBM MRI.

Cox proportional hazards regression model with survival outcome and survival time as dependent variables can simultaneously analyze the impact of multiple factors on survival time. To ensure that the hidden layer can effectively process censored data and extract features that are incredibly relevant and robust to survival, the Cox proportional hazard regression model is applied to create the loss function after densely connected in the middle layer. At the same time, the Cox proportional hazard regression model can better correct the influence of multiple confounding factors on the results. The Cox proportional hazard regression model is defined as follows:

\begin{array}{l} log \frac{h_{i} (t)}{h_{0} (t)} = β_{1} z_{i 1} + β_{2} z_{i 2} + \dots β_{P} z_{i P}, \end{array}

where h_i(t) represents the risk function of patient i and is the probability of the subject's death at time t. h₀(t) is a baseline risk level, and the h_i(t) (i = 1,..., n) risk functions of all patients at different times are compared with it. The critical assumption of the Cox prognostic model is that the hazard ratio h_i(t)/h₀(t) is constant for time. The natural logarithm of the ratio is the weighted sum of multiple predictors (here represented by z_i1,..., z_ip), and the weight coefficients are represented by β₁,..., β_p. These coefficients are estimated by maximizing the partial likelihood function of Cox's proportional hazards:

\begin{array}{l} log ζ (β) = \sum_{i = 1}^{n} δ_{i} {β^{'} z_{i} - log \sum_{j \in R (t_{i})} e^{β^{'} z_{j}}} \end{array}

Among them, z_i is the vector used to predict patient i, δ_i is an indicator of patient i's survival status (0 = survival or censorship, 1 = death), and R(t_i) represents the risk vector set of patient i. This study applies the loss function to the survival prediction model, which is defined as follows:

\begin{array}{l} L_{s} = - \sum_{i = 1}^{n} δ_{i} {W^{'} ϕ (x_{i}) - log \sum_{j \in R (t_{i})} e^{W^{'} ϕ (x_{j})}} \end{array}

Among them, W, represents the weight vector of the final output of the survival prediction branch and ϕ(x_i) represents the feature vector of the prediction branch. The multiplication of the two is the patient's risk prediction, which is the natural logarithm of the hazard ratio.

In this study, the loss function of the prognostic model is composed of the reconstruction loss function and the loss function of the Cox proportional hazard regression model, which is defined as follows:

\begin{array}{l} L_{h y b r i d} = α L_{r} + β L_{s} = α [\frac{1}{n} \sum_{i = 1}^{n} | | x_{j} - ψ (ϕ (x_{i})) | |^{2}] \\ + β [- \sum_{i = 1}^{n} δ_{i} {W^{'} ϕ (x_{i}) - log \sum_{j \in R (t_{i})} e^{W^{'} ϕ (x_{j})}}] \end{array}

Among them, α and β are the reconstruction loss functions, and the loss function of the Cox proportional hazard regression model is the weight coefficient. For convenience, the sum of α and β is set to 1.

Survival prediction model training

A total of 205 cases of MRI and survival data from BraTS2019 containing four modalities are used in this study. These cases are divided into 143 cases of the training set, 31 cases of the validation set, and 31 cases of the test set. The result of the segmentation of the lesion area is used as the input to the model. The DAE and the Cox proportional hazard regression loss functions are used to extract the multimodal image data of GBM. Finally, the PI of each individual is not affected by time changes. The data are randomly scrambled before entering the network. The batch size in the input network is 26, and each batch is sorted from short to long in the order of survival time. The ratio of the reconstructed loss function and the loss function of the Cox proportional hazard model was determined to be 7:3 after several experiments. The model has been optimized by Adam. The initial learning rate is 5e-4, and the epoch is 200. The learning rate will decay by 0.5 after 10 epochs if the validation loss is not improving. The maximum number of iterations of training is set to 300.

Survival prediction model evaluation

In this study, we used the concordance index (C-index) and the accuracy to evaluate the model's performance. The C-index was obtained by combining the PI with the individual's survival time and status and calculated by the R “Hmisc” package. The accuracy is calculated based on a three-category classification. We define the survival days < 300 as a high-risk group, from 300 to 450 as a mid-risk group, and more than 450 as a low-risk group (Bakas et al., 2018), and divided the PI evenly into three parts in the training set, which was defined as low-, medium-, and high-risk groups. To assess the performance of the survival prediction model proposed in this study, its accuracy is compared with three methods: post hoc (Hermoza et al., 2021), random forest regressor (RFR) (Agravat and Raval, 2019), and a survival prediction model using neural networks (Wang et al., 2019).

The survival prediction model was constructed by combining the PI (produced by the network), the survival time, and the survival status of each individual. All patients were divided into high- and low-risk groups according to the median of the PI. Then, we applied a Kaplan–Meier survival analysis and the log-rank test to evaluate the model's stratification ability. This study used Schoenfeld residual method to verify whether the PI predicted by the network is time-dependent (Zhang et al., 2018). A Lowess (locally weighted scatterplot smoothing) smoothing function was used for fitting to obtain the smooth curve of Schoenfeld residuals and time. The correlation between Schoenfeld residuals and time rank was tested to investigate the independence between residuals and time. If the p-value of the test is more than 0.05, it proves that the linear relationship between residual and time is not significant, which further shows that the PI does not depend on time changes. To evaluate the predictive ability of the PI predicted by the network and whether the predictive power of the PI decreases over time, we constructed the time-dependent receiver operating characteristic (ROC) curve for evaluation. There were stages every 200 days, and fewer patients survived 800 days. The four stages were evaluated in this study to reduce the impact of the small patient sample number in the later stage.

Nomogram construction and evaluation

In addition, combined with clinical risk factors such as age, gender, race, KPS, radiotherapy, and chemotherapy, a nomogram was built on the predictive model for the training set to predict overall survival (OS). The calibration curve was constructed to analyze the diagnostic performance of the nomogram in the training set, validation set, and test set.

Results

Performance of survival prediction models

In this study, the consistency index of the survival prediction model that combines the DAE and the Cox model reached 0.78 in the training set, 0.75 in the validation set, and 0.74 in the test set. As shown in Table 1, the accuracy was used to evaluate the performance of different prognostic models. In the test set, the accuracy of our proposed model reaches 0.548, which is 0.3% lower than the model proposed by Wang et al. (2019). However, in the training set and validation set, our proposed method achieves a higher accuracy value than the other three survival prediction models in Table 1. Therefore, our proposed method has a slight advantage. The model proposed in this study could extract the robust features related to survival prediction from the multimodal MRI of GBM lesions and could process censored data.

TABLE 1

Table 1. Comparison of the results of different prognostic models.

Confirm the validity of the PI

The Kaplan–Meier survival analysis with log-rank test results is shown in Figure 3, where p = < 2e-16 in the training set, p = 3e-04 in the validation set, and p = 0.007 in the test set. The p-value of log-rank test results was < 0.05 in all three sets, which proved that the survival probability of different groups divided by PI was significantly different. It also suggested that the PI predicted by the network had an influential role in predicting survival probability.

FIGURE 3

Figure 3. Kaplan–Meier curve of risk grouping of the training set (A), validation set (B), and test set (C).

PI independence with time

Figure 4 shows the Schoenfeld residual plot of the PI in the training set, validation set, and test set. In the residual check of the PI, the fitting curve of the scatter plot was roughly parallel to the x-axis. In addition, the p-value of the test was 0.11, more than 0.05, proving that the PI and survival time were independent of each other and indicating that the survival prediction ability of the PI was less affected by time.

FIGURE 4

Figure 4. Schoenfeld residual plot of the prognostic index in the training set (A), validation set (B), and test set (C).

Time-dependent accuracy analysis of the model

Figure 5 shows the time-dependent ROC of the training set, validation set, and test set. The area under the curve (AUC) value of each period of the three sets was relatively high. The AUC in the last stage was relatively high because almost all high-risk patients had died. On the three sets, the attenuation of AUC was minimal, which proved that the predictive ability of the PI attenuates to a small extent with time.

FIGURE 5

Figure 5. Time-dependent ROC of the training set (A), validation set (B), and test set (C).

Development and validation of the nomogram

The radiomics nomogram incorporating the PI and seven clinical factors was constructed based on the multivariate Cox regression (Figure 6). The figure represents the survival prediction model for patients with GBM. The consistency index of the nomogram model reached 0.79 in the training set, 0.74 in the validation set, and 0.75 in the test set. Calibration curves (Figure 7) showed that the predicted OS of the nomogram was closely aligned with the observed OS in the training set, validation set, and test set.

FIGURE 6

Figure 6. Radiomics nomogram for overall survival of patients with GBM. The shaded part indicates the distribution status and probability density of the patients.

FIGURE 7

Figure 7. The calibration curves of the radiomics nomogram in the training set (A), validation set (B), and test set (C) of the overall survival of patients with GBM. The calibration curves depict the calibration of the nomogram in terms of the agreement between the predicted risk and the observed survival.

Discussion

The structure of brain GBM is complex, and tumor tissue mainly includes edema, necrosis, and tumor core. In addition, the traditional extraction of imaging features is limited, and many deeper features of brain GBM cannot be effectively extracted. Autoencoder is an unsupervised learning technology that applies neural networks for representational learning to overcome the heterogeneity of individual tumors and contribute to the noise reduction of images. These are the advantages of traditional CNN networks.

Prognostic models are designed to assess the impact of specific prognostic factors on events of interest over time and predict the risk of future possibilities for new patients. The cure rates of patients with GBM are inferior, and the survival rates are often worse or even almost impossible to cure. Accurate prediction of survival probability is essential for the treatment of patients. Therefore, it is urgent to develop a prognostic model to assess prognostic variables (Cheng et al., 2013; Royston and Altman, 2013; Yeh et al., 2016). The most popular prognostic model is the Cox proportional risk regression model proposed by Cox (1972), a semi-parametric regression model. Cox regression provides the direction of DAE learning and can capture more features related to survival. It has unique advantages in constructing a survival prediction model, which can analyze the impact of multiple factors on survival and process deleted data. It can obtain the risk level, which is different from the traditional loss function used for survival prediction. However, it is assumed that the linear logarithmic risk function is too simple for clinical survival data. As a result, many researchers have proposed non-linear risk models to fit survival data as much as possible, such as Cox regression based on neural networks (Cox, 1972) and multi-task survival analysis learning (Li et al., 2016).

However, it is challenging to build a prediction model with all the features, and it is prone to overfitting, which would lead to inaccurate prediction results. Hence, it is necessary to screen out the crucial features and eliminate the insignificant ones, which led to the later development of Lasso-Cox (Zhang and Lu, 2007), Ridge-Cox (Vinzamuri and Reddy, 2013), and EN-Cox (Simon et al., 2011) models. In this study, we proposed a hypothesis of a survival prediction model combining the DAE and Cox proportional hazards regression model. The empirical results show that the model can better predict the survival of patients with GBM. Through this model, we could obtain not only the compelling image features but also the features that can represent survival.

To evaluate the predictive power of the PI derived from network prediction, we applied time-dependent ROC curve analysis, which was constructed at different survival time points. The result showed that the prediction ability of the PI was practical with the increase in time. On the training set, validation set, and test set, the attenuation of AUC is minimal, which proves that the predictive ability of the PI attenuates to a small extent with time. In the Kaplan–Meier survival and log-rank test, the p-value was < 0.05 in the training set, validation set, and test set, proving that higher and lower PI values were significantly different. It also confirmed that the PI of the network played an influential role in predicting survival probability (Cui et al., 2020). In the residual verification of the PI, the fitting curve of the scatter plot was roughly parallel to the x-axis. The p-value of the test was 0.11, more than 0.05, proving that the PI and survival time were independent of each other and indicating that the survival prediction ability of the PI was less affected by time (Kwon et al., 2015). Previous studies (Sun et al., 2015; Chen et al., 2021) have confirmed that clinical factors, such as age, gender, and KPS, are important variables related to the prognosis of GBM. We constructed a nomogram based on the PI and clinical factors. The calibration plots for the probabilities of OS showed good agreement between the predicted OS by nomogram and the actual OS of patients with GBM. The results suggested the accuracy of the nomogram and further indicated that the nomogram can accurately predict the prognosis of patients with GBM (Liu et al., 2020). Compared with PI, the predictive performance of the nomogram is improved.

Despite the promising results, this study still has several limitations. This study is retrospective, and only the Cancer Imaging Archive (TCIA) database was used. In the future, a multicenter study is needed to richly assess the generalization ability of the survival prediction model (Lao et al., 2017). In addition, the survival prediction model also can be added with TNM stage classification tasks that might contribute to survival prediction in the future and make better adjustments to the consistency index of survival prediction.

This study proposes a GBM survival prediction model based on DAE and Cox proportional hazard regression loss function. The survival prediction branch is added, and the Cox proportional hazard regression model is used as the auxiliary loss function based on DAE. The results indicate that the model can predict the prognosis of patients with GBM well.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

TY conceived the study, designed the experiments, analyzed the data, and wrote the manuscript. DL and DZ edited the manuscript. ZY, LL, XZ, GC, FX, YL, and LZ supervised data analysis. MP and LW performed the statistical analyses. All authors accessed the study data and reviewed and approved the final manuscript version.

Funding

This work was supported by the Fundamental Research Program of Shanxi Province (20210302123292) and the Central Guidance on Local Science and Technology Development Fund of Shanxi Province (YDZJSX2021A018).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

AUC, areas under the curve; C-index, concordance index; DAE, denoising autoencoder; MRI, magnetic resonance imaging; DWT, denoise wavelet transform; CT, computer tomography; DeepConvSurv, deep convolutional neural network for survival analysis; VOI, volume of interest; BraTS, brain tumor segmentation; TCIA, The Cancer Imaging Archive; ET, enhancing tumor; AE, autoencoder; PI, prognostic index; Lowess, locally weighted scatterplot smoothing; ROC, receiver operating characteristic; LASSO, least absolute shrinkage and selection operator; CNNs, convolutional neural networks; SVM, support vector machine; TCGA, The Cancer Genome Atlas; OS, overall survival; GBM, glioblastoma multiforme; KPS, Karnofsky performance score.

References

Agravat, R. R., and Raval, M. S. (2019). “Brain tumor segmentation and survival prediction,” in International MICCAI Brainlesion Workshop. Springer. doi: 10.1007/978-3-030-46640-4_32

CrossRef Full Text | Google Scholar

Avants, B. B., Tustison, N., and Song, G. (2009). Advanced normalization tools (ANTS). Insight J. 2, 1–35. doi: 10.54294/uvnhin

CrossRef Full Text | Google Scholar

Badve, S., and Gökmen-Polar, Y. (2015). Tumor heterogeneity in breast cancer. Adv. Anat. Pathol. 22, 294–302. doi: 10.1097/PAP.0000000000000074

PubMed Abstract | CrossRef Full Text | Google Scholar

Bakas, S., Akbari, H., Sotiras, A., Bilello, M., and Davatzikos, C. (2017b). Segmentation labels and radiomic features for the pre-operative scans of the TCGA-GBM collection. Cancer Imag. Arch. doi: 10.7937/k9/tcia.2017.klxwjj1q

CrossRef Full Text | Google Scholar

Bakas, S., Akbari, H., Sotiras, A., Bilello, M., Rozycki, M., Kirby, J., et al. (2017c). Segmentation labels and radiomic features for the pre-operative scans of the TCGA-LGG collection. Cancer Imag. Arch. doi: 10.7937/k9/tcia.2017.gjq7r0ef

CrossRef Full Text | Google Scholar

Bakas, S., Akbari, H., Sotiras, A., Bilello, M., Rozycki, M., Kirby, J. S., et al. (2017a). Advancing the cancer genome atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci Data 4:170117. doi: 10.1038/sdata.2017.117

PubMed Abstract | CrossRef Full Text | Google Scholar

Bakas, S., Reyes, M., Jakab, A., Bauer, S., Rempfler, M., Crimi, A., et al. (2018). Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge. arXiv preprint arXiv 1811:02629.

Google Scholar

Chen, H., Li, C., Zheng, L., Lu, W., Li, Y., Wei, Q. A., et al. (2021). machine learning-based survival prediction model of high grade glioma by integration of clinical and dose-volume histogram parameters. Cancer Med. 10, 2774–2786. doi: 10.1002/cam4.3838

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, W. Y., Ou Yang, T. H., and Anastassiou, D. (2013). Development of a prognostic model for breast cancer survival in an open challenge environment. Sci. Transl. Med. 5:181ra.50. doi: 10.1126/scitranslmed.3005974

PubMed Abstract | CrossRef Full Text | Google Scholar

Conti, A., Duggento, A., Indovina, I., Guerrisi, M., and Toschi, N. (2021). Radiomics in breast cancer classification and prediction. Semin. Cancer Biol. 72, 238–250. doi: 10.1016/j.semcancer.2020.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Cox, D. R. (1972). Regression models and life-tables. J. R. Statist. Soc. 34, 187–202. doi: 10.1111/j.2517-6161.1972.tb00899.x

CrossRef Full Text | Google Scholar

Cui, L., Li, H., Hui, W., Chen, S., Yang, L., Kang, Y., et al. (2020). A deep learning-based framework for lung cancer survival analysis with biomarker interpretation. BMC Bioinform. 21:112. doi: 10.1186/s12859-020-3431-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Hermoza, R., Maicas, G., Nascimento, J. C., and Carneiro, G. (2021). “Post-hoc overall survival time prediction from brain MRI,” in 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI). IEEE. doi: 10.1109/ISBI48211.2021.9433877

PubMed Abstract | CrossRef Full Text | Google Scholar

Jenkinson, M. D., Du Plessis, D. G., Walker, C., and Smith, T. S. (2007). Advanced MRI in the management of adult gliomas. Br. J. Neurosurg. 21, 550–561. doi: 10.1080/02688690701642020

PubMed Abstract | CrossRef Full Text | Google Scholar

Kwon, J. O., Jin, S. H., Min, J. S., Kim, M. S., Lee, H. W., Park, S., et al. (2015). Time-dependent effects of prognostic factors in advanced gastric cancer patients. J. Gastric Cancer 15, 238–245. doi: 10.5230/jgc.2015.15.4.238

PubMed Abstract | CrossRef Full Text | Google Scholar

Lao, J., Chen, Y., Li, Z. C., Li, Q., Zhang, J., Liu, J., et al. (2017). A deep learning-based radiomics model for prediction of survival in glioblastoma multiforme. Sci. Rep. 7:10353. doi: 10.1038/s41598-017-10649-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Wang, J., Ye, J., and Reddy, C. K. (2016). “A multi-task learning formulation for survival analysis,” in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. doi: 10.1145/2939672.2939857

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Z., Cai, S., Li, H., Gu, J., Tian, Y., Cao, J., et al. (2021). Developing a lncRNA signature to predict the radiotherapy response of lower-grade gliomas using co-expression and ceRNA network analysis. Front. Oncol. 11:622880. doi: 10.3389/fonc.2021.622880

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Q., Li, J., Liu, F., Yang, W., Ding, J., Chen, W., et al. (2020). A radiomics nomogram for the prediction of overall survival in patients with hepatocellular carcinoma after hepatectomy. Cancer Imaging 20:82. doi: 10.1186/s40644-020-00360-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Z., Sun, Q., Bai, H., Liang, C., Chen, Y., and Li, Z. C. (2019). “3D deep attention network for survival prediction from magnetic resonance images in glioblastoma,” in 2019 IEEE International Conference on Image Processing (ICIP). IEEE. doi: 10.1109/ICIP.2019.8803077

CrossRef Full Text | Google Scholar

Lohmann, P., Kocher, M., Steger, J., and Galldiks, N. (2018). Radiomics derived from amino-acid PET and conventional MRI in patients with high-grade gliomas. Q. J. Nucl. Med. Mol. Imaging 62, 272–280. doi: 10.23736/S1824-4785.18.03095-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Mobadersany, P., Yousefi, S., Amgad, M., Gutman, D. A., Barnholtz-Sloan, J. S., Velázquez Vega, J. E., et al. (2018). Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl. Acad. Sci. U.S.A. 115, E2970–e2979. doi: 10.1073/pnas.1717139115

PubMed Abstract | CrossRef Full Text | Google Scholar

Nie, D., Lu, J., Zhang, H., Adeli, E., Wang, J., Yu, Z., et al. (2019). Multi-channel 3D deep feature learning for survival time prediction of brain tumor patients using multi-modal neuroimages. Sci. Rep. 9:1103. doi: 10.1038/s41598-018-37387-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, Y., Chen, F., Li, S., Liu, X., Wang, C., Yu, C., et al. (2020). Tumor-associated macrophages as treatment targets in glioma. Brain Sci. Adv. 6, 306–323. doi: 10.26599/BSA.2020.9050015

CrossRef Full Text | Google Scholar

Poff, A., Koutnik, A. P., Egan, K. M., Sahebjam, S., D'Agostino, D., Kumar, N. B., et al. (2019). Targeting the Warburg effect for cancer treatment: Ketogenic diets for management of glioma. Semin. Cancer Biol. 56, 135–148. doi: 10.1016/j.semcancer.2017.12.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Royston, P., and Altman, D. G. (2013). External validation of a Cox prognostic model: principles and methods. BMC Med. Res. Methodol. 13:33. doi: 10.1186/1471-2288-13-33

PubMed Abstract | CrossRef Full Text | Google Scholar

Schniering, J., Maciukiewicz, M., Gabrys, H. S., Brunner, M., Blüthgen, C., Meier, C., et al. (2022). Computed tomography-based radiomics decodes prognostic and molecular differences in interstitial lung disease related to systemic sclerosis. Eur. Respir. J. 59:e04503. doi: 10.1183/13993003.04503-2020

PubMed Abstract | CrossRef Full Text | Google Scholar

Simon, N., Friedman, J., Hastie, T., and Tibshirani, R. (2011). Regularization paths for cox's proportional hazards model via coordinate descent. J. Stat. Softw. 39, 1–13. doi: 10.18637/jss.v039.i05

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, T., Plutynski, A., Ward, S., and Rubin, J. B. (2015). An integrative view on sex differences in brain tumors. Cell. Mol. Life Sci. 72, 3323–3342. doi: 10.1007/s00018-015-1930-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Takaya, Y., Chen, H., Hoya, T., Jiang, C., and Oyama, K. (2021). Evaluation of the malignant potential of gliomas using diffusion-weighted and gadolinium-enhanced magnetic resonance imaging. Brain Sci. Adv. 7, 248–256. doi: 10.26599/BSA.2021.9050023

CrossRef Full Text | Google Scholar

Tang, Z., Xu, Y., Jin, L., Aibaidula, A., Lu, J., Jiao, Z., et al. (2020). Deep learning of imaging phenotype and genotype for predicting overall survival time of glioblastoma patients. IEEE Trans. Med. Imaging 39, 2100–2109. doi: 10.1109/TMI.2020.2964310

PubMed Abstract | CrossRef Full Text | Google Scholar

Vinzamuri, B., and Reddy, C. K. (2013). “Cox regression with correlation based regularization for electronic health records,” in 2013 IEEE 13th International Conference on Data Mining. IEEE. doi: 10.1109/ICDM.2013.89

CrossRef Full Text | Google Scholar

Wang, B., Niu, Y., Miao, L., Cao, R., Yan, P., Guo, H., et al. (2017). Decreased complexity in Alzheimer's disease: resting-state fMRI evidence of brain entropy mapping. Front. Aging Neurosci. 9:378. doi: 10.3389/fnagi.2017.00378

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, F., Jiang, R., Zheng, L., Meng, C., and Biswal, B. (2019). “3D U-net based brain tumor segmentation and survival days prediction,” in International MICCAI Brainlesion Workshop. Springer. doi: 10.1007/978-3-030-46640-4_13

CrossRef Full Text | Google Scholar

Wang, J., Xie, X., Shi, J., He, W., Chen, Q., Chen, L., et al. (2020). Denoising autoencoder, a deep learning algorithm, aids the identification of a novel molecular signature of lung adenocarcinoma. Genomics Proteomics Bioinform. 18, 468–480. doi: 10.1016/j.gpb.2019.02.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Weller, M., Wick, W., Aldape, K., Brada, M., Berger, M., Pfister, S. M., et al. (2015). Glioma. Nat. Rev. Dis. Primers 1:15017. doi: 10.1038/nrdp.2015.17

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, S., Tang, L., Li, X., Fan, F., and Liu, Z. (2020). Immunotherapy for glioma: Current management and future application. Cancer Lett. 476, 1–12. doi: 10.1016/j.canlet.2020.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, T., Wang, W., Yang, L., Chen, K., Chen, R., Han, Y., et al. (2018). Rich club disturbances of the human connectome from subjective cognitive decline to Alzheimer's disease. Theranostics 8, 3237–3255. doi: 10.7150/thno.23772

PubMed Abstract | CrossRef Full Text | Google Scholar

Yeh, R. W., Secemsky, E. A., Kereiakes, D. J., Normand, S. L., Gershlick, A. H., Cohen, D. J., et al. (2016). Development and validation of a prediction rule for benefit and harm of dual antiplatelet therapy beyond 1 year after percutaneous coronary intervention. JAMA 315, 1735–1749. doi: 10.1001/jama.2016.3775

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, H. H., and Lu, W. (2007). Adaptive Lasso for Cox's proportional hazards model. Biometrika 94, 691–703. doi: 10.1093/biomet/asm037

CrossRef Full Text | Google Scholar

Zhang, Z., Reinikainen, J., Adeleke, K. A., Pieterse, M. E., and Groothuis-Oudshoorn, C. G. M. (2018). Time-varying covariates and coefficients in Cox regression models. Ann. Transl. Med. 6:121. doi: 10.21037/atm.2018.02.12

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, X., Yao, J., and Huang, J. (2016). “Deep convolutional neural network for survival analysis with pathological images,” in 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE. doi: 10.1109/BIBM.2016.7822579

CrossRef Full Text | Google Scholar

Keywords: glioblastoma multiforme, deep learning, survival prediction, denoising autoencoder, time-dependent ROC curve, Cox proportional hazard regression

Citation: Yan T, Yan Z, Liu L, Zhang X, Chen G, Xu F, Li Y, Zhang L, Peng M, Wang L, Li D and Zhao D (2023) Survival prediction for patients with glioblastoma multiforme using a Cox proportional hazards denoising autoencoder network. Front. Comput. Neurosci. 16:916511. doi: 10.3389/fncom.2022.916511

Received: 09 April 2022; Accepted: 13 December 2022;
Published: 10 January 2023.

Edited by:

Carlos Alberto Silva, University of Minho, Portugal

Reviewed by:

Mehul S. Raval, Ahmedabad University, India
Zhi-Cheng Li, Shenzhen Institutes of Advanced Technology (CAS), China

Copyright © 2023 Yan, Yan, Liu, Zhang, Chen, Xu, Li, Zhang, Peng, Wang, Li and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dandan Li, yes bGlkYW5kYW5AdHl1dC5lZHUuY24=; Dong Zhao, yes ZG9uZ3poYW84ODIwMDBAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.