Multi-task learning-based feature selection and classification models for glioblastoma and solitary brain metastases

Huang, Ya; Huang, Shan; Liu, Zhiyong

doi:10.3389/fonc.2022.1000471

ORIGINAL RESEARCH article

Front. Oncol., 21 September 2022

Sec. Cancer Imaging and Image-directed Interventions

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.1000471

This article is part of the Research TopicApplication of Radiomics in Understanding Tumor Biological Behaviors and Treatment ResponseView all 17 articles

Multi-task learning-based feature selection and classification models for glioblastoma and solitary brain metastases

Ya Huang¹

Shan Huang¹

Zhiyong Liu^2,3*

¹Key Laboratory of Computing and Stochastic Mathematics, School of Mathematics and Statistics, Hunan Normal University, Changsha, China
²Department of Intensive Care, Xiangya Hospital, Central South University, Changsha, China
³National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China

Purpose: To investigate the diagnostic performance of feature selection via a multi-task learning model in distinguishing primary glioblastoma from solitary brain metastases.

Method: The study involved 187 patients diagnosed at Xiangya Hospital, Yunnan Provincial Cancer Hospital, and Southern Cancer Hospital between January 2010 and December 2018. Radiomic features were extracted from conventional magnetic resonance imaging including T1-weighted, T2-weighted, and contrast-enhanced T1-weighted sequences. We proposed a new multi-task learning model using these three sequences as three tasks. Multi-series fusion was performed to complement the information from different dimensions in order to enhance model robustness. Logical loss was used in the model as the data-fitting item, and the feature weights were expressed in the logical loss space as the sum of shared weights and private weights to select the common features of each task and the characteristics having an essential impact on a single task. A diagnostic model was constructed as a feature selection method as well as a classification method. We calculated accuracy, recall, precision, and area under the curve (AUC) and compared the performance of our new multi-task model with traditional diagnostic model performance.

Results: A diagnostic model combining the support vector machine algorithm as a classification algorithm and our model as a feature selection method had an average AUC of 0.993 in the training set, with AUC, accuracy, precision, and recall rates respectively of 0.992, 0.920, 0.969, and 0.871 in the test set. The diagnostic model built on our multi-task model alone, in the training set, had an average AUC of 0.987, and in the test set, the AUC, accuracy, precision, and recall rates were 0.984, 0.895, 0.954, and 0.838.

Conclusion: It is feasible to implement the multi-task learning model developed in our study using logistic regression to differentiate between glioblastoma and solitary brain metastases.

1 Introduction

Brain tumors, also known as intracranial tumors, are a growth or mass of abnormal cells or tissue in the brain (1). Brain tumors are generally subdivided into two main types: primary or secondary (metastatic) (2). Glioblastoma (GBM) is a typical malignant primary brain tumor that affects an average of 3 out of 100,000 people (3). Solitary brain metastases (SBM) are secondary malignant brain tumors, which are more common than GBM. The incidence rate of SBM is approximately 7 to 14 per 100,000 people (3, 4). As the standard treatment course for GBM is aggressive trimodality therapy compared to surgery or radiosurgery for SBM, it is of great clinical importance to accurately and rapidly distinguish between these two types of brain malignancies as rapidly as possible. As the main diagnostic tool for brain tumors (5), magnetic resonance imaging (MRI) methods create clear and detailed three-dimensional images of brain and tumor anatomy. However, for patients with SBM and GBM, their MR images both show ring enhancement, intra-tumor necrosis, and per femoral T2 high signal (6, 7), which poses a challenge for the accurate differentiation between GBM and SBM.

Radiomics (8–12), the application of advanced image feature analysis algorithms, can be used to capture intra-tumoral heterogeneity in a non-invasive manner. Numerous studies have applied radiomics to tumor classification. Austin et al. used a filtered histogram texture analysis-based imaging historic approach to identify high-grade and low-grade gliomas (13), where the AUC on the test set reached 0.90. However, the data of their study were highly unbalance in the number of high-grade and low-grade gliomas. Among the three feature selection methods, packing, filtering, and embedding, the embedding method can obtain a higher computational efficiency and classification performance than the filtering (14, 15) and packing (16) methods (17). Therefore, the embedding method has received increasing attention recently. Qian et al. used the feature selection method of least absolute shrinkage and selection operator (Lasso) combined with a support vector machine(SVM) classifier, to obtain an AUC of 0.90 in their test set (18). Cho et al. used a machine learning approach to classify gliomas based on radiomics (19), which ultimately selected five significant features with an average AUC of 0.903 on their test set. Artzi et al. found that the SVM approach had the best results for classifying between GBM and SBM subtypes (20), with an AUC of 0.96.

Liu et al. combined handcrafted radiomics and deep learning-based radiomics and used a random forest algorithm for feature selection and classification (21), the AUC reached 0.97 for single contrast-enhanced T1-weighted(T1C) MRI sequence. Because tumor sites behave differently under different sequences of MRI, patients usually have multiple series of imaging data acquired to accurately determine the tumor location, size, and additional information during the treatment. Different sequences provide different information, and multiple sequence fusion can complement information from different dimensions, thus enhancing the robustness of a model. As such, we introduced a multi-task learning model (22, 23) to fuse T1, T1C, and T2 sequence information to develop a robust prediction model to aid in clinical diagnosis.

Nowadays, most studies on multi-task-based feature selection focus on different canonical terms. The features are selected by constraints of different paradigms, such as the commonly used ℓ_1,1 (24), ℓ_2,1 (25) paradigms, etc. (17, 26). In the present report, we have improved the data-fitting term in the multi-task learning model so that the model can be used not only for feature selection but also for classification functions to ultimately achieve a higher accuracy than traditional diagnostic models.

The present work proposes a new feature selection and classification model based on the multi-task learning model, which treats the 1106 features extracted from each sample in each sequence (T1, T2, T1C) as a task. It uses the logical loss function as a data-fitting term to ensure the feasibility of classifying GBM and SBM. Taking ℓ_1,1, ℓ₁ as regular terms to ensure the sparseness of feature selection. At the same time, private weights are introduced based on a common weight to make full use of the relevant similarities and differences between each task. The result is a 3-task feature selection classification model. In this study, we mainly utilized the alternating iterative method and the fast-iterative shrinkage threshold algorithm based on the backtracking method to solve equations. The experimental results demonstrate that our model, whether as a feature selection model combined with SVM classification methods to form a diagnostic model or as a standalone diagnostic model, successfully integrated multiple sequence information to provide a robust predictive model for clinical diagnosis while also the diagnostic model consisting of one model improves the efficiency of our tumor classification.

2 Materials and methods

2.1 Data acquisition

The data used in this study were obtained from 120 patients with SBM and 67 patients with GBM admitted to the Xiangya Hospital, Yunnan Cancer Hospital and Southern Cancer Hospital between January 1, 2010 and December 31, 2018. All patients were histologically diagnosed according to the tumor grading guidelines published by the World Health Organization in 2021. This retrospective analysis of data from MR images was approved by the institutional review board, and the requirement of informed consent was waived.

MRI on all patients was performed by the hospital radiology department using 3.0-T systems. Each patient had T1, T2, and T1C MR image series performed. High-quality MR images were obtained using the following protocols:

● Axial T1: layer thickness =5 mm, layer spacing =1.5 mm, matrix =320×256, and field of view (FOV)=24×24 cm.

● Axial T2: layer thickness =5 mm, layer spacing =1.5 mm, matrix =384×384, and FOV =24×24 cm.

● Axial T1C: layer thickness =5 mm, layer spacing =1.5 mm, matrix =320×256, and FOV =24×24 cm.

MR image data of the patient for the present study can be found in the reference (21).

2.2 Data preprocessing

The overall workflow of the current study is shown in Figure 1 with a description of each involved step.

FIGURE 1

Figure 1 A generic framework for creating classification models using radiological features. Steps include ROI(region of interest) delineation, feature extraction, feature selection, and modeling.

2.2.1 Delineation of the region of interest (ROI)

We preprocessed each image by noise reduction, offset field correction, and strict intra-target alignment using the public software package FSL v6.0.4. All images were evaluated independently by two neuroradiologists who have between 5-10 years of experience. ROIs of the entire tumor on T1, T2, and T1C images were created manually using the ITK-SNAP software layer by layer around the enhanced tumor layer (27); areas of macroscopic necrosis, cystic degeneration, and edema were avoided. A third senior neuroradiologist with 15 years of experience reexamined the images and made a final diagnosis when there was a conflict between the two original neuroradiologists (21).

2.2.2 Data normalization

Differences in instrumentation and imaging parameters, tumor sites of patients, and other factors can lead to significant differences in MR images. These differences will result in significant issues for imaging histology analysis. Therefore, we performed MIL(modality mismatch, intensity distribution variance, and layer-spacing differences) normalization for all MR images (28). First, we used B-sample interpolation for body mode matching for all patient MR images to obtain a total of 120 SBM samples and 67 GBM samples, Second, we set the interlayer gap of all MR images to 1 mm. Third, we applied MIL data normalization to make the intensity of MR image distribution consistent.

2.2.3 Feature extraction

We used the platform PyRadiomics (http://www.radiomics.io/pyradiomics.html) to perform feature extraction on all MIL normalized data. 1106 features were extracted from each MRI series. With each patient undergoing three different sequences of MRI, we extracted a total of 3318 features per patient. The extracted features are shown in Table 1.

TABLE 1

Table 1 Extract the specific content of radiomics features.

To account for the large difference in the number of samples between the two tumor groups, we used the Synthetic Minority Over-sampling Technique (SMOTE) (29) to oversample the GBM group in order to generate the same number of samples as the SBM group. In total, we obtained 120 SBM samples and 120 GBM samples. Finally, we randomly selected 24 SBM samples and 24 GBM samples as the test set, and the remaining 96 SBM samples and 96 GBM samples as the training set.

2.3 Feature selection and classification

2.3.1 Proposed model

We introduce the logical loss function into the multi-task learning model to obtain the data fitting term as

\begin{array}{l} L (X, y, W) = \sum_{m = 1}^{M} \sum_{j = 1}^{N} (- y_{i}^{m} X_{i}^{m} w^{m} + \ln (1 + \exp (X_{i}^{m} w^{m})) & (1) \end{array}

where the number of samples for each task is N and $y_{j}^{m} \in {+ 1, - 1}$ denotes the class of samples. When $y = - 1$ , the patient has GBM. $y = + 1$ means the patient has SBM.

To obtain the characteristics unique to a single task, we introduced a personal value of b based on the expected weight of s and finally derived our new model

\begin{array}{l} \min_{s, B} \sum_{m = 1}^{M} \sum_{i = 1}^{N} \frac{1}{N} (- y_{i} X_{i}^{m} (s + b^{m}) + \ln (1 + \exp (X_{i}^{m} (s + b^{m})))) + λ_{s} {║ s ║}_{1} + λ_{b} {║ B ║}_{1, 1} & (2) \end{array}

Where $y_{i}$ is the label of the $i^{t h}$ sample. M is the number of tasks where M=3 as described in the text. We assume that the number of samples for all tasks is N. $X_{i}^{m} \in ℝ^{1 \times (d + 1)}$ is the $i^{t h}$ sample of the $m^{t h}$ task, i.e., the $i^{t h}$ row of the matrix $X^{m}$ . Depending on the nature of logistic regression, we populate the last column of $X^{m}$ with an N-dimensional 1 vector. $B = [b^{1}, b^{2}, \dots, b^{M}] \in ℝ^{(d + 1) \times M}$ , then $b^{m}$ is the $m^{t h}$ column of matrix B, ${| | s | |}_{1} = \sum_{i = 1}^{d + 1} | s_{i} |$ and ${| | B | |}_{1, 1} = \sum_{i}^{(d + 1)} \sum_{m}^{M} | b_{i}^{m} |$ are the regularization terms where s and bmare are the shared and private weights, respectively. $λ_{s}, λ_{b}$ are the two regularization parameters, and let $λ_{s} < M λ_{b}$ .

2.3.2 Model solving

We rewrote (2) in the following form

\begin{array}{l} \min_{s, B} \sum_{m = 1}^{M} \sum_{i = 1}^{N} \frac{1}{N} (- y_{i} X_{i}^{m} (s + b^{m}) + \ln (1 + \exp (X_{i}^{m} (s + b^{m})))) + λ_{s} \sum_{i}^{p + 1} | s_{i} | + λ_{b} \sum_{i}^{d + 1} \sum_{m}^{M} | b_{i}^{m} | & (3) \end{array}

We considered the matrix B as a constant matrix, with a minimization of the variable s

\begin{array}{l} \min_{s} \sum_{m = 1}^{M} \sum_{i = 1}^{N} \frac{1}{N} (- y_{i} X_{i}^{m} (s + b^{m}) + \ln (1 + \exp (X_{i}^{m} (s + b^{m})))) + λ_{s} \sum_{i}^{p + 1} | s_{i} | & (4) \end{array}

Similarly, fixing the vector s and considering it as a constant vector, then minimizing B is equivalent to minimizing the following problem for any m

\begin{array}{l} \min_{b^{m}} \sum_{i = 1}^{N} \frac{1}{N} (- y_{i} X_{i}^{m} (s + b^{m}) + \ln (1 + \exp (X_{i}^{m} (s + b^{m})))) + λ_{b} \sum_{i}^{d + 1} | b_{i}^{m} | & (5) \end{array}

It is further shown that the computation of $b^{m}$ is only relevant for a single task.

To select the final feature, equation (4) and (5) must be solved. Because of the non-differentiation of the $ℓ_{1}$ norm, we used a fast-iterative shrinkage threshold algorithm to solve the above two subproblems in the course of our study (30). Both equations were solved in a similar manner, but the calculation in the model (5) involved only the $m^{t h}$ task, independent of the other tasks, while solving model (4) required the participation of all tasks.

2.3.3 Model analysis

We represent the weights solved by the model in the form of a sum of shared weights s and private weights $b^{m}$ . For the computation of s, all tasks need to be involved, and when $s_{i} \neq 0$ , it is assumed that all tasks pick the $i^{t h}$ feature. However, the computation of $b^{m}$ requires data from only the $m^{t h}$ task, and $b^{m} \neq 0$ indicates the $i^{t h}$ feature is important for the $m^{t h}$ task but may not be important for other tasks. Finally, we denoted the features selected by the mth task by $(s + b^{m})$ . The $ℓ_{1}$ and $ℓ_{1, 1}$ regularization forced both weights to be sparse, thus satisfying the “feature selection” requirement. The feature selection methods with $ℓ_{1, 1}$ norm as the regular term or $ℓ_{2, 1}$ norm as the regular term are two classical methods in the sparse embedding method (31). The multi-task Lasso model based on $ℓ_{1, 1}$ regularization had an entirely separable form. Each task can separately compute its own weight without being influenced by other tasks, so the sparse multi-task Lasso model is equivalent to the single-task Lasso model. The single difference is that all tasks of the sparse multi-task Lasso method use the same regularization parameter. In contrast, the regularization parameter of the single-task Lasso model can be determined by each task individually. The multi-task Lasso model based on the $ℓ_{2, 1}$ regularization makes the features selected by different tasks almost the same, which reasonably exploits the correlation between different tasks but loses the specificity between different tasks. In contrast, our model not only makes full use of the correlation between different tasks but also highlights the specificity between different tasks and fuses the information of multiple sequences, thus providing a more robust prediction model for clinical diagnosis. Their differences are given visually in Figure 2.

FIGURE 2

Figure 2 The difference between LASSO_1,1 Model, LASSO_2,1 Model, and Our Proposed Model. The left panel represents the input datasets; the right panel represents the learned weight matrix. (A) Schematic of our method for feature selection. (B) Schematic diagram of feature selection based on multi-task Lasso model under ℓ_1,1 regularization. (C) Schematic diagram of feature selection based on multi-task Lasso model under ℓ_2,1 regularization.

Accurate preoperative diagnosis can be effective in formulating accurate and personalized treatment for patients, especially when MR images of SBM and GBM are extremely similar. In the current study, the proposed multi-task learning based on the logistic loss function can be used not only for feature selection but also for tumor classification tasks.

In contrast, the Lasso model based on mean square loss can only be used as a feature selection algorithm and cannot perform the subsequent classification task independently. In contrast, our model can use probabilities to account for the classification, for example, for any sample $X_{i}^{m} \in ℝ^{1 \times (d + 1)}$ , the probability that the sample is classified as label 1 is

\begin{array}{l} p (y_{i} = 1 | X_{i}^{m}) = \frac{1}{1 + e^{- X_{i}^{m} (s + b^{m})}} & (6) \end{array}

Next, the model can perform the tumor classification task independently by simply picking the appropriate threshold value. Its calculation process will be described in the section of experimental procedure.

2.3.4 Experimental procedure

This experiment was performed using MATLAB 9.11. The features of each sample and the labels are used as input, and the optimal penalty parameters $λ_{s}$ and $λ_{b}$ are obtained on the training set using 5-fold cross-validation. The ratio $\frac{λ_{s}}{λ_{b}}$ of the two penalty parameters is always kept as the following six values: 1.25, 1.5, 1.75, 2, 2.25, 2.5. Moreover, the mean AUC values under the optimal penalty parameters are recorded. Then, the data were computed and trained using the alternating miniaturization algorithm and the fast-iterative shrinkage threshold algorithm. Note that during the fast-iterative shrinkage threshold algorithm solution, our parameters $L_{0} = 1, η = 1.1$ . To ensure the confidence of the results, we repeatedly performed 5-fold cross-validation 10 times. Finally, the model"s performance was evaluated by average accuracy, average recall, average precision, and average AUC(The flow chart of our model solution is given in Figure 3).

FIGURE 3

Figure 3 Experimental flowchart.

3 Results

Additional analyses were conducted to compare our model with four diagnostic models based on single-task and multi-task learning. The single-task-based models are the diagnostic model with Lasso model for feature selection and SVM for classification (Lasso+SVM); the diagnostic model with Lasso model for feature selection and logistic regression (LR) algorithm for classification (Lasso+LR); the logistic regression as loss function and $ℓ_{1}$ norm constraint of “LR₁” model for feature selection and SVM for classification (LR₁+SVM); and a diagnostic model with LR₁ model for feature selection and classification (LR₁).

The models based on multi-task learning are the diagnostic model with the Lasso multi-task model with $ℓ_{1, 1}$ norm as the regular term for feature selection and the SVM method for classification(Lasso_1,1+SVM); the diagnostic model using the Lasso_1,1 multi-task model for feature selection and logistic regression(LR) for classification(Lasso_1,1+LR); diagnostic model with feature selection using the Lasso multi-task model with $ℓ_{2, 1}$ norm as the regular term and classification by SVM method(Lasso_2,1+SVM); diagnostic model with feature selection using Lasso_2,1 multi-task model and classification by LR method(Lasso_2,1+LR); our model as a diagnostic model with feature selection method and SVM method for classification(Ours+SVM) and our model as a diagnostic model(Ours).

Table 2 shows the confusion matrices of 4 single-task models based on T1, T1C, and T2 sequences and 6 diagnostic models based on multi-task learning. The values in the table are taken as an average of the results of the 5-fold cross-validation 10 times. In the table, TP (True Positive) represents the number of GBM predicted as GBM, FN (False Negative) means the number of GBM predicted as SBM, FP (False Positive) means the number of SBM predicted as GBM, and TN (True Negative) represents the number of SBM predicted as SBM. From the experimental results, it can be seen that the value of TP or TN is significantly higher than the value of FN or FP. This shows that our model is meaningful and feasible.

TABLE 2

Table 2 The mean of the confusion matrix of each model after 5-fold cross-validation 10 times.

To further verify the feasibility of our model, the ROC (receiver operating characteristics) curves of our model are compared with those of four other multi-task models, and the values of the AUC and the standard deviation are also given. Usually, the closer to a value of 1 the AUC is, the better the model performance is; the smaller the standard deviation is, the more stable the model is. For the test dataset, our model combined with SVM classification, multi-task: Ours+SVM, had the most significant AUC of 0.992, however, our multi-task model alone had the second highest AUC of 0.984, with an AUC difference of 0.008 from the optimal model. The standard deviation (std) of the multi-task: Ours+SVM model is also the smallest among the six models, at 0.008, indicating that the model is the most stable. The std of our multi-task model alone was second only to the multi-task: Ours+SVM model. The results are shown in Figure 4.

FIGURE 4

Figure 4 ROC curves of multi-task model in six. The horizontal coordinate indicates the false positive rate and the vertical coordinate indicates the true positive rate. (A) ROC curves for the Lasso_1,1+SVM model, the blue solid line indicates the average ROC curve and the black dashed line indicates the mean ± std. (B) ROC curves for the Lasso_1,1+LR model, the blue solid line indicates the average ROC curve and the black dashed line indicates the mean ± std. (C) ROC curves for the Lasso_2,1+SVM model, the blue solid line indicates the average ROC curve and the black dashed line indicates the mean ± std. (D) ROC curves for the Lasso_2,1+LR model, the blue solid line indicates the average ROC curve and the black dashed line indicates the mean ± std. (E) ROC curves for the Ours+SVM model, the blue solid line indicates the average ROC curve and the black dashed line indicates the mean ± std. (F) ROC curves for the Ours model, the blue solid line indicates the average ROC curve and the black dashed line indicates the mean ± std.

Feature selection can effectively reduce the dimensionality of the data, thereby reducing the amount of computation and improving the efficiency of problem-solving. Figure 5 shows the distribution of features when selecting 20 features for six multitasking models. Our method not only selected the same features for the three sequences of T1, T1C, and T2, but also selected the characteristics that are unique to each sequence, which may have a large impact on only one of the sequences and not very much on the other sequences.

FIGURE 5

Figure 5 Learned weight matrix. The color bar in the right side indicates the values of matrix.The horizontal coordinates indicate the different tasks T1, T1C, and T2. The vertical coordinates indicate the number of features screened.

In brain tumor classification experiments, we compared the classification performance of the single-task-based and multi-task learning-based diagnostic models. In the single-task model experiments, we used four single-task feature selection models to classify the data of T1, T1C, and T2 sequences. Finally, we used 5-fold cross-validation method 10 times to obtain the average AUC on the training set and the average AUC, accuracy, precision, and recall on the test set. In the classification experiment based on multi-task learning, we treated the three sequences of T1, T1C, and T2 as 3 tasks, trained and tested them through 6 multi-task models, and obtained the above 4 evaluation indicators. The results were shown in Table 3.

TABLE 3

Table 3 The values of various metrics for each method on the training and test sets.

In the single-task experiment, the LR₁+ SVM model based on the T2 sequence achieved the highest average AUC of 0.973 on the training set, and the average AUC also reached the highest 0.969 on the test set, and accuracy and recall also reached the highest in the single-task model, accuracy = 0.876, recall = 0.886. In all single-task experiments, the maximum value of precision is 0.922.

The multi-task learning model was introduced into the classification of brain tumors, and the classification performance of each model was significantly improved. As can be seen from the data in the table, the values of the indicators of the multi-task model are significantly better than those of the corresponding single-task model. Using our model, the multi-task: Ours+SVM model, its average AUC, accuracy, and precision are all at their highest, and the AUC on the test set reaches 0.992. Our model is not only used as a feature selection method, but also as a classification method. Although its metrics are not optimal, it outperforms the traditional diagnostic models (Lasso_1,1+SVM, Lasso_1,1+LR, Lasso_2,1+SVM, Lasso_2,1+LR), and we use only one model, thus improving the efficiency of diagnosis.

4 Discussion

Accurate classification of GBM and SBM is a challenging clinical problem. Different sequences of MRI provide unique information, and the rational fusion of multiple sequences can complement information from different dimensions (32). Thus, we proposed a new multi-task learning model to enable an accurate and fast diagnosis method for clinical usage.

This study introduced T1, T1C, and T2 sequences into the multi-task learning model. The feature weights were represented as the sum of shared and private weights. In turn, when filtering radiomic features, we can fully use the correlation between MR images of different sequences while still retaining the differences between the sequences and selecting features that have an essential impact on a specific task. Based on the above multi-task model, we also replaced the data matching term with a logistic regression function, which resulted in efficient model feature selection and classification of brain tumors.

We used an alternating minimization algorithm and a fast-iterative shrinkage threshold algorithm to train the data in model solving. We used the 5-fold cross-validation method to select optimal parameters for the selection of parameters. To ensure the accuracy and credibility of the data results, we conducted a 5-fold cross-validation 10 times in training and testing, and the final metrics were selected as the average value. The optimum model with an average AUC of 0.992 on the test set was found when our model performed feature selection and the SVM method performed classification. As a feature selection and classification method, our method alone reached the second highest average AUC of 0.984 on the test set. Multi-task learning enhances the robustness of our model, thus providing a stable predictive model for clinical diagnosis while ensuring accuracy and making diagnoses possible with just one model, improving the efficiency of diagnostics.

Our model still has a comparative advantage over a single sequence classification task. For example, in a previous study (21), we used the same dataset with the random forest method as the feature selection method. Then six machine learning models were used for classification. The final result is that the random forest did the best classification job, obtaining an AUC of 0.97 on the test set but was limited to the T1C sequences. The single-task model, which does not consider the relationship between different sequences, does not take advantage of complementarity of information, which leads to the final classification effect being relatively not very good. On the other hand, the model proposed in this paper can make full use of the complementary information between different sequences and improve the accuracy and robustness of the prediction model. Our model is comparable with the classical multi-task model based on $ℓ_{2, 1}$ regularization (24), but it can extract not only the same features for each sequence but also features that are important for a specific task. Moreover, compared with the general feature selection model, our model also integrates feature selection and classification to improve efficiency and convenience for diagnosis.

The present study does have some limitations. First, this study used a manual method to segment ROI, which is time consuming. Additionally, although two to three researchers have been involved in the segmentation process, it is very challenging to eliminate all bias in the results. Second, the data sample is small and cannot be extrapolated from this particular population to the general population. Third, the multitask learning model proposed in the present study requires the same features among tasks and does not apply to all multitask problems. Lastly, in the medical imaging part, ROI segmentation requires two neurologists with 5 to 10 years of experience. In our future study, we plan to use deep learning algorithms or image segmentation methods to automatically delineate the ROI to improve the efficiency of our work.

5 Conclusion

In this work, we proposed a feature selection model based on the multi-task learning model for SBM and GBM classification. The feature selection model uses a logistic regression function as a loss function, which makes the classification function of the model possible. Most of the current brain tumor classification studies have been performed using single-task models, which do not take advantage of the correlation between different sequences of MR images and therefore, the performance is not optimized to utilize all available information. Furthermore, the traditional multi-task Lasso model does not fully consider the correlation between different tasks. In contrast, our model makes full use of the correlation between MR images of different sequences while selecting the features that have an essential impact on a specific task. It is possible to select different combinations of features for different tasks, thus improving the classification performance of the model to some extent. In conclusion, our model generally outperforms the traditional multi-task Lasso model.

Our model as a feature selection method and paired with an SVM classification method has a great advantage over other methods of the same type. Our proposed model is also a good choice as a classification method. Although it has inferior performance to that of using our method with other classification algorithms, it improves the convenience of tumor classification. Thus, our model is advantageous in classifying SBM and GBM using MR images with multiple sequences.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

Studies involving human subjects were reviewed and approved by the ethics committees of Xiangya Hospital, Yunnan Cancer Hospital, and Southern Cancer Hospital. The requirement for informed consent was waived.

Author contributions

YH: Writing and Data Analysis. SH: Data Analysis. ZL: Founding acquisition; Writing review and editing. All authors contributed to the article and approved the submitted version.

Funding

The work is supported by Natural Science Foundation of Hunan Province of China with the grant NO.2022JJ30944.

Acknowledgments

We thank the High-Performance Computing Center of Central South University for providing the computation resource.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Pei L, Jones KA, Shboul ZA, Chen JY, Iftekharuddin KM. Deep neural network analysis of pathology images with integrated molecular data for enhanced glioma classification and grading. Front Oncol (2021) 11:668694. doi: 10.3389/fonc.2021.668694

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Alex B, Eva GV, Garth C, Tom R. Primary and metastatic brain tumours in adults: summary of nice guidance. BMJ (2018) 362:k2924. doi: 10.1136/bmj.k2924

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Chen C, Zheng A, Ou X, Wang J, Ma X. Comparison of radiomics-based machine-learning classifiers in diagnosis of glioblastoma from primary central nervous system lymphoma. Front Oncol (2020) 10:1151. doi: 10.3389/fonc.2020.01151

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Nayak L, Lee EQ, Wen PY. Epidemiology of brain metastases. Curr Oncol Rep (2012) 14:48–54. doi: 10.1007/s11912-011-0203-y

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Chen C, Ou X, Wang J, Guo W, Ma X. Radiomics-based machine learning in differentiation between glioblastoma and metastatic brain tumors. Front Oncol (2019) 9:806. doi: 10.3389/fonc.2019.00806

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Cha S, Lupo J, Chen MH, Lamborn K, McDermott M, Berger M, et al. Differentiation of glioblastoma multiforme and single brain metastasis by peak height and percentage of signal intensity recovery derived from dynamic susceptibility-weighted contrast-enhanced perfusion mr imaging. Am J Neuroradiol (2007) 28:1078–84. doi: 10.3174/ajnr.a0484

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Bae S, An C, Ahn SS, Kim H, Han K, Kim SW, et al. Robust performance of deep learning for distinguishing glioblastoma from single brain metastasis using radiomic features: model development and validation. Sci Rep (2020) 10:1–10. doi: 10.1038/s41598-020-68980-6

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, Van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Lambin P, Leijenaar RT, Deist TM, Peerlings J, De Jong EE, Van Timmeren J, et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Kocher M, Ruge MI, Galldiks N, Lohmann P. Applications of radiomics and machine learning for radiotherapy of malignant brain tumors. Strahlentherapie und Onkol (2020) 196:856–67. doi: 10.1007/s00066-020-01626-8

CrossRef Full Text | Google Scholar

11. Louis DN, Perry A, Reifenberger G, Deimling AV, Figarella-Branger D, Cavenee WK, et al. The 2016 world health organization classification of tumors of the central nervous system: A summary. Acta Neuropathol (2016) 131:803–20. doi: 10.1007/s00401-016-1545-1

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Rathore S, Akbari H, Doshi J, Shukla G, Rozycki M, Bilello M, et al. Radiomic signature of infiltration in peritumoral edema predicts subsequent recurrence in glioblastoma: Implications for personalized radiotherapy planning. J Med Imaging (2018) 12:021219. doi: 10.1117/1.JMI.5.2.021219

CrossRef Full Text | Google Scholar

13. Austin D, Zhang B, Taimur S, Andrew P, Nicholas L, Mary GS, et al. Diagnostic accuracy of mri texture analysis for grading gliomas. J Neuro-Oncol (2018) 140:583–9. doi: 10.1007/s11060-018-2984-4

CrossRef Full Text | Google Scholar

14. Zhao Z, Liu H. Spectral feature selection for supervised and unsupervised learning. Proc 24th Int Conf Mach Learn (2007) 227:1151–7. doi: 10.1145/1273496.1273641

CrossRef Full Text | Google Scholar

15. Peng H, Long F, Ding C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell (2005) 27:1226–38. doi: 10.1109/TPAMI.2005.159

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Constantinopoulos C, Titsias MK, Likas A. Bayesian Feature and model selection for gaussian mixture models. IEEE Trans Pattern Anal Mach Intell (2006) 28:1013–8. doi: 10.1109/TPAMI.2006.111

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Sun Z, Yu Y. Robust multi-class feature selection via ℓ_2,0-norm regularization minimization. Intelligent Data Anal (2022) 26:57–73. doi: 10.3233/IDA-205724

CrossRef Full Text | Google Scholar

18. Qian Z, Li Y, Wang Y, Li L, Li R, Wang K, et al. Differentiation of glioblastoma from solitary brain metastases using radiomic machine-learning classifiers. Cancer Lett (2019) 451:128–35. doi: 10.1016/j.canlet.2019.02.054

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Cho HH, Lee SH, Kim J, Park H. Classification of the glioma grading using radiomics analysis. PeerJ (2018) 6:e5982. doi: 10.7717/peerj.5982

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Artzi M, Bressler I, Ben Bashat D. Differentiation between glioblastoma, brain metastasis and subtypes using radiomics analysis. J Magnetic Resonance Imaging (2019) 50:519–28. doi: 10.1002/jmri.26643

CrossRef Full Text | Google Scholar

21. Liu Z, Jiang Z, Meng L, Yang J, Zhou R. Handcrafted and deep learning-based radiomic models can distinguish gbm from brain metastasis. J Oncol (2021) 2021:1–10. doi: 10.1155/2021/5518717

CrossRef Full Text | Google Scholar

22. Bi J, Xiong T, Yu S, Dundar M, Rao RB. An improved multi-task learning approach with applications in medical diagnosis. In: Joint European conference on machine learning and knowledge discovery in databases. Berlin, Heidelberg: Springer (2008). p. 117–32. doi: 10.1007/978-3-540-87479-926

CrossRef Full Text | Google Scholar

23. Fourure D, Emonet R, Fromont E, Muselet D, Neverova N, Tremeau A, et al. Multi-task, multi-domain learning: Application to semantic segmentation and pose regression. Neurocomputing (2017) 251:68–80. doi: 10.1016/j.neucom.2017.04.014

CrossRef Full Text | Google Scholar

24. Obozinski G, Taskar B, Jordan M. Multi-task feature selection. Statistics Department, UC Berkeley: Citeseer (2006) 2:2.

Google Scholar

25. Obozinski G, Taskar B, Jordan M. Joint covariate selection for grouped classification. Dordrecht, Netherlands: Dept. of Statistics, University of California Berkeley (2007).

Google Scholar

26. Yin P, Lou Y, He Q, Xin J. Minimization of ℓ_1-2 for compressed sensing. SIAM J Sci Computing (2015) 37:A536–63. doi: 10.1137/140952363

CrossRef Full Text | Google Scholar

27. Liang C, Li Y, Luo J. A novel method to detect functional microrna regulatory modules by bicliques merging. IEEE/ACM Trans Comput Biol Bioinf (2016) 13:1–1. doi: 10.1109/TCBB.2015.2462370

CrossRef Full Text | Google Scholar

28. Hu Z, Zhuang Q, Xiao Y, Wu G, Shi Z, Chen L, et al. Mil normalization-prerequisites for accurate mri radiomics analysis. Comput Biol Med (2021) 104403. doi: 10.1016/j.compbiomed.2021.104403

CrossRef Full Text | Google Scholar

29. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. Smote: Synthetic minority over-sampling technique. J Artif Int Res (2002) 16:321–57. doi: 10.1613/jair.953

CrossRef Full Text | Google Scholar

30. Beck A, Teboulle M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J Imaging Sci (2009) 2:183–202. doi: 10.1137/080716542

CrossRef Full Text | Google Scholar

31. Obozinski G, Taskar B, Jordan MI. Joint covariate selection and joint subspace selection for multiple classification problems. Stat Computing (2010) 20:231–52. doi: 10.1007/s11222-008-9111-x

CrossRef Full Text | Google Scholar

32. Bathla G, Priya S, Liu Y, Ward C, Le N, Soni N, et al. Radiomics-based differentiation between glioblastoma and primary central nervous system lymphoma: A comparison of diagnostic performance across different mri sequences and machine learning techniques. Eur Radiol (2021) 31:1–11. doi: 10.1007/s00330-021-07845-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: solitary brain metastases, glioblastoma, multi-task learning, feature selection, classification, logistic regression

Citation: Huang Y, Huang S and Liu Z (2022) Multi-task learning-based feature selection and classification models for glioblastoma and solitary brain metastases. Front. Oncol. 12:1000471. doi: 10.3389/fonc.2022.1000471

Received: 22 July 2022; Accepted: 16 August 2022;
Published: 21 September 2022.

Edited by:

Xue Meng, Shandong Cancer Hospital, China

Reviewed by:

Fada Guan, Yale University, United States
Qilei Chen, University of Massachusetts Lowell, United States

Copyright © 2022 Huang, Huang and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhiyong Liu, emhpeW9uZ2xpdWljdUBjc3UuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.