- 1Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, China
- 2Anhui Provincial Engineering Laboratory of Beidou Precision Agricultural Information, Anhui Agricultural University, Hefei, China
- 3School of Information and Computer, Anhui Agricultural University, Hefei, China
- 4Key Laboratory of High Magnetic Field and Ion Beam Physical Biology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
- 5Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, China
Head and neck squamous cell carcinoma (HNSCC) is a malignant tumor. Radiotherapy (RT) is an important treatment for HNSCC, but not all patients derive survival benefit from RT due to the individual differences on radiosensitivity. A prediction model of radiosensitivity based on multiple omics data might solve this problem. Compared with single omics data, multiple omics data can illuminate more systematical associations between complex molecular characteristics and cancer phenotypes. In this study, we obtained 122 differential expression genes by analyzing the gene expression data of HNSCC patients with RT (N = 287) and without RT (N = 189) downloaded from The Cancer Genome Atlas. Then, HNSCC patients with RT were randomly divided into a training set (N = 149) and a test set (N = 138). Finally, we combined multiple omics data of 122 differential genes with clinical outcomes on the training set to establish a 12-gene signature by two-stage regularization and multivariable Cox regression models. Using the median score of the 12-gene signature on the training set as the cutoff value, the patients were divided into the high- and low-score groups. The analysis revealed that patients in the low-score group had higher radiosensitivity and would benefit from RT. Furthermore, we developed a nomogram to predict the overall survival of HNSCC patients with RT. We compared the prognostic value of 12-gene signature with those of the gene signatures based on single omics data. It suggested that the 12-gene signature based on multiple omics data achieved the best ability for predicting radiosensitivity. In conclusion, the proposed 12-gene signature is a promising biomarker for estimating the RT options in HNSCC patients.
Introduction
Head and neck squamous cell carcinoma (HNSCC) is the sixth most common malignancy in the world, and nearly 60% of newly diagnosed HNSCC is locally advanced disease (Alsahafi et al., 2019; van der Heijden et al., 2019; Wang et al., 2019). Radiotherapy (RT) is a commonly used adjuvant therapy for HNSCC in addition to surgical treatment (Jemal et al., 2011; Suh et al., 2015). But each of HNSCC patients receiving the same dose of RT has different responses due to the complexity and heterogeneity of tumor, and some patients even have RT injury and secondary cancer (Scaife et al., 2015). Globally, the prognosis of HNSCC patients receiving RT remains a challenge. Therefore, prognostic biomarkers of radiosensitivity prediction for HNSCC are needed to improve RT options and predict treatment response.
In recent years, several studies have suggested that miRNAs, lncRNAs, and some of their target genes were correlated with the RT outcomes in HNSCC patients (Leucci et al., 2016; Weng et al., 2016; Chen et al., 2018; Han et al., 2018). For instance, the upregulation of miR-494-3p expression can enhance the radiosensitivity of HNSCC (Weng et al., 2016), and the upregulation of LINC00473 promotes the radioresistance of HNSCC cells (Han et al., 2018). Furthermore, some gene expression-based signatures have been constructed to predict the survival rate of HNSCC patients with RT. For example, Eschrich et al. (2009) developed the radiation sensitivity index, which was used to predict survival probability in HNSCC patients receiving concurrent chemoradiotherapy. Ma et al. (2019) identified a 4-gene methylation signature to predict the survival rate of HNSCC patients with RT. However, these studies only used single omics data which could not draw more comprehensive associations between complex molecular characteristics and cancer phenotypes. By contrast, multiple omics data involve multidimensional studies of cancer cells, potentially revealing the molecular mechanisms behind different phenotypes of cancer, such as metastasis and recurrence (Chakraborty et al., 2018; Xi et al., 2018; Wang et al., 2020). Therefore, a model based on multiple omics data could be an effective method for radiosensitivity prediction of HNSCC patients.
In this work, in order to construct reliable biomarkers for predicting RT response in HNSCC, we used the gene expression data and copy number variation (CNV)/single nucleotide variation (SNV) data from The Cancer Genome Atlas (TCGA) (Tomczak et al., 2015) to develop a gene signature by the two-stage regularization (2SR) (Lin et al., 2015; Hu et al., 2019) and multivariable Cox regression (Gui and Li, 2005; Benner et al., 2010) models. Then, we evaluated the ability of the gene signature for predicting radiosensitivity by Kaplan–Meier survival analysis (Tripepi and Catalano, 2004). Furthermore, we constructed a nomogram based on the gene signature and clinical variables to facilitate a more intuitive prediction of 3-year and 5-year survival rates for HNSCC patients receiving RT.
Materials and Methods
Figure 1 illustrated the workflow of the proposed signature for predicting RT response in HNSCC. Firstly, R package DESeq2 (Love et al., 2014) was used to identify differential expression genes (DEGs) between HNSCC patients receiving RT and without RT. Secondly, the 2SR and multivariable Cox regression models were used to construct a gene signature associated with the radiosensitivity prediction of HNSCC patients. Finally, Kaplan–Meier survival analysis and time-dependent receiver operating characteristic (ROC) curves (Heagerty et al., 2000; Kamarudin et al., 2017) were used to evaluate the performance of the gene signature. And a nomogram based on the gene signature and clinical variables was constructed to predict the 3-year and 5-year survival rates. The major procedures were described in the following sections.
Data Processing
We downloaded the transcriptomic gene expression data and the clinical follow-up data of HNSCC patients from TCGA (Tomczak et al., 2015). Meanwhile, we collected the genomic CNV/SNV data from UCSC Xena platform (Goldman et al., 2019). Firstly, we abandoned samples with overall survival (OS) less than 30 days to avoid the impact of deaths with unrelated causes (Chen et al., 2018), and a total of 476 HNSCC patients were analyzed. Furthermore, the available clinical variables included gender, age, OS, vital status, T stage, N stage, clinical stage, tumor grade, and RT options. Secondly, we obtained 20,557 common genes from the gene expression data and the CNV/SNV data, which were used to screen the gene signature associated with the radiosensitivity prediction of HNSCC in subsequent analysis. Thirdly, DESeq2 was used in the normalization of gene expression data and the detection of DEGs between 287 HNSCC patients with RT and 189 patients without RT. As a consequence, genes with |log2 fold change| ≥ 1 and false discovery rate (FDR) < 0.01 were defined as DEGs for the further analysis. Finally, the CNV/SNV data of DEGs were converted into a sample-by-gene matrix. If there were one or more mutations within the gene for each sample, the gene-level mutation status value in sample-by-gene matrix was defined as 1; otherwise, it was defined as 0.
Establishment of the Gene Signature
Based on DEGs, we used the 2SR and multivariable Cox regression models to construct a gene signature that can predict the radiosensitivity of HNSCC patients. The 2SR model could integrate multiple layer omics data to identify signature genes. Specifically, on the first layer, we predicted gene expression values using the CNV/SNV data of DEGs. On the second layer, we used a regularization methodology to regress the predicted gene expression on the first layer and performed signature genes selection and estimation. Multivariable Cox regression model was used to estimate regression coefficients for the identified signature genes. From this model, gene score of HNSCC patients was described as the sum of the products of individual gene expression levels and the estimated regression coefficients. The detailed processes were elaborated in the following.
Firstly, the 287 patients with RT were randomly divided into a training set (N = 149) and a test set (N = 138) (Table 1). Secondly, on the training set, we input gene expression data, clinical data, and the sample-by-gene matrix of DEGs into the 2SR model to select the OS-related signature genes. The 2SR model was used with default parameters, and the output of this model was a list of genes and their correlated coefficients with OS in patients with RT. When the correlation between genes and OS reached 80% and above, these genes were identified as signature genes. Finally, we calculated the coefficients of these signature genes using multivariable Cox regression model and constructed a gene signature according to the expression levels of these genes, which can stratify HNSCC patients into the high- and low-score groups with the median score of the gene signature on the training set as cutoff value.
Statistical Analysis
In the work, the ROC curve was performed via the R package survivalROC (Heagerty et al., 2013), and the area under the ROC curve (AUC) was used to assess the overall performance of radiosensitivity prediction. Kaplan–Meier survival curves were used to further evaluate the significance difference of OS between different groups. A two-tailed P-value (P) < 0.05 was considered statistically significant in all analyses. The nomogram and the calibration plot were established using R package rms (Harrell et al., 2019) and were used to predict OS of HNSCC patients with RT.
Results and Discussion
Identification of a 12-Gene Signature
Based on the gene expression data from 287 patients with RT and 189 patients without RT, 122 DEGs with |log2 fold change| ≥ 1 and FDR < 0.01 were obtained. The gene expression, clinical, and CNV/SNV data of these DEGs on the training set were imported into 2SR model. With the probability related to OS should be >80% of HNSCC patients receiving RT, 12 genes were picked out as the signature genes. Next, we calculated the coefficients of these signature genes using multivariable Cox regression model. Finally, in order to predict the radiosensitivity of HNSCC patients, we constructed a gene signature on the training set according to the expression levels of these 12 genes as follows: gene score = TDRD9 × 6.950E-6 + CELF3 × 1.106E-2 + FGF19 × 1.937E-5 + KCNB2 × 5.388E-3 + CLDN6 × 1.334E-4 − BEST2 × 6.053E-4 − DDX25 × 1.802E-3 − TMPRSS15 × 4.378E-4 − ALPI × 2.134E-3 − FABP7 × 1.754E-3 − IL17REL × 2.132E-3 − RORB × 1.182E-3. The details of these 12 signature genes were shown in Supplementary Table S1. Then, based on the gene scores, the patients were divided into the high- and low-score groups, where the cutoff value of -0.06338 was derived from the median score of gene scores on the training set samples. Specifically, on the training (149 HNSCC patients receiving RT) and test (139 patients receiving RT) sets, HNSCC patients with a gene score ≥-0.06338 were divided into the high-score group, while those with a gene score <-0.06338 were divided into the low-score group.
Radiosensitivity Prediction by the 12-Gene Signature
To assess the radiosensitivity prediction ability of the 12-gene signature on the HNSCC patients, Kaplan–Meier survival and ROC curves were performed on the training and test sets, respectively. On the training set, there was a significant difference on radiosensitivity between the high- and low-score groups (P = 0.0011, Figure 2A). As we can see, patients in the high-score group were associated with poor radiosensitivity, while patients in the low-score group showed good radiosensitivity. In the light of the time-dependent ROC curves of 3-year and 5-year survival (Figure 2B), the survival time prediction accuracy of the 12-gene signature for HNSCC patients had AUC of 0.705 at 3 years and 0.697 at 5 years. Furthermore, the prediction performance of the 12-gene signature was evaluated on the test set. As seen in Figure 2C, the survival rate was significantly higher in the low-score group than that in the high-score group (P = 0.00031), which was similar with the result of the training set. And the 3-year and 5-year prediction accuracy achieved 0.661 and 0.584, respectively (Figure 2D). The performance on the test set was similar with that on the training set. It indicated the generalization ability of the 12-gene signature was good, and this gene signature could provide a method to predict the radiosensitivity for HNSCC patients. However, the radiosensitivity prediction accuracy of 5-year survival on the test set was lower than that on the training set (Figures 2B,D). There were two possible reasons to explain the causes of the difference. First, the follow-up times were relatively short for HNSCC cohort in TCGA, and the OS of most patients was also less than 5 years. Second, the intrinsic genetic heterogeneity of the tumor could lead to different OS in HNSCC patients with same therapeutic method. The longer the OS, the larger the effect of the intrinsic genetic heterogeneity, and the more difficultly we evaluated this effect with RT.
Figure 2. Kaplan–Meier survival and time-dependent ROC curves on the training (A,B) and test (C,D) sets according to the 12-gene signature.
Assessment of the 12-Gene Signature in All HNSCC Patients
Because of the complexity and heterogeneity of tumors, some HNSCC patients are treated by RT with good outcomes, and some patients may have a resistance to RT, even get worse. Therefore, the 12-gene signature is important for the radiosensitivity prediction of HNSCC patients to improve RT options. In addition, we also assessed the prognostic value of the 12-gene signature in a total of 476 HNSCC patients. As seen in Figure 3A, it indicated that patients in the low-score group generally had a higher 5-year survival rate than that in the high-score group (P < 0.0001). And there was a significant difference between patients with RT and without RT in the low-score groups (P = 0.0033, Figure 3B); however, in the high-score group, the difference was insignificant (P = 0.95, Figure 3C). It suggested that patients in the high-score group might have the radioresistance and did not benefit from RT.
Figure 3. The prognostic values of the 12-gene signature in all HNSCC patients. (A) Kaplan–Meier analysis of overall survival in 476 patients according to the 12-gene signature. (B) Kaplan–Meier survival curves of patients with/without RT in the low-score group. (C) Kaplan–Meier survival curves of patients with/without RT in the high-score group.
In addition, based on different clinical variables such as age, gender, T stage, N stage, clinical stage and tumor grade, HNSCC patients were further stratified into different subgroups (Table 1). Then we evaluated the prognostic value of the 12-gene signature on these different subgroups between HNSCC patients in the high- and low-score groups. On the subgroups of clinic T (Figures 4A,B), clinic N (Figures 4C,D), stage 3-4 (Figure 4F), and grade (Figures 4G,H), the survival rates of patients in the low-score group were significantly higher than those in the high-score group. While there was no statistically significant difference of survival rate between patients in the high- and low-score groups in the subgroup of stage 1-2 (Figure 4E). It was mainly due to the small number of patients in this clinical phase, which accounted for only 11% of all HNSCC patients. Furthermore, on the subgroups of age (Supplementary Figures S1A,B) and gender (Supplementary Figure S1C), the survival rates of patients in the low-score group also were significantly longer than those patients in the high-score group. However, the survival rate between female patients (Supplementary Figure S1D) in the high- and low-score groups did not have statistically significant difference, which was similar with that on the stage 1-2 subgroup. Taken together, these results suggested that this 12-gene signature could serve as a novel and reliable biomarker for the radiosensitivity prediction of HNSCC patients.
Figure 4. The Kaplan–Meier survival analysis of the 12-gene signature in all HNSCC patients in the high- and low-score groups on clinical subgroups of T 1-2 (A), T 3-4 (B), N 0-1 (C), N 2-3 (D), Stage 1-2 (E), Stage 3-4 (F), Grade 1-2 (G) and Grade 3-4 (H).
Prediction of OS in HNSCC With RT by Nomogram
As a visual tool, nomogram has been widely used in predicting the prognosis of cancers (Lubsen et al., 1978; Gorlia et al., 2008). In this work, the nomogram was used to construct the OS prediction model for HNSCC patients with RT. As shown in Figure 5A, age, T stage, N stage, clinical stage, tumor grade, and gene score were considered as relevant variables for the nomogram construction. Since the points of male and female in the nomogram were similar and closed to zero, gender was not shown here. In addition, the contribution of gene score was very important, and it played a crucial role in survival estimation of HNSCC patients with RT. In order to evaluate the predicted outcomes of nomogram, the calibration plots on the training and test sets were exhibited in Figure 5B. It was found that the predicted results of the nomogram showed good agreements with the actual situations, especially on the test set. Furthermore, according to the time-dependent ROC curves (Figure 5C), the nomogram achieved 0.701 and 0.641 of AUC for 3-year OS on the training and test sets, respectively. It revealed that the nomogram could be used as a promising tool to predict the OS of HNSCC patients with RT.
Figure 5. Evaluation of the nomogram on predicting the OS of HNSCC patients with RT. (A) Nomogram for predicting the 3-year and 5-year OS in HNSCC with RT. (B) Calibration plots of the nomogram on the training and test sets. The 45-degree line represents the real outcomes. (C) Time-dependent ROC curves of 3-year OS prediction using the nomogram on the training and test sets.
Comparison With the Gene Signatures Based on Single Omics Data
Besides the 12-gene signature established using multiple omics data, we also assessed the radiosensitivity prediction ability of the gene signatures based on single omics data, such as gene expression data or CNV/SNV data. Since the 2SR model requires multiple omics data as the input files, we used differential expression analysis, univariable Cox proportional hazards regression analysis (David, 1972), classical LASSO regression model (Tibshirani, 1996, 1997; Segal, 2006), and multivariate Cox regression model to select the most important biomarkers and take their linear combination as a predictor of radiosensitivity based on single omics data. Firstly, differential expression analysis was used to identify DEGs by analyzing gene expression profiles of HNSCC patients with RT and without RT. Of note, the construction of the gene signature based on the CNV/SNV data did not use differential expression analysis. Secondly, univariate Cox proportional hazards regression analysis and LASSO logistic regression model were used to screen out the characteristic genes associated with survival. Thirdly, multivariate Cox regression model was used to establish a gene signature for radiosensitivity prediction. Then, HNSCC patients receiving RT were divided into the high- and low-score groups according to the median score on the training set patients. Finally, Kaplan–Meier survival analysis and ROC curves were conducted to observe the difference of survival rate between the patients in the high- and low-score groups.
Based on gene expression data, a 7-gene signature was constructed for radiosensitivity prediction, and the gene score = CHGB × 1.121E-4 + ODAM × 3.603E-5 + RP11-169K17.3 × 8.518E-2 − ZNF541 × 8.229E-5 − CLGN × 2.952E-3 − AC011747.3 × 1.407E-1 − RP11-203B7.2 × 1.357E-1. The details of these 7 signature genes were shown in Supplementary Table S2. The survival analysis results on the test set were shown in Figure 6A. Obviously, the difference between the high- and low-score groups was significant (P = 0.029), which was similar with that on the training set (P = 0.011, Supplementary Figure S2A). The prognostic accuracy of the 7-gene signature for HNSCC patients receiving RT was 0.62 at 3 years and 0.575 at 5 years (Figure 6B), which were both lower than the performances of the 12-gene signature based on multiple omics data.
Figure 6. Kaplan–Meier survival and time-dependent ROC curves on the test set according to the 7-gene signature (A,B) and the 3-gene signature (C,D).
Based on CNV/SNV data, a 3-gene signature was developed for radiosensitivity prediction, and the gene score = − BCLAF1 × 8.084E-1 − ABCB9 × 3.330E-1 − MIS18BP1 × 3.697E-1. The details of the 3-gene signature were shown in Supplementary Table S3. The survival analysis on the test set showed that there was a significant difference between the high- and low-score groups (P = 0.018, Figure 6C), but it was inconsistent with the result on the training set (P = 0.0068, Supplementary Figure S2C). The 3-year and 5-year survival prognostic accuracy of the 3-gene signature were 0.387 and 0.345 on the test set, respectively (Figure 6D), which were far less than the performances of the 12-gene signature using multiple omics data. As single omics data, the sample-by-gene matrix based on CNV/SNV data was sparse, and the genetic information extracted from the sparse matrix was very limited. So the radiosensitivity prediction accuracy of the gene signature based on CNV/SNV data was not good.
At first, we compared the 12-gene signature with those gene signatures (7-gene and 3-gene signatures) based on single omics data. The results showed that the 12-gene signature achieved the highest separation ability, and it significantly stratified patients into the low- and high-score groups on the test set (P = 0.00031 vs. P = 0.029 vs. P = 0.018). It also had the highest accuracy of survival estimation among these gene signatures (3-year survival: 0.661 vs. 0.620 vs. 0.387; 5-year survival: 0.584 vs. 0.575 vs. 0.345) on the test set. In addition, we performed GO and KEGG enrichment analyses, and the result showed there were no significant enrichments. Moreover, on the GO and KEGG terms (Jiao et al., 2012; Xi et al., 2020), there are also no correlations between these signature genes and radiation. Maybe these genes were novel candidate targets and biomarkers correlated with radiation. However, given the performance of 12-gene signature was better than those of 7-gene and 3-gene signatures, the genes in 12-gene signature were more important on radiation response and radiosensitivity prediction. Nevertheless, their roles need to be proved by biological and clinical experiments. Furthermore, we evaluated the performance of the 5-miRNA signature (Chen et al., 2018) on the test set. As shown in Supplementary Figure S3, there is no significant different between the high- and low-score groups based on 5-miRNA signature, and the items of AUC (0.493 and 0.450) on 3-year and 5-year survivals were lower than those (0.661 and 0.584) based on 12-gene signature. In a word, the 12-gene signature based on multiple omics data was a relatively reliable biomarker to predict whether the HNSCC patient benefit from the treatment of RT.
Conclusion
In this study, we used the gene expression, clinical, and CNV/SNV data to develop and validate the 12-gene signature, which may serve as a promising prognostic biomarker for the radiosensitivity prediction of HNSCC patients. Furthermore, we constructed a nomogram based on gene score and clinical variables, which might be a useful tool on the survival estimation of HNSCC patients receiving RT. Finally, we systemically compared the prognosis ability of the gene signatures based on multiple and single omics data, and the results showed that the 12-gene signature based on multiple omics data was more accurate in predicting radiotherapy response and survival rate of HNSCC patients. However, this study also has some limitations. First, these signature genes as biomarkers for radiosensitivity in HNSCC deserve further biological and clinical verification. Second, gene expression signatures are subject to sampling bias caused by the complexity and heterogeneity of tumors, so we will consider the subtypes of tumor in the future study.
Data Availability Statement
All datasets presented in this study are included in the article/Supplementary Material.
Author Contributions
JX, LW, GZ, MH, JL, and ZY conceived and designed the study. MH and JL performed the experiments. MH, JL, and CD wrote the manuscript. JX, YB, and PW reviewed the manuscript. All authors read and approved the final version of this manuscript.
Funding
This work was supported by the National Natural Science Foundation of China (11835014, 61672037, and 21601001) and the Recruitment Program for Leading Talent Team of Anhui Province (2019-16).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors acknowledge the High-performance Computing Platform of Anhui University for providing computing resources.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.00960/full#supplementary-material
FIGURE S1 | The Kaplan-Meier survival analysis of the 12-gene signature in all HNSCC patients in the high and low score groups on age ≤ 60, age > 60, male, female subgroups.
FIGURE S2 | Kaplan-Meier survival and time-dependent ROC curves on the training set according to the 7-gene signature and the 3-gene signature.
TABLE S1 | The information of genes in the 12-gene signature.
TABLE S2 | The information of genes in the 7-gene signature.
TABLE S3 | The information of genes in the 3-gene signature.
References
Alsahafi, E., Begg, K., Amelio, I., Raulf, N., Lucarelli, P., Sauter, T., et al. (2019). Clinical update on head and neck cancer: molecular biology and ongoing challenges. Cell Death Dis. 10:540. doi: 10.1038/s41419-019-1769-1769
Benner, A., Zucknick, M., Hielscher, T., Ittrich, C., and Mansmann, U. (2010). High-dimensional Cox models: the choice of penalty as part of the model building process. Biomed. J. 52, 50–69. doi: 10.1002/bimj.200900064
Chakraborty, S., Hosen, M. I., Ahmed, M., and Shekhar, H. U. (2018). Onco-multi-OMICS approach: a new frontier in cancer research. Biomed Res. Int. 2018:9836256. doi: 10.1155/2018/9836256
Chen, L., Wen, Y., Zhang, J., Sun, W., Lui, V. W. Y., Wei, Y., et al. (2018). Prediction of radiotherapy response with a 5-microRNA signature-based nomogram in head and neck squamous cell carcinoma. Cancer Med. 7, 726–735. doi: 10.1002/cam4.1369
Eschrich, S., Zhang, H. L., Zhao, H. Y., Boulware, D., Lee, J. H., Bloom, G., et al. (2009). Systems biology modeling of the radiation sensitivity network: a biomarker discovery platform. Int. J. Radiat. Oncol. Biol. Phys. 75, 497–505. doi: 10.1016/j.ijrobp.2009.05.056
Goldman, M., Craft, B., Hastie, M., Repeèka, K., McDade, F., Kamath, A., et al. (2019). The UCSC Xena platform for public and private cancer genomics data visualization and interpretation. bioRxiv [Preprint] Available online at: https://www.biorxiv.org/content/10.1101/326470v6 (accessed September 26, 2019).
Gorlia, T., van den Bent, M. J., Hegi, M. E., Mirimanoff, R. O., Weller, M., Cairncross, J. G., et al. (2008). Nomograms for predicting survival of patients with newly diagnosed glioblastoma: prognostic factor analysis of EORTC and NCIC trial 26981-22981/CE.3. Lancet Oncol. 9, 29–38. doi: 10.1016/S1470-2045(07)70384-70384
Gui, J., and Li, H. (2005). Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data. Bioinformatics 21, 3001–3008. doi: 10.1093/bioinformatics/bti422
Han, P., Ji, X., Zhang, M., and Gao, L. J. E. R. M. P. S. (2018). Upregulation of lncRNA LINC00473 promotes radioresistance of HNSCC cells through activating Wnt/beta-catenin signaling pathway. Eur. Rev. Med. Pharmacol. 22, 7305–7313. doi: 10.26355/eurrev_201811_16267
Harrell, F. E. Jr., Harrell, M. F. E. Jr., and Hmisc, D. (2019). Package ‘rms’. Nashville: Vanderbilt University, 229.
Heagerty, P. J., Lumley, T., and Pepe, M. S. (2000). Time-dependent ROC curves for censored survival data and a diagnostic marker. Biometrics 56, 337–344.
Hu, X. H., Xie, W. B., Wu, C. C., and Xu, S. Z. (2019). A directed learning strategy integrating multiple omic data improves genomic prediction. Plant Biotechnol. J. 17, 2011–2020.
Jemal, A., Bray, F., Center, M. M., Ferlay, J., Ward, E., and Forman, D. (2011). Global cancer statistics. CA Cancer J. Clin. 61, 69–90. doi: 10.3322/caac.20107
Jiao, X., Sherman, B. T., Huang, D. W., Stephens, R., Baseler, M. W., Lane, H. C., et al. (2012). DAVID-WS: a stateful web service to facilitate gene/protein list analysis. Bioinformatics 28, 1805–1806.
Kamarudin, A. N., Cox, T., and Kolamunnage-Dona, R. (2017). Time-dependent ROC curve analysis in medical research: current methods and applications. BMC Med. Res. Methodol. 17:53. doi: 10.1186/s12874-017-0332-6
Leucci, E., Vendramin, R., Spinazzi, M., Laurette, P., Fiers, M., Wouters, J., et al. (2016). Melanoma addiction to the long non-coding RNA SAMMSON. Nature 531, 518–522.
Lin, W., Feng, R., and Li, H. Z. (2015). Regularization methods for high-dimensional instrumental variables regression with an application to genetical genomics. J. Am. Stat. Assoc. 110, 270–288. doi: 10.1080/01621459.2014.908125
Love, M. I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15:550. doi: 10.1186/s13059-014-0550-558
Lubsen, J., Pool, J., and Der Does, E. V. (1978). A practical device for the application of a diagnostic or prognostic function. Methods Inf. Med. 17, 127–129. doi: 10.1055/s-0038-1636613
Ma, J. B., Li, R., and Wang, J. (2019). Characterization of a prognostic four-gene methylation signature associated with radiotherapy for head and neck squamous cell carcinoma. Mol. Med. Report. 20, 622–632. doi: 10.3892/mmr.2019.10294
Scaife, J. E., Barnett, G. C., Noble, D. J., Jena, R., Thomas, S. J., West, C. M. L., et al. (2015). Exploiting biological and physical determinants of radiotherapy toxicity to individualize treatment. Brit. J. Radiol. 88:20150172. doi: 10.1259/bjr.20150172
Segal, M. R. (2006). Microarray gene expression data with linked survival phenotypes: diffuse large-B-cell lymphoma revisited. Biostatistics 7, 268–285. doi: 10.1093/biostatistics/kxj006
Suh, Y. E., Raulf, N., Gaken, J., Lawler, K., Urbano, T. G., Bullenkamp, J., et al. (2015). MicroRNA-196a promotes an oncogenic effect in head and neck cancer cells by suppressing annexin A1 and enhancing radioresistance. Int. J. Cancer 137, 1021–1034. doi: 10.1002/ijc.29397
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. B. 58, 267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x
Tibshirani, R. (1997). The lasso method for variable selection in the Cox model. Stat. Med. 16, 385–395. doi: 10.1002/(sici)1097-0258(19970228)16:4<385::aid-sim380<3.0.co;2-3
Tomczak, K., Czerwinska, P., and Wiznerowicz, M. (2015). The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp. Oncol. 19, A68–A77. doi: 10.5114/wo.2014
van der Heijden, M., Essers, P. B. M., de Jong, M. C., de Roest, R. H., Sanduleanu, S., Verhagen, C. V. M., et al. (2019). Biological determinants of chemo-radiotherapy response in HPV-negative head and neck cancer: a multicentric external validation. Front. Oncol. 9:1470. doi: 10.3389/fonc.2019.01470
Wang, H. C., Chan, L. P., and Cho, S. F. (2019). Targeting the immune microenvironment in the treatment of head and neck squamous cell carcinoma. Front. Oncol. 9:1084. doi: 10.3389/fonc.2019.01084
Wang, W., Zhou, Y., Cheng, M.-T., Wang, Y., Zheng, C.-H., Xiong, Y., et al. (2020). Potential pathogenic genes prioritization based on protein domain interaction network analysis. IEEE/ACM Trans. Comput. Biol. Bioinform. 99, 1–1.
Weng, J. H., Yu, C. C., Lee, Y. C., Lin, C. W., Chang, W. W., and Kuo, Y. L. (2016). miR-494-3p induces cellular senescence and enhances radiosensitivity in human oral squamous carcinoma cells. Int. J. Mol. Sci. 17:1092. doi: 10.3390/ijms17071092
Xi, J., Li, A., and Wang, M. (2018). HetRCNA: a novel method to identify recurrent copy number alternations from heterogeneous tumor samples based on matrix decomposition framework. IEEE/ACM Trans. Comput. Biol. Bioinform. 17, 422–434.
Keywords: head and neck squamous cell carcinoma, radiotherapy, multiple omics data, radiosensitivity, gene signature
Citation: Liu J, Han M, Yue Z, Dong C, Wen P, Zhao G, Wu L, Xia J and Bin Y (2020) Prediction of Radiosensitivity in Head and Neck Squamous Cell Carcinoma Based on Multiple Omics Data. Front. Genet. 11:960. doi: 10.3389/fgene.2020.00960
Received: 15 June 2020; Accepted: 31 July 2020;
Published: 18 August 2020.
Edited by:
Bing Wang, Anhui University of Technology, ChinaCopyright © 2020 Liu, Han, Yue, Dong, Wen, Zhao, Wu, Xia and Bin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yannan Bin, ynbin@ahu.edu.cn
†These authors have contributed equally to this work