- 1Department of Urology, Danyang People’s Hospital, Danyang, China
- 2Department of Clinical Medicine, Medical School of Southeast University, Nanjing, China
- 3Department of Urology, Zhongda Hospital Affiliated to Southeast University, Nanjing, China
- 4Department of Urology, The Second People's Hospital of Taizhou, Taizhou, China
- 5Department of Urology, Yancheng Third People’s Hospital, Yancheng, China
- 6Department of Urology, Jinhu County People’s Hospital, Huaian, China
- 7Department of Urology, Binhai County People’s Hospital, Yancheng, China
Purpose: Renal clear cell carcinoma (ccRCC) is the most lethal of all pathological subtypes of renal cell carcinoma (RCC). Genomic instability was recently reported to be related to the occurrence and development of kidney cancer. The biological roles of long non-coding RNAs (lncRNAs) in tumorigenesis have been increasingly valued, and various lncRNAs were found to be oncogenes or cancer suppressors. Herein, we identified a novel genomic instability-associated lncRNA (GILncs) model for ccRCC patients to predict the overall survival (OS).
Methods: The Cancer Genome Atlas (TCGA) database was utilized to obtain full transcriptome data, somatic mutation profiles, and clinical characteristics. The differentially expressed lncRNAs between the genome-unstable-like group (GU) and the genome-stable-like group (GS) were defined as GILncs, with |logFC| > 1 and an adjusted p-value< 0.05 for a false discovery rate. All samples were allocated into GU-like or GS-like types based on the expression of GILncs observed using hierarchical cluster analyses. A genomic instability-associated lncRNA signature (GILncSig) was constructed using parameters of the included lncRNAs. Quantitative real-time PCR analysis was used to detect the in vitro expression of the included lncRNAs. Validation of the risk model was performed by the log-rank test, time-dependent receiver operating characteristic (ROC) curves analysis, and multivariate Cox regression analysis.
Results: Forty-six lncRNAs were identified as GILncs. LINC00460, AL139351.1, and AC156455.1 were employed for GILncSig calculation based on the results of Cox analysis. GILncSig was confirmed as an independent predictor for OS of ccRCC patients. Additionally, it presented a higher efficiency and accuracy than other RCC prognostic models reported before.
Conclusion: GILncSig score was qualified as a critical indicator, independent of other clinical factors, for prognostic prediction of ccRCC patients.
Background
Renal cell carcinoma (RCC) accounts for approximately 85% of urinary cancers generated from the kidneys, and the associated morbidity is growing continuously over recent years (1). Nearly 100,000 patients die from RCC annually all over the world, and more than 170,000 deaths were observed globally in 2018 according to recent statistics (2, 3). Histologically, RCC can be classified into five subtypes with unique characteristics, including clear cell (ccRCC), papillary (pRCC), chromophobe (cRCC), collecting duct (cdRCC), and unclassified RCC. About 70~80% of patients are diagnosed with ccRCC after tumor biopsy or nephrectomy (4). Generally, ccRCC with metastases is associated with high mortality, and over 25% of patients first diagnosed with ccRCC were reported to be distant metastatic with a 5-year survival rate of 0~10% (5). Besides, the number of deaths from ccRCC accounts for the most among all subtypes of kidney cancers.
Genomic instability has been widely acknowledged as a trigger to carcinogenesis and requires therapeutic intervention. Usually, cancer genomic instability promotes the development of carcinomatous characteristics (6, 7). The more frequently the genetic alterations arise, the more likely the genomic instability occurs during the cell cycle (8). A high level of genomic instability with numerous somatic mutations could lead to malignant progression, distant metastasis, and poor prognosis in multiple cancers (9–11). It was currently reported that genomic instability of various critical genes in RCC cells can affect metabolic features of the tumor and disturb the process of cell division, thus resulting in malignant progression (8, 12).
Long non-coding RNAs (lncRNAs) are defined as transcripts comprising over 200 nucleotides that are unable to encode proteins. They function as regulators equipped with diverse biological functions in tumor-associated signaling pathways, including epigenetic regulation, transcriptional regulation, and post-transcriptional regulation (13, 14). In multiple cancers, such as breast cancer, prostate cancer, colorectal cancer as well as RCC, aberrant lncRNAs were often detected at different processes of tumorigenesis. For example, dysregulated lipid-associated lncRNAs could be regarded as predictors of poor prognosis of cancer (15). Moreover, lncRNA-URRCC, which is overexpressed in RCC samples, was found to be related to poor prognosis and acceleration of cell proliferation, and invasion in the ccRCC (16).
Nowadays, although aberrantly expressed lncRNAs and genomic instability are both considered to play core roles in the carcinogenesis of renal cells, it is still unclear whether there is a clinical association between them (9–14). Notably, evidence unraveling the critical roles of lncRNAs in genomic instability maintenance and the prognostic significance of the genomic instability-associated lncRNAs in cancer patients remained to be identified. Therefore, in this study, we attempted to establish a risk-score model for predicting the prognosis of ccRCC patients based on statistics from the TCGA database.
Methods
Data retrieval and sample classification
We obtained full transcriptome data, somatic mutation profiles, and clinical characteristics of 539 patients from the TCGA database (https://cancergenome.nih.gov/). To identify the possible relations between lncRNAs and genomic instability, lncRNA expression profiles and somatic mutation profiles were analyzed. After the somatic mutation frequency calculation of every sample, all the samples were ranked in descending order. The top 25% and the bottom 25% of these samples were allocated to the genome-unstable-like group (GU) and the genome-stable-like group (GS), respectively. The differences in lncRNA expression levels between the two groups were evaluated with a significance analysis of microarrays (SAM). Differentially expressed lncRNAs with |logFC| > 1 and adjusted p-value< 0.05 in false discovery rate (FDR) were defined as genomic instability-associated lncRNAs (GILncs). According to expression levels of specific lncRNAs, all samples were separately assigned to GU-like type and GS-like type using hierarchical cluster analyses.
Co-expression net and gene functional exploration
Co-expression regulatory net model was constructed to analyze the correlation between 46 lncRNAs and the corresponding susceptible mRNAs. The relevancy degrees were measured by Pearson correlation coefficients per cluster. Furthermore, enrichment analyses of most related mRNAs were conducted for function prediction, in terms of Gene Ontology (GO) terms and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway (17). Statistics were processed and visualized with the ‘clusterProfiler’ R package (18).
Quantitative real-time PCR analysis
RNA extraction kits (OMEGA, China) were used to extract RNA from the kidney tissue. The specific primers used were as follows: 5’ ACGCAGTGGATGAGAACGAA (LINC00460 forward) and 5’ GGGGTGACTTCAGAATGCGT (LINC00460 reverse); 5’CTTCACATTCTACACAGCCTCTCCT (AL139351.1 forward) and 5’GGTGTGGGTGAAGTAAAG AAAGC (AL139351.1 reverse); 5’ CTCACTGGAGCCG CCTAACTT (AC156455.1 forward) and 5’ CGTGTTGA GGACTACAGAAGAGGA (AC156455.1 reverse). mRNA expressions were normalized to GAPDH mRNA expression. Every experiment was repeated at least three times.
Prognosis-related statistical analysis
To identify prognosis-related GILncs independently from other features of ccRCC patients, univariate and multivariate Cox proportional hazard regression analyses were performed. The qualified prognosis-related lncRNAs were considered for prognostic model construction, of which p-values< 0.05 was considered statistically significant.
Combining the expression of prognosis-related GILncs and coefficients from multivariate Cox regression, a genomic instability-associated lncRNA signature (GILncSig) for predicting prognosis was derived, and the computational formula for the same is as follows:
Where, GILncSig (patient) represents a prognostic risk score for ccRCC patients, lncRNAi represents the ith prognosis-related lncRNA, expr (lncRNAi) is the lncRNAi expression level, and coef (lncRNAi) is the contribution of lncRNAi to the risk score derived from multivariate Cox regression coefficients.
Classification between the low-risk group with low GILncSig and the high-risk group with GILncSig relying on the risk cutoff was computed by the median score of the patients in the training set. The accuracy of the predictive model for each group was evaluated by the Kaplan–Meier (KM) log-rank test, time-dependent receiver operating characteristic (ROC) curve analysis, and multivariate Cox regression analysis. KM survival curves were analyzed to determine correlations among all parameters, including clinical characteristics and GILncSig. Hazard ratios (HR), 95% confidence interval (CI), and p-value were standards for identifying independent prognostic indicators. ROC curves were utilized to evaluate the predictive effectiveness of the genome unstable lncRNA-based risk scores for the prognosis of ccRCC patients. A two-sided p-value< 0.05 threshold was considered statistically significant. All statistical analyses were conducted with R version 4.0.3.
Results
Sample cluster dependent on GILncs
The research process is shown in Figure 1. Detailed clinical characteristics of ccRCC patients including age, gender, grade, clinical stage, and TNM are described in Table 1, and no difference was detected between the subgroups. First, 170 samples were divided into the GU-like group and GS-like group, based on the cumulative number of somatic mutations. The expressions of 46 genes were significantly different between the two groups (Figure 2A). Using hierarchical cluster analyses, all samples were clustered based on the expression levels of qualified lncRNAs. Fifty-three and 486 samples were classified into GS-like type and GU-like type, respectively (Figure 2B). Consistently, the frequency of somatic mutations was significantly higher in GU-like type than in GS-like type (Figure 2C). For reverse verification, the expression of the novel GILnc AC021744.1 was compared between the two groups. As was expected, the expression of AC021744.1 was significantly higher in the GU-like type compared with the GS-like type (Figure 2D).
Figure 2 Identification of GILncs and whole sample cluster. (A) The heatmap plot of lncRNAs based on mutation frequency. (B) Hierarchical clustering analysis of all 539 samples. GU-like type is colored in red and GS-like type is colored in blue. (C) The boxplot of somatic mutation count comparison between GU-like type and GS-like type. (D) The boxplots of AC021744.1 expression level in the GU-like type and GS-like type.
LncRNA-mRNA co-expression net and biological function prediction
Interactions between lnRNAs and mRNAs were visualized in a network consisting of nodes and lines (Figure 3A). Green nodes represented critical GILncs, and red nodes represented the 10 most related mRNAs regulated by lncRNAs. The lines connecting them showed the degree of relevancy. To further predict the potential biological roles of mRNAs in ccRCC, the functional enrichment analysis was employed for GO terms and KEGG pathways research. As is displayed in Figure 3B, the ‘monovalent inorganic cation homeostasis’ (GO:0055067, p-value = 1.58×10-6) and ‘ear development’ (GO:0043583, p-value = 6.98×10-5) were GO terms of biological process (BP) that most genes are involved in. In terms of cellular component (CC), the number of genes that enriched in the ‘apical part of cell’ (GO:0045177, p-value = 5.70×10-6) was the greatest, reaching 20. As far as molecular function (MF) was concerned, most genes were likely to participate in the ‘monovalent inorganic cation transmembrane transporter activity’ (GO:0016324, p-value = 0.0001). Further, KEGG enrichment analysis was performed to find the biological signal pathways in which GILncs might be involved. In accordance with the bubble diagram (Figure 3C), included genes were mostly inclined to concentrate on KEGG pathways of ‘Human papillomavirus infection’ (n = 11, p-value = 0.026) and ‘PI3K-Akt signaling pathway’ (n = 11, p-value = 0.039).
Figure 3 Regulatory network and functional analyses. (A) Co-expression network of GILnc and targeted mRNAs. (B) Bar plots of GO terms enrichment analysis. (C) The bubble plot of KEGG pathway enrichment analysis.
Establishment of GILncSig and prognostic risk model
Using multivariate Cox regression analysis, 3 lncRNAs namely LINC00460, AL139351.1, and AC156455.1, that were closely related to OS were selected for the model establishment. To validate the result, human renal cells were used to confirm the high expression of the included lncRNAs.
Next, all ccRCC samples were randomly allocated to the training set (n = 257) and the testing set (n = 256) for further validation of the prognostic risk model. The GILncSig was utilized as the index for risk group classification. To figure out the association between risk score and prognosis, survival analyses of both the training set and testing set were performed. KM curves of OS were drafted to compare the survival outcomes (Figures 4A, F) in the two sets, and both showed significantly better OS in the low-risk group than in the high-risk group (p-value< 0.001). Areas under the curve (AUC) values of ROC curves were determined to assess the reliability of our model (Figures 4B, G). AUC values of the training set (AUC = 0.691) and the testing set (AUC = 0.689) were simultaneously close to 0.7, indicating relatively high effectiveness for prognosis prediction. The correlation between prognostic risk score and AC021744.1 expression was illustrated in expplots (Figures 4C, H) and heatmaps (Figures 4D, I). The results suggested that as the risk score increases, the expression level of lncRNA AC021744.1 would enhance consistently in both the training set and the testing set. In addition, lncRNA LINC00460, AL139351.1, and AC156455.1 were found to be upregulated when the risk score was higher, which further clarified these three lncRNAs as high-risk genes. The mutplots (Figures 4E, J) showed that for both the training and testing set, higher somatic mutation counts were observed for higher risk scores.
Figure 4 Validation of the GILncSig model. (A) The survival analysis in training set; (B) The ROC curve in training set; (C) The expplot analysis in training set; (D) The heatmap in training set; (E) The mutplot analysis in training set; (F) The survival analysis in testing set; (G) The ROC curve in testing set; (H) The expplot analysis in testing set; (I) The heatmap in testing set; (J) The mutplot analysis in testing set.
Relevance between risk score and somatic mutation count/AC021744.1 expression
As demonstrated in the two sets of box plots in Figure 5, the association between prognostic risk score and mutation counts or AC021744.1 expression level was further assessed in pairs. A significantly positive trend could be illustrated between mutation frequency and risk of prognosis in the training set (p-value = 0.0067, Figure 5A), while no significant association was detected in the testing set (Figure 5C). No association between risk score and the AC021744.1 expression level was found in the training set, nor in the testing set (Figures 5B, D).
Figure 5 Risk correlation analysis. (A, C) Boxplots of correlation between risk levels and somatic mutation count in training group and testing group. (B, D) Boxplots of correlation between risk levels and AC021744.1 expression in training group and testing group.
Validation of GILncSig as an independent prognostic factor from clinical characteristics
To further investigate whether GILncSig could be identified as an independent predictor of OS, several potential prognostic indicators were comprehensively researched with the integration of univariate and multivariate Cox regression analysis. Factors of interest included age, gender, pathological grade, clinical stage, and GILncSig (Table 2). The Cox regression analysis revealed that the GILncSig Score could further be identified as an independent prognostic factor affecting OS from other potential factors in the testing set (HR = 1.110, 95% CI 0.987-1.248, p-value = 0.019) and TCGA set (HR = 1.015, 95% CI 1.007-1.023, p-value< 0.000). Similarly, pathological grade showed a significant association with survival outcomes in the testing set (HR = 1.633, 95% CI 1.146-2.327, p-value = 0.007) and TCGA set (HR = 1.428, 95% CI 1.131-1.802, p-value = 0.003), as well. Meanwhile, age and the clinical stage could significantly influence the survival of ccRCC patients in all sets.
Table 2 Univariate and Multivariate Cox regression analysis of the GILncSig and OS in different sets.
In addition, a series of KM log-rank analyses were performed to further validate GILncSig as an independent prognostic factor in different subgroup samples. In each pair of subgroups, survival time was positively correlated with a risk score, under the effects of age (Figures 6A, B), gender (Figures 6C, D), pathological grade (Figures 6E, F), and clinical stage (Figures 6G, H).
Figure 6 Predictive consistency analyses of GILncSig in populations with different clinical characteristics. (A, B) KM curves: Comparisons of OS between ccRCC patients with high and low risk, in old and young groups. (C, D) KM curves: Comparisons of OS between ccRCC patients with high and low risk, in male and female groups. (E, F) KM curves: Comparisons of OS between ccRCC patients with high and low risk, in pathologically early and advanced groups. (G, H) KM curves: Comparisons of OS between ccRCC patients with high and low risk, in clinically early and advanced groups.
To conclude the results above, the GILncSig score could be deemed as a prognostic predictor with consistent independence, which was negatively correlated with the OS of ccRCC patients.
Relationship between risk and oncogene mutation
PBRM1 has been recognized as a classic oncogene taking part in genomic instability and a high mutation frequency in PBRM1 was related to the occurrence of ccRCC (19). To figure out whether the new signature of risk estimation in ccRCC can predict prognosis, the single genetic mutation count was calculated in the training and testing sets. Although the proportion of mutation appeared to be larger in the high-risk group in both the two sets (Figures 7A, B), no statistical significance was probed (training set p-value = 0.235, testing set p-value = 0.276).
Figure 7 (A, B) Carcinogenic function analysis of GILncSig based on single geng mutation of PBRM1: Proportional bar plots of mutation type and wild type of PBRM1 in samples of training set and testing set. (C) mRNA expression of LINC00460, AL139351.1 and AC156455.1 in HK2, 786-O and Caki-2 cells according to qRT-PCR analysis. (D) Comparison among three lncRNA-based prognostic models based on AUCs: ROCs of GILncSig, SunLncSig and ZengLncSig. * means p<0.05, and ** means p<0.01.
Expression level of GILnc in different cell lines
As shown in Figure 7C, the expressions of LINC00460, AL139351.1, and AC156455.1 determined by the qPCR analysis were higher in RCC cells 786-O and Caki-2 as compared with the normal renal cells.
Comparison of lncRNA-Related prognostic prediction models
To evaluate the efficiency of our prognostic model of GILncSig, it was compared with the other two published lncRNA-related prognostic prediction models for ccRCC. AUCs of the three ROC curves corresponding to the three models represented relative predictive accuracy. As illustrated in Figure 7D, the AUC of ZengLncSig (20), a six-lncRNA-based risk model, was 0.500, while that for another five immune-related lncRNA signatures of Sun (21), was 0.679. Accordingly, our GILncSig of 3 lncRNAs with the AUC of 0.688 exhibited the most effective prediction of prognosis for patients with ccRCC.
Discussion
ccRCC is the most prevalent histological subtype of kidney cancer, contributing to a major part of yearly mortality related to cancers (4, 5). While lncRNAs have become increasingly recognized for their multiple biological roles in the tumorigenesis process in ccRCC, studies on their role in prognostic risk prediction are still insufficient (22–24). The occurrence of somatic mutation in cancer-related genes plays a critical role in the induction of RCC, and there is an increasing number of etiological studies on genomic instability. For instance, Wang W et al. reported in 2012 that the genomic instability existing in the DNA repair gene Ku70 contributed to causing RCC (25). Moreover, genomic DNA hypomethylation, which promoted the genomic instability of the global genome was proved as a hallmark of RCC risk by Mendoza-Pérez J et al., offering further hypotheses on the etiology of the RCC tumorigenesis (26). In 2019, Renzo G DiNatale et al. demonstrated that the mutation on TCEB1 could diminish the suppressive effects of the Von Hippel-Lindau (VHL) gene in ccRCC. Therefore, molecular events contributing to high genomic instability were proved to enforce aggressiveness and adverse clinical outcomes in ccRCC patients (27–29). In recent years, several studies investigated the association between survival outcomes and different clusters of lncRNAs in ccRCC samples and established risk models to validate the predictive ability of lncRNAs. In an earlier study by Sun Z et al., an immune-related signature, synthesized from 5 lncRNAs (AC008105.3, LINC02084, AC243960.1, AC093278.2, and AC108449.2) extracted from the TCGA database, could successfully predict the clinical outcomes (21). Zeng JH et al. proposed a practical six lncRNA-based prognostic risk model (CTA−384D8.35, CTD−2263F21.1, LINC01510, RP11−352G9.1, RP11−395B7.2, and RP11−426C22.4) based on the expression levels of involved non-coding genes in ccRCC samples (20). However, prognostic models with the theme of GILncs are rarely reported. Therefore, when constructing predictive models for prognostic risk of ccRCC patients, we were encouraged to derive an original index, GILncSig score, and carry out a comprehensive analysis to validate relations among GILncs and clinical outcomes (30–32).
We eventually screened out 46 GILncs equipped with differential mutation frequency in 539 samples from the TCGA database with outright open access. After seriatim referring to correlative literature, it was observed that very few studies discussed non-coding genes AC016405.3, AC114803.1, AC156455.1, AL139351.1, OSTM1-AS1, and AC015977.2 in GILncSig computation. Nevertheless, the highly mutant lncRNA AC021744.1 was reported to be an indicator of poor prognosis among patients with hepatocellular carcinoma (HCC). The overexpression of AC021744.1 could directly lead to shorter recurrence-free survival (RFS) time because of severe liver fibrosis (33). As the only known oncogene relevant to genomic instability in our model, AC021744.1 was selected for the current study. In accordance with the individual distribution of 46 genome-unstable lncRNAs, all 539 samples were clustered as GS-like type (n = 53) and GU-like type (n = 486). In addition, Bao S reported in 2019 (34) that AC021744.1 was a gene instability-related lncRNA significantly correlated with the gene instability-driving gene UBQLN4 and played an important role in the occurrence and development of breast cancer. Hence, AC021744.1 was also considered a representative genomic instability-associated lncRNA for further validation. It was observed that with the presence or absence of genomic instability, levels of somatic mutation count would appear low and high, respectively, thereby verifying the positive association among the included parameters.
The detailed construction of the co-expression network among genomic instability-related lncRNAs and regulated mRNAs included quantitative synthesis and analysis of the relevance between lncRNAs and mRNAs. Unfortunately, further co-regulation predictions from any other databases were not obtained to add to the predictions of co-expression among genes and the lncRNAs of interest. We assumed it was because the functions and signaling pathways of genomic instability-associated lncRNAs were rarely studied or reported in the literature. Due to these limitations, more original sequencing statistics are anticipated for further co-expression or ceRNA analyses.
In the meantime, GO and KEGG enrichment analyses of susceptible mRNAs were conducted for biological function forecast. Noticeably, mRNAs targeted downstream were highly enriched at the sites of the ‘PI3K-Akt signaling pathway’, in line with the result of KEGG analysis. PI3K/Akt signaling pathway has been mentioned in numerous studies on carcinomas and is reported to participate in the malignant progression and poor prognosis of various cancers, including RCC (35). Hence, the impacts of genomic instability-related lncRNAs on the prognosis of ccRCC patients could be viewed from another aspect. When the upregulated and downregulated gene was analyzed separately in GO analyses, it was found that among the 46 genomic instability-associated lncRNAs, most upregulated lncRNAs significantly enriched GO terms while the enrichments due to a few downregulated lncRNAs candidates were insignificant. Therefore, we synthesized the enrichment to summarize the enrichment of GILncs on GO terms, no matter whether it was significant or not (Figure 3B).
After the establishment of the GILncSig model, samples could be accordingly divided into a high-risk group and a low-risk group. Evidently, the somatic mutation frequency and expression of high-risk genes, LINC00460, AL139351.1, and AC156455.1, were enhanced when the prognostic risk was higher. Meanwhile, ccRCC manifested more malignant attributes leading to poorer OS concluded by the incremental expression level of the cancer-associated gene, AC021744.1. As no biological functions of AL139351.1 and AC156455.1 have been found, attention should be paid to LINC00460, a dysregulated lncRNA reported in RCC in 2018 (36). It has been validated that LINC00460 functions as a competing endogenous RNA (ceRNA) in co-expression and promotes the malignant development of multiple cancers, including prostate cancer (37), skin cancer (38), hepatocellular cancer (39), colorectal cancer (40) and so on, except for RCC. Coincidentally, Zhang D et al. researched LINC00460 as well and when identifying a three-lncRNA prognostic signature (41), they speculated LINC00460’s competing endogenous feature for its overexpression in ccRCC but did not mention its potential genomic instability. Controversially, the mutation frequency and expression were not found to be consistently differential when quantitative analyses were performed (Figure 5), probably due to the uncertain carcinogenic mechanism in ccRCC. In addition, our GILncSig Score model possessed great independence as a prognostic predictor from other significant clinical factors, such as age, gender, pathological grade, and clinical stage, showing its compatibility for all kinds of ccRCC patients with different clinical characteristics. Besides, when comparing our model with former lncRNA-related signatures, it is worth noting that our GILncSig model showed better prognosis efficiency with a higher AUC of 0.688 (Figure 7C).
It is widely acknowledged that RCC is insensitive to radiotherapy and chemotherapy, and hence, remedies targeted to specific genes or immune checkpoint inhibitors (ICIs) are now being explored (42, 43). Since Braun DA et al. (43) clinically validated the alternations of PBRM1 as a biomarker of ICI response in RCC in 2019, and Carril-Ajuria L et al. (19) demonstrated the prognostic and predictive value of PBRM1 in ccRCC in the same year, more and more researchers are exploring relevant genes, which could be hallmarks of immune targeted therapy of PBRM1-mutant ccRCC. For example, in the recent 2021 conference of European Urology, Hagiwara M et al. (44) proposed that poly ADP-ribose polymerase-1 (PARP1) could be a marker of the efficacy of immunotherapy for patients with PBRM1-mutant ccRCC, the higher expression of which suggested poorer prognosis and higher drug resistance. In our study, 46 differentially expressed GILncs between genome-unstable and genome-stable-like groups were found, among which LINC00460, AL139351.1, and AC156455.1 were validated as significant independent prognostic factors and considered for the construction of the risk model. Furthermore, a partially positive correlation was observed between the risk score and the mutation of PBRM1 in the training and the testing set. The mutation of PBRM1 was more likely to appear in the high-risk group for both the training and testing sets, while no statistical significance was discovered. Though the difference was insignificant, this still led us to the potential of the risk model to predict response to ICI treatment for patients with PBRM1-mutated ccRCC. We assumed that it owed to the limits of sample size, and it remains to be confirmed in the future under the condition of a larger sample size or experiments in vivo/vitro.
However, some limitations in the current study could not be neglected. Owing to the lack of other independent cohorts to perform validation, the same datasets were applied to set the training group and test group for internal validation. Admittedly, by changing the distribution of samples, the distribution density of repeated samples would increase, and thus we anticipate more data from independent cohorts for further validation.
In addition, due to the lack of an extra independent dataset for external validation, the qPCR analysis comparing the mRNA expression in ccRCC cell lines, and the normal renal tubular epithelial cells was carried out, showing that GILncs of the risk model were expressed significantly higher in the tumor cell lines. As the three prognostic GILncs were overexpressed in ccRCC, we did perform clone formation and Transwell experiments to explore the effects of these lncRNAs on proliferation, migration and invasion of RCC cells at the beginning. However, no significant differences were observed in cells’ proliferation, migration and invasion compared to controls, after knocking down the three prognosis-associated GILncs in either Caki-2 or 786-o cell lines, respectively. Therefore, we hypothesized that these GILncs did not affect the prognosis of patients by directly promoting the aggressiveness of RCC cells and further experimental exploration are expected.
Finally, since the data were extracted retrospectively from databases, the outcomes may be not accurate enough, and prospective validations based on experimental results would make the results more persuasive. For example, lncRNA AC021744.1 was identified as an oncogene related to the poor prognosis of HCC patients (33), but its carcinogenesis in RCC still lacks further proof from experimental studies. Consequently, the arguments using AC021744.1 could not completely support the correlation between the risk score and OS. Further, functions of AL139351.1 and AC156455.1 included in GILncSig remain to be discovered.
To sum up, the GILncSig Score, an original index calculated with the coefficients and expression levels of GILncs, is qualified to be a critical indicator, independent of other clinical factors, for predicting the prognostic risk of ccRCC patients.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: TCGA Database ID: TCGA-KIRC.
Author contributions
JL, HL, and MC designed the study; DJ, TW, NS, and HJ conducted the study and maintained the data; YW analyzed the data and made the figures; MW helped draft the paper; JW and YS helped correct a major of the grammatical mistakes and made a great contribution to this revision. All authors contributed to the article and approved the submitted version.
Funding
Medical Research Foundation of Jiangsu Province (Z2019024).
Acknowledgments
We thank Bullet Edits Limited for linguistic editing and proofreading of the manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Rebecca LSM, Kimberly DMM, Ahmedin JD. PhD: Cancer statistics, 2020. CA: A Cancer J Clin (2020) 70(1):7–30. doi: 10.3322/caac.21590
2. Barata PC, Rini BI. Treatment of renal cell carcinoma: Current status and future directions. CA Cancer J Clin (2017) 67(6):507–24. doi: 10.3322/caac.21411
3. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2018) 68(6):394–424. doi: 10.3322/caac.21492
4. Moch H, Cubilla AL, Humphrey PA, Reuter VE, Ulbright TM. The 2016 WHO classification of tumours of the urinary system and Male genital organs-part a: Renal, penile, and testicular tumours. Eur UROL (2016) 70(1):93–105. doi: 10.1016/j.eururo.2016.02.029
5. Motzer RJ, Bacik J, Mazumdar M. Prognostic factors for survival of patients with stage IV renal cell carcinoma: Memorial sloan-kettering cancer center experience. Clin Cancer Res (2004) 10(18 Pt 2):6302S–3S. doi: 10.1158/1078-0432.CCR-040031
6. Hanahan D, Weinberg RA. The hallmarks of cancer. CELL (2000) 100(1):57– 70. doi: 10.1016/S0092-8674(00)81683-9
7. Hanahan D, Weinberg RA. Hallmarks of cancer: The next generation. CELL (2011) 144(5):646–74. doi: 10.1016/j.cell.2011.02.013
8. Shen Z. Genomic instability and cancer: An introduction. J Mol Cell Biol (2011) 3(1):1–3. doi: 10.1093/jmcb/mjq057
9. Negrini S, Gorgoulis VG, Halazonetis TD. Genomic instability–an evolving hallmark of cancer. Nat Rev Mol Cell Biol (2010) 11(3):220–8. doi: 10.1038/nrm2858
10. Seton-Rogers S. Genomic instability: The sting of metastasis. Nat Rev Cancer (2018) 18(3):137. doi: 10.1038/nrc.2018.16
11. Sonugür FG, Akbulut H. The role of tumor microenvironment in genomic instability of malignant tumors. Front Genet (2019) 10:1063. doi: 10.3389/fgene.2019.01063
12. Linehan WM, Schmidt LS, Crooks DR, Wei D, Srinivasan R, Lang M, et al. The metabolic basis of kidney cancer. Cancer Discovery (2019) 9(8):1006–21. doi: 10.1158/2159-8290.CD-18-1354
13. Veneziano D, Di Bella S, Nigita G, Laganà A, Ferro A, Croce CM. Noncoding RNA: Current deep sequencing data analysis approaches and challenges. Hum Mutat (2016) 37(12):1283–98. doi: 10.1002/humu.23066
14. Theis M, Paszkowski-Rogacz M, Weisswange I, Chakraborty D, Buchholz F. Targeting human long noncoding transcripts by endoribonuclease-prepared siRNAs. J BIOMOL SCREEN (2015) 20(8):1018–26. doi: 10.1177/1087057115583448
15. Ma Y, Zhang J, Wen L, Lin A. Membrane-lipid associated lncRNA: A new regulator in cancer signaling. Cancer Lett (2018) 419:27–9. doi: 10.1016/j.canlet.2018.01.008
16. Zhai W, Sun Y, Guo C, Hu G, Wang M, Zheng J, et al. LncRNA-SARCC suppresses renal cell carcinoma (RCC) progression via altering the androgen receptor(AR)/miRNA-143-3p signals. Cell Death DIFFER (2017) 24(9):1502–17. doi: 10.1038/cdd.2017.74
17. Powers RK, Goodspeed A, Pielke-Lombardo H, Tan AC, Costello JC. GSEA-InContext: Identifying novel and common patterns in expression experiments. BIOINFORMATICS (2018) 34(13):i555–64. doi: 10.1093/bioinformatics/bty271
18. Yu G, Wang LG, Han Y, He QY. ClusterProfiler: An r package for comparing biological themes among gene clusters. OMICS (2012) 16(5):284–7. doi: 10.1089/omi.2011.0118
19. Carril-Ajuria L, Santos M, Roldán-Romero JM, Rodriguez-Antona C, de Velasco G. Prognostic and predictive value of PBRM1 in clear cell renal cell carcinoma. Cancers (Basel) (2019) 12(1):16. doi: 10.3390/cancers12010016
20. Zeng JH, Lu W, Liang L, Chen G, Lan HH, Liang XY, et al. Prognosis of clear cell renal cell carcinoma (ccRCC) based on a six-lncRNA-based risk score: An investigation based on RNA-sequencing data. J Transl Med (2019) 17(1):281. doi: 10.1186/s12967-019-2032-y
21. Sun Z, Jing C, Xiao C, Li T. Long non-coding RNA profile study identifies an immune-related lncRNA prognostic signature for kidney renal clear cell carcinoma. Front Oncol (2020) 10:1430. doi: 10.3389/fonc.2020.01430
22. Hombach S, Kretz M. Non-coding RNAs: Classification, biology and functioning. Adv Exp Med Biol (2016) 937:3–17. doi: 10.1007/978-3-319-42059-2_1
23. Wei Z, Batagov AO, Carter DR, Krichevsky AM. Fetal bovine serum RNA interferes with the cell culture derived extracellular RNA. Sci Rep (2016) 6:31175. doi: 10.1038/srep31175
24. Kuthi L, Jenei A, Hajdu A, Németh I, Varga Z, Bajory Z, et al. Prognostic factors for renal cell carcinoma subtypes diagnosed according to the 2016 WHO renal tumor classification: a study involving 928 patients. Pathol Oncol Res (2017) 23(3):689–98. doi: 10.1007/s12253-016-0179-x
25. Wang W, Gao Y, Yan F, Wang M, Hu F, Wang D, et al. Association of Ku70 a-31G polymorphism and risk of renal cell carcinoma in a Chinese population. DNA Cell Biol (2012) 31(7):1314–20. doi: 10.1089/dna.2011.1540
26. Mendoza-Pérez J, Gu J, Herrera LA, Tannir NM, Matin SF, Karam JA, et al. Genomic DNA hypomethylation and risk of renal cell carcinoma: A case-control study. Clin Cancer Res (2016) 22(8):2074–82. doi: 10.1158/1078-0432.CCR-15-0977
27. DiNatale RG, Gorelick AN, Makarov V, Blum KA, Silagy AW, Freeman B, et al. Putative drivers of aggressiveness in TCEB1-mutant renal cell carcinoma: An emerging entity with variable clinical course. Eur Urol Focus (2021) 7(2):381–389. doi: 10.1016/j.euf.2019.11.013.
28. The Cancer Genome Atlas Research Network. Comprehensive molecular characterization of clear cell renal cell carcinoma. NATURE (2013) 499(7456):43–9. doi: 10.1038/nature12222
29. Stebbins CE, Kaelin WJ, Pavletich NP. Structure of the VHL-ElonginC-ElonginB complex: Implications for VHL tumor suppressor function. SCIENCE (1999) 284(5413):455–61. doi: 10.1126/science.284.5413.455
30. Mettu RK, Wan YW, Habermann JK, Ried T, Guo NL. A 12-gene genomic instability signature predicts clinical outcomes in multiple cancer types. Int J Biol Markers (2010) 25(4):219–28. doi: 10.5301/JBM.2010.6079
31. van de Vijver MJ, He YD, Van'T VL, Dai H, Hart AA, Voskuil DW, et al. A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med (2002) 347(25):1999–2009. doi: 10.1056/NEJMoa021967
32. Habermann JK, Doering J, Hautaniemi S, Roblick UJ, Bündgen NK, Nicorici D, et al. The gene expression signature of genomic instability in breast cancer is an independent predictor of clinical outcome. Int J Cancer (2009) 124(7):1552–64. doi: 10.1002/ijc.24017
33. Ye J, Wu S, Pan S, Huang J, Ge L. Risk scoring based on expression of long non−coding RNAs can effectively predict survival in hepatocellular carcinoma patients with or without fibrosis. Oncol Rep (2020) 43(5):1451–66. doi: 10.3892/or.2020.7528
34. Bao S, Zhao H, Yuan J, Fan D, Zhang Z, Su J, et al. Computational identification of mutator-derived lncRNA signatures of genome instability for improving the clinical outcome of cancers: a case study in breast cancer. Brief Bioinform (2019) 21(5):1742–55. doi: 10.1093/bib/bbz118
35. Guo H, German P, Bai S, Barnes S, Guo W, Qi X, et al. The PI3K/AKT pathway and renal cell carcinoma. J Genet Genomics (2015) 42(7):343–53. doi: 10.1016/j.jgg.2015.03.003
36. Wang J, Zhang C, He W, Gou X. Construction and comprehensive analysis of dysregulated long non-coding RNA-associated competing endogenous RNA network in clear cell renal cell carcinoma. J Cell Biochem (2019) 120(2):2576–93. doi: 10.1002/jcb.27557
37. Dong Y, Quan HY. Downregulated LINC00460 inhibits cell proliferation and promotes cell apoptosis in prostate cancer. Eur Rev Med Pharmacol Sci (2019) 23(14):6070–8. doi: 10.26355/eurrev_201907_18420
38. Jiang Y, Cao W, Wu K, Qin X, Wang X, Li Y, et al. LncRNA LINC00460 promotes EMT in head and neck squamous cell carcinoma by facilitating peroxiredoxin-1 into the nucleus. J Exp Clin Cancer Res (2019) 38(1):365. doi: 10.1186/s13046-019-1364-z
39. Tu J, Zhao Z, Xu M, Chen M, Weng Q, Ji J. LINC00460 promotes hepatocellular carcinoma development through sponging miR-485-5p to up-regulate PAK1. BioMed Pharmacother (2019) 118:109213. doi: 10.1016/j.biopha.2019.109213
40. Zhang H, Lu Y, Wu J, Feng J. LINC00460 hypomethylation promotes metastasis in colorectal carcinoma. Front Genet (2019) 10:880. doi: 10.3389/fgene.2019.00880
41. Zhang D, Zeng S, Hu X. Identification of a three-long noncoding RNA prognostic model involved competitive endogenous RNA in kidney renal clear cell carcinoma. Cancer Cell Int (2020) 20:319. doi: 10.1186/s12935-020-01423-4
42. Mao W, Wang K, Xu B, Zhang H, Sun S, Hu Q, et al. ciRS-7 is a prognostic biomarker and potential gene therapy target for renal cell carcinoma. Mol Cancer (2021) 20(1):142. doi: 10.1186/s12943-021-01443-2
43. Braun DA, Ishii Y, Walsh AM, Van Allen EM, Wu CJ, Shukla SA, et al. Clinical validation of PBRM1 alterations as a marker of immune checkpoint inhibitor response in renal cell carcinoma. JAMA Oncol (2019) 5(11):1631–3. doi: 10.1001/jamaoncol.2019.3158
Keywords: cell renal cell carcinoma, TCGA, long non-coding RNA, prognosis, bioinformatics
Citation: Jiang D, Wu T, Shi N, Shan Y, Wang J, Jiang H, Wu Y, Wang M, Li J, Liu H and Chen M (2022) Development of genomic instability-associated long non-coding RNA signature: A prognostic risk model of clear cell renal cell carcinoma. Front. Oncol. 12:1019011. doi: 10.3389/fonc.2022.1019011
Received: 14 August 2022; Accepted: 23 September 2022;
Published: 19 October 2022.
Edited by:
Hai Hu, Sun Yat-sen Memorial Hospital, ChinaReviewed by:
Xiongbing Zu, Xiangya Hospital, Central South University, ChinaNing Zhang, Shanghai Jiao Tong University, China
Copyright © 2022 Jiang, Wu, Shi, Shan, Wang, Jiang, Wu, Wang, Li, Liu and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jian Li, MjIwMjA0MDU2QHNldS5lZHUuY24=; Hui Liu, MzA3NTgxODQwMUBxcS5jb20=; Ming Chen, bWluZ2NoZW5zZXVAMTI2LmNvbQ==
†These authors have contributed equally to this work