Skip to main content

ORIGINAL RESEARCH article

Front. Med., 24 May 2022
Sec. Precision Medicine
This article is part of the Research Topic Precision Medical Approach-Driven Multi-Dimensional Diagnosis and Treatment Strategies View all 7 articles

Identification of an Epithelial-Mesenchymal Transition-Related Long Non-coding RNA Prognostic Signature to Determine the Prognosis and Drug Treatment of Hepatocellular Carcinoma Patients

\r\nShenglan Huang,Shenglan Huang1,2Dan Li,Dan Li1,2Lingling Zhuang,Lingling Zhuang2,3Jian Zhang,Jian Zhang1,2Jianbing Wu,*Jianbing Wu1,2*
  • 1Department of Oncology, The Second Affiliated Hospital of Nanchang University, Nanchang, China
  • 2Jiangxi Key Laboratory of Clinical and Translational Cancer Research, Nanchang, China
  • 3Department of Gynaecology, The Second Affiliated Hospital of Nanchang University, Nanchang, China

Introduction: Hepatocellular carcinoma (HCC) is one of the most common malignant tumors with poor prognosis. Epithelial–mesenchymal transition (EMT) is crucial for cancer progression and metastasis. Thus, we aimed to construct an EMT-related lncRNA signature for predicting the prognosis of HCC patients.

Methods: Cox regression analysis and LASSO regression method were used to build an EMT-related lncRNAs risk signature based on TCGA database. Kaplan-Meier survival analysis was conducted to compare the overall survival (OS) in different risk groups. ROC curves and Cox proportional-hazards analysis were performed to evaluate the performance of the risk signature. RT-qPCR was conducted in HCC cell lines and tissue samples to detect the expression of some lncRNAs in this risk model. Furthermore, a nomogram involving the risk score and clinicopathological features was built and validated with calibration curves and ROC curves. In addition, we explored the association between risk signature and tumor immunity, somatic mutations status, and drugs sensitivity.

Results: Twelve EMT-related lncRNAs were obtained to construct the prognostic risk signature for patients with HCC. The Kaplan-Meier curve analysis revealed that patients in the high-risk group had worse overall survival (OS) than those in low-risk group. ROC curves and Cox regression analysis suggested the risk signature could predict HCC survival exactly and independently. The prognostic value of the risk model was confirmed in the testing and entire groups. We also found AC099850.3 and AC092171.2 were highly expressed in HCC cells and HCC tissues. The nomogram could accurately predict survival probability of HCC patients. Gene set enrichment analysis (GSEA) and gene ontology (GO) analysis showed that cancer-related pathways and cell division activity were enriched in high-risk group. The SNPs showed that the prevalence of TP53 mutations was significantly different between high- and low-risk groups; the TP53 mutations and the high TMB were both associated with a worse prognosis in patients with HCC. We also observed widely associations between risk signature and drugs sensitivity in HCC.

Conclusion: A novel EMT-related lncRNAs risk signature, including 12 lncRNAs, was established and identified in patients with HCC, which can accurately predict the prognosis of HCC patients and may be used to guide individualized treatment in the clinical practice.

Introduction

Hepatocellular carcinoma (HCC) is one of the most common aggressive tumors that ranks as the fourth leading cause of cancer-related mortality worldwide (1). Despite the proven efficacies of several treatment methods, such as surgical resection, liver transplantation, ablation therapy, radiotherapy, targeted therapy, systemic chemotherapy, and immunotherapy, the prognosis remains unsatisfactory. According to estimates from the World Health Organization, more than 1 million patients will die from HCC by 2030, with the 5-year survival rate being no more than 18% (2, 3). Such poor survival rates are mainly due to lack of reliable predictive markers. Therefore, there is an urgent need to identify highly sensitive and specific prognostic biomarkers for predicting the clinical outcomes of HCC patients.

Epithelial-mesenchymal transition (EMT) is a process that leads to cell morphology and function changes, which is characterized by the dissolution of cell-cell junctions, synthesis of extracellular matrix, increasing cell motility that consequently result in the cells gaining invasive and metastatic properties (4, 5). Numerous studies have shown that EMT plays a vital role in the development and progression of HCC (6, 7). For example, transforming growth factor-β (TGF-β), one of the most potent inducers of EMT, is correlated with carcinogenesis, invasion and metastasis in HCC (8). The EMT-transformed HCC cells stimulated with TGF-β were resistant to sorafenib-induced apoptosis (9). Moreover, mounting studies have shown that EMT contributes to immunosuppression, and EMT process is associated with chemotherapy and immunotherapy resistance (10). Therefore, EMT-related genes may be used as novel and ideal prognostic predictors and therapeutic targets for the patients with HCC.

Long non-coding RNAs (lncRNAs), which account for 80–90% of all ncRNAs, structurally contain more than 200 nucleotides. It has been wildly applied in early diagnosis and prognosis biomarkers of various tumors (11, 12). Accumulating evidences suggested that lncRNAs are involved in regulating EMT processes in a variety of tumors, including HCC (13); such as lncRNA-POIR (14), lncRNA-AB209371 (15), lncRNA CRNDE (16), lncRNA SNHG3 (17), and lncRNA LOC554202 (18), have been reported to promote the epithelial-mesenchymal transition, accelerate the tumor metastasis, and suppress the drug sensitivity of HCC cells. Conversely, lncRNA CASC2 and lncRNA miR503HG inhibit EMT and exert anti-metastatic effects in HCC cells (19, 20). In addition, many EMT-related lncRNAs signatures exhibit predictive potency for the survival of patients, which have been reported in bladder and kidney cancer, Colorectal Cancer, Melanoma, glioblastoma (2125). However, the potential value of EMT-related lncRNAs as prognostic indicators is poorly explored in the HCC.

Therefore, the objective of this study was to construct and validate an EMT-related lncRNAs signature to determine the prognosis and drugs response of patients with HCC, we first constructed and identified an EMT-related lncRNAs prognostic model based on the data extracted from The Cancer Genome Atlas (TCGA) and then explored the association of the risk model with biological functions, immune cells infiltration, somatic mutations, and sensitivity of clinical drugs, in the patients with HCC. The flowchart of this study was shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Flowchart of this study. ETM, epithelial-mesenchymal transition; LncRNA, long non-coding RNA; LASSO, the least absolute shrinkage and selection operator; GSEA, gene set enrichment analysis; GO, Gene Ontology; TCGA, The Cancer Genome Atlas; ROC, receiver operating characteristic; SNPs, single nucleotide polymorphisms; TMB, tumor mutation burden.

Materials and Methods

Data Acquisition and Processing

The RNA sequencing (RNA-Seq) data in FPKM (fragments per kilobase of transcript per million mapped reads) format of HCC project (including 374 tumor and 50 normal samples), single nucleotide polymorphism (SNP) data (n = 362), and the corresponding clinical data (including survival status, survival time, age, gender, histological grade, and TNM stage) were downloaded from the TCGA database.1 The genes were annotated and sorted as 19,659 protein-coding genes and 14,142 lncRNAs according to the ENSEMBL database.2 The exclusion criteria of this study were set as follows: (1) Samples without intact genomics for analysis; (2) The genes with an average value <1; (3) The patients whose clinical follow-up time less than 30 days. The remaining data of 343 HCC samples with 710 lncRNAs and 11,282 protein-coding genes were included in our study. The patients were stochastically assigned into the training (n = 240) and testing group (n = 103) at the rate of 7:3 using “caret” package of R v.4.0.4 software3 (26, 27). Both training and testing sets conformed to the following requirements: (1) samples were randomly assigned to training and testing group; (2) patients in two groups had similar clinicopathological features. A prognostic EMT-related lncRNAs risk model was established based on the training cohort, the testing and entire cohorts were used to verify the predictive performance of this model.

Tissue Sample Collection and Ethics Approval

We collected 30 paired HCC tissues and non-cancerous normal tissues following informed consent from patients who underwent tumor surgical resection at the Second Affiliated Hospital of Nanchang University from October 2020 and November 2021. This study met the standards set by terms of the Declaration of Helsinki and was approved by the Second Affiliated Hospital of Nanchang University Medical Research Ethics Committee. Tissue samples were maintained at –80°C until RNA extraction.

Screening for the Epithelial–Mesenchymal Transition-Related LncRNAs in Hepatocellular Carcinoma

A total of 200 EMT-related genes were downloaded from MSigDB v.7.4 (hallmark_ epithelial_ mesenchymal_ transition M5930).4 Then, 159 EMT-related genes with average value >1 were extracted from LIHC project of TCGA database by utilizing the “limma” R package. Pearson correlation coefficient analysis was performed to identify the EMT-related lncRNAs according to the criteria: | correlation coefficient | > 0.3 and p < 0.001. A total of 449 EMT-related lncRNAs were screened for further analysis.

Construction and Verification of an Epithelial–Mesenchymal Transition-Related LncRNAs Prognostic Signature for Hepatocellular Carcinoma Patients

To estimate the prognostic value of EMT-related lncRNAs, we applied multiple analyses to build the prognostic risk signature in the training group. First, we performed univariate Cox regression and Kaplan-Meier survival analysis to identify the survival associated lncRNAs, the significant lncRNAs were defined as p < 0.05. Then, we incorporated these lncRNAs into the least absolute shrinkage and selection operator (LASSO) and multivariate Cox regression to construct a prognostic signature for HCC patients. Eventually, 12 lncRNAs were enrolled into the risk model. According to the multivariate Cox regression coefficient and expression value of each lncRNA, the risk score of each patient was calculated using following equation:

Riskscore=i=1n(coefi×βi)

where coef represents the regression coefficient and βis the expressive value of each EMT related lncRNA. The HCC patients in the training cohort were divided into high- or low-risk groups based on the median risk score. Similarly, the patients in testing and entire cohorts were allocated to either high-risk or low-risk groups. Thereafter, we conducted the Kaplan-Meier survival analysis to compare the overall survival (OS) between the high- and low-risk groups. Time-dependent receiver operating characteristic (time-ROC) curves were sketched to evaluate the predictive accuracy of the risk signature for 1-, 3-, and 5-year survival via “survivalROC” R package. Meanwhile, the stratification analyses were performed in entire cohort to assess the prognostic value of the risk signature in different populations, which based on the following clinicopathological features: age (≤65 and >65 years), gender (Male and Female), grade (Grade 1–2 and Grade 3–4), and clinical stage (Stage I–II and Stage III–IV); p < 0.05 was considered to be statistically significant.

Thereafter, Cox proportional-hazards analyses were performed to determine the independent factors for the prognosis of HCC patients, including the following clinicopathological factors: age, gender, grade, clinical stage, and risk signature; p < 0.05 was considered to be statistically significant. A multi-indicator receiver operating characteristic (multi-ROC) curves were drawn and the area under the curve (AUC) were calculated to compare the predictive ability between the risk signature and other clinicopathological features.

Additionally, the relevancy between the 12 prognostic EMT-related lncRNAs and EMT-related protein-coding genes was analyzed by using above Pearson correlation coefficient analysis and presented with co-expression network. The Cytoscape 3.8.2 software5 and “ggalluvial” R package were employed to visualize the co-expression correlation.

Construction and Validation of the Predictive Nomogram

Based on the risk score and conventional clinical characteristics (including age, gender, histological grade, TNM stage), we established a nomogram in training, testing, and entire cohort to quantitatively assess the 1- and 3-year survival probability of HCC patients using the “survival” and “rms” R packages. Moreover, we assessed the predictive performance of the nomogram with calibration curves and ROC curves using “rms” and “foreign” packages of R.

Enrichment Analysis

The transcriptome data of the entire group were selected for gene set enrichment analysis (GSEA) using the GSEA v.4.1.0 software.6 Based on the risk signature, 343 samples were divided into high- and low-risk groups, the enriched pathways were identified by GSEA analysis and Gene Ontology (GO) analysis in the two risk groups. In GSEA analysis, the c2.cp.kegg.v7.4.symbols.gmt downloaded from MSigDB7 was used as the reference and a false discovery rate (FDR) of q-value <0.05 was used to identify the significantly enriched pathways. The significant cancer-related pathways were presented in the result. Thereafter, differential expressed genes (DEGs) were extracted from TCGA database depending on the risk grouping and the statistical significance was defined as FDR < 0.05 and | log2FC| ≥ 1. According to the DEGs, GO analysis was conducted to evaluate the related biological pathways in the two risk subgroups. Corrected p-values <0.05 were considered significant, and the top 10 most significant biological pathways were displayed with bar charts and bubble plots.

Immune Cells Infiltration and Immune-Related Pathways

According to the EMT-related lncRNAs prognostic model, single-sample gene set enrichment analysis (ssGSEA) was conducted to explore the correlation between the risk signature and immune cells infiltrating and the immune-related pathways. First, 16 immune infiltration cells and 13 immune-related pathways scores were calculated in each LIHC sample based on ssGSEA method using “GSVA,” “limma,” and “GSEABase” R package. Then, we compared the difference of immune cells infiltration and immune-related pathways in different risk groups. The result was visualized with boxplot using the “ggpubr” R package, and p < 0.05 was considered as statistical significance.

Exploring the Correlation of the Risk Signature With Single Nucleotide Polymorphisms

The format (MAF) files of HCC SNPs mutation dataset, possessed by the method of “varscan,” were obtained from TCGA database. The genes mutation status, variant classification, and the tumor mutation burden (TMB) of each patient were acquired using the Perl 5.30.0 software.8 Then, we integrated the SNPs data and risk grouping, and explored the correlation between SNPs and risk signature. The top 20 driver genes with highest mutational frequencies and its mutation types in the high-risk and low-risk groups were displayed with waterfall plots using “GenVisR” R package. Next, chi-square test was performed to compared the gene mutational frequency between two risk groups. Kaplan–Meier survival analysis was applied to evaluate the prognostic role of TP53 mutation status, TMB, as well as TMB combined with risk signature in patients with HCC via “survival” package of R. p < 0.05 indicated a statistical significance.

The Role of the Risk Signature in Predicting Treatment Response to Chemotherapy, Targeted Therapy, and Immunotherapy

To investigate the role of EMT-related lncRNAs risk signature in predicting the sensitivity of chemotherapy and targeted therapy of HCC patients, we applied the pRRophetic algorithm to calculate the drugs half-maximal inhibitory concentration (IC50) based on the Cancer Cell Line Encyclopedia (CCLE) by using “pRRophetic” R package (28). According to the National Comprehensive Cancer Network (NCCN) guidelines, antitumor drugs such as doxorubicin, camptothecin, cisplatin, mitomycin C, gemcitabine, and sorafenib were selected for this analysis. The IC50 of these drugs were compared between the high- and low-risk groups with Wilcoxon rank-sum test, and p-value <0.05 was considered as significant difference.

It was reported that the immune checkpoint related genes can effectively reflect the response rate of immune checkpoint inhibitors (ICIs), and immunophenoscore (IPS) associated with the responds to ICIs (29, 30). Thus, we further investigated the association of risk signature with immune-related genes and IPS. We first carried out differential expression analysis of 46 immune checkpoint related genes in high- and low-risk groups and focused on the correlation of risk signature with six key genes: CTLA4, PD-1, IDO1,TIM-3 PD-L1, and PD-L2 (29). Then, IPS [four subtypes: IPS-CTLA4(+)/PD-1(+), IPS-CTLA4(+)/PD-1(–),IPS-CTLA4(–)/PD-1(+), IPS-CTLA4(–)/PD-1(–)] were compared in high- and low-risk groups with Wilcoxon rank-sum test, which were obtained from the Cancer Immunome Atlas database.9 p-value <0.05 was deeded statistically significant.

Cell Culture and Real-Time Reverse Transcription-Quantitative Polymerase Chain Reaction

Human HCC cell lines (HCC-LM3, SMMC-7721) were purchased from the Cell Bank of Typical Culture Preservation Committee of Chinese Academy of Sciences (Shanghai, China). The other three HCC cell lines (SMMC97-H, HepG2, Huh7) and one normal liver cell line (L-02) were purchased from Procell Life Science and Technology Co., Ltd. (Wuhan, China). Cells were cultured in high glucose Dulbecco’s modified Eagle medium (DMEM) with 10% FBS (Gibco, Grand Island, NY, United States), 100 μg/mL streptomycin and 100 U/mL penicillin sodium (Biotechnology, Beijing, China). The cells were incubated at 37°C with 5% CO2 in a humidified incubator.

Total RNA was isolated from HCC cells and tissue samples using Trizol Reagent (Invitrogen, Carlsbad, CA, United States) according to the product manual. Subsequently, the RNA was reversely transcribed to complementary DNA (cDNA) using the PrimeScript™ RT reagent kit with gDNA Eraser (RR047A, TaKaRa, China) according to the product instructions. The qPCR was conducted with TB Green® Premix Ex Taq™ II (RR820A, TaKaRa, China) on a CFX96 Real-Time PCR Detection System. The reaction conditions were 94.0°C for 30 s, followed by 94.0°C for 4 s, 58.0°C for 15 s, and 72°C for 15 s, for a total of 40 cycles. Relative mRNA expression of HCC cells was calculated by the 2–ΔΔCt method, while 2–ΔCT was used to reckon the mRNA expression in HCC tissues, and the data was normalized by the endogenous control of Glyceraldehyde 3-phosphate dehydrogenase (GAPDH). qPCR assays were performed in triplicates. The lncRNA Primers were listed in Supplementary Table 1.

Statistical Analysis

Statistical analyses were performed using the R software v.4.0.4,10 Perl 5.30.0 software (see text footnote 8) and GraphPad Prism 8.0 software.11 Chi-square test was used to analyze the baseline characteristics of patients with HCC. Student’s t-test or Wilcoxon rank-sum test was applied to compare differences between the two groups. LASSO and Cox regression analyses were used to build a prognostic model. Kaplan-Meier analysis with log-rank test was used to evaluate the survival differences of the two groups. The ROC curve was used to estimate the predictive ability of the risk models. The predictive values of the nomogram for 1- and 3-year OS were assessed using calibration and ROC curves. All statistical tests were two-sided and p < 0.05 was considered statistically significant.

Results

Identification of Epithelial–Mesenchymal Transition-Related LncRNAs in Hepatocellular Carcinoma

A total of 424 HCC samples with transcriptome data (374 tumor samples and 50 normal samples) and 377 HCC specimens containing clinical information were acquired from TCGA database. After removing the repeated samples, and patients with incomplete clinicopathological data and less than 30 days follow-up time, the remaining 343 patients with complete follow-up information and 319 patients with complete clinicopathological data were included in this study for subsequent analysis. Of the 343 samples, 240 patients were randomly assigned to the training set, 103 patients were distributed to testing set. The clinicopathological features of the HCC patients included in this study are shown in Table 1. Then, 200 EMT-related genes were downloaded from MSigDB. After excluding low expression genes, the left 159 EMT-related genes were identified in TCGA dataset (Supplementary Table 2). According to the Pearson correlation coefficient analysis, 449 EMT-related lncRNAs were screened based on the filtering criteria of correlation coefficient < 0.3 and p < 0.001.

TABLE 1
www.frontiersin.org

Table 1. Characteristics of hepatocellular carcinoma patients from TCGA dataset.

Establishment the Epithelial–Mesenchymal Transition-Related LncRNAs Prognostic Model in the Training Cohort

We identified 449 EMT-related lncRNAs in above analysis, of which 41 were significantly correlated with the survival of HCC patients based on univariate Cox regression analysis and Kaplan-Meier analysis in the training set (Supplementary Table 3). Thereafter, LASSO-penalized Cox regression and multivariate Cox regression analysis were performed to identify the prognostic lncRNAs, the most relevant prognostic parameters were calculated and cross-validated to establish the best prognostic model in the training set (Figures 2A,B). Eventually, 12 lncRNAs were selected to construct a prognostic signature with multivariate Cox regression coefficients, including AC103760.1, AC015908.3, LINC02362, LINC02499, F11-AS1, LINC00942, PRRT3-AS1, AC012146.1, AC092171.2, AC099850.3, CASC19, and AL158206.1 (Table 2). Besides, we constructed co-expression network of the 12 EMT-related lncRNAs and EMT-related protein-coding genes by using Cytoscape to visually present the correlation between EMT-related genes and EMT-related lncRNAs (Supplementary Figure 1).

FIGURE 2
www.frontiersin.org

Figure 2. Construction of the EMT-related lncRNAs risk signature in training cohort. (A) LASSO coefficient profiles of the EMT-related lncRNAs. (B) Cross validation for tuning parameter selection in the proportional hazards model. (C) Kaplan-Meier survival analysis of the high- and low-risk groups. (D) The precision of the prognostic model was evaluated with Multi-indicator receiver operating characteristic (ROC) curves. (E) Risk score curves based on the risk score of each patient. (F) Scatter plots based on the survival status of each patient. (G) Heatmap displaying the expression levels of the 12 EMT-related lncRNAs in high- and low-risk groups.

TABLE 2
www.frontiersin.org

Table 2. Multivariate Cox regression analysis of EMT-related lncRNA.

Based on the expression value of each lncRNA and corresponding regression coefficient, the individualized risk score was calculated as the following formula: Risk score = (–0.124 × AC103760.1 expression) + (–0.113 × AC015908.3 expression) + (–0.027 × LINC02362 expression) + (–0.021 × LINC02499 expression) + (–0.009 × F11.AS1 expression) + (0.02 × LINC00942 expression) + (0.03 × PRRT3.AS1 expression) + (0.033 × AC012146.1 expression) + (0.122 × AC092171.2 expression) + (0.142 × AC099850.3 expression) + (0.163 × CASC19 expression) + (0.313 × AL158206.1 expression). HCC patients in the training set were divided into low-risk subgroup (n = 120) and high-risk subgroup (n = 120) basing on the median risk score (median value = 0.8488). Kaplan-Meier survival curve indicated that the HCC patients in the high-risk group had worse OS than those in the low-risk group (p = 5.304e-06), as shown in Figure 2C. Time-ROC curve analysis was performed to evaluate the precision of the prognostic model, and the results demonstrated that the risk model provided a precise predictive effect for 1-, 3-, and 5-year OS, the AUC values were 0.828, 0.811, and 0.758, respectively (Supplementary Figure 2A). Risk score curves and scatter plots revealed that the higher the score, the higher the mortality rate observed in the patients (Figures 2E,F). The heatmap showed that seven lncRNAs (LINC00942, PRRT3-AS1, AC012146.1, AC092171.2, AC099850.3, CASC19, and AL158206.1) were upregulated in the high-risk group, while the other five lncRNAs (AC103760.1, AC015908.3, LINC02362, LINC02499, and F11-AS1) were upregulated in the low-risk group (Figure 2G).

Furthermore, univariate and multivariate cox proportional hazards regression analyses were performed to evaluate the independent prognostic factors for patients with HCC. The results showed that the risk signature [HR = 1.080, 95% CI (1.051–1.110), p < 0.001] and clinical stage [HR = 1.744, 95% CI (1.348–2.258), p < 0.001] were independent predictors for OS in patients with HCC (Table 3). Moreover, multi-indicator receiver operating characteristic (multi-ROC) curve analysis showed that the AUC value of the risk model was the highest at 0.825 in the training group (Figure 2D), which demonstrated that the EMT-related lncRNAs risk model provided a more precise prediction for OS than other clinical parameters in the patients with HCC.

TABLE 3
www.frontiersin.org

Table 3. Univariate and multivariate Cox regression analyses of the risk score model and clinical factors.

Identification the Epithelial–Mesenchymal Transition-Related LncRNAs Risk Model in the Validation Cohorts

To evaluate the applicability of the risk model, we conducted Kaplan-Meier analysis and Cox proportional hazards regression analysis in the testing and the entire cohorts. Firstly, we calculated the risk score of HCC patients in the validation cohorts according to the above-mentioned formula. The median risk score in the training cohort was applied to divide the patients (validation cohorts) into low-risk group or high-risk group. Kaplan–Meier analysis of the validation groups revealed that HCC patients with low risk had longer OS than those with high risk in testing set (p = 1.87e-02, Figure 3A) and entire set (p = 2.972e-06, Figure 3D). The AUC value of the risk model for predicting OS were 0.736 in the testing group and 0.794 in the entire group, indicating that it is more accurate than other clinical predictors in survival prediction (Figures 3B,E). The AUC values of the risk signature for 1-, 3-, and 5-year OS were 0.757, 0.675, and 0.682, respectively, in the testing group; and 0.809, 0.710, and 0.694, respectively, in the entire group (Supplementary Figures 2B,C). The results of the risk score curves, scatter plots, and heatmaps in the validation groups showed similar trends to those observed in the training group (Figures 3C,F). Cox proportional hazards regression analysis showed that EMT-related lncRNA risk model could act as an independent predictor for the survival of HCC patients in validation cohorts (Table 3).

FIGURE 3
www.frontiersin.org

Figure 3. Validation of the EMT-related lncRNAs risk signature in the testing and entire cohorts. (A) Kaplan–Meier survival analysis of the high- and low-risk groups in testing cohort. (B) The precision of the prognostic model was assessed with multi-indicator receiver operating characteristic (ROC) curves in testing cohort. (C) Risk score curves, survival status, and heatmap of lncRNAs in the testing group. (D) Kaplan–Meier survival analysis of the high- and low-risk groups in entire cohort. (E) Multi-indicator receiver operating characteristic (ROC) curves in entire cohort. (F) Risk score curves, survival status, and heatmap of lncRNAs in the entire group.

To further explore the prognostic value of the EMT-related lncRNAs risk signature in HCC patients, stratification analysis was performed in different subgroups. First, the patients in the entire cohort were divided into different subgroups based on age (≤65 and >65 years), gender (male and female), histological grade (grade 1–2 and grade 3–4), and clinical stage (stage I-II and stage III-IV). Then, Kaplan–Meier survival curve analysis was conducted in different subgroups. The results showed that patients with high risk in different age and clinical stage subgroups had worse survival than those with low risk, and the same results were observed in the male and Grade 1–2 subgroup. However, there was no significant difference in the female subgroup and patients with grade 3–4, which suggested that the prognostic signature was applicable in most subgroups of HCC (Figure 4).

FIGURE 4
www.frontiersin.org

Figure 4. Kaplan–Meier survival analysis in different clinicopathological characteristic subgroups. (A) Age ≤ 60; (B) Age > 60; (C) Male; (D) Female; (E) Grade 1–2; (F) Grade 3–4; (G) Clinical stage I–II; (H) Clinical stage III–IV.

Differential Expression of the Epithelial–Mesenchymal Transition-Related LncRNAs in Hepatocellular Carcinoma Cells and Tissue Samples

The four lncRNAs (AC099850.3, AC092171.2, AL158206.1, and CASC19) with the greatest influence on the survival, which had confirmed in Cox proportional hazards regression analysis (Table 3), were selected for differential expression analysis in HCC cell lines and tissues samples. We first compared the four lncRNAs expression levels between normal liver cell line L-02 and five different HCC cell lines (HCC-LM3, SMMC97-H, SMMC-7721, HepG2, and Huh7) using real-time reverse transcription-quantitative polymerase chain reaction (RT-qPCR). The results showed that mRNA expression of AC099850.3 and AC092171.2 were higher in HCC cell lines compared with hepatic cell line (Figures 5A,B). Elevated mRNA expression of AL158206.1 and CASC19 was observed in part of human HCC cell lines (Figures 5C,D). Then, the differential expression analysis of the four lncRNAs was carried out in 30 paired HCC and para-cancer tissues by using RT-qPCR. The clinical information was displayed in Supplementary Table 4. The results of RT-qPCR revealed that AC099850.3 (p = 0.0023) and AC092171.2 (p = 0.0196) were significantly highly expressed in HCC tissues; whereas, no significant difference was observed in AL158206.1, and CASC19 between tumor tissues and para-carcinoma tissues (Figures 5E–H). These results were consistent with results from TCGA database, which conducted in 374 HCC tissues and 50 normal tissues with Wilcoxon rank-sum test (Figures 5I–L).

FIGURE 5
www.frontiersin.org

Figure 5. Expression level of lncRNAs in HCC cell lines, clinical tissue samples, and TCGA database. (A–D) The RT-qPCR analysis of the expression levels of AC099850.3, AC092171.2, AL158206.1, and CASC19 in five HCC cells and normal liver cell line (L-02). Error bars represent the SD of triplicate experiments. (E–H) The expression levels of AC099850.3, AC092171.2, AL158206.1, and CASC19 in 30 paired carcinoma tissues and adjacent tissues in HCC patients. (I–L) Expression analysis of AC099850.3, AC092171.2, AL158206.1, and CASC19 in HCC tissues and normal tissues derived from TCGA dataset. ns: not statistically significant, *p < 0.05 and ***p < 0.001.

A Nomogram Combining Risk Score With Other Clinical Features Predicting Overall Survival in Hepatocellular Carcinoma

We incorporated age, gender, grade, clinical stage, and risk score to establish a nomogram both in training cohort and validation cohorts to predict the 1- and 3-year OS for patients with HCC. The Comprehensive scores were obtained by combining each clinical factor; the higher the total score of patients, the worse the prognosis (Figures 6A,D,G). The predictive consistency and accuracy of the nomogram for 1- and 3-year OS were assessed using calibration curves and ROC curves. In training cohort, the calibration curves showed the nomogram model nearly in accordance with reality, and ROC curves demonstrated that the nomogram provided an accurate survival prediction with AUC = 0.795 for 1-year OS and 0.773 for 3-years OS as shown in Figures 6B–C. Similar results were showed in testing cohort (Figures 6E,F) and entire cohort (Figures 6H,I). These results demonstrate that the nomogram exhibited preferable clinical practicality in predicting the prognosis of HCC patients.

FIGURE 6
www.frontiersin.org

Figure 6. Construction of the predictive nomogram for HCC patients. (A) Nomogram predicting the survival probability of 1- and 3-year OS in training group; (B) multi-indicator receiver operating characteristic (ROC) curves of the nomogram for 1- and 3-year OS in training group; (C) calibration curve of the nomogram for predicting 1- and 3-year OS in training group; (D) nomogram predicting the survival probability of 1- and 3-year OS in testing group; (E) multi-indicator receiver operating characteristic (ROC) curves of the nomogram for 1- and 3-year OS in testing group; (F) calibration curve of the nomogram for predicting 1- and 3-year OS in testing group; (G) nomogram predicting the survival probability of 1- and 3-year OS in entire group; (H) multi-indicator receiver operating characteristic (ROC) curves of the nomogram for 1- and 3-year OS in entire group; (I) calibration curve of the nomogram for predicting 1- and 3-year OS in entire group.

Comparison of the Risk Signature and Other Epithelial–Mesenchymal Transition-Related Models in Hepatocellular Carcinoma

Next, we compared the predictive performance of the risk signature built in this study with other five EMT-related prognostic models for HCC, including the EMT-related five-lncRNA signature (31), EMT-related teen-gene signature (32), EMT-related five-gene signature (33), the EMT-related six-gene signature (34). We mainly compared the AUCs of ROC curves to evaluate the accuracy of these risk model. As shown in Table 4, the results indicated the risk signature built in our study exhibited a higher predictive efficacy with AUC values greater 0.7 in training group when compared with the previous EMT-related lncRNA models. Furthermore, an independent validation set was used to verify the predictive ability of the risk model, and a similar predictive performance was observed in test cohort, indicating this risk signature has good practicability and generalizability.

TABLE 4
www.frontiersin.org

Table 4. Comparison of the risk model and other models.

Function Analysis of the Epithelial–Mesenchymal Transition-Related LncRNAs Prognostic Signature

Gene set enrichment analysis was conducted in the entire group to probe the potential biological mechanism of the EMT-related lncRNAs signature. The results suggested tumor-related pathways, including bladder cancer, thyroid cancer, pathways in cancer, VEGF signaling pathway, cell cycle, and DNA replication were highly enriched in the high-risk group with FDR (q-value) < 0.05, as shown in Figure 7A.

FIGURE 7
www.frontiersin.org

Figure 7. Functional analysis between the low- and high-risk group in the entire cohort. (A) GSEA analysis based on the risk grouping. (B,C) GO analysis between high- and low-risk groups. GSEA, gene set enrichment analysis; GO, Gene Ontology.

To further determine the biofunctions associated with the risk signature, differentially expressed genes were extracted from the TCGA database based on risk grouping. A total of 969 DEGs (upregulated 832 genes, downregulated 146 genes, fold change > 1, p < 0.05) were identified in the high-risk group compared to the low-risk group. We then selected the DEGs for GO enrichment analysis, and the results showed that these DEGs were mainly enriched in terms associated with cell division process (Figures 7B,C), suggested that the proliferation ability of HCC cells in the high-risk group was notably more rapid than that in the low-risk group.

The Relationship Between the Epithelial–Mesenchymal Transition-Related LncRNAs Prognostic Model and Tumor Immune Features

To further investigate the potential correlation between the risk signature and the tumor immunity in HCC, we employed ssGSEA analysis to compare the difference of 16 immune infiltrating cells and 13 immune function-related pathways between high- and low-risk groups in entire cohort. The results revealed that the infiltration level of immune cells, including aDCs, macrophages, Th2 cells, and Treg cells, were significantly upregulated in the high-risk groups, but natural killer (NK) cells were upregulated in the low-risk group (P < 0.05; Figure 8A). And we found antigen-presenting cell (APC) co-stimulation, checkpoint, and major histocompatibility complex-I (MHC-I) were significantly activated in the high-risk group; nevertheless, the pathways of INF-I and INF-II responses were evidently activated in the low-risk group (P < 0.05) (Figure 8B). These findings suggested that the risk signature could influence the tumor immune statues in patients with HCC.

FIGURE 8
www.frontiersin.org

Figure 8. The relationship between the risk score, immune infiltration cells, and immune functions based on ssGSEA. (A) Comparison of the infiltration levels of 16 immune cells between the low- and high-risk groups. (B) Comparison of the 13 immune functions between the low- and high-risk groups. ns: not statistically significant, *p < 0.05; **p < 0.01; ***p < 0.001. ssGSEA: single-sample gene set enrichment analysis.

The Association of Risk Model With Single Nucleotide Polymorphisms

It has been confirmed that SNPs contributes to tumorigenesis and tumor progression. Tumor mutation burden (TMB) is defined as the total number of somatic mutations detected per million bases, including gene coding errors, base substitution, gene insertion or deletion errors. TMB has been reported to be used as a valid prognostic biomarkers and immune therapy response indicators in many kinds of tumor (35). Thus, we explored the connection among the SNPs, TMB, and the EMT-related lncRNAs signature. The top 20 mutation genes in high- and low-risk groups were shown in Figures 9A,B, the cumulative incidence plots showed that somatic mutations occurred more frequently in the high-risk group compared to the low-risk group (87.65 vs. 81.93%), and the prevalence of TP53 mutations was significantly different between high- and low-risk groups (p < 0.001) (Figure 9C). A significant association was observed between TP53 mutational status and prognosis from Kaplan–Meier analysis (p = 0.0163), indicating that the HCC patients with TP53 mutation had worse prognosis than those without TP53 mutation (Figure 9D). Meanwhile, we found the high TMB was also associated with a worse prognosis (p < 0.001) (Figure 9E). The patients with the low TMB and low-risk had the best overall survival compared with other groups, while the high TMB and high-risk group had the worst prognosis (Figure 9F). Collectively, it can be concluded that the risk signature may relevant to the somatic mutations, thus contribute to tumor progression.

FIGURE 9
www.frontiersin.org

Figure 9. The association between risk signature and single-nucleotide variant in HCC. (A,B) The top 20 genes mutations in high- and low-risk groups based on the TCGA entire cohort; (C) the occurrence probability of TP53 mutations in high- and low-risk groups. (D) Kaplan–Meier survival analysis according to the mutation state of TP53. (E,F) Kaplan–Meier survival analysis based on the TMB and risk signature.

Differential Response to Chemotherapy, Targeted Therapy, and Immunotherapy Based on Risk Signature

To explore the different response to drugs treatment between high- and low-risk groups, we applied the pRRophetic algorithm to estimate the IC50 of chemotherapeutic agents and targeted drugs, and compared the IC50 of different drugs in two risk groups with Wilcoxon rank-sum test. Doxorubicin, camptothecin, mitomycin C, cisplatin, gemcitabine, and sorafenib were included in this analysis. As shown in Figures 10A–F, we found higher IC50 of Doxorubicin, camptothecin, mitomycin C, cisplatin, gemcitabine was presented in low-risk group, whereas low-risk group was associated with a higher sensitivity to sorafenib, and no significant differences was observed in camptothecin. These results might guide individualized chemotherapy or targeted therapy for different risk populations.

FIGURE 10
www.frontiersin.org

Figure 10. Comparison of treatment drugs sensitivity between high- and low-risk groups. (A–F) IC50 of doxorubicin, cisplatin, gemcitabine, mitomycin C, camptothecin, and sorafenib in high- and low-risk groups. (G–L) The difference of the expression level of CTLA4, PD-1, TIM-3, IDO-1, PD-L1, and PD-L2 between high- and low-risk groups. (M–P) Comparison of the treatment response of ICIs in two risk score groups. ICIs, immune checkpoint inhibitors; IC50, half-maximal inhibitory concentration.

Subsequently, we investigated the potential roles of the risk signature in predicting ICIs response in HCC patients. First, we compared the expression level of 46 immune related genes between high- and low-risk groups. We found 34 of the 46 immune genes significantly differentially expressed in different risk subgroups (P < 0.05) (Supplementary Figure 3). Among the six key genes (CTLA4, PD-1, IDO1, TIM-3, PD-L1, and PD-L2), the expression levels of CTLA4, PD-1, IDO1, and TIM-3 were significantly increased in the high-risk group, whereas there was no distinct difference in the expression of PD-L1 and PD-L2 between high- and low-risk groups (Figures 10G–L). This result might provide better guidance for drugs selection of ICIs in HCC patients. Moreover, we further explored the relationship between the risk signature and the IPS in HCC samples. The results indicated that patients in the low-risk group showed higher IPS in two subtypes [IPS-CTLA4(–)/PD-1(–), IPS-CTLA4(+)/PD-1(–)]; however, there no significant association between risk signature and the other two IPS subtypes [IPS-CTLA4(+)/PD-1(+), IPS-CTLA4(–)/PD-1(+)] (Figures 10M–P). The results indicated that the risk signature might contribute to the evaluation of response to ICBs in HCC patients.

Discussion

Hepatocellular carcinoma is one of the most common malignant tumors worldwide, characterized by high recurrence rates, high mortality rates, and poor prognosis in patients (36). Growing evidence suggests that EMT plays a crucial role in the progression, invasion, and metastasis of various tumors (37). Thus, we focused on the EMT as the cut-in point to discover novel prognosis markers for HCC. In this study, an EMT-related 12-lncRNAs risk model was constructed and validated for HCC patients based on TCGA dataset, we found that the risk model could independently and accurately predict the OS of HCC patients in training cohort and testing cohort. In addition, we discovered that the 12-lncRNAs risk signature is connected to the immune infiltration level, key gene mutations, and drugs sensitivity in patients with HCC.

Previous researches have revealed that lncRNAs participate in the regulation of EMT and ultimately affect tumor progression and metastasis. Moreover, EMT-related lncRNAs could serve as novel and effective biomarkers and therapeutic targets for various tumors (38). However, the prognostic value of the EMT-related lncRNAs has rarely been investigated in HCC. Hence, in this study, we focused on constructing and validating a new EMT-related lncRNAs prognostic signature for HCC. Based on RNA-Seq and clinical information from the TCGA dataset, we screened out 12 EMT-related lncRNAs to construct a prognostic risk signature using Cox regression and LASSO regression in training cohort. Kaplan–Meier analysis indicated that the survival of HCC patients in the high-risk group was worse than those in the low-risk group, and the risk model achieved well prediction performance with AUC = 0.828 for 1-year OS, AUC = 0.811 for 3-year OS, and AUC = 0.758 for 5-years OS. The lncRNAs signature exhibited more sensitivity and specificity, and acted as independent indicators for HCC patients when compared with other clinical parameters. More importantly, the prognostic availability of the risk model was validated in the testing and entire groups. Besides, a nomogram, established basing on age, gender, grade, clinical stage, and risk score, provided better clinical practicality than the traditional tumor grade or clinical stage system. In comparison with the recently published studies that explored the EMT-related lncRNAs or other prognostic predictors in HCC (31), Our study demonstrates certain strengths. First, 343 HCC samples obtained from TCGA database were randomly assigned to training and testing group, the 12 EMT-related lncRNAs signature was built based on training cohort, and we tested the performance of the model with independent testing cohort. Next, the risk signature showed a high predictive value with AUC values between 0.7 and 0.9 both in training group and validation groups. Furthermore, the differential expression of some lncRNAs in this risk model were firstly identified in HCC cell lines and HCC tissues. In addition, we comprehensively explored the correlation of risk signature with immune cell infiltration feature, chemosensitivity, immunotherapy response, and somatic mutations status.

Some lncRANs in this risk model have been reported to be associated with the malignant phenotypes of HCC or other cancers. For instance, LINC02499 was significantly downregulated in HCC tissues compared to para-cancerous tissues, and its low expression was also remarkably correlated with poorer overall survival (39). lncRNA F11-AS1 was low expression in HCC tissues and attenuated tumor growth and metastasis via the F11-AS1/miR-3146/PTEN axis (40). In addition, the low expression of lncRNA F11-AS1 was correlated with poor prognosis in patients with HBV-related HCC via regulation of the miR-211-5p/NR1I3 pathway (41). LINC00942 acts as an oncogene in breast cancer that promotes cell proliferation and colony formation and inhibits cell apoptosis (42). The lncRNA CASC19 is markedly upregulated and is associated with tumor progression and poorer prognoses in various cancers, including colorectal cancer, advanced gastric cancer, non-small lung cancer cells, and pancreatic cancer (4346). However, the other lncRNA in this risk model, including AC103760.1, AC015908.3, LINC02362, AC012146.1, AC092171.2, AC099850.3, and AL158206.1, have been rarely reported. In this study, we verified the expression of the four lncRNAs (AC099850.3, AC092171.2, AL158206.1, and CASC19) in HCC, which exhibited the greatest impact on the risk score and prognosis. We found that the mRNA expression levels of AC099850.3 and AC092171.2 was significantly higher in HCC cells and tissues, while the AL158206.1 and CASC19 is upregulated in part HCC cell lines, and no significant difference was observed in CASC19 and AL158206.1 expression between HCC tissues and para-cancerous tissues. Thus, the risk signature could discover novel biomarkers for further research.

Epithelial–mesenchymal transition promote tumor progression and metastasis not only through reprogramming of cancer cells but also through reshaping tumor microenvironment (TME) (47). Numerous studies have indicated that tumor cells undergoing EMT acquire the ability to secrete some of cytokines, chemokines and growth factors that could regulate immune infiltrating cells in TME, thus promoting cancer immune escape (10). In turn, the occurrence of EMT is closely associated with the TME. Zhang et al. (48) revealed that cancer-associated fibroblasts in TME are implicated in modulating enhancing EMT and suppressing the anti-tumor immunity of HCC cells, which ultimately benefit for HCC initiation with malignant phenotypes. The exosomes in the TME derived from distinct cell types mediate the drug resistance by regulating epithelial-mesenchymal transition (EMT) and immune response (49). Our previous study reported that some TME-related lncRNAs could be applied as an effective prognostic biomarker for HCC patients (50). In this study, we intensively explored the relationship of EMT-related lncRNAs signature with immune infiltrating cells and immune function-related pathways in TME. Thus, we provided insights into the EMT-mediated TME regulatory mechanisms by constructing a EMT-lncRNAs-TME interactive network. ssGSEA analysis revealed that immune cells, including aDCs, macrophages, Th2 cells, and Treg cells, were enriched in high-risk group, but NK cells were upregulated in the low-risk group. Besides, APC co-stimulation, check-point, and MHC-I were significantly upregulated in the high-risk group; while the INF-I and INF-II responses were upregulated in the low-risk group. It is generally known that Macrophages play a prominent role in tumorigenesis and cancer progression by presenting M1 and M2 polarization, and M2 polarization induces cellular proliferation, angiogenesis, and EMT in HCC (51). It has been reported that the infiltration of macrophages is proportional to Tregs and inversely proportional to CD8+ T cells in HCC, and the absence of NK cells may trend toward M2 macrophages and accelerate liver fibrogenesis (51). Given the above results, we speculated that the tumor immune response in the high-risk group was lower than that in low-risk group, which mainly owning to immunosuppressive cells, such as macrophages and Tregs. Conversely, the anti-tumor immune response pathway was enriched in the low-risk group, which may contribute to a favorable prognosis.

Increasing evidence has demonstrated that EMT is also involved in other malignant behaviors, such as irradiation resistance, chemoresistance, and targeted drug resistance in HCC (5, 5255). According the risk signature constructed by EMT-related lncRNAs, the IC50 of common chemotherapeutic agents and targeted drugs were calculated to assess the drugs sensitivity in different risk groups. The results indicated doxorubicin, mitomycin C, cisplatin, gemcitabine seem to be more suitable for the patients in the high-risk group. However, sorafenib shows significant superiority in the low-risk group. These results may provide potential guidance for individualized chemotherapy and targeted therapy. In addition, the relationship among the immune checkpoint-related genes, IPS, and the TME-related lncRNA signature was further investigated, we found that 34 out of the 46 immune checkpoint -related genes were significantly different between the high- and low-risk groups, and the expression levels of CTLA4, PD-1, IDO, and TIM-3 were significantly upregulated in the high-risk group. Furthermore, HCC patients in the low-risk group exhibited higher IPS in two subtypes [IPS-CTLA4(–)/PD-1(–), IPS-CTLA4(+)/PD-1(–)] and tended to have a better response to ICIs. In short, our findings might guide personalized therapeutic strategies for patients with HCC.

There are a few limitations in this study. First, the 12 EMT-related lncRNAs risk signature was constructed and evaluated using limited data and clinical information from TCGA database, but without external validation from other database or clinical samples, which may limit the generalizability of our model. In addition, some of the lncRNAs reported in this study have not been previously studied; therefore, it is critical to verify the functional features and molecular mechanisms of the risk signature model. Next, we will continue to pay attention to the role of the signature in clinical practice and further conduct related experiments to explore the functions and mechanisms of the EMT-related lncRNAs. Besides, in recent years, single-cell multi-omics approaches and spatial transcriptomics are becoming available and wildly applied in discovering distinctive biomarkers and exploring tissue cytoarchitectures and functions (56, 57). Thus, we will further investigate the potential molecular and functional mechanisms of the 12 EMT-related lncRNA signatures in HCC basing on single-cell multi-omics and spatial transcriptomics.

In conclusion, our study established a novel EMT-related prognostic risk signature including 12 lncRNAs and constructed a nomogram to predict the overall survival of HCC patients, which may improve the prognostic predictive efficacy and guide the individualized treatment for the patients with HCC.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession numbers can be found in the article/Supplementary Material. Raw data are available from the corresponding author upon reasonable request.

Ethics Statement

The studies involving human participants were reviewed and approved by the Second Affiliated Hospital of Nanchang University Medical Research Ethics Committee. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

SH and JW: conceptualization. SH: investigation. LZ, DL, and JZ: methodology. LZ and DL: supervision. LZ and JZ: visualization. SH: writing-original draft. JW: writing-review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 82060435).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2022.850343/full#supplementary-material

Supplementary Figure 1 | Co-expression network of the prognostic EMT-related lncRNAs and EMT-related genes.

Supplementary Figure 2 | Time-dependent receiver operating characteristic (time-ROC) curve evaluated the precision of the prognostic model. (A) In training group; (B) in testing group; (C) in entire group.

Supplementary Figure 3 | The correlation of TME-related lncRNA signature with the expression of 46 ICIs-related genes. ICIs, immune checkpoint inhibitors. ns, not statistically significant. *p < 0.05; **p < 0.01; ***p < 0.001.

Abbreviations

APC, antigen-presenting cell; AUC, area under the curve; HCC, hepatocellular carcinoma; EMT, epithelial-mesenchymal transition; LncRNA, long non-coding RNA; FPKM, fragments per kilobase of transcript per million mapped reads; GO, Gene Ontology; GSEA, gene set enrichment analysis; HR, hazard ratio; IC 50, half-maximal inhibitory concentration; ICIs, immune checkpoint inhibitors; IPS, immunophenoscore; LASSO, the least absolute shrinkage and selection operator; MHC-I, major histocompatibility complex-I; multi-ROC, multi-indicator receiver operating characteristic; OS, overall survival; RT-qPCR, real-time reverse transcription-quantitative polymerase chain reaction; ROC, receiver operating characteristic; ssGSEA, Single-sample gene set enrichment analysis; SNPs, single nucleotide polymorphisms; TCGA, The Cancer Genome Atlas; TGF-β, Transforming growth factor-β; time-ROC, time-dependent receiver operating characteristic; TMB, tumor mutation burden.

Footnotes

  1. ^ https://cancergenome.nih.gov/
  2. ^ http://asia.ensembl.org/index.html
  3. ^ http://www.r-project.org/
  4. ^ http://www.broadinstitute.org/gsea/msigdb/index.jsp
  5. ^ http://www.cytoscape.org/
  6. ^ http://www.gsea-msigdb.org/gsea/index.jsp
  7. ^ http://www.gsea-msigdb.org/gsea/msigdb/collections.jsp
  8. ^ https://www.perl.org/
  9. ^ https://tcia.at/home
  10. ^ https://www.r-project.org/
  11. ^ http://www.graphpad-prism.cn

References

1. Yoon JS, Sinn DH, Lee JH, Kim HY, Lee CH, Kim SW, et al. Tumor marker-based definition of the transarterial chemoembolization-refractoriness in intermediate-stage hepatocellular carcinoma: a multi-cohort study. Cancers (Basel). (2019) 11:1721. doi: 10.3390/cancers11111721

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Villanueva A, Longo DL. Hepatocellular carcinoma. N Engl J Med. (2019) 380:1450–62. doi: 10.1056/NEJMra1713263

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Jemal A, Ward EM, Johnson CJ, Cronin KA, Ma J, Ryerson B, et al. Annual report to the nation on the status of cancer, 1975-2014, featuring survival. J Natl Cancer Inst. (2017) 109:djx030. doi: 10.1093/jnci/djx030

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Zeisberg EM, Tarnavski O, Zeisberg M, Dorfman AL, McMullen JR, Gustafsson E, et al. Endothelial-to-mesenchymal transition contributes to cardiac fibrosis. Nat Med. (2007) 13:952–61. doi: 10.1038/nm1613

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Iwahashi S, Shimada M, Utsunomiya T, Imura S, Morine Y, Ikemoto T, et al. Epithelial-mesenchymal transition-related genes are linked to aggressive local recurrence of hepatocellular carcinoma after radiofrequency ablation. Cancer Lett. (2016) 375:47–50. doi: 10.1016/j.canlet.2016.02.041

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Diepenbruck M, Christofori G. Epithelial-mesenchymal transition (EMT) and metastasis: yes, no, maybe? Curr Opin Cell Biol. (2016) 43:7–13. doi: 10.1016/j.ceb.2016.06.002

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Simeone P, Trerotola M, Franck J, Cardon T, Marchisio M, Fournier I, et al. The multiverse nature of epithelial to mesenchymal transition. Semin Cancer Biol. (2019) 58:1–10. doi: 10.1016/j.semcancer.2018.11.004

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Coulouarn C, Factor VM, Thorgeirsson SS. Transforming growth factor-beta gene expression signature in mouse hepatocytes predicts clinical outcome in human cancer. Hepatology. (2008) 47:2059–67. doi: 10.1002/hep.22283

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Fernando J, Malfettone A, Cepeda EB, Vilarrasa-Blasi R, Bertran E, Raimondi G, et al. A mesenchymal-like phenotype and expression of CD44 predict lack of apoptotic response to sorafenib in liver tumor cells. Int J Cancer. (2015) 136:E161–72. doi: 10.1002/ijc.29097

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Dongre A, Weinberg RA. New insights into the mechanisms of epithelial-mesenchymal transition and implications for cancer. Nat Rev Mol Cell Biol. (2019) 20:69–84. doi: 10.1038/s41580-018-0080-4

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Yuan L, Xu ZY, Ruan SM, Mo S, Qin JJ, Cheng XD. Long non-coding RNAs towards precision medicine in gastric cancer: early diagnosis, treatment, and drug resistance. Mol Cancer. (2020) 19:96. doi: 10.1186/s12943-020-01219-0

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Fan Y, Wang YF, Su HF, Fang N, Zou C, Li WF, et al. Decreased expression of the long noncoding RNA LINC00261 indicate poor prognosis in gastric cancer and suppress gastric cancer metastasis by affecting the epithelial-mesenchymal transition. J Hematol Oncol. (2016) 9:57. doi: 10.1186/s13045-016-0288-8

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Han L, Jia L, Zan Y. Long intergenic noncoding RNA smad7 (Linc-smad7) promotes the epithelial-mesenchymal transition of HCC by targeting the miR-125b/SIRT6 axis. Cancer Med. (2020) 9:9123–37. doi: 10.1002/cam4.3515

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Chen BW, Zhou Y, Wei T, Wen L, Zhang YB, Shen SC, et al. lncRNA-POIR promotes epithelial-mesenchymal transition and suppresses sorafenib sensitivity simultaneously in hepatocellular carcinoma by sponging miR-182-5p. J Cell Biochem. (2021) 122:130–42. doi: 10.1002/jcb.29844

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Xiao C, Wan X, Yu H, Chen X, Shan X, Miao Y, et al. LncRNA-AB209371 promotes the epithelial-mesenchymal transition of hepatocellular carcinoma cells. Oncol Rep. (2019) 41:2957–66. doi: 10.3892/or.2019.7045

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Zhu L, Yang N, Du G, Li C, Liu G, Liu S, et al. LncRNA CRNDE promotes the epithelial-mesenchymal transition of hepatocellular carcinoma cells via enhancing the Wnt/β-catenin signaling pathway. J Cell Biochem. (2018) 120:1156–64. doi: 10.1002/jcb.26762

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Zhang PF, Wang F, Wu J, Wu Y, Huang W, Liu D, et al. LncRNA SNHG3 induces EMT and sorafenib resistance by modulating the miR-128/CD151 pathway in hepatocellular carcinoma. J Cell Physiol. (2019) 234:2788–94. doi: 10.1002/jcp.27095

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Yang L, Deng WL, Zhao BG, Xu Y, Wang XW, Fang Y, et al. FOXO3-induced lncRNA LOC554202 contributes to hepatocellular carcinoma progression via the miR-485-5p/BSG axis. Cancer Gene Ther. (2021) 29:326–40. doi: 10.1038/s41417-021-00312-w

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Wang Y, Liu Z, Yao B, Li Q, Wang L, Wang C, et al. Long non-coding RNA CASC2 suppresses epithelial-mesenchymal transition of hepatocellular carcinoma cells through CASC2/miR-367/FBXW7 axis. Mol Cancer. (2017) 16:123. doi: 10.1186/s12943-017-0702-z

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Song S, Qiu X. LncRNA miR503HG inhibits epithelial-mesenchymal transition and angiogenesis in hepatocellular carcinoma by enhancing PDCD4 via regulation of miR-15b. Dig Liver Dis. (2021) 53:107–16. doi: 10.1016/j.dld.2020.09.008

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Du Y, Wang B, Jiang X, Cao J, Yu J, Wang Y, et al. Identification and validation of a stromal EMT related LncRNA signature as a potential marker to predict bladder cancer outcome. Front Oncol. (2021) 11:620674. doi: 10.3389/fonc.2021.620674

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Zhong W, Zhang F, Huang C, Lin Y, Huang J. Identification of epithelial-mesenchymal transition-related LncRNA with prognosis and molecular subtypes in clear cell renal cell carcinoma. Front Oncol. (2020) 10:591254. doi: 10.3389/fonc.2020.591254

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Li D, Lin X, Chen B, Ma Z, Zeng Y, Wang H. Identification and validation of EMT-related lncRNA prognostic signature for colorectal cancer. Front Genet. (2021) 12:723802. doi: 10.3389/fgene.2021.723802

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Xiao B, Liu L, Chen Z, Li A, Wang P, Xiang C, et al. Identification of epithelial-mesenchymal transition-related prognostic lncRNAs biomarkers associated with melanoma microenvironment. Front Cell Dev Biol. (2021) 9:679133. doi: 10.3389/fcell.2021.679133

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Yang X, Niu S, Liu J, Fang J, Wu Z, Ling S, et al. Identification of an epithelial-mesenchymal transition-related lncRNA prognostic signature for patients with glioblastoma. Sci Rep. (2021) 11:23694. doi: 10.1038/s41598-021-03213-y

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Jiang HY, Ning G, Wang YS, Lv WB. 14-CpG-based signature improves the prognosis prediction of hepatocellular carcinoma patients. Biomed Res Int. (2020) 2020:9762067. doi: 10.1155/2020/9762067

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Zhu Y, Yang L, Shen H. Value of the application of CE-MRI radiomics and machine learning in preoperative prediction of sentinel lymph node metastasis in breast cancer. Front Oncol. (2021) 11:757111. doi: 10.3389/fonc.2021.757111

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Geeleher P, Cox N, Huang RS. Prrophetic: an R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS One. (2014) 9:e107468. doi: 10.1371/journal.pone.0107468

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Nishino M, Ramaiya NH, Hatabu H, Hodi FS. Monitoring immune-checkpoint blockade: response evaluation and biomarker development. Nat Rev Clin Oncol. (2017) 14:655–68. doi: 10.1038/nrclinonc.2017.88

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Nie Y, Li J, Wu W, Guo D, Lei X, Zhang T, et al. A novel nine-lncRNA risk signature correlates with immunotherapy in hepatocellular carcinoma. Front Oncol. (2021) 11:706915. doi: 10.3389/fonc.2021.706915

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Xu BH, Jiang JH, Luo T, Jiang ZJ, Liu XY, Li LQ. Signature of prognostic epithelial-mesenchymal transition related long noncoding RNAs (ERLs) in hepatocellular carcinoma. Medicine (Baltimore). (2021) 100:e26762. doi: 10.1097/md.0000000000026762

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Wang X, Xing Z, Xu H, Yang H, Xing T. Development and validation of epithelial mesenchymal transition-related prognostic model for hepatocellular carcinoma. Aging (Albany NY). (2021) 13:13822–45. doi: 10.18632/aging.202976

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Zhu G, Xia H, Tang Q, Bi F. An epithelial-mesenchymal transition-related 5-gene signature predicting the prognosis of hepatocellular carcinoma patients. Cancer Cell Int. (2021) 21:166. doi: 10.1186/s12935-021-01864-5

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Wu G, Yang Y, Zhu Y, Li Y, Zhai Z, An L, et al. Comprehensive analysis to identify the epithelial-mesenchymal transition-related immune signatures as a prognostic and therapeutic biomarkers in hepatocellular carcinoma. Front Surg. (2021) 8:742443. doi: 10.3389/fsurg.2021.742443

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Zhao Y, Zhang M, Pu H, Guo S, Zhang S, Wang Y. Prognostic implications of pan-cancer CMTM6 expression and its relationship with the immune microenvironment. Front Oncol. (2020) 10:585961. doi: 10.3389/fonc.2020.585961

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Srivastava S, Wong KF, Ong CW, Huak CY, Yeoh KG, Teh M, et al. A morpho-molecular prognostic model for hepatocellular carcinoma. Br J Cancer. (2012) 107:334–9. doi: 10.1038/bjc.2012.230

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Yeung KT, Yang J. Epithelial-mesenchymal transition in tumor metastasis. Mol Oncol. (2017) 11:28–39. doi: 10.1002/1878-0261.12017

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Statello L, Guo CJ, Chen LL, Huarte M. Gene regulation by long non-coding RNAs and its biological functions. Nat Rev Mol Cell Biol. (2021) 22:96–118. doi: 10.1038/s41580-020-00315-9

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Ma X, Mo M, Tan HJJ, Tan C, Zeng X, Zhang G, et al. LINC02499, a novel liver-specific long non-coding RNA with potential diagnostic and prognostic value, inhibits hepatocellular carcinoma cell proliferation, migration, and invasion. Hepatol Res. (2020) 50:726–40. doi: 10.1111/hepr.13491

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Du J, Chen M, Liu J, Hu P, Guan H, Jiao X. LncRNA F11-AS1 suppresses liver hepatocellular carcinoma progression by competitively binding with miR-3146 to regulate PTEN expression. J Cell Biochem. (2019) 120:18457–64. doi: 10.1002/jcb.29163

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Deng Y, Wei Z, Huang M, Xu G, Wei W, Peng B, et al. Long non-coding RNA F11-As1 inhibits HBV-related hepatocellular carcinoma progression by regulating NR1I3 via binding to microRNA-211-5p. J Cell Mol Med. (2020) 24:1848–65. doi: 10.1111/jcmm.14881

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Sun T, Wu Z, Wang X, Wang Y, Hu X, Qin W, et al. LNC942 promoting METTL14-mediated m6A methylation in breast cancer cell proliferation and progression. Oncogene. (2020) 39:5358–72. doi: 10.1038/s41388-020-1338-9

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Lu T, Wei GH, Wang J, Shen J. LncRNA CASC19 contributed to the progression of pancreatic cancer through modulating miR-148b/E2F7 axis. Eur Rev Med Pharmacol Sci. (2020) 24:10462–71. doi: 10.26355/eurrev_202010_23399

CrossRef Full Text | Google Scholar

44. Wang L, Lin C, Sun N, Wang Q, Ding X, Sun Y. Long non-coding RNA CASC19 facilitates non-small cell lung cancer cell proliferation and metastasis by targeting the miR-301b-3p/LDLR axis. J Gene Med. (2020) 22:e3254. doi: 10.1002/jgm.3254

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Wang WJ, Guo CA, Li R, Xu ZP, Yu JP, Ye Y, et al. Long non-coding RNA CASC19 is associated with the progression and prognosis of advanced gastric cancer. Aging (Albany NY). (2019) 11:5829–47. doi: 10.18632/aging.102190

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Wang XD, Lu J, Lin YS, Gao C, Qi F. Functional role of long non-coding RNA CASC19/miR-140-5p/CEMIP axis in colorectal cancer progression in vitro. World J Gastroenterol. (2019) 25:1697–714. doi: 10.3748/wjg.v25.i14.1697

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Lou Y, Diao L, Cuentas ER, Denning WL, Chen L, Fan YH, et al. Epithelial-mesenchymal transition is associated with a distinct tumor microenvironment including elevation of inflammatory signals and multiple immune checkpoints in lung adenocarcinoma. Clin Cancer Res. (2016) 22:3630–42. doi: 10.1158/1078-0432.Ccr-15-1434

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Zhang J, Gu C, Song Q, Zhu M, Xu Y, Xiao M, et al. Identifying cancer-associated fibroblasts as emerging targets for hepatocellular carcinoma. Cell Biosci. (2020) 10:127. doi: 10.1186/s13578-020-00488-y

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Zhang J, Song Q, Wu M, Zheng W. The emerging roles of exosomes in the chemoresistance of hepatocellular carcinoma. Curr Med Chem. (2021) 28:93–109. doi: 10.2174/0929867327666200130103206

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Huang S, Zhang J, Lai X, Zhuang L, Wu J. Identification of novel tumor microenvironment-related long noncoding RNAs to determine the prognosis and response to immunotherapy of hepatocellular carcinoma patients. Front Mol Biosci. (2021) 8:781307. doi: 10.3389/fmolb.2021.781307

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Li Z, Wu T, Zheng B, Chen L. Individualized precision treatment: targeting TAM in HCC. Cancer Lett. (2019) 458:86–91. doi: 10.1016/j.canlet.2019.05.019

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Liang R, Zhang J, Liu Z, Liu Z, Li Q, Luo X, et al. Mechanism and molecular network of RBM8A-mediated regulation of oxaliplatin resistance in hepatocellular carcinoma. Front Oncol. (2020) 10:585452. doi: 10.3389/fonc.2020.585452

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Shrestha R, Prithviraj P, Bridle KR, Crawford DHG, Jayachandran A. Combined inhibition of TGF-β1-induced EMT and PD-L1 silencing re-sensitizes hepatocellular carcinoma to sorafenib treatment. J Clin Med. (2021) 10:1889. doi: 10.3390/jcm10091889

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Zhang Y, Wu W, Sun Q, Ye L, Zhou D, Wang W. linc-ROR facilitates hepatocellular carcinoma resistance to doxorubicin by regulating TWIST1-mediated epithelial-mesenchymal transition. Mol Med Rep. (2021) 23:340. doi: 10.3892/mmr.2021.11979

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Zhao H, Cheng X, Yu J, Li Y. Stabilization of snail maintains the sorafenib resistance of hepatocellular carcinoma cells. Arch Biochem Biophys. (2021) 699:108754. doi: 10.1016/j.abb.2021.108754

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Su J, Song Q, Qasem S, O’Neill S, Lee J, Furdui CM, et al. Multi-omics analysis of brain metastasis outcomes following craniotomy. Front Oncol. (2020) 10:615472. doi: 10.3389/fonc.2020.615472

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Song Q, Su J. DSTG: deconvoluting spatial transcriptomics data through graph-based artificial intelligence. Brief Bioinform. (2021) 22:bbaa414. doi: 10.1093/bib/bbaa414

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: hepatocellular carcinoma, epithelial-mesenchymal transition, long non-coding RNA, prognostic signature, nomogram

Citation: Huang S, Li D, Zhuang L, Zhang J and Wu J (2022) Identification of an Epithelial-Mesenchymal Transition-Related Long Non-coding RNA Prognostic Signature to Determine the Prognosis and Drug Treatment of Hepatocellular Carcinoma Patients. Front. Med. 9:850343. doi: 10.3389/fmed.2022.850343

Received: 07 January 2022; Accepted: 02 May 2022;
Published: 24 May 2022.

Edited by:

Beatson Richard, University College London, United Kingdom

Reviewed by:

Chen Li, The First Affiliated Hospital of Nanchang University, China
Qiu Jinrong, Eastern Hepatobiliary Surgery Hospital, China

Copyright © 2022 Huang, Li, Zhuang, Zhang and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jianbing Wu, TmRlZnk5MzAwOEBuY3UuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.