- 1Department of Pancreatic and Gastric Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
- 2Department of General Surgery, Xuanwu Hospital, Capital Medical University, Beijing, China
- 3Department of General Surgery, The First People’s Hospital of Dongcheng District, Beijing, China
- 4Department of Cardiovascular Surgery, China-Japan Friendship Hospital, Beijing, China
- 5Department of Pathology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Weighted correlation network analysis (WGCNA) is a statistical method that has been widely used in recent years to explore gene co-expression modules. Competing endogenous RNA (ceRNA) is commonly involved in the cancer gene expression regulation mechanism. Some ceRNA networks are recognized in gastric cancer; however, the prognosis-associated ceRNA network has not been fully identified using WGCNA. We performed WGCNA using datasets from The Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) to identify cancer-associated modules. The criteria of differentially expressed RNAs between normal stomach samples and gastric cancer samples were set at the false discovery rate (FDR) < 0.01 and |fold change (FC)| > 1.3. The ceRNA relationships obtained from the RNAinter database were examined by both the Pearson correlation test and hypergeometric test to confirm the mRNA–lncRNA regulation. Overlapped genes were recognized at the intersections of genes predicted by ceRNA relationships, differentially expressed genes, and genes in cancer-specific modules. These were then used for univariate and multivariate Cox analyses to construct a risk score model. The ceRNA network was constructed based on the genes in this model. WGCNA-uncovered genes in the green and turquoise modules are those most associated with gastric cancer. Eighty differentially expressed genes were observed to have potential prognostic value, which led to the identification of 12 prognosis-related mRNAs (KIF15, FEN1, ZFP69B, SP6, SPARC, TTF2, MSI2, KYNU, ACLY, KIF21B, SLC12A7, and ZNF823) to construct a risk score model. The risk genes were validated using the GSE62254 and GSE84433 datasets, with 0.82 as the universal cutoff value. 12 genes, 12 lncRNAs, and 35 miRNAs were used to build a ceRNA network with 86 dysregulated lncRNA–mRNA ceRNA pairs. Finally, we developed a 12-gene signature from both prognosis-related and tumor-specific genes, and then constructed a ceRNA network in gastric cancer. Our findings may provide novel insights into the treatment of gastric cancer.
Introduction
Gastric cancer is a major cause of cancer-related mortality worldwide (Van Cutsem et al., 2016). It is a serious form of cancer characterized by limited chemotherapy regimens and complex patterns of tumorigenesis and progression in different subtypes (Erdem et al., 2018; Kubota et al., 2020). There have been exceptional advancements in the interpretation of the molecular pattern of gastric cancer through research projects including the Cancer Genome Atlas (TCGA) (Cancer Genome Atlas Research N, 2014) and the Asian Cancer Research Group (ACRG) (Cristescu et al., 2015) in recent years; however, current classifications are not sufficient to describe the vast differences in prognoses and summarize overall genomic characteristics, even for patients who are recognized as belonging to the same molecular subtypes.
Integrated analysis of transcriptomes is believed to provide peculiar insights into diseases; in this respect, weighted gene co-expression network analysis (WGCNA) may be the most popular approach for detecting co-expressed RNAs from RNA-seq data to microarray data (van Dam et al., 2018; Kakati et al., 2019). WGCNA can identify select groups of significant genes with similar biological functions and with strong correlations to specific traits. Many recent surveys have used WGCNA for both non-neoplastic and neoplastic diseases, including gastric cancer.
Salmena et al. (2011) speculated that the expression of many RNA transcripts is regulated by competing endogenous RNAs (ceRNAs) competing for the same sequences in miRNAs. They established the groundwork for a significant discovery of a communication network between coding and non-coding RNAs. Their theory has been supported by studies on the pathological processes of many malignancies, including breast, colon, and gastric cancers (Qi et al., 2015; Shuwen et al., 2018; Abdollahzadeh et al., 2019).
Thus, finding key genes that will serve as drug targets is crucial to the treatment of gastric cancer. In this study, we used WGCNA to construct a solid cancer-associated ceRNA network in gastric cancer for the first time. We hypothesized that identifying gene co-expression patterns would provide additional insight into disease-associated biological pathways. Finally, we explored a lncRNA–miRNA–mRNA network based on the survival-related hallmark genes of gastric cancer, providing candidate targets for its management and surveillance.
Materials and Methods
RNA-Sequencing and Microarray Data Collection
RNA-seq data were obtained from TCGA repository (https://portal.gdc.cancer.gov/) and the Genotype-Tissue Expression (GTEx) portal (https://www.gtexportal.org/) and sequenced on the Illumina HiSeq 2000 RNA Sequencing platform. TCGA offers a comprehensive database of cancer genomic profiles of specific cancer types. GTEx is another project that recruits postmortem donors without diseases, which has made genetic traits of healthy people open to the public. We performed a combined analysis of the stomach data from these two projects in the present study. The transcript per million (TPM) expression values for 625 stomach samples and the RNA-seq by Expectation Maximization (RSEM) expected counts for 624 stomach samples were preserved for further analysis (raw counts data of a patient did not exist).
The microarray data were downloaded from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/). The datasets were obtained from human gene expression microarray profiles of gastric cancer using fresh-frozen specimens. Finally, we included GSE62254 and GSE84433 for further analysis, which are the two largest datasets of microarray-based gene expression profiling. GSE62254 was profiled on the Affymetrix Human Genome U133 Plus 2.0 Array platform (Affymetrix, Inc., Santa Clara, CA, USA), including 300 gastric cancer tumor samples and 100 normal samples, which is the largest number of normal gastric samples among datasets (Cristescu et al., 2015).
We downloaded raw data (CEL files) generated by the Affymetrix platform. The R package oligo was utilized for format conversion, missing data filling, background correction, and data normalization (Parrish and Spencer , 2004). GSE84433 was profiled on the Illumina HumanHT-12 V3.0 Expression Beadchip (Illumina, Inc., San Diego, CA, USA); it includes 357 gastric cancer samples, and thus has the highest number of gastric cancer patients with survival information (Cheong et al., 2018). We downloaded data in the format of raw counts, performed quantile normalization, followed by log transformation.
Annotation of the above datasets was performed according to the different platforms using the official R package downloaded from Bioconductor (http://www.bioconductor.org/packages/). When the same RNA name appeared, the probe with the highest signal value was stored. To facilitate the analysis, only the overlapped RNAs qualified for survival-related ceRNA network construction. To keep the data updated, clinical and survival information of patients were obtained from the websites on March 5th, 2020. All original data were retrieved from the open database; thus, the documents of medical ethics were exempted as all had been approved when first published.
Differentially Expressed RNAs
The Limma package was used for screening differentially expressed RNAs (Ritchie et al., 2015). The RSEM expected counts retrieved from the GTEx database and TCGA database were utilized to discern differentially expressed RNAs between gastric cancer and control groups, which comprised normal stomach tissue both from dead non-cancerous subjects and adjacent stomach tissue in gastric cancer patients. For microarray-based profiling, expression values retrieved from GSE62254 were utilized to distinguish differentially expressed RNAs between 300 gastric cancer samples and 100 normal tissue samples. The R package Limma was used to process the data with the standard of false discovery rate (FDR) < 0.01, and |fold change (FC)| > 1.3. After filtering out RNAs with low expression values, the overlapping, differentially expressed RNAs were considered suitable for further analysis.
WGCNA
WGCNA is a bioinformatics method for dealing with high-throughput gene expression data, which can be used for the construction of a co-expression network (Langfelder and Horvath, 2008). The expression values of genes were preprocessed in the form of log2 (TPM + 0.001). The genes were then chosen in order of descending variance of their expression in the datasets. Finally, 13000 genes were put through WGCNA. Pairwise Pearson coefficients were used to assess the weighted co-expression relationships between all genes to produce an adjacency matrix. The least value for which the scale-free topology fit R^2 index > 0.75 was chosen as the soft-threshold power. Pearson coefficients were produced for all paired genes; thus, the co-expression matrix was rendered into an adjacency matrix using soft-threshold power. The soft-threshold power was selected according to the standard scale-free distribution. Scale-free co-expression networks were created with 30 RNAs as the minimal module size and 0.25 as the dendrogram cut height for module merging. The soft threshold was used to ensure a scale-free network. Genes with high correlations were clustered into the same module after forming a co-expression network.
Gene Function Analysis
Upregulated and downregulated differentially expressed genes were put into Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG), respectively. For the gene set enrichment analysis (GSEA) and KEGG–GSEA, all genes were incorporated into the analysis. A pathway (term) with an adjusted p-value < 0.05 was considered a functional enriched pathway (term) using the R package ClusterProfiler (Yu et al., 2012).
ceRNA Network
RNAinter was employed to observe the relationships among mRNA, lncRNA, and miRNA. RNAinter contains 24 databases demonstrated by experiment and 14 databases forecasted by calculation, including miRTarBase and starBase. We used lncRNA–miRNA relationships and mRNA–miRNA relationships with the least confidence score limit of 0.5 for further analysis (Lin et al., 2020). To ensure mRNA–lncRNA competing relationship pairs that have the shared miRNAs in gastric cancer, the Pearson correlation test and the hypergeometric test were utilized using the expression profiles. Quantile normalization and correction of batch effects between TCGA dataset and the GTEx dataset were performed for the expression profiles before further analysis (Leek et al., 2012). If the P-values of both tests were less than 0.05, the mRNA–lncRNA competing interaction pairs were saved to create the ceRNA network (Paci et al., 2014). The co-expression network with the overlapped lncRNA–miRNA–mRNA relationship was visualized with Cytoscape software (Su et al., 2014).
Construction and Validation of the Survival Model
Training datasets for the prognostic score system were performed according to the sequencing data from TCGA. Validation of the prognostic score system was performed on the two microarray datasets (GSE62254 and GSE84433). Patients who died within 30 postoperative days and without 30-day follow-up were excluded. Expression profiles of eligible gastric cancer patients were normalized and non-overlapping genes were removed from the analysis. Before survival analysis, the gene expression values in the datasets were processed with a standardization with 0 mean value and standard deviation of 1. Based on the mRNAs obtained in the clinical cancer-associated modules, univariate Cox regression analysis was performed to identify prognosis-related mRNAs; then, all the survival-related genes were used to perform multivariate Cox regression. The backward selection method was used to select the most suitable survival gene group to construct a risk score system (Donithan et al., 1992). The samples in the datasets were divided into high-risk and low-risk groups according to the universal cutoff value of their risk scores. Kaplan–Meier survival curves were used to evaluate correlations between the overall survival of the two groups in datasets using the survival package (Therneau, 2020).
Results
Differential Gene Expression Analysis
Sequencing data (TCGA + GTEx) and microarray data (GSE62254) were downloaded and processed as previously described. A total of 13007 genes and 292 lncRNAs were identified after screening out less-expressed ones. Thirty pathways were generated after the genes were applied to the KEGG–GSEA (Figure 1A). A total of 3895 differentially expressed genes and 79 lncRNAs were determined by comparing each gastric cancer group to both control groups. In total, 2347 upregulated and 1548 downregulated genes were subjected to GO term and KEGG pathway enrichment analyses. In the GO analysis for upregulated genes, neutrophil activation, neutrophil mediated immunity, and T cell activation were rendered as the top three terms of biological processes (BP); the chromosomal region, condensed chromosome, and centromeric region on chromosome were rendered as the top three terms of cellular components (CC); cell adhesion molecule binding, cadherin binding, and DNA helicase activity were rendered as the top three terms of molecular functions (MF) (Figure 1B). In the GO analysis for downregulated genes, regulation of neuron projection development, axonogenesis, and regulation of cell morphogenesis were rendered as the top three terms of the BP; the collagen-containing extracellular matrix, nuclear speck, and axon part were rendered as the top three terms of CC; actin binding, coenzyme binding, and extracellular matrix structural constituent were rendered as the top three terms of the MF (Figure 1C). In the KEGG analysis, human papillomavirus infection, human T−cell leukemia virus 1 infection, and Epstein−Barr virus infection were rendered as the top three enriched pathways for upregulated genes (Figure 1D); herpes simplex virus 1 infection, MAPK signaling, and oxytocin signaling were rendered as the top three pathways enriched for downregulated genes (Figure 1E).
Figure 1 Enrichment analysis of KEGG–GSEA, GO, and KEGG. (A) KEGG–GSEA after filtering out low-expressed genes. (B) GO terms enriched from upregulated genes. (C) GO terms enriched from downregulated genes. (D) KEGG pathways enriched from upregulated genes. (E) KEGG pathways enriched from downregulated genes.
WGCNA
WGCNA was performed to ascertain the most strongly cancer-associated genes. When the soft-power β was set to 4, the scale-free topology fit index was over 0.75 (Figure 2A). The created network included nine modules (Figure 2B). Figure 2C shows that the green module was recognized as the most specific module with a coefficient of correlation of 0.89 (p = 2 × 10−217); the turquoise module came in second with a coefficient of correlation of 0.61 (p = 8 × 10−65). Genes in both modules showed a high correlation with each other according to the heatmap of the topological overlap plot (Figure 2D).
Figure 2 WGCNA identification of cancer-associated RNA modules. (A) Graphs of soft-threshold power versus scale-free topology model Fit index and mean connectivity. Four was chosen as the appropriate soft-power. (B) Cluster dendrogram of the co-expression network modules created according to the dissimilarity of the topological overlap in the selected mRNAs. (C) Analysis of relationships between genes in modules between gastric cancer and normal samples. The green and turquoise modules were the most tumor-specific modules. (D) Heatmap plot of topological overlap in the mRNA network. Selected genes in the green and turquoise modules showed higher topological overlap. The gene dendrogram and the corresponding module are shown along the left and top.
Functional Analysis of the Green and Turquoise Modules
A total of 805 genes in the green module and 5213 genes in the turquoise module were subjected to KEGG–GSEA, generating 12 pathways (Figure 3A). The GO and KEGG enrichment results of 3312 upregulated differentially expressed genes in both modules are plotted in Figures 3B, C, in which enrichment results of the downregulated differentially expressed genes were not found. In total, 817 upregulated and eight downregulated differentially expressed genes were found to be qualified, with appropriate ceRNA relationships in the stomach, and appeared in the two cancer-associated modules (Figure 3D).
Figure 3 Functional analysis of genes in green and turquoise modules. (A) KEGG–GSEA for genes in the green and turquoise modules for signaling pathway analysis. (B) GO terms enriched from upregulated genes in the green and turquoise modules. (C) KEGG pathways enriched from upregulated genes in the green and turquoise modules. (D) 825 mRNAs recognized at the intersection of genes predicted, differentially expressed genes, and genes in the green and turquoise modules.
Survival Model Construction and Validation
The clinical and pathological data from the construction and validation cohorts are shown in Table 1. We used Cox univariate regression analysis for the 825 genes for identifying survival-related genes in 332 TCGA gastric cancer samples. The univariate analysis screened 83 predictors based on prognosis. Because three genes did not appear in GSE84433, 80 genes were included in the multivariate analysis, which further led to the identification of 12 upregulated mRNAs for constructing a risk score model, containing KIF15, FEN1, ZFP69B, SP6, SPARC, TTF2, MSI2, KYNU, ACLY, KIF21B, SLC12A7, and ZNF823 (Table 2). The risk scores for individual samples was calculated using the formula:
Risk score = (0.56321) × Expression Value (KIF15) + (−0.24228) × Expression Value (FEN1) + (−0.25944) × Expression Value (ZFP69B) + (−0.23267) × Expression Value (SP6) + (0.18278) × Expression Value (SPARC) + (−0.3753) × Expression Value (TTF2) + (−0.29728) × Expression Value (MSI2) + (0.29177) × Expression Value (KYNU) + (0.45439) × Expression Value (ACLY) + (−0.30168) × Expression Value (KIF21B) + (−0.23641) × Expression Value (SLC12A7) + (−0.22233) × Expression Value (ZNF823).
The forest plot of hazard ratios (HRs) is shown in Figure 4 using TCGA datasets. The HRs of KIF15 (HR = 1.756), SPARC (HR = 1.201), KYNU (HR = 1.339), and ACLY (HR = 1.575) were greater than 1; however, the HRs of FEN1 (HR=0.784), ZFP69B (HR = 0.772), SP6 (HR = 0.792), TTF2 (HR = 0.687), MSI2 (HR = 0.743), KIF21B (HR = 0.740), SLC12A7 (HR = 0.790), and ZNF82 (HR = 0.801) were less than 1. We further discovered that the best cutoff risk score to differentiate low-risk from high-risk groups was 0.82. The risk score for each individual were calculated and was categorized into two groups according to the cutoff value. Kaplan-Meier analysis showed that there was a significant difference between high-risk (n=51) and low-risk patients (n=281) in the training dataset (log-rank test, p < 0.0001, Figure 5A). Risk stratification, survival information, and expression values of 12 genes of 332 patients were shown in the risk score panel. We observed that both the survival information and the 12-gene expression of patients in the high-risk group varied from those in the low-risk group (Figure 5D). Using GSE62254 and GSE84433 as validation datasets, the survival curves (Figures 5B, C) and the risk score panels (Figures 5E, F) between the high-risk groups and the low-risk groups were distinctively different. The results in the validation datasets were similar to those in the training dataset, therefore robustness of the 12-gene signature risk score system in predicting sample risk was supported.
Figure 5 Validation of the 12 prognosis-related gene risk score system. (A) Kaplan–Meier survival curves of the training group using 0.82 as the cutoff value. (B) Kaplan–Meier survival curves of the GSE25544 dataset using 0.82 as the cutoff value. (C) Kaplan–Meier survival curves of the GSE83344 based on the 12 prognosis-related genes using 0.82 as the cutoff value. (D) 12-gene signature risk score panel for the gene signature in the training group. (E) 12-gene signature risk score panel for the gene signature in GSE25544. (F) 12-gene signature risk score panel for the gene signature in GSE83344.
Cerna Network Construction
We narrowed interesting lncRNAs to an intersection of lncRNAs predicted by the 12-mRNA model and differentially expressed lncRNAs, with eight upregulated lncRNAs (OIP5-AS1, MCF2L-AS1, TMPO-AS1, HCP5, DLEU1, PPP1R26-AS1, DLEU2, and ZFAS1) and four downregulated lncRNAs (SH3BP5-AS1, CCDC18-AS1, TTC28-AS1, and TRG-AS1) identified. In total, 35 miRNAs and 12 mRNAs in the risk score system, and 12 differently expressed lncRNAs were identified to create the ceRNA network. The correlation of 12 mRNAs and 12 lncRNAs was confirmed by both the Pearson correlation test and the hypergeometric test (Table 3). A ceRNA network was constructed using 86 dysregulated lncRNA–mRNA–ceRNA pairs (Figure 6).
Table 3 Pearson correlation tests and hypergeometric tests of the candidate lncRNA-miRNA-mRNA competing endogenous RNA pairs.
Discussion
Currently, although there have been some investigations concentrating on non-coding RNA-involving gene expression regulation networks in gastric cancer, our research is the first to employ WGCNA to produce a co-expression network of lncRNAs–miRNAs–mRNAs in gastric cancer. Compared with the previous risk score-based system in gastric cancer (Cho et al., 2011; Cheong et al., 2018; Duan et al., 2019; Hu et al., 2019), the advantages of our ceRNA network are that it provided 12 both cancer-associated and prognosis-related genes and was constructed using rigorous calculation of the ceRNA regulation relationships.
KIF15 is a gene involved in immune diseases and cancer progression. Whether it is mutated is related to the sensitivity of immune checkpoint inhibitors. Győrffy et al. reported that in both wild-type PIK3CA patient groups, individuals with KIF15 mutations displayed substantially increased expression of PD-L1, whereas those without mutations displayed decreased expression (Menyhart et al., 2018). Zhang et al. found that knockdown of KIF15 resulted in mitochondrial damage and ROS-JNK-p53 axis activation, thus promoting apoptosis and inhibiting cell proliferation in gastric cancer cells (Tao et al., 2020). The DNA replication and repair pathway is an important mechanism in gastric cancer. FEN1 plays an important role in apoptotic fragmentation of DNA, maintenance of telomere stability, and rescue of stalled replication forks (Shen et al., 2005; Zheng et al., 2011). Unsurprisingly, genetic variants and changes in the expression of FEN1 will alter patients’ sensitivity to chemotherapy and prognosis (Liu et al., 2012; Wang et al., 2014; Xie et al., 2016). The role of SPARC in gastric cancer has not been fully elucidated, although its diagnostic and prognostic value has been confirmed by multiple studies (Zhao et al., 2010; Liao et al., 2018). SPARC can act as both an inhibitor and promoter in cancer (Tai and Tang, 2008; Said, 2016; Li et al., 2019); the real effect of SPARC can be altered by other genes owing to its epistatic effects (Chen et al., 2018). However, some oncologists have shown that SPARC is mostly produced by gastric cancer-associated fibroblasts rather than gastric cancer cells (Ma et al., 2018; Ma et al., 2019). Thus, the concrete mechanism of SPARC in gastric cancer needs further investigation. MSI2 is an oncogene associated with differentiation, resulting in the preservation of cancer stem cells. According to a previous study based on microarray and RT-PCR, MSI2 expression increased slightly relative to normal tissue, but it is still used as a biomarker for gastric cancer (Emadi-Baygi et al., 2013; Yang et al., 2019). SLC12A7 (Solute Carrier Family 12 Member 7) acts as a potassium/chloride co-transporter for maintaining a stable osmotic pressure, which can be activated by insulin-like growth factor (IGF) resulting in cell invasion and progression in breast cancer, adrenocortical cancer, cervical cancer, and ovarian cancer (Shen et al., 2004; Hsu et al., 2007; Chen et al., 2009; Brown et al., 2019). It has been reported that amplification of SLC12A7 is mainly within HER2− patient samples in gastric cancer, while the precise mechanism of SLC12A7 is still unknown (Zhou et al., 2018). ACLY is the integral enzyme responsible for the formation of cytosolic acetyl-CoA, and high expression of ACLY was shown to be associated with a poor prognosis (Qian et al., 2015). Citrate and inhibitors of ACLY can reduce its expression, thus protecting against gastric cancer cell progression (Guo et al., 2016; Icard et al., 2020). In addition, ACLY can be downregulated by miR-133b via PPARγ (Cheng et al., 2017) and lncRNA FLJ22763 (Zhang et al., 2019) in gastric cancer.
The functions of ZNF823, ZFP69B, SP6, KYNU, KIF21B, and TTF2 have not been reported in gastric cancer. KYNU is a pyridoxal-5´-phosphate dependent enzyme that catalyzes the cleavage of kynurenine into anthranilic acid (AA) and the cleavage of 3-hydroxykynurenine (3-HK) into 3-hydroxyanthranilic acids (3-HAA) (Badawy, 2017). Drugs have been developed to regulate the expression of KYNU to suppress tumor growth through the Kynurenine pathways, such as in breast cancer (Liu et al., 2019; Liu et al., 2020) and melanoma (Rad et al., 2019). It has been demonstrated that TTF2 is able to terminate RNA polymerase II transcription, which has an important function in promoting chromosome segregation and altering protein-DNA interactions (Jiang and Price, 2004; Jiang et al., 2004; Cheng et al., 2012). KIF21B is an ATP-dependent microtubule-based motor protein that participates in the intracellular transfer of membranous organelles. KIF21B is a potential oncogene that resists the induction of apoptosis and facilitates malignant tumorigenesis, tumor development, intrusion, and metastasis. Patients with high expression of KIF21B have been demonstrated to have a poorer prognosis in hepatocellular carcinoma (Zhao et al., 2020) and non-small cell lung cancer (Sun et al., 2020). SP6 is an important gene that regulates odontogenesis, belonging to a family of transcription factors that contain 3 classical zinc finger DNA-binding domains(Aurrekoetxea et al., 2016; Smith et al., 2020; Nakamura et al., 2020).
In summary, we identified a survival-related gene-based ceRNA network using the WGCNA algorithm, and the constructed lncRNA–miRNA–mRNA ceRNA interactive network will probably provide a basis for additional inspection of the regulatory mechanisms of gastric cancer. Gastric cancer has high heterogeneity among its different histological and molecular subtypes. Thus, while we have selected the expression profiles with the largest sample size that we could obtain currently, we must admit that further experimental works and large cohorts are needed to verify our results and elucidate the prognostic value of ceRNA networks in gastric cancer.
Data Availability Statement
The datasets analyzed for this study can be found in The Cancer Genome Atlas (TCGA) data portal (https://portal.gdc.cancer.gov/), the Genotype-Tissue Expression (GTEx) portal. (https://www.gtexportal.org/home/index.html), and the Gene Expression Omnibus (GEO) repository (https://www.ncbi.nlm.nih.gov/geo/).
Author Contributions
XZ and YX contributed to study conception. XZ, XW, LZ, and HZ contributed to data collection and analysis. XZ, WL, BW, LX, and YT contributed to manuscript writing. All authors contributed to the article and approved the submitted version.
Funding
The study was supported by the CAMS Initiative for Innovative Medicine (2016-I2M-1-007) and China International Medical Foundation (CIMF-F-H001-314).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Abdollahzadeh, R., Daraei, A., Mansoori, Y., Sepahvand, M., Amoli, M. M., Tavakkoly-Bazzaz, J., et al. (2019). (ceRNA) cross talk and language in ceRNA regulatory networks: a new look at hallmarks of breast cancer. J. Cell. Physiol. 234 (7), 10080–10100. doi: 10.1002/jcp.27941
Aurrekoetxea, M., Irastorza, I., Garcia-Gallastegui, P., Jimenez-Rojo, L., Nakamura, T., Yamada, Y., et al. (2016). Wnt/beta-Catenin Regulates the Activity of Epiprofin/Sp6, SHH, FGF, and BMP to Coordinate the Stages of Odontogenesis. Front. Cell Dev. Biol. 4, 25. doi: 10.3389/fcell.2016.00025
Badawy, A. A. (2017). Kynurenine Pathway of Tryptophan Metabolism: Regulatory and Functional Aspects. Int. J. Tryptophan. Res. 10, 1178646917691938. doi: 10.1177/1178646917691938
Brown, T. C., Nicolson, N. G., Stenman, A., Juhlin, C. C., Gibson, C. E., Callender, G. G., et al. (2019). Insulin-Like Growth Factor and SLC12A7 Dysregulation: A Novel Signaling Hallmark of Non-Functional Adrenocortical Carcinoma. J. Am. Coll. Surg. 229 (3), 305–315. doi: 10.1016/j.jamcollsurg.2019.04.018
Cancer Genome Atlas Research N (2014). Comprehensive molecular characterization of gastric adenocarcinoma. Nature 513 (7517), 202–209. doi: 10.1038/nature13480
Chen, Y. F., Chou, C. Y., Wilkins, R. J., Ellory, J. C., Mount, D. B., Shen, M. R. (2009). Motor protein-dependent membrane trafficking of KCl cotransporter-4 is important for cancer cell invasion. Cancer Res. 69 (22), 8585–8593. doi: 10.1158/0008-5472.CAN-09-2284
Chen, L. Z., He, C. Y., Su, X., Peng, J. L., Chen, D. L., Ye, Z., et al. (2018). SPP1 rs4754 and its epistatic interactions with SPARC polymorphisms in gastric cancer susceptibility. Gene 640, 43–50. doi: 10.1016/j.gene.2017.09.053
Cheng, B., Li, T., Rahl, P. B., Adamson, T. E., Loudas, N. B., Guo, J., et al. (2012). Functional association of Gdown1 with RNA polymerase II poised on human genes. Mol. Cell 45 (1), 38–50. doi: 10.1016/j.molcel.2011.10.022
Cheng, Y., Jia, B., Wang, Y., Wan, S. (2017). miR-133b acts as a tumor suppressor and negatively regulates ATP citrate lyase via PPARγ in gastric cancer. Oncol. Rep. 38 (5), 3220–3226. doi: 10.3892/or.2017.5944
Cheong, J. H., Yang, H. K., Kim, H., Kim, W. H., Kim, Y. W., Kook, M. C., et al. (2018). Predictive test for chemotherapy response in resectable gastric cancer: a multi-cohort, retrospective analysis. Lancet Oncol. 19 (5), 629–638. doi: 10.1016/S1470-2045(18)30108-6
Cho, J. Y., Lim, J. Y., Cheong, J. H., Park, Y. Y., Yoon, S. L., Kim, S. M., et al. (2011). Gene expression signature-based prognostic risk score in gastric cancer. Clin. Cancer Res. 17 (7), 1850–1857. doi: 10.1158/1078-0432.CCR-10-2180
Cristescu, R., Lee, J., Nebozhyn, M., Kim, K. M., Ting, J. C., Wong, S. S., et al. (2015). Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nat. Med. 21 (5), 449–456. doi: 10.1038/nm.3850
Donithan, M., Van Natta, M., Tonascia, J. (1992). Cox regression analyses using SAS/PHREG. Controlled Clin. Trials 13 (5), 420. doi: 10.1016/0197-2456(92)90146-Q
Duan, S., Wang, P., Liu, F., Huang, H., An, W., Pan, S., et al. (2019). Novel immune-risk score of gastric cancer: A molecular prediction model combining the value of immune-risk status and chemosensitivity. Cancer Med. 8 (5), 2675–2685. doi: 10.1002/cam4.2077
Emadi-Baygi, M., Nikpour, P., Mohammad-Hashem, F., Maracy, M. R., Haghjooy-Javanmard, S. (2013). MSI2 expression is decreased in grade II of gastric carcinoma. Pathol. Res. Pract. 209 (11), 689–691. doi: 10.1016/j.prp.2013.07.008
Erdem, G. U., Bozkaya, Y., Ozdemir, N. Y., Demirci, N. S., Yazici, O., Zengin, N. (2018). 5-fluorouracil, leucovorin, and irinotecan (FOLFIRI) as a third-line chemotherapy treatment in metastatic gastric cancer, after failure of fluoropyrimidine, platinum, anthracycline, and taxane. Bosn. J. Basic Med. Sci. 18 (2), 170–177. doi: 10.17305/bjbms.2017.2258
Guo, X., Zhang, X., Wang, T., Xian, S., Lu, Y. (2016). 3-Bromopyruvate and sodium citrate induce apoptosis in human gastric cancer cell line MGC-803 by inhibiting glycolysis and promoting mitochondria-regulated apoptosis pathway. Biochem. Biophys. Res. Commun. 475 (1), 37–43. doi: 10.1016/j.bbrc.2016.04.151
Hsu, Y. M., Chou, C. Y., Chen, H. H., Lee, W. Y., Chen, Y. F., Lin, P. W., et al. (2007). IGF-1 upregulates electroneutral K-Cl cotransporter KCC3 and KCC4 which are differentially required for breast cancer cell proliferation and invasiveness. J. Cell Physiol. 210 (3), 626–636. doi: 10.1002/jcp.20859
Hu, Z., Yang, D., Tang, Y., Zhang, X., Wei, Z., Fu, H., et al. (2019). Five−long non−coding RNA risk score system for the effective prediction of gastric cancer patient survival. Oncol. Lett. 17 (5), 4474–4486. doi: 10.3892/ol.2019.10124
Icard, P., Wu, Z., Fournel, L., Coquerel, A., Lincet, H., Alifano, M. (2020). ATP citrate lyase: A central metabolic enzyme in cancer. Cancer Lett. 471, 125–134. doi: 10.1016/j.canlet.2019.12.010
Jiang, Y., Price, D. H. (2004). Rescue of the TTF2 knockdown phenotype with an siRNA-resistant replacement vector. Cell Cycle. 3 (9), 1151–1153. doi: 10.4161/cc.3.9.1151
Jiang, Y., Liu, M., Spencer, C. A., Price, D. H. (2004). Involvement of transcription termination factor 2 in mitotic repression of transcription elongation. Mol. Cell. 14 (3), 375–385. doi: 10.1016/S1097-2765(04)00234-5
Kakati, T., Bhattacharyya, D. K., Barah, P., Kalita, J. K. (2019). Comparison of Methods for Differential Co-expression Analysis for Disease Biomarker Prediction. Comput. Biol. Med. 113, 103380. doi: 10.1016/j.compbiomed.2019.103380
Kubota, Y., Kawazoe, A., Sasaki, A., Mishima, S., Sawada, K., Nakamura, Y., et al. (2020). The Impact of Molecular Subtype on Efficacy of Chemotherapy and Checkpoint Inhibition in Advanced Gastric Cancer. Clin. Cancer Res. doi: 10.1158/1078-0432.CCR-20-0075
Langfelder, P., Horvath, S. (2008). WGCNA: an R package for weighted correlation network analysis. BMC Bioinf. 9 (1), 559. doi: 10.1186/1471-2105-9-559
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E., Storey, J. D. (2012). The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28 (6), 882–883. doi: 10.1093/bioinformatics/bts034
Li, L., Zhu, Z., Zhao, Y., Zhang, Q., Wu, X., Miao, B., et al. (2019). FN1, SPARC, and SERPINE1 are highly expressed and significantly related to a poor prognosis of gastric adenocarcinoma revealed by microarray and bioinformatics. Sci. Rep. 9 (1), 7827. doi: 10.1038/s41598-019-43924-x
Liao, P., Li, W., Liu, R., Teer, J. K., Xu, B., Zhang, W., et al. (2018). Genome-scale analysis identifies SERPINE1 and SPARC as diagnostic and prognostic biomarkers in gastric cancer. Onco. Targets Ther. 11, 6969–6980. doi: 10.2147/OTT.S173934
Lin, Y., Liu, T., Cui, T., Wang, Z., Zhang, Y., Tan, P., et al. (2020). RNAInter in 2020: RNA interactome repository with increased coverage and annotation. Nucleic Acids Res. 48 (D1), D189–DD97. doi: 10.1093/nar/gkz804
Liu, L., Zhou, C., Zhou, L., Peng, L., Li, D., Zhang, X., et al. (2012). Functional FEN1 genetic variants contribute to risk of hepatocellular carcinoma, esophageal cancer, gastric cancer and colorectal cancer. Carcinogenesis 33 (1), 119–123. doi: 10.1093/carcin/bgr250
Liu, Y., Feng, X., Lai, J., Yi, W., Yang, J., Du, T., et al. (2019). A novel role of kynureninase in the growth control of breast cancer cells and its relationships with breast cancer. J. Cell Mol. Med. 23 (10), 6700–6707. doi: 10.1111/jcmm.14547
Liu, Q., Zhai, J., Kong, X., Wang, X., Wang, Z., Fang, Y., et al. (2020). Comprehensive Analysis of the Expression and Prognosis for TDO2 in Breast Cancer. Mol. Ther. Oncolytics 17, 153–168. doi: 10.1016/j.omto.2020.03.013
Ma, Y., Zhu, J., Chen, S., Li, T., Ma, J., Guo, S., et al. (2018). Activated gastric cancer-associated fibroblasts contribute to the malignant phenotype and 5-FU resistance via paracrine action in gastric cancer. Cancer Cell Int. 18, 104. doi: 10.1186/s12935-018-0599-7
Ma, Y., Zhu, J., Chen, S., Ma, J., Zhang, X., Huang, S., et al. (2019). Low expression of SPARC in gastric cancer-associated fibroblasts leads to stemness transformation and 5-fluorouracil resistance in gastric cancer. Cancer Cell Int. 19 (1), 137. doi: 10.1186/s12935-019-0844-8
Menyhart, O., Pongor, L. S., Gyorffy, B. (2018). Mutations Defining Patient Cohorts With Elevated PD-L1 Expression in Gastric Cancer. Front. Pharmacol. 9 (1522), 1522. doi: 10.3389/fphar.2018.01522
Nakamura, T., Iwamoto, T., Nakamura, H. M., Shindo, Y., Saito, K., Yamada, A., et al. (2020). Regulation of miR-1-Mediated Connexin 43 Expression and Cell Proliferation in Dental Epithelial Cells. Front. Cell Dev. Biol. 8 (156):156. doi: 10.3389/fcell.2020.00156
Paci, P., Colombo, T., Farina, L. (2014). Computational analysis identifies a sponge interaction network between long non-coding RNAs and messenger RNAs in human breast cancer. BMC Syst. Biol. 8, 83. doi: 10.1186/1752-0509-8-83
Parrish, R. S., Spencer, H. J., III. (2004). Effect of normalization on significance testing for oligonucleotide microarrays. J. Biopharm. Stat. 14 (3), 575–589. doi: 10.1081/BIP-200025650
Qi, X., Zhang, D. H., Wu, N., Xiao, J. H., Wang, X., Ma, W. (2015). ceRNA in cancer: possible functions and clinical implications. J. Med. Genet. 52 (10), 710–718. doi: 10.1136/jmedgenet-2015-103334
Qian, X., Hu, J., Zhao, J., Chen, H. (2015). ATP citrate lyase expression is associated with advanced stage and prognosis in gastric adenocarcinoma. Int. J. Clin. Exp. Med. 8 (5), 7855–7860.
Rad, P. S., Morikawa, H., Kiani, N. A., Yang, M., Azimi, A., Shafi, G., et al. (2019). Exhaustion of CD4+ T-cells mediated by the Kynurenine Pathway in Melanoma. Sci. Rep. 9 (1), 12150. doi: 10.1038/s41598-019-48635-x
Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43 (7), e47. doi: 10.1093/nar/gkv007
Salmena, L., Poliseno, L., Tay, Y., Kats, L., Pandolfi, P. P. (2011). A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language? Cell 146 (3), 353–358. doi: 10.1016/j.cell.2011.07.014
Shen, M. R., Lin, A. C., Hsu, Y. M., Chang, T. J., Tang, M. J., Alper, S. L., et al. (2004). Insulin-like growth factor 1 stimulates KCl cotransport, which is necessary for invasion and proliferation of cervical cancer and ovarian cancer cells. J. Biol. Chem. 279 (38), 40017–40025. doi: 10.1074/jbc.M406706200
Shen, B., Singh, P., Liu, R., Qiu, J., Zheng, L., Finger, L. D., et al. (2005). Multiple but dissectible functions of FEN-1 nucleases in nucleic acid processing, genome stability and diseases. Bioessays 27 (7), 717–729. doi: 10.1002/bies.20255
Shuwen, H., Qing, Z., Yan, Z., Xi, Y. (2018). Competitive endogenous RNA in colorectal cancer: A systematic review. Gene 645, 157–162. doi: 10.1016/j.gene.2017.12.036
Smith, C. E., Whitehouse, L. L., Poulter, J. A., Wilkinson Hewitt, L., Nadat, F., Jackson, B. R., et al. (2020). A missense variant in specificity protein 6 (SP6) is associated with amelogenesis imperfecta. Hum. Mol. Genet. 29 (9), 1417–1425. doi: 10.1093/hmg/ddaa041
Su, G., Morris, J. H., Demchak, B., Bader, G. D. (2014). Biological network exploration with Cytoscape 3. Curr. Protoc. Bioinf. 47 (1), 8. doi: 10.1002/0471250953.bi0813s47
Sun, Z. G., Pan, F., Shao, J. B., Yan, Q. Q., Lu, L., Zhang, N. (2020). Kinesin superfamily protein 21B acts as an oncogene in non-small cell lung cancer. Cancer Cell Int. 20, 233. doi: 10.1186/s12935-020-01323-7
Tai, I. T., Tang, M. J. (2008). SPARC in cancer biology: its role in cancer progression and potential for therapy. Drug Resist. Updat. 11 (6), 231–246. doi: 10.1016/j.drup.2008.08.005
Tao, J., Sun, G., Li, Q., Zhi, X., Li, Z., He, Z., et al. (2020). KIF15 promotes the evolution of gastric cancer cells through inhibition of reactive oxygen species-mediated apoptosis. J. Cell. Physiol. doi: 10.1002/jcp.29743
Therneau, T. M. (2020). A package for survival analysis in S. version 3.2-3. Available at: https://CRAN.R-project.org/package=survival
Van Cutsem, E., Sagaert, X., Topal, B., Haustermans, K., Prenen, H. (2016). Gastric cancer. Lancet 388 (10060), 2654–2664. doi: 10.1016/S0140-6736(16)30354-3
van Dam, S., Vosa, U., van der Graaf, A., Franke, L., de Magalhaes, J. P. (2018). Gene co-expression analysis for functional classification and gene-disease predictions. Brief Bioinform. 19 (4), 575–592. doi: 10.1093/bib/bbw139
Wang, K., Xie, C., Chen, D. (2014). Flap endonuclease 1 is a promising candidate biomarker in gastric cancer and is involved in cell proliferation and apoptosis. Int. J. Mol. Med. 33 (5), 1268–1274. doi: 10.3892/ijmm.2014.1682
Xie, C., Wang, K., Chen, D. (2016). Flap endonuclease 1 silencing is associated with increasing the cisplatin sensitivity of SGC−7901 gastric cancer cells. Mol. Med. Rep. 13 (1), 386–392. doi: 10.3892/mmr.2015.4567
Yang, Z., Li, J., Shi, Y., Li, L., Guo, X. (2019). Increased musashi 2 expression indicates a poor prognosis and promotes malignant phenotypes in gastric cancer. Oncol. Lett. 17 (3), 2599–2606. doi: 10.3892/ol.2019.9889
Yu, G., Wang, L.-G., Han, Y., He, Q.-Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. Omics: J. Integr. Biol. 16 (5), 284–287. doi: 10.1089/omi.2011.0118
Zhang, G., Wang, Q., Lu, J., Ma, G., Ge, Y., Chu, H., et al. (2019). Long non-coding RNA FLJ22763 is involved in the progression and prognosis of gastric cancer. Gene 693, 84–91. doi: 10.1016/j.gene.2019.01.028
Zhao, Z.-S., Wang, Y.-Y., Chu, Y.-Q., Ye, Z.-Y., Tao, H.-Q. (2010). SPARC Is Associated with Gastric Cancer Progression and Poor Survival of Patients. Clin. Cancer Res. 16 (1), 260–268. doi: 10.1158/1078-0432.CCR-09-1247
Zhao, H. Q., Dong, B. L., Zhang, M., Dong, X. H., He, Y., Chen, S. Y., et al. (2020). Increased KIF21B expression is a potential prognostic biomarker in hepatocellular carcinoma. World J. Gastrointest. Oncol. 12 (3), 276–288. doi: 10.4251/wjgo.v12.i3.276
Zheng, L., Jia, J., Finger, L. D., Guo, Z., Zer, C., Shen, B. (2011). Functional regulation of FEN1 nuclease and its link to cancer. Nucleic Acids Res. 39 (3), 781–794. doi: 10.1093/nar/gkq884
Keywords: gastric cancer, weighted correlation network analysis, competing endogenous RNA, risk score, The Cancer Genome Atlas, Genotype-Tissue Expression, Gene Expression Omnibus
Citation: Zheng X, Wang X, Zheng L, Zhao H, Li W, Wang B, Xue L, Tian Y and Xie Y (2020) Construction and Analysis of the Tumor-Specific mRNA–miRNA–lncRNA Network in Gastric Cancer. Front. Pharmacol. 11:1112. doi: 10.3389/fphar.2020.01112
Received: 03 June 2020; Accepted: 08 July 2020;
Published: 21 July 2020.
Edited by:
Dong-Hua Yang, St. John‘s University, United StatesReviewed by:
Hua Zhu, The Ohio State University, United StatesShanzhi Wang, University of Arkansas at Little Rock, United States
Copyright © 2020 Zheng, Wang, Zheng, Zhao, Li, Wang, Xue, Tian and Xie. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yibin Xie, eWliaW54aWVfMjAwM0AxNjMuY29t
†These authors share first authorship