Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 23 July 2021
Sec. Cancer Genetics
This article is part of the Research Topic Non-coding RNA as Prognostic and Diagnostic Biomarkers in Thoracic Oncology View all 16 articles

Blood Circulating miRNA Pairs as a Robust Signature for Early Detection of Esophageal Cancer

Yang SongYang SongSuzhu ZhuSuzhu ZhuNing ZhangNing ZhangLixin Cheng*Lixin Cheng*
  • Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China

Esophageal cancer (EC) is a common malignant tumor in the digestive system which is often diagnosed at the middle and late stages. Noninvasive diagnosis using circulating miRNA as biomarkers enables accurate detection of early-stage EC to reduce mortality. We built a diagnostic signature consisting of four miRNA pairs for the early detection of EC using individualized Pairwise Analysis of Gene Expression (iPAGE). Profiling of miRNA expression identified 496 miRNA pairs with significant relative expression change. Four miRNA pairs consistently selected from LASSO were used to construct the final diagnostic model. The performance of the signature was validated using two independent datasets, yielding both AUCs and PRCs over 0.99. Furthermore, precision, recall, and F-score were also evaluated for clinical application, when a fixed threshold is given, resulting in all the scores are larger than 0.92 in the training set, test set, and two validation sets. Our results suggested that the 4-miRNA signature is a new biomarker for the early diagnosis of patients with EC. The clinical use of this signature would have improved the detection of EC for earlier therapy and more favorite prognosis.

Introduction

Epidemiological data have indicated that esophageal cancer (EC), a common malignant tumor in the digestive system, is the sixth cause of tumor-related death, and is also accompanied by an increasing incidence and mortality worldwide (1). Every year, over 300,000 people die from EC and the number is up to 150,000 and in China (1). Despite the advances in surgical techniques and chemoradiotherapy strategies have extensively improved the prognosis of EC patients, EC remains a deadly cancer of the gastrointestinal tract. Because of its insidious onset, the diagnosis of EC is usually at an advanced stage. Therefore, finding effective biomarkers for the early diagnosis of EC has great significance.

High-throughput technologies have revolutionized non-invasive diagnosis in medical research by the parallel analysis of thousands of molecules in cells or body fluids, including proteins, microbes, coding and non-coding RNAs, etc. (26). Non-coding RNAs (ncRNAs) regulate gene transcription and recently are emerging as a novel therapeutic targets and promising biomarkers for disease diagnosis and prognosis (712). MicroRNAs (miRNAs) are a type of small and highly conserved non-coding RNAs with 18–25 nucleotides in length (13, 14). miRNAs could broadly inhibit the expression of target messenger RNAs (mRNAs) and affect the fundamental cellular and physiological functions in humans (15). In recent years, numerous studies have indicated that miRNAs play as pivotal regulators in the tumorigenesis, progression, proliferation, and metastasis of various cancers, including EC (16, 17). Despite many miRNAs potentially important to cancers are yet to be characterized, their expression patterns have shown their non-invasive diagnosis ability in detecting and monitoring cancer progression (18).

Several studies have investigated the value of circulating miRNAs as potential biomarkers for the early screening of EC (16). Notably, for the gene transcriptome data, it is usually preprocessed using a series of steps, including background correction, signal normalization, and gene summarization (1921). For each step, several candidate algorithms are available based on different assumptions of data distribution. For instance, the quantile normalization assumes all samples have identical distribution regardless of the sample heterogeneity and conditions, such as cancer and normal (20). However, this most commonly used assumption only holds true when a small fraction of genes are dysregulated. In fact, a considerable fraction of genes are differently expressed in cancer samples due to the very different expression distribution of genes between the cancer and non-cancer samples (20, 21).

Previously we proposed a feature selection method, individualized Pairwise Analysis of Gene Expression (iPAGE) (22), to reduce the mRNA and lncRNA dimension, which is more suitable for the high dimensional miRNA data due to its high simplicity and efficiency. The relative expression change of a pair of genes are considered and only the gene pairs with significant alterations between the detecting groups are remained for further analysis, instead of the single genes with differential expression. Based on a stringent selection criterion, only a few gene pairs are refined and it benefits a lot for the subsequent step of model construction. Currently, we are using the iPAGE strategy for several directions on the forefront of genetic science to come up with more sophisticated results in terms of methylome and single-cell RNA-seq.

The iPAGE strategy fits miRNA expressions well and it is useful in machine learning where complex number systems determine what the computer “learns” or “knows” (2, 23, 24). In this study, we identified a four-miRNA pair signature for the early diagnosis of EC using iPAGE. The performance of the signature was validated using two independent datasets, and it outperformed the other state-of-art biomarkers in both ROC and PRC.

Materials and Methods

miRNA Expression Data

The miRNA expression datasets used in this study were downloaded from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) database. Using the keywords “esophageal cancer” and “serum” for human miRNA dataset searching, we obtained three datasets GSE122497, GSE106817, and GSE112264 (16, 2527). All these three datasets were detected using the 3D-Gene Human miRNA V21_1.0.0 platform (GPL21263). More detailed description for each dataset was listed in Table 1. No normalization was carried out and only the raw data were used for miRNA pair selection. For the 6-miRNA signature built by Sudo et al. using miRNA expression values (16), the data were normalized using the Robust Multichip Average (RMA) algorithm (28).

TABLE 1
www.frontiersin.org

Table 1 miRNA microarray data sets used in this study.

Detection of miRNA Pairs

The dataset GSE122497 contained 566 samples with esophageal squamous cell carcinoma and 4,965 non-cancer samples as controls. 70% of these samples were assigned as the training set and the other 30% samples were set as the test set (Figure 1A). Then, the individualized Pairwise Analysis of Gene Expression (iPAGE) strategy was used for feature selection. All possible miRNA pairs were constructed and the reverse pairs with significant relative expression changes were kept for subsequent analysis. The reverse pairs were defined as the expression abundance of the first miRNA consistently larger than the second one in at least 90% of the cancer samples and the first miRNA smaller than the second one in more than 90% of the control samples. In addition to 0.9 defined as the reverse rate, another threshold of 95% was also used for comparison in this study.

FIGURE 1
www.frontiersin.org

Figure 1 Identification of miRNA pair signature. (A) Workflow of this study. (B) Binary matrix with rows represent miRNA pairs and columns represent the result of LASSO. The black grid corresponds to the selected pairs. iPAGE, individualized Pair Analysis of Gene Expression. LASSO, Least absolute shrinkage and selection operator.

Model Construction

The reverse miRNA pairs selected in the previous step served as candidate markers for the diagnostic signature. Next, these pairs were further refined using least absolute shrinkage and selection operator (LASSO), resulting in a penal of miRNA pairs with assigned coefficients or contribution weights. Hereafter, we named the penal as miRNA pair signature. Considering the results of LASSO are different when set different seeds, we performed LASSO 100 times and utilized the common miRNA pairs to construct the final diagnostic model (Figure 1A).

Performance Evaluation

We evaluated the performance of the miRNA pair signature using both Receiver Operating Characteristic (ROC) curve and Precision-Recall Curves (PRC) on the test set and two independent validation sets, GSE106817 and GSE112264. Measurements of precision, recall, and F-score were also used for evaluation, which were calculated as follows,

Pression=TP/(TP+FP),Recall=TP/(TP+FN),Fscore=(2RecallPrecision)/(Recall+Pression),

where TP, TN, FP, and FN denote the number of true positives, true negatives, false positives, and false negatives, respectively. All the above calculations were conducted using R 4.0.3.

Results

Data Collection

Circulating miRNAs can be stably detected in serum and serve as potential biomarkers in the non-invasive diagnosis of cancers. To build an effective diagnostic model, we systematically collected the datasets containing miRNA serum samples of Esophageal Cancer (EC) from the GEO database []. Three datasets GSE122497, GSE106817, and GSE112264 were selected using the keywords “esophageal cancer” and “serum”. Using the platform of GPL21263 3D-Gene Human miRNA V21_1.0.0, these three datasets detected 2,565 miRNAs among 8,469 samples, including both EC and control normal samples. GSE122497, containing the highest number of samples (n=5531), was randomly divided into a training set (70%) and a test set (30%). The other two datasets were used as external sets for independent validation, where the larger one GSE106817 with a sample size of 2,847 was defined as validation set 1 and the smaller one GSE112264 (91 samples) was defined as validation set 2.

Identification of miRNA-Pair Signatures

For the training set, a total of 3,288,330 miRNA pairs composed of 2,565 miRNAs were constructed. We identified 496 miRNA pairs with significant relative expression change, namely, in a pair, the expression values of one miRNA are consistently larger than the other miRNA in at least 90% of the control samples and smaller than the other one in more than 90% of the cancer samples. Then, we selected the miRNA pairs contributing most to the classification using LASSO. Since the resulting pairs were different using the random computation seeds, we carried out LASSO 100 times and determined the miRNA pairs that were consistently selected (Figure 1B). Interestingly, a majority of the miRNA pairs were randomly picked up and only four pairs (red boxed) were selected in all the 100 rounds, indicating the importance of these pairs in classification.

Next, we calculated the coefficients of the four miRNA pairs using LASSO to build a risk score, miRPS, reflecting the probability of a patient having EC. The miRPS was calculated as follows: 3.903316 * (hsa-miR-6781-5p, hsa-miR-6789-5p) + 3.613282 * (hsa-miR-6893-5p, hsa-miR-1290) + 3.138672 * (hsa-miR-6784-5p, hsa-miR-5100) + 2.603476 * (hsa-miR-125a-3p, hsa-miR-221-3p) - 8.312100. For each pair, the value is assigned 1 if the expression value of the first miRNA is larger than the second one. Otherwise, it is assigned 0. No coefficient was dominated and the largest one is 3.903316 for the pair of hsa-miR-6781-5p and hsa-miR-6789-5p. The expression value of each miRNA pair was reverse between distinct states (Figure 2A). The heatmap illustrates the significant differences of the miRNAs in each pair between cancer and non-cancer samples (Figure 2B). We also provided the chromosome and sequence information of the four miRNA pairs for potential further analysis (Figures 2C, D).

FIGURE 2
www.frontiersin.org

Figure 2 Summarization of the four miRNA pairs. (A) Expression values of the four identified miRNA pairs. Line represents the average expression abundance in EC and normal states for a miRNA in the training set. Two lines are intersecting when a pair of miRNAs are reversed in expression between the EC and normal state. (B) Heatmap showing the expression value of the four miRNA pairs between EC and normal samples in the training set. (C) A circos plot showing the location of the four miRNA pairs in chromosome. Curves in the circle represent the miRNA pairs. (D) The genetic information of the miRNA pairs.

Performance Evaluation

The performance of the 4-miRNA pair signature was evaluated using the internal test set and two external validation sets. The 4-miRNA pair signature in these datasets yielded extremely high AUCs, all of them are close to 1 (Figure 3). Similar results also obtained for the PRCs, with scores higher than 0.99 in all datasets. The EC samples were clearly discriminated from the normal samples when the risk score threshold was 0.5 (Figure 3, lower panel). More importantly, iPAGE facilitated the decision of the classification threshold and only a few samples were uncorrected predicted.

FIGURE 3
www.frontiersin.org

Figure 3 Performance evaluation of the 4-miRNA pair signature. The first row and the second row show the ROC and PRC curves for the training set, test set, and the two validation sets. The third row illustrates the prediction probability of the EC and normal samples in the four data sets.

Recently, Sudo et al. built an EC index using 6 serum miRNAs, i.e., miR-8073, miR-6820-5p, miR-6794-5p, miR-3196, miR-744-5p, and miR-6799-5p, to accurately detect early-stage EC. Our results demonstrated that the 4-miRNA pair signature overall outperforms the 6-miRNA signature, especially in the validation set 1 (Figure 4). The AUCs of the 4-miRNA pair signature were over 0.9900, while the scores were around 0.9970 for the 6-miRNA signature in the four sets. Moreover, the PRCs of the 4-miRNA pair signature were more than 0.9990, whereas the scores were 0.9773, 0.9845, and 0.9580 for the 6-miRNA signature in the training set, test set, and validation set 1 (Figures 3, 4).

FIGURE 4
www.frontiersin.org

Figure 4 Performance evaluation of the 6-miRNA signature. The first row and the second row show the ROC and PRC curves for the training set, test set, and the two validation sets. The third row illustrates the prediction probability of the EC and normal samples in the four data sets.

More importantly, it is hard to determine a consistent threshold to predict whether a sample is EC or normal for the 6-miRNA signature, resulting in a low measurement of precision and recall. When the threshold was set 0, the 6-miRNA signature demonstrated the precision of 0.9422, 0.9620, and 0.8137 in the training set, test set, and validation set 1 (Figure 5 and Table 2), respectively, while the scores were much higher for the 4-miRNA pair signature (0.9822, 0.9822, and 0.9239, respectively). The 6-miRNA signature yielded the recall of 0.9167, 0.8941, 0.9432, and 0.8200 in the training set, test set, validation set 1, and validation set 2, respectively, whereas the scores were improved to 0.9747, 0.9765, 0.9659, and 0.9200 for the 4-miRNA pair signature. The F-score of the 4-miRNA pair signature ranged from 0.9444 to 0.9794 in the four sets, which is consistently higher than that of the 6-miRNA signature (between 0.8737 and 0.9296).

FIGURE 5
www.frontiersin.org

Figure 5 Comparison of the performance of the 4-miRNA pair signature and the 6-miRNA signature. Precision, recall, and F-score are used for evaluation.

TABLE 2
www.frontiersin.org

Table 2 Evaluation of the performance of three miRNA signatures.

The miRNA Pairs Are Associated With EC

In previous studies, miR-125b was reported to participate in tumor proliferation and cell cycle regulation as a suppressor regulator. Ma et al. identified a miRNA cluster including three miRNAs, i.e., miR-99b, let-7e, and miR-125a, and observed the overexpression of the miRNAs in this cluster enhanced esophageal squamous cell carcinoma cell migration and invasion in vitro and induced an experimental metastasis in vivo (29). Wang et al. found that inhibition of miR-221 in 5-FU resistant cells resulted in reduced cell proliferation, increased apoptosis, restored chemosensitivity, and led to inactivation of the Wnt/β-catenin pathway mediated by regulating DKK2 expression in esophageal adenocarcinoma (30). Mao et al. demonstrated that miR-1290 functions as a tumor oncogene by targeting NFIX to degrade its expression, which can promote proliferation, migration, and invasion during EC progression (31). The biological consequences that miR-1290 mediated by binding NFIX were also experimentally verified in vitro.

Other miRNAs such as miR-5100 and miR-6893 were also important regulators that are dysregulated in several types of cancers. The expression abundance of miR-5100 is associated with the prognosis of gastric cancer (32) and miR-6893 could restore circMTO1-regulated migration, invasion, and chemoresistance of cervical cancer cells (33). Therefore, the miRNAs in the miRNA pairs not characterized may serve as candidate regulators and therapy targets in the future clinical applications of EC.

Discussion

We identified a 4-miRNA pair signature with the ability to diagnose patients with EC and validated its efficacy in two independent datasets. In total 8,378 samples were used to build and validate the diagnostic model. The signature demonstrated both AUCs and PRCs over 0.99 in all of the training set, test set, and two validation sets, which outperformed other state-of-art single miRNA signature. We also found literature supported evidences showing that the four miRNA pairs are highly associated with EC. Our results revealed that miRNAs pairs may serve as potential biomarkers for EC diagnosis.

Previously, we observed that using the expression value of lncRNAs or coding genes directly may lead to deviation, because high-throughput platforms are sensitive to various forms of technical variations (22, 34). Moreover, the generated continuous measurements were not measurable and comparable between different states due to the global biological alteration, even though they were preprocessed by plausible normalization methods (20, 21). iPAGE quantifies the relative expression of a pair of genes instead of the expression abundance of a single gene, which is an appropriate and sophisticated strategy to address the data preprocessing problem (22). Our results revealed that the relative expression is more reliable than the absolute expression value in the EC miRNA high-throughput data, which is an extension and approval of our previous discoveries. Recently, Liu et al. used 1,231 high-throughput miRNA-profiled serum samples to develop a diagnostic model for prostate cancer based on circulating miRNAs pairs and obtained approximate 0.99 for most of the measurements in a test and a validation set (35). This study also supported that circulating miRNA pairs are able to generate a robust diagnostic model in early diagnosis of cancers.

During the step of miRNA pair selection, we defined the reverse rate of 0.9 to filter miRNA pairs with a high ability to discriminate EC from the control samples. To assess the performance of iPAGE objectively, another threshold of 0.95 was also used to identify the reverse miRNA pairs. Using this threshold, 5-miRNA pairs were determined and it demonstrated AUCs and PRCs over 0.99 except the validation set 1, which yielded a PRC of 0.9813 (Table 2). Our findings revealed that iPAGE is a powerful tool for feature selection to reduce the data dimension and obtain relevant features for the machine learning models. In addition to the four miRNA pairs in miRPS consistently identified by running LASSO multiple times, the pairs selected by a majority of simulated calculations may also contribute to the classification. The four miRNA pairs were sufficient for diagnosis with a high accuracy, so it is not necessary to add the miRNA pairs less important into the penal. However, we may also consider these important pairs to improve the signature when it is not powerful enough.

In this study, the miRNA datasets used were all from the same platform of 3D-Gene Human miRNA V21_1.0.0, which limited the generalization of iPAGE and miRPS across different platforms. With the development of high-throughput technologies, an increasing number of miRNA datasets detected using different platforms will be available, more comprehensive cross-platform studies are warranted.

Our results revealed that circulating miRNAs pairs could serve as potential biomarkers for EC early diagnosis. iPAGE facilitates the steps of data preprocessing and feature selection, which is not only for lncRNA and mRNA data, but also for the miRNA expression data.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: GSE122497, GSE106817, GSE112264.

Author Contributions

LC and YS conceived of the idea. SZ and NZ prepared the data and analyzed the results. LC and YS supervised this work and wrote the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Guangdong Basic and Applied Basic Research Foundation (2019A1515110097).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global Cancer Statistics 2018: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin (2018) 68:394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Liu S, Zhao W, Liu X, Cheng L. Metagenomic Analysis of the Gut Microbiome in Atherosclerosis Patients Identify Cross-Cohort Microbial Signatures and Potential Therapeutic Target. FASEB J (2020) 34:14166–81. doi: 10.1096/fj.202000622R

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Cheng L, Zeng Y, Hu S, Zhang N, Cheung KCP, Li B, et al. Systematic Prediction of Autophagy-Related Proteins Using Arabidopsis Thaliana Interactome Data. Plant J (2021) 105:708–20. doi: 10.1111/tpj.15065

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Li L, Liu M, Yue L, Wang R, Zhang N, Liang Y, et al. Host-Guest Protein Assembly for Affinity Purification of Methyllysine Proteomes. Anal Chem (2020) 92:9322–9. doi: 10.1021/acs.analchem.0c01643

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Cheng L, Liu P, Leung KS. SMILE: A Novel Procedure for Subcellular Module Identification With Localisation Expansion. IET Syst Biol (2018) 12:55–61. doi: 10.1049/iet-syb.2017.0085

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Cheng L, Fan K, Huang Y, Wang D, Leung K-S. Full Characterization of Localization Diversity in the Human Protein Interactome. J Proteome Res (2017) 16:3019–29. doi: 10.1021/acs.jproteome.7b00306

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Cheng L, Leung KS. Quantification of non-Coding RNA Target Localization Diversity and Its Application in Cancers. J Mol Cell Biol (2018) 10:130–8. doi: 10.1093/jmcb/mjy006

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Cheng L, Leung KS. Identification and Characterization of Moonlighting Long non-Coding RNAs Based on RNA and Protein Interactome. Bioinformatics (2018) 34:3519–28. doi: 10.1093/bioinformatics/bty399

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Cheng L, Nan C, Kang L, Zhang N, Liu S, Chen H, et al. Whole Blood Transcriptomic Investigation Identifies Long Non-Coding RNAs as Regulators in Sepsis. J Transl Med (2020) 18:217. doi: 10.1186/s12967-020-02372-2

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Liu X, Xu Y, Wang R, Liu S, Wang J, Luo Y, et al. A Network-Based Algorithm for the Identification of Moonlighting Noncoding RNAs and Its Application in Sepsis. Briefings Bioinf (2021) 22(1):581–8. doi: 10.1093/bib/bbz154

CrossRef Full Text | Google Scholar

11. Liu X, Zheng X, Wang J, Zhang N, Leung K-S, Ye X, et al. A Long non-Coding RNA Signature for Diagnostic Prediction of Sepsis Upon ICU Admission. Clin Transl Med (2020) 10:e123. doi: 10.1002/ctm2.123

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Nan CC, Zhang N, Cheung KCP, Zhang HD, Li W, Hong CY, et al. Knockdown of Lncrna MALAT1 Alleviates LPS-Induced Acute Lung Injury Via Inhibiting Apoptosis Through the miR-194-5p/FOXP2 Axis. Front Cell Dev Biol (2020) 8:586869. doi: 10.3389/fcell.2020.586869

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Bartel DP. MicroRNAs: Genomics, Biogenesis, Mechanism, and Function. Cell (2004) 116:281–97. doi: 10.1016/S0092-8674(04)00045-5

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Schwarzenbach H, Nishida N, Calin GA, Pantel K. Clinical Relevance of Circulating Cell-Free microRNAs in Cancer. Nat Rev Clin Oncol (2014) 11:145–56. doi: 10.1038/nrclinonc.2014.5

PubMed Abstract | CrossRef Full Text | Google Scholar

15. He L, Thomson JM, Hemann MT, Hernando-Monge E, Mu D, Goodson S, et al. A microRNA Polycistron as a Potential Human Oncogene. Nature (2005) 435:828–33. doi: 10.1038/nature03552

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Sudo K, Kato K, Matsuzaki J, Boku N, Abe S, Saito Y, et al. Development and Validation of an Esophageal Squamous Cell Carcinoma Detection Model by Large-Scale MicroRNA Profiling. JAMA Netw Open (2019) 2:e194573. doi: 10.1001/jamanetworkopen.2019.4573

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Nakamura K, Sawada K, Yoshimura A, Kinose Y, Nakatsuka E, Kimura T. Clinical Relevance of Circulating Cell-Free microRNAs in Ovarian Cancer. Mol Cancer (2016) 15:48. doi: 10.1186/s12943-016-0536-0

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Calin GA, Croce CM. MicroRNA Signatures in Human Cancers. Nat Rev Cancer (2006) 6:857–66. doi: 10.1038/nrc1997

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Liu X, Li N, Liu S, Wang J, Zhang N, Zheng X, et al. Normalization Methods for the Analysis of Unbalanced Transcriptome Data: A Review. Front Bioeng Biotechnol (2019) 7:358. doi: 10.3389/fbioe.2019.00358

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Cheng L, Lo LY, Tang NL, Wang D, Leung KS. CrossNorm: A Novel Normalization Strategy for Microarray Data in Cancers. Sci Rep (2016) 6:18898. doi: 10.1038/srep18898

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Cheng L, Wang X, Wong PK, Lee KY, Li L, Xu B, et al. ICN: A Normalization Method for Gene Expression Data Considering the Over-Expression of Informative Genes. Mol Biosyst (2016) 12:3057–66. doi: 10.1039/C6MB00386A

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Zheng X, Leung KS, Wong MH, Cheng L. Long Non-Coding RNA Pairs to Assist in Diagnosing Sepsis. BMC Genomics (2021) 22:275. doi: 10.1186/s12864-021-07576-4

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Wang J, Xiang X, Bolund L, Zhang X, Cheng L, Luo Y. Gnl-Scorer: A Generalized Model for Predicting CRISPR on-Target Activity by Machine Learning and Featurization. J Mol Cell Biol (2020) 12(11):909–11. doi: 10.1101/605790

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wang J, Zhang X, Cheng L, Luo Y. An Overview and Metanalysis of Machine and Deep Learning-Based CRISPR gRNA Design Tools. RNA Biol (2020) 17:13–22. doi: 10.1080/15476286.2019.1669406

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. Archive for Functional Genomics Data Sets–Update. Nucleic Acids Res (2013) 41:D991–5. doi: 10.1093/nar/gks1193

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Yokoi A, Matsuzaki J, Yamamoto Y, Yoneoka Y, Takahashi K, Shimizu H, et al. Integrated Extracellular microRNA Profiling for Ovarian Cancer Screening. Nat Commun (2018) 9:4319. doi: 10.1038/s41467-018-06434-4

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Urabe F, Matsuzaki J, Yamamoto Y, Kimura T, Hara T, Ichikawa M, et al. Large-Scale Circulating Microrna Profiling for the Liquid Biopsy of Prostate Cancer. Clin Cancer Res (2019) 25:3016–25. doi: 10.1158/1078-0432.CCR-18-2849

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Bolstad BM, Irizarry RA, Astrand M, Speed TP. A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias. Bioinformatics (2003) 19:185–93. doi: 10.1093/bioinformatics/19.2.185

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Ma J, Zhan Y, Xu Z, Li Y, Luo A, Ding F, et al. ZEB1 Induced miR-99b/let-7e/miR-125a Cluster Promotes Invasion and Metastasis in Esophageal Squamous Cell Carcinoma. Cancer Lett (2017) 398:37–45. doi: 10.1016/j.canlet.2017.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Wang Y, Zhao Y, Herbst A, Kalinski T, Qin J, Wang X, et al. Mir-221 Mediates Chemoresistance of Esophageal Adenocarcinoma by Direct Targeting of DKK2 Expression. Ann Surg (2016) 264:804–14. doi: 10.1097/SLA.0000000000001928

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Mao Y, Liu J, Zhang D, Li B. MiR-1290 Promotes Cancer Progression by Targeting Nuclear Factor I/X(NFIX) in Esophageal Squamous Cell Carcinoma (ESCC). BioMed Pharmacother (2015) 76:82–93. doi: 10.1016/j.biopha.2015.10.005

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Zhang HM, Li H, Wang GX, Wang J, Xiang Y, Huang Y, et al. Mkl1/miR-5100/CAAP1 Loop Regulates Autophagy and Apoptosis in Gastric Cancer Cells. Neoplasia (2020) 22:220–30. doi: 10.1016/j.neo.2020.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Chen M, Ai G, Zhou J, Mao W, Li H, Guo J. circMTO1 Promotes Tumorigenesis and Chemoresistance of Cervical Cancer Via Regulating Mir-6893. BioMed Pharmacother (2019) 117:109064. doi: 10.1016/j.biopha.2019.109064

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Zheng X, Wu Q, Wu H, Leung KS, Wong MH, Liu X, et al. Evaluating the Consistency of Gene Methylation in Liver Cancer Using Bisulfite Sequencing Data. Front Cell Dev Biol (2021) 9:671302. doi: 10.3389/fcell.2021.671302

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Liu HP, Lai HM, Guo Z. Prostate Cancer Early Diagnosis: Circulating microRNA Pairs Potentially Beyond Single microRNAs Upon 1231 Serum Samples. Brief Bioinform (2021) 22(3):bbaa111. doi: 10.1093/bib/bbaa111

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: microRNA, biomarker, esophageal cancer (EC), gene pair, diagnosis

Citation: Song Y, Zhu S, Zhang N and Cheng L (2021) Blood Circulating miRNA Pairs as a Robust Signature for Early Detection of Esophageal Cancer. Front. Oncol. 11:723779. doi: 10.3389/fonc.2021.723779

Received: 11 June 2021; Accepted: 08 July 2021;
Published: 23 July 2021.

Edited by:

Desi Shang, Harbin Medical University, China

Reviewed by:

Yang Chen, Shantou University, China
Fuyan Hu, Wuhan University of Technology, China

Copyright © 2021 Song, Zhu, Zhang and Cheng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lixin Cheng, ZWFzb25sY2hlbmdAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.