Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 21 January 2019
Sec. Cancer Epidemiology and Prevention

Identification of Serum MicroRNAs as Novel Biomarkers in Esophageal Squamous Cell Carcinoma Using Feature Selection Algorithms

  • 1Department of Epidemiology and Health Statistics, School of Public Health, Beijing Municipal Key Laboratory of Clinical Epidemiology, Capital Medical University, Beijing, China
  • 2National Cancer Center/Cancer Hospital, Chinese Academy of Medical Science & Peking Union Medical College, Beijing, China
  • 3Department of Molecular Physiology and Biophysics, Holden Comprehensive Cancer Center, University of Iowa Carver College of Medicine, Iowa City, IA, United States

Introduction: Circulating microRNAs (miRNAs) are promising molecular biomarkers for the early detection of esophageal squamous cell carcinoma (ESCC). We investigated the serum miRNA expression profiles from microarray-based technologies and evaluated the diagnostic value of serum miRNAs as potential biomarkers for ESCC by using feature selection algorithms.

Methods: Serum miRNA expression profiles were obtained from 52 ESCC patients and 52 age- and sex-matched controls via performing a high-throughput microarray assay. Five representative feature selection algorithms including the false discovery rate procedure, family-wise error rate procedure, Lasso logistic regression, hybrid huberized support vector machine (SVM), and SVM using the squared-error loss with the elastic-net penalty were jointly carried out to select the significantly differentially expressed miRNAs based on the miRNA profiles.

Results: Three miRNAs including miR-16-5p, miR-451a, and miR-574-5p were identified as the powerful biomarkers for the diagnosis of ESCC. The diagnostic accuracy of the combination of these three miRNAs was evaluated by using logistic regression and the SVM. The averages of the area under the receiver operating curve and classification accuracies based on different classifiers were more than 0.80 and 0.79, respectively. The cross-validation results suggested that the three-miRNA-based classifiers could clearly distinguish ESCC patients from healthy controls. Moreover, the classifying performance of the miRNA panel persisted in discriminating the healthy group from patients with ESCC stage I-II (AUC > 0.76) and patients with ESCC stage III-IV (AUC > 0.80).

Conclusions: These results in this study have moved forward the identification of novel biomarkers for the diagnosis of ESCC.

Introduction

Esophageal cancer (EC) is the seventh most common malignancy and the sixth leading cause of cancer-related death worldwide in 2018 (1). Esophageal squamous cell carcinoma (ESCC) accounts for over 90% of all EC cases in lower income countries, especially in parts of Asia (1). Despite the improvements in surgical techniques and perioperative management have extensively prolonged the survival of ESCC patients, ESCC still remains one of the most deadly carcinomas of the gastrointestinal tract. The 5-year survival rate of late-stage ESCC is <15% (2). Therefore, early detection represents an essential way of reducing the morbidity and mortality of ESCC. Currently, diagnosis of ESCC mainly depends on endoscopic examination with biopsy, meaning that accurate detection of ESCC can only be obtained when patients have obvious symptoms and lesions. However, most early-stage ESCC patients are asymptomatic and their lesions are confined to the mucosa or submucosa, thus they usually lose the opportunity to be early diagnosed. Moreover, the invasiveness and the potential for sample error limit the effectiveness of the endoscopic biopsy (3). Thus, the discovery of novel non-invasive biomarkers with high efficiency for early detection of ESCC is urgently needed.

MicroRNAs (miRNAs) are a class of small non-coding RNAs of about 18–25 nucleotides in length. MiRNAs regulate gene expression by direct binding to the 3′ untranslated region (3′UTR) of target messenger RNAs (mRNAs) according to base pair complementarity to promote their degradation and/or translational inhibition (4). The altered expression of specific miRNAs has been associated with various diseases, including cancers (58). In recent years, several studies have analyzed the serum miRNA profiles in human ESCC patients by microarray-based techniques, and have examined their potential clinical relevance (911). However, a comprehensive evaluation of the value of circulating miRNAs as potential biomarkers for screening ESCC has yet to be investigated. Moreover, for the microarray data, the number of samples is usually much smaller than that of measured miRNAs, and this limitation is known as high-dimension, low-sample-size (HDLSS) problem that may lead to over-fitting and negatively influence the diagnostic performance in traditional statistical models. Therefore, the identification of miRNAs as biomarkers is tightly linked with the curse of dimensionality. Feature selection algorithms including multiple hypotheses testing and Lasso-type variable selection methods that have been proposed to reduce the dimension due to their simplicity and efficiency are more suitable for high-dimensional microarray data of miRNAs (12).

In this case-control study, we profiled serum miRNA expression of a relatively large number of samples using the Agilent miRNA array. The combined application of the false discovery rate (FDR) procedure (13), the family-wise error rate (FWER) procedure (14), Lasso Logistic regression (15), hybrid huberized support vector machine (HHSVM) (16)and support vector machine using the squared-error loss (SESVM) with the elastic-net penalty (17) was then carried out, and identified three serum miRNAs, including miR-16-5p, miR-451a, and miR-574-5p, as candidate biomarkers for diagnosing ESCC. We then developed three-miRNA-based classifiers based on these three miRNAs by carrying out Logistic regression and SVM model to predict ESCC. We assessed the predictive accuracies of these classifiers and discovered that they were highly efficient in discriminating ESCC patients from healthy controls, suggesting these miRNAs are potential diagnostic biomarkers for ESCC.

Materials and Methods

Patients and Healthy Controls

The participants were composed of 104 Chinese Han people, including 52 unrelated ESCC patients and 52 healthy controls. All ESCC cases were pathologically diagnosed for the first time and were recruited consecutively from the Endoscopy Center of Cancer Hospital, Linzhou City, Henan province, from October 2014 to October 2015. During the same period, healthy controls were randomly selected from an early EC screening program, which were frequency-matched to the ESCC cases by age (±5 years) and gender. All enrolled ESCC case and control subjects were residents in Linzhou city. This study was approved by the Institutional Review Board of Capital Medical University and was in accordance with the principle of the Declaration of Helsinki. All subjects recruited for this study provided written informed consent prior to participation. Detailed characteristics of ESCC patients and the healthy volunteers were summarized in Table 1.

TABLE 1
www.frontiersin.org

Table 1. Characteristics of the study subjects.

Total RNA Extraction

Whole blood samples were loaded into the serum collection tubes and stood for 1 h at room temperature, followed by being centrifuged at 820 g for 10 min at 4°C. The resulting serum was transferred into new tubes, followed by further centrifugation at 16, 000 g for 10 min at 4°C. Total RNA was extracted from 400 μl serum and then purified by using the mirVanaTM PARISTM kit (Ambion, Austin, TX, USA) according to the manufacturer's instructions. The quantity and purity of RNA were measured with the NanoDrop 1000 Spectrophotometer (NanoDrop Technologies, Waltham, MA, USA), and the concentration of extracted RNA ranged from 4.5 to 15.7 ng/μl. The integrity of RNA was assayed by capillary electrophoresis with an Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA).

Agilent MiRNA Microarray

The serum RNA samples from the 52 ESCC patients and 52 healthy controls were assayed for miRNA expression profiles by using the Agilent Human microRNA microarray, which contains the probes for 2006 human miRNAs. Briefly, 100 ng total RNA was labeled with Cy3 by using the miRNA Complete Labeling and Hyb Kit (Agilent Technologies, Santa Clara, CA, USA) according to the manufacturer's instructions. Each slide was hybridized with 100 ng Cy3-labeled RNA in a hybridization oven (Agilent Technologies, Santa Clara, CA, USA) at 55°C, 20 rpm for 20 h according to the manufacturer's instructions. After hybridization, slides were washed with the Gene Expression Wash Buffer Kit (Agilent Technologies, Santa Clara, CA, USA) in staining dishes (Thermo Shandon, Waltham, MA, USA). The washed slides were scanned and the microarray images were then converted into spot intensity values by the Agilent Microarray Scanner (Agilent Technologies, Santa Clara, CA, USA). The signals undergone background subtraction were exported directly into the Feature Extraction software 10.7 (Agilent Technologies, Santa Clara, CA, USA) with default settings. Raw data were normalized by the quartile algorithm with the Gene Spring Software 12.6 (Agilent Technologies, Santa Clara, CA, USA).

Statistical Analysis

The intersection among the significant miRNAs selected by different statistical methods is identified as the candidate biomarkers for ESCC (Figure 1). For each miRNA, the non-parametric Wilcoxon rank-sum test was used to compare expression levels between the ESCC and the control group. The single p-value of each test is then calculated. The adjusted p-values are computed by the FDR method (13) and the FWER method of Holm's procedure (14), and miRNAs with an adjusted p-value less than or equal to 0.05 are defined significant and can be utilized as candidate biomarkers of ESCC. Simultaneously, the Lasso logistic regression (15), HHSVM (16) and SESVM (17) were performed to select the most useful diagnostic biomarkers from all ESCC associated miRNAs based on the microarray data. The intersection among significant miRNAs selected by different statistical methods is identified as a panel of biomarkers for the diagnosis of ESCC. Integrating multiple biomarkers in logistic regression and SVM, multi-miRNA classifiers are constructed to predict ESCC. The performance of the multi-miRNA classifiers is measured by classification accuracy and the area under the receiver operating characteristic (ROC) curve (AUC). Sensitivities and specificities of multi-miRNA classifiers were determined by the highest Youden index. The cross-validation was applied to evaluate the diagnostic performance. Each split of data set in cross-validation is 3/2 for training and 1/3 for testing.

FIGURE 1
www.frontiersin.org

Figure 1. Flowchart of identifying predictive miRNAs by performing representative feature selection algorithms.

The heatmap of significant miRNAs identified by the FDR method was plotted with MultiExperiment Viewer (MeV, version 4.9, TM4, Boston, MA, USA). Each significant miRNA was standardized independently by performing Z-score transformation to scale the log base 2 of the expression levels into having a mean of zero and standard deviation of one. All statistical analysis was performed with R software (version 3.4.1, R Foundation for Statistical Computing, Vienna, Austria). The R packages “glmnet” and “gcdnet” were used to implement the biomarker screening by Lasso logistic regression, HHSVM, and SESVM. The package “e1071” was used to perform SVM, and the package “pROC” was employed to plot the ROC curve and to determine the AUC.

Results

Identification of Significant MiRNAs by Controlling FDR and FWER

For each one among 842 human miRNAs encoded within three digits, we compared the miRNA expression levels of the ESCC patients and the normal controls. By controlling the FDR and FWER at 0.05, we identified seven significantly differentially expressed miRNAs, including miR-16-5a, miR-92-3a, miR-107, miR-320C, miR-451a, miR-486, and miR-574 (Table 2). The heatmap of the seven differential miRNAs indicated that their expression levels were consistent within each group but obviously different between the two groups (Figure 2). In addition, as shown in Figure 3, the serum levels of these identified miRNAs were all significantly up-regulated in ESCC patients than in healthy controls.

TABLE 2
www.frontiersin.org

Table 2. The seven selected miRNAs by controlling FDR and FWER.

FIGURE 2
www.frontiersin.org

Figure 2. The heat map of the seven candidate miRNAs identified by the FDR method. Pseudocolors indicated expression levels on a log-2 scale from −3 to 3 standard deviations independently by Z-score transformation. Green: negative, under-expressed; red: positive, over-expressed.

FIGURE 3
www.frontiersin.org

Figure 3. Serum levels of the seven selected miRNAs in ESCC cases and healthy controls. The miRNAs expression levels were log2-transformed.

Selection of Significant MiRNAs by Lasso Logistic Regression

We also applied Lasso logistic regression to screen these 842 human miRNAs encoded within three digits. The regularization parameter in Lasso logistic regression was selected by performing the 10-fold cross-validation, in which the cross-validated binomial deviance and the misclassification error were used as the criteria of predictive performance (Supplementary Figure S1). In the cross-validation process, the array data were randomly split into 10 subsets. Because the selected miRNAs were sometimes slightly different among each analysis, we carried out the cross-validation for 100 times and then identified candidate miRNAs that were selected for more than 30 times out of the 100 cycles of cross-validation. To obtain a more manageable set of miRNAs, we sacrificed one more cross-validation error and finally identified six miRNAs, including miR-7b-5p, miR-107, miR-16-5p, miR-191-3p, miR-451a, and miR-574-5p, which all had a selection frequency higher than 90%.

Identification of Significant MiRNAs With Penalized SVM

The SVM method is one of the most powerful classification techniques, which is widely used for analyzing microarray data. Since the standard SVM cannot automatically select significant genes, to identify candidate miRNA biomarkers we performed the HHSVM and the SESVM. In these two methods, the tuning parameters that control the elastic-net penalty were selected by performing multiple rounds of 5-fold cross-validation. The margin-based loss function and misclassification error (ME) were employed for controlling cross-validation errors (Supplementary Figures S2, S3) for the HHSVM and the SESVM. We iterated the cross-validation 100 times for the high-dimensional microarray datasets and computed the selection frequency of each miRNA. MiRNAs with selection frequency of more than 30% were selected as significant ones. The HHSVM and the SESVM identified the same set of six miRNAs as Lasso logistic regression, including miR-7b-5p, miR-107, miR-16-5p, miR-191-3p, miR-451a, and miR-574-5p. The selection frequencies of these six miRNAs by HHSVM and SESVM methods using two types of cross-validation errors were summarized in Table 3. As shown in Table 3, five miRNAs were selected for more than 80 times out of 100 random splits by HHSVM and SESVM.

TABLE 3
www.frontiersin.org

Table 3. The selection frequencies (%) of candidate miRNAs.

The Diagnostic Values of Selected Serum MiRNAs in ESCC

Notably, four miRNAs, miR-107, miR-16-5p, miR-451a, and miR-574-5p were simultaneously identified to be differentially expressed between ESCC and controls with all feature selection methods performed in this study. Although miR-107 was significant when the FDR was controlled at a level of 0.05, it was not significant if the FWER was also controlled at the same level. Furthermore, miR-107 had two outliers with extremely high levels, resulting in very large standard deviances in both ESCC and control groups. Therefore, miR-107 was excluded from further analysis and only the remaining three miRNAs, miR-16-5p, miR-451a, and miR-574-5p, were defined as potential biomarkers for the diagnosis of ESCC.

To evaluate the efficiency of this novel panel of three biomarkers in the detection of ESCC, logistic regression and SVM were applied to develop three-miRNA-based classifiers for predicting ESCC. We also used the panel of the three significant miRNAs and the main clinical characteristics including age, sex, smoking status, and drinking status for the classifiers. We randomly split the miRNA profile data into the training and the testing sets 100 times. In each split, 35 ESCC patients and 35 healthy controls were randomly classified into the training group and the rest 17 ESCC and 17 controls were correspondingly defined as the testing group; the ROC curves were plotted and the AUCs were calculated with logistic regression, linear SVM and SVM with the Radial Basis Function kernel (Figure 4 and Supplementary Figure S4). As shown in Table 4, the average AUCs and average accuracies of the three classifiers with the significant three miRNAs all exceeded 0.80 and 0.79, respectively; the average sensitivities and specificities of the two methods both were larger than 0.74. Furthermore, the diagnostic performance of the three-miRNA-based classifiers was very similar to (a little worse than in some settings) that of the classifiers with the panel of three significant miRNAs and the main clinical characteristics. These results indicated that the panel of three miRNAs had high predictive accuracy for distinguishing ESCC from the healthy group with relatively high sensitivity and specificity.

FIGURE 4
www.frontiersin.org

Figure 4. The ROC curves for the three serum miRNA-based panel as a diagnostic biomarker by using three classifiers. (A) Logistic regression. (B) Linear SVM. (C) SVM with the Radial Basis Function kernel.

TABLE 4
www.frontiersin.org

Table 4. Diagnostic performance in discriminating ESCC from the healthy.

Multiple comparisons of three miRNA levels between patients with ESCC stage I-II (ESCC I-II), ESCC stage III-IV (ESCC III-IV) and the healthy groups were further performed (Figure 5A and Supplementary Table S1). In both patient subgroups, the levels of miR-16-5p, miR-451a, and miR-574-5p were significantly increased compared with the healthy group (p < 0.005 for all three miRNAs). The diagnostic performance of the miRNA panel in different ESCC stages was further evaluated by using 100 times cross-validation (Table 5, Figures 5B,C, and Supplementary Figure S5). The analysis demonstrated that the miRNA panel had high accuracy in discriminating the healthy group from patients with ESCC stage I-II (AUC > 0.76, sensitivity > 0.73, and specificity > 0.82) and patients with ESCC stage III-IV (AUC > 0.80, sensitivity > 0.75, and specificity > 0.81). The similar (slightly better in some settings) results of diagnostic accuracy were obtained by adding the main characteristics in the three-miRNA-based classifiers (Supplementary Table S2). This indicated that the diagnostic performance of the miRNA panel was independent of ESCC status. Thus, the expression levels of miR-16-5p, miR-451a, and miR-574-5p panel can potentially serve as an efficient diagnostic biomarker for ESCC.

FIGURE 5
www.frontiersin.org

Figure 5. Quantification of serum miR-16-5p, miR-451a, and miR-574-5p in the healthy controls and two different ESCC groups. (A) The box plot comparing the expression level of serum miR-16-5p, miR-451a, and miR-574-5p in the healthy control with ESCC patients with stage I-II (ESCC I-II) and with stage III-IV (ESCC III-IV). (B) The ROC curve for the three serum miRNA-based panel in differentiating ESCC I-II from the healthy by logistic regression. (C) The ROC curve for the three serum miRNA-based panel in differentiating ESCC III-IV from the healthy by logistic regression.

TABLE 5
www.frontiersin.org

Table 5. Diagnostic performance in differentiating ESCC subgroups from the healthy for the three-miRNA-based classifiers.

Discussion

In this study, we systematically investigated the serum miRNA profiles of a relatively large cohort of 52 ESCC patients and 52 well-matched healthy controls using high-throughput microarray-based assay. Using the FDR method, the FWER method, Lasso Logistic regression, HHSVM and SESVM together, this study has identified three miRNAs, miR-16-5p, miR-451a, and miR-574-5p, as potential biomarkers for diagnosis of ESCC. The expression levels of these three miRNAs were significantly up-regulated in the serum of ESCC patients compared to normal controls. We developed classifiers based on these three miRNAs with the logistic regression and the SVM model to predict ESCC. The classification accuracy and AUC were used to assess the performance of the three miRNAs as a diagnostic tool in detecting ESCC. The average AUCs and classification accuracies based on different classifiers were higher than 0.80 and 0.79, respectively, indicating that the three-miRNA-based panel had a high potential to distinguish ESCC patients from healthy controls.

This study has major strengths. To the best of our knowledge, this is the first study in the identification of potential serum miRNAs as diagnostic biomarkers in ESCC based on microarray data of exceeding 100 sample cases, which increased the statistical power of each test and had a stronger sensitivity for quantitative detection of the serum miRNAs. Furthermore, five representative feature selection algorithms, which are appropriate for high dimension and low sample data analysis, were together performed to select significant miRNAs with a high credibility. Lastly, reliable validation employing 100 times random splits for the samples has shown that miRNA-based classifiers were powerful for the diagnosis of ESCC.

The biological functions of the three miRNAs identified in our study have been investigated in previous studies. MiR-16 was one of the earliest miRNAs found to be involved in cancers; it could suppress apoptosis and promote cell growth by down-regulating reversion-inducing-cysteine-rich protein with Kazal motifs (RECK) and sex-determining region Y-box (Sox) 6, two genes that play important roles in the pathogenesis of ESCC (18). MiR-451 is a key factor in regulating erythroid differentiation and in maintaining homeostasis of erythroid cells (19). It has also been associated with cell proliferation, migration and apoptosis via regulating different target genes (20), such as calcium-binding protein 39 (CAB39) (21), Ras-related protein 14 (RAB14) (22), and macrophage migration inhibitory factor (MIF) (23). MiR-574-5p has been reported to act as an oncogene in various types of cancers, including ESCC (24, 25).

There are some limitations of this study that need to be declared. First, the microarray experiment in this study was not designed specifically to identify biomarkers distinguishing between ESCC and normal esophagus, but rigorous dimensionality reduction techniques were employed to strengthen its effect of screening miRNAs. Second, although our study design is a population-based case-control study, potential drawbacks such as selection bias may still occur, because the ESCC cases were selected from the Cancer Hospital of Linzhou City, which accounted for above 60% of the esophageal cancer patients in the study population. However, the healthy controls were recruited from the general population in Linzhou City. Moreover, this case-control study design is not warranted to infer causal relationships between the expression levels of miRNAs and ESCC. Third, the levels of the three candidate miRNAs selected from the microarray were not further quantified by quantitative real-time PCR (qRT-PCR). However, miRNA microarray expression has shown to be highly concordant when re-analyzed with qRT-PCR (26), with correlation coefficients measuring from r = 0.986 to 0.994, depending on normalization method (27). Further validation of the newly identified miRNAs in larger independent population is necessary before they can be put in a clinical application.

Conclusion

In summary, five useful feature selection algorithms were jointly adopted to select the common serum miRNA signatures for the diagnosis of ESCC based on microarray data. Three serum miRNAs were identified, and classifiers based on the combination of them were constructed by the Logistic regression and the SVM. The cross-validation results showed that the three-miRNA-based classifiers can accurately distinguish ESCC patients from healthy controls. This study provides a resource and an impetus for further investigating these novel serum miRNAs as biomarkers in ESCC.

Data Availability

The miRNA datasets analyzed for this study can be found in the Gene Expression Omnibus (GEO) repository (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE112840).

Author Contributions

DZ, YH, WW, and FL conceived the study. YD and QM performed and evaluated the experiments. YS helped to perform the experiments. XG helped to collect the samples. DZ, YD, and LZ analyzed the data. DZ, LZ, and FL wrote the manuscript. All authors contributed to the preparation of the manuscript. All authors read and approved the final manuscript.

Funding

This work was supported by grants from the National Natural Science Foundation of China (81473056, 81573224, 81703317, and 81874277), Natural Science Foundation of Beijing Municipal (7172025), and Scientific Research Common Program of Beijing Municipal Commission of Education (KM201710025006).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Thank all the individuals who participated in this study.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2018.00674/full#supplementary-material

References

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, and Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2018) 68:394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Pennathur A, Farkas A, Krasinskas AM, Ferson PF, Gooding WE, Gibson MK, et al. Esophagectomy for T1 esophageal cancer: outcomes in 100 patients and implications for endoscopic therapy. Ann Thorac Surg. (2009) 87:1048–54. discussion: 54–5. doi: 10.1016/j.athoracsur.2008.12.060

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Zhao L, Wei WQ, Zhao DL, Hao CQ, Lin DM, Pan QJ, et al. Population-based study of dna image cytometry as screening method for esophageal cancer. World J Gastroenterol. (2012) 18:375–82. doi: 10.3748/wjg.v18.i4.375

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell (2004) 116:281–97. doi: 10.1016/S0092-8674(04)00045-5

PubMed Abstract | CrossRef Full Text | Google Scholar

5. He L, Thomson JM, Hemann MT, Hernando-Monge E, Mu D, Goodson S, et al. A microRNA polycistron as a potential human oncogene. Nature (2005) 435:828–33. doi: 10.1038/nature03552

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Lu J, Getz G, Miska EA, Alvarez-Saavedra E, Lamb J, Peck D, et al. MicroRNA expression profiles classify human cancers. Nature (2005) 435:834–8. doi: 10.1038/nature03702

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Calin GA, and Croce CM. MicroRNA signatures in human cancers. Nat Rev Cancer. (2006) 6:857–66. doi: 10.1038/nrc1997

PubMed Abstract | CrossRef Full Text | Google Scholar

8. He L, He X, Lim LP, de Stanchina E, Xuan Z, Liang Y, et al. A microRNA component of the p53 tumour suppressor network. Nature (2007) 447:1130–4. doi: 10.1038/nature05939

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Takeshita N, Hoshino I, Mori M, Akutsu Y, Hanari N, Yoneyama Y, et al. Serum microRNA expression profile: mir-1246 as a novel diagnostic and prognostic biomarker for oesophageal squamous cell carcinoma. Br J Cancer. (2013) 108:644–52. doi: 10.1038/bjc.2013.8

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Wu C, Wang C, Guan X, Liu Y, Li D, Zhou X, et al. Diagnostic and prognostic implications of a serum miRNA panel in oesophageal squamous cell carcinoma. PLoS ONE (2014) 9:e92292. doi: 10.1371/journal.pone.0092292

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Hui B, Chen X, Hui L, Xi R, and Zhang X. Serum miRNA expression in patients with esophageal squamous cell carcinoma. Oncol Lett. (2015) 10:3008–12. doi: 10.3892/ol.2015.3642

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Roy P, Truntzer C, Maucort-Boulch D, Jouve T, and Molinari N. Protein mass spectra data analysis for clinical biomarker discovery: a global review. Brief Bioinform. (2011) 12:176–86. doi: 10.1093/bib/bbq019

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Benjamini Y, and Hochberg Y. Controlling the false discovery rate - a practical and powerful approach to multiple testing. J R Stat Soc B Methodol. (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

CrossRef Full Text | Google Scholar

14. Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat. (1979) 6:65–70.

Google Scholar

15. Wu TT, Chen YF, Hastie T, Sobel E, and Lange K. Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics (2009) 25:714–21. doi: 10.1093/bioinformatics/btp041

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Wang L, Zhu J, and Zou H. Hybrid huberized support vector machines for microarray classification and gene selection. Bioinformatics (2008) 24:412–9. doi: 10.1093/bioinformatics/btm579

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Yang Y, and Zou H. An efficient algorithm for computing the HHSVM and its generalizations. J Comput Graph Stat. (2013) 22:396–415. doi: 10.1080/10618600.2012.680324

CrossRef Full Text | Google Scholar

18. Zhu Y, Xia Y, Niu H, and Chen Y. MiR-16 induced the suppression of cell apoptosis while promote proliferation in esophageal squamous cell carcinoma. Cell Physiol Biochem. (2014) 33:1340–8. doi: 10.1159/000358701

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Rasmussen KD, Simmini S, Abreu-Goodger C, Bartonicek N, Di Giacomo M, Bilbao-Cortes D, et al. The miR-144/451 locus is required for erythroid homeostasis. J Exp Med. (2010) 207:1351–8. doi: 10.1084/jem.20100458

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Su Z, Ni L, Yu W, Yu Z, Chen D, Zhang E, et al. MicroRNA-451a is associated with cell proliferation, migration and apoptosis in renal cell carcinoma. Mol Med Rep. (2015) 11:2248–54. doi: 10.3892/mmr.2014.2957

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Tian Y, Nan Y, Han L, Zhang A, Wang G, Jia Z, et al. MicroRNA miR-451 downregulates the PI3K/AKT pathway through CAB39 in human glioma. Int J Oncol. (2012) 40:1105–12. doi: 10.3892/ijo.2011.1306

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Wang R, Wang ZX, Yang JS, Pan X, De W, and Chen L. B. MicroRNA-451 functions as a tumor suppressor in human non-small cell lung cancer by targeting ras-related protein 14 (RAB14). Oncogene (2011) 30:2644–58. doi: 10.1038/onc.2010.642

CrossRef Full Text | Google Scholar

23. Liu N, Jiang N, Guo R, Jiang W, He QM, Xu YF, et al. MiR-451 inhibits cell growth and invasion by targeting MIF and is associated with survival in nasopharyngeal carcinoma. Mol Cancer (2013) 12:123. doi: 10.1186/1476-4598-12-123

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Ji S, Ye G, Zhang J, Wang L, Wang T, Wang Z, et al. miR-574-5p negatively regulates Qki6/7 to impact beta-catenin/wnt signalling and the development of colorectal cancer. Gut (2013) 62:716–26. doi: 10.1136/gutjnl-2011-301083

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Yang M, Liu R, Sheng J, Liao J, Wang Y, Pan E, et al. Differential expression profiles of microRNAs as potential biomarkers for the early diagnosis of esophageal squamous cell carcinoma Oncol Rep. (2013) 29:169–76. doi: 10.3892/or.2012.2105

CrossRef Full Text | Google Scholar

26. Pritchard CC, Cheng HH, and Tewari M. MicroRNA profiling: approaches and considerations. Nat Rev Genet. (2012) 13:358–69. doi: 10.1038/nrg3198

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Pradervand S, Weber J, Thomas J, Bueno M, Wirapati P, Lefort K, et al. Impact of normalization on miRNA microarray expression profiling. RNA (2009) 15:493–501. doi: 10.1261/rna.1295509

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: esophageal squamous cell carcinoma, serum microRNA, multiple-testing criterion, Lasso logistic regression, penalized support vector machine

Citation: Zheng D, Ding Y, Ma Q, Zhao L, Guo X, Shen Y, He Y, Wei W and Liu F (2019) Identification of Serum MicroRNAs as Novel Biomarkers in Esophageal Squamous Cell Carcinoma Using Feature Selection Algorithms. Front. Oncol. 8:674. doi: 10.3389/fonc.2018.00674

Received: 16 August 2018; Accepted: 21 December 2018;
Published: 21 January 2019.

Edited by:

Jianguang Ji, Lund University, Sweden

Reviewed by:

Shaohua Xie, Karolinska Institute (KI), Sweden
Kun Chen, Zhejiang University, China

Copyright © 2019 Zheng, Ding, Ma, Zhao, Guo, Shen, He, Wei and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wenqiang Wei, weiwq@cicams.ac.cn
Fen Liu, liufen05@ccmu.edu.cn

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.