- 1Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu, China
- 2Center for Systems Biology, University, Suzhou, China
- 3School of Medicine, Institute of Medical Sciences, Örebro University, Örebro, Sweden
- 4Department of Ophthalmology, Guangdong Academy of Medical Sciences, Guangdong Provincial People's Hospital, Guangzhou, China
- 5School of Science, Kangda College of Nanjing Medical University, Lianyungang, China
- 6Department of Oncology and Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
Introduction
Colorectal cancer (CRC) is one of the most common forms of cancer and a major cause of cancer-related death in both men and women worldwide (Lao and Grady, 2011; Siegel et al., 2017). Over 881,000 people globally have died from CRC, while 1.8 million were newly diagnosed with CRC in 2018 (Bray et al., 2018). The death rate of CRC has been steadily declining since 1990 (Siegel et al., 2017), but precise diagnosing and treating CRC remains challenging. Many patients exhibit few symptoms until the tumor has metastasized, making biomarkers for early diagnosis essential. Liquid biopsy is an easy and non-invasive method to detect ctDNA (circulating tumor DNA) in plasma or serum samples for early diagnosis, prognosis, or treatment (Tarazona and Cervantes, 2018). But ctDNA from a liquid biopsy is difficult to process and the lack of accuracy is still a problem (Kolencik et al., 2020); as a result, most previous studies focused on tumor tissue samples. For CRC patients, ctDNA is used to detect not only RAS mutations, but also DNA methylation, such as SEPTIN9 methylation (Song et al., 2018).
Epigenetic modifications play an important role in CRC genesis and progression (Danese and Montagnana, 2017). Epigenetics investigates heritable phenotype changes without alterations in the DNA sequence (Dupont et al., 2009). Epigenetic modifications include DNA methylation, histone modification, and genomic imprinting. DNA methylation is one of the best-characterized epigenetic mechanisms (Li and Zhang, 2014), which adds methyl groups to DNA, often at CpG sequences (Ehrlich et al., 1982). Emerging evidence suggests that some epigenetic modifications, DNA methylation in particular, can be important biomarkers for CRC (Ahmed, 2007). Aberrant DNA methylation is tissue-specific and often appears at early stages of cancer development (Jahn et al., 2011), making it a potentially ideal biomarker for early diagnosis of CRC.
We have previously constructed a biomarker database for colorectal cancer (CBD) (Zhang et al., 2018). Despite numerous reports on this subject so far, to our best knowledge, no database for cancer epigenetic biomarkers has been built yet. To enable the systematic study of epigenetics in CRC, we hereby established the first cancer epigenetic biomarker database, which was named CRC-EBD (Epigenetic Biomarker Database for Colorectal Cancer). CRC-EBD stores the epigenetic biomarkers information on CRC from PubMed literature. As precision medicine is becoming the new scientific paradigm (Morere, 2012), our database is built with more focus on collecting information regarding clinical samples in order to promote future translational researches on CRC.
Materials and Methods
Data in CRC-EBD was manually collected from PubMed. We used “(colon[ti] OR rectosigmoid junction[ti] OR rectal[ti] OR anus[ti] OR bowel[ti] OR colorectum[ti] OR colorectal[ti]) AND (biomarker*[tiab] OR marker*[tiab] OR indicator*[tiab] OR predicator*[tiab] OR (drug target*[tiab]) OR (therapeutic target*[tiab])” as the term to search the PubMed for the CRC biomarkers. In addition, we used the keyword “AND methylat*[tiab]” for methylation biomarker, “AND histone*[tiab]” for histone modification, and “AND epigenetics*[tiab] NOT methylation[tiab] NOT histone*[tiab]” for other epigenetic biomarkers. In total, 1,444 articles were screened for these biomarkers in PubMed citations until December, 2019.
The following rules were applied to screen articles about CRC epigenetic biomarkers.
1) The article should contain clear statements like “Epigenetic modification (such as DNA methylation, histone modification, or other epigenetics modifications) is a biomarker/marker/indicator of CRC.” If the statement includes expressions like “can/may/has potential,” the corresponding data is included. This key statement can be searched in our database under “Description”.
2) Reviews or meta-analyses are excluded in the screening of CRC biomarkers.
3) If the article includes information about AUC/sensitivity/specificity or other assessment of the accuracy of the biomarker for prediction or classification of CRC, the value should be statistically significant.
4) Biomarkers from different articles have different IDs in our database, even if they share the same name, but with different clinical conditions for CRC, such as biomarker for diagnosis, prognosis, or treatment of CRC.
5) If both single and combinatorial biomarkers are included in one article, all the reported markers are given different IDs in our database.
We eventually selected 355 biomarkers, along with 694 records of sample information and 420 records of epigenetics information from the articles. The various cancer names in the original articles were uniformly changed to colorectal/colon/rectal cancer. A common format, as in “methylation of APC,” was adopted for all the biomarker names in CRC-EBD. All gene symbols and miRNA names were annotated as the official gene symbols from NCBI and miRBase. The biomarkers were also labeled with sample resources (blood, stool, and tissue) and clinical applications (diagnosis, prognosis, and treatment). Moreover, sample information of the patients (e.g., nationality, age, and TNM stage,) was collected for further analysis in personalized medicine. The pipeline of data collection, database construction, and functions of CRC-EBD is shown in Figure 1.
B/S (Browser/Server) structure and WAMP (Windows Server 2016 + Apache 2.4.39 + MySQL (10.4.6-MariaDB) + PHP 7.3.8) were used to construct the database. Users can access our database using their own browsers without installing other components. HTML and CSS were used to create the web pages and display the information. PHP and JavaScript were applied to connect the database and realize the search function. The data is stored in the MySQL database, which can be easily and quickly accessed. The charts in the statistics page were generated dynamically using ECharts (Li et al., 2018).
Discussion
The epigenetic biomarkers in our online database can be searched by epigenetics name, epigenetics type, CRC subtype, biomarker type, and application. Epigenetics name searching mode allows users to enter the name of a gene, miRNA, or histone in a text box. Similarly, under CRC subtype searching mode, users can type in a text box the CRC subtypes or cancer names. Epigenetics types can be searched by DNA methylation, RNA methylation, histone modification, or others. Furthermore, users can select the biomarker type (diagnostic, prognostic, or therapeutic) and the application mode (blood, stool, tissue, or bowel lavage fluid) for their searches. The search result will be shown in a new webpage containing the list of biomarkers, and users can click each item for more detailed information.
Among the 355 epigenetics biomarkers in our CRC-EBD, 81.69% (290) of them are single DNA methylation biomarkers, whereas 11.52% are combinatorial (Figure 2A). Based on the clinical applications, 59.72% of the biomarkers are diagnostic, among which 9.86% are combinatorial for diagnosis, prognosis, or treatment (Figure 2B). 225 (63.38%) biomarkers are applied for tissue samples, 39 (10.99%) for stool, 77 (21.69%) for blood, and 13 (3.66%) for multiple sample types. A combined biomarker (miR-124-3, ZNF582-AS1, and SFRP1 methylation) is the only one reported for bowel lavage fluid detection (Figure 2C). 92.98% of the biomarkers in our database are applied for colorectal cancer research, demonstrating its prominence in the current field of studies (Figure 2D).
Figure 2. Distributions of the biomarkers in CRC-EBD. (A) Epigenetic types. (B) Biomarker types. (C) Sample types. (D) Cancer types.
Six hundred and ninety four groups of samples in total are collected in the CRC-EBD: 457 are tissue samples (tumor samples and healthy samples), 73 are stool samples, 131 are serum/plasma samples or others. Though stool or blood samples are easier and more convenient to acquire, most of the previous studies are based on tissue samples directly connected to cancer genesis and progress.
CRC-EBD is the first online resource for epigenetic biomarkers of cancer. We will expand the database to other cancers in the future. This database will offer the users a systematic perspective on the heterogeneous cancer and promote epigenetics research on cancers.
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found at: http://www.sysbio.org.cn/EBD/.
Author Contributions
XL, XZ, HZ, X-FS, and BS conducted and designed this study. XL, XZ, JC, BY, SR, and YL collected data and implemented the database. XL and SR wrote the manuscript. BS supervised the project. All authors reviewed and approved the paper for publication.
Funding
This work was supported by the National Natural Science Foundation of China (Grant no. 31670851).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Ahmed, F. E. (2007). Colorectal cancer epigenetics: the role of environmental factors and the search for molecular biomarkers. J. Environ. Sci. Health C Environ. Carcinog. Ecotoxicol. Rev. 25, 101–154. doi: 10.1080/10590500701399184
Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., and Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424. doi: 10.3322/caac.21492
Danese, E., and Montagnana, M. (2017). Epigenetics of colorectal cancer: emerging circulating diagnostic and prognostic biomarkers. Ann. Transl. Med. 5:279. doi: 10.21037/atm.2017.04.45
Dupont, C., Armant, D. R., and Brenner, C. A. (2009). Epigenetics: definition, mechanisms and clinical perspective. Semin. Reprod. Med. 27, 351–357. doi: 10.1055/s-0029-1237423
Ehrlich, M., Gama-Sosa, M. A., Huang, L. H., Midgett, R. M., Kuo, K. C., McCune, R. A., et al. (1982). Amount and distribution of 5-methylcytosine in human DNA from different types of tissues of cells. Nucleic Acids Res. 10, 2709–2721. doi: 10.1093/nar/10.8.2709
Jahn, K. A., Su, Y., and Braet, F. (2011). Multifaceted nature of membrane microdomains in colorectal cancer. World J. Gastroenterol. 17, 681–690. doi: 10.3748/wjg.v17.i6.681
Kolencik, D., Shishido, S. N., Pitule, P., Mason, J., Hicks, J., and Kuhn, P. (2020). Liquid biopsy in colorectal carcinoma: clinical applications and challenges. Cancers 12:1376. doi: 10.3390/cancers12061376
Lao, V. V., and Grady, W. M. (2011). Epigenetics and colorectal cancer. Nat. Rev. Gastroenterol. Hepatol. 8, 686–700. doi: 10.1038/nrgastro.2011.173
Li, D., Mei, H., Shen, Y., Su, S., Zhang, W., Wang, J., et al. (2018). ECharts: a declarative framework for rapid construction of web-based visualization. Vis. Inform. 2, 136–146. doi: 10.1016/j.visinf.2018.04.011
Li, E., and Zhang, Y. (2014). DNA methylation in mammals. Cold Spring Harb. Perspect. Biol. 6:a019133. doi: 10.1101/cshperspect.a019133
Morere, J. F. (2012). Oncology in 2012: from personalized medicine to precision medicine. Target Oncol. 7, 211–212. doi: 10.1007/s11523-012-0238-5
Siegel, R. L., Miller, K. D., Fedewa, S. A., Ahnen, D. J., Meester, R. G. S., Barzi, A., et al. (2017). Colorectal cancer statistics, 2017. CA Cancer J. Clin. 67, 177–193. doi: 10.3322/caac.21395
Song, L., Guo, S., Wang, J., Peng, X., Jia, J., Gong, Y., et al. (2018). The blood mSEPT9 is capable of assessing the surgical therapeutic effect and the prognosis of colorectal cancer. Biomark Med. 12, 961–973. doi: 10.2217/bmm-2018-0012
Tarazona, N., and Cervantes, A. (2018). Liquid biopsy: another tool towards tailored therapy in colorectal cancer. Ann. Oncol. 29, 7–8. doi: 10.1093/annonc/mdx641
Keywords: colorectal cancer, database, epigenetics, DNA methylation, histone modification
Citation: Liu X, Zhang X, Chen J, Ye B, Ren S, Lin Y, Sun X-F, Zhang H and Shen B (2020) CRC-EBD: Epigenetic Biomarker Database for Colorectal Cancer. Front. Genet. 11:907. doi: 10.3389/fgene.2020.00907
Received: 14 May 2020; Accepted: 22 July 2020;
Published: 06 October 2020.
Edited by:
Xiaogang Wu, University of Texas MD Anderson Cancer Center, United StatesReviewed by:
Lorena Aguilar Arnal, National Autonomous University of Mexico, MexicoGeorges Nemer, American University of Beirut, Lebanon
Copyright © 2020 Liu, Zhang, Chen, Ye, Ren, Lin, Sun, Zhang and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bairong Shen, YmFpcm9uZy5zaGVuJiN4MDAwNDA7c2N1LmVkdS5jbg==; Hong Zhang, aG9uZy56aGFuZyYjeDAwMDQwO29ydS5zZQ==
†These authors have contributed equally to this work