- 1Department of Molecular Biology and Genetics, Bilkent University, Ankara, Turkey
- 2Department of Multidisciplinary Studies, National University of Medical Sciences, Rawalpindi, Pakistan
Zebrafish is a small fresh-water species widely used as a vertebrate model in human physiology and pathologies. In addition, the zebrafish genome has recently been sequenced providing a plethora of opportunities for comparison with the genomes of other animals, such as rodents and humans (Howe et al., 2013). However, there is an emerging need for a concise synthesis of the literature findings and approaches used in comparing transcriptomes of mammals with that of zebrafish. In this opinion article, we first focus on features which make zebrafish genome and transcriptome unique and invaluable for comparative studies. Exploring research findings under multiple topics such as comparative physiology, toxicology, and cancer reveal important aspects of functional conservation among vertebrates and effective research pipelines for comparative transcriptomics studies.
Features of Zebrafish Genome/transcriptome
The zebrafish genome is similar to the human genome with respect to the number of chromosomes (25 vs. 23 pairs, respectively) (Postlethwait et al., 2000) and of protein coding genes (Howe et al., 2013). On the other hand, a distinguishing aspect of the zebrafish genome, as that of most fish genomes, is the extra whole-genome duplication (WGD) event that has been estimated to occur at least 320 million years ago (Hoegg et al., 2007). As a result, the zebrafish genome contains paralogous pairs of genes, exhibiting divergent or complementary functions, for a set of mammalian orthologous genes (Laprairie et al., 2016). The divergence of zebrafish paralogous genes with respect to their methylation pattern in correspondence to expression levels in gametes and early embryos has been studied (Zhong et al., 2016). Two thousand four hundred and forty paralogous pairs of genes have been identified and more than 75% of these genes are found to be under purifying selection. In addition, around 600 duplicate pairs in zebrafish exhibit divergent promoter methylation as well as a negative correlation between levels of promoter DNA methylation and mRNA expression (Zhong et al., 2016). Future studies are still needed to better understand the expression patterns of zebrafish paralogs in comparison to human orthologous genes. In this context, we are developing a web-based tool called CompariZome for comparative statistical examination of public human and zebrafish expression datasets extracted from GEO repository (Edgar et al., 2002; Lopes et al., 2016).
Identification of Conserved Functional Processes by Comparative Transcriptomics
Based on previous studies, two effective and complementary approaches exist for comparative transcriptomics between different species: comparison of significantly modulated (a) genes; and/or (b) pathways (Figure 1). We briefly explain them below with selected examples from the literature.
For any comparative study dealing with zebrafish and mammalian expression, multiple data sources (Edgar et al., 2002; Kolesnikov et al., 2015) need to be searched for a specified biological question (Figure 1). Upon obtaining differentially expressed genes (DEGs), the next crucial step is to identify the pathways enriched in the gene lists of each species, separately. A commonly used web-based software is DAVID (Huang da et al., 2009a,b; Schartl et al., 2012; Sucularli et al., 2016), used to identify the enriched functional terms, e.g., KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways or gene ontology (GO) (Figure 1). Gene Set Enrichment Analysis (GSEA), which was developed to rank significantly enriched biological/cellular processes based on differentially expressed genes (Subramanian et al., 2005), is also widely used (Zheng et al., 2011; Yildiz et al., 2013). One of the most important steps in comparative transcriptomics is orthology mapping (Figure 1). This can be performed using different resources that include Ensembl Biomart (Saraiva et al., 2015; Shih et al., 2015); Unigene clusters (Lam et al., 2006; Zheng et al., 2011, 2014); and Homologene (Driessen et al., 2015). Statistics based on Fisher's Exact test (Sucularli et al., 2016) or GSEA (Zheng et al., 2011) are essential elements to reveal significant associations between different species in terms of the selected genes, pathways or enrichment terms (Figure 1). Moreover, the visualization of such associations between different species requires the use of multivariate exploratory techniques among which Principal Component Analysis (PCA) and heatmaps (Zheng et al., 2014; Saraiva et al., 2015; Tarifeño-Saldivia et al., 2017), and/or coinertia (COA) plots (Kaya et al., 2011) are commonly preferred (Figure 1). Representation by Venn diagrams is another way to show the intersection of gene lists/pathways between species (Lam et al., 2006; Tarifeño-Saldivia et al., 2017). The abundance of one-to-many orthology relationships between zebrafish and mammals, however, complicates orthology mapping and hence may restrict downstream analyses to the use of only one-to-one orthologous pairs that exhibit moderate-high sequence similarity (Saraiva et al., 2015; Davison et al., 2017; Tarifeño-Saldivia et al., 2017). Finally, the development of online tools and use of meta-analysis methodologies can lead to easier access to data/analyses and more robust conclusions on the degree of functional conservation between zebrafish and mammals (Kaya et al., 2011; Sucularli et al., 2016; Davison et al., 2017).
To demonstrate different aspects of the pipeline described in Figure 1, we provide the below examples of comparative transcriptomics studies focusing on physiology, toxicology, and cancer.
Physiological Processes and Disease Relevance
Inter-species transcriptome analysis is highly useful to discover common, as well as unique, signatures in a diverse set of physiological and relevant pathological conditions. For example, RNA-sequencing of zebrafish heart has led to the identification of 96 chamber-specific genes, 68 of which possess orthologs in humans. An OMIM database search reveals the 25 of these 68 genes are disease-associated, and five of them are specifically involved in cardiac diseases (Singh et al., 2016). Similarly, 49 orthologs of 51 known human dilated cardiomyopathy-associated genes are found to be conserved in zebrafish, further promoting the use of the zebrafish model for human cardiac disease research (Shih et al., 2015). However, 19 of these genes have two or more paralogs in zebrafish, pointing to one of the complexities that needs to be resolved for studying disease association between zebrafish and humans. Currently, the selection of a gene that is enriched within the specified tissue emerges as a potential strategy to prioritize one or more of the paralogous genes for future studies (Shih et al., 2015).
Maternal to zygotic transition (MZT), during which maternally provided RNA and proteins are replaced by those produced by the zygote alone, is a shared phenomenon among animals (Bensaude et al., 1983; Harvey et al., 2013). The comparison of expression changes in MZT of Lymnaea stagnalis with the published data from several arthropod, nematode, urochordate, and chordate species including zebrafish, has indeed led to the extraction of a shared expression signature. An analysis of orthologs common to all species (identified using tBLASTx) reveals that maternal transcripts are often enriched with housekeeping functions and are involved in nucleotide binding, cell cycle progression, and protein degradation (Liu et al., 2014). This extensive comparative transcriptomics study of embryogenesis shows the power of such an approach and contributes to our understanding of the patterns of embryonic conservation among vertebrates and invertebrates. Future comparative transcriptomics studies between zebrafish and mammals can help reveal the functional divergence of paralogous genes in zebrafish MZT.
Other areas receiving attention in comparative transcriptomics include regulation of the circadian clock between zebrafish and mouse (Boyle et al., 2017) and the degree of conservation of ovulation in humans, mice, and zebrafish (Liu et al., 2017). Zebrafish has also become a useful model for revealing the resemblance between zebrafish swim bladder and mammalian lung (Zheng et al., 2011), renal distal convoluted tubules of zebrafish and mouse (Sugano et al., 2017), pancreatic cell populations of zebrafish, mouse, and human (Tarifeño-Saldivia et al., 2017), and olfactory systems of mouse and zebrafish (Saraiva et al., 2015). In addition, the conservation of immune response between zebrafish and mammals has been shown using meta-genomics of gut microbiota (Davison et al., 2017) and with respect to IFN gamma signaling (Filiano et al., 2016).
Cancer
Cancer, one of the complex human diseases threatening large numbers of people across different age groups involves the deregulation of expression of genes relevant to cell proliferation, apoptosis, and senescence (Jones and Baylin, 2007). Hence, understanding how similarly genes and signaling pathways are modulated between zebrafish and humans can help identify novel therapeutic gene/protein targets and signaling pathways in cancer.
For example, comparisons between medaka, zebrafish, and human melanoma transcriptomes using RNA sequencing have identified more downregulated than upregulated genes that are commonly modulated between fish and human samples (Schartl et al., 2012). Another study focusing on the comparison of genetic alterations in zebrafish and human melanoma (exhibiting congruent BRAF mutation and P53 deletions) points to lower rates of UV-induced mutations in zebrafish while providing novel candidate genes to pursue in drug-resistant human melanomas (Kansler et al., 2017). Similarly, a study performed with an inducible zebrafish embryonal rhabdomyosarcoma model has identified the shared upregulation of MYF5 mRNA in zebrafish and human expression datasets based on GSEA comparisons (Langenau et al., 2007). Further transcriptomics and mechanistic studies in zebrafish and human rhabdomyosarcomas have demonstrated the potent role of the transcription factors MYF5 and MYOD in the development and growth of muscle tumors (Tenente et al., 2017).
Using statistical approaches, such as binomial tests and GSEA has shown that zebrafish liver cancer most significantly resembles human liver cancer in terms of changes in gene expression (Lam et al., 2006). Further studies demonstrate that deregulations in several cancer-related transcription factors, MYC, E2F1, YY1, and STAT, are commonly observed in zebrafish and human liver cancers (Ung et al., 2009). It has also been shown that different human liver cancer subtypes can be paired successfully with xmrk-, kras-, and myc-induced zebrafish tumors using comparative transcriptomics at the level of gene as well as at the level of cellular pathways (Zheng et al., 2014). The authors have found that the human orthologs of the genes upregulated in zebrafish liver tumors are also upregulated in different stages of hepatocellular carcinomas (HCC); and altogether, three types of zebrafish models represent almost half of the human HCC samples (Zheng et al., 2014).
Toxicology and Pharmacology
Toxicology and pharmacology are the two relevant fields in which the comparison of zebrafish and mammalian transcriptomes provide invaluable information for translational biology. Liver cells, mainly due to their ability for detoxification, are frequently expression-profiled in human toxicology studies where comparisons with zebrafish liver or embryos/larvae exposed to drugs are made (Driessen et al., 2015). For example, mercury can induce early and late response genes in vivo in zebrafish liver and in vitro in human liver carcinoma cells. In addition to the identification of concordant upregulated (proteasome and DNA damage response) and downregulated (mitochondrial fatty acid beta oxidation, electron transport chain, nuclear receptor signaling) pathways, this study also reveals discrepancies between transcriptomes of zebrafish and humans in response to mercury, signifying caution in such comparative studies (Ung et al., 2010). Gene-centric analyses in toxicology also exist; for example, H2O2 exposure in zebrafish larvae and in human keratinocytes shows that the expressions of 41 genes are modulated in common between the two species (Lisse et al., 2016). Scatterplots of logarithmically transformed fold expression changes (LogFC) from different expression datasets help test similarity in expression profiles between experiments and have been used in recent comparative transcriptomics literature (Yildiz et al., 2013; Lisse et al., 2016).
Zebrafish is a highly preferred model for testing drugs commonly used in treating human diseases. For example, a study performed with the immunosuppressant drug, cyclosporine, at the levels of gene, pathway and transcription factors, strengthen the confidence on the effective use of zebrafish embryos for comparative toxicogenomic studies (Driessen et al., 2013). Pathway-focused comparative transcriptomics analysis of rapamycin exposure, in vitro, between a zebrafish embryonic fibroblast cell line and five mouse cell lines also demonstrate the presence of a strong and positive inter-species association between expression profiles (Sucularli et al., 2016). Furthermore, the comparison of pathways/gene ontology terms via meta-analysis pipelines, as well as the stringent use of association tests, can be listed among the strategies to increase the robustness of findings in comparative transcriptomics (Sucularli et al., 2016). Using in vivo models in addition to in vitro comparisons may further lead to more comprehensive findings since a larger number of shared genes and stronger enrichment scores can be uncovered with in vivo studies (Driessen et al., 2015).
Conclusion
Herein, we have provided a brief pipeline from hypothesis testing to the visualization of results for comparative transcriptomics studies between zebrafish and other animals based on the existing literature (Figure 1). We also reiterate the need to focus on the employment of effective strategies to deal with paralogous gene expression in comparative studies. Moreover, the use of meta-analysis in comparative transcriptomics is likely to increase the robustness of conclusions in regard to the functional conservation of genes and pathways between zebrafish and mammals. Similarly, the development of web-based analysis tools will enhance accessibility to comparative transcriptomics data and will allow researchers to make better decisions while selecting genes/pathways to target for further mechanistic studies or translational research.
Author Contributions
HS and OK contributed to the ideas and the literature search of the manuscript, wrote the manuscript, and revised it.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank Bilkent University, Department of Molecular Biology and Genetics Zebrafish Facility, The Scientific and Technological Research Council of Turkey, (TUBITAK grant nos.103T038 and 105S365), and Higher Education Commission (HEC), Pakistan (Ph.D. Scholarship, HS).
References
Bensaude, O., Babinet, C., Morange, M., and Jacob, F. (1983). Heat shock proteins, first major products of zygotic gene activity in mouse embryo. Nature 305, 331–333. doi: 10.1038/305331a0
Boyle, G., Richter, K., Priest, H. D., Traver, D., Mockler, T. C., Chang, J. T., et al. (2017). Comparative analysis of vertebrate diurnal/circadian transcriptomes. PLoS ONE 12:e0169923. doi: 10.1371/journal.pone.0169923
Davison, J. M., Lickwar, C. R., Song, L., Breton, G., Crawford, G. E., and Rawls, J. F. (2017). Microbiota regulate intestinal epithelial gene expression by suppressing the transcription factor Hepatocyte nuclear factor 4 alpha. Genome Res. 27, 1195–1206. doi: 10.1101/gr.220111.116
Driessen, M., Kienhuis, A. S., Pennings, J. L., Pronk, T. E., van de Brandhof, E. J., Roodbergen, M., et al. (2013). Exploring the zebrafish embryo as an alternative model for the evaluation of liver toxicity by histopathology and expression profiling. Arch. Toxicol. 87, 807–823. doi: 10.1007/s00204-013-1039-z
Driessen, M., Vitins, A. P., Pennings, J. L., Kienhuis, A. S., Water, B., and van der Ven, L. T. (2015). A transcriptomics-based hepatotoxicity comparison between the zebrafish embryo and established human and rodent in vitro and in vivo models using cyclosporine A, amiodarone and acetaminophen. Toxicol. Lett. 232, 403–412. doi: 10.1016/j.toxlet.2014.11.020
Edgar, R., Domrachev, M., and Lash, A. E. (2002). Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 30, 207–210. doi: 10.1093/nar/30.1.207
Filiano, A. J., Xu, Y., Tustison, N. J., Marsh, R. L., Baker, W., Smirnov, I., et al. (2016). Unexpected role of interferon-gamma in regulating neuronal connectivity and social behaviour. Nature 535, 425–429. doi: 10.1038/nature18626
Harvey, S. A., Sealy, I., Kettleborough, R., Fenyes, F., White, R., Stemple, D., et al. (2013). Identification of the zebrafish maternal and paternal transcriptomes. Development 140, 2703–2710. doi: 10.1242/dev.095091
Hoegg, S., Boore, J. L., Kuehl, J. V., and Meyer, A. (2007). Comparative phylogenomic analyses of teleost fish Hox gene clusters: lessons from the cichlid fish Astatotilapia burtoni. BMC Genomics 8:317. doi: 10.1186/1471-2164-8-317
Howe, K., Clark, M. D., Torroja, C. F., Torrance, J., Berthelot, C., Muffato, M., et al. (2013). The zebrafish reference genome sequence and its relationship to the human genome. Nature 496, 498–503. doi: 10.1038/nature12111
Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009a). Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1–13. doi: 10.1093/nar/gkn923
Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009b). Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57. doi: 10.1038/nprot.2008.211
Jones, P. A., and Baylin, S. B. (2007). The epigenomics of cancer. Cell 128, 683–692. doi: 10.1016/j.cell.2007.01.029
Kansler, E. R., Verma, A., Langdon, E. M., Simon-Vermot, T., Yin, A., Lee, W., et al. (2017). Melanoma genome evolution across species. BMC Genomics 18:136. doi: 10.1186/s12864-017-3518-8
Kaya, K. D., Karakülah, G., Yakicier, C. M., Acar, A. C., and Konu, O. (2011). mESAdb: microRNA expression and sequence analysis database. Nucleic Acids Res. 39(Database issue), D170–D180. doi: 10.1093/nar/gkq1256
Kolesnikov, N., Hastings, E., Keays, M., Melnichuk, O., Tang, Y. A., Williams, E., et al. (2015). ArrayExpress update–simplifying data submissions. Nucleic Acids Res. 43(Database issue), D1113–D1116. doi: 10.1093/nar/gku1057
Lam, S. H., Wu, Y. L., Vega, V. B., Miller, L. D., Spitsbergen, J., Tong, Y., et al. (2006). Conservation of gene expression signatures between zebrafish and human liver tumors and tumor progression. Nat. Biotechnol. 24, 73–75. doi: 10.1038/nbt1169
Langenau, D. M., Keefe, M. D., Storer, N. Y., Guyon, J. R., Kutok, J. L., Le, X., et al. (2007). Effects of RAS on the genesis of embryonal rhabdomyosarcoma. Genes Dev. 21, 1382–1395. doi: 10.1101/gad.1545007
Laprairie, R. B., Denovan-Wright, E. M., and Wright, J. M. (2016). Subfunctionalization of peroxisome proliferator response elements accounts for retention of duplicated fabp1 genes in zebrafish. BMC Evol. Biol. 16:147. doi: 10.1186/s12862-016-0717-x
Lisse, T. S., King, B. L., and Rieger, S. (2016). Comparative transcriptomic profiling of hydrogen peroxide signaling networks in zebrafish and human keratinocytes: implications toward conservation, migration and wound healing. Sci. Rep. 6:20328. doi: 10.1038/srep20328
Liu, D. T., Brewer, M. S., Chen, S., Hong, W., and Zhu, Y. (2017). Transcriptomic signatures for ovulation in vertebrates. Gen. Comp. Endocrinol. 247, 74–86. doi: 10.1016/j.ygcen.2017.01.019
Liu, M. M., Davey, J. W., Jackson, D. J., Blaxter, M. L., and Davison, A. (2014). A conserved set of maternal genes? Insights from a molluscan transcriptome. Int. J. Dev. Biol. 58, 501–511. doi: 10.1387/ijdb.140121ad
Lopes, S. S., Distel, M., Linker, C., Fior, R., Monteiro, R., Bianco, I. H., et al. (2016). Report of the 4th European zebrafish principal investigator meeting. Zebrafish 13, 590–595. doi: 10.1089/zeb.2016.1364
Postlethwait, J. H., Woods, I. G., Ngo-Hazelett, P., Yan, Y. L., Kelly, P. D., Chu, F., et al. (2000). Zebrafish comparative genomics and the origins of vertebrate chromosomes. Genome Res. 10, 1890–1902. doi: 10.1101/gr.164800
Saraiva, L. R., Ahuja, G., Ivandic, I., Syed, A. S., Marioni, J. C., Korsching, S. I., et al. (2015). Molecular and neuronal homology between the olfactory systems of zebrafish and mouse. Sci. Rep. 5:11487. doi: 10.1038/srep11487
Schartl, M., Kneitz, S., Wilde, B., Wagner, T., Henkel, C. V., Spaink, H. P., et al. (2012). Conserved expression signatures between medaka and human pigment cell tumors. PLoS ONE 7:e37880. doi: 10.1371/journal.pone.0037880
Shih, Y. H., Zhang, Y., Ding, Y., Ross, C. A., Li, H., Olson, T. M., et al. (2015). Cardiac transcriptome and dilated cardiomyopathy genes in zebrafish. Circ. Cardiovasc. Genet. 8, 261–269. doi: 10.1161/CIRCGENETICS.114.000702
Singh, A. R., Sivadas, A., Sabharwal, A., Vellarikal, S. K., Jayarajan, R., Verma, A., et al. (2016). Chamber specific gene expression landscape of the zebrafish heart. PLoS ONE 11:e0147823. doi: 10.1371/journal.pone.0147823
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U.S.A. 102, 15545–15550. doi: 10.1073/pnas.0506580102
Sucularli, C., Shehwana, H., Kuscu, C., Dungul, D. C., Ozdag, H., and Konu, O. (2016). Functionally conserved effects of rapamycin exposure on zebrafish. Mol. Med. Rep. 13, 4421–4430. doi: 10.3892/mmr.2016.5059
Sugano, Y., Cianciolo Cosentino, C., Loffing-Cueni, D., Neuhauss, S.C. F., and Loffing, J. (2017). Comparative transcriptomic analysis identifies evolutionarily conserved gene products in the vertebrate renal distal convoluted tubule. Pflugers Arch. 469, 859–867. doi: 10.1007/s00424-017-2009-8
Tarifeño-Saldivia, E., Lavergne, A., Bernard, A., Padamata, K., Bergemann, D., Voz, M. L., et al. (2017). Transcriptome analysis of pancreatic cells across distant species highlights novel important regulator genes. BMC Biol. 15:21. doi: 10.1186/s12915-017-0362-x
Tenente, I. M., Hayes, M. N., Ignatius, M. S., McCarthy, K., Yohe, M., Sindiri, S., et al. (2017). Myogenic regulatory transcription factors regulate growth in rhabdomyosarcoma. Elife 6:e19214. doi: 10.7554/eLife.19214
Ung, C. Y., Lam, S. H., and Gong, Z. (2009). Comparative transcriptome analyses revealed conserved biological and transcription factor target modules between the zebrafish and human tumors. Zebrafish 6, 425–431. doi: 10.1089/zeb.2009.0608
Ung, C. Y., Lam, S. H., Hlaing, M. M., Winata, C. L., Korzh, S., Mathavan, S., et al. (2010). Mercury-induced hepatotoxicity in zebrafish: in vivo mechanistic insights from transcriptome analysis, phenotype anchoring and targeted gene expression validation. BMC Genomics 11:212. doi: 10.1186/1471-2164-11-212
Yildiz, G., Arslan-Ergul, A., Bagislar, S., Konu, O., Yuzugullu, H., Gursoy-Yuzugullu, O., et al. (2013). Genome-wide transcriptional reorganization associated with senescence-to-immortality switch during human hepatocellular carcinogenesis. PLoS ONE 8:e64016. doi: 10.1371/journal.pone.0064016
Zheng, W., Li, Z., Nguyen, A. T., Li, C., Emelyanov, A., and Gong, Z. (2014). Xmrk, kras and myc transgenic zebrafish liver cancer models share molecular signatures with subsets of human hepatocellular carcinoma. PLoS ONE 9:e91179. doi: 10.1371/journal.pone.0091179
Zheng, W., Wang, Z., Collins, J. E., Andrews, R. M., Stemple, D., and Gong, Z. (2011). Comparative transcriptome analyses indicate molecular homology of zebrafish swimbladder and mammalian lung. PLoS ONE 6:e24019. doi: 10.1371/journal.pone.0024019
Keywords: zebrafish, transcriptomics, mammals, cancer, toxicology, development, physiology, comparative
Citation: Shehwana H and Konu O (2019) Comparative Transcriptomics Between Zebrafish and Mammals: A Roadmap for Discovery of Conserved and Unique Signaling Pathways in Physiology and Disease. Front. Cell Dev. Biol. 7:5. doi: 10.3389/fcell.2019.00005
Received: 24 November 2018; Accepted: 10 January 2019;
Published: 01 February 2019.
Edited by:
Isaac Henry Bianco, University College London, United KingdomReviewed by:
Masazumi Tada, University College London, United KingdomCopyright © 2019 Shehwana and Konu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ozlen Konu, a29udUBmZW4uYmlsa2VudC5lZHUudHI=