- 1Laboratory of Genetics and Genomics of Rare Diseases, Giannina Gaslini Institute (IRCCS), Genova, Italy
- 2Department of Health Sciences, School of Medical and Pharmaceutical Sciences, University of Genoa, Genoa, Italy
Editorial on the Research Topic
Elucidation of the causes of human disease by multi-omics integration
This Research Topic collected papers aimed at integrating different omics data to investigate the causes of human diseases. The multi-omics analysis, in fact, can leverage the advantages of each set of data, collecting more complete biological and clinical information about patients and thus providing a more holistic point of view. Integrating analysis allows us to model the correlations or associations likely present at the different omics layers of related data leading to a better modeling of the complexity of data and the possibility of applying multivariate methods. Ultimately, these approaches can provide hints on the complex biological and molecular patterns underlying human traits and diseases. Several multi-omics approaches have been published in order to improve knowledge about human diseases, whether they are Mendelian diseases (Almeida et al., 2022), complex disorders (Chong et al., 2023; Diray-Arce et al., 2023) or tumors (Park et al., 2023; Poos et al., 2023).
One of the greatest advantages of integrating multi-omics data is the possibility to exploit different datasets already available. Thanks to the improvement in technologies and bioinformatic tools, as well as the more and more affordable costs, omics data of many kinds have been generated at a remarkable hastening. Alongside, many databases that include clinical data together with genomic data, expression profiles, and other omics data are progressively becoming available, on both specific diseases and general populations. Just think of widely used databasessuch as TCGA (https://www.cancer.gov/tcga) and GTEx database (https://gtexportal.org/home), and the newly released proteomic dataset by UK Biobank (https://www.ukbiobank.ac.uk/), just to name a few. Among the papers collected in this Research Topic, Chen et al. investigated the SHC-adaptor protein 1 (SHC1) role in cancer through a pan-cancer analysis taking advantage of data from public databases (GTEx database, TCGA database, Oncomine and CPTAC database). The SHC1 expression levels were examined with regard to several crucial issues related to cancer, such as clinical outcomes, genetic alteration, DNA methylation, or protein phosphorylation. Altogether, their results support the SHC1 role in tumor immunity and its possible prognostic and diagnostic value in multiple cancers. Also Xie et al. focused on cancer, taking advantage of the TCGA database. They used mRNA expression and clinical data of Pancreatic ductal adenocarcinoma (PDAC) patients from the TCGA database as training cohort to establish the prognostic model and from GEO database as test cohort to validate the model focusing on necroptosis-related genes (NRGs). They were thus able to construct a genetic risk prognostic model based on four NRGs, independent of other risk factors such as age and tumor grade and with a fair predictive diagnostic efficacy that might be used in PDAC patients.
Besides the articles described above, many studies based on multi-omics analysis are focused on the field of tumor and have the goal of identifying biomarkers with a diagnostic and prognostic effect. Kaya et al. combined a published dataset of whole genome gene expression from patients with colorectal cancer (CRC) with already published copy number alterations data of different CRC samples, in order to identify a gene signature with a potential diagnostic and predictive value. The authors evaluated this gene signature expression levels in blood samples from CRC patients using data from GTEx portal, to test blood as a biological sample from which to extract information in a non-invasive way. Also Chen et al. focused on CRC datasets. Through the development of a new method, they identified gene networks significantly correlated with CRC survival that included genes with crucial roles in cancer progression and that could serve as potential prognostic biomarkers. With the same purpose, Needhamnsen et al. published a study on patients affected by Multiple Sclerosis (MS) in which they combined small RNA data and methylome data by applying the multiple co-inertia analysis (MCIA) multivariate method. This unsupervised approach was applied to multi-omics data derived from multiple compartments, cell-free cerebrospinal fluid (CSF), CSF, plasma and peripheral blood mononuclear cells from the same 42 individuals. Their integrative approach allowed to distinguish between the relapsing-remitting form and the secondary progressive stage of MS patients.
An additional advantage of omics data is that they can be re-analyzed as information is added and new methods of analysis evolve. It is therefore compelling the development and improvement of analysis methods, their standardization and guidelines on both analysis and data control steps based on interdisciplinary approaches. Several articles provide an overview on the multi-omics analysis (Correa-Aguila et al., 2022) with particular regard to the machine learning approach (Reel et al., 2021). Athieniti and Spyrou also define the main objectives investigated through multi-omics and describe the different methods applied as well as the main challenges (Athieniti and Spyrou, 2023). In the present editorial, Chen et al., designed a framework, Correlation-based Local Approximation of Membership (CLAM), which combines multi-omics data to identify co-expressed gene modules without the need across different datasets to share the same genes or samples and exploiting known molecular interactions in module detection, thus overcoming some common limitations of current methods. They applied their method to expression data from several cancer and normal tissue dataset finding higher precision and biological relevance in modules retrieval and a higher number of gene ontology (GO) terms in enrichment analysis in comparison to other known methods.
Besides the necessity of improving data integration methods and guidelines, there are still several limitations to the multi-omics approach. Among the most evident, is the lack of external validation of the results obtained by multi-omics analysis in different biological samples. In addition, some of the common issues in human diseases studies are even more crucial in multi-omics studies. The risk of sampling bias and the importance of detailed phenotypes and clinical classification remains of utmost importance, including the definition of covariates related to clinical outcomes (Lipman et al., 2023). Finally, the sample size is of concern, since achieving the statistical power becomes even more relevant than in the single omics.
In conclusion, multi-omics analysis can improve the knowledge of the complex underlying biology and identify prognostic and diagnostic biomarkers, as attempted in the studies published in the present Research Topic. This might eventually pave the way to better prevention and personalized medicine: multi-omics are still at their dawn, with some methods still lacking maturity, but these approaches are definitely promising and worthy of being increasingly applied to the study of human traits and diseases.
Author contributions
MR: Writing–original draft, Conceptualization, Writing–review and editing. FL: Writing–original draft, Writing–review and editing, Conceptualization.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Almeida, L. S., Pereira, C., Aanicai, R., Schröder, S., Bochinski, T., Kaune, A., et al. (2022). An integrated multiomic approach as an excellent tool for the diagnosis of metabolic diseases: our first 3720 patients. Eur. J. Hum. Genet. 30 (9), 1029–1035. doi:10.1038/s41431-022-01119-5
Athieniti, E., and Spyrou, G. M. (2023). A guide to multi-omics data collection and integration for translational medicine. Comput. Struct. Biotechnol. J. 21, 134–149. doi:10.1016/j.csbj.2022.11.050
Chong, D., Jones, N. C., Schittenhelm, R. B., Anderson, A., and Casillas-Espinosa, P. M. (2023). Multi-omics integration and epilepsy: towards a better understanding of biological mechanisms. Prog. Neurobiol. 227, 102480. doi:10.1016/j.pneurobio.2023.102480
Correa-Aguila, R., Alonso-Pupo, N., and Hernández-Rodríguez, E. W. (2022). Multi-omics data integration approaches for precision oncology. Mol. Omics 18 (6), 469–479. doi:10.1039/d1mo00411e
Diray-Arce, J., Fourati, S., Doni Jayavelu, N., Patel, R., Maguire, C., Chang, A. C., et al. (2023). Multi-omic longitudinal study reveals immune correlates of clinical course among hospitalized COVID-19 patients. Cell. Rep. Med. 4 (6), 101079. doi:10.1016/j.xcrm.2023.101079
Lipman, D., Safo, S. E., and Chekouo, T. (2023). Integrative multi-omics approach for identifying molecular signatures and pathways and deriving and validating molecular scores for COVID-19 severity and status. BMC Genomics 24 (1), 319. doi:10.1186/s12864-023-09410-5
Park, Y. H., Im, S. A., Park, K., Wen, J., Lee, K. H., Choi, Y. L., et al. (2023). Longitudinal multi-omics study of palbociclib resistance in HR-positive/HER2-negative metastatic breast cancer. Genome Med. 15 (1), 55. doi:10.1186/s13073-023-01201-7
Poos, A. M., Prokoph, N., Przybilla, M. J., Mallm, J. P., Steiger, S., Seufert, I., et al. (2023). Resolving therapy resistance mechanisms in multiple myeloma by multi-omics subclone analysis. Blood J. Online ahead of print. doi:10.1182/blood.2023019758
Keywords: omic analyses, multi-omic, human diseases, bioinformatics, integration
Citation: Rusmini M and Lantieri F (2023) Editorial: Elucidation of the causes of human disease by multi-omics integration. Front. Genet. 14:1271406. doi: 10.3389/fgene.2023.1271406
Received: 02 August 2023; Accepted: 03 August 2023;
Published: 09 August 2023.
Edited and reviewed by:
Jared C. Roach, Institute for Systems Biology (ISB), United StatesCopyright © 2023 Rusmini and Lantieri. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Marta Rusmini, bWFydGFydXNtaW5pQGdhc2xpbmkub3Jn