Editorial: Multi-omics approaches in the study of human disease mechanisms

Wang, Dapeng; Agapito, Giuseppe

doi:10.3389/fbinf.2024.1546680

EDITORIAL article

Front. Bioinform., 07 January 2025

Sec. Integrative Bioinformatics

Volume 4 - 2024 | https://doi.org/10.3389/fbinf.2024.1546680

This article is part of the Research TopicMulti-omics approaches in the study of human disease mechanismsView all 5 articles

Editorial: Multi-omics approaches in the study of human disease mechanisms

Dapeng Wang^1,2*^†

Giuseppe Agapito^3,4^†

¹Shandong Key Laboratory of Intelligent Oil and Gas Industrial Software, Qingdao Institute of Software, College of Computer Science and Technology, China University of Petroleum (East China), Qingdao, China
²National Heart and Lung Institute, Imperial College London, London, United Kingdom
³Department of Law, Economics and Social Sciences, University Magna Græcia, Catanzaro, Italy
⁴Data Analytics Research Center, University Magna Græcia, Catanzaro, Italy

Editorial on the Research Topic
Multi-omics approaches in the study of human disease mechanisms

With the development and popularity of high-throughput next-generation sequencing technologies, omics approaches gradually become the essential tool of modern biological and medical research, such as genomics, transcriptomics, proteomics and radiomics. In the early years, most studies used single omics to profile the specific type of biological molecules, which can generate inconsistent biomarkers with different rankings across omics types. With the advancement and cost-effectiveness of the omics, high-quality key biomarkers as well as molecular pathways and regulatory networks causatively associated with diseases can be identified through the co-called multi-omics with more than one type of omics (Hasin et al., 2017). In a typical multi-omics study, one would compare disease samples with controls and compare samples with different severities or different progressive stages to explore the disease-specific or stage-specific molecular features pending further experimental verification. The combination of demographic and clinical data with multi-omics data from patients with a specific disease offers a unique opportunity to make full use of cutting-edge artificial intelligence methods including machine learning and deep learning to accumulate knowledge and experience in interdisciplinary research fields (Reel et al., 2021; Ballard et al., 2024). The most informative analysis is through the multi-omics data from the same set of samples with longitudinal information in order to illuminate the time-dependent dynamic disease progression characteristics. For the multifaceted and complex diseases, multi-omics could define groups of patients with distinct endotypes exhibiting heterogeneous treatment responses due to their particular underlying molecular mechanism connecting genotype with phenotype (Tyler and Bunyavanich, 2019). The findings from these studies could inform the early diagnosis, prediction of prognosis and implementation of most appropriate and effective treatment strategies for the disease, leading to the improvement of the quality of life for patients and realisation of personalised medicine. Recently, multi-omics has been extensively employed in the studies of human diseases, including rare diseases, cancers, and other common diseases. For example, it has proven to be instrumental in the prediction of response to treatment in breast cancers (Sammut et al., 2022), the identification of epigenetic changes in human brains with Alzheimer’s disease (AD) (Nativio et al., 2020), and the improvement of diagnostic yield and clinical management of patients with rare diseases (Lunke et al., 2023). The advent of single-cell omics and spatial omics revolutionised our ways of discovering new cell types at enhanced resolution, elucidating the cellular heterogeneity and cell-cell interactions and measuring three-dimensional architecture and organisation of molecular profiles in a whole tissue (Bressan et al., 2023). Even though the integration of multi-omics has shown the powerful performance in the molecular characterisation of human disease aetiology, the data collection, analysis and harmonisation have presented enormous challenges due to the varied development stages of different omics techniques. In particular, sequencing-based transcriptomics has more established and standardised pipelines for both experimental and bioinformatic processes, as compared to other mass spectroscopy-based omics types, such as proteomics and metabolomics. Besides, transcriptomics can cover all human protein-coding genes whereas proteomics can only screen a selection of human proteins, which makes the integration of transcriptomics and proteomics less comprehensive. Furthermore, the complexity and high-dimensional structure of high-volume multi-omics data offers new avenues for the development of mathematical, statistical, computer science and data science approaches, including data housing, management strategies and data visualisation.

The Research Topic on “Multi-omics approaches in the study of human disease mechanisms” is comprised of three original research articles and one brief research report. The contributions to this Research Topic explore how the combined use of multiple techniques can help researchers to gain new insights into disease pathogenesis and drug discovery and development. In particular, they present the development of innovative omics data analysis pipelines and methods that improve the ability to interpret complex data sets.

Gene set enrichment analysis is part of the routine analysis of omics studies and efforts have been constantly made to extract the biological insights following the differential expression or abundance analysis between disease group and control group because of the limitations of the existing methods. To account for the uncertainties in the gene set generated by the differential expression analysis of omics studies, Hemandhar Kumar et al. developed bootGSEA, which used bootstrap approach to analyse randomly selected subsets of data and calculated the integrated score based on rank aggregation of bootstrap replicates and multiple datasets. They also devised an evaluation framework to assess the robustness of the analysis by comparing the results with and without a bootstrap step. The application of the method in the transcriptomics data from renal cell carcinoma and transcriptomics and proteomics data from a spinal muscular atrophy (SMA) mouse model demonstrated an increase in the robustness of the analysis with improved biological interpretation and the effectiveness of the new method in the analysis of different datasets from single-omics or multi-omics studies.

Complex diseases usually involve multiple factors from multiple molecular dimensions, and multi-omics analysis could facilitate the systematic investigation of pathogenesis of the diseases. Vacher et al. used machine learning algorithms to establish a classification model of patients with Alzheimer’s disease based on four individual omics domain including SNP, methylation, RNA and proteomics and their combination. The evaluation results suggested that the integration of the four omics datasets provided the best prediction performance than any of the individual datasets and demonstrated the feasibility of using machine learning approaches in the multi-omics datasets. The group of optimal features identified through the multi-omics analysis spanned across the four different omics categories, including those involved in neurodevelopmental pathways and other uncharacterised features with unknown functions.

Association rule mining is one of the powerful tools for elucidating the directional relationship among various genes and discovering the most relevant rules to the disease status. Mallik et al. reported a MOOVARM (multi-objective optimized variable cutoff-based association rule mining) framework for identifying the top rules from multi-omics datasets according to multiple and dynamic support thresholds, confidence thresholds and lift thresholds estimated from the integrative analysis of the data. Furthermore, they tested their new method in three different types of omics datasets including gene expression and DNA methylation in high-grade soft tissue sarcomas as well as protein-protein interaction data. The top ranked optimised rule created a signature of three genes (STAT3, TP53, MAPK3), suggesting the potential directional regulatory role of them in the pathogenesis of the disease. Top ten rules identified through MOOVARM produced the best overall classification accuracy, as compared to those identified from other two methods, such as Apriori and Eclat.

To provide a guidance for choosing the appropriate deconvolution methods, Slabowska et al. evaluated the performance of three major methods, namely Cell2location, RCTD, and spatialDWLS, for spatial transcriptomic data from patients with cardiovascular disease and chronic kidney disease, based on the comparison with annotations provided by histologists. All three methods achieved similar accuracies, and they had poor performance on certain cell types, such as endothelial cells. The running time of Cell2location for deconvolution was much longer than that of the other two methods, and Cell2location was able to generate consistent deconvolution results at a smaller reference size. C2L vs. RCTD showed greater similarity for a number of cell types in all three pairwise comparisons.

The research papers in the Research Topic highlight the remarkable achievements in the field and the need for further and continuous improvement in technologies and methodologies for investigating multi-omics data. Integrating different types of complex omics data can revolutionise the current approach to healthcare, enabling more precise and effective interventions for many diseases. In conclusion, this Research Topic showcases the recent developments and promising implementations of bioinformatic methods in multi-omics studies on various human disease types.

Author contributions

DW: Writing–original draft, Writing–review and editing. GA: Writing–review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. DW is supported by the Taishan Scholars Program of Shandong Province (tsqn202312110).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ballard, J. L., Wang, Z., Li, W., Shen, L., and Long, Q. (2024). Deep learning-based approaches for multi-omics data integration and analysis. BioData Min. 17 (1), 38. doi:10.1186/s13040-024-00391-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Bressan, D., Battistoni, G., and Hannon, G. J. (2023). The dawn of spatial omics. Science 381 (6657), eabq4964. doi:10.1126/science.abq4964

PubMed Abstract | CrossRef Full Text | Google Scholar

Hasin, Y., Seldin, M., and Lusis, A. (2017). Multi-omics approaches to disease. Genome Biol. 18 (1), 83. doi:10.1186/s13059-017-1215-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Lunke, S., Bouffler, S. E., Patel, C. V., Sandaradura, S. A., Wilson, M., Pinner, J., et al. (2023). Integrated multi-omics for rapid rare disease diagnosis on a national scale. Nat. Med. 29 (7), 1681–1691. doi:10.1038/s41591-023-02401-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Nativio, R., Lan, Y., Donahue, G., Sidoli, S., Berson, A., Srinivasan, A. R., et al. (2020). An integrated multi-omics approach identifies epigenetic alterations associated with Alzheimer's disease. Nat. Genet. 52 (10), 1024–1035. doi:10.1038/s41588-020-0696-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Reel, P. S., Reel, S., Pearson, E., Trucco, E., and Jefferson, E. (2021). Using machine learning approaches for multi-omics data analysis: a review. Biotechnol. Adv. 49, 107739. doi:10.1016/j.biotechadv.2021.107739

PubMed Abstract | CrossRef Full Text | Google Scholar

Sammut, S. J., Crispin-Ortuzar, M., Chin, S. F., Provenzano, E., Bardwell, H. A., Ma, W., et al. (2022). Multi-omic machine learning predictor of breast cancer therapy response. Nature 601 (7894), 623–629. doi:10.1038/s41586-021-04278-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Tyler, S. R., and Bunyavanich, S. (2019). Leveraging -omics for asthma endotyping. J. Allergy Clin. Immunol. 144 (1), 13–23. doi:10.1016/j.jaci.2019.05.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: multi-omics, human disease, data integration, bioinformatics, data analysis

Citation: Wang D and Agapito G (2025) Editorial: Multi-omics approaches in the study of human disease mechanisms. Front. Bioinform. 4:1546680. doi: 10.3389/fbinf.2024.1546680

Received: 17 December 2024; Accepted: 23 December 2024;
Published: 07 January 2025.

Edited and reviewed by:

Zhi-Ping Liu, Shandong University, China

Copyright © 2025 Wang and Agapito. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dapeng Wang, ZGFwZW5nLndhbmdAdXBjLmVkdS5jbg==

^†ORCID: Dapeng Wang, orcid.org/0000-0002-9925-4574; Giuseppe Agapito, orcid.org/0000-0003-2868-7732

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.