Skip to main content

EDITORIAL article

Front. Genet., 01 July 2021
Sec. Computational Genomics
This article is part of the Research Topic Cross-Domain Analysis for “All of Us” Precision Medicine View all 6 articles

Editorial: Cross-Domain Analysis for “All of Us” Precision Medicine

  • 1CAS Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
  • 2Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
  • 3Department of Computer Science, Aberystwyth University, Aberystwyth, United Kingdom

Precision medicine aims to combat complex human diseases by providing preventative, diagnostic, and treatment tools for each patient rather than each cohort. Along with the development of precision medicine (formerly called personalized medicine) (Aronson and Rehm, 2015), the recent “All of Us” initiative seeks to gather data related to “lifestyle, environment, and biology” from a wide range of people in order to better understand individual differences and improve precision medicine (All of Us Research Program Investigators et al., 2019). “All of Us” precision medicine is seeking easy, low-cost, non-invasive, and effective early clinical detection approaches, which include novel biological technology (BT) and information technology (IT) in precision healthcare research. Importantly, many types/domains of data from new technologies can be utilized, such as voice/text recording, image scanning, behavioral monitoring, and feature phenotyping, in addition to omics data from high-throughput technologies (Zeng, 2021). These consist of cross-domain data, where the micro cross-domain can cover conventional omics data and the macro cross-domain can bridge the omics with other data types. Therefore, the cross-domain analysis is a key to understanding “All of Us” precision medicine.

In particular, the artificial intelligence (AI) model and method (e.g., deep learning) has achieved great success in analyzing image data in many fields and is bringing new opportunities to clinical research (Zeng et al., 2021). However, for cross-domain analysis in “All of Us” precision medicine, several challenges still exist: the numbers of variables in different domains are unbalanced, and the bias coming from the dominant domain should be tackled (Zhang et al., 2020); the number of observations is small or not large enough in clinical or biological fields, thus small-sample modeling is required (Lai et al., 2020); the samples in different domains may be not matched, so that heterogeneous integration should be considered (Yang et al., 2020); and many data organizations exist, so data without a vector or matrix form needs a reasonable and powerful transformation/representation (Wang et al., 2021).

This Research Topic serves as an update of our previous topic (Zeng et al., 2019), “Machine Learning Advanced Dynamic Omics Data Analysis for Precision Medicine,” with a focus on the application of artificial intelligence with cross-domain data for further developing “All of Us” precision medicine.

At present, we do not have a clear understanding of the molecular pathogenesis of patent ductus arteriosus (PDA). Chen et al. screened the rare copy number variants of 39 PDA patients and 100 healthy children by whole exome sequencing. This study was narrowed down by network and enrichment based analysis, and finally selected 7 candidate genes significantly expressed in human embryonic heart, which include CHRNA4 as potential genetic cause of PDA.

Yan et al. investigated the potential of serum miRNAs as signatures for pancreatic cancer (PC). First, they analyzed the expression differences of miRNAs between PC patients and healthy controls by feature selection approaches, obtaining a group of PC signatures. Next they compared miRNAs' expressions between inoperable and operable PC patients, identifying lots of operability miRNAs. These findings revealed a non-invasive way on the diagnosis and therapy of pancreatic cancer.

Xu et al. applied network-based bioinformatics analysis for reanalyzing temporal gene expression data related to Infantile hemangioma (IH), with the purpose of distinguishing genes for proliferative and regressive IH lesions. This analysis pipeline included Short Time-series Expression Miner, weighted gene co-expression network analysis, and area under curve ranking, which determined key hub genes which were verified using qRTPCR.

Restoration of altered transcriptional regulations in disease states should be an important action for the pathogen genes associated with disease diagnosis and prognosis (Lu et al., 2020). The quantitative relation between transcriptional regulation and disease phenotypes can help detect such targeted genes for transcriptional modification. Duan et al. proposed a model-based quantitative relationship measurement between mRNA (non-TF) genes and Transcription Factor (TF) genes. They further applied this method to detect quantitatively altered mRNA genes in lung cancer on the basis of one learning dataset and several independent evaluation datasets, which provided new candidates for lung cancer diagnosis, prognosis, and treatment.

Complex diseases are determined by many heterogeneous biological factors (Su et al., 2020), thus, omics integration analysis should help capture new prognostic signatures and models for disease prognosis. Ren et al. explored the key biomarkers of early-stage Lung adenocarcinoma and built a prognostic model on the combination of gene transcription and methylation. In the re-analysis of public RNA-seq and methylation microarray data, with a series of statistical, network-based and machine learning approaches, a prognostic index based on 17 methylation driver genes, and 4 CD8 T cell-related genes was established for predicting the survival risk of lung cancer patients in early-stage.

Finally, we sincerely thank the reviewers; their great efforts ensure the high quality of all articles in this topic. This Research Topic should bring more attention to the important topics of “All of Us” precision medicine based on machine learning and cross-domain data.

Author Contributions

TZ drafted the manuscript. TZ, TH, and CL revised the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (No. XDB38050200), the National Natural Science Foundation of China (11871456), and the Shanghai Municipal Science and Technology Major Project (2017SHZDZX01).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

All of Us Research Program Investigators, Denny, J. C., Rutter, J. L., Goldstein, D. B., Philippakis, A., Smoller, J. W., and Jenkins, G, et al. (2019). The “All of Us” research program. N. Engl. J. Med. 381, 668–676. doi: 10.1056/NEJMsr1809937

CrossRef Full Text | Google Scholar

Aronson, S. J., and Rehm, H. L. (2015). Building the foundation for genomics in precision medicine. Nature 526, 336–342. doi: 10.1038/nature15816

PubMed Abstract | CrossRef Full Text | Google Scholar

Lai, N., Kan, M., Han, C., Song, X., and Shan, S. (2020). Learning to learn adaptive classifier-predictor for few-shot learning. IEEE Trans. Neural. Netw. Learn. Syst. doi: 10.1109/TNNLS.2020.3011526. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, Y., Xue, J., Deng, T., Zhou, X., Yu, K., Deng, L., et al. (2020). Safety and feasibility of CRISPR-edited T cells in patients with refractory non-small-cell lung cancer. Nat. Med. 26, 732–740. doi: 10.1038/s41591-020-0840-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Su, Y., Chen, D., Yuan, D., Lausted, C., Choi, J., Dai, C. L., et al. (2020). Multi-omics resolves a sharp disease-state shift between mild and moderate COVID-19. Cell 183, 1479–1495 e1420. doi: 10.1016/j.cell.2020.10.037

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Ma, A., Chang, Y., Gong, J., Jiang, Y., Qi, R., et al. (2021). scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses. Nat. Commun. 12:1882. doi: 10.1038/s41467-021-22197-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, H., Cao, H., He, T., Wang, T., and Cui, Y. (2020). Multilevel heterogeneous omics data integration with kernel fusion. Brief Bioinform. 21, 156–170. doi: 10.1093/bib/bby115

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, T. (2021). “Repurpose analysis expanding biomedical benefits by omics data integration,” in Systems Medicine, ed. O. Wolkenhauer (Oxford: Academic Press) 94–102. doi: 10.1016/B978-0-12-801238-3.11387-X

CrossRef Full Text

Zeng, T., Huang, T., and Lu, C. (2019). Editorial: Machine learning advanced dynamic omics data analysis for precision medicine. Front. Genet. 10:1343. doi: 10.3389/fgene.2019.01343

CrossRef Full Text | Google Scholar

Zeng, T., Yu, X., and Chen, Z. (2021). Applying artificial intelligence in the microbiome for gastrointestinal diseases: a review. J. Gastroenterol. Hepatol. 36, 832–840. doi: 10.1111/jgh.15503

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, M. L., Li, Y. K., Yang, H., and Liu, X. Y. (2020). Towards class-imbalance aware multi-label learning. IEEE Trans. Cybern. doi: 10.1109/TCYB.2020.3027509. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: cross-domain, precision medicine, artificial intelligence, machine learning, omics

Citation: Zeng T, Huang T and Lu C (2021) Editorial: Cross-Domain Analysis for “All of Us” Precision Medicine. Front. Genet. 12:713771. doi: 10.3389/fgene.2021.713771

Received: 24 May 2021; Accepted: 07 June 2021;
Published: 01 July 2021.

Edited and reviewed by: Richard D. Emes, University of Nottingham, United Kingdom

Copyright © 2021 Zeng, Huang and Lu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tao Zeng, emVuZ3RhbyYjeDAwMDQwO3NpYnMuYWMuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.