Skip to main content

EDITORIAL article

Front. Genet., 29 November 2023
Sec. Computational Genomics
This article is part of the Research Topic Computational Mechanism of Genetic/Evolutionary Operator and Optimizations in Genomic Data Applications View all 5 articles

Editorial: Computational mechanism of genetic/evolutionary operator and optimizations in genomic data applications

  • 1School of Biomedical Engineering, Guangzhou Medical University, Guangzhou, China
  • 2School of Information Engineering, Ningxia University, Yinchuan, China

Introduction

The exponential growth of genomic data, driven by advancements in high-throughput sequencing technologies, has precipitated the need for innovative data management solutions (Vandereyken et al., 2023). The challenge extends beyond mere storage to encompass the swift transmission and processing of large datasets, which are essential for timely and effective data analysis and interpretation (Caudai et al., 2021; Yan et al., 2022). The principles of genetics and evolutionary theory have fundamentally explained the emergence and progression of the biological realm, bringing transformative concepts of growth and variation to biology (Moczek et al., 2015). This evolutionary perspective has not only accelerated advancements in genetic research but also significantly propelled other scientific fields forward (Shi et al., 2021; Diaz-Flores et al., 2022). Motivated by genetic and evolutionary theories, researchers have developed numerous computational strategies rooted in genetic and evolutionary operations and stochastic search techniques (Ünal and Başçiftçi, 2022).

In recent times, the application of genomic data has encountered optimization challenges where conventional mathematical approaches fall short (Zhou et al., 2019). Genetic and evolutionary algorithms stand apart from traditional calculus-based and exhaustive methods due to their ability to achieve global optimization with remarkable robustness and broad applicability (Viriyasitavat et al., 2021; Shi et al., 2022). Characterized by self-organization, self-adaptation, and self-learning, these algorithms can adeptly handle complex optimizations irrespective of the problem’s nature (Gheibi et al., 2021; Liu et al., 2022). Genomic data analysis often deals with intricate patterns and complex regulations unsuitable for traditional optimization methods (Hassija et al., 2023; Xi et al., 2023). Integrating genetic and evolutionary algorithm-based complex optimizations in genomic analysis can alleviate bottlenecks in bioinformatics tasks (Xi et al., 2020a; Mandal et al., 2023). Therefore, this research theme focuses on exploring complex optimization challenges in genomic data applications using genetic and evolutionary algorithms (Jiao et al., 2023).

Advancements in genomic sequencing technology have led to an explosion of data, marking the advent of the genomics big data era (Xi et al., 2020b). This surge brings both possibilities and challenges, especially in terms of data storage, processing, and interpretation (Ahmed et al., 2022). The articles in this special edition, titled “Computational Mechanism of Genetic/Evolutionary Operator and Optimizations in Genomic Data Applications,” collectively tackle these issues through cutting-edge computational methods. They delve into the complexities of genomic data and introduce innovative ways to optimize its application across various biological and clinical settings.

Optimizing genomic data storage and processing

In this Research Topic, the paper titled “Enhancing Genomic Mutation Data Storage Optimization based on the Compression of Asymmetry of Sparsity” tackles the formidable challenge of managing the deluge of genomic data. It presents a novel compression algorithm, CA_SAGM, specifically designed for sparse asymmetric gene mutations (Ding et al.). This development is particularly pertinent for massive genomic databases like The Cancer Genome Atlas (TCGA), where efficient data handling is paramount. The study’s comparative analysis of CA_SAGM with other algorithms underscores the critical role of data compression in navigating the complexities of large-scale genomic datasets.

Advancing genomic research through computational estimation techniques

Building on the theme of computational innovation, the paper “A Noise-tolerance Learning Method for Efficiently Estimating Open Chromatin Regions via cfDNA sequencing data” focuses on open chromatin regions (OCRs), crucial for understanding cellular functions and gene expression (Ren et al.). The introduction of OCRFinder, a learning-based, noise-tolerant approach, marks a significant stride in addressing the dynamic challenges of chromatin accessibility in cfDNA-seq data. By integrating ensemble learning and semi-supervised strategies, this study exemplifies the importance of sophisticated computational methods in genomic research, especially in areas like chromatin accessibility.

Genomic data in clinical and prognostic applications

Shifting the spotlight to clinical implications, “Development and validation of focal adhesion-related genes signature in gastric cancer” illustrates the power of genomic data in disease prognosis (Zhao et al.). This paper offers a prognostic signature based on focal adhesion-related genes, showcasing how genomic data can be instrumental in identifying critical prognostic genes for gastric cancer. This research bridges computational methodologies and genomic insights, enhancing our understanding of cancer biology and providing valuable tools for cancer prognosis.

Exploring disease mechanisms through gene modification analysis

The issue concludes with “Comprehensive Analysis of Key m5C Modification-Related Genes in Type 2 Diabetes,” which delves into the role of 5-methylcytosine (m5C) RNA methylation in type 2 diabetes (T2D) (Song et al.). Employing a variety of computational techniques, including LASSO regression and Gene Set Enrichment Analysis, the study sheds light on the molecular mechanisms of T2D. It highlights potential biomarkers and therapeutic targets, demonstrating the utility of genomic data in deciphering the complexities of disease processes.

Collectively, these articles represent the multifaceted applications of computational techniques in genomic data analysis. From optimizing data storage and processing to enhancing disease prognosis and understanding molecular mechanisms, these studies underscore the transformative impact of computational methods in the era of genomic big data. The innovations and insights showcased in this Research Topic are set to significantly shape future research and applications in genomics, bridging computational prowess with biological discovery.

Author contributions

JX: Conceptualization, Funding acquisition, Writing–original draft, Writing–review and editing. ZY: Conceptualization, Funding acquisition, Writing–original draft, Writing–review and editing. WS: Conceptualization, Funding acquisition, Writing–original draft, Writing–review and editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work is supported partially by National Natural Science Foundation of China (Grant No. 62202117), partially by the Tertiary Education Scientific Research Project of Guangzhou Municipal Education Bureau (No. 202235388), partially by the Special Foundation in Department of Higher Education of Guangdong (Grant No. 2022ZDX 2053), partially by the Guangzhou Basic and Applied Basic Research Foundation (No. 2023A04J0386).

Acknowledgments

We would like to thank Dr. Janet Ajibade for her helpful suggestions on organizing this Research Topic.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ahmed, I., Jeon, G., and Piccialli, F. (2022). From artificial intelligence to explainable artificial intelligence in industry 4.0: a survey on what, how, and where. IEEE Trans. Industrial Inf. 18, 5031–5042. doi:10.1109/tii.2022.3146552

CrossRef Full Text | Google Scholar

Caudai, C., Galizia, A., Geraci, F., Le Pera, L., Morea, V., Salerno, E., et al. (2021). Ai applications in functional genomics. Comput. Struct. Biotechnol. J. 19, 5762–5790. doi:10.1016/j.csbj.2021.10.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Diaz-Flores, E., Meyer, T., and Giorkallos, A. (2022). “Evolution of artificial intelligence-powered technologies in biomedical research and healthcare,” in Smart biolabs of the future (Springer), 23–60.

CrossRef Full Text | Google Scholar

Gheibi, O., Weyns, D., and Quin, F. (2021). Applying machine learning in self-adaptive systems: a systematic literature review. ACM Trans. Aut. Adapt. Syst. (TAAS) 15, 1–37. doi:10.1145/3469440

CrossRef Full Text | Google Scholar

Hassija, V., Chamola, V., Mahapatra, A., Singal, A., Goel, D., Huang, K., et al. (2023). Interpreting black-box models: a review on explainable artificial intelligence. Cogn. Comput., 1–30. doi:10.1007/s12559-023-10179-8

CrossRef Full Text | Google Scholar

Jiao, R., Nguyen, B. H., Xue, B., and Zhang, M. (2023). A survey on evolutionary multiobjective feature selection in classification: approaches, applications, and challenges. IEEE Trans. Evol. Comput., 1. doi:10.1109/tevc.2023.3292527

CrossRef Full Text | Google Scholar

Liu, Y., Wang, Q., and Xi, J. (2022). Deepda-ace: a novel domain adaptation method for species-specific acetylation site prediction. Mathematics 10, 2364. doi:10.3390/math10142364

CrossRef Full Text | Google Scholar

Mandal, A. K., Sarma, P. K. D., and Dehuri, S. (2023). A study of bio-inspired computing in bioinformatics: a state-of-the-art literature survey. Open Bioinforma. J. 16. doi:10.2174/18750362-v16-e230517-2022-17

CrossRef Full Text | Google Scholar

Moczek, A. P., Sears, K. E., Stollewerk, A., Wittkopp, P. J., Diggle, P., Dworkin, I., et al. (2015). The significance and scope of evolutionary developmental biology: a vision for the 21st century. Evol. Dev. 17, 198–219. doi:10.1111/ede.12125

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, W., Chen, W.-N., Kwong, S., Zhang, J., Wang, H., Gu, T., et al. (2021). A coevolutionary estimation of distribution algorithm for group insurance portfolio. IEEE Trans. Syst. Man, Cybern. Syst. 52, 6714–6728. doi:10.1109/tsmc.2021.3096013

CrossRef Full Text | Google Scholar

Shi, W., Hu, X.-M., and Chen, W.-N. (2022). An estimation of distribution algorithm with clustering for scenario-based robust financial optimization. Complex & Intelligent Syst. 8, 3989–4003. doi:10.1007/s40747-021-00640-2

CrossRef Full Text | Google Scholar

Ünal, H. T., and Başçiftçi, F. (2022). Evolutionary design of neural network architectures: a review of three decades of research. Artif. Intell. Rev., 1–80.

Google Scholar

Vandereyken, K., Sifrim, A., Thienpont, B., and Voet, T. (2023). Methods and applications for single-cell and spatial multi-omics. Nat. Rev. Genet. 24, 494–515. doi:10.1038/s41576-023-00580-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Viriyasitavat, W., Da Xu, L., Dhiman, G., Sapsomboon, A., Pungpapong, V., and Bi, Z. (2021). Service workflow: state-of-the-art and future trends. IEEE Trans. Serv. Comput. 16, 757–772. doi:10.1109/tsc.2021.3121394

CrossRef Full Text | Google Scholar

Xi, J., Deng, Z., Liu, Y., Wang, Q., and Shi, W. (2023). Integrating multi-type aberrations from dna and rna through dynamic mapping gene space for subtype-specific breast cancer driver discovery. PeerJ 11, e14843. doi:10.7717/peerj.14843

PubMed Abstract | CrossRef Full Text | Google Scholar

Xi, J., Li, A., and Wang, M. (2020a). HetRCNA: a novel method to identify recurrent copy number alternations from heterogeneous tumor samples based on matrix decomposition framework. IEEE/ACM Trans. Comput. Biol. Bioinforma. 17, 422–434. doi:10.1109/TCBB.2018.2846599

PubMed Abstract | CrossRef Full Text | Google Scholar

Xi, J., Yuan, X., Wang, M., Li, A., Li, X., and Huang, Q. (2020b). Inferring subgroup-specific driver genes from heterogeneous cancer samples via subspace learning with subgroup indication. Bioinformatics 36, 1855–1863. doi:10.1093/bioinformatics/btz793

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, J., Xi, J., and Yu, Z. (2022). “A parametric model for clustering single-cell mutation data,” in 2022 IEEE international conference on bioinformatics and biomedicine (BIBM) (IEEE), 253–260.

CrossRef Full Text | Google Scholar

Zhou, T., Song, Z., and Sundmacher, K. (2019). Big data creates new opportunities for materials research: a review on methods and applications of machine learning for materials design. Engineering 5, 1017–1026. doi:10.1016/j.eng.2019.02.011

CrossRef Full Text | Google Scholar

Keywords: genetics, evolution, optimization, genomics, computational mechanism

Citation: Xi J, Yu Z and Shi W (2023) Editorial: Computational mechanism of genetic/evolutionary operator and optimizations in genomic data applications. Front. Genet. 14:1343199. doi: 10.3389/fgene.2023.1343199

Received: 23 November 2023; Accepted: 23 November 2023;
Published: 29 November 2023.

Edited and reviewed by:

Quan Zou, University of Electronic Science and Technology of China, China

Copyright © 2023 Xi, Yu and Shi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jianing Xi, xjn@gzhmu.edu.cn; Zhenhua Yu, zhyu@nxu.edu.cn; Wen Shi, shiwen@gzhmu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.