The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Ecol. Evol.
Sec. Paleoecology
Volume 13 - 2025 |
doi: 10.3389/fevo.2025.1459690
Functional annotation of eukaryotic genes from sedimentary ancient DNA
Provisionally accepted- 1 Polar Terrestrial Environmental Systems, Department of Earth Sciences, Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research (AWI), Potsdam, Brandenburg, Germany
- 2 Institute for Biochemistry and Biology, University of Potsdam, Potsdam, Brandenburg, Germany
- 3 Institute of Enviornmental Science and Geography, Faculty of Mathematics and Natural Sciences, University of Potsdam, Potsdam, Germany
- 4 Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, Computing and Data Centre, Bremerhaven, Germany
Sedimentary ancient DNA (sedaDNA) provides valuable insights into past ecosystems, yet its functional diversity has remained unexplored due to potential limitations in gene annotation for short-read data. Eukaryotes, especially, are typically underrepresented and have low coverage in complex metagenomic datasets from sediments. In this study, we evaluate the potential of eukaryotic gene annotation in sedimentary ancient DNA (sedaDNA) time-series data covering the last 23,000 years. We compared four gene annotation pipelines (GAPs) that apply Prodigal (ProkGAP) and MetaEuk (EukGAP) with and without taxonomic pre-classification. We identify ProkGAP as the pipeline which recovers the largest gene catalog with 6,568,483 functional genes and the highest number of eukaryotic functional genes (5,895 unique KEGG orthologs). Our findings show that ProkGAP, originally invented for prokaryotic gene prediction, yields the largest share of functional genes among all GAPs tested. At the same time, it allows the analysis of prokaryotic and eukaryotic gene functions in parallel and predicts most gene diversity. Interestingly, in our time-series data the gene catalog size and diversity show an increasing trend towards recent times indicating a more complex eukaryotic functional community during the Holocene. However, all gene annotation pipelines are limited by incomplete functional reference databases, which hamper the link between taxonomic-functional relationships when considering lower taxonomic levels. Future research on functional gene prediction from short read sedaDNA data should focus on expanding the eukaryotic databases and increasing sequencing depth to explore eukaryotic and prokaryotic functional composition and diversity in past ecosystems and their relationships to environmental change.
Keywords: Eukaryotes, functional genes, Paleoecosystem, paleo metagenomics, Sedimentary ancient DNA (sedaDNA)
Received: 04 Jul 2024; Accepted: 24 Jan 2025.
Copyright: © 2025 Cabuk, Herzschuh, Harms, von Hippel and Stoof-Leichsenring. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Kathleen Stoof-Leichsenring, Polar Terrestrial Environmental Systems, Department of Earth Sciences, Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research (AWI), Potsdam, 14401, Brandenburg, Germany
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.