AUTHOR=Vasar Martti , Davison John , Moora Mari , Sepp Siim-Kaarel , Anslan Sten , Al-Quraishy Saleh , Bahram Mohammad , Bueno C. Guillermo , Cantero Juan José , Fabiano Ezequiel Chimbioputo , Decocq Guillaume , Drenkhan Rein , Fraser Lauchlan , Oja Jane , Garibay-Orijel Roberto , Hiiesalu Inga , Koorem Kadri , Mucina Ladislav , Öpik Maarja , Põlme Sergei , Pärtel Meelis , Phosri Cherdchai , Semchenko Marina , Vahter Tanel , Doležal Jiři , Palacios Aida M. Vasco , Tedersoo Leho , Zobel Martin TITLE=Metabarcoding of soil environmental DNA to estimate plant diversity globally JOURNAL=Frontiers in Plant Science VOLUME=14 YEAR=2023 URL=https://www.frontiersin.org/journals/plant-science/articles/10.3389/fpls.2023.1106617 DOI=10.3389/fpls.2023.1106617 ISSN=1664-462X ABSTRACT=Introduction

Traditional approaches to collecting large-scale biodiversity data pose huge logistical and technical challenges. We aimed to assess how a comparatively simple method based on sequencing environmental DNA (eDNA) characterises global variation in plant diversity and community composition compared with data derived from traditional plant inventory methods.

Methods

We sequenced a short fragment (P6 loop) of the chloroplast trnL intron from from 325 globally distributed soil samples and compared estimates of diversity and composition with those derived from traditional sources based on empirical (GBIF) or extrapolated plant distribution and diversity data.

Results

Large-scale plant diversity and community composition patterns revealed by sequencing eDNA were broadly in accordance with those derived from traditional sources. The success of the eDNA taxonomy assignment, and the overlap of taxon lists between eDNA and GBIF, was greatest at moderate to high latitudes of the northern hemisphere. On average, around half (mean: 51.5% SD 17.6) of local GBIF records were represented in eDNA databases at the species level, depending on the geographic region.

Discussion

eDNA trnL gene sequencing data accurately represent global patterns in plant diversity and composition and thus can provide a basis for large-scale vegetation studies. Important experimental considerations for plant eDNA studies include using a sampling volume and design to maximise the number of taxa detected and optimising the sequencing depth. However, increasing the coverage of reference sequence databases would yield the most significant improvements in the accuracy of taxonomic assignments made using the P6 loop of the trnL region.