| Literature DB >> 34002951 |
Laetitia Mathon1,2, Alice Valentini2, Pierre-Edouard Guérin1, Eric Normandeau3, Cyril Noel4, Clément Lionnet5, Emilie Boulanger1,6, Wilfried Thuillier5, Louis Bernatchez3, David Mouillot6,7, Tony Dejean2, Stéphanie Manel1.
Abstract
Bioinformatic analysis of eDNA metabarcoding data is crucial toward rigorously assessing biodiversity. Many programs are now available for each step of the required analyses, but their relative abilities at providing fast and accurate species lists have seldom been evaluated. We used simulated mock communities and real fish eDNA metabarcoding data to evaluate the performance of 13 bioinformatic programs and pipelines to retrieve fish occurrence and read abundance using the 12S mt rRNA gene marker. We used four indices to compare the outputs of each program with the simulated samples: sensitivity, F-measure, root-mean-square error (RMSE) on read relative abundances, and execution time. We found marked differences among programs only for the taxonomic assignment step, both in terms of sensitivity, F-measure and RMSE. Running time was highly different between programs for each step. The fastest programs with best indices for each step were assembled into a pipeline. We compare this pipeline to pipelines constructed from existing toolboxes (OBITools, Barque, and QIIME 2). Our pipeline and Barque obtained the best performance for all indices and appear to be better alternatives to highly used pipelines for analyzing fish eDNA metabarcoding data with a complete reference database. Real eDNA metabarcoding data also indicated differences for taxonomic assignment and execution time only. This study reveals major differences between programs during the taxonomic assignment step. The choice of algorithm for the taxonomic assignment can have a significant impact on diversity estimates and should be made according to the objectives of the study. This article is protected by copyright. All rights reserved.Keywords: benchmark; bioinformatics; eDNA; metabarcoding; sensitivity; species identification
Year: 2021 PMID: 34002951 DOI: 10.1111/1755-0998.13430
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 7.090