Literature DB >> 33820526

To denoise or to cluster, that is not the question: optimizing pipelines for COI metabarcoding and metaphylogeography.

Adrià Antich1, Creu Palacin2, Owen S Wangensteen3, Xavier Turon4.   

Abstract

BACKGROUND: The recent blooming of metabarcoding applications to biodiversity studies comes with some relevant methodological debates. One such issue concerns the treatment of reads by denoising or by clustering methods, which have been wrongly presented as alternatives. It has also been suggested that denoised sequence variants should replace clusters as the basic unit of metabarcoding analyses, missing the fact that sequence clusters are a proxy for species-level entities, the basic unit in biodiversity studies. We argue here that methods developed and tested for ribosomal markers have been uncritically applied to highly variable markers such as cytochrome oxidase I (COI) without conceptual or operational (e.g., parameter setting) adjustment. COI has a naturally high intraspecies variability that should be assessed and reported, as it is a source of highly valuable information. We contend that denoising and clustering are not alternatives. Rather, they are complementary and both should be used together in COI metabarcoding pipelines.
RESULTS: Using a COI dataset from benthic marine communities, we compared two denoising procedures (based on the UNOISE3 and the DADA2 algorithms), set suitable parameters for denoising and clustering, and applied these steps in different orders. Our results indicated that the UNOISE3 algorithm preserved a higher intra-cluster variability. We introduce the program DnoisE to implement the UNOISE3 algorithm taking into account the natural variability (measured as entropy) of each codon position in protein-coding genes.  This correction increased the number of sequences retained by 88%. The order of the steps (denoising and clustering) had little influence on the final outcome.
CONCLUSIONS: We highlight the need for combining denoising and clustering, with adequate choice of stringency parameters, in COI metabarcoding. We present a program that uses the coding properties of this marker to improve the denoising step. We recommend researchers to report their results in terms of both denoised sequences (a proxy for haplotypes) and clusters formed (a proxy for species), and to avoid collapsing the sequences of the latter into a single representative. This will allow studies at the cluster (ideally equating species-level diversity) and at the intra-cluster level, and will ease additivity and comparability between studies.

Entities:  

Keywords:  COI; Clustering; Denoising; Metabarcoding; Metaphylogeography; Operational taxonomic units

Mesh:

Year:  2021        PMID: 33820526      PMCID: PMC8020537          DOI: 10.1186/s12859-021-04115-6

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  38 in total

1.  Search and clustering orders of magnitude faster than BLAST.

Authors:  Robert C Edgar
Journal:  Bioinformatics       Date:  2010-08-12       Impact factor: 6.937

2.  Estimating the entropy of DNA sequences.

Authors:  A O Schmitt; H Herzel
Journal:  J Theor Biol       Date:  1997-10-07       Impact factor: 2.691

3.  Improving eDNA-based protist diversity assessments using networks of amplicon sequence variants.

Authors:  Dominik Forster; Guillaume Lentendu; Sabine Filker; Elyssa Dubois; Thomas A Wilding; Thorsten Stoeck
Journal:  Environ Microbiol       Date:  2019-08-16       Impact factor: 5.491

4.  DADA2: High-resolution sample inference from Illumina amplicon data.

Authors:  Benjamin J Callahan; Paul J McMurdie; Michael J Rosen; Andrew W Han; Amy Jo A Johnson; Susan P Holmes
Journal:  Nat Methods       Date:  2016-05-23       Impact factor: 28.547

5.  VSEARCH: a versatile open source tool for metagenomics.

Authors:  Torbjørn Rognes; Tomáš Flouri; Ben Nichols; Christopher Quince; Frédéric Mahé
Journal:  PeerJ       Date:  2016-10-18       Impact factor: 2.984

6.  Swarm: robust and fast clustering method for amplicon-based studies.

Authors:  Frédéric Mahé; Torbjørn Rognes; Christopher Quince; Colomban de Vargas; Micah Dunthorn
Journal:  PeerJ       Date:  2014-09-25       Impact factor: 2.984

7.  Metabarcoding free-living marine nematodes using curated 18S and CO1 reference sequence databases for species-level taxonomic assignments.

Authors:  Lara Macheriotou; Katja Guilini; Tania Nara Bezerra; Bjorn Tytgat; Dinh Tu Nguyen; Thi Xuan Phuong Nguyen; Febe Noppe; Maickel Armenteros; Fehmi Boufahja; Annelien Rigaux; Ann Vanreusel; Sofie Derycke
Journal:  Ecol Evol       Date:  2019-01-22       Impact factor: 2.912

8.  From metabarcoding to metaphylogeography: separating the wheat from the chaff.

Authors:  Xavier Turon; Adrià Antich; Creu Palacín; Kim Praebel; Owen Simon Wangensteen
Journal:  Ecol Appl       Date:  2019-12-11       Impact factor: 4.657

9.  A total crapshoot? Evaluating bioinformatic decisions in animal diet metabarcoding analyses.

Authors:  Devon R O'Rourke; Nicholas A Bokulich; Michelle A Jusino; Matthew D MacManes; Jeffrey T Foster
Journal:  Ecol Evol       Date:  2020-07-23       Impact factor: 3.167

10.  DNA metabarcoding of littoral hard-bottom communities: high diversity and database gaps revealed by two molecular markers.

Authors:  Owen S Wangensteen; Creu Palacín; Magdalena Guardiola; Xavier Turon
Journal:  PeerJ       Date:  2018-05-04       Impact factor: 2.984

View more
  9 in total

1.  The critical role of natural history museums in advancing eDNA for biodiversity studies: a case study with Amazonian fishes.

Authors:  C David de Santana; Lynne R Parenti; Casey B Dillman; Jonathan A Coddington; Douglas A Bastos; Carole C Baldwin; Jansen Zuanon; Gislene Torrente-Vilara; Raphaël Covain; Naércio A Menezes; Aléssio Datovo; T Sado; M Miya
Journal:  Sci Rep       Date:  2021-09-13       Impact factor: 4.379

2.  Airborne environmental DNA for terrestrial vertebrate community monitoring.

Authors:  Christina Lynggaard; Mads Frost Bertelsen; Casper V Jensen; Matthew S Johnson; Tobias Guldberg Frøslev; Morten Tange Olsen; Kristine Bohmann
Journal:  Curr Biol       Date:  2022-01-06       Impact factor: 10.834

3.  Feces DNA analyses track the rehabilitation of a free-ranging beluga whale.

Authors:  Babett Günther; Eve Jourdain; Lindsay Rubincam; Richard Karoliussen; Sam L Cox; Sophie Arnaud Haond
Journal:  Sci Rep       Date:  2022-04-19       Impact factor: 4.996

Review 4.  Coming of age for COI metabarcoding of whole organism community DNA: Towards bioinformatic harmonisation.

Authors:  Thomas J Creedy; Carmelo Andújar; Emmanouil Meramveliotakis; Victor Noguerales; Isaac Overcast; Anna Papadopoulou; Hélène Morlon; Alfried P Vogler; Brent C Emerson; Paula Arribas
Journal:  Mol Ecol Resour       Date:  2021-09-30       Impact factor: 8.678

5.  Estimating biodiversity across the tree of life on Mount Everest's southern flank with environmental DNA.

Authors:  Marisa C W Lim; Anton Seimon; Batya Nightingale; Charles C Y Xu; Stephan R P Halloy; Adam J Solon; Nicholas B Dragone; Steven K Schmidt; Alex Tait; Sandra Elvin; Aurora C Elmore; Tracie A Seimon
Journal:  iScience       Date:  2022-08-15

6.  MetaWorks: A flexible, scalable bioinformatic pipeline for high-throughput multi-marker biodiversity assessments.

Authors:  Teresita M Porter; Mehrdad Hajibabaei
Journal:  PLoS One       Date:  2022-09-29       Impact factor: 3.752

7.  Managing human-mediated range shifts: understanding spatial, temporal and genetic variation in marine non-native species.

Authors:  Luke E Holman; Shirley Parker-Nance; Mark de Bruyn; Simon Creer; Gary Carvalho; Marc Rius
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-01-24       Impact factor: 6.671

8.  DnoisE: distance denoising by entropy. An open-source parallelizable alternative for denoising sequence datasets.

Authors:  Adrià Antich; Creu Palacín; Xavier Turon; Owen S Wangensteen
Journal:  PeerJ       Date:  2022-01-19       Impact factor: 2.984

9.  Establishment, Genetic Diversity, and Habitat Suitability of Aedes albopictus Populations from Ecuador.

Authors:  Andrés Carrazco-Montalvo; Patricio Ponce; Stephany D Villota; Emmanuelle Quentin; Sofía Muñoz-Tobar; Josefina Coloma; Varsovia Cevallos
Journal:  Insects       Date:  2022-03-19       Impact factor: 2.769

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.