Literature DB >> 33580619

eDNAFlow, an automated, reproducible and scalable workflow for analysis of environmental DNA sequences exploiting Nextflow and Singularity.

Mahsa Mousavi-Derazmahalleh1,2,3, Audrey Stott4, Rose Lines1,2,3, Georgia Peverley3, Georgia Nester1, Tiffany Simpson1,3, Michal Zawierta5, Marco De La Pierre4, Michael Bunce1,6, Claus T Christophersen1,2,7.   

Abstract

Metabarcoding of environmental DNA (eDNA) when coupled with high throughput sequencing is revolutionising the way biodiversity can be monitored across a wide range of applications. However, the large number of tools deployed in downstream bioinformatic analyses often places a challenge in configuration and maintenance of a workflow, and consequently limits the research reproducibility. Furthermore, scalability needs to be considered to handle the growing amount of data due to increase in sequence output and the scale of project. Here, we describe eDNAFlow, a fully automated workflow that employs a number of state-of-the-art applications to process eDNA data from raw sequences (single-end or paired-end) to generation of curated and noncurated zero-radius operational taxonomic units (ZOTUs) and their abundance tables. This pipeline is based on Nextflow and Singularity which enable a scalable, portable and reproducible workflow using software containers on a local computer, clouds and high-performance computing (HPC) clusters. Finally, we present an in-house Python script to assign taxonomy to ZOTUs based on user specified thresholds for assigning lowest common ancestor (LCA). We demonstrate the utility and efficiency of the pipeline using an example of a published coral diversity biomonitoring study. Our results were congruent with the aforementioned study. The scalability of the pipeline is also demonstrated through analysis of a large data set containing 154 samples. To our knowledge, this is the first automated bioinformatic pipeline for eDNA analysis using two powerful tools: Nextflow and Singularity. This pipeline addresses two major challenges in the analysis of eDNA data; scalability and reproducibility.
© 2021 John Wiley & Sons Ltd.

Keywords:  Nextflow; Singularity; environmental DNA; metabarcoding

Year:  2021        PMID: 33580619     DOI: 10.1111/1755-0998.13356

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  4 in total

1.  Ancient plant DNA reveals High Arctic greening during the Last Interglacial.

Authors:  Sarah E Crump; Bianca Fréchette; Matthew Power; Sam Cutler; Gregory de Wet; Martha K Raynolds; Jonathan H Raberg; Jason P Briner; Elizabeth K Thomas; Julio Sepúlveda; Beth Shapiro; Michael Bunce; Gifford H Miller
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-30       Impact factor: 11.205

2.  Biomonitoring via DNA metabarcoding and light microscopy of bee pollen in rainforest transformation landscapes of Sumatra.

Authors:  Carina Carneiro de Melo Moura; Christina A Setyaningsih; Kevin Li; Miryam Sarah Merk; Sonja Schulze; Rika Raffiudin; Ingo Grass; Hermann Behling; Teja Tscharntke; Catrin Westphal; Oliver Gailing
Journal:  BMC Ecol Evol       Date:  2022-04-26

3.  Rainfall as a trigger of ecological cascade effects in an Australian groundwater ecosystem.

Authors:  Mattia Saccò; Alison J Blyth; William F Humphreys; Steven J B Cooper; Nicole E White; Matthew Campbell; Mahsa Mousavi-Derazmahalleh; Quan Hua; Debashish Mazumder; Colin Smith; Christian Griebler; Kliti Grice
Journal:  Sci Rep       Date:  2021-02-12       Impact factor: 4.379

4.  Secondary predation constrains DNA-based diet reconstruction in two threatened shark species.

Authors:  Mark de Bruyn; Matteo Barbato; Joseph D DiBattista; Matt K Broadhurst
Journal:  Sci Rep       Date:  2021-09-15       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.