Literature DB >> 30782112

SLIM: a flexible web application for the reproducible processing of environmental DNA metabarcoding data.

Yoann Dufresne1,2, Franck Lejzerowicz1,3, Laure Apotheloz Perret-Gentil1, Jan Pawlowski1, Tristan Cordier4.   

Abstract

BACKGROUND: High-throughput amplicon sequencing of environmental DNA (eDNA metabarcoding) has become a routine tool for biodiversity survey and ecological studies. By including sample-specific tags in the primers prior PCR amplification, it is possible to multiplex hundreds of samples in a single sequencing run. The analysis of millions of sequences spread into hundreds to thousands of samples prompts for efficient, automated yet flexible analysis pipelines. Various algorithms and software have been developed to perform one or multiple processing steps, such as paired-end reads assembly, chimera filtering, Operational Taxonomic Unit (OTU) clustering and taxonomic assignment. Some of these software are now well established and widely used by scientists as part of their workflow. Wrappers that are capable to process metabarcoding data from raw sequencing data to annotated OTU-to-sample matrix were also developed to facilitate the analysis for non-specialist users. Yet, most of them require basic bioinformatic or command-line knowledge, which can limit the accessibility to such integrative toolkits. Furthermore, for flexibility reasons, these tools have adopted a step-by-step approach, which can prevent an easy automation of the workflow, and hence hamper the analysis reproducibility.
RESULTS: We introduce SLIM, an open-source web application that simplifies the creation and execution of metabarcoding data processing pipelines through an intuitive Graphic User Interface (GUI). The GUI interact with well-established software and their associated parameters, so that the processing steps are performed seamlessly from the raw sequencing data to an annotated OTU-to-sample matrix. Thanks to a module-centered organization, SLIM can be used for a wide range of metabarcoding cases, and can also be extended by developers for custom needs or for the integration of new software. The pipeline configuration (i.e. the modules chaining and all their parameters) is stored in a file that can be used for reproducing the same analysis.
CONCLUSION: This web application has been designed to be user-friendly for non-specialists yet flexible with advanced settings and extensibility for advanced users and bioinformaticians. The source code along with full documentation is available on the GitHub repository ( https://github.com/yoann-dufresne/SLIM ) and a demonstration server is accessible through the application website ( https://trtcrd.github.io/SLIM/ ).

Entities:  

Keywords:  Amplicon sequencing; High-throughput sequencing; Molecular ecology; Pipeline; Reproducibility; eDNA metabarcoding

Mesh:

Year:  2019        PMID: 30782112      PMCID: PMC6381720          DOI: 10.1186/s12859-019-2663-2

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  8 in total

1.  APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data.

Authors:  Dominik Buchner; Till-Hendrik Macher; Florian Leese
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

2.  Applying convolutional neural networks to speed up environmental DNA annotation in a highly diverse ecosystem.

Authors:  Benjamin Flück; Laëtitia Mathon; Stéphanie Manel; Alice Valentini; Tony Dejean; Camille Albouy; David Mouillot; Wilfried Thuiller; Jérôme Murienne; Sébastien Brosse; Loïc Pellissier
Journal:  Sci Rep       Date:  2022-06-17       Impact factor: 4.996

3.  High-throughput DNA barcoding of oligochaetes for abundance-based indices to assess the biological quality of sediments in streams and lakes.

Authors:  Régis Vivien; Laure Apothéloz-Perret-Gentil; Jan Pawlowski; Inge Werner; Michel Lafont; Benoit J D Ferrari
Journal:  Sci Rep       Date:  2020-02-06       Impact factor: 4.379

Review 4.  Environmental DNA and RNA as Records of Human Exposome, Including Biotic/Abiotic Exposures and Its Implications in the Assessment of the Role of Environment in Chronic Diseases.

Authors:  Indu Shekhar Thakur; Deodutta Roy
Journal:  Int J Mol Sci       Date:  2020-07-10       Impact factor: 5.923

5.  Cascabel: A Scalable and Versatile Amplicon Sequence Data Analysis Pipeline Delivering Reproducible and Documented Results.

Authors:  Alejandro Abdala Asbun; Marc A Besseling; Sergio Balzano; Judith D L van Bleijswijk; Harry J Witte; Laura Villanueva; Julia C Engelmann
Journal:  Front Genet       Date:  2020-11-20       Impact factor: 4.599

6.  Monitoring newt communities in urban area using eDNA metabarcoding.

Authors:  Léo Charvoz; Laure Apothéloz-Perret-Gentil; Emanuela Reo; Jacques Thiébaud; Jan Pawlowski
Journal:  PeerJ       Date:  2021-11-26       Impact factor: 2.984

7.  Planktonic foraminifera genomic variations reflect paleoceanographic changes in the Arctic: evidence from sedimentary ancient DNA.

Authors:  Joanna Pawłowska; Jutta E Wollenburg; Marek Zajączkowski; Jan Pawlowski
Journal:  Sci Rep       Date:  2020-09-15       Impact factor: 4.379

8.  Patterns of eukaryotic diversity from the surface to the deep-ocean sediment.

Authors:  Tristan Cordier; Inès Barrenechea Angeles; Nicolas Henry; Franck Lejzerowicz; Cédric Berney; Raphaël Morard; Angelika Brandt; Marie-Anne Cambon-Bonavita; Lionel Guidi; Fabien Lombard; Pedro Martinez Arbizu; Ramon Massana; Covadonga Orejas; Julie Poulain; Craig R Smith; Patrick Wincker; Sophie Arnaud-Haond; Andrew J Gooday; Colomban de Vargas; Jan Pawlowski
Journal:  Sci Adv       Date:  2022-02-04       Impact factor: 14.136

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.