Literature DB >> 20180949

CANGS: a user-friendly utility for processing and analyzing 454 GS-FLX data in biodiversity studies.

Ram Vinay Pandey1, Viola Nolte, Christian Schlötterer.   

Abstract

BACKGROUND: Next generation sequencing (NGS) technologies have substantially increased the sequence output while the costs were dramatically reduced. In addition to the use in whole genome sequencing, the 454 GS-FLX platform is becoming a widely used tool for biodiversity surveys based on amplicon sequencing. In order to use NGS for biodiversity surveys, software tools are required, which perform quality control, trimming of the sequence reads, removal of PCR primers, and generation of input files for downstream analyses. A user-friendly software utility that carries out these steps is still lacking.
FINDINGS: We developed CANGS (Cleaning and Analyzing Next Generation Sequences) a flexible and user-friendly integrated software utility: CANGS is designed for amplicon based biodiversity surveys using the 454 sequencing platform. CANGS filters low quality sequences, removes PCR primers, filters singletons, identifies barcodes, and generates input files for downstream analyses. The downstream analyses rely either on third party software (e.g.: rarefaction analyses) or CANGS-specific scripts. The latter include modules linking 454 sequences with the name of the closest taxonomic reference retrieved from the NCBI database and the sequence divergence between them. Our software can be easily adapted to handle sequencing projects with different amplicon sizes, primer sequences, and quality thresholds, which makes this software especially useful for non-bioinformaticians.
CONCLUSION: CANGS performs PCR primer clipping, filtering of low quality sequences, links sequences to NCBI taxonomy and provides input files for common rarefaction analysis software programs. CANGS is written in Perl and runs on Mac OS X/Linux and is available at http://i122server.vu-wien.ac.at/pop/software.html.

Entities:  

Year:  2010        PMID: 20180949      PMCID: PMC2830946          DOI: 10.1186/1756-0500-3-3

Source DB:  PubMed          Journal:  BMC Res Notes        ISSN: 1756-0500


  7 in total

1.  The Bioperl toolkit: Perl modules for the life sciences.

Authors:  Jason E Stajich; David Block; Kris Boulez; Steven E Brenner; Stephen A Chervitz; Chris Dagdigian; Georg Fuellen; James G R Gilbert; Ian Korf; Hilmar Lapp; Heikki Lehväslaiho; Chad Matsalla; Chris J Mungall; Brian I Osborne; Matthew R Pocock; Peter Schattner; Martin Senger; Lincoln D Stein; Elia Stupka; Mark D Wilkinson; Ewan Birney
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness.

Authors:  Patrick D Schloss; Jo Handelsman
Journal:  Appl Environ Microbiol       Date:  2005-03       Impact factor: 4.792

4.  Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing.

Authors:  Roman K Thomas; Elizabeth Nickerson; Jan F Simons; Pasi A Jänne; Torstein Tengs; Yuki Yuza; Levi A Garraway; Thomas LaFramboise; Jeffrey C Lee; Kinjal Shah; Keith O'Neill; Hidefumi Sasaki; Neal Lindeman; Kwok-Kin Wong; Ana M Borras; Edward J Gutmann; Konstantin H Dragnev; Ralph DeBiasi; Tzu-Hsiu Chen; Karen A Glatt; Heidi Greulich; Brian Desany; Christine K Lubeski; William Brockman; Pablo Alvarez; Stephen K Hutchison; J H Leamon; Michael T Ronan; Gregory S Turenchalk; Michael Egholm; William R Sellers; Jonathan M Rothberg; Matthew Meyerson
Journal:  Nat Med       Date:  2006-06-25       Impact factor: 53.440

5.  Microbial population structures in the deep marine biosphere.

Authors:  Julie A Huber; David B Mark Welch; Hilary G Morrison; Susan M Huse; Phillip R Neal; David A Butterfield; Mitchell L Sogin
Journal:  Science       Date:  2007-10-05       Impact factor: 47.728

6.  MAFFT version 5: improvement in accuracy of multiple sequence alignment.

Authors:  Kazutaka Katoh; Kei-ichi Kuma; Hiroyuki Toh; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2005-01-20       Impact factor: 16.971

7.  FastGroupII: a web-based bioinformatics platform for analyses of large 16S rDNA libraries.

Authors:  Yanan Yu; Mya Breitbart; Pat McNairnie; Forest Rohwer
Journal:  BMC Bioinformatics       Date:  2006-02-07       Impact factor: 3.169

  7 in total
  25 in total

1.  NGS QC Toolkit: a toolkit for quality control of next generation sequencing data.

Authors:  Ravi K Patel; Mukesh Jain
Journal:  PLoS One       Date:  2012-02-01       Impact factor: 3.240

2.  PyroTrimmer: a software with GUI for pre-processing 454 amplicon sequences.

Authors:  Jeongsu Oh; Byung Kwon Kim; Wan-Sup Cho; Soon Gyu Hong; Kyung Mo Kim
Journal:  J Microbiol       Date:  2012-11-04       Impact factor: 3.422

Review 3.  Sequencing our way towards understanding global eukaryotic biodiversity.

Authors:  Holly M Bik; Dorota L Porazinska; Simon Creer; J Gregory Caporaso; Rob Knight; W Kelley Thomas
Journal:  Trends Ecol Evol       Date:  2012-01-11       Impact factor: 17.712

4.  Structural venomics reveals evolution of a complex venom by duplication and diversification of an ancient peptide-encoding gene.

Authors:  Sandy S Pineda; Yanni K-Y Chin; Eivind A B Undheim; Sebastian Senff; Mehdi Mobli; Claire Dauly; Pierre Escoubas; Graham M Nicholson; Quentin Kaas; Shaodong Guo; Volker Herzig; John S Mattick; Glenn F King
Journal:  Proc Natl Acad Sci U S A       Date:  2020-05-12       Impact factor: 11.205

5.  CANGS DB: a stand-alone web-based database tool for processing, managing and analyzing 454 data in biodiversity studies.

Authors:  Ram Vinay Pandey; Viola Nolte; Jens Boenigk; Christian Schlötterer
Journal:  BMC Res Notes       Date:  2011-06-30

6.  Forest Age and Plant Species Composition Determine the Soil Fungal Community Composition in a Chinese Subtropical Forest.

Authors:  Yu Ting Wu; Tesfaye Wubet; Stefan Trogisch; Sabine Both; Thomas Scholten; Helge Bruelheide; François Buscot
Journal:  PLoS One       Date:  2013-06-27       Impact factor: 3.240

7.  Automated cleaning and pre-processing of immunoglobulin gene sequences from high-throughput sequencing.

Authors:  Miri Michaeli; Hila Noga; Hilla Tabibian-Keissar; Iris Barshack; Ramit Mehr
Journal:  Front Immunol       Date:  2012-12-28       Impact factor: 7.561

8.  SIMPLEX: cloud-enabled pipeline for the comprehensive analysis of exome sequencing data.

Authors:  Maria Fischer; Rene Snajder; Stephan Pabinger; Andreas Dander; Anna Schossig; Johannes Zschocke; Zlatko Trajanoski; Gernot Stocker
Journal:  PLoS One       Date:  2012-08-01       Impact factor: 3.240

9.  AdapterRemoval: easy cleaning of next-generation sequencing reads.

Authors:  Stinus Lindgreen
Journal:  BMC Res Notes       Date:  2012-07-02

10.  Phylogenetic affiliation of SSU rRNA genes generated by massively parallel sequencing: new insights into the freshwater protist diversity.

Authors:  Najwa Taib; Jean-François Mangot; Isabelle Domaizon; Gisèle Bronner; Didier Debroas
Journal:  PLoS One       Date:  2013-03-14       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.