Literature DB >> 20360363

Identifying and removing artificial replicates from 454 pyrosequencing data.

Tracy K Teal1, Thomas M Schmidt.   

Abstract

An intrinsic artifact of 454-based pyrosequencing leads to artificial overrepresentation of >10% of the original DNA sequencing templates. This artificial amplification of sequences is unbiased with regard to position on the pyrosequencing plate or sequence identity, and it occurs in all currently available 454 technologies. The amplified sequences start at the same position and are identical (duplicates), or vary in length, or contain a sequencing discrepancy. If the abundance of any sequence in a data set is going to be enumerated, either for comparative community analysis, transcriptional analysis or other applications, it is important to remove these artificial replicates before analysis. A web-based tool that incorporates the clustering algorithm cd-hit was developed to identify and remove artificially replicated sequences in 454-based pyrosequencing data sets. This tool cannot be used for data sets that have an initial amplification step before the standard pyrosequencing procedure, because artificial replicates cannot be distinguished from expected replication due to polymerase chain reaction (PCR) amplification, e.g., in sequencing of amplified gene "tags." This protocol provides details on how to use the replicate filter and obtain a file of unique sequences for use in metagenomic or transcriptomic analyses.

Mesh:

Year:  2010        PMID: 20360363     DOI: 10.1101/pdb.prot5409

Source DB:  PubMed          Journal:  Cold Spring Harb Protoc        ISSN: 1559-6095


  9 in total

1.  Comparative Metagenomics of Eight Geographically Remote Terrestrial Hot Springs.

Authors:  Peter Menzel; Sóley Ruth Gudbergsdóttir; Anne Gunn Rike; Lianbing Lin; Qi Zhang; Patrizia Contursi; Marco Moracci; Jakob K Kristjansson; Benjamin Bolduc; Sergey Gavrilov; Nikolai Ravin; Andrey Mardanov; Elizaveta Bonch-Osmolovskaya; Mark Young; Anders Krogh; Xu Peng
Journal:  Microb Ecol       Date:  2015-02-25       Impact factor: 4.552

2.  Changes in diversity, abundance, and structure of soil bacterial communities in Brazilian Savanna under different land use systems.

Authors:  Pabulo Henrique Rampelotto; Adão de Siqueira Ferreira; Anthony Diego Muller Barboza; Luiz Fernando Wurdig Roesch
Journal:  Microb Ecol       Date:  2013-04-27       Impact factor: 4.552

3.  Metagenomics - a guide from sampling to data analysis.

Authors:  Torsten Thomas; Jack Gilbert; Folker Meyer
Journal:  Microb Inform Exp       Date:  2012-02-09

4.  Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak.

Authors:  Saneyoshi Ueno; Grégoire Le Provost; Valérie Léger; Christophe Klopp; Céline Noirot; Jean-Marc Frigerio; Franck Salin; Jérôme Salse; Michael Abrouk; Florent Murat; Oliver Brendel; Jérémy Derory; Pierre Abadie; Patrick Léger; Cyril Cabane; Aurélien Barré; Antoine de Daruvar; Arnaud Couloux; Patrick Wincker; Marie-Pierre Reviron; Antoine Kremer; Christophe Plomion
Journal:  BMC Genomics       Date:  2010-11-23       Impact factor: 3.969

5.  Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence.

Authors:  Frank M You; Naxin Huo; Karin R Deal; Yong Q Gu; Ming-Cheng Luo; Patrick E McGuire; Jan Dvorak; Olin D Anderson
Journal:  BMC Genomics       Date:  2011-01-25       Impact factor: 3.969

6.  Human gut microbiome viewed across age and geography.

Authors:  Tanya Yatsunenko; Federico E Rey; Mark J Manary; Indi Trehan; Maria Gloria Dominguez-Bello; Monica Contreras; Magda Magris; Glida Hidalgo; Robert N Baldassano; Andrey P Anokhin; Andrew C Heath; Barbara Warner; Jens Reeder; Justin Kuczynski; J Gregory Caporaso; Catherine A Lozupone; Christian Lauber; Jose Carlos Clemente; Dan Knights; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2012-05-09       Impact factor: 49.962

7.  Filtering duplicate reads from 454 pyrosequencing data.

Authors:  Susanne Balzer; Ketil Malde; Markus A Grohme; Inge Jonassen
Journal:  Bioinformatics       Date:  2013-02-01       Impact factor: 6.937

8.  A metagenomic framework for the study of airborne microbial communities.

Authors:  Shibu Yooseph; Cynthia Andrews-Pfannkoch; Aaron Tenney; Jeff McQuaid; Shannon Williamson; Mathangi Thiagarajan; Daniel Brami; Lisa Zeigler-Allen; Jeff Hoffman; Johannes B Goll; Douglas Fadrosh; John Glass; Mark D Adams; Robert Friedman; J Craig Venter
Journal:  PLoS One       Date:  2013-12-11       Impact factor: 3.240

Review 9.  Metagenomics: Retrospect and Prospects in High Throughput Age.

Authors:  Satish Kumar; Kishore Kumar Krishnani; Bharat Bhushan; Manoj Pandit Brahmane
Journal:  Biotechnol Res Int       Date:  2015-11-17
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.