Literature DB >> 21716311

Impact of training sets on classification of high-throughput bacterial 16s rRNA gene surveys.

Jeffrey J Werner1, Omry Koren, Philip Hugenholtz, Todd Z DeSantis, William A Walters, J Gregory Caporaso, Largus T Angenent, Rob Knight, Ruth E Ley.   

Abstract

Taxonomic classification of the thousands-millions of 16S rRNA gene sequences generated in microbiome studies is often achieved using a naïve Bayesian classifier (for example, the Ribosomal Database Project II (RDP) classifier), due to favorable trade-offs among automation, speed and accuracy. The resulting classification depends on the reference sequences and taxonomic hierarchy used to train the model; although the influence of primer sets and classification algorithms have been explored in detail, the influence of training set has not been characterized. We compared classification results obtained using three different publicly available databases as training sets, applied to five different bacterial 16S rRNA gene pyrosequencing data sets generated (from human body, mouse gut, python gut, soil and anaerobic digester samples). We observed numerous advantages to using the largest, most diverse training set available, that we constructed from the Greengenes (GG) bacterial/archaeal 16S rRNA gene sequence database and the latest GG taxonomy. Phylogenetic clusters of previously unclassified experimental sequences were identified with notable improvements (for example, 50% reduction in reads unclassified at the phylum level in mouse gut, soil and anaerobic digester samples), especially for phylotypes belonging to specific phyla (Tenericutes, Chloroflexi, Synergistetes and Candidate phyla TM6, TM7). Trimming the reference sequences to the primer region resulted in systematic improvements in classification depth, and greatest gains at higher confidence thresholds. Phylotypes unclassified at the genus level represented a greater proportion of the total community variation than classified operational taxonomic units in mouse gut and anaerobic digester samples, underscoring the need for greater diversity in existing reference databases.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21716311      PMCID: PMC3217155          DOI: 10.1038/ismej.2011.82

Source DB:  PubMed          Journal:  ISME J        ISSN: 1751-7362            Impact factor:   10.302


  28 in total

1.  Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons.

Authors:  Brian J Haas; Dirk Gevers; Ashlee M Earl; Mike Feldgarden; Doyle V Ward; Georgia Giannoukos; Dawn Ciulla; Diana Tabbaa; Sarah K Highlander; Erica Sodergren; Barbara Methé; Todd Z DeSantis; Joseph F Petrosino; Rob Knight; Bruce W Birren
Journal:  Genome Res       Date:  2011-01-06       Impact factor: 9.043

2.  Genome sequencing in microfabricated high-density picolitre reactors.

Authors:  Marcel Margulies; Michael Egholm; William E Altman; Said Attiya; Joel S Bader; Lisa A Bemben; Jan Berka; Michael S Braverman; Yi-Ju Chen; Zhoutao Chen; Scott B Dewell; Lei Du; Joseph M Fierro; Xavier V Gomes; Brian C Godwin; Wen He; Scott Helgesen; Chun Heen Ho; Chun He Ho; Gerard P Irzyk; Szilveszter C Jando; Maria L I Alenquer; Thomas P Jarvie; Kshama B Jirage; Jong-Bum Kim; James R Knight; Janna R Lanza; John H Leamon; Steven M Lefkowitz; Ming Lei; Jing Li; Kenton L Lohman; Hong Lu; Vinod B Makhijani; Keith E McDade; Michael P McKenna; Eugene W Myers; Elizabeth Nickerson; John R Nobile; Ramona Plant; Bernard P Puc; Michael T Ronan; George T Roth; Gary J Sarkis; Jan Fredrik Simons; John W Simpson; Maithreyan Srinivasan; Karrie R Tartaro; Alexander Tomasz; Kari A Vogt; Greg A Volkmer; Shally H Wang; Yong Wang; Michael P Weiner; Pengguang Yu; Richard F Begley; Jonathan M Rothberg
Journal:  Nature       Date:  2005-07-31       Impact factor: 49.962

3.  Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.

Authors:  Qiong Wang; George M Garrity; James M Tiedje; James R Cole
Journal:  Appl Environ Microbiol       Date:  2007-06-22       Impact factor: 4.792

4.  Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities.

Authors:  Patrick D Schloss; Sarah L Westcott; Thomas Ryabin; Justine R Hall; Martin Hartmann; Emily B Hollister; Ryan A Lesniewski; Brian B Oakley; Donovan H Parks; Courtney J Robinson; Jason W Sahl; Blaz Stres; Gerhard G Thallinger; David J Van Horn; Carolyn F Weber
Journal:  Appl Environ Microbiol       Date:  2009-10-02       Impact factor: 4.792

5.  Compilation of small ribosomal subunit RNA structures.

Authors:  J M Neefs; Y Van de Peer; P De Rijk; S Chapelle; R De Wachter
Journal:  Nucleic Acids Res       Date:  1993-07-01       Impact factor: 16.971

6.  Rapidly denoising pyrosequencing amplicon reads by exploiting rank-abundance distributions.

Authors:  Jens Reeder; Rob Knight
Journal:  Nat Methods       Date:  2010-09       Impact factor: 28.547

7.  Simrank: Rapid and sensitive general-purpose k-mer search tool.

Authors:  Todd Z DeSantis; Keith Keller; Ulas Karaoz; Alexander V Alekseyenko; Navjeet N S Singh; Eoin L Brodie; Zhiheng Pei; Gary L Andersen; Niels Larsen
Journal:  BMC Ecol       Date:  2011-04-27       Impact factor: 2.964

8.  The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

Authors:  Jonas Binladen; M Thomas P Gilbert; Jonathan P Bollback; Frank Panitz; Christian Bendixen; Rasmus Nielsen; Eske Willerslev
Journal:  PLoS One       Date:  2007-02-14       Impact factor: 3.240

9.  DNA bar coding and pyrosequencing to identify rare HIV drug resistance mutations.

Authors:  Christian Hoffmann; Nana Minkah; Jeremy Leipzig; Gary Wang; Max Q Arens; Pablo Tebas; Frederic D Bushman
Journal:  Nucleic Acids Res       Date:  2007-06-18       Impact factor: 16.971

10.  Exploring microbial diversity and taxonomy using SSU rRNA hypervariable tag sequencing.

Authors:  Susan M Huse; Les Dethlefsen; Julie A Huber; David Mark Welch; David Mark Welch; David A Relman; Mitchell L Sogin
Journal:  PLoS Genet       Date:  2008-11-21       Impact factor: 5.917

View more
  189 in total

1.  Comparison of Illumina paired-end and single-direction sequencing for microbial 16S rRNA gene amplicon surveys.

Authors:  Jeffrey J Werner; Dennis Zhou; J Gregory Caporaso; Rob Knight; Largus T Angenent
Journal:  ISME J       Date:  2011-12-15       Impact factor: 10.302

2.  Accurate, rapid taxonomic classification of fungal large-subunit rRNA genes.

Authors:  Kuan-Liang Liu; Andrea Porras-Alfaro; Cheryl R Kuske; Stephanie A Eichorst; Gary Xie
Journal:  Appl Environ Microbiol       Date:  2011-12-22       Impact factor: 4.792

3.  Pyrosequencing-derived bacterial, archaeal, and fungal diversity of spacecraft hardware destined for Mars.

Authors:  Myron T La Duc; Parag Vaishampayan; Henrik R Nilsson; Tamas Torok; Kasthuri Venkateswaran
Journal:  Appl Environ Microbiol       Date:  2012-06-22       Impact factor: 4.792

4.  Intergenerational effects of macroalgae on a reef coral: major declines in larval survival but subtle changes in microbiomes.

Authors:  Deanna S Beatty; Cody S Clements; Frank J Stewart; Mark E Hay
Journal:  Mar Ecol Prog Ser       Date:  2018-02-23       Impact factor: 2.824

5.  From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition.

Authors:  Andrea Porras-Alfaro; Kuan-Liang Liu; Cheryl R Kuske; Gary Xie
Journal:  Appl Environ Microbiol       Date:  2013-11-15       Impact factor: 4.792

6.  The circulating microbiome signature and inferred functional metagenomics in alcoholic hepatitis.

Authors:  Puneet Puri; Suthat Liangpunsakul; Jeffrey E Christensen; Vijay H Shah; Patrick S Kamath; Gregory J Gores; Susan Walker; Megan Comerford; Barry Katz; Andrew Borst; Qigui Yu; Divya P Kumar; Faridoddin Mirshahi; Svetlana Radaeva; Naga P Chalasani; David W Crabb; Arun J Sanyal
Journal:  Hepatology       Date:  2018-02-22       Impact factor: 17.425

7.  Contrasting taxonomic stratification of microbial communities in two hypersaline meromictic lakes.

Authors:  Adrian-Ştefan Andrei; Michael S Robeson; Andreea Baricz; Cristian Coman; Vasile Muntean; Artur Ionescu; Giuseppe Etiope; Mircea Alexe; Cosmin Ionel Sicora; Mircea Podar; Horia Leonard Banciu
Journal:  ISME J       Date:  2015-05-01       Impact factor: 10.302

8.  Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities.

Authors:  Migun Shakya; Christopher Quince; James H Campbell; Zamin K Yang; Christopher W Schadt; Mircea Podar
Journal:  Environ Microbiol       Date:  2013-02-06       Impact factor: 5.491

9.  Stable-Isotope Probing-Enabled Cultivation of the Indigenous Bacterium Ralstonia sp. Strain M1, Capable of Degrading Phenanthrene and Biphenyl in Industrial Wastewater.

Authors:  Jibing Li; Chunling Luo; Dayi Zhang; Xixi Cai; Longfei Jiang; Gan Zhang
Journal:  Appl Environ Microbiol       Date:  2019-07-01       Impact factor: 4.792

10.  Effect of cytomegalovirus and Epstein-Barr virus replication on intestinal mucosal gene expression and microbiome composition of HIV-infected and uninfected individuals.

Authors:  Sara Gianella; Antoine Chaillon; Ece A Mutlu; Phillip A Engen; Robin M Voigt; Ali Keshavarzian; John Losurdo; Prachi Chakradeo; Steven M Lada; Masato Nakazawa; Alan L Landay
Journal:  AIDS       Date:  2017-09-24       Impact factor: 4.177

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.