Literature DB >> 24936364

Up to Species-level Community Analysis of Human Gut Microbiota by 16S rRNA Amplicon Pyrosequencing.

Jiro Nakayama1, Jiahui Jiang1, Koichi Watanabe2, Kangting Chen3, Huang Ninxin3, Kazunori Matsuda2, Takashi Kurakawa2, Hirokazu Tsuji2, Kenji Sonomoto1, Yuan-Kun Lee3.   

Abstract

Pyrosequencing-based 16S rRNA profiling has become a common powerful tool to obtain the community structure of gastrointestinal tract microbiota, but it is still hard to process the massive amount of sequence data into microbial composition data, especially at the species level. Here we propose a new approach in combining the quantitative insights into microbial ecology (QIIME), Mothur and ribosomal database project (RDP) programs to efficiently process 454 pyrosequence data to bacterial composition data up to the species level. It was demonstrated to precisely convert batch sequence data of 16S rRNA V6-V8 amplicons obtained from adult Singaporean fecal samples to taxonomically annotated biota data.

Entities:  

Keywords:  16S rRNA gene; human gut microbiota; pyrosequence

Year:  2013        PMID: 24936364      PMCID: PMC4034321          DOI: 10.12938/bmfh.32.69

Source DB:  PubMed          Journal:  Biosci Microbiota Food Health        ISSN: 2186-3342


Pyrosequence-based 16S rRNA profiling has become a very powerful tool to obtain the structure of complex gut microbiota. However, in most pipelines established already, the 16S rRNA sequence data are processed into operational taxonomic units (OTUs) instead of species units [1,2,3]. This is reasonable because the partial sequence determined by a pyrosequencer is not enough to identify the species with high confidence, but OTU processing on partial 16S rRNA sequences often loses valuable microbiological information apart from systematic biology. For example, Bifidobacterium longum subsp. longum and Bifidobacterium longum subsp. infantis, which are classified into two distinct taxa as well as biotypes, are grouped in the same OTU when their partial 16S rRNA sequences are clustered with an identity of 97%. Therefore, we have proposed our unique approach using RDP seqmatch followed by the SeqmatchQ400 program in which pyrosequenced 16S rRNA sequences are directly subjected to a database search one by one to find the closest species [4]. An in silico demonstration of this approach with V6-V8 sequences in the database showed that the species of approximately more than 80% of the strains in the database are correctly identified. Furthermore, the above-mentioned two subspecies of B. longum could be identified correctly for more than 80% of the strains in the database. However, this approach is hard to apply for massive data analysis because it requires redundant sequence search. Thus, here we propose a new scheme, as shown in Fig. 1, in which a prepared non-redundant 16S rRNA sequence set is uploaded instead of whole sequences. In this study with this approach, 16S rRNA V6-V8 amplicon sequences obtained by pyrosequencing could be precisely and efficiently converted to bacterial composition data annotated with taxomic information from the phylum to species levels (partially up to the subspecies level) as described below.
Fig. 1.

Flowchart of computational processing of 454 pyrosequence data to individual microbiota data.

Flowchart of computational processing of 454 pyrosequence data to individual microbiota data. Fecal samples were collected from 28 healthy adult Singaporeans aged 20 to 25 years. Feces were collected in individual sterile Faeces Containers (Sarstedt, Nümbrecht, Germany) containing 2 mL RNAlater® (Ambion, Inc., Austin, TX, USA) and stored at room temperature, taken to the laboratory within 12 hr, and then stored at 4°C until DNA extraction. The Ethics Committee of the National University of Singapore provided ethical clearance for this microbiological research study in accordance with the Helsinki Declaration. Bacterial DNA was extracted from samples by the bead-beating method and purified as described previously [5]. The V6-V8 regions of bacterial 16S rRNA genes were amplified by PCR with a bacterial universal primer set, Q-968F-# (5′-CWSWSWWSHTWACGCGARGAACCTTACC-3′) and Q-1390R-# (5′-CWSWSWWSHTTGACGGGC GGTGWGTAC-3′) (# indicates a series of 128 barcode sequence tags underlined in the sequence). The PCR was performed in a 50 µL volume containing 10 ng to 100 ng extracted DNA as a template, 10 mM Tris-HCl (pH 8.3), 50 mM KCl, 1.5 mM MgCl2, 200 µM deoxynucleoside triphosphate (dNTP) mixture, 10 pmol of each primer and 1.25 U TaKaRa Ex Taq HS (Takara Bio, Otsu, Shiga, Japan). The PCR condition was as follows: 98°C for 2.5 min; 20 cycles at 98°C for 15 sec, 54°C for 30 sec, and 72°C for 20 sec; and finally 72°C for 5 min. The PCR products were purified using a QIAquick PCR Purification Kit (Qiagen, Valencia, CA, USA) according to the manufacturer’s protocol. The purified products were quantified using a NanoDrop ND-1000 microphotometer (NanoDrop Technologies, Wilmington, DE, USA). After that, equal amounts (100 ng) of the amplicons from different samples were pooled and purified prior to pyrosequencing using the ethanol precipitation method. The amplicon mixture DNAs were clonally amplified by emulsion PCR (emPCR) with GS FLX Titanium LV emPCR Kit (Lib-L) v2 according to manufacturer’s protocol (454 Life Sciences / Roche Diagnostics). Beads with amplified DNA were loaded onto a GS FLX Titanium PicoTiterPlate with dividers with separate reaction chambers to accommodate two mixture pools. Sequencing was carried out using an FLX Genome Sequencer (454 Life Sciences) with GS FLX Titanium Sequencing Kit XLR70 according to the manufacturer’s protocol (454 Life Sciences). The obtained 454 batch sequence data were sorted into each sample batch by using the QIIME split_library.py script (http://qiime.org/scripts/split_libraries.html) with the barcode sequences. The parameters used in this script were as follows: l (minimum sequence length) = 408, e (maximum number of errors in barcode) = 0, reverse primer mismatches = 3, a (maximum number of ambiguous bases) = 3, L (maximum sequence length) = 500. As a result, 131,768 reads were assigned to the 28 subjects, in which the average ± standard deviation of the number of reads per sample was 4361 ± 2143. To prepare a nonredundant sequence set, the 131,768 reads were dereplicated within 99% nucleotide sequence identity by using the pick_otus_through_otu_table.py script of QIIME (http://qiime.org/scripts/pick_otus_through_otu_table.html). As a result, a set of 15,578 non-redundant sequences was obtained. Then, these non-redundant sequences were subjected to a PCR chimera check using the Chimera.uchime program in Mothur 1.25.1 (http://www.mothur.org/wiki/Download_mothur), which is one of the fastest chimera check program at present [6,7,8]. The chimera check was performed using a known 16S rRNA sequence dataset, gg_97_otus_4feb2011.fasta (http://greengenes.lbl.gov/cgi-bin/nph-index.cgi) as a reference [database (DB) Uchime], following a check in de novo (de novo Uchime). As shown in Fig. 2, a dense cluster was observed in the region where the scores of both database (DB) Uchime and de novo Uchime were higher than approximately 0.5. Frequent sequences read more than 100 times were surrounded by square lines (171 sequencs). Some of these sequences showed scores of more than 0.3, which is a default cut off of this program. In principal, frequently read sequences must have less possibility to be chimeras because PCR chimeras are incidentally generated by misannealing between template and incomplete products during the reaction cycles. Therefore, nine frequent sequences that were read more than 100 times but showed scores of more than 0.3 in both de novo Uchime and DB uchime were subjected to a BLAST search to determine whether these sequences were really chimeras or not. As a result, eight sequences showed 100% identity to 16S rRNA sequences of uncultured organism clones in the NCBI database, and the rest showed 99.5% indentity, suggesting that these were not chimeras. Thus, the default cutoff score of 0.3 was considered to be too strict and was shifted to 0.6 for both de novo and DB Uchime to let most of the frequent sequences be classified as nonchimeras. SG.6_C1696, which had 696 read counts but was still categorized as a chimera was exceptionally removed from the chimera lists; approximately a thousand sequences in the database that came from uncultured fecal bacterial clones showed more than 98% identity to this sequence, suggesting that this sequences represented one of the yet-to-be-cultured members in the human gut. As a result of the chimera check, 6,612 sequences corresponding to 9,671 reads were recognized as PCR chimeras, and the other 8,966 sequences corresponding to 122,097 reads were selected for further analysis as nonchimera sequences.
Fig. 2.

Result of a chimera check of 15,578 nonredundant sequences. The scores of each sequence in de novo Uchime and DB Uchime were two-dimensionally plotted. Grey dots represent sequences with read counts of less than 10. Black dots represent sequences with the read counts of more than 9, and sequences with more than 99 counts are surrounded by a square. Vertical and horizontal dotted lines show the cut off threshold (score = 0.6) used for the chimera detection except in the case of SG.9_C1697.

Result of a chimera check of 15,578 nonredundant sequences. The scores of each sequence in de novo Uchime and DB Uchime were two-dimensionally plotted. Grey dots represent sequences with read counts of less than 10. Black dots represent sequences with the read counts of more than 9, and sequences with more than 99 counts are surrounded by a square. Vertical and horizontal dotted lines show the cut off threshold (score = 0.6) used for the chimera detection except in the case of SG.9_C1697. In order to get taxonomic information, the selected 8,966 sequences were subjected to two web-based searches, RDP Classifier [9] and RDP Seqmatch [10] (http://rdp.cme.msu.edu/). In the RDP Classifier search, the cut off value of the confidence threshold for taxonomic classification was set to be 80% as generally recommended. The number of sequences identified to any known taxa and their total read counts are summarized in Table 1. In the taxonomic level higher than family, almost all sequences were identified to known taxa, and more than 90% of the sequences corresponding to 96.9% of the total nonchimera reads were done at the family level. At the genus level, the identified rate remarkably decreased to 65.5%, while approximately 80% of the nonchimera sequence reads were identified to a known genus.
Table 1.

The number and percentage of sequences and reads identified to known taxa*

NonchimeraaPhylumbClassbOrderbFamilybGenusbSpeciesc
Identified taxa #-9172547107276
Sequence #      8,9668,782 (97.9)8,689 (96.9)8,677 (96.8)8,309 (92.7)5,869 (65.5)3,992 (44.5)
Read #      122,097120,305 (98.5)119,666 (98.0)119,651 (98.0)118,267 (96.9)95,822 (78.5)95,112 (77.9)

*The values in parentheses represent percentages of the number of nonchimera OTUs or reads. a Nonchimeras were selected by de novo Uchime (cut-off score = 0.6) and DB Uchime (cut-off score = 0.6) searches except in the cases of SG.6_C1697, which was recognized as a nonchimera by BLAST and RDP Seqmatch searches. b Identification to known taxonomic groups at these hierarcy levels was performed by RDP Classifier with a confidence threshold of 80%. c Identification to known species was performed by RDP Seqmatch with an S_ab score higher than 0.9.

*The values in parentheses represent percentages of the number of nonchimera OTUs or reads. a Nonchimeras were selected by de novo Uchime (cut-off score = 0.6) and DB Uchime (cut-off score = 0.6) searches except in the cases of SG.6_C1697, which was recognized as a nonchimera by BLAST and RDP Seqmatch searches. b Identification to known taxonomic groups at these hierarcy levels was performed by RDP Classifier with a confidence threshold of 80%. c Identification to known species was performed by RDP Seqmatch with an S_ab score higher than 0.9. RDP Seqmatch was employed to gain species level community data. However, less than 50% of the sequences were matched to known species with an S_ab score of more than 0.9, which is the theoretical threshold for species identification. When RDP Seqmatch was performed against the 16S rRNA database including the sequences of uncultured organisms in addition to those of the isolated bacteria, more than 80% of the sequences showed significant identity to 16S rRNA in the database (data not shown). This result suggests that the majority of the unidentified sequences came from yet-to-be cultured bacteria. Indeed, the RDP Seqmatch search was performed to find the 20 closest 16S rRNA sequences of the cultured strains, and if more than 2 different species showed the same highest score, the one with the highest count in the top 20 list was selected. This algorithm was processed by an excel macro file named SeqmatchQ400 [4]. As a result of the RDP Seqmatch search followed by processing with SeqmatchQ400, a total of 276 species were identified in our Singaporean subjects. The Chao1 value was calculated by using alpha_diversity.py script of QIIME (http://qiime.org/scripts/alpha_diversity.html), which is based on a rarefaction curve [11] of the species frequency. As a result, the total number of described species existing in our Singaporean subjects was estimated to be 315 species. The averaged number of species detected in each individual was 78, and the average Chao1 value of each individual was 117. These numbers are comparative to those previously found by OTU-based estimation using full-length 16S rRNA gene sequences [12]. Among the 276 species, 53 species were commonly detected in more than a half of our subjects (Table 2). This number is also comparative to the data obtained by a metagenomic sequencing, in which 75 species were commonly detected in more than a half of 124 European individuals at the 1% level of genome coverage [13]. As shown in Table 2, 26 species were shared between the common species of our study and the European metagenomic study.
Table 2.

Species commonly detected in more than a half of the 28 Singaporean subjects

Ranka               SpeciesAbundanceb(%)CarrierscMetagenome rankd
1Faecalibacterium prausnitzii9.27281
2Bacteroides vulgatus5.17234
3Collinsella aerofaciens4.902220
4Bifidobacterium adolescentis4.902531
5Clostridium clostridioforme4.0128NL
6Bifidobacterium longum subsp longum3.6627NL
7Bifidobacterium pseudocatenulatum2.722261
8Eubacterium hadrum2.3227NL
9Eubacterium rectale2.042311
10Bacteroides dorei1.952227
11Gemmiger formicilis1.7220NL
12Bifidobacterium stercoris1.7020NL
13Roseburia faecis1.6722NL
14Blautia wexlerae1.5727NL
15Bacteroides uniformis1.56286
16Ruminococcus bromii1.351433
17Dorea longicatena1.272113
18Ruminococcus torques1.092717
19Ruminococcus obeum0.912630
20Ruminococcus gnavus0.842164
21Escherichia coli0.761955
22Clostridium mayombei0.7326NL
23Bacteroides ovatus0.702523
24Megasphaera elsdenii0.6714NL
25Blautia luti0.6423NL
26Roseburia inulinivorans0.6223NL
27Parabacteroides distasonis0.612721
28Streptococcus salivarius0.5827NL
29Eubacterium eligens0.5620NL
30Lachnospira pectinoschiza0.5522NL
31Bacteroides fragilis0.441640
32Dorea formicigenerans0.37233
33Clostridium nexile0.361769
34Parabacteroides merdae0.311728
35Phascolarctobacterium faecium0.3022NL
36Alistipes putredinis0.291519
37Bacteroides caccae0.281732
38Bacteroides thetaiotaomicron0.252826
39Clostridium bartlettii0.211753
40Klebsiella pneumoniae0.1916NL
41Sutterella wadsworthensis0.1815NL
42Flavonifractor plautii0.1725NL
43Coprococcus catus0.1717NL
44Eubacterium ventriosum0.142035
45Streptococcus parasanguinis0.1423NL
46Eggerthella lenta0.1120NL
47Bilophila wadsworthia0.1024NL
48Clostridium disporicum0.0914NL
49Odoribacter splanchnicus0.0817NL
50Alistipes massiliensis0.0514NL
51Clostridium leptum0.051546
52Clostridium innocuum0.0517NL
53Actinomyces odontolyticus0.0317NL

a Ranked by the relative abundance in the third column. b Average of relative abundance among the 28 Singaporean subjects. c The number of subjects in whom the species was detected. d Rank in the metagenomic catalogue of the study of 124 European individuals (13). NL means not listed in the catalogue.

a Ranked by the relative abundance in the third column. b Average of relative abundance among the 28 Singaporean subjects. c The number of subjects in whom the species was detected. d Rank in the metagenomic catalogue of the study of 124 European individuals (13). NL means not listed in the catalogue. As a final step, individual biota data were profiled based on the catalogue of the 8,782 nonredundant sequences taxonomically annotated. Relative abundance of each taxonomic group was calculated by dividing the read count of identified sequences by the total read count in individuals. Figure 3 shows the population distribution of common genera, families and phyla. Four phyla, Firmicutes, Actinobacteria, Bacteroidetes and Proteobacteria, were detected in all subjects. Firmicutes mainly consists of two dominant families, Lachnospiraceae and Ruminococcaceae, and was the most dominant phylum, occupying more than 50% of the total population in most subjects. The next dominant phyla were Actinobacteria, which mainly consists of genus Bifidobacterium, and Bacteroidetes, which mainly consists of genus Bacteroides. As shown in Table 2, Faecalibacterium prausnitzii, which belongs to the Ruminococcaceae family, was the most dominant species and detected in all subjects as in the European metagenomic study [13]. Bifidobacterium adolescentis, B. longum subsp. longum and B. pseudocatenulatum were found to be dominant and ranked within the top 10 species in the genus Bifidobacterium. This is different from the European metagenomic study in which no Bifidobacterium species were ranked within the top 10, suggesting the difference in gut bacterial community between Asian and European individuals.
Fig. 3.

Population distribution of 37 (A), 19 (B) and 4 (C) common genera, families and phyla, respectively, among the 28 Singaporean subjects. The relative abundance of each taxonomic group was calculated by dividing the read counts of identified sequences by the individual`s total read number. The 37 genera, 19 families, and 4 phyla were selected as they were detected in more than a half of our Singaporean subjects.

Population distribution of 37 (A), 19 (B) and 4 (C) common genera, families and phyla, respectively, among the 28 Singaporean subjects. The relative abundance of each taxonomic group was calculated by dividing the read counts of identified sequences by the individual`s total read number. The 37 genera, 19 families, and 4 phyla were selected as they were detected in more than a half of our Singaporean subjects. To examine the quantitative accuracy, the 454 population data of two dominant genera, Bacteroides and Bifidobacterium, were compared with those measured by quantitative real-time PCR. As shown in Fig. 4, these two groups of data were consistent with each other even though Bacteroides was more sensitive and Bifidobacterium was less sensitive in the pyrosequencing-based analysis than in quantitative real-time PCR. This is probably due to the difference in the copy numbers of 16S rRNA genes on the chromosome; the average among 16 Bifidobacterium species in the rrNDB database (http://rrndb.mmg.msu.edu/search.php) is 3.56, while that among 11 Bacteroides species is 6.0.
Fig. 4.

Comparison of the relative abundances of Bacteroides (A) and Bifidobacterium (B) determined by 16S rRNA amplicon pyrosequencing with those determined by quantitative real-time PCR. The relative abundance in the pyrosequencing data was calculated by dividing the number of reads identified to genus Bacteroides or Bifidobacterium by the total read counts in each subject. In the quantitative PCR, group-specific primers targeting the Bacteroides fragilis group and genus Bifidobacterium were used, respectively [5].

Comparison of the relative abundances of Bacteroides (A) and Bifidobacterium (B) determined by 16S rRNA amplicon pyrosequencing with those determined by quantitative real-time PCR. The relative abundance in the pyrosequencing data was calculated by dividing the number of reads identified to genus Bacteroides or Bifidobacterium by the total read counts in each subject. In the quantitative PCR, group-specific primers targeting the Bacteroides fragilis group and genus Bifidobacterium were used, respectively [5]. In conclusion, the newly designed approach combining the QIIME, Mothur and RDP programs precisely and efficiently converted the 454 pyrosequence data to bacterial composition data up to the species level. Details of the whole process are summarized in Table 3. The total processing time was less than 12 hr, and the largest amount of time was spent on the DB Uchime and RDP Seqmatch searches. Ongoing rapid fulfillment of 16S rRNA sequence database should be going to compensate the blank of unidentified bacterial groups appeared in our process. The output data annotated with known bacterial taxa from phylum to species (subspecies in part) should supply missing information more relevant to microbiology in addition to systematic understanding of the microbial community structure realized by the well-established OTU-based profiling technology.
Table 3.

Summary of the computational processing of 454 pyrosequence data

StepProcessProgramProgram source (URL or e-mail)Calculator machine# of queryseqs.OutputTime
1Barcode sortingsplit_libraries.pyQIIME (http://qiime.org/scripts/split_libraries.html)Windows 64 bit PC*1,583,218**106 to 8618 reads per subject10 min
2OTU clusteringpick_otus_through_otu_table.pyQIIME (http://qiime.org/scripts/pick_otus_through_otu_table.html)Windows 64 bit PC131,76815,578 redundant seqs.17 min
3de novo chimera checkde novo UchimeMothur 1.25.1 (http://www.mothur.org/wiki/Download_mothur)Windows 64 bit PC15,5787,200 chimeric seqs.6 min
4DB chimera checkDB UchimeMothur 1.25.1 (http://www.mothur.org/wiki/Download_mothur)Windows 64 bit PC15,5787,852 chimeric seqs. (6,612 chimeric seqs.)***7 hrs
5Sequence search up to genus levelRDP ClassifierRDP II (http://rdp.cme.msu.edu/)RDP host computer8,9669 phyla–109 genera10 min
6Sequence search in species levelRDP SeqmatchRDP II (http://rdp.cme.msu.edu/)RDP host computer8,9663,992 seqs identified with know species3 hrs
7Data processingof RDP Seqmatch SeqmatchQ400Kyushu Univ(nakayama @ agr.kyushu-u.ac.jp)Windows 64 bit PC8,945276 species30 min

*Intel Core i7-3930 K CPU (3.20 GHz). **Batch sequence data included all sequences from 2 x half PicoTiterPlate regions, which contained 256 samples including non-Singaporean samples. ***The number of chimeric sequences determined by taking into account both de novo and DB chimera checks.

*Intel Core i7-3930 K CPU (3.20 GHz). **Batch sequence data included all sequences from 2 x half PicoTiterPlate regions, which contained 256 samples including non-Singaporean samples. ***The number of chimeric sequences determined by taking into account both de novo and DB chimera checks.
  11 in total

1.  DECIPHER, a search-based approach to chimera identification for 16S rRNA sequences.

Authors:  Erik S Wright; L Safak Yilmaz; Daniel R Noguera
Journal:  Appl Environ Microbiol       Date:  2011-11-18       Impact factor: 4.792

2.  Diversity of the human intestinal microbial flora.

Authors:  Paul B Eckburg; Elisabeth M Bik; Charles N Bernstein; Elizabeth Purdom; Les Dethlefsen; Michael Sargent; Steven R Gill; Karen E Nelson; David A Relman
Journal:  Science       Date:  2005-04-14       Impact factor: 47.728

3.  Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.

Authors:  Qiong Wang; George M Garrity; James M Tiedje; James R Cole
Journal:  Appl Environ Microbiol       Date:  2007-06-22       Impact factor: 4.792

4.  A human gut microbial gene catalogue established by metagenomic sequencing.

Authors:  Junjie Qin; Ruiqiang Li; Jeroen Raes; Manimozhiyan Arumugam; Kristoffer Solvsten Burgdorf; Chaysavanh Manichanh; Trine Nielsen; Nicolas Pons; Florence Levenez; Takuji Yamada; Daniel R Mende; Junhua Li; Junming Xu; Shaochuan Li; Dongfang Li; Jianjun Cao; Bo Wang; Huiqing Liang; Huisong Zheng; Yinlong Xie; Julien Tap; Patricia Lepage; Marcelo Bertalan; Jean-Michel Batto; Torben Hansen; Denis Le Paslier; Allan Linneberg; H Bjørn Nielsen; Eric Pelletier; Pierre Renault; Thomas Sicheritz-Ponten; Keith Turner; Hongmei Zhu; Chang Yu; Shengting Li; Min Jian; Yan Zhou; Yingrui Li; Xiuqing Zhang; Songgang Li; Nan Qin; Huanming Yang; Jian Wang; Søren Brunak; Joel Doré; Francisco Guarner; Karsten Kristiansen; Oluf Pedersen; Julian Parkhill; Jean Weissenbach; Peer Bork; S Dusko Ehrlich; Jun Wang
Journal:  Nature       Date:  2010-03-04       Impact factor: 49.962

5.  Use of 16S rRNA gene-targeted group-specific primers for real-time PCR analysis of predominant bacteria in human feces.

Authors:  Takahiro Matsuki; Koichi Watanabe; Junji Fujimoto; Toshihiko Takada; Ryuichiro Tanaka
Journal:  Appl Environ Microbiol       Date:  2004-12       Impact factor: 4.792

6.  QIIME allows analysis of high-throughput community sequencing data.

Authors:  J Gregory Caporaso; Justin Kuczynski; Jesse Stombaugh; Kyle Bittinger; Frederic D Bushman; Elizabeth K Costello; Noah Fierer; Antonio Gonzalez Peña; Julia K Goodrich; Jeffrey I Gordon; Gavin A Huttley; Scott T Kelley; Dan Knights; Jeremy E Koenig; Ruth E Ley; Catherine A Lozupone; Daniel McDonald; Brian D Muegge; Meg Pirrung; Jens Reeder; Joel R Sevinsky; Peter J Turnbaugh; William A Walters; Jeremy Widmann; Tanya Yatsunenko; Jesse Zaneveld; Rob Knight
Journal:  Nat Methods       Date:  2010-04-11       Impact factor: 28.547

7.  UCHIME improves sensitivity and speed of chimera detection.

Authors:  Robert C Edgar; Brian J Haas; Jose C Clemente; Christopher Quince; Rob Knight
Journal:  Bioinformatics       Date:  2011-06-23       Impact factor: 6.937

8.  Reducing the effects of PCR amplification and sequencing artifacts on 16S rRNA-based studies.

Authors:  Patrick D Schloss; Dirk Gevers; Sarah L Westcott
Journal:  PLoS One       Date:  2011-12-14       Impact factor: 3.240

9.  Human gut microbiome viewed across age and geography.

Authors:  Tanya Yatsunenko; Federico E Rey; Mark J Manary; Indi Trehan; Maria Gloria Dominguez-Bello; Monica Contreras; Magda Magris; Glida Hidalgo; Robert N Baldassano; Andrey P Anokhin; Andrew C Heath; Barbara Warner; Jens Reeder; Justin Kuczynski; J Gregory Caporaso; Catherine A Lozupone; Christian Lauber; Jose Carlos Clemente; Dan Knights; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2012-05-09       Impact factor: 49.962

10.  The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis.

Authors:  J R Cole; B Chai; R J Farris; Q Wang; S A Kulam; D M McGarrell; G M Garrity; J M Tiedje
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  5 in total

1.  Age-related changes in the gut microbiota and the core gut microbiome of healthy Thai humans.

Authors:  Orawan La-Ongkham; Massalin Nakphaichit; Jiro Nakayama; Suttipun Keawsompong; Sunee Nitisinprasert
Journal:  3 Biotech       Date:  2020-05-30       Impact factor: 2.406

2.  The use of genus-specific amplicon pyrosequencing to assess phytophthora species diversity using eDNA from soil and water in Northern Spain.

Authors:  Santiago Català; Ana Pérez-Sierra; Paloma Abad-Campos
Journal:  PLoS One       Date:  2015-03-16       Impact factor: 3.240

3.  Diversity in gut bacterial community of school-age children in Asia.

Authors:  Jiro Nakayama; Koichi Watanabe; Jiahui Jiang; Kazunori Matsuda; Shiou-Huei Chao; Pri Haryono; Orawan La-Ongkham; Martinus-Agus Sarwoko; I Nengah Sujaya; Liang Zhao; Kang-Ting Chen; Yen-Po Chen; Hsueh-Hui Chiu; Tomoko Hidaka; Ning-Xin Huang; Chikako Kiyohara; Takashi Kurakawa; Naoshige Sakamoto; Kenji Sonomoto; Kousuke Tashiro; Hirokazu Tsuji; Ming-Ju Chen; Vichai Leelavatcharamas; Chii-Cherng Liao; Sunee Nitisinprasert; Endang S Rahayu; Fa-Zheng Ren; Ying-Chieh Tsai; Yuan-Kun Lee
Journal:  Sci Rep       Date:  2015-02-23       Impact factor: 4.379

4.  Effects of Japanese pepper and red pepper on the microbial community during nukadoko fermentation.

Authors:  Hiroshi Ono; Shoko Nishio; Jun Tsurii; Tetsuhiro Kawamoto; Kenji Sonomoto; Jiro Nakayama
Journal:  Biosci Microbiota Food Health       Date:  2014-09-27

5.  SPINGO: a rapid species-classifier for microbial amplicon sequences.

Authors:  Guy Allard; Feargal J Ryan; Ian B Jeffery; Marcus J Claesson
Journal:  BMC Bioinformatics       Date:  2015-10-08       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.