| Literature DB >> 22699610 |
.
Abstract
A variety of microbial communities and their genes (the microbiome) exist throughout the human body, with fundamental roles in human health and disease. The National Institutes of Health (NIH)-funded Human Microbiome Project Consortium has established a population-scale framework to develop metagenomic protocols, resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 or 18 body sites up to three times, which have generated 5,177 microbial taxonomic profiles from 16S ribosomal RNA genes and over 3.5 terabases of metagenomic sequence so far. In parallel, approximately 800 reference strains isolated from the human body have been sequenced. Collectively, these data represent the largest resource describing the abundance and variety of the human microbiome, while providing a framework for current and future studies.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22699610 PMCID: PMC3377744 DOI: 10.1038/nature11209
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
HMP donor samples examined by 16S and WGS
| Body Region | Body Site | Total Samples | Total 16S Samples | V13 Samples | V13 Read Depth (M) | V35 Samples | V35 Read depth (M) | Samples V13&V35 | Total WGS Samples | Total Read Depth (G) | % Filtered Reads | % Human Reads | Remaining Read Depth (G) | Samples 16S&WGS |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gut | Stool | 352 | 337 | 193 | 1.4 | 328 | 2.38 | 184 | 139 | 1720.7 | 15 | 1 | 1450.6 | 124 |
| Buccal mucosa | 346 | 330 | 184 | 1.3 | 314 | 1.74 | 168 | 107 | 1438 | 9 | 82 | 136.68 | 91 | |
| Hard palate | 325 | 325 | 179 | 1.2 | 310 | 1.67 | 164 | 1 | 10.87 | 20 | 25 | 5.92 | 1 | |
| Keratinized gingiva | 335 | 329 | 183 | 1.3 | 319 | 1.74 | 173 | 6 | 72.3 | 5 | 47 | 34.42 | 0 | |
| Palatine Tonsils | 337 | 332 | 189 | 1.2 | 315 | 1.87 | 172 | 6 | 74.75 | 2 | 80 | 13.45 | 1 | |
| Oral Cavity | Saliva | 315 | 310 | 166 | 0.9 | 292 | 1.45 | 148 | 5 | 55.69 | 1 | 91 | 4.24 | 0 |
| Subgingival plaque | 334 | 328 | 186 | 1.2 | 314 | 1.84 | 172 | 7 | 92.06 | 5 | 79 | 15.29 | 1 | |
| Supragingival plaque | 345 | 331 | 192 | 1.3 | 316 | 1.88 | 177 | 115 | 1500.7 | 15 | 40 | 674.81 | 101 | |
| Throat | 331 | 325 | 176 | 1 | 312 | 1.67 | 163 | 7 | 78.78 | 4 | 79 | 13.57 | 1 | |
| Tongue dorsum | 348 | 332 | 193 | 1.3 | 320 | 2.04 | 181 | 122 | 1620.1 | 15 | 19 | 1084.3 | 106 | |
| Airway | Anterior nares | 316 | 302 | 169 | 1 | 283 | 1.17 | 150 | 84 | 1129.9 | 3 | 96 | 14.31 | 70 |
| Left Antecubital fossa | 269 | 269 | 158 | 0.7 | 221 | 0.47 | 110 | 0 | na | na | na | 0 | na | |
| Left Retroauricular crease | 313 | 312 | 188 | 1.6 | 295 | 1.46 | 171 | 9 | 126.34 | 9 | 73 | 22.07 | 8 | |
| Skin | Right Antecubital fossa | 274 | 274 | 158 | 0.7 | 229 | 0.52 | 113 | 0 | na | na | na | 0 | na |
| Right Retroauricular crease | 319 | 316 | 190 | 1.4 | 304 | 1.56 | 178 | 15 | 181.94 | 18 | 59 | 42.38 | 12 | |
| Mid vagina | 145 | 143 | 91 | 0.6 | 140 | 0.96 | 88 | 2 | 22.58 | 0 | 99 | 0.18 | 0 | |
| Vagina | Posterior fornix | 152 | 142 | 89 | 0.6 | 136 | 0.98 | 83 | 53 | 702.13 | 6 | 90 | 25.24 | 43 |
| Vaginal introitus | 142 | 140 | 87 | 0.6 | 131 | 0.85 | 78 | 3 | 36.48 | 1 | 98 | 0.58 | 1 | |
|
| ||||||||||||||
| total | 5298 | 5177 | 2971 | 19 | 4879 | 26.3 | 2673 | 681 | 8863.3 | 11 | 49 | 3538.1 | 560 | |
1×106 reads post-processing with the mothur pipeline (Supplementary Information)
1×109 reads post-processing with the mothur pipeline (Supplementary Information)
Fraction of reads with low quality bases that were removed (Supplementary Information)
Fraction of human reads that were removed (Supplementary Information)
Figure 1Rates of gene and OTU discovery from HMP taxonomic and metagenomic data
Accumulation curves for a, OTU counts from 16S data (all body sites) b, clustered gene index counts from metagenomic data (all applicable body sites) and c, the ratio of average unique genes contributed versus unique OTUs encountered with increasing sample counts (Supplementary Information). Ratios given for each curve in c represent the average number of unique genes contributed per unique OTU at the final sample count. Curves for stool, buccal mucosa and anterior nares suggest that the proportion of gene-to-taxa discovery has stabilized. In contrast, the curve for supragingival plaque suggests relatively fewer new genes are being contributed per additional OTU. Error bars represent 95% confidence intervals.