| Literature DB >> 24349140 |
Shibu Yooseph1, Cynthia Andrews-Pfannkoch2, Aaron Tenney1, Jeff McQuaid3, Shannon Williamson3, Mathangi Thiagarajan4, Daniel Brami1, Lisa Zeigler-Allen3, Jeff Hoffman3, Johannes B Goll4, Douglas Fadrosh3, John Glass2, Mark D Adams3, Robert Friedman3, J Craig Venter3.
Abstract
Understanding the microbial content of the air has important scientific, health, and economic implications. While studies have primarily characterized the taxonomic content of air samples by sequencing the 16S or 18S ribosomal RNA gene, direct analysis of the genomic content of airborne microorganisms has not been possible due to the extremely low density of biological material in airborne environments. We developed sampling and amplification methods to enable adequate DNA recovery to allow metagenomic profiling of air samples collected from indoor and outdoor environments. Air samples were collected from a large urban building, a medical center, a house, and a pier. Analyses of metagenomic data generated from these samples reveal airborne communities with a high degree of diversity and different genera abundance profiles. The identities of many of the taxonomic groups and protein families also allows for the identification of the likely sources of the sampled airborne bacteria.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24349140 PMCID: PMC3859506 DOI: 10.1371/journal.pone.0081862
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
DNA yields after amplification.
| Sample | Estimated final DNA yield (ng) |
| NY_INDOOR (0.1 µm) | 100 |
| NY_INDOOR (3.0 µm) | 200 |
| NY_OUTDOOR (0.1 µm) | 500 |
| NY_OUTDOOR (3.0 µm) | 1000 |
| SD_IHOSP (0.1 µm) | 468 |
| SD_IHOSP (3.0 µm) | 259 |
| SD_OHOSP (0.1 µm) | 235 |
| SD_OHOSP (3.0 µm) | 327 |
| SD_IHOUS (0.1 µm) | 110 |
| SD_IHOUS (3.0 µm) | 308 |
| SD_SCRPP (0.1 µm) | 294 |
| SD_SCRPP (3.0 µm) | 370 |
The MDA products for the NYC samples were quantified by UV spectroscopy and the linker amplification products for the SD samples were quantified by fluorescence spectroscopy.
Taxonomic classification of metagenomic reads.
| NY_INDOOR | NY_OUTDOOR | SD_IHOSP | SD_OHOSP | SD_IHOUS | SD_SCRPP | |
| Archaea | 1,048 | 269 | 286 | 606 | 90 | 324 |
| Bacteria | 256,691 | 55,681 | 347,562 | 263,396 | 192,326 | 44,092 |
| Eukaryota | 663,225 | 281,601 | 68,998 | 167,883 | 40,334 | 375,198 |
| Mixed | 6,012 | 1,915 | 5,975 | 7,010 | 5,936 | 2,356 |
| Viruses | 958 | 495 | 1,926 | 514 | 4,186 | 2,026 |
| Other | 1,084 | 1,389 | 491 | 167 | 206 | 67 |
| Unclassified | 504,660 | 620,628 | 152,468 | 384,538 | 148,683 | 729,639 |
| Total | 1,433,678 | 961,978 | 577,706 | 824,114 | 391,761 | 1,153,702 |
Mixed refers to a read that had matches to multiple kingdoms and could not be definitively assigned to one kingdom. Other refers to sequences that could not be identified (NCBI taxonomy ID 32644), mostly synthetic constructs. Unclassified refers to a read that could not be assigned to any kingdom (i.e. had no match in the reference databases). Kingdom taxonomy:
Figure 1Taxonomic classification of metagenomic reads.
Figure 2Taxonomic classification and diversity of the NYC 16S data.
In the stacked barcharts, only those taxonomic groups that have ≥2% abundance in at least one of the samples is reported. The rarefaction curves along with the richness and diversity estimates were calculated using mothur and were based on averages of 25 random samples of 10,000 sequences from each dataset.
Figure 3Abundances of KEGG functional categories.
Figure 4Ordination of the six metagenomic samples by principal component analysis based on KO abundance profiles.
The first three principal components (PC1, PC2, and PC3) account for 83.1% of the total variation. The dashed lines in the 3D plot shows the height (PC3) of the sample points when projected on to the PC1-PC2 plane.