| Literature DB >> 24846174 |
Nur A Hasan1, Brian A Young2, Angela T Minard-Smith2, Kelly Saeed3, Huai Li3, Esley M Heizer2, Nancy J McMillan2, Richard Isom3, Abdul Shakur Abdullah3, Daniel M Bornman2, Seth A Faith2, Seon Young Choi1, Michael L Dickens2, Thomas A Cebula4, Rita R Colwell5.
Abstract
Human saliva is clinically informative of both oral and general health. Since next generation shotgun sequencing (NGS) is now widely used to identify and quantify bacteria, we investigated the bacterial flora of saliva microbiomes of two healthy volunteers and five datasets from the Human Microbiome Project, along with a control dataset containing short NGS reads from bacterial species representative of the bacterial flora of human saliva. GENIUS, a system designed to identify and quantify bacterial species using unassembled short NGS reads was used to identify the bacterial species comprising the microbiomes of the saliva samples and datasets. Results, achieved within minutes and at greater than 90% accuracy, showed more than 175 bacterial species comprised the bacterial flora of human saliva, including bacteria known to be commensal human flora but also Haemophilus influenzae, Neisseria meningitidis, Streptococcus pneumoniae, and Gamma proteobacteria. Basic Local Alignment Search Tool (BLASTn) analysis in parallel, reported ca. five times more species than those actually comprising the in silico sample. Both GENIUS and BLAST analyses of saliva samples identified major genera comprising the bacterial flora of saliva, but GENIUS provided a more precise description of species composition, identifying to strain in most cases and delivered results at least 10,000 times faster. Therefore, GENIUS offers a facile and accurate system for identification and quantification of bacterial species and/or strains in metagenomic samples.Entities:
Mesh:
Year: 2014 PMID: 24846174 PMCID: PMC4028220 DOI: 10.1371/journal.pone.0097699
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Multi-platform BLASTn analysis of two salivary samples against various databases.
| Sample | Sequencing Platform | ||
| Illumina HiSeq 2000 | Illumina GAIIx | Ion Torrent (16S, V4) | |
| Reference Database used with BLASTn | |||
|
| Greengenes 16s1 | Greengenes 16s | Greengenes 16s3 |
| NCBI 16s | NCBI 16s | NCBI 16s3 | |
| NCBI RefSeq (microbial subset)1 | NCBI RefSeq (microbial subset) | NCBI RefSeq (microbial subset)3 | |
|
| Greengenes 16s1 | Greengenes 16s2 | Greengenes 16s3 |
| NCBI 16s1 | NCBI 16s2 | NCBI 16s3 | |
| NCBI RefSeq (microbial subset)1 | NCBI RefSeq (microbial subset)2 | NCBI RefSeq (microbial subset)3 | |
1Illumina HiSeq2000 sequencing carried out at Nationwide Children’s Hospital, Columbus, OH.
2Illumina GAIIx sequencing carried out at CosmosID.
3Ion Torrent sequencing performed by SeqWright, Inc., Houston, TX.
Illumina GAIIx sequencing performed at The Ohio State University, Columbus, OH.
Species composition and simulation statistics of the synthetic metagenomic dataset.
| Simulations Statistics | BLAST (RefSeq_genomic) | |||
| Species | Genome Coverage | Number of Reads | Relative Abundance | Number of Reads |
|
| 0.08 | 4034394 | 72.81 | 3349 |
|
| 47.21 | 537919 | 9.71 | 554 |
|
| 52.69 | 430335 | 7.77 | 440 |
|
| 5.44 | 53791 | 0.97 | 58 |
|
| 12.08 | 107583 | 1.94 | 106 |
|
| 11.02 | 107583 | 1.94 | 88 |
|
| 11.1 | 107583 | 1.94 | 83 |
|
| 0.99 | 46261 | 0.83 | NA |
|
| 0.94 | 46261 | 0.83 | NA |
|
| 0.84 | 46261 | 0.83 | 46 |
|
| 1.21 | 23130 | 0.42 | 23 |
The right most column represents number of reads from the sample that BLAST was able to assign to species. NA: Not Assigned due to lack of RefSeq entries.
Figure 1Screenshot of GENIUS client software displaying tabular output of in silico metagenomic data.
Left panel indicates projects loaded to this graphical user interface. Three table views to the rightmost panel represent output of 5VCE (a)NmerCE (b) and merged output from both 5VCE and NmerCE (c) algorithms, respectively.
Figure 2Genus overlap for sample VFD10-018 (a) and VFD12-006 (b) estimated by 16S sequencing/NCBI 16S BLAST and GAIIx sequencing/5VCE-NmerCE.
Figure 3Relative abundance of species in VFD10-018 estimated by GAIIx sequencing and BLAST (microbial reference database), 5VCE, and NmerCE algorithms.
Figure 4Relative abundance of species in VFD12-006 estimated by GAIIx sequencing and BLAST (microbial reference database), 5VCE, and NmerCE algorithms.
Figure 5Principal component analysis of data for seven saliva samples analyzed by GENIUS.
Figure 6Double hierarchical dendrogram showing bacterial distribution at the species level for seven saliva samples.
The relative values for bacterial species are depicted by color intensity, with legend indicated at the top of the figure.