| Literature DB >> 22300522 |
Vladimir Lazarevic1, Katrine Whiteson, Nadia Gaïa, Yann Gizard, David Hernandez, Laurent Farinelli, Magne Osterås, Patrice François, Jacques Schrenzel.
Abstract
BACKGROUND: The salivary microbiota is a potential diagnostic indicator of several diseases. Culture-independent techniques are required to study the salivary microbial community since many of its members have not been cultivated.Entities:
Year: 2012 PMID: 22300522 PMCID: PMC3296672 DOI: 10.1186/2043-9113-2-4
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Description of the 4 sequence datasets and 14 subsets
| Dataset | |||||||
|---|---|---|---|---|---|---|---|
| 16S-HTS, 20 cycles | 16S-HTS, 25 cycles | 16S-HTS, 30 cycles | WGS | ||||
| 16,345,598 | 10,732,108 | 6,150,466 | 14'724,838 | ||||
| 1,200,000 | 1,200,000 | 1,200,000 | 1,000,000 | ||||
| V1 | V3 | V1 | V3 | V1 | V3 | ||
| 592,322 | 566,310 | 576,701 | 564,774 | 546,710 | 569,459 | 1,000,000 | |
| 532,945 | 530,504 | 534,229 | 523,548 | 548,009 | 495,531 | 1,000,000 | |
Figure 1Heat map showing changes in taxa proportions as a function of PCR cycle number. The taxa shared by all (twelve) 16S-HTS subsets were analyzed. The relative abundances of taxa after 20 PCR cycles were used as baselines for comparisons. These values are represented according to the grey scale below the heat map. Changes (%) in the relative abundance of taxa after 25 or 30 cycles are represented by rectangles according to the color scale below the heat map. The corresponding values are given in the Additional files 1 and 3. Differences were significant (P < 0.01; chi-square test) unless marked by an asterisk.
Number of BLATSN hits against human, bacterial and viral databases
| BLASTN hita counts | R/F ratio | ||
|---|---|---|---|
| F | R | ||
| Human | 801677 | 642018 | 80.0 |
| Bacteria | 7898 | 6807 | 86.7 |
| Viruses | 38 | 34 | 89.5 |
| Fungi | 0 | 0 | - |
F, forward run; R, reverse run.
The following BLASTN parameters were used: wordsize = 16, e ≤ 10-30, sequence identity ≥ 94%, alignment length ≥ 90 nt.
combined entries from ftp://ftp.ebi.ac.uk/pub/databases/fastafiles/emblrelease/em_rel_std_hum.gz as of 4 March 2011, and ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/human_genomic.gz as of 12 March 2011.
ftp://ftp.ncbi.nih.gov/genomes/Bacteria/ as of 16 November 2010.
ftp://ftp.ncbi.nih.gov/genomes/Viruses/ as of 25 November 2010.
ftp://ftp.ncbi.nih.gov/genomes/Fungi/ as of 3 March 2011.
Figure 2Comparison of WGS, 16S-WGS and 16S-HTS datasets to study bacterial community of the saliva sample. Group-average clustering of data was performed using a Bray-Curtis similarity matrix in PRIMER-E (Plymouth), based on square-root-transformed genera abundance. Only genera which occurred at a frequency of > 0.1% in at least one of 14 subsets were included in the analysis. V1 and V3 designate the sequenced hypervariable region of 16S rDNA. 20, 25 and 30 indicate the number of PCR cycles performed. F, forward reads; R, reverse reads.
Similarity between the V3 16S and other datasets
| Datasets to which the F and R 30-cycle V3 subsets were compared | Average similaritya ± SD (%) | Pearson's correlation coefficient (F, R)b |
|---|---|---|
| Itself (F vs R) | 94.8 | 0.998 |
| 25-cycle V3 | 97.1 ± 0.3 | 0.998, 0.999 |
| 20-cycle V3 | 93.2 ± 1.5 | 0.974, 0.977 |
| 30-cycle V1 | 82.7 ± 0.8 | 0.987, 0.990 |
| 16S-WGS | 68.5 ± 3.7 | 0.933, 0.958 |
| WGS | 64.1 ± 0.5 | 0.650, 0.652 |
| V3 from other individualsc | 64.8 ± 3.5 | 0.453-0.871, 0.458-0.867 |
F, forward run; R, reverse run.
a The genera frequency were square-root transformed and used to construct the Bray-Curtis similarity matrix. When applicable, the similarities were calculated separately for the pairs of forward and the pairs of reversed subsets prior to averaging.
b Based on untransformed data.
c Samples 1.1, 2.1, 3.1, 4.1 and 5.1 from a previous study [21] in which the initial PCR cycles were carried out in the same conditions as those in the present work. Then, 28 cycles were performed using bar-coded primers and the amplicons were sequenced by the Roche/454 platform. The average similarity observed among these five microbiomes was 65.2 ± 7.2%.
Detection of viral sequences in the WGS dataset using BLASTN
| Number of hits a | ||
|---|---|---|
| F | R | |
| Human herpesvirus 7 | 17 | 15 |
| Porcine endogenous retrovirus E | 1 | - |
| 1 | 1 | |
| Enterobacteria phage lambda | 12 | 11 |
| Enterobacteria phage phiX174 | 6 | 7 |
| 1 | - | |
F, forward run; R, reverse run.
The following BLASTN parameters were used: wordsize = 16, e ≤ 10-30, sequence identity ≥ 94%, alignment length ≥ 90 nt.