| Literature DB >> 27000765 |
Bo Yang1, Yong Wang2, Pei-Yuan Qian3.
Abstract
BACKGROUND: Prokaryotic 16S ribosomal RNA (rRNA) sequences are widely used in environmental microbiology and molecular evolution as reliable markers for the taxonomic classification and phylogenetic analysis of microbes. Restricted by current sequencing techniques, the massive sequencing of 16S rRNA gene amplicons encompassing the full length of genes is not yet feasible. Thus, the selection of the most efficient hypervariable regions for phylogenetic analysis and taxonomic classification is still debated. In the present study, several bioinformatics tools were integrated to build an in silico pipeline to evaluate the phylogenetic sensitivity of the hypervariable regions compared with the corresponding full-length sequences.Entities:
Keywords: 16S rRNA; 16S rRNA gene; Geodesic distance; Phylogenetic; Primer; Variable regions
Mesh:
Substances:
Year: 2016 PMID: 27000765 PMCID: PMC4802574 DOI: 10.1186/s12859-016-0992-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Workflow of the data processing. As described in Materials and methods, the sequences downloaded from the SILVA database were filtered, randomly selected and grouped. Phylogenetic trees were then built, and geodesic distances were calculated
Positions of the hypervariable sub-regions of the 16S rRNA sequences
| Region | Start position | End position | Start postion (E. Coli) | End position (E. Coli) |
|---|---|---|---|---|
| V1 | 8 | 789 | 8 | 96 |
| V2 | 790 | 2697 | 97 | 306 |
| V3 | 2698 | 4069 | 307 | 487 |
| V4 | 4070 | 7044 | 488 | 746 |
| V5 | 7045 | 9533 | 747 | 885 |
| V6 | 9534 | 10454 | 886 | 1029 |
| V7 | 10455 | 12258 | 1030 | 1180 |
| V8 | 12259 | 13597 | 1181 | 1372 |
| V9 | 13598 | 14371 | 1373 | 1468 |
| VT | 8 | 14371 | 8 | 1468 |
Fig. 2Geodesic distance between trees based on sub-regions and trees based on VT
Fig. 3Illustration of different variable regions. Red regions (V2, V8) have a poor phylogenetic resolution at the phylum level. Green regions (V4, V5, V6) are associated with the shortest geodesic distance, which suggests that they may be the best choice for phylogeny-related analyses and the phylogenetic analysis of novel bacterial phyla. The figure refers to the primer map from Lutzonilab (http://lutzonilab.org/16s-ribosomal-dna/). Use of this information was approved by the original authors of the website
Fig. 4AHC results for different regions based on the geodesic distances of the phylogenetic trees
Fig. 5The 2D-3D structures of the 16S rRNA gene. Individual regions are identified by the same color in both the 2D and 3D structures. Some important structures are colored with blocks