| Literature DB >> 21483493 |
Toïdi Adékambi1, Ray W Butler, Finnian Hanrahan, Arthur L Delcher, Michel Drancourt, Thomas M Shinnick.
Abstract
Comparative genomic sequencing is shedding new light on bacterial identification, taxonomy and phylogeny. An in silico assessment of a core gene set necessary for cellular functioning was made to determine a consensus set of genes that would be useful for the identification, taxonomy and phylogeny of the species belonging to the subclass Actinobacteridae which contained two orders Actinomycetales and Bifidobacteriales. The subclass Actinobacteridae comprised about 85% of the actinobacteria families. The following recommended criteria were used to establish a comprehensive gene set; the gene should (i) be long enough to contain phylogenetically useful information, (ii) not be subject to horizontal gene transfer, (iii) be a single copy (iv) have at least two regions sufficiently conserved that allow the design of amplification and sequencing primers and (v) predict whole-genome relationships. We applied these constraints to 50 different Actinobacteridae genomes and made 1,224 pairwise comparisons of the genome conserved regions and gene fragments obtained by using Sequence VARiability Analysis Program (SVARAP), which allow designing the primers. Following a comparative statistical modeling phase, 3 gene fragments were selected, ychF, rpoB, and secY with R2>0.85. Selected sets of broad range primers were tested from the 3 gene fragments and were demonstrated to be useful for amplification and sequencing of 25 species belonging to 9 genera of Actinobacteridae. The intraspecies similarities were 96.3-100% for ychF, 97.8-100% for rpoB and 96.9-100% for secY among 73 strains belonging to 15 species of the subclass Actinobacteridae compare to 99.4-100% for 16S rRNA. The phylogenetic topology obtained from the combined datasets ychF+rpoB+secY was globally similar to that inferred from the 16S rRNA but with higher confidence. It was concluded that multi-locus sequence analysis using core gene set might represent the first consensus and valid approach for investigating the bacterial identification, phylogeny and taxonomy.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21483493 PMCID: PMC3069002 DOI: 10.1371/journal.pone.0014792
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Strategy to establish a comprehensive gene set useful for MLSA following the literature recommended criteria.
Figure 2Mean variability for successive windows of 50 nucleotide positions using SVARAP.
(A) ychF gene sequence,(B) rpoB gene sequence (C), secY gene sequence (D), infB gene sequence and (E) rpoC gene sequence The arrows correspond to the forward and reverse amplification and sequencing primers.
Testing amplification with broad range primers on some representative genera of the subclass Actinobacteridae. (+) PCR positif.
| 16S rRNA |
|
|
| |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | + |
|
| + | + | + | − |
Broad range primer sequences for amplification and sequencing of the species of the subclass Actinobacteridae.
| Genes name | Sequence of the primers | Excepted sequence size ( | Hybridization Temperature |
|
|
| 754 bp | 60°C |
|
|
| 787 bp | 52°C |
|
|
| 703 bp | 52°C |
R = A or G; Y = C or T; K = G or T; S = G or C; W = A or T; B = C, G, or T; D = A, G, or T; H = A, C, or T; V = A, C, or G; N = A, C, G, or T.
Core gene set intraspecies similarities among species of the subclass Actinobacteridae.
| Species | Number of isolates | 16S rRNA similarity (%) |
|
|
|
|
|
|
|
|
|
|
|
| 3 | 100 | 99.1–99.9 | 99.7 | 99.9–100 |
|
| 2 | 100 | 100 | 100 | 100 |
|
| 3 | 100 | 100 | 100 | 100 |
|
| 20 | 100 | 100 | 100 | 100 |
|
| 3 | 100 | 99.0–100 | 100 | 99.6–100 |
|
| 2 | 100 | 96.9 | 99.3 | 96.3 |
|
| 2 | 100 | 100 | 99.7 | 100 |
|
| 2 | 99.4 | 99.9 | 98.8 | 99.2 |
|
| 2 | 100 | 100 | 99.9 | 100 |
|
| 2 | 99.9–100 | 100 | 99.9 | 100 |
|
| 2 | 98.7–99.9 | 97.6 | 99.2 | 99.2 |
|
| 4 | 99.8–100 | 100 | 100 | 100 |
|
| 4 | 99.9–100 | 98.3–100 | 97.8–99.9 | 98.1–100 |
|
| 2 | 100 | 99.8 | 99.7 | 100 |
(a) The intraspecies similarities were performed experimentally on 20 clinical isolates of Mycobacterium abscessus.
Figure 3Scatter plot of the relationship between genome conserved similarity and gene sequence fragment similarity.
(A) ychF fragment, (B) rpoB fragment, (C) secY fragment, (D), infB fragment and (E) ychF+rpoB+secY fragment.
Figure 4Phylogenetic tree of actinobacterial species using the neighbour-joining method with Kimura's two parameter distance correction.
(A) ychF+rpoB+secY fragment, (B) 16S rRNA gene. The support of each branch, as determined from 1000 bootstrap samples, is indicated by the value at each node (as a percentage). Bar represent difference in nucleotide sequences.