| Literature DB >> 24358875 |
Cristiane C Thompson1, Vanessa E Emmel1, Erica L Fonseca1, Michel A Marin1, Ana Carolina P Vicente1.
Abstract
The identification of the clinically relevant viridans streptococci group, at species level, is still problematic. The aim of this study was to extract taxonomic information from the complete genome sequences of 67 streptococci, comprising 19 species, by means of genomic analyses, multilocus sequence analysis (MLSA), average amino acid identity (AAI), genomic signatures, genome-to-genome distances (GGD) and codon usage bias. We then attempted to determine the usefulness of these genomic tools for species identification in streptococci. Our results showed that MLSA, AAI and GGD analyses are robust markers to identify streptococci at the species level, for instance, S. pneumoniae, S. mitis, and S. oralis. A Streptococcus species can be defined as a group of strains that share ≥ 95% DNA similarity in MLSA and AAI, and > 70% DNA identity in GGD. This approach allows an advanced understanding of bacterial diversity.Entities:
Year: 2013 PMID: 24358875 PMCID: PMC3799547 DOI: 10.12688/f1000research.2-67.v1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Genomic features of the streptococci.
G+C content (%): guanine + cytosine content (%). No. of CDs: number of coding DNA sequence. Nc: effective number of codons.
| Organism | GenBank
| Genome
| G+C content
| No. of
|
|
|---|---|---|---|---|---|
|
| CP000114 | 2,127,839 | 35 | 1996 | 44.9 |
|
| AL732656 | 2,211,485 | 35 | 2094 | 45.2 |
|
| AE009948 | 2,160,267 | 35 | 2124 | 45.1 |
|
| AECT00000000 | 1,993,709 | 38 | 2035 | 50.6 |
|
| AEEL00000000 | 2,050,893 | 37 | 2088 | 44.5 |
|
| AEKN00000000 | 2,239,421 | 43 | 2204 | 54.4 |
|
| AP010935 | 2,106,340 | 39 | 2094 | 50.3 |
|
| FM204883 | 2,253,793 | 41 | 2001 | 52.6 |
|
| FM204884 | 2,149,868 | 41 | 1869 | 52.4 |
|
| CP001129 | 2,024,171 | 41 | 1893 | 52.3 |
|
| AEEM00000000 | 2,214,091 | 37 | 2218 | 44.5 |
|
| FN597254 | 2,350,911 | 37 | 2223 | 44.4 |
|
| CP000725 | 2,196,662 | 40 | 2051 | 52.4 |
|
| AEDY00000000 | 1,792,252 | 39 | 2102 | 48.9 |
|
| ABJK00000000 | 1,925,087 | 37 | 2051 | 44.0 |
|
| FN568063 | 2,146,611 | 39 | 2004 | 50.4 |
|
| AEDT00000000 | 1,873,702 | 40 | 1757 | 49.8 |
|
| AP010655 | 2,013,587 | 36 | 1895 | 46.4 |
|
| AE014133 | 2,030,921 | 36 | 1960 | 46.5 |
|
| AEDW00000000 | 1,884,712 | 41 | 1793 | 51.4 |
|
| ADVN00000000 | 2,124,730 | 41 | 2035 | 52.8 |
|
| AEKM00000000 | 2,050,302 | 41 | 1978 | 52.9 |
|
| CP002121 | 2,130,580 | 39 | 2216 | 50.3 |
|
| FM211187 | 2,221,315 | 39 | 1990 | 50.0 |
|
| CP001033 | 2,209,198 | 39 | 2206 | 50.3 |
|
| CP000410 | 2,046,115 | 39 | 1914 | 49.8 |
|
| CP001015 | 2,078,953 | 39 | 2114 | 50.0 |
|
| CP000936 | 2,245,615 | 39 | 2155 | 50.2 |
|
| FQ312030 | 2,142,122 | 39 | 1824 | 49.9 |
|
| FQ312029 | 2,093,317 | 39 | 1930 | 50.0 |
|
| CP000919 | 2,120,234 | 39 | 2123 | 50.2 |
|
| FQ312027 | 2,036,867 | 39 | 1824 | 49.9 |
|
| CP000920 | 2,111,882 | 39 | 2073 | 50.1 |
|
| AE007317 | 2,038,615 | 39 | 2042 | 50.1 |
|
| CP000921 | 2,112,148 | 39 | 2044 | 50.1 |
|
| CP001993 | 2,088,772 | 39 | 2275 | 50.4 |
|
| AE005672 | 2,160,842 | 39 | 2105 | 50.0 |
|
| CP002176 | 2,240,045 | 39 | 2352 | 50.4 |
|
| CP000918 | 2,184,682 | 39 | 2202 | 50.1 |
|
| AENS00000000 | 2,111,372 | 36 | 2030 | 48.6 |
|
| AE014074 | 1,900,521 | 38 | 1865 | 49.1 |
|
| CP000261 | 1,860,355 | 38 | 1898 | 49.4 |
|
| CP000017 | 1,838,554 | 38 | 1865 | 48.9 |
|
| CP000056 | 1,897,573 | 38 | 1894 | 48.9 |
|
| AE009949 | 1,895,017 | 38 | 1839 | 49.0 |
|
| CP000259 | 1,836,467 | 38 | 1877 | 49.0 |
|
| CP000260 | 1,928,252 | 38 | 1986 | 49.0 |
|
| CP000003 | 1,899,877 | 38 | 1886 | 49.2 |
|
| CP000262 | 1,937,111 | 38 | 1979 | 49.1 |
|
| AE004092 | 1,852,441 | 38 | 1696 | 48.8 |
|
| CP000829 | 1,815,785 | 38 | 1700 | 48.8 |
|
| BA000034 | 1,894,275 | 38 | 1859 | 49.1 |
|
| AM295007 | 1,841,271 | 38 | 1745 | 48.9 |
|
| ACLO00000000 | 2,128,332 | 40 | 1992 | 47.0 |
|
| AEPO00000000 | 2,054,852 | 41 | 2013 | 51.7 |
|
| CP000387 | 2,388,435 | 43 | 2270 | 54.5 |
|
| AEVH00000000 | 2,311,949 | 43 | 2260 | 54.5 |
|
| FM252032 | 2,146,229 | 41 | 1932 | 52.0 |
|
| CP000837 | 2,038,034 | 41 | 1979 | 52.4 |
|
| AM946016 | 2,007,491 | 41 | 1824 | 51.9 |
|
| FM252031 | 2,095,898 | 41 | 1898 | 52.0 |
|
| CP000024 | 1,796,226 | 39 | 1915 | 47.0 |
|
| CP000419 | 1,856,368 | 39 | 1709 | 46.8 |
|
| CP000023 | 1,796,846 | 39 | 1888 | 46.9 |
|
| CP002340 | 1,831,949 | 39 | 1919 | 46.8 |
|
| AM946015 | 1,852,352 | 36 | 1762 | 46.4 |
|
| AEKO00000000 | 2,022,289 | 39 | 1979 | 47.1 |
Taxonomic resolution of genomic analyses of streptococci species.
MLSA: multilocus sequence analysis. AAI: amino acid identity. GGD: genome to genome distance. Nc: effective number of codons.
| 16S rRNA
| MLSA
| AAI
| GGD
| Codon usage
| |
|---|---|---|---|---|---|
|
| ≥99 | ≥95 | ≥95 | >70 | - |
|
| ≥99 | ≥98 | >97 | >70 | 49 |
|
| 99 | 100 | 98 | >70 | 45 |
|
| 99 | 98 | >96 | >70 | 52 |
|
| 100 | 100 | 100 | >70 | 52 |
|
| 99 | ≥97 | >97 | >70 | 50 |
|
| 99 | 100 | >97 | >70 | 47 |
|
| ≤99 | <95 | <95 | <70 | 44-54 |
|
| 99 | <94 | <92 | <70 | 47 |
|
| >99 | <94 | <93 | <70 | 50–51 |
Figure 1. Neighbor-joining tree based on 16S rRNA gene sequences and MLSA concatenated sequences of Streptococcus.
The numbers at the nodes indicate the values of bootstrap statistics after 2000 replications, and values below 50% are not shown. Bars, 0.005% and 0.02% estimated sequence divergence.