| Literature DB >> 19860885 |
Cristiane C Thompson1, Ana Carolina P Vicente, Rangel C Souza, Ana Tereza R Vasconcelos, Tammi Vesth, Nelson Alves, David W Ussery, Tetsuya Iida, Fabiano L Thompson.
Abstract
BACKGROUND: Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera) from 32 genome sequences of different vibrio species. We use a variety of tools to explore the taxonomic relationship between the sequenced genomes, including Multilocus Sequence Analysis (MLSA), supertrees, Average Amino Acid Identity (AAI), genomic signatures, and Genome BLAST atlases. Our aim is to analyse the usefulness of these tools for species identification in vibrios.Entities:
Mesh:
Year: 2009 PMID: 19860885 PMCID: PMC2777879 DOI: 10.1186/1471-2148-9-258
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Genomic features of the vibrios genomes.
| Aliivibrio salmonicida FLI1238 | 3325164 | 39 | 77 | 48 | ||
| Chromosome I | 1206461 | 38 | 2820 | 77 | ||
| Chromosome II | 984 | |||||
| Photobacterium profundum SS9 | ||||||
| Chromosome I | 4085304 | 41 | 3416 | 82 | 51 | |
| Chromosome II | 2237943 | 41 | 2006 | 80 | ||
| 5234286 | 45 | 4341 | 81 | 53 | ||
| Vibrio alginolyticus 12G01 | 5160431 | 44 | 4732 | 86 | 53 | |
| Vibrio angustum S14+ | 5101447 | 39 | 4558 | 84 | 48 | |
| Vibrio campbellii AND4 | 4255798 | 44 | 3935 | 85 | 53 | |
| Vibrio cholerae N16961 | 2961149 | 47 | 87 | 52 | ||
| Chromosome I | 1072315 | 46 | 2742 | 84 | ||
| Chromosome II | 1093 | |||||
| Vibrio cholerae 0395 | 3024069 | 47 | 88 | 52 | ||
| Chromosome I | 1108250 | 46 | 2742 | 86 | ||
| Chromosome II | 1133 | |||||
| Vibrio cholerae 1587 | 4137501 | 47 | 3758 | 82 | 52 | |
| Vibrio cholerae 2740-80 | 3945478 | 47 | 3771 | 87 | 52 | |
| Vibrio cholerae 623-39 | 3975259 | 47 | 3777 | 86 | 52 | |
| Vibrio cholerae B33 | 4026835 | 47 | 3677 | 83 | 53 | |
| Vibrio cholerae MAK757 | 3917446 | 47 | 3501 | 82 | 52 | |
| Vibrio cholerae MZO-2 | 3862985 | 47 | 3425 | 83 | 52 | |
| Vibrio cholerae MZO-3 | 4146039 | 47 | 3897 | 86 | 52 | |
| Vibrio cholerae NCTC8457 | 4063388 | 47 | 3975 | 86 | 53 | |
| Vibrio cholerae V52 | 3974495 | 47 | 3815 | 86 | 52 | |
| Vibrio fischeri ES114+ | 2897536 | 38 | 45 | |||
| Chromosome I | 1330333 | 37 | 2586 | 86 | ||
| Chromosome II | 1175 | 87 | ||||
| Vibrio fischeri MJ11+ | 45 | |||||
| Chromosome I | 2905029 | 38 | 2590 | 86 | ||
| Chromosome II | 1418848 | 37 | 1254 | 87 | ||
| 5989646 | 46 | 4954 | 66 | 51 | ||
| Vibrio harveyi ATCC BAA-1116 | 3765351 | 45 | 85 | 53 | ||
| Chromosome I | 2204018 | 45 | 3546 | 86 | ||
| Chromosome II | 2374 | |||||
| Vibrio harveyi HY01 | 5400985 | 45 | 4327 | 75 | 51 | |
| 4373300 | 46 | 3744 | 86 | 53 | ||
| 4421792 | 46 | 3790 | 86 | 53 | ||
| Vibrio parahaemolyticus RIMD2210633 | ||||||
| Chromosome I | 3288558 | 45 | 3080 | 86 | 52 | |
| Chromosome II | 1877212 | 45 | 1752 | 86 | ||
| Vibrio parahaemolyticus AQ3810 | 5771228 | 45 | 5509 | 80 | 53 | |
| Vibrio shilonii AK1 | 5701826 | 43 | 5360 | 88 | 54 | |
| Vibrio sp Ex25 | 4844262 | 44 | 4240 | 84 | 53 | |
| Vibrio sp MED222 | 4891901 | 43 | 4590 | 85 | 52 | |
| Vibrio splendidus 12B01 | 5596386 | 44 | 5231 | 85 | 53 | |
| Vibrio vulnificus CMCP6 | ||||||
| Chromosome I | 3281944 | 46 | 2915 | 83 | 53 | |
| Chromosome II | 1844853 | 47 | 1557 | 86 | ||
| Vibrio vulnificus YJ016 | 3354505 | 46 | 87 | 53 | ||
| Chromosome I | 1857073 | 47 | 3259 | 89 | ||
| Chromosome II | 1696 |
The genomes sequenced by this study are in bold. +Vibrio angustum and Vibrio fischeri were reclassified as Photobacterium angustum [67] and Aliivibrio fischeri [35], respectively. *calculated using concatenated chromosome sequences.
Figure 1Genome BLAST atlas. The chromosome I of vibrios. The Atlas was constructed using the genome of V. cholerae N16961 as the reference strain on which the genes of the other strains are mapped. Genomic regions unique to this strain and not appearing in other vibrio strains are lightly colored. The position of the genes in the different replicons may not be the same.
Figure 2Genome BLAST atlas. The chromosome II of vibrios. The Atlas was constructed using the genome of V. cholerae N16961 as the reference strain on which the genes of the other strains are mapped. Genomic regions unique to this strain and not appearing in other vibrio strains are lightly colored. Notice the hyper-variable region (midpoint at 375 K) in the chromosome II of vibrios (the super-integron), corresponding to 1/6 of this replicon. The position of the genes in the different replicons may not be the same.
Figure 3Taxonomic resolution of AAI, BLAST proteome and genome dissimilarity [. Mean and standard deviations. Red = AAI; Green = proteome; yellow = [δ*(f,g)]. The taxonomic resolution of AAI is down to the intergenera level, whereas [δ*(f,g)] has a resolution at interspecies level. The dashed lines delimit (p < 0.001) the different taxonomic levels for AAI and [δ*(f,g)] but not for the proteome. The proteome did not completely fit this figure (and dashed lines limits), showing some noise signal for V. harveyi-V.harveyi.
Figure 4A-C. Phylogenetic trees based on the maximum likelihood method using 16S rRNA gene, MLSA (i.e. ftsZ, gyrB, mreB, pyrH, recA, rpoA and topA; 10,141 bp), and supertree (i.e. aminopeptidase P, alaS, aspS, ftsZ, gltX, gyrB, hisS, ileS, infB, metG, mreB, pntA, pheT, pyrH, recA, rpoA, rpoB, rpsH, signal recognition particle protein, threonyl-tRNA synthetase, topA, valS and 30S ribosomal protein S11; 41,617 bp). Bootstrap percentages after 2000 replications are shown. Because some genomes used in this study are not completely sequenced, for the comparison of 16S rRNA, MLSA and supertree, we used 16 genomes of vibrios. The genes used in MLSA and supertree were found only in these 16 genomes.
Figure 5Codon usage rose plots for four representatives of different vibrio genera. The vibrios with a higher AT content (bottom rose plots) have a codon usage bias of A or U at the third position of the codon. The frequency scale is represented at the right side of the rose plots.
Pearson correlation coefficient (expressed as percentage) between different methods
| 1. 16S rRNA gene identity | 100 | |||||
| 2. Identity in MLSA | 86.5 | 100 | ||||
| 3. Identity in supertree analysis | 91.1 | 98.4 | 100 | |||
| 4. Average aminoacid identity (AAI) | 85.9 | 97.7 | 96.9 | 100 | ||
| 5. Karlin genome signature dissimilarity | 71.5 | 85.3 | 82.3 | 84.9 | 100 | |
| 6. BLAST proteome identity | 77.1 | 89.0 | 86.4 | 92.5 | 85.5 | 100 |
Figure 6Pangenome plot of vibrios. Pangenome consists of panel A and panel B. The vibrio pangenome is around 26504 genes. V. cholerae has a pangenome of 6923 genes with clear increment of genes of its sister species V. mimicus (8306 genes).