| Literature DB >> 24278351 |
Mohammad J Hossain1, Geoffrey C Waldbieser, Dawei Sun, Nancy K Capps, William B Hemstreet, Kristen Carlisle, Matt J Griffin, Lester Khoo, Andrew E Goodwin, Tad S Sonstegard, Steven Schroeder, Karl Hayden, Joseph C Newton, Jeffery S Terhune, Mark R Liles.
Abstract
To investigate the molecular basis of the emergence of Aeromonas hydrophila responsible for an epidemic outbreak of motile aeromonad septicemia of catfish in the Southeastern United States, we sequenced 11 A. hydrophila isolates that includes five reference and six recent epidemic isolates. Comparative genomics revealed that recent epidemic A. hydrophila isolates are highly clonal, whereas reference isolates are greatly diverse. We identified 55 epidemic-associated genetic regions with 313 predicted genes that are present in epidemic isolates but absent from reference isolates and 35% of these regions are located within genomic islands, suggesting their acquisition through lateral gene transfer. The epidemic-associated regions encode predicted prophage elements, pathogenicity islands, metabolic islands, fitness islands and genes of unknown functions, and 34 of the genes encoded in these regions were predicted as virulence factors. We found two pilus biogenesis gene clusters encoded within predicted pathogenicity islands. A functional metabolic island that encodes a complete pathway for myo-inositol catabolism was evident by the ability of epidemic A. hydrophila isolates to use myo-inositol as a sole carbon source. Testing of A. hydrophila field isolates found a consistent correlation between myo-inositol utilization as a sole carbon source and the presence of an epidemic-specific genetic marker. All epidemic isolates and one reference isolate shared a novel O-antigen cluster. Altogether we identified four different O-antigen biosynthesis gene clusters within the 11 sequenced A. hydrophila genomes. Our study reveals new insights into the evolutionary changes that have resulted in the emergence of recent epidemic A. hydrophila strains.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24278351 PMCID: PMC3835674 DOI: 10.1371/journal.pone.0080943
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of draft genome sequences from 11 different A. hydrophila isolates obtained from diseased fish.
| Strains | Source of isolates | Year of isolation | Type | Contigs >200bp | N50 contig size (bp) | %G+C | Total bp in assembly | GenBank Accession no. |
|---|---|---|---|---|---|---|---|---|
| AL97-91 | Diseased Tilapia | 1997 | RAh | 111 | 159,889 | 61.2 | 4826834 | SRX157795 |
| TN97-08 | Diseased Blue Gill | 1997 | RAh | 94 | 144,878 | 60.8 | 5197004 | SRX157873 |
| MN98-04 | Diseased Tilapia | 1998 | RAh | 98 | 140,863 | 61.1 | 4863171 | SRX157796 |
| AL06-01 | Diseased catfish | 2006 | RAh | 122 | 120,531 | 61.3 | 4750603 | SRX157912 |
| AL06-06 | Diseased Goldfish | 2006 | RAh | 133 | 104,809 | 61.4 | 4844135 | SRX157794 |
| AL09-79 | Diseased Catfish | 2009 | VAh | 91 | 111,260 | 60.9 | 4975016 | SRX157791 |
| ML09-119 | Diseased Catfish | 2009 | VAh | 100 | 167,870 | 60.9 | 5003533 | SRX157759 |
| ML09-121 | Diseased Catfish | 2009 | VAh | 93 | 182,452 | 60.9 | 4998164 | SRX157784 |
| ML09-122 | Diseased Catfish | 2009 | VAh | 156 | 89,294 | 60.9 | 4979378 | SRX157790 |
| AL10-121 | Diseased Catfish | 2010 | VAh | 98 | 167,914 | 60.9 | 5010737 | SRX157792 |
| PB10-118 | Diseased catfish | 2010 | VAh | 100 | 143,368 | 60.9 | 5060794 | SRX157793 |
Note: VAh and RAh stand for virulent Aeromonas hydrophila and reference Aeromonas hydrophila, respectively.
Figure 1Evolutionary relationships of 37 A. hydrophila taxa based on gyrB gene sequences (out of a larger dataset of 107 A. hydrophila gyrB sequences).
The evolutionary history was inferred using the Maximum Parsimony method. Tree #1 out of 67 most parsimonious trees (length = 218) is shown. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) are shown next to the branches. The tree is drawn to scale, with branch lengths calculated using the average pathway method and are in the units of the number of changes over the whole sequence.
Figure 2Pan and core-genome plot of 11 different A. hydrophila isolates.
The red and blue lines indicated the number of genes within the core and pan-genomes, respectively. The A. hydrophila core genome contained 3,511 core genes whereas the A. hydrophila pan-genome contained 6,856 genes. Note that the addition of other VAh strains after A. hydrophila ML09-119 did not significantly increase the number of new gene families, which was in agreement with the highly clonal nature of VAh strains.
Figure 3BLAST MATRIX of 12 different A. hydrophila isolates.
The proteomes of each of the A. hydrophila strains were compared using all-against-all BLASTp according to the methods described previously [29]. This matrix showed the output from pairwise comparison of conserved protein families of each of the isolates to each other. The green color represents the % homology between proteomes and the red color represents % homology within proteomes. This matrix showed that all the epidemic A. hydrophila isolates are similar to each other but substantially different to reference isolates. All of the reference isolates, except for MN98-04 and AL97-91 that were highly similar to each other, possessed substantial amount of diversity in their protein families.
Summary of ORFs encoded by the O-antigen biosynthesis gene cluster of VAh strain ML09-119.
| ORF ID | Nucleotide positions in Accession no. KC999973 | Predicted Function | Gene | Nearest Neighbor | % Identity | Accession number |
|---|---|---|---|---|---|---|
| ORFu | 1.672 | lipoprotein YmcC |
|
| 43 | ZP_06715179.1 |
| ORF1 | 1114.1842 | O-antigen length determinant protein |
|
| 59 | YP_003912353.1 |
| ORF2 | 2160..4784 | periplasmic protein involved in polysaccharide export |
|
| 45 | YP_001339663.1 |
| ORF3 | 4930.6012 | O-antigen chain length determinant protein |
|
| 80 | YP_004393425.1 |
| ORF4 | 6067.6606 | dTDP-4-dehydrorhamnose 3,5-epimerase |
|
| 58 | ZP_08635572.1 |
| ORF5 | 6620.7561 | glucose-1-phosphate thymidylyltransferase |
|
| 61 | ZP_03699710.1 |
| ORF6 | 7561.8445 | rmlD gene product |
|
| 54 | YP_003809930.1 |
| ORF7 | 8432.9493 | undecaprenyl-phosphate N-acetylglucosaminyl 1-phosphate transferase |
|
| 57 | ZP_06154788.1 |
| ORF8 | 9544.10776 | phosphomannomutase |
|
| 69 | ZP_08734182.1 |
| ORF9 | 10965.11717 | Colanic acid biosynthesis glycosyl transferase |
|
| 67 | ZP_03065870.1 |
| ORF10 | 11717.13018 | mannose-1-phosphate guanylyltransferase |
|
| 71 | ZP_01218698.1 |
| ORF11 | 13128.13610 | GDP-mannose mannosyl hydrolase |
|
| 63 | YP_002893236.1 |
| ORF12 | 13594.14574 | GDP-fucose synthetase |
|
| 84 | NP_668408.1 |
| ORF13 | 14578.15684 | GDP-mannose 4,6-dehydratase |
|
| 86 | ZP_01235027.1 |
| ORF14 | 15705.16940 | group 1 glycosyl transferase |
|
| 49 | ZP_03830724.1 |
| ORF15 | 16937.17986 | group 1 glycosyl transferase |
|
| 52 | YP_003016893 |
| ORF16 | 17986.19167 | glycosyl transferase group 1 |
|
| 52 | ZP_08780763.1 |
| ORF17 | 19164.19655 | acetyltransferase |
|
| 59 | ZP_08780764.1 |
| ORF18 | 19648.20826 | O-antigen polymerase |
|
| 29 | ZP_05287114.1 |
| ORF19 | 20877.21968 | group 1 glycosyl transferase protein |
|
| 42 | ZP_08475479.1 |
| ORF20 | 22444.23694 | O-antigen flippase |
|
| 81 | EHC06312.1 |
| ORF21 | 23691.24794 | aminotransferase |
|
| 81 | YP_001555451.1 |
| ORF22 | 24796.25233 | dTDP-D-Fucp3N acetyltransferase |
|
| 85 | YP_001555452.1 |
| ORF23 | 25235.25657 | dTDP-6-deoxy-3,4-keto-hexulose isomerase |
|
| 72 | ADV52549.1 |
| ORF24 | 25668.26534 | glucose-1-phosphate thymidylyltransferase |
|
| 79 | NP_718742 |
| ORF25 | 26531.27619 | dTDP-glucose-4-6-dehydratase |
|
| 93 | AAM22544.1 |
| ORFd | 28234.31383 | AcrB protein |
|
| 99 | YP_857414.1 |
Summary of ORFs encoded by the O-antigen biosynthesis gene cluster of RAh strain TN97-08.
| ORF ID | Nucleotide positions in Accession no. KC999968 | Predicted Function | Gene | Nearest Neighbor | % Identity | Accession number |
|---|---|---|---|---|---|---|
| ORFu | 1.672 | lipoprotein YmcC |
|
| 43 | ZP_06715179.1 |
| ORF1 | 1114.1842 | O-antigen length determinant protein |
|
| 59 | YP_003912353 |
| ORF2 | 2160..4784 | periplasmic protein involved in polysaccharide export |
|
| 45 | YP_001339663 |
| ORF3 | 4930.6012 | O-antigen chain length determinant protein |
|
| 80 | YP_004393425 |
| ORF4 | 6067.6606 | dTDP-4-dehydrorhamnose 3,5-epimerase |
|
| 58 | ZP_08635572 |
| ORF5 | 6620.7561 | glucose-1-phosphate thymidylyltransferase |
|
| 61 | YP_005093462 |
| ORF6 | 7561.8445 | rmlD gene product |
|
| 54 | YP_003809930 |
| ORF7 | 8432.9493 | undecaprenyl-phosphate N-acetylglucosaminyl 1-phosphate transferase |
|
| 57 | ZP_06154788 |
| ORF8 | 9544.10776 | phosphomannomutase |
|
| 69 | YP_005021705 |
| ORF9 | 10965.11717 | Colanic acid biosynthesis glycosyl transferase |
|
| 67 | ZP_03065870 |
| ORF10 | 11717.13018 | mannose-1-phosphate guanylyltransferase |
|
| 71 | ZP_01218698 |
| ORF11 | 13128.13610 | GDP-mannose mannosyl hydrolase |
|
| 63 | YP_002893236 |
| ORF12 | 13594.14574 | GDP-fucose synthetase |
|
| 84 | NP_668408 |
| ORF13 | 14578.15684 | GDP-mannose 4,6-dehydratase |
|
| 86 | ZP_01235027 |
| ORF14 | 15705.16940 | group 1 glycosyl transferase |
|
| 49 | ZP_03830724 |
| ORF15 | 16937.17986 | group 1 glycosyl transferase |
|
| 52 | YP_003016893 |
| ORF16 | 17986.19167 | glycosyl transferase group 1 |
|
| 52 | ZP_08780763 |
| ORF17 | 19164.19655 | acetyltransferase |
|
| 59 | ZP_08780764 |
| ORF18 | 19648.20826 | O-antigen polymerase |
|
| 29 | ZP_05287114 |
| ORF19 | 20877.21968 | group 1 glycosyl transferase protein |
|
| 42 | ZP_08475479 |
| ORF20 | 22444.23694 | O-antigen flippase |
|
| 81 | EHC06312 |
| ORF21 | 23691.24794 | aminotransferase |
|
| 81 | YP_001555451 |
| ORF22 | 24796.25233 | dTDP-D-Fucp3N acetyltransferase |
|
| 85 | YP_001555452 |
| ORF23 | 25235.25657 | dTDP-6-deoxy-3,4-keto-hexulose isomerase |
|
| 72 | ADV52549 |
| ORF24 | 25668.26534 | glucose-1-phosphate thymidylyltransferase |
|
| 79 | ADV52548 |
| ORF25 | 26531.27619 | dTDP-glucose-4-6-dehydratase |
|
| 93 | AAM22544 |
| ORFd | 28234.31383 | AcrB protein |
|
| 99 | YP_857414.1 |
Figure 4Mauve Alignment of the O-antigen cluster from 16 different A. hydrophila isolates.
Segments with a similar color indicate homologous regions. The alignment is on scale based on the size of the O-antigen cluster of A. hydrophila ATCC 7966 which is 44 kb in length. All of the EAh strains including RAh strain TN97-08 shared the ML09-119-type O-antigen biosynthesis gene cluster.
Figure 5Schematic organization of the four different types of O-antigen biosynthesis gene cluster present within the genome of the 11 A. hydrophila isolates sequenced in this study.
All of the genes on the cluster are transcribed in the same direction. All VAh strains along with RAh strain TN97-08 shared the ML09-119-type O-antigen biosynthesis gene cluster. This cluster encodes proteins predicted to be involved in the biosynthesis of the nucleotide sugars D-rhamnose, D-mannose, D-Fucose, and 3-acetamido-3, 6-dideoxy-d-galactose (D-Fucp3NAc). The AL97-91-type cluster (that was also shared with MN98-04) encodes genes predicted to be required for S-layer biosynthesis and transport in addition to O-antigen biosynthesis. Genes that encode conserved proteins with similar functions are marked with the same color. The number displayed next to the maps indicates the nucleotide positions on the respective contig from each strain. The designation of each of the genes presented on the schematic map of the AL06-01, ML09-119, AL06-06 and AL97-91 O-antigen clusters are found in Tables 2, 4, 5 and 6, respectively.
Summary of ORFs encoded by the O-antigen biosynthesis gene cluster of RAh strain AL06-01.
| ORF ID | Nucleotide positions in Accession no. KC999970 | Predicted function | Gene | Nearest Neighbor | %Identity | Accession Number |
|---|---|---|---|---|---|---|
| ORFu | 1..429 | low molecular weight protein-tyrosine-phosphatase |
|
| 99 | YP_857382 |
| ORF1 | 665..1780 | capsular polysaccharide transport protein |
|
| 99 | YP_857383 |
| ORF2 | 2116..3282 | UDP-glucose 6-dehydrogenase |
|
| 99 | YP_857384 |
| ORF3 | 3295..4308 | nucleotide sugar epimerase |
|
| 99 | YP_857385 |
| ORF4 | 4305..5459 | O-antigen polymerase |
|
| 32 | ZP_06180045 |
| ORF5 | 5549..6343 | glycosyl transferase family protein |
|
| 86 | YP_857387 |
| ORF6 | 6345..7166 | glycosyl transferase family protein |
|
| 98 | YP_857388 |
| ORF7 | 7170..8264 | glycoside hydrolase family protein |
|
| 99 | YP_857389 |
| ORF8 | 8327..9541 | polysaccharide biosynthesis protein |
|
| 100 | YP_857390 |
| ORF9 | 9543..10658 | NAD(P) transhydrogenase subunit alpha |
|
| 100 | YP_857391 |
| ORF10 | 10651..11673 | RimK-like protein |
|
| 100 | YP_857392 |
| ORF11 | 11732..12811 | UDP-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase |
|
| 100 | YP_857393 |
| ORF12 | 13296..15281 | protein WbgZ |
|
| 98 | YP_857395 |
| ORF13 | 15278..16156 | glycoside hydrolase family protein |
|
| 92 | YP_857396 |
| ORF14 | 16303..17259 | UDP-glucose 4-epimerase |
|
| 91 | YP_857397 |
| ORF15 | 17259..18068 | glycosyl transferase family protein |
|
| 65 | YP_001141293 |
| ORF16 | 18059..19144 | glycosyltransferase |
|
| 53 | YP_004594810 |
| ORF17 | 19141..20250 | UDP-N-acetylglucosamine 2-epimerase |
|
| 80 | YP_001141289 |
| ORF18 | 20262..21338 | group 1 glycosyl transferase |
|
| 43 | YP_004594808 |
| ORF19 | 21325..24027 | family 2 glycosyl transferase |
|
| 47 | YP_003444096 |
| ORF20 | 24040..25344 | ABC transporter ATP-binding protein |
|
| 64 | YP_004594806 |
| ORF21 | 25334..26143 | ABC-2 type transporter |
|
| 60 | ZP_10171957 |
| ORF22 | 26145..26684 | dTDP-4-dehydrorhamnose 3,5-epimerase |
|
| 78 | ZP_08521430 |
| ORF23 | 26747..27625 | glucose-1-phosphate thymidylyltransferase 1 |
|
| 92 | YP_001141271 |
| ORF24 | 27737..28624 | dTDP-4-dehydrorhamnose reductase |
|
| 95 | YP_857411 |
| ORF25 | 28624..29709 | putative dTDP-glucose-4-6-dehydratase |
|
| 96 | AAM22544 |
| ORFd | 30397..31806 | outer membrane protein OprM |
|
| 97 | YP_857413 |
Summary of ORFs encoded by the O-antigen biosynthesis gene cluster of RAh strain AL06-06.
| ORF ID | Nucleotide positions in Accession no. KC999971 | Predicted function | Gene | Nearest Neighbor | %Identity | Accession Number |
|---|---|---|---|---|---|---|
| ORFu | 1..1719 | lipid A core - O-antigen ligase |
|
| 97 | YP_857377 |
| ORF1 | 1814..2875 | O-antigen chain length determinant protein |
|
| 69 | ZP_08521419 |
| ORF2 | 2948..3487 | dTDP-4-dehydrorhamnose 3,5-epimerase |
|
| 57 | ZP_08635572 |
| ORF3 | 3592..4479 | glucose-1-phosphate thymidylyltransferase 1 |
|
| 92 | YP_004392190 |
| ORF4 | 4591..5478 | dTDP-4-dehydrorhamnose reductase |
|
| 97 | YP_857411 |
| ORF5 | 5478..6566 | dTDP-glucose-4, 6-dehydratase |
|
| 98 | AAM22544 |
| ORF6 | 6761..8722 | epimerase/dehydratase family WbfY-like protein |
|
| 96 | ZP_08521420 |
| ORF7 | 8781..9335 | lipid carrier : UDP-N-acetylgalactosaminyltransferase |
|
| 96 | YP_004393428 |
| ORF8 | 9338..10078 | UDP-glucose 4-epimerase |
|
| 77 | ZP_05883342 |
| ORF9 | 10301..11422 | glycosyl transferase, group 1 family protein |
|
| 59 | NP_718730 |
| ORF10 | 11419..13353 | asparagine synthetase, glutamine-hydrolyzing |
|
| 74 | NP_718731 |
| ORF11 | 13380..14491 | glycosyl transferase, group 1 |
|
| 46 | ZP_08744363 |
| ORF12 | 14505..15575 | group 1 glycosyl transferase |
|
| 35 | ZP_06689986 |
| ORF13 | 15572..16861 | virulence factor MVIN family protein |
|
| 39 | ZP_02383242 |
| ORF14 | 16858..18057 | Cna B domain-containing protein |
|
| 33 | ZP_10479786 |
| ORF15 | 18286..19314 | UDP-GlcNAc 4-epimerase |
|
| 81 | NP_718745 |
| ORF16 | 19417..20697 | UDP-glucose dehydrogenase |
|
| 88 | YP_004190001 |
| ORF17 | 21479..21766 | dTDP-D-glucose-4,6-dehydratase |
|
| 90 | AAM74474 |
| ORFd | 22510..23925 | outer membrane protein OprM |
|
| 99 | AAM22559 |
Summary of ORFs encoded by the O-antigen biosynthesis gene cluster of RAh strain AL97-91.
| ORF ID | Nucleotide positions in Accession no. KC999966 | Predicted function | Gene | Nearest Neighbor | % Identity | Accession Number |
|---|---|---|---|---|---|---|
| ORFu | 1.1710 | lipid A core - O-antigen ligase |
|
| 97 | YP_857377 |
| ORF1 | 1715..2740 | UDP-glucose lipid carrier transferase |
|
| 73 | ABX39510 |
| ORF2 | 3008.4144 | glycosyltransferase |
|
| 75 | YP_001141302 |
| ORF3 | 4141.5235 | glycosyltransferase, group 2 family protein |
|
| 70 | YP_001141301 |
| ORF4 | 5237.6016 | glycosyltransferase, group 2 family protein |
|
| 48 | ACH97156 |
| ORF5 | 6013.7269 | integral membrane protein AefA/O-antigen flippase |
|
| 70 | YP_004730750 |
| ORF6 | 7300.9282 | epimerase/dehydratase family WbfY-like protein |
|
| 94 | YP_857395 |
| ORF7 | 9279.10157 | glycosyl transferase, group 4 family protein |
|
| 91 | YP_857396 |
| ORF8 | 10304.11260 | UDP-glucose 4-epimerase |
|
| 92 | YP_857397 |
| ORF9 | 11260.12066 | Glycosyltransferase, family 2 |
|
| 65 | YP_001141293 |
| ORF10 | 12068.12787 | acyltransferase family protein |
|
| 30 | ZP_01733088 |
| ORF11 | 13147.16497 | putative N-acetyl glucosaminyl transferase |
|
| 50 | ZP_06189367 |
| ORF12 | 16515.17831 | transporter |
|
| 73 | ZP_06189366 |
| ORF13 | 17821.18639 | ABC-2 type transporter |
|
| 65 | ZP_01665322 |
| ORF14 | 18841.20259 | surface layer protein |
|
| 100 | ACV89427 |
| ORF15 | 21019.23250 | S-protein secretion D |
|
| 100 | AAA79322 |
| ORF16 | 23254.23754 | ORF2 | ORF2 |
| 100 | AAA79321 |
| ORF17 | 23751.24347 | ORF1 | ORF1 |
| 100 | AAA79320 |
| ORF18 | 24344.25090 | ORFJ, partial | ORFJ |
| 99 | AAA79319 |
| ORF19 | 25284.26306 | general secretion pathway protein K |
|
| 49 | YP_002798801 |
| ORF20 | 26306.26911 | type II secretion system protein |
|
| 48 | YP_002798802 |
| ORF21 | 26908.27312 | type II secretion system protein |
|
| 58 | AEA85764 |
| ORF22 | 27306.27677 | type II secretion system protein |
|
| 48 | AEA85763 |
| ORF23 | 27680.28114 | General secretion pathway protein G |
|
| 84 | AEA85762 |
| ORF24 | 28133.29332 | type II secretion system protein |
|
| 60 | YP_002798806 |
| ORF25 | 29332.30987 | type II secretion system protein E |
|
| 76 | YP_002798807 |
| ORF26 | 31755.32303 | dTDP-4-dehydrorhamnose 3,5-epimerase |
|
| 81 | ZP_09461002 |
| ORF27 | 32368.33246 | glucose-1-phosphate thymidylyltransferase |
|
| 94 | YP_001141271 |
| ORF28 | 33359.34246 | dTDP-4-dehydrorhamnose reductase |
|
| 98 | YP_857411 |
| ORF29 | 34246.35331 | dTDP-glucose 4,6-dehydratase |
|
| 97 | YP_857412 |
| ORFd | 36022.37437 | outer membrane protein OprM |
|
| 98 | YP_857413 |
Figure 6Predicted genomic islands (GIs) within the genome of A. hydrophila ML09-119.
GIs were predicted using the IslandViewer tool [32]. The black line indicates the %G+C content. All of the predicted GIs showed a %G+C content bias much lower than the average %G+C content of A. hydrophila (61.0%).
The distribution of 16 different VAh-specific genomic islands in different A. hydrophila isolates used in this study.
| GI # | Nucleotide positions | Size (kb) | Number of ORFs | %G+C |
| Epidemic-associated GIs |
|---|---|---|---|---|---|---|
| GI 1 | 8728.22581 | 13,853 | 12 | 44.0 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 2 | 30195.41895 | 11,700 | 13 | 57.8 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 3 | 71487.98770 | 27,283 | 28 | 41.3 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 4 | 263720.284778 | 21,058 | 32 | 54.0 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, AL97-91, MN98-04, TN97-08, AL06-01 | - |
| GI 5 | 76843.95300 | 18,457 | 22 | 51.5 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, AL97-91, MN98-04, TN97-08, AL06-01 | - |
| GI 6 | 95365.107861 | 12,496 | 12 | 48.1 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 7 | 123898.133969 | 10,071 | 12 | 44.5 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, TN97-08, AL06-01 | - |
| GI 8 | 277559.288667 | 11,108 | 13 | 54.2 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, AL97-91, MN98-04 | - |
| GI 9 | 91274.118831 | 27,557 | 36 | 51.1 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, TN97-08 | - |
| GI 10 | 119893.134877 | 14,984 | 20 | 57.4 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, TN97-08 | - |
| GI 11 | 21645.43624 | 21,979 | 33 | 42.3 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 12 | 25686.46472 | 20,787 | 19 | 49.9 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 13 | 1.8099 | 8,098 | 12 | 35.2 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 14 | 51060.60399 | 9,339 | 12 | 63.6 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
| GI 15 | 60579.71065 | 10,486 | 13 | 43.6 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118, TN97-08, | - |
| GI 16 | 351791.364972 | 13,182 | 18 | 56.4 | ML09-119, ML09-122, ML09-121, AL09-79, AL10-121, PB10-118 | Unique |
Summary of ORFs encoded within GI 13 of EAh isolate ML09-119 involved in CS5 pilus biogenesis.
| ORF ID | Nucleotide positions in contig 27 | Putative function | Gene | Top BLASTx hit | % Identity | E-value | % Identity to |
|---|---|---|---|---|---|---|---|
| ORF1 | 106.456 | Extracellular solute-binding protein family 3 | - |
| 48 | 2.27917 | - |
| ORF2 | 909.1490 | CS5 fimbrial major pilin subunit |
|
| 38 | 2E-24 | 38 (CAA11820) |
| ORF3 | 1555..2247 | 25.9 kDa protein in CS5 3' region precursor |
|
| 36 | 3.82E-30 | 28 (CAA11821) |
| ORF4 | 2303.3643 | P pilus assembly protein porin PapC-like protein |
|
| 29 | 9.65E-45 | 25 (CAA11822) |
| ORF5 | 3945.4679 | P pilus assembly protein porin PapC-like protein |
|
| 28 | 6.78E-19 | 22 (CAA11822) |
| ORF6 | 4669.5244 | Hypothetical protein |
|
| 26 | 4.91E-08 | 26 (CAA11823) |
| ORF7 | 5205.6023 | CS5 fimbrial minor pilin subunit |
|
| 35 | 2.56E-19 | 35 (CAA11825) |
| ORF8 | 6369.7127 | EAL domain protein | - |
| 45 | 1.92E-56 | - |
| ORF9 | 7258.8034 | Alpha/beta hydrolase, putative | - |
| 49 | 3.35E-64 | - |
Predicted genes that have homology to putative virulence factors and are present within VAh-associated genomic regions.
| Unique region ID | VFDB ID | GIs | Gene | Putative functions | Organisms | % Identity | E-value |
|---|---|---|---|---|---|---|---|
| C8R1 | VFG1693 | GI 1 |
| Prophage P4 integrase |
| 33 | 1.1E-45 |
| C10R1 | VFG0893 | - |
| PapA protein |
| 33 | 2.1E-16 |
| C10R1 | VFG0075 | - |
| Internalin A |
| 30 | 1.4E-16 |
| C13R2 | VFG0783 | - |
| Putative integrase for prophage 933L and the LEE pathogenicity island |
| 27 | 1.3E-16 |
| C15R4 | VFG0038 | GI 2 |
| probable oxidoreductase |
| 28 | 2.3E-19 |
| C15R4 | VFG0344 | GI 2 |
| iron(III) ABC transporter, ATP |
| 33 | 6.7E-23 |
| C15R5 | VFG0082 | - |
| aldehyde dehydrogenase |
| 31 | 1.2E-47 |
| C15R6 | VFG0598 | - |
| Sai integrase |
| 47 | 4.9E-93 |
| C18R3 | VFG0672 | GI 6 |
| integrase |
| 66 | 5E-162 |
| C20R4 | VFG1124 | - |
| conserved hypothetical protein |
| 41 | 4.2E-39 |
| C20R7 | VFG0925 | - |
| Ferric enterobactin transport ATP |
| 33 | 2.3E-23 |
| C20R7 | VFG0922 | - |
| Putative permease of iron compound ABC transport system |
| 39 | 3.8E-29 |
| C20R8 | VFG0358 | - |
| yersiniabactin receptor protein |
| 26 | 1.4E-34 |
| C20R8 | VFG0167 | - |
| transcriptional regulator PchR |
| 41 | 1.1E-26 |
| C26R1 | VFG1102 | GI 12 |
| DNA methylase HsdM |
| 25 | 4.6E-17 |
| C26R1 | VFG1098 | GI 12 |
| type I restriction enzyme HsdR |
| 24 | 1.3E-21 |
| C26R2 | VFG2417 | - |
| hypothetical protein |
| 47 | 5.7E-41 |
| C26R2 | VFG2416 | - |
| putative receptor |
| 56 | 1.2E-108 |
| C26R2 | VFG2415 | - |
| putative enzyme |
| 55 | 8.5E-142 |
| C26R2 | VFG2412 | - |
| hypothetical protein |
| 48 | 1.1E-50 |
| C26R2 | VFG2414 | - |
| hypothetical protein |
| 57 | 7.2E-44 |
| C26R2 | VFG2044 | - |
| Virulence factors transcription regulator |
| 47 | 7.7E-53 |
| C27R1 | VFG1433 | GI 13 |
| CS7 fimbria major subunit CsvA precursor |
| 39 | 6.8E-22 |
| C27R1 | VFG0584 | GI 13 |
| putative diguanylate cyclase |
| 29 | 4.8E-28 |
| C32R1 | VFG1584 | GI 14 |
| hypothetical protein |
| 26 | 1.9E-13 |
| C32R1 | VFG1584 | GI 14 |
| hypothetical protein |
| 26 | 4.7E-18 |
| C32R2 | VFG1888 | - |
| sensory box histidine kinase |
| 39 | 1.1E-41 |
| C32R2 | VFG0584 | - |
| putative diguanylate cyclase |
| 32 | 6.9E-32 |
| C36R3 | VFG1092 | GI 16 |
| integrase, phage family |
| 26 | 9.3E-15 |
| C36R3 | VFG1443 | GI 16 |
| outer membrane protein A |
| 38 | 1.8E-54 |
| C36R3 | VFG1548 | GI 16 |
| PrfC protein |
| 39 | 1E-164 |
| C36R3 | VFG1547 | GI 16 |
| PrfD protein |
| 51 | 2.7E-61 |
| C39R1 | VFG0266 | - |
| hemoglobin receptor |
| 31 | 1.4E-45 |
| C39R1 | VFG0266 | - |
| hemoglobin receptor |
| 36 | 6.5E-34 |
Figure 7Genetic elements involved in myo-inositol utilization in VAh strains.
The schematic organization depicts the cluster of genes involved in myo-inositol utilization in epidemic A. hydrophila ML09-119. The presence of a functional myo-inositol utilization pathway in VAh strains was confirmed by their ability to grow on myo-inositol as a sole carbon source.
Distribution of five prophages in different A. hydrophila isolates used in this study.
|
| AH1 | AH2 | AH3 | AH4 | AH5 |
|---|---|---|---|---|---|
| AL06-06 | Absent | Partial | Present | Absent | Absent |
| AL06-01 | Absent | Absent | Absent | Absent | Absent |
| AL97-91 | Absent | Partial | Partial | Partial | Absent |
| TN97-08 | Absent | Partial | Partial | Absent | Absent |
| MN98-04 | Absent | Present | Partial | Partial | Absent |
| ML09-119 | Present | Present | Present | Present | Present |
| ML09-121 | Present | Present | Present | Present | Present |
| ML09-122 | Present | Present | Present | Present | Present |
| AL09-79 | Present | Present | Present | Present | Present |
| AL10-121 | Present | Present | Present | Present | Present |
| PB10-118 | Present | Present | Present | Present | Present |
The reference mapping of TN97-08 against the prophage AH5 found that some reads from TN97-08 matched with this prophage genome but none of them encoded complete ORFs.
Figure 8Identification of a VAh-specific genetic region associated with prophage AH4 that is induced after mitomycin C treatment.
(Panel A) Mauve multiple genome alignment of the prophage A4 region from AH strains MN98-04 and AL97-91 with VAh strain ML09-119, depicting the two upstream ORFs (in red) and two within-prophage regions (shown by lack of Mauve alignment) that are VAh-associated. (Panel B) Induced phage DNAs were subjected to 454 pyrosequencing and were reference mapped against the AH4 prophage region of the A. hydrophila Ml09-119 genome. Each predicted ORF is indicated as an arrow, and the four VAH-associated ORFs are depicted in red.