| Literature DB >> 23409014 |
Rachel A Mann1, Theo H M Smits, Andreas Bühlmann, Jochen Blom, Alexander Goesmann, Jürg E Frey, Kim M Plummer, Steven V Beer, Joanne Luck, Brion Duffy, Brendan Rodoni.
Abstract
The plant pathogen Erwinia amylovora can be divided into two host-specific groupings; strains infecting a broad range of hosts within the Rosaceae subfamily Spiraeoideae (e.g., Malus, Pyrus, Crataegus, Sorbus) and strains infecting Rubus (raspberries and blackberries). Comparative genomic analysis of 12 strains representing distinct populations (e.g., geographic, temporal, host origin) of E. amylovora was used to describe the pan-genome of this major pathogen. The pan-genome contains 5751 coding sequences and is highly conserved relative to other phytopathogenic bacteria comprising on average 89% conserved, core genes. The chromosomes of Spiraeoideae-infecting strains were highly homogeneous, while greater genetic diversity was observed between Spiraeoideae- and Rubus-infecting strains (and among individual Rubus-infecting strains), the majority of which was attributed to variable genomic islands. Based on genomic distance scores and phylogenetic analysis, the Rubus-infecting strain ATCC BAA-2158 was genetically more closely related to the Spiraeoideae-infecting strains of E. amylovora than it was to the other Rubus-infecting strains. Analysis of the accessory genomes of Spiraeoideae- and Rubus-infecting strains has identified putative host-specific determinants including variation in the effector protein HopX1(Ea) and a putative secondary metabolite pathway only present in Rubus-infecting strains.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23409014 PMCID: PMC3567147 DOI: 10.1371/journal.pone.0055644
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Strain metadata and genome sequence statistics for the 12 E. amylovora strains analyzed in this study.
| Chromosome metrics | ||||||||||||||
| Strain name (synonyms) | Isolated from | Origin | Genome status | Replicon | Accession numbers | Length (bp) | G+C content | Number of CDS | % coding | CD density (per kb) | Reference | |||
| Spiraeoideae-infecting isolates | ||||||||||||||
| CFBP 1430 |
| France, 1972 | Complete | Chromosome | FN434113 | 3805573 | 53.6 | 3706 | 85.6 | 0.973 |
| |||
| pEA29 | FN434114 | 28259 | ||||||||||||
| ATCC 49946 (Ea273) |
| New York, USA, 1973 | Complete | Chromosome | FN666575 | 3805874 | 53.6 | 3712 | 85.1 | 0.903 |
| |||
| pEA72 | FN666577 | 71487 | ||||||||||||
| pEA29 | FN666576 | 28243 | ||||||||||||
| CFBP 1232T (NCPPB 683T) |
| UK, 1959 | Draft | Chromosome | CAPB01000001-42 | 3767276 | 53.6 | 3780 | 86.6 | 1.003 |
| |||
| pEA29 | HF560650 | 28251 | ||||||||||||
| CFBP 2585 (Ea495) |
| Ireland, 1986 | Draft | Chromosome | CAOZ01000001-12 | 3767556 | 53.6 | 3734 | 86.0 | 0.992 |
| |||
| pEA30 | HF560646 | 29586 | ||||||||||||
| pEA29 | HF560645 | 28258 | ||||||||||||
| Ea266 (E4001A) |
| Ontario, Canada | Draft | Chromosome | CAOY01000001-38 | 3758663 | 53.6 | 3804 | 86.1 | 1.011 |
| |||
| pEA29 | HF560644 | 28261 | ||||||||||||
| Ea356 (Ea1/79) |
| N-Germany, 1979 | Draft | Chromosome | CAOX01000001-14 | 3763948 | 53.6 | 3760 | 86.2 | 0.998 |
| |||
| pEA29 | HF560643 | 28258 | ||||||||||||
| 01SFR-BO |
| Ravenna, Italy, 1991 | Draft | Chromosome | CAPA01000001-11 | 3767556 | 53.6 | 3744 | 86.1 | 0.993 |
| |||
| pEA29 | HF560647 | 28259 | ||||||||||||
| ACW 56400 |
| Fribourg, Switzerland, 2007 | Draft | Chromosome | AFHN01000001-22 | 3766903 | 53.6 | 3758 | 86.5 | 0.995 |
| |||
| pEI70 | CP002951 | 65831 | ||||||||||||
| pEA29 | AFHN01000023 | 28251 | ||||||||||||
| UPN527 |
| Navarra, Spain, 1997 | Draft | Chromosome | CAPC01000001-18 | 3766971 | 53.6 | 3746 | 86.1 | 0.994 |
| |||
|
| ||||||||||||||
| ATCC BAA-2158 (BB1, IL5, Ea246, BC204) |
| Illinois, USA, 1972 | Draft | Chromosome | FR719181-209 | 3808219 | 53.6 | 3827 | 86.2 | 1.004 |
| |||
| pEA29 | FR719212 | 28138 | ||||||||||||
| pEAR5.2 | FR719211 | 5251 | ||||||||||||
| pEAR4.3 | FR719210 | 4369 | ||||||||||||
| Ea644 |
| Massachusetts, USA, 2003 | Draft | Chromosome | CAPD01000001-40 | 3803638 | 53.3 | 3937 | 86.4 | 1.034 |
| |||
| pEA29 | HF560648 | 28689 | ||||||||||||
| MR1 (Ea574) |
| Michigan, USA | Draft | Chromosome | CAPE01000001-29 | 3789707 | 53.4 | 4042 | 84.7 | 1.067 |
| |||
| pEA29 | HF560649 | 27604 | ||||||||||||
Figure 1Circular plot of the pan-genome of E. amylovora.
The CDS of the pan-genome (forward and reverse) are depicted in the two outermost circles (aqua). Moving inwards, the core genome is depicted in yellow and the accessory genome in black. The accessory genome of the individual strains of E. amylovora continue inwards as follows: Rubus-infecting strains MR1 (red), Ea644 (pink) and ATCC BAA-2158 (purple), and Spiraeoideae-infecting strains CFBP 1430 (light blue), ATCC 49946 (royal blue), Ea266 (dark green), CFBP 2585 (tan), 01SFR-BO (sky blue), Ea356 (teal), UPN527 (navy blue), ACW 56400 (orange) and CFBP 1232T (light green). Variable regions of interest are numbered with a pan-genome locus (PL) of 1 to 32 and are described in Supplementary Tables 1 and 2. Of note are PL 4 (ICE flanking PAI-1), PL 20 (secondary metabolite cluster only found in Rubus-infecting strains), PL 27 (sequence from the Rubus-infecting strains that could not be assembled into contiguous sequence), PL 28 (pEA72), PL 29 (pEA29), PL 30 (pEI70), PL 31 (pEAR5.2 and pEAR4.3) and PL 32 (pEA30).
Percent average amino acid identities (AAI) calculated from the core genome data set using EDGAR and MUMi scores of genomic distance between the 12 E. amylovora strains and closely related Erwinia spp.
| Average amino acid identities (%) | MUMi Scores | |||||
| Data set | Average (Stdev) | Min | Max | Average (Stdev) | Min | Max |
| All | 99.72 (0.34) | 99.19 | 100 | 0.044 (0.050) | 0.000 | 0.122 |
| All Spiraeoideae-infecting isolates | 99.98 (0.02) | 99.93 | 100 | 0.005 (0.003) | 0.000 | 0.010 |
| All | 99.42 (0.36) | 99.19 | 99.83 | 0.089 (0.050) | 0.031 | 0.119 |
| ATCC BAA-2158 to all Spiraeoideae-infecting isolates | 99.79 (0.02) | – | – | 0.044 (0.002) | – | – |
| ATCC BAA-2158 to Ea644 and MR1 | 99.23/99.19 | – | – | 0.116/0.118 | – | – |
| MR1 to all Spiraeoideae-infecting isolates | 99.21 (0.00) | – | – | 0.118 (0.002) | – | – |
| Ea644 to all Spiraeoideae-infecting isolates | 99.24 (0.02) | – | – | 0.116 (0.002) | – | – |
| MR1 to Ea644 | 99.83 | – | – | 0.031 | – | – |
|
| 95.44 (0.05) | – | – | 0.588 (0.004) | – | – |
|
| 92.66 (0.01) | – | – | 0.804 (0.001) | – | – |
|
| 85.07 (0.01) | – | – | 0.940 (0.000) | – | – |
MUMi score values vary from 0 for identical genomes to 1 for very distant genomes.
Figure 2Phylogenetic analysis of Erwinia species created in EDGAR based on concatenated sequence of the core genome.
All strains of E. amylovora cluster together and are separate from the other Erwinia species. The Spiraeoideae-infecting strains form a distinct cluster within E. amylovora and the Rubus-infecting strain ATCC BAA-2158 (Rubus-infecting 1) clusters more closely with these strains than with the other two Rubus-infecting strains labeled Rubus-infecting 2.
Percentage of CDS predicted to be core in the genome of each selected bacterial species.
| Species | % core CDS | No. Genomes | Reference | |
|
| 44% | 17 | Open |
|
|
| 48% | 6 | Unknown |
|
|
| 63% | 4 | Open |
|
|
| 64% | 19 | Unknown |
|
|
| 74% | 44 | Open |
|
|
| 75% | 5 | Open |
|
|
| 80% | 26 | Open |
|
|
| 85–89% | 17 | Closed |
|
|
| 89% | 12 | Open | This study |
|
| 98% | 9 | Closed |
|
Figure 3Singleton development plot analysis.
Single development plots defined using 12 strains of E. amylovora including plasmids (A) and excluding plasmids (B), and 9 Spiraeoideae-infecting strains of E. amylovora including plasmids (C) and excluding plasmids (D). All plots indicate that the pan-genome of E. amylovora is ‘open’, predicting that each additional strain sequenced will add 52 (Plot A), 40 (Plot B), 30 (Plot C) and 11 (Plot D) new singletons to their respective pan-genome sets.