| Literature DB >> 35592008 |
Michela Ruinelli1, Jochen Blom2, Theo H M Smits1, Joël F Pothier1.
Abstract
Members of the Pseudomonas syringae species complex cause symptoms that are ranging from leaf spots to cankers on a multitude of plant species, including some of the genus Prunus. To date, a total of two species of the P. syringae species complex and six different pathovars have been associated with diseases on Prunus spp., which were shown to belong to different phylogenetic units (phylogroups, PG) based on sequence similarity of housekeeping genes or whole genomes, suggesting that virulence to Prunus spp. may be the result of convergent pathoadaptation. In this study, a comparative genomics approach was used to determine genes significantly associated with strains isolated from Prunus spp. across a phylogeny of 97 strains belonging to the P. syringae species complex. Our study revealed the presence of a set of orthologous proteins which were significantly associated with strains isolated from Prunus spp. than in strains isolated from other hosts or from non-agricultural environments. Among them, the type III effector HopAY predicted to encode for a C58 cysteine protease was found to be highly associated with strains isolated from Prunus spp. and revealed patterns supporting co-evolution and host adaptation.Entities:
Keywords: Pseudomonas syringae species complex; co-evolution; comparative genomics; host adaptation; pathogenicity
Year: 2022 PMID: 35592008 PMCID: PMC9111521 DOI: 10.3389/fmicb.2022.804681
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
List of strains used for this study.
| Strain | Code | Origin | Host | GenBank accession | Reference | Genome subset |
|---|---|---|---|---|---|---|
| Pavii CFBP 3846 | FR, 1991 |
| LT963402- |
| A | |
| Ppe CFBP 1573 | FR, 1974 |
| ODAL01 |
| A | |
| Ppe NCPPB 2254 | FR, 1969 |
| ODAM01 |
| A | |
| Ppe NCPPB 2254* | FR, 1969 |
| LAZV01 |
| – | |
| Pmp2 HRIW5261 | UK, 1990 |
| LIIA01 |
| B | |
| Pmp2 CFBP 3800 | UK, N.D. |
| OLMQ01 |
| B | |
| Pmp2 CFBP 6411 | UK, 1995 |
| LT963408 |
| B | |
| Pmp2 MAFF 302280 | US, N.D. |
| AEAE01 |
| B | |
| P. cerasi PL58 | PL, 2007 |
| LT222313- |
| D | |
| P. cerasi PL963 | PL, 2007 |
| LT963395- |
| D | |
| Psy 2339 | HU, 1984 |
| LIHU01 |
| – | |
| Psy CFBP 2118 | FR, 1979 |
| LT962481 |
| – | |
| Psy CFBP 4215 | FR. 1997 |
| LT962480 |
| – | |
| P. amygdali CFBP 3205 | GR, 1967 |
| JYHB01 |
| – | |
| Pmp1 2341 | HU, 1988 |
| LIIB01 |
| C | |
| Pmp FTRSU7805* | JP, 1978 |
| LGLQ01 | N.A. | – | |
| Pmp1 HRIW5269 | UK, 1990 |
| LIHZ01 |
| C | |
| Pmp1 CFBP 2116 | FR, 1974 |
| LT985192- |
| C | |
| Pmp1 CFBP 3840 | FR, 1996 |
| LT963409- |
| C | |
| Pscer CFBP 6109 | JP, 1995 |
| LT963391- |
| C | |
| Pscer CFBP 6110 | JP, 1995 |
| LT985210- |
| C | |
| Psa3 ICMP 18884 | NZ, 2010 |
| CP011972- |
| B | |
| Psa2 ICMP 19073 | KR, 1998 |
| AOJR01 |
| B | |
| Psa1 ICMP 9617 | JP, 1984 |
| CM002753- |
| B | |
| Pfm ICMP 18883 | NZ, 2010 |
| AOKH01 |
| B | |
| Pto DC3000 | UK, 1960 |
| AE016853- |
| A | |
| Pto NCPPB 1108 | UK, 1960 |
| ADGA01 | N.A. | A | |
| Pto NYS-T1 | US, 2009 |
| JRRA01 |
| A | |
| Pto T1 | 1986 |
| ABSM01 |
| A | |
| Pav BPIC631 | GR, 1976 |
| AKBS01 |
| B | |
| Pav CRAFRUEC1 | IT, 2003 |
| ATLL01 |
| B | |
| Pav PaVt10 | IT, 2010 |
| JYHC01 |
| B | |
| Psav ISPAVE013 | IT, 1992 |
| AKCJ01 |
| – | |
| Psav ISPAVE037 | IT, 1992 |
| AKCK01 |
| – | |
| Pla M302278 | US, 1935 |
| AEAM01 |
| A | |
| CC440 | FR, 2002 |
| AVEC02 |
| – | |
| CC457 | FR, 2003 |
| AVEB02 |
| – | |
| CC94 | FR, 1997 |
| AVEA02 |
| – | |
| Pla MAFF 301315 | JP, 1975 |
| AEAF01 |
| C | |
| Pae 2250 | United Kingdom |
| ACXT01 |
| – | |
| Pae 0893_23 | IN, 1969 |
| AEAD01 |
| – | |
| Pae NCPPB 3681 | IN, 1980 |
| ACXS01 |
| – | |
| Paf DSM 50255 | CA, 1942 |
| AWUI01 | N.A. | – | |
| Psy B64 | N.D. |
| ANZF01 |
| – | |
| Psy SM | United States |
| APWT01 |
| – | |
| BRIP39023 | AU, 1971 |
| AMZX01 |
| D | |
| BRIP34876 | AU, 1971 |
| AMXK01 |
| – | |
| BRIP34881 | AU, 1971 |
| AMXL01 |
| – | |
| Psy B728a | US, 1987 |
| CP000075 |
| – | |
| Pph 1448a | ET, 1985 |
| CP000058- |
| C | |
| Psv DAPP-PG722 | IT, 2007 |
| JOJV01 |
| – | |
| Psv PseNe107 | NP, 2007 |
| JYHF01 |
| – | |
| CC1458 | US, 2005 |
| AVEN02 |
| – | |
| CC1466 | US, 2005 |
| AVEM02 |
| – | |
| Pgy B076 | 2007 |
| AEGG01 |
| C | |
| Pgy r4 | 1977 |
| AEGH01 |
| C | |
| Pta 6605 | JP | AJXI01 | N.A. | C | ||
| Pta ATCC 11528 | US, 1905 | AEAP01 |
| C | ||
| Psy A2 | N.D. |
| LGKU01 | N.A. | – | |
| Psy B301D-R | UK, 1969 | JALJ01 |
| – | ||
| CC1630 | US, 2007 | AVED02 |
| – | ||
| Pma CFBP1657 | NZ, 1965 |
| JYHH01 |
| A | |
| Pth ICMP 3923 | JP, 1970 |
| LJRU01 | N.A. | – | |
| Pvi ICMP 3963 | US, N.d. | LJRR01 | N.A. | – | ||
| Ppp ICMP 4048 | CN, 1973 |
| LJRB01 | N.A. | D | |
| UMAF0158 | ES, 1993 | CP005970- |
| D | ||
| Ppa LMG 2367 | US, 1963 | ALAC01 |
| – | ||
| Psy DSM 10604 | UK, 1950 |
| JALK01 | N.A. | – | |
| Psy HS191 | AU, 1979 |
| CP006256- |
| – | |
| Psy 642 | US, 2007 | Unidentified weed | ADGB01 |
| – | |
| Psy 1212 | United Kingdom |
| AVCR02 |
| – | |
| Pde CFBP 3226 | JP, 1979 |
| JYHG01 |
| – | |
| P.meliae CFBP 3225 | JP, 1974 |
| JYHE01 |
| – | |
| Par ICMP 4457 | JP, 1967 |
| LJPS01 | N.A. | – | |
| CC1513 | FR, 2006 |
| AVEL02 |
| – | |
| CC1629 | US, 2007 |
| AVEE02 |
| – | |
| Pcb ICMP 2823 | HU, 1957 |
| LJPX01 | N.A. | – | |
| Pal ES4326 | US, 1965 |
| AEAK01 |
| – | |
| Phe ICMP4531 | MX, 1972 |
| LJQM01 | N.A. | – | |
| Ptg ICMP4091 | ZW, 1972 |
| LJRM01 | N.A. | – | |
| Pvir TA043 | FR, 2007 |
| AVDV01 |
| – | |
| Pvir UASWS0038 | CH, 2007 | AMQP01 |
| – | ||
| CC1416 | US, 2004 | Epilithon | AVEP02 |
| – | |
| CC1544 | FR, 2006 | Lake water | AVEI02 |
| – | |
| CC1559 | FR, 2006 | Snow | AVEG02 |
| – | |
| USA007 | US, 2007 | Stream water | AVDY02 |
| – | |
| CC1543 | FR, 2006 | Lake water | AVEJ02 |
| – | |
| UB0390 | FR,2007 | River water | JPQV01 | N.A. | – | |
| UB303 | FR, 2006 | Lake water | AVDZ02 |
| – | |
| USA011 | US, 2007 | Stream water | AVDX02 |
| – | |
| Pvir CC1582 | FR, 2006 | Epilithon | AVDW01 |
| – | |
| CC1417 | US, 2004 | Epilithon | AVEO02 |
| – | |
| CC1524 | FR, 2006 | Stream water | AVEK02 |
| – | |
| CC1583 | FR, 2006 | Epilithon | AVEF02 |
| – | |
| CC1557 | FR, 2006 | Snow | CP007014- | N.A. | – | |
| GAW0119 | FR, 2010 | Irrigation canal | JPQU01 | N.A. | – | |
| CEB003 | FR, 2010 | Stream water | JPQT01 | N.A. | – | |
| Pfl Pf0-1 | US, 1987 | Soil | CP000094 |
| – | |
| Ppu KT2440 | N.D. | Soil | AE015451 |
| – |
Superscript following strain names indicate .
Superscript asterisk following strain name indicates strains with a suspected misnaming based on core genome phylogeny. Culture collections providing strains are abbreviated in the strain names as ATCC (American Type Culture Collection, Manassas, Virginia, United States), CFBP (Collection Française de Bactéries associées aux Plantes, FR), DSM (German Collection of Microorganisms and Cell Cultures, DE), ICMP (International Collection of Microorganisms from Plants, NZ), LMG (Bacteria collection of the Laboratory for Microbiology of the Faculty of Sciences of the Ghent University, BE), NCPPB (National Collection of Plant Pathogenic Bacteria, United Kingdom), and MAFF (NIAS Genebank of the Ministry of Agriculture, Forestry and Fisheries, JP).
N.D.: not determined.
For Whole Genome Shotgun (WGS) sequences, accession numbers are provided as four letters prefixes and two digits for the version number of the data set.
N.A.: not applicable.
Figure 1Neighbor-joining (NJ) phylogeny based on the core genome of the Pseudomonas syringae species complex and distribution profile of 13 proteins considered to be significantly associated with Prunus spp. among these strains. A set of 99 genomes of strains belonging to the P. syringae species complex as well as one Pseudomonas fluorescens and one Pseudomonas putida strains were used for this analysis (Table 1). The tree was built using EDGAR 2.2 (Blom et al., 2016) out of a core genome of 1,344 CDS giving a total alignment of 5,36,722 amino acids per genome. Percent bootstrap (bt) support values calculated for 500 reiterations are indicated near nodes. Only bt values over 51 are displayed. The strain names refer to the code field from Table 1. Phylogroups (PG) and clades are indicated on the right. Strains isolated from Prunus spp. are indicated in red, from other woody hosts in blue, from herbaceous hosts in black, and from non-agricultural environments in green. Strain names followed by an asterisk (*) indicate strains which were excluded from further comparative analyses due to a presumed misnaming of the genome. Genome subsets (A-D) used to determine the correlation between gene presence and Prunus spp. association are indicated with color highlights. Arrows indicate the node at which Prunus-associated strains are diverging from the non-Prunus-associated strains within the same genome subset. Protein orthologs were retrieved out of these 97 genomes using EDGAR 2.2 (Blom et al., 2016). Black squares indicate presence of the protein based on FIGURE 1the orthology criteria of EDGAR 2.2. Proteins highlighted in pink are involved in virulence based on their annotation. Protein descriptions followed by two asterisks (**) indicate that orthologs were also found using online TBLASTN analysis against 13 additional Pseudomonas species closely related to the P. syringae species complex as reported in Supplementary Figure 4. The proteins are ordered by decreasing significance of the likelihood ratio (LR) statistic when exceeding the p ≤ 0.05 threshold of 5.36. This order is not indicative of any physical proximity. Locus tags and LR statistic are reported over each considered protein; h.p.: hypothetical protein; NA: not applicable; and ND: not determined.
Figure 2Venn diagram showing groups of ortholog proteins within Prunus-associated Pseudomonas syringae strains belonging to different PG but absent from phylogenetically closely related non-Prunus-associated strains. Subsets of genomes to be compared were defined based on core genome phylogeny within PG1a (subset A), PG1b (subset B), PG2a (subset C), and PG3 (subset D; see Figure 1). For each subset of genomes, the core genome of the Prunus-associated strains was calculated, and at the same time, all ortholog proteins found in non-Prunus-associated strains within the same subset were discarded using EDGAR 2.2 (Blom et al., 2016). The protein sequences resulting for each of the subsets (gene sets A–D) were used as BLASTP query against each other using standalone BLAST v.2.2.29+ (Camacho et al., 2009) and considered as ortholog if identity and coverage were higher or equal to 60%.
Number of genes significantly more present in Prunus spp. isolated strains.
| LR value | Number of genes | Proportion (%) | ||||
|---|---|---|---|---|---|---|
| Expected | Observed | Retained | Tested | Flexible | ||
| 0.05 | 5.36 | 3 | 19 | 13 | 38.78 | 1.80 |
| 0.01 | 6.40 | <1 | 17 | 12 | 34.69 | 1.61 |
| 0.001 | 9.08 | <1 | 11 | 7 | 22.45 | 1.04 |
| 0.0001 | 12.62 | <1 | 3 | 3 | 6.12 | 0.28 |
| 0.00001 | 12.63 | <1 | 3 | 3 | 6.12 | 0.28 |
Expected number of Type I (false-positive) errors under the null model.
Retained based on the criteria present in 60% of the Prunus isolated strains.
Proportion of the 49 tested genes (three genes skipped based on the criteria occurring in either greater than six or fewer than 92 strains).
Proportion of the total flexible genome of subsets A-D (1,058 genes).
Figure 3Comparison of the serine/threonine protein kinase PBS1 among different plant species (A) and relation to the HopAY sequence (B). (A) Maximum likelihood phylogeny of the PBS1 protein among different plant species and relative PBS1 sequence stretch corresponding to the amino acids 233–252 of the Arabidopsis thaliana PBS1 sequence (NCBI Acc. Nr. NP_196820) containing the HopAR1 cleavage site GDK (brown block). The corresponding EDK region in the PBS1 sequence of the Prunus spp. is highlighted in red. In the phylogeny Prunus spp. members are reported in red, whereas plants other than Prunus spp. from which Pseudomonas syringae strains possessing the HopAY gene were isolated are indicated in blue. The gray block indicates species belonging to the Rosaceae family. The evolutionary distances were computed using the Jones–Taylor–Thornton model matrix-based with a gamma distribution (shape parameter = 2.25). Percent bootstrap (bt) support values calculated for 1,000 reiterations are indicated near node. Only bt values over 51 are displayed. All ambiguous positions were removed for each sequence pair giving a total of 598 positions. Alignments were obtained using ClustalW and phylogenetic analysis was performed with MEGA 6.0. (B) Alignment of HopAY and PBS1 around the EDK region. The first 133 amino acids (aa) of the HopAY sequence of Pseudomonas syringae pv. morsprunorum race 2 CFBP 3800 (Pmp2 CFBP3800) were aligned to the aa 170–302 of the Prunus persica PBS1 sequence (NCBI Acc. Nr. XP_007225732). The EDK motif is highlighted in red, identical residues are highlighted in orange, whereas residues sharing similar side chain properties at a specific position are indicated by asterisks.
Figure 4Insertion–deletion profiles of hopAY retrieved by BLASTN and corresponding HopAY truncation scheme. Five major groups (1–5) were defined based on conserved indel mutations. Numbers in bracket indicate the position of the insertion or deletion based on the reference hopAY sequence from Pseudomonas syringae pv. phaseolicola 1448a (indicated in bold) available in the Hop database (PPI). HopAY proteins with a truncation affecting the catalytic domains characteristic for C58 cysteine protease and thus considered as non-functional are represented in red. The strain names used refer to the code field from Table 1. Figure is not to scale.
Figure 5Comparison of the neighbor-joining phylogeny based on the core genome (A) and on HopAY (B). (A) The core genome of the 36 strains possessing an hopAY ortholog based on the BLASTN search was determined using EDGAR 2.2 (Blom et al., 2016) out of a core genome of 2,511 CDS giving a total alignment of 872,675 amino acids per genome. The strain names used refer to the code field from Table 1. Phylogroups and clades are indicated on the left and on the right, respectively. (B) The phylogeny of the 43 retrieved HopAY sequences was computed using the Jones–Taylor–Thornton model matrix-based with a gamma distribution (shape parameter = 2.25). Percent bootstrap (bt) support values calculated for 1,000 reiterations are indicated near nodes. Only bt values over 51 are displayed. All ambiguous positions were removed for each sequence pair giving a total of 323 positions in the final data set. Evolutionary analyses were conducted in MEGA 6.0. Indel groups (gray dashed lines boxes) refer to Figure 4. Asterisks (*) indicate strains with unique indel profiles. If nothing stated, sequences belong to indel group 1. For strains possessing multiple copies of HopAY, the locus tag is indicated in brackets. Strains isolated from Prunus spp. are highlighted in red. PG and clades are indicated with the same color code as used on the left in panel (A).