Literature DB >> 30103675

Horizontal gene transfer plays a major role in the pathological convergence of Xanthomonas lineages on common bean.

Nicolas W G Chen1, Laurana Serres-Giardi1, Mylène Ruh1, Martial Briand1, Sophie Bonneau1, Armelle Darrasse1, Valérie Barbe2, Lionel Gagnevin3,4, Ralf Koebnik4, Marie-Agnès Jacques5.   

Abstract

BACKGROUND: Host specialization is a hallmark of numerous plant pathogens including bacteria, fungi, oomycetes and viruses. Yet, the molecular and evolutionary bases of host specificity are poorly understood. In some cases, pathological convergence is observed for individuals belonging to distant phylogenetic clades. This is the case for Xanthomonas strains responsible for common bacterial blight of bean, spread across four genetic lineages. All the strains from these four lineages converged for pathogenicity on common bean, implying possible gene convergences and/or sharing of a common arsenal of genes conferring the ability to infect common bean.
RESULTS: To search for genes involved in common bean specificity, we used a combination of whole-genome analyses without a priori, including a genome scan based on k-mer search. Analysis of 72 genomes from a collection of Xanthomonas pathovars unveiled 115 genes bearing DNA sequences specific to strains responsible for common bacterial blight, including 20 genes located on a plasmid. Of these 115 genes, 88 were involved in successive events of horizontal gene transfers among the four genetic lineages, and 44 contained nonsynonymous polymorphisms unique to the causal agents of common bacterial blight.
CONCLUSIONS: Our study revealed that host specificity of common bacterial blight agents is associated with a combination of horizontal transfers of genes, and highlights the role of plasmids in these horizontal transfers.

Entities:  

Keywords:  Common bean; Horizontal gene transfer; Host adaptation; TAL effectors; Xanthomonas

Mesh:

Substances:

Year:  2018        PMID: 30103675      PMCID: PMC6090828          DOI: 10.1186/s12864-018-4975-4

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

In nature, most pathogens are generalists, meaning that they are able to infect multiple hosts, while other pathogens are specialists, meaning that they are highly adapted to a single or few host species [1]. For plant pathogens, adaptation to a specific host plant is a complex process possibly involving diverse molecular determinants and leading to host specificity [2, 3]. Understanding the molecular basis of host specificity can provide new insights into the evolution and ecology of specialist pathogens, and their potential to shift species and to infect new hosts. Bacteria from the genus Xanthomonas infect at least 392 plant species including important crops and ornamentals [4]. Yet, each individual strain is able to infect only one or few plant species. Strains able to cause the same symptoms on the same host range are grouped into pathovars [5]. Although our understanding of the molecular basis of host specificity is still limited, chemotactic sensors, adhesins and type III effectors emerge as key determinants for shaping host specificity in Xanthomonas [6-8]. Chemotactic sensors enable the bacteria to detect attractant or repellent molecules and trigger flagellar motility towards entry sites of the host plant, while adhesins allow the attachment on the host plant surface and biofilm formation, and type III effectors are delivered into the plant cells where they can have different functions including providing pathogen-associated molecular pattern triggered immunity (PTI). The Xanthomonas axonopodis species complex sensu Vauterin [9, 10] groups more than 30 pathovars infecting a wide range of plants including economically important crops and ornamentals, such as Citrus, Anthurium and Dieffenbachia species, as well as pepper, cassava, cotton, mango, soybean, and common bean. Based on repetitive-sequence-based Polymerase Chain Reaction (rep-PCR) fingerprints, X. axonopodis has been subdivided into six subclusters named 9.1 to 9.6 [11]. More recently, this species complex has been split into the four species X. citri, X. euvesicatoria, X. phaseoli and X. axonopodis [12, 13]. Common bacterial blight of bean (CBB) is the most devastating bacterial disease infecting common bean (Phaseolus vulgaris L.). CBB occurs everywhere where common bean is cultivated and may cause up to 75% yield loss in the most severe cases [14, 15]. Xanthomonas strains responsible for CBB are distributed across four different genetic lineages [16]. The fuscous lineage (fuscans) and the non-fuscous lineages 2 (NF2) and 3 (NF3) belong to X. citri pv. fuscans while the non-fuscous lineage 1 (NF1) belongs to X. phaseoli pv. phaseoli [9, 11, 12]. Pathological convergence between the NF1 and fuscans lineages is associated with horizontal gene transfers (HGT) involving dozens of genes [17]. Horizontal transfer of genes encoding Transcription Activator-Like (TAL) type III effectors was also observed between the four lineages of CBB agents [18]. In particular, all strains from the four genetic lineages display an allele of the tal23A gene, suggesting that this gene is important for Xanthomonas adaptation to common bean. In order to search for Xanthomonas genes putatively involved in the adaptation leading to common bean specificity in Xanthomonas, we have generated the whole genome sequences of 17 X. citri pv. fuscans and X. phaseoli pv. phaseoli strains. A combination of approaches including a comparison between the phylogeny of genes and the phylogeny of organisms, a parsimony approach to infer gene gains and losses, and a genome-wide search for specific k-mers, was used to search for genes presenting common characteristics unique to strains belonging to the four bean-pathogenic lineages of X. citri pv. fuscans and X. phaseoli pv. phaseoli.

Results

Genome sequencing and phylogeny

In order to obtain genomic data representative of the diversity of Xanthomonas strains responsible for CBB, we produced whole genome sequences for 17 strains from the four genetic lineages of X. citri pv. fuscans and X. phaseoli pv. phaseoli that affect beans. In addition, we sequenced two strains of X. citri pv. mangiferaeindicae, one strain of X. citri pv. anacardii, three strains of X. oryzae pv. oryzicola, and used 51 other publically available Xanthomonas genomes for a total of 72 whole genome sequences (Table 1). Stenotrophomonas maltophilia strain R551–3 and Xylella fastidiosa strains 9a5c and Temecula1 were used as outgroups for further analyses [19-21]. Annotation revealed from 3209 to 5405 coding sequences (CDS) per Xanthomonas genome (Additional file 1). Among Xanthomonas strains responsible for CBB, chromosome size ranged from 4,957,446 bp in strain CFBP1815 to 5,517,999 bp in strain CFBP6992, with an average GC content ranging from 64.3 to 64.9%. The phylogeny of strains was assessed based on the amino acid sequences of all annotated CDS using CVTree (Fig. 1). The overall topology of this tree was congruent with previous Xanthomonas phylogenies [11, 12, 22]. As described previously, strains responsible for CBB are distributed into four distinct genetic lineages belonging to two different species, X. citri and X. phaseoli [12, 13, 16, 23].
Table 1

Informations on the sequenced strains used in this study

IdentifierSpecies/PathovarStrainHost of isolationCountry and date of isolationContigsTotal size (bp)GC%Accession numberReference
stenma5513 Stenotrophomonas maltophilia R551-3 Populus trichocarpa na14,573,96966.29GCA_000020665.1Taghavi et al., 2008 [21]
albiliPC73 X. albilineans GPE PC73Saccharum spp. cv. H63-1418.Guadeloupe (France), 200313,768,69562.97GCA_000087965.1Pieretti et al., 2009 [77]
aberra6865X. campestris pv. campestrisCFBP6865Brassica oleracea var. capitata.Australia, 197545,169,51864.93UnpublishedNoël, comm. pers
barbar5825X. campestris pv. barbareaeCFBP5825R Barbarea vulgaris USA, 193925,055,18665.09ATNQ00000000Roux et al., 2015 [78]
campes8004X. campestris pv. campestris8004Brassica oleracea var. botrytis.United Kingdom, 195815,148,70864.95GCA_000012105.1Qian et al., 2005 [79]
campesAT33X. campestris pv. campestrisATCC 33913Brassica oleracea var. gemmiferaUnited Kingdom, 195715,076,18865.06GCA_000007145.1da Silva et al., 2002 [80]
campesB100X. campestris pv. campestrisB100nana15,079,00265.04GCA_000070605.1Vorhölter et al., 2008 [81]
incana2527X. campestris pv. incanaeCFBP2527R Matthiola incana USA, 195014,926,20565.12ATNO00000000Roux et al., 2015 [78]
incana1606X. campestris pv. incanaeCFBP1606R Matthiola incana France, 197414,967,65165.17ATNN00000000Roux et al., 2015 [78]
musace4381X. campestris pv. musacearumNCPPB4381Musa sp.Uganda, 20051154,810,03861.96ACHT00000000Studholme et al., 2010 [82]
raphan5828X. campestris pv. raphaniCFBP5828R Raphanus sativus USA, na24,912,70965.35ATNP00000000Roux et al., 2015 [78]
raphan756CX. campestris pv. raphani756CBrassica oleracea var. capitataeast asia, na14,941,21465.28GCA_000221965.1Bogdanove et al., 2011 [83]
cassav4642 X. cassavae CFBP4642 Manihot esculenta Malawi, 195175,278,19265.22GCA_000454545.1Bolot et al., 2013 [84]
axanac2913X. citri pv. anacardiiCFBP2913 Mangifera indica Brazil, na15,203,49664.57CP024057this study
axaura1035X. citri pv. aurantifoliiICPB10535 Citrus aurantiifolia Brazil, na3515,060,89664.81ACPY00000000Moreira et al., 2010 [85]
axcitr1083X. citri pv. citriFDC1083 Citrus reticulata Brazil, 198015,218,31464.72CCVZ01000000Gordon et al., 2015 [86]
axcitrJ902X. citri pv. citriJF90-2 Citrus aurantiifolia Oman, 198615,251,22564.66CCWA01000000Gordon et al., 2015 [86]
axcitrJ238X. citri pv. citriJJ238-24 Citrus aurantiifolia Thailand, 198915,282,62264.78CCVX01000000Gordon et al., 2015 [86]
axcitrLC80X. citri pv. citriLC80Citrus reticulata x C. sinensisMali, 200615,229,12764.7CCWJ01000000Gordon et al., 2015 [86]
axcitrLE20X. citri pv. citriLE20-1 Citrus aurantiifolia Ethiopia, 200815,309,24064.71CCWK01000000Gordon et al., 2015 [86]
axcitr9322X. citri pv. citriLMG9322 Citrus aurantiifolia Florida, 198615,194,13164.75CCVY01000000Gordon et al., 2015 [86]
axcitri306X. citri pv. citri306nana15,175,55464.77GCA_000007165.1da Silva et al., 2002 [80]
phafus1815X. citri pv. fuscans (fuscans)CFBP1815Phaseolus spGreece, 197814,957,44664.82GCA_900234415this study
phafus4834X. citri pv. fuscans (fuscans)4834-RPhaseolus vulgaris cv. MicheletFrance, 199814,981,99564.81FO681494Darrasse et al., 2013 [26]
phafus6166X. citri pv. fuscans (fuscans)CFBP6166 Phaseolus vulgaris South Africa, 196314,987,58764.8GCA_900234455this study
phafus6960X. citri pv. fuscans (fuscans)CFBP6960 Phaseolus vulgaris Reunion island, France, 200014,992,13164.79GCA_900234515this study
phafus6970X. citri pv. fuscans (fuscans)CFBP6970Phaseolus spUSA, 199015,006,65664.9GCA_900234505this study
pha2GL6988X. citri pv. fuscans (NF2)CFBP6988Phaseolus vulgaris cv. marlaReunion island, France, 200015,132,43364.64Depositedthis study
pha2GL6990X. citri pv. fuscans (NF2)CFBP6990Phaseolus vulgaris cv. marlaReunion island, France, 200015,124,65364.63GCA_900234485this study
pha2GL6991X. citri pv. fuscans (NF2)CFBP6991Phaseolus vulgaris cv. marlaReunion island, France, 200015,342,56564.35GCA_900234525this study
pha3GL6992X. citri pv. fuscans (NF3)CFBP6992Phaseolus vulgaris cv. marlaReunion island, France, 200015,517,99964.48GCA_900234495this study
pha3GL6994X. citri pv. fuscans (NF3)CFBP6994 Phaseolus vulgaris Tanzania, 199015,250,26664.57GCA_900234565this study
pha3GL6996X. citri pv. fuscans (NF3)CFBP6996Phaseolus vulgaris cv. marlaReunion island, France, 200045,095,42064.72AVET00000000this study
phafus7766X. citri pv. fuscans (fuscans)CFBP7766 Phaseolus vulgaris Cameroon, 200915,185,89164.63GCA_900234475this study
phafus7767X. citri pv. fuscans (fuscans)CFBP7767 Phaseolus vulgaris Cameroon, 200915,107,38864.7GCA_900234465this study
axglyc2526X. citri pv. glycinesCFBP2526 Glycine hispida Sudan, 195645,255,15264.63GCA_000495275.1Darrasse et al., 2013 [26]
axglyc7119X. citri pv. glycinesCFBP7119 Glycine max Brazil, 198145,521,78364.39GCA_000488895.1Darrasse et al., 2013 [26]
axmalv1386X. citri pv. malvacearumGSPB1386nana1274,991,41164.7GCA_000309905.1Hainan University
axmalv2388X. citri pv. malvacearumGSPB2388nana615,127,01664.53GCA_000309925.1Hainan University
axmalvaX18X. citri pv. malvacearumX18Gossypium spp.Burkina Faso, 1980s44,993,69264.7ATMA00000000Cunnac et al., 2013 [87]
axmalvaX20X. citri pv. malvacearumX20Gossypium spp.Burkina Faso, 1980s45,219,64864.49ATMB00000000Cunnac et al., 2013 [87]
axmang5610X. citri pv. mangiferaeindicaeLG56-10 Mangifera indica na45,281,95264.56PEBY00000000this study
axmang8127X. citri pv. mangiferaeindicaeLG81-27 Mangifera indica na65,209,59464.68PEBZ00000000this study
axmangL941X. citri pv. mangiferaeindicaeLMG941 Mangifera indica India, 19571955,144,32364.84CAHO01000001Midha et al., 2012 [88]
axalfa3836X. euvesicatoria pv. alfalfaeCFBP3836 Medicago sativa Sudan, na65,081,43864.74GCA_000488955.1Jacques et al., 2013 [3, 22]
axalli6369X. euvesicatoria pv. alliiCFBP6369 Allium cepa Reunion island, France, 199635,427,48864.36GCA_000730305.1Gagnevin et al., 2014 [89]
axmeloniF1X. euvesicatoria pv. citrumelonisF1Citrus sp.FL, USA, 198414,967,46964.91GCA_000225915.1Jalan et al., 2011 [90]
axeuve8510X. euvesicatoria pv. euvesicatoria85-10 Capsicum annuum FL, USA, 198515,178,46664.74GCA_000009165.1Thieme et al., 2005 [91]
perfor9118X. euvesicatoria pv. perforans91-118 Lycopersicon esculentum FL, USA, na2915,296,24165.04AEQW00000000Potnis et al., 2011 [92]
gardne1965 X. gardneri ATCC19865 Lycopersicon esculentum Yugoslavia, 19535525,594,68763.68AEQX00000000Potnis et al., 2011 [92]
carotaM081X. hortorum pv. carotaeM081 Daucus carota OR, USA, na1545,039,26963.83GCA_000505565.1Kimbrel et al., 2011 [93]
oryzaeKA10X. oryzae pv. oryzaeKACC10331 Oryza sativa Korea, na14,941,43963.69GCA_000007385.1Lee et al., 2005 [94]
oryzaeMA31X. oryzae pv. oryzaeMAFF 311018 Oryza sativa Japan, na14,940,21763.7GCA_000010025.1unpublished
oryzaePX99X. oryzae pv. oryzaePXO99A Oryza sativa Philippines, na15,240,07563.63GCA_000019585.1Salzberg et al., 2008 [95]
oryzicBA15X. oryzae pv. oryzicolaBAI15 Oryza sativa Burkina Faso, 200914,315,32764.16GCA_002189395.1this study
oryzicBA20X. oryzae pv. oryzicolaBAI20 Oryza sativa Burkina Faso, 200914,498,64364.05GCA_002189435.1this study
oryzicBA21X. oryzae pv. oryzicolaBAI21 Oryza sativa Burkina Faso, 200914,419,87064.07GCA_002189465.1this study
oryzicB256X. oryzae pv. oryzicolaBLS256 Oryza sativa Philippines, 198414,831,73964.05AAQN01000001Bogdanove et al., 2011 [83]
axdieff695X. phaseoli pv. dieffenbachiaeLMG695 Anthurium andreanum Brazil, 196515,037,35764.88GCA_001564415.1Robène et al., 2016 [96]
axmaniC151X. phaseoli pv. manihotisCIO151Manihot sp.Colombia, na365,154,53264.7GCA_000265845.1Rodriguez et al., 2012 [97]
pha1GLC412X. phaseoli pv. phaseoli (NF1)CFBP412 Phaseolus vulgaris USA, na15,028,35164.94GCA_900234435this study
pha1GL6164X. phaseoli pv. phaseoli (NF1)CFBP6164 Phaseolus vulgaris Romania, 199615,319,06164.64GCA_900234535this study
pha1GL6546X. phaseoli pv. phaseoli (NF1)CFBP6546 Phaseolus vulgaris USA, na44,994,66564.92Depositedthis study
pha1GL6984X. phaseoli pv. phaseoli (NF1)CFBP6984 Phaseolus vulgaris Reunion island, France, 200015,106,66364.82GCA_900234425this study
pha1GL7430X. phaseoli pv. phaseoli (NF1)CFBP7430Phaseolus vulgaris cv. CFO16Iran, 200615,085,97864.89GCA_900234445this study
axsyng9055X. phaseoli pv. syngonii (NF1)LMG9055 Syngonium podophyllum na65,010,90564.82GCA_001640215.1Robène et al., 2016 [96]
saccha4393 X. sacchari NCPPB4393Musa sp.Tanzania, 20074704,955,09969.03AGDB00000000Studholme et al., 2011 [98]
cereal2541X. translucens pv. cerealisCFBP2541 Bromus inermis USA, 1941104,524,87067.37GCA_000807145.1Pesce et al., 2015a [99]
trgramXt29X. translucens pv. graminisART-Xtg29 Lolium multiflorum Switzerland, na7884,203,85568.58ANGG00000000Wichman et al., 2013 [101]
gramin2053X. translucens pv. graminisCFPB2053 Dactylis glomerata Switzerland, 197324,344,93668.33LHSI00000000Pesce et al., 2015b [100]
trtran1874X. translucens pv. translucensDSM18974 Hordeum vulgare USA, 19585514,550,92167.67GCA_000331775.1unpublished
vasculN702X. vasicola pv. vasculorumNCPPB702 Saccharum officinarum Zimbabwe, 1959975,491,45756.86ACHS00000000Studholme et al., 2010 [82]
vesica3537 X. vesicatoria ATCC35937 Lycopersicon esculentum New Zealand, 19552965,567,56164.06AEQV01000000Potnis et al., 2011 [92]
xylefaTem1Xylella fastidiosa subsp. fastidiosaTemecula1 Vitis vinifera USA, na12,519,80251.77GCA_000007245.1van Sluys et al., 2003 [102]
xylefa9a5cXylella fastidiosa subsp. pauca9a5cCitrus sinensis cv. ValenciaBrazil, 199212,679,30652.67GCA_000006725.1Simpson et al., 2000 [19]

na not available

Fig. 1

Phylogeny of Xanthomonas strains used in this study. Phylogeny of Xanthomonas strains used in this study with indication of gene gains and losses. The phylogenetic tree is based on whole genome analysis using CVTree [66] with default parameters. Strain aliases are described in Table 1. Stenotrophomonas and Xylella genomes have been used as outgroups. Xanthomonas main phylogenetic groups 1 and 2 [24] and the X. axonopodis species complex [9, 10] are indicated by arrows. Groups 9.2, 9.4, 9.5 and 9.6 [11] are indicated in brackets. Fuscans, NF2, NF3 and NF1 refer to the four genetic lineages of strains responsible for CBB. A parsimony approach was performed to infer gene gains (blue) and losses (red) at levels higher than the pathovar rank, and numbers are displayed at each branch. Red stars highlight cases where gene loss was greater that gene gain. Curved arrows represent horizontal gene transfers (HGT) retrieved by Ks analysis on alignments of 115 candidate genes for bean specificity, with HGT from X. citri pv. fuscans to X. phaseoli pv. phaseoli in green, HGT from X. phaseoli pv. phaseoli to X. citri pv. fuscans in purple, and HGT between X. citri pv. fuscans lineages in red. Numbers in circles correspond to the numbers of candidate genes involved for each HGT. Question marks indicate events for which the origin or end of the HGT was not precise enough to assign any particular lineage

Informations on the sequenced strains used in this study na not available Phylogeny of Xanthomonas strains used in this study. Phylogeny of Xanthomonas strains used in this study with indication of gene gains and losses. The phylogenetic tree is based on whole genome analysis using CVTree [66] with default parameters. Strain aliases are described in Table 1. Stenotrophomonas and Xylella genomes have been used as outgroups. Xanthomonas main phylogenetic groups 1 and 2 [24] and the X. axonopodis species complex [9, 10] are indicated by arrows. Groups 9.2, 9.4, 9.5 and 9.6 [11] are indicated in brackets. Fuscans, NF2, NF3 and NF1 refer to the four genetic lineages of strains responsible for CBB. A parsimony approach was performed to infer gene gains (blue) and losses (red) at levels higher than the pathovar rank, and numbers are displayed at each branch. Red stars highlight cases where gene loss was greater that gene gain. Curved arrows represent horizontal gene transfers (HGT) retrieved by Ks analysis on alignments of 115 candidate genes for bean specificity, with HGT from X. citri pv. fuscans to X. phaseoli pv. phaseoli in green, HGT from X. phaseoli pv. phaseoli to X. citri pv. fuscans in purple, and HGT between X. citri pv. fuscans lineages in red. Numbers in circles correspond to the numbers of candidate genes involved for each HGT. Question marks indicate events for which the origin or end of the HGT was not precise enough to assign any particular lineage

Genome expansion occurred during the evolution of Xanthomonas

To identify the genes shared by different clades of Xanthomonas, we constructed an orthology matrix using OrthoMCL (Additional file 2). Based on this orthology matrix, we performed a parsimony approach to infer gene gains and losses at each branch of the phylogenetic tree (Fig. 1). We did not take into account events occurring on the most distal branches to reduce the bias due to the difference of quality between sequenced genomes. At every branch, one to several hundreds of genes were either gained or lost. A general observation was that gene gains were higher than gene losses, suggesting that genome expansion occurred during the evolution of Xanthomonas (Fig. 1). Only four cases of genome reduction were observed (i) at the origin of the Xylella genus, (ii and iii) along two consecutive branches before and after the split between the X. oryzae species and the X. vasculorum and X. musacearum species, and (iv) at the origin of X. citri pv. malvacearum. The largest gene gain (418) was observed at the origin of Xanthomonas phylogenetic group 2 as defined by Young et al. [24], while the largest gene loss (819) was observed at the origin of the Xylella genus. Few gene losses (9 to 32) were observed before the diversification of each of the four genetic lineages involved in CBB. Of those, the NF1 lineage was the one which gained the most genes (271) followed by the NF2 (225), NF3 (108) and fuscans (83) lineages, respectively.

The pan and core genomes of Xanthomonas reveal extensive horizontal gene transfers between strains pathogenic on common bean

Individuals that are closely related to each other typically share more orthologs than unrelated individuals. Therefore, groups of closely related individuals tend to have a smaller pan genome and a larger core genome than groups of more divergent individuals. As such, the pan and core genomes for the 72 Xanthomonas strains comprised 32,602 and 1144 CDS, respectively, while the pan and core genomes for the 75 strains including the outgroups comprised 34,723 and 816 CDS, respectively (Fig. 2). Similarly, each Rademaker group alone, i.e. 9.2, 9.4, 9.5 and 9.6, had a smaller pan genome (6578, 8222, 9387 and 9437 CDS, respectively) and a larger core genome (3493, 2949, 3056 and 3213 CDS, respectively) than the X. axonopodis species complex, which had pan and core genomes of 19,010 and 2297 CDS, respectively. Strikingly, when grouping strains responsible for CBB belonging to X. citri pv. fuscans and X. phaseoli pv. phaseoli, both the pan and core genomes (10,750 and 3222 CDS, respectively) were larger than the pan and core genomes from groups 9.4 (8222 and 2949 CDS, respectively) or 9.6 (9437 and 3213 CDS, respectively) (Fig. 2). Thus, strains responsible for CBB, although being phylogenetically diverse, had more genes in common than they had with other strains belonging to their respective clades, which was suggestive of extensive HGT among these strains. This result was reminiscent of previous comparative analyses showing that dozens of genes have been horizontally transferred between the fuscans and NF1 lineages [17].
Fig. 2

The core- and pan-genome of Xanthomonas. Gene numbers correspond to the number of ortholog groups retrieved for each group of strains. All: all strains used in this study (n = 75); Xantho: strains from the Xanthomonas genus (n = 72); X.axo: strains from the Xanthomonas axonopodis species complex (n = 44). 9.2, 9.4, 9.5, 9.6: strains belonging to rep-PCR groups 9.2 (n = 5), 9.4 (n = 8), 9.5 (n = 16) and 9.6 (n = 15), respectively, as defined in Rademaker et al. [103]. Xcf-Xpp: strains pathogenic on common bean belonging to Xanthomonas citri pv. fuscans or Xanthomonas phaseoli pv. phaseoli (n = 18)

The core- and pan-genome of Xanthomonas. Gene numbers correspond to the number of ortholog groups retrieved for each group of strains. All: all strains used in this study (n = 75); Xantho: strains from the Xanthomonas genus (n = 72); X.axo: strains from the Xanthomonas axonopodis species complex (n = 44). 9.2, 9.4, 9.5, 9.6: strains belonging to rep-PCR groups 9.2 (n = 5), 9.4 (n = 8), 9.5 (n = 16) and 9.6 (n = 15), respectively, as defined in Rademaker et al. [103]. Xcf-Xpp: strains pathogenic on common bean belonging to Xanthomonas citri pv. fuscans or Xanthomonas phaseoli pv. phaseoli (n = 18)

Strains pathogenic on common bean share 115 CDS presenting unique characteristics

To search for genes potentially involved in the convergence between X. phaseoli pv. phaseoli (i.e. the NF1 lineage) and X. citri pv. fuscans (i.e. the NF2, NF3 and fuscans lineages) to infect common bean, we performed a combination of four different analyses. First, within the OrthoMCL matrix, we searched for CDS specifically present in the genomes of CBB agents and absent from any other Xanthomonas genome, or present in the genomes from all Xanthomonas but not in the genomes of CBB agents. No CDS was retrieved by this analysis. We also searched for CDS specifically present or absent when grouping the NF1 lineage to each of the NF2, NF3, or fuscans lineages. Only one CDS was specifically retrieved in the NF1 and fuscans lineages. Second, we used the results from the CDS gains and losses approach described above to search for genes shared by all strains from X. phaseoli pv. phaseoli and X. citri pv. fuscans, and gained in the ancestor of one pathovar or the other. This approach unveiled nine CDS shared by all strains responsible for CBB, and gained in either X. phaseoli pv. phaseoli or X. citri pv. fuscans (Table 2). We also searched for CDS shared by the NF1 and each of the NF2, NF3, or fuscans lineage and gained in at least one of these lineages. Four CDS were shared by the NF1 and NF2 lineages, or the NF1 and NF3 lineages, while three CDS were shared by the NF1 and fuscans lineages.
Table 2

Numbers of CDS presenting similarities among the lineages of CBB agents

Lineages studiedPresence/absenceaGainedbMonophyleticc24-mersd
NF1/NF2/NF3/fuscans0928108
NF1/NF204933
NF1/NF304528
NF1/fuscans13105231
Total120147400

aCDS specifically present or absent in all the lineages studied compared to all other X. axonopodis strains

bCDS present in all the lineages studied and gained in a least one of these lineages

cCDS monophyletic for the lineages studied

dCDS containing 24-mers specifically present or absent in all the lineages studied compared to all other X.axonopodis strains

Numbers of CDS presenting similarities among the lineages of CBB agents aCDS specifically present or absent in all the lineages studied compared to all other X. axonopodis strains bCDS present in all the lineages studied and gained in a least one of these lineages cCDS monophyletic for the lineages studied dCDS containing 24-mers specifically present or absent in all the lineages studied compared to all other X.axonopodis strains Third, we used a phylogenetic approach to search for genes for which strains from X. citri pv. fuscans and X. phaseoli pv. phaseoli formed a monophyletic group. For this, we constructed phylogenetic trees on 3202 CDS present in every X. citri pv. fuscans and X. phaseoli pv. phaseoli strain and in at least one additional strain from group 9.2 or 9.4 and one other strain from group 9.5 or 9.6. The additional strains from groups 9.2, 9.4, 9.5 and 9.6 were located inbetween X. citri pv. fuscans and X. phaseoli pv. phaseoli (Fig. 1). Thus, CDS found as monophyletic for CBB strains could be potential traces of HGT between both pathovars. This approach unveiled 28 CDS for which the four genetic lineages formed a monophyletic group, suggesting that they were horizontally transferred among these lineages (Table 2). Nine CDS were specifically monophyletic for the NF1 and NF2 lineages, five for the NF1 and NF3 lineages, and 105 for the NF1 and fuscans lineages, suggesting that most horizontal transfers occurred among the NF1 and fuscans lineages. Finally, we used the SkIf tool [25] on the 72 Xanthomonas genomes to search for genes containing short 24-bp sequences (24-mers) specific to strains responsible for CBB, or alternatively genes from strains responsible for CBB lacking 24-mers present in all other strains from the X. axonopodis species complex. In all, we identified 108 CDS containing 24-mers either specifically present or absent from the four lineages (Table 2). Moreover, 33 CDS contained 24-mers specific for the NF1 and NF2 lineages, 28 for the NF1 and NF3 lineages and 231 for the NF1 and fuscans lineages. Similarly to the analysis based on phylogeny, this analysis based on k-mers pointed an overrepresentation of CDS with specific 24-mers shared by the NF1 and fuscans lineages compared to NF2 and NF3 lineages. Together, these four analyses unveiled respectively 0, 9, 28, or 109 CDS presenting features unique to CBB agents. The analysis based on presence/absence seemed to be too stringent for unveiling any CDS, while the analysis based on k-mers was the most sensitive, suggesting that SkIf was an appropriate tool for finding common traits shared by phylogenetically distant strains. Most of these CDS were found redundantly by two or more analyses, for a total of 115 non-redundant CDS (Table 3). The most represented functions encoded by these 115 predicted CDS were hypothetical proteins (26 CDS), followed by membrane-related proteins (10 CDS), two-component system proteins (six CDS), putative secreted proteins (five CDS), reductases (five CDS), RNA-related proteins (five CDS), Type III secretion system-related proteins (five CDS), TonB-dependent proteins (four CDS), Type IV secretion system-related proteins (three CDS), Type VI secretion system-related proteins (three CDS), DNA-related proteins (three CDS) and transcription regulators (three CDS) (Table 3).
Table 3

Overview of the 115 genes putatively involved in bean specificity

IdentifieraAccession numberbPredicted functionGeneGainedMonophyletic24-mersRecombinantHGTAtypical GC%Nonsynonymous sitesAritua et al. [17]c
1m00100560TonB-dependent transporterYesYes
2m00100580type III effector avrBs2 YesYesYes
3m00100590hypothetical proteinYesYesYes
4m00101230two compoment system sensor proteinYes
5m00101980two component system protein glnG Yes
6m00102200two component system response regulatorYesYesYesYes
7m00104250type III secretion system protein hrpF YesYesYes
8m00104520type III effector xopA YesYesYes
9m00104530lytic transglycosylase-like protein hpaH YesYesYesYes
10m00104540hypothetical proteinYesYesYes
11m00104640diguanylate cyclaseYesYesYes
12m001046904-alpha-glucaNotransferase malQ YesYes
13m00105290threonine aldolaseYesYes
14m00105330flavin reductaseYesYesYesYes
15m00105340anthranilate phosphoribosyl transferase trpD YesYes
16m00105390S-adenosylmethionine decarboxylase speD YesYesYes
17m00105750hypothetical membrane proteinYes
18m00107500TonB-dependent transporterYesYesYes
19m00108560general secretion pathway protein D xpsD YesYesYesYesYes
20m00108570hypothetical proteinYesYesYes
21m00108600TonB-dependent transporterYesYesYesYes
22m00109610TonB-dependent transporterYesYesYes
23m00109620phosphoanhydride phosphohydrolase appA YesYesYesYesYes
24m00109670hypothetical proteinYesYes
25m00109990bis(5′nucleosyl)-tetraphosphatase apaH Yes
26m00110970peptidyl-tRNA hydrolaseYesYesYesYes
27m0011116050S ribosomal protein rplW YesYes
28m00112070two-component system sensor protein phoR Yes
29m00113470xanthine dehydrogenase subunitYesYesYesYesYes
30m00113490xanthine dehydrogenase subunitYesYes
31m00114150membrane proteinYesYesYes
32m00114680two component system sensor proteinYesYesYes
33m00116790NAD/FAD binding proteinYesYes
34m00117590acetyltransferaseYesYes
35m00117600transcriptional regulatorYesYesYesYesYes
36m00117610hypothetical proteinYesYesYes
37m00118890hypothetical proteinYesYes
38m00119700putative secreted proteinYes
39m00119970phosphomethylpyrimidine kinase thiD YesYesYes
40m00121810threonine synthase thrC YesYesYesYes
41m00121850histidyl tRNA synthetase hisS YesYes
42m00121900histidinol phosphatase hisB YesYesYes
43m00123210recombination factor protein rarA YesYes
44m00126530ethanolamine permease eutP YesYesYes
45m00126540ethanolamine ammona-lyase eutB YesYes
46m00127140membrane proteinYesYes
47m00127150cGMP specific phosphodiesteraseYesYesYesYes
48m00127220hypothetical proteinYesYes
49m00127250ubiquinol-cytochrome c reductase subunitYesYes
50m00127260ubiquinol-cytochrome c reductase subunitYesYes
51m001278501-phosphofructokinase fruK YesYes
52m00127860PTS fructose porter IIBC component fruA YesYes
53m00130650RNA polymerase ECF-type sigma factor rpoE2 YesYesYes
54m00130660hypothetical proteinYesYesYesYes
55m00130670RNA-binding proteinYesYesYes
56m00130690inner membrane proteinYesYes
57m00132430DNA topoisomerase IAYesYes
58m00135840membrane fusion proteinYes
59m00135870sulfite reductase (NADPH) subunit cysJ YesYesYes
60m00135880sulfite reductase (NADPH) subunit cysI YesYes
61m00135970transcriptional regulator protein cysB YesYes
62m00135980siroheme synthase cysG YesYesYes
63m00136050phosphoglycerate kinase pgk YesYes
64m00136550hypothetical proteinYes
65m00138140two-compoment system sensor proteinYesYes
66m00139330dihydrolipoamide acetyltransferase aceF YesYesYes
67m00139350DNA glycosylaseYesYesYes
68m00139420membrane proteinYesYesYesYes
69m00139860putative secreted proteinYes
70m00139930flavodoxin proteinYesYes
71m00140860peptidoglycan binding protein lysM YesYes
72m00140880hypothetical proteinYes
73m00141310PhnB like proteinYesYesYes
74m00141340hypothetical proteinYesYes
75m00141400hypothetical proteinYesYes
76m00141430transmembrane proteinYesYesYes
77m00141440hypothetical proteinYesYesYes
78m00141490dipeptide epimeraseYesYes
79m00142140putative glucosyltransferaseYesYesYes
80m00144120type VI secretion system protein icmF YesYesYesYes
81m00144240type VI secretion system kinase protein tagE YesYesYes
82m00144270transmembrane sensor protein fecR YesYes
83m00144420type VI secretion system virulence protein impE YesYesYes
84m00145330type III effector xopAD YesYesYesYes
85m00146940membrane proteinYesYesYes
86m00146970LysR family transcriptional regulatorYesYes
87m00146980short chain dehydrogenaseYesYes
88m00146990RNA 2′-phosphotransferase family proteinYesYesYes
89m00147020hypothetical proteinYesYesYes
90m00147060esterase family proteinYesYes
91m00147090putative secreted proteinYesYes
92m00147100putative secreted proteinYesYesYes
93m00147120ribosomal pseudouridine synthaseYesYesYesYes
94m00147130hypothetical proteinYesYesYes
95m00147140putative secreted proteinYesYesYes
96m00200020plasmid partitioning proteinYesYesnaYes
97m00200060hypothetical proteinYesYesnaYes
98m00200080hypothetical proteinYesYesnaYes
99m00200130hypothetical proteinYesYesYes
100m00200150hypothetical proteinYesYesnaYes
101m00200160hypothetical proteinYesnaYesYes
102m00200170hypothetical proteinYesnaYes
103m00200190hypothetical proteinYesnaYes
104m00200260hypothetical proteinYesYesYes
105m00200790transposase tnpA YesYesYes
106m00200820resolvase tnpR YesYes
107m00200880DNA topoisomerase IAYesnaYes
108m00200980type IV secretion system protein trbI YesYesYesnaYes
109m00201120membrane proteinYes
110m00201220type IV conjugal transfer protein traG YesnaYes
111m00201230hypothetical proteinYesnaYes
112m00201250lytic transglycolaseYesYesYesnaYes
113m00201260type IV conjugal transfer protein traF YesYesnaYes
114m00201270hypothetical proteinYesYesYesnaYes
115m00201290hypothetical proteinYesYesYesnaYes
Total115 92810818882449

– not found in corresponding analysis

na not applicable due to the absence of corresponding CDS in outgroup

aidentifiers correponding to the numbers shown in Fig. 3

baccession numbers for strain CFBP6456

cgenes with 100% identity over 95% of their length according to Aritua et al. [17]

Overview of the 115 genes putatively involved in bean specificity – not found in corresponding analysis na not applicable due to the absence of corresponding CDS in outgroup aidentifiers correponding to the numbers shown in Fig. 3
Fig. 3

Mapping of the 24-mers specific for strains pathogenic on common bean. The innermost rings represent the reference chromosome or plasmids with associated coordinates. Colored lines represent 24-mers specifically retrieved in X. citri pv. fuscans and X. phaseoli pv. phaseoli strains (purple), or in the NF1 plus fuscans lineages (green), or in the NF1 plus NF2 lineages (blue), or in the NF1 plus NF3 lineages (black). Red numbers correspond to the identifiers of the 115 genes listed in Table 3

baccession numbers for strain CFBP6456 cgenes with 100% identity over 95% of their length according to Aritua et al. [17] We hypothesized that the CDS potentially involved in the specific adaptation to common bean should bear nonsynonymous polymorphisms specific to Xanthomonas strains pathogenic on common bean. Analysis of the alignments for the 115 candidate CDS highlighted 44 CDS with nonsynonymous sites retrieved exclusively in X. citri pv. fuscans and X. phaseoli pv. phaseoli (Table 3). More than one third of these CDS (16/44) encoded hypothetical proteins. Among the other CDS, three encoded type IV secretion system proteins TrbI TraG and TraF, two encoded putative secreted proteins, two encoded type III secretion system proteins XopA and XopAD, two encoded DNA topoisomerases, and others encoded proteins of various functions (Table 3).

Specificity to common bean is associated with successive waves of horizontal gene transfers

Strain CFBP6546 from the NF1 lineage was used as reference for further analyses. Its genome contained one chromosome and three extrachromosomal plasmids formerly described as plasmid a, plasmid b and plasmid c in strain 4834-R [26]. Most candidate genes (95/115) were located on the chromosome, while 20 were located on the plasmid a (Fig. 3). This corresponds to a density of one candidate gene per 50.9 kbp in the chromosome, and one per 3.5 kbp for the plasmid a, while none were retrieved in plasmids b or c. Interestingly, all the CDS found in plasmid a contained specific nonsynonymous sites (Table 3). Thus, plasmid a appeared as an important vector of genes involved in the adaptation to common bean. Another observation was that the chromosome contained regions with various 24-mers shared by the NF1 lineage and any of the fuscans, NF2 or NF3 lineages (in green, blue or black in Fig. 3, respectively). This suggests that the regions shared by the NF1 and the other lineages diverged since the split between the NF2, NF3 and fuscans lineages. By contrast to what was observed for the chromosome, all specific 24-mers found in plasmid a were simultaneously shared by the four genetic lineages of strains responsible for CBB (in purple in Fig. 3), indicating that these regions have been shared between the four lineages recently enough to still have 100% identity between each other. Together, these results suggest that 24-mers retrieved in the chromosome correspond to more ancient HGT events than those retrieved in plasmid a. Mapping of the 24-mers specific for strains pathogenic on common bean. The innermost rings represent the reference chromosome or plasmids with associated coordinates. Colored lines represent 24-mers specifically retrieved in X. citri pv. fuscans and X. phaseoli pv. phaseoli strains (purple), or in the NF1 plus fuscans lineages (green), or in the NF1 plus NF2 lineages (blue), or in the NF1 plus NF3 lineages (black). Red numbers correspond to the identifiers of the 115 genes listed in Table 3 Nucleotide synonymous substitution rates at silent sites (Ks) is an estimation of neutral evolution because it does not take into account the nonsynonymous sites that can be under selection pressure. Therefore, Ks can be used as an approximation of the time of divergence between genes or taxa, with higher Ks value meaning longer time of divergence between two sequences [27, 28]. For each of the 115 candidate genes found in CBB agents, we performed multiple alignments. We could not perform Ks analysis on 15 genes that were lacking outgroups (Table 3), therefore we tested only 100 out of 115 genes. Among these 100 genes, 18 were recombinants according to RDP software analysis (Table 3). For these 18 recombinants, Ks values were independently calculated on both sides of the breakpoints. We calculated pairwise Ks values for different combinations of strains including X. citri pv. fuscans, X. phaseoli pv. phaseoli and closely related strains including X. citri pv. anacardii, X. citri pv. aurantifolii, X. phaseoli pv. syngonii, X. phaseoli pv. dieffienbachiae and X. phaseoli pv. manihotis (Fig. 1, Additional file 3). We then used these Ks as relative time divergence estimations to infer if a HGT occurred between NF1, NF2, NF3 and/or fuscans lineages, as well as the direction of this HGT. For example, for gene m00100580b the mean Ks value between strains from the NF1 and fuscans lineages was 7.40e-03 +/− 2.85e-10. This value was lower than the Ks values when comparing the fuscans lineage to its closest relatives from the NF2 or NF3 lineages (Ks = 4.58e-02 +/− 1.32e-09 or 4.08e-02 +/− 1.54e-09, respectively), or when comparing the NF1 lineage to its closest relative X. phaseoli pv. manihotis (Ks = 5.33e-01 +/− 0.00). These results indicate that m00100580b was more similar between the NF1 and fuscans lineages than between these lineages and their closest relatives, meaning that m00100580b was horizontally transferred between the ancestors of the NF1 and fuscans lineages. Moreover, the Ks value between the NF1 lineage and the NF2 or NF3 lineages (Ks = 4.36e-02 +/− 1.04e-09 or 4.01e-02 +/− 0.00, respectively) was lower than between the NF1 lineage and X. phaseoli pv. manihotis (Ks = 5.33e-01 +/− 0.00). Therefore, m00100580b was closer between NF1 strains and other strains from group 9.6 than from it’s closest relatives, meaning that the horizontal transfer was directed from the fuscans lineage to the NF1 lineage. This analysis confirmed HGT for 88 out of 100 genes tested (Fig. 1, Table 3, Additional file 3). The vast majority of HGT was directed from X. citri pv. fuscans to X. phaseoli pv. phaseoli, while only four HGT occurred from X. phaseoli pv. phaseoli to X. citri pv. fuscans. In particular, 55 HGT events were detected from the fuscans lineage to the NF1 lineage. In addition to having been transferred between distant lineages, 16 and 1 genes were also transferred between the fuscans and the NF2 lineages, or the fuscans and the NF3 lineages, respectively (Fig. 1). Moreover, eight genes had Ks = 0.00 +/− 0.00 between the NF2 and NF3 lineages, and nine between the NF2, NF3 and fuscans lineages, suggesting that HGT events also occurred between these lineages (Fig. 1, Additional file 3). Together, our results show that several more or less important waves of HGT occurred between the ancestors of phylogenetically distant strains responsible for CBB. Finally, GC content is often used as a mean to detect HGT from foreign origin [29]. Out of the 115 candidate CDS, only two CDS (m00105390 and m00200160) presented an atypical GC content (α < 0.05) within the genome of strain CFBP6546 (Table 3). This result was not unexpected, as all strains from the NF1, NF2, NF3 and fuscans lineages have a similar GC content around 64% (Table 1), therefore HGT between these strains was not expected to result in a shift of GC content.

Discussion

We performed a comparative genomics analysis to detect genes putatively involved in Xanthomonas specificity to common bean. For this, we generated the whole genome sequence from 17 strains representing the diversity of the four genetic lineages belonging to X. citri pv. fuscans and X. phaseoli pv. phaseoli. We used a combination of comparative genomics approaches that led to the discovery of 115 genes bearing features unique to CBB agents. Out of these 115 genes, 108 were retrieved using the SkIf tool based on specific 24-mer search [25]. Previous analyses based on identity percentage unveiled 63 genes sharing 100% identity over at least 95% of their length among strains from the NF1 and fuscans linages [17]. Only nine of these genes were retrieved within our list of 115 genes (Table 3). This difference can be explained by the fact that we discarded most of the genes shared only by the NF1 and fuscans lineages and retained genes similar in all four genetic lineages of CBB agents (Table 2). On the other hand, we unveiled numerous genes that did not share 100% identity among the NF1 and fuscans lineages for their whole length, but instead shared small specific sequences of 24 nucleotides or more. Whether these similarities lie within functionally important domains of the encoded proteins remains to be studied. Ks comparisons, showed that a majority of these genes were involved in HGT between X. citri pv. fuscans and X. phaseoli pv. phaseoli. Therefore, HGT was the predominant force leading to similarities between the genomes of X. citri pv. fuscans and X. phaseoli pv. phaseoli. Finding HGT events within these genes validated our approach. In particular, SkIf was an interesting tool because in addition to being more sensitive than gene gain and loss or phylogenetic approaches, it was not based on gene alignments, thus less sensitive to annotation and/or sequencing biases. HGT events occurred at different moments of the evolution of Xanthomonas strains having common bean as host. The vast majority of these HGT were directed from X. citri pv. fuscans to X. phaseoli pv. phaseoli. This strongly suggest that X. citri pv. fuscans was originally pathogenic on common bean, and that X. phaseoli pv. phaseoli subsequently acquired the ability to cause CBB on bean due to successive acquisitions of novel genes and/or novel alleles coming from the three X. citri pv. fuscans lineages. This can be compared to our knowledge on the origin of the lineages and their genetic diversity. The causal agent of CBB was first isolated and identified by Smith in 1897 [30] as a yellow pigmented strain, later shown as belonging to the NF1 lineage. Burkholder later isolated the first fuscous strains from beans grown in Switzerland in 1924 [31]. However there are no data to document a putative pre-existence of one, the other, or both types of strains prior to their first identifications. The genetic diversity of the yellow and fuscous strains was revealed by various methods. Amplified or restriction fragment length polymorphism analyses [16, 32–34], amplified polymorphic DNA fragments [32, 35], pulse field gel electrophoresis [33], and multilocus sequence analysis [23] all revealed that both types of strains are more or less equivalent in terms of genetic diversity. This suggests that diversification of both lineages occurred around the same time, and thus that the ancestors of these two lineages may have coexisted. As a consequence, X. citri pv. fuscans may be the descendant of the original CBB agent that had transferred determinants useful for adaptation on common bean to the ancestor of X. phaseoli pv. phaseoli. Therefore, the ancestor of X. phaseoli pv. phaseoli appears as a recombinant that emerged as a new common bean pathogen through the acquisition of novel genes and alleles. In quarantine areas such as Europe, Turkey, Barhain, Azerbaijan and Israel, seed lots are routinely tested using a method from the International Seed Testing Association involving isolation of bacterial strains, pathogenicity tests, and specific PCR assays [36, 37]. Our results could serve for improving PCR-based monitoring of CBB agents by designing PCR primers on genes presenting sequences unique to CBB agents and potentially important for common bean specificity. Such primers could potentially detect novel HGT of these genes in strains unrelated to X. phaseoli pv. phaseoli and X. citri pv. fuscans, thus allowing to forecast new threads potentially dangerous for common bean production. Very diverse functions were retrieved among the proteins encoded by the 115 candidate genes. Interestingly, 44 genes contained nonsynonymous polymorphisms specific for strains responsible for CBB, suggesting that they may play an important role in common bean specificity. Although 17 of these 44 genes encoded hypothetical proteins, it appears that most other genes encoded proteins involved in pathogenicity or in the interaction with the plant environment. The type IV secretion system was particularly represented with genes encoding TraF, TraG and TraI, and is involved in the translocation of macromolecules such as proteins important for pathogenicity, or DNA for mediating HGT [38, 39]. Thus, sharing similar proteins of the type IV secretion system could favour HGT among strains responsible for CBB. Indeed, strains from the NF1, NF2, NF3 and fuscans lineages have been found in La Réunion Island in 2000 (Table 1), indicating that sympatry exists among all these lineages, rendering further HGT events possible [18]. Three genes encoding proteins related to the type III secretion system were retrieved. XopA [40] and HpaH [41] are two proteins that may be involved in the structure of the type III secretion system, while XopAD is a type III effector of unknown function consisting of multiple semi-conserved 42 amino acids SKW repeats [42, 43]. The type III secretion system is pivotal for the virulence of most Gram negative plant pathogenic bacteria, and repertoires of type III effectors have been described as potentially important factors for host specificity and host adaptation in Xanthomonas [7, 44, 45] and other genera such as Pseudomonas [46]. Moreover, our analysis pointed out one diguanilate cyclase and one cGMP specific phosphodiesterase, two proteins involved in the metabolism of cyclic di-GMP that may play a role in biofilm formation [47] and pathogenicity [48]. One TonB-dependent transporter was also retrieved. TonB-dependent transporter are outer membrane receptors involved in molecule uptake such as iron siderophore complexes or nutrients and may play a role in host specificity [3, 49]. Other proteins putatively involved in pathogenicity were retrieved, such as ThrC, a threonine synthase involved in the virulence of X. oryzae pv. oryzicola in rice [50], XpsD, an outer membrane protein from the type II secretion system that is putatively involved in the secretion of cell wall degradative enzymes during infection [51], or IcmF, a protein of the type VI secretion system, which is involved in the interaction with other bacteria and may participate in pathogenicity [52, 53]. One flavine reductase and one xanthine dehydrogenase, two proteins putatively involved in oxydoreduction pathways were retrieved, and may be involved in the response to stress during the interaction with common bean. In addition to genes putatively involved in pathogenicity or virulence, our analysis unveiled genes involved in more general metabolism pathways, such as PhnB involved in the biosynthesis of tryptophan [54], or two DNA topoisomerases involved in the relaxation of the supercoiled DNA molecule during transcription, replication or recombination [55]. On one hand, our analysis unveiled only one or few genes within a given function, while the functions retrieved correspond to pathways often involving dozens of genes. This suggested that slight modifications within a given pathway would be sufficient to impact host specificity. On the other hand, the genes retrieved here encompass almost all the stages of host plant colonization by the bacteria, from the ability to mobilize trophic resources for multiplication to the interaction with other microorganisms, biofilm formation, response to oxidative stress, and inhibition of plant defences. Therefore, the ability to infect a particular plant seems to require not just one or a few adaptative determinants but an arsenal of factors allowing a global adaptation to a specific niche including the plant and, as a consequence a fine tuning and coordination of the activity of these determinants. Interestingly, 19 of these 44 candidate genes were retrieved on plasmid a, suggesting that this plasmid played a major role for pathological convergence of CBB agents. Plasmid a carries an additional type III effector gene encoding an effector from the Transcription Activator-Like (TAL) family that was horizontally transferred between the NF1, NF2, NF3 and fuscans lineages [18]. Plasmids are genetic elements that favour HGT, but transfers of whole plasmids often induce a fitness cost for the bacteria [56]. More generally, horizontally transferred genes tend to be lost if not providing selective advantages for recipient strains [57]. Interestingly, the nine candidate genes retrieved by the gain and loss approach were all located on plasmid a (Table 3). The maintenance of these novel genes in the four genetic lineages of CBB agents is a testament to the importance of these genes for the bacteria. Except for two genes encoding proteins involved in the type IV secretion system, the other seven genes encoded proteins of unknown function. It would be interesting to perform functional characterization of these genes, and further analyse their implication in common bean specificity. Analysing the expression patterns during infection would be a natural extension of this study, and a first step towards functional validation of these genes.

Conclusion

Together, our results indicate that consecutive waves of HGT occurred between phylogenetically distant Xanthomonas strains able to cause the same symptoms on the same host plant: common bean. These HGT led to specific combinations of genes only retrieved in strains responsible for CBB, which provided new insights into the evolution of these bacteria towards infecting common bean. Mining for candidate genes for host specificity could be generalized to other polyphyletic pathovars such as pathovars euvesicatoria, vesicatoria, perforans, and gardneri forming a group of strains pathogenic on pepper and tomato [8]. Such analyses could both give new information on the molecular bases of host specificity, and provide new tools for enhancing epidemiological surveillance of strains pathogenic on a given host, or detecting recombinant strains presenting a high potential of emergence through the acquisition of novel genes.

Methods

Bacterial genomes and strains

All strains used in this study are listed in Table 1. The strains used for genome sequencing were provided by the CIRM-CFBP (International Center for Microbial Ressources - French Collection for Plant-associated Bacteria, https://www6.inra.fr/cirm_eng/). Genome sequencing was performed using the following procedure. Genomic DNA was prepared from overnight liquid cultures of bacteria previously grown on 10% TSA medium (tryptone at 1.7 g/L, soybean peptone at 0.3 g/L, glucose at 0.25 g/L, NaCl at 0.5 g/L, K2HPO4 at 0.5 g/L, agar at 15 g/L, pH 7.2) for 2 days at 28 °C. DNA was extracted and purified by the method of Klotz and Zimm [58]. Illumina sequencing was performed by Genoscope (20 strains, paired end reads of 300/500 bp) or GATC Biotech (three strains, with combined paired-end reads of ca. 250 bp and 3 kb mate-pair reads). Genome assembly was performed using a combination of SOAPdenovo (version 2.04) [59], SOAPGapCloser (version 1.12) [60] and Velvet (version 1.2.02) [61] assemblers. Sequenced genomes were estimated to be > 93% complete and < 3% contaminated (Additional file 4) using CheckM (version 1.0) [62]. The pathogenicity of all CBB strains was confirmed on common bean plants from cultivar Flavert as described in Ruh et al. [18]. The seeds from cultivar Flavert were kindly provided by Vilmorin (La Ménitré, France) and are available at the bean collection of the CIAT (Center for Tropical Agriculture, Colombia, http://genebank.ciat.cgiar.org/genebank/main.do).

Annotation and phylogenomic analyses

Structural and functional annotation of whole genome assemblies was performed using the automated pipeline Eugene-PP (version 1.2) [63], using SWISS-PROT as protein database and training protein database (http://www.uniprot.org/). Additional functional annotation of all predicted CDS was performed with InterProScan (version 4) [64]. A presence/absence matrix of ortholog groups was constructed using OrthoMCL (version 2.0) on amino acid sequences from all predicted CDS at an inflation index of 1.5 [65]. This matrix was then used for defining core and pan genomes. Phylogenetic trees were constructed using CVTree (version 4.2) [66] using the aminoacid sequences of all predicted CDS from the 75 genomes used in this study. CDS gains and losses were analysed using the Most Parcimonious Reconstruction function from the APE package (version 3.2) [67] to search for the most parsimonious succession of events explaining the repertoire of ortholog groups at each node of the phylogenetic tree.

Searching for genes monophyletic for X. citri pv. fuscans and X. phaseoli pv. phaseoli strains

A phylogenetic approach was used to search for genes for which strains from X. citri pv. fuscans and X. phaseoli pv. phaseoli form a monophyletic group. For this we selected 3202 CDS using an R script to search the orthology matrix for genes that were present in all X. citri pv. fuscans and X. phaseoli pv. phaseoli strains and in at least one strain from Rep-PCR group 9.2 or 9.4 [11] plus one another strain from Rep-PCR group 9.5 or 9.6, in order to avoid getting trees were X.citri pv. fuscans and X. phaseoli pv. phaseoli appear monophyletic due to a lack of correspondig genes in the strains inbetween. CDS were aligned using MAFFT (version 7) with L-INS-I strategy [68]. Neighbour-joining trees were constructed using APE (version 3.2) under the Kimura 80 model [67]. CDS monophyletic for all X. citri pv. fuscans and X. phaseoli pv. phaseoli strains, or alternatively for the NF1 and another lineage (i.e. NF2, NF3, or fuscans), were retrieved using the APE package (version 3.2) [67].

Searching for genes containing k-mers specific for X. citri pv. fuscans and X. phaseoli pv. phaseoli strains

A k-mer-based approach was used to search for genes containing short specific sequences present in all strains from X. citri pv. fuscans and X. phaseoli pv. phaseoli but absent in other strains. For this, we used SkIf (version 1.0) [25] with a k-mer size of 24 (or 24-mer), and using X. citri pv. fuscans strain 4834-R genome as reference. The same approach was used to search for genes containing 24-mers absent in strains belonging to X. citri pv. fuscans and X. phaseoli pv. phaseoli but conserved in all other strains from the X. axonopodis species complex, using X. citri pv. anacardii strain CFBP2913 genome as reference.

Recombination and HGT analyses

The 115 genes presenting specific traits of adaptation to common bean were aligned using MAFFT (version 7) with L-INS-I strategy [68]. Intragenic recombination events were then searched using a suite of programs implemented in RDP (version 4.16) [69], RDP [70], Geneconv [71], MaxChi [72], Chimaera [73], Bootscan [74] and 3seq [75]. Default parameters were used for each method except for Bootscan (window = 150, step = 20, neighbor joining trees, 200 replicates, 95% cut-off, J&N model with Ti:Tv = 2, coefficient of variation = 2). Ks was calculated using DNAsp (version 5) [76]. For each gene, the occurrence and dating of HGT events were estimated by comparing Ks values from 28 different pairwise combinations listed in Additional file 3. For example, NF1 and fuscans strains belong to phylogenetically distant strains, thus if the Ks between strains from genetic lineages NF1 and fuscans was lower than the mean Ks between other lineages, it was indicative of recent HGT between the ancestors of NF1 and fuscans. Direction of events were assessed by comparing the Ks values for outgroups belonging to Rep-PCR groups 9.4 and 9.6 (Fig. 1). For recombinants, separate analyses were performed for each region on both sides of the recombination point. CDS content per strain. (PDF 406 kb) OrthoMCL matrix. (XLSX 10182 kb) Ks analysis on 100 CDS presenting characteristics unique to CBB agents. (XLSX 64 kb) CheckM analysis of the genomes sequenced in this study. (XLSX 11 kb)
  86 in total

Review 1.  Analyzing the mosaic structure of genes.

Authors:  J M Smith
Journal:  J Mol Evol       Date:  1992-02       Impact factor: 2.395

2.  Size of DNA determined by viscoelastic measurements: results on bacteriophages, Bacillus subtilis and Escherichia coli.

Authors:  L C Klotz; B H Zimm
Journal:  J Mol Biol       Date:  1972-12-30       Impact factor: 5.469

3.  Comparison of the genomes of two Xanthomonas pathogens with differing host specificities.

Authors:  A C R da Silva; J A Ferro; F C Reinach; C S Farah; L R Furlan; R B Quaggio; C B Monteiro-Vitorello; M A Van Sluys; N F Almeida; L M C Alves; A M do Amaral; M C Bertolini; L E A Camargo; G Camarotte; F Cannavan; J Cardozo; F Chambergo; L P Ciapina; R M B Cicarelli; L L Coutinho; J R Cursino-Santos; H El-Dorry; J B Faria; A J S Ferreira; R C C Ferreira; M I T Ferro; E F Formighieri; M C Franco; C C Greggio; A Gruber; A M Katsuyama; L T Kishi; R P Leite; E G M Lemos; M V F Lemos; E C Locali; M A Machado; A M B N Madeira; N M Martinez-Rossi; E C Martins; J Meidanis; C F M Menck; C Y Miyaki; D H Moon; L M Moreira; M T M Novo; V K Okura; M C Oliveira; V R Oliveira; H A Pereira; A Rossi; J A D Sena; C Silva; R F de Souza; L A F Spinola; M A Takita; R E Tamura; E C Teixeira; R I D Tezza; M Trindade dos Santos; D Truffi; S M Tsai; F F White; J C Setubal; J P Kitajima
Journal:  Nature       Date:  2002-05-23       Impact factor: 49.962

4.  Novel insights into the genomic basis of citrus canker based on the genome sequences of two strains of Xanthomonas fuscans subsp. aurantifolii.

Authors:  Leandro M Moreira; Nalvo F Almeida; Neha Potnis; Luciano A Digiampietri; Said S Adi; Julio C Bortolossi; Ana C da Silva; Aline M da Silva; Fabrício E de Moraes; Julio C de Oliveira; Robson F de Souza; Agda P Facincani; André L Ferraz; Maria I Ferro; Luiz R Furlan; Daniele F Gimenez; Jeffrey B Jones; Elliot W Kitajima; Marcelo L Laia; Rui P Leite; Milton Y Nishiyama; Julio Rodrigues Neto; Letícia A Nociti; David J Norman; Eric H Ostroski; Haroldo A Pereira; Brian J Staskawicz; Renata I Tezza; Jesus A Ferro; Boris A Vinatzer; João C Setubal
Journal:  BMC Genomics       Date:  2010-04-13       Impact factor: 3.969

Review 5.  The structural biology of type IV secretion systems.

Authors:  Rémi Fronzes; Peter J Christie; Gabriel Waksman
Journal:  Nat Rev Microbiol       Date:  2009-10       Impact factor: 60.633

6.  Selection-driven gene loss in bacteria.

Authors:  Sanna Koskiniemi; Song Sun; Otto G Berg; Dan I Andersson
Journal:  PLoS Genet       Date:  2012-06-28       Impact factor: 5.917

7.  Sensing and adhesion are adaptive functions in the plant pathogenic xanthomonads.

Authors:  Nadia Mhedbi-Hajri; Armelle Darrasse; Sandrine Pigné; Karine Durand; Stéphanie Fouteau; Valérie Barbe; Charles Manceau; Christophe Lemaire; Marie-Agnès Jacques
Journal:  BMC Evol Biol       Date:  2011-03-11       Impact factor: 3.260

8.  Dynamic evolution of pathogenicity revealed by sequencing and comparative genomics of 19 Pseudomonas syringae isolates.

Authors:  David A Baltrus; Marc T Nishimura; Artur Romanchuk; Jeff H Chang; M Shahid Mukhtar; Karen Cherkis; Jeff Roach; Sarah R Grant; Corbin D Jones; Jeffery L Dangl
Journal:  PLoS Pathog       Date:  2011-07-14       Impact factor: 6.823

9.  Lateral gene transfer and ancient paralogy of operons containing redundant copies of tryptophan-pathway genes in Xylella species and in heterocystous cyanobacteria.

Authors:  Gary Xie; Carol A Bonner; Tom Brettin; Raphael Gottardo; Nemat O Keyhani; Roy A Jensen
Journal:  Genome Biol       Date:  2003-01-29       Impact factor: 13.583

10.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

View more
  4 in total

Review 1.  Mechanistic insights into host adaptation, virulence and epidemiology of the phytopathogen Xanthomonas.

Authors:  Shi-Qi An; Neha Potnis; Max Dow; Frank-Jörg Vorhölter; Yong-Qiang He; Anke Becker; Doron Teper; Yi Li; Nian Wang; Leonidas Bleris; Ji-Liang Tang
Journal:  FEMS Microbiol Rev       Date:  2020-01-01       Impact factor: 16.408

2.  Using Genomes and Evolutionary Analyses to Screen for Host-Specificity and Positive Selection in the Plant Pathogen Xylella fastidiosa.

Authors:  Tiffany N Batarseh; Abraham Morales-Cruz; Brian Ingel; M Caroline Roper; Brandon S Gaut
Journal:  Appl Environ Microbiol       Date:  2022-09-12       Impact factor: 5.005

3.  Identification of a virulence tal gene in the cotton pathogen, Xanthomonas citri pv. malvacearum strain Xss-V2-18.

Authors:  Fazal Haq; Shiwang Xie; Kunxuan Huang; Syed Mashab Ali Shah; Wenxiu Ma; Lulu Cai; Xiameng Xu; Zhengyin Xu; Sai Wang; Lifang Zou; Bo Zhu; Gongyou Chen
Journal:  BMC Microbiol       Date:  2020-04-15       Impact factor: 3.605

4.  Common bacterial blight of bean: a model of seed transmission and pathological convergence.

Authors:  Nicolas W G Chen; Mylène Ruh; Armelle Darrasse; Justine Foucher; Martial Briand; Joana Costa; David J Studholme; Marie-Agnès Jacques
Journal:  Mol Plant Pathol       Date:  2021-05-04       Impact factor: 5.663

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.