Literature DB >> 31014247

Genomic sequence analysis reveals diversity of Australian Xanthomonas species associated with bacterial leaf spot of tomato, capsicum and chilli.

R Roach1,2, R Mann3, C G Gambley4, T Chapman5, R G Shivas6, B Rodoni3.   

Abstract

BACKGROUND: The genetic diversity in Australian populations of Xanthomonas species associated with bacterial leaf spot in tomato, capsicum and chilli were compared to worldwide bacterial populations. The aim of this study was to confirm the identities of these Australian Xanthomonas species and classify them in comparison to overseas isolates. Analysis of whole genome sequence allows for the investigation of bacterial population structure, pathogenicity and gene exchange, resulting in better management strategies and biosecurity.
RESULTS: Phylogenetic analysis of the core genome alignments and SNP data grouped strains in distinct clades. Patterns observed in average nucleotide identity, pan genome structure, effector and carbohydrate active enzyme profiles reflected the whole genome phylogeny and highlight taxonomic issues in X. perforans and X. euvesicatoria. Circular sequences with similarity to previously characterised plasmids were identified, and plasmids of similar sizes were isolated. Potential false positive and false negative plasmid assemblies were discussed. Effector patterns that may influence virulence on host plant species were analysed in pathogenic and non-pathogenic xanthomonads.
CONCLUSIONS: The phylogeny presented here confirmed X. vesicatoria, X. arboricola, X. euvesicatoria and X. perforans and a clade of an uncharacterised Xanthomonas species shown to be genetically distinct from all other strains of this study. The taxonomic status of X. perforans and X. euvesicatoria as one species is discussed in relation to whole genome phylogeny and phenotypic traits. The patterns evident in enzyme and plasmid profiles indicate worldwide exchange of genetic material with the potential to introduce new virulence elements into local bacterial populations.

Entities:  

Keywords:  CAZymes; Cell wall degrading enzymes; Pan genome; Secretion system

Mesh:

Year:  2019        PMID: 31014247      PMCID: PMC6480910          DOI: 10.1186/s12864-019-5600-x

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

In recent years, whole genome sequences of a variety of bacterial plant pathogens have been used to investigate the phylogenetic relationships between species, as well as the genetic basis for pathogenicity and potential diagnostic target genes [1]. Next generation sequencing (NGS) and population genomics provides insight into many facets of host-pathogen interactions [2]. The wealth of information generated with NGS technology gives plant pathologists an opportunity to investigate pathogen movement, infection strategies and phenotypic trait association with the ultimate goal of providing targeted management strategies and better biosecurity. For example, genome sequence analysis of pathogenic and non-pathogenic Xanthomonas species on Prunus spp. resulted in a molecular diagnostic assay to differentiate pathogenic and non-pathogenic strains where previous tests did not [3]. Similar studies have examined the genetic diversity of Xanthomonas species that cause bacterial leaf spot (BLS) of tomato, capsicum and chilli worldwide [1], but not yet to Australian Xanthomonas strains associated with this disease. Xanthomonas species reported to cause BLS in Australian tomato and pepper have been assigned to X. euvesicatoria, X. perforans and X. vesicatoria, with non-pathogenic strains of X. arboricola and Xanthomonas sp. also isolated [4]. A draft genome comparison of BLS-causing X. vesicatoria, X. euvesicatoria, X. gardneri and X. perforans provided the basis for many subsequent studies using genomic data. Insights into the virulence and pathogenicity of Xanthomonas has been provided by genomic studies that have revealed much about plasmids and secretion systems that deliver effectors and host cell wall degrading enzymes [1, 5]. Plasmid transferral via conjugation is a major mechanism of gene transfer throughout bacterial populations, accounting for rapid shifts in pathogen response to chemicals, antibiotics and host resistance genes [6-8]. Plasmids of BLS-causing Xanthomonas species vary in size and carry virulence and resistance genes [9, 10]. Gene cassettes and integrons are also responsible for genome diversity in Xanthomonas [11]. The characteristic structure and content of a number of Xanthomonas species has been described as an open pan genome that readily exchanges mobile elements within a population [12]. Other features of Xanthomonas genomes include products of bacterial secretion systems involved with host interactions such as effectors and carbohydrate active enzymes. Understanding these elements of the bacterial genome are key to understanding how genetics reflects species phylogeny and pathogenicity. Plant defence responses to bacterial pathogens involve recognition of molecular patterns or proteins associated with bacterial secretion systems [13]. Pathogen associated molecular patterns are recognised by pattern receptors in the host that then triggers immunity. Proteins introduced by bacterial secretion systems, known as effectors may also induce immunity. Effectors of the type III secretion system (T3SS) were shown to be the main source of virulence in X. campestris pv. campestris [14], and integral to pathogenicity in Xanthomonas [13]. The T3SS introduces a complex of proteins to the host cell that target plant cell structures, alter the regulation of host genes or act as chaperones and delivery systems for the secreted effectors [15]. Effectors of plant pathogens are complex and diverse; some of the better studied include the TAL/ TALE (transcription activator-like) classes of effectors [16]. The Xop (Xanthomonas outer protein) effector classes and general effector nomenclature is described by White et al. [13], and are identified in strains of many Xanthomonas species. They note that the complex interactions between secreted proteins and host cells will likely be expanded and refined with additional genomic data. The need to understand the impact of effectors is demonstrated by the X. perforans host range expansion partially correlated with the loss of the effector AvrBsT [17]. Interestingly, AvrBsT has been described as a fitness factor [18], demonstrating that effectors may influence disease severity as well as host range. Other effectors have been linked to pathogenic function, such as AvrHah1 inducing a water soaking effect common in many bacterial diseases [19] by upregulating the intake of water into cells. Even as more genomes are sequenced every year, there is still much to be investigated about effector function [20]. In addition to the T3SS, the type II secretion system (T2SS) has also been described as important for pathogenicity in Xanthomonas species [21]. The type II secretion system is a common feature of many plant and animal pathogens as well as non-pathogenic species, involved in a range of infection and colonisation processes [22]. The T2SS is typically associated with the secretion of carbohydrate active enzymes (CAZymes), families of enzymes involved in carbohydrate processing pathways. Carbohydrate degradation has traditionally been used as a diagnostic trait in bacteriology [23], and have also been discussed in structural biology as therapeutic targets [24]. Determining which CAZyme families are present in bacterial strains may indicate substrate preference and pathogenicity. As they are currently classified, CAZymes are described by protein sequence as numbered families of six classes; glycoside hydrolases (GH), glycosyl transferases (GT), polysaccharide lyases (PL), carbohydrate esterases (CE), carbohydrate binding modules (CBM), and auxiliary activity families (AA) [25, 26]. The variety of secreted proteins in Xanthomonas and their impact on pathogenicity has been reviewed previously [1, 27], highlighting the potential for effector and CAZyme profiles to infer pathogenicity and bacterial growth strategies. The analyses of genome structure and secretion system products contribute to the understanding of bacterial relatedness and function. By comparing genomes of Australian BLS-associated Xanthomonas strains we aim to improve our understanding of the taxonomic status of these species as well as incorporating Australian BLS-causing strains into the global understanding of this pathogen complex. These analyses will provide a foundation for further identification of targets for resistance breeding or future population genetics studies.

Results

Taxonomy and pathogenicity

Genome statistics of all 50 Australian Xanthomonas draft genomes are reported in Table 1. All draft genomes were approximately 5 Mbp in length, ranging from 4,806,110 bp to 5,379,097 bp and had a GC content ranging from 64.02 to 66.14% (average of 64.74%) which is consistent with reference genomes of the sequenced species [10, 27].
Table 1

Draft genome statistics of 50 Australian Xanthomonas strains and reported genome statistics of public assemblies from Genbank

StrainGenbank ID bSpecies cHostRegionCollection dateContig numberMbCDS fN50GC%Reference
BRIPa 62410QFAP01xaTomatoStanthorpe2015364.94077224,44665.76this study
BRIP 62412QFAO01xaTomatoStanthorpe2015535.124295403,34165.65this study
BRIP 62414QFAN01xaTomatoStanthorpe2015455.144304307,90265.63this study
BRIP 62416QFAM01xaTomatoStanthorpe2015635.174266455,33765.66this study
BRIP 62432QFAQ01xaTomatoTenterfield2015494.954060227,84565.61this study
NCPPB 1630JPHE01xa pv ce dBananaNew Zealand19601214.98406280,99465.5Harrison et al. 2014 (Unpublished)
NCPPB 1832JPHC01xa pv ceBananaNew Zealand1960754.914044172,77265.6Harrison et al. 2016
NCPPB 1447AJTL01xa pv j dna enana3715.02425726,88365.4Noh et al. (Unpublished)
NCCB 100457APMC02xa pv co dHazelUSA20102815.23434635,47965.5Ibarra Caballero et al. 2013
xa3004AZQY01xaBarleyRussia19981324.76393661,66566Ignatov et al. 2015
IVIA 2626.1LJGN01xa pv p dPlum (Prunus salicina)Spain20092145.03426847,65065.4Garita-Cambronero et al. 2016
CITA 44LJGM01xaCherry (Prunus mahaleb)Spain2002714.763951120,98165.8Garita-Cambronero et al. 2016
J303LSGZ01xa pv jWalnutChile20111615.07422753,67365.5Santander et al. (Unpublished)
CITA 124LXKK01xaPeach (Prunus persica)Spain20121284.75404160,81865.8Garita-Cambronero et al. (Unpublished)
CITA 14LXIB01xaPeach (Prunus persica)Spain2008724.864001100,94665.6Garita-Cambronero et al. (Unpublished)
CFBP 2528JZEF01xa pv jWalnutNew Zealand1956195.084264431,85965.5Cesbron et al. (Unpublished)
CFBP 7634JZEH01xaWalnutFrance200254.9341023,063,88665.6Cesbron et al. (Unpublished)
CFBP 7179JZEG01xa pv jWalnutFrance2002235.164358479,11365.4Cesbron et al. (Unpublished)
CFBP 7651JZEI01xaWalnutFrance200885.0341882,811,77065.5Cesbron et al. (Unpublished)
CFBP 3894LOMI01xa pv pPlum (Prunus salicina)New Zealand1953785.064204162,13565.4Lopez-Soriano et al. (Unpublished)
MAFF301420BAVC01xa pv pnanana3965.002448521,37365.3Fujikawa et al. (Unpublished)
MAFF311562BAVB01xa pv pnanana4325.08463023,37465.3Fujikawa et al. (Unpublished)
MAFF301427BAVD01xa pv pnanana3564.9444427,52965.4Fujikawa et al. (Unpublished)
Xap33JHUQ01xa pv pAlmondSpain20094745.1438834,43865.4Garita-Cambronero et al. 2014
BRIP 63464QEZX01xeCapsicumBundaberg20151405.344561194,82264.53this study
DAR 82542QEYY01xeCapsicumBundaberg19814265.12448230,59064.66this study
BRIP 62441QFAC01xeChilliBundaberg20151355.344565159,12264.52this study
BRIP 62555QFAB01xeChilliBundaberg20151395.344570164,57464.53this study
BRIP 62757QFAA01xeChilliBundaberg20151495.334558167,01564.53this study
BRIP 62858QEZZ01xeChilliBundaberg20151445.344554146,01464.53this study
BRIP 62959QEZY01xeChilliBundaberg20151295.344560169,17564.53this study
BRIP 38997QEZV01xeChilliBundaberg19861125.34513163,56564.55this study
BRIP 39016QEZW01xeApple of Peru (Nicandra physalodes)Christmas creek1973965.064290171,82764.6this study
DAR 34895QEYX01xeCapsicumDural1981695.234455176,95264.62this study
BRIP 62390QFAL01xeCapsicumGatton20141375.344550153,55164.53this study
BRIP 62391QFAK01xeCapsicumGatton20141315.344559113,98264.53this study
BRIP 62392QFAJ01xeCapsicumGatton20141835.344562169,14164.53this study
BRIP 62395QFAI01xeCapsicumGatton20141695.354574179,35164.53this study
BRIP 62396QFAH01xeChilliGatton20141615.384604102,92064.5this study
BRIP 62400QFAG01xeChilliGatton20141335.344548194,82264.53this study
BRIP 62403QFAF01xeChilliGatton20141445.344557135,45364.53this study
BRIP 62438QFAD01xeCapsicumHawkesbury Heights2015995.124310161,93364.71this study
BRIP 62425QFAE01xeCapsicumStanthorpe20151635.344576164,09164.53this study
DAR 26930QEYW01xeTomatoLiverpool19761015.054201103,61764.9this study
66bJSZG01xeCapsicum (C. annuum cv. Kambi)Bulgaria20123845.31455028,79364.6Vancheva et al. 2015
83 MJSZH01xeCapsicum (C. annuum cv. Kurtovska kapija)Macedonia20133175.1433031,38564.7Vancheva et al. 2015
LMG 27970JPYC01xepepperBelgium194714915.044380918664.7Constantin et al. 2015
Xe329JZLQ01xePepperUSA19972035.094242210,54164.7Schwartz et al. 2015
Xe206JZLN01xePepperUSA19951335.444648133,86464.5Schwartz et al. 2015
Xe515JZLV01xePepperUSA19991185.324525163,35064.5Schwartz et al. 2015
Xe526JZLW01xePepperUSA20001785.394598176,62764.5Schwartz et al. 2015
Xe586JZLX01xePepperUSA20031205.314522140,45364.6Schwartz et al. 2015
Xe679JZLZ01xePepperUSA20081095.144324144,31064.7Schwartz et al. 2015
Xe681JZMA01xePepperUSA20091045.164360210,55564.7Schwartz et al. 2015
Xe678JZLY01xePepperUSA20082025.424643132,31264.4Schwartz et al. 2015
Xe259JZLO01xePepperUSA19951395.254423187,53564.6Schwartz et al. 2015
Xe199JZLM01xePepperUSA19941105.274470174,84264.6Schwartz et al. 2015
Xe315JZLP01xePepperUSA19966295.514638164,44264.3Schwartz et al. 2015
Xe354JZLR01xePepperUSA19982875.434629176,62764.4Schwartz et al. 2015
Xe376JZLS01xePepperUSA19951285.34500176,63564.5Schwartz et al. 2015
Xe455JZLT01xePepperUSA19981375.334549148,78364.5Schwartz et al. 2015
Xe490JZLU01xePepperUSA19991325.374587187,43964.5Schwartz et al. 2015
Xe683JZRP01xePepperUSA20101085.324544163,37164.5Schwartz et al. (Unpublished)
Xe684JZRQ01xePepperUSA20101575.384594148,78364.5Schwartz et al. (Unpublished)
Xe685JZRR01xePepperUSA20101315.324536148,77064.5Schwartz et al. (Unpublished)
F4–2JZRU01xePepperUSA20041235.354591159,11364.5Schwartz et al. (Unpublished)
H3–2JZRW01xePepperUSA20041085.324531163,30664.5Schwartz et al. (Unpublished)
Xe689JZRS01xePepperUSA20111435.374588163,33664.5Schwartz et al. (Unpublished)
Xe695JZRT01xePepperUSA20121105.364588163,31064.5Schwartz et al. (Unpublished)
G4–1JZRV01xePepperUSA20041125.354586163,30664.5Schwartz et al. (Unpublished)
L3–2JZRX01xePepperUSA20041525.364584159,11464.5Schwartz et al. (Unpublished)
Xe181JZRY01xePepperUSA19941075.314518163,97564.5Schwartz et al. (Unpublished)
ATCC 19865AEQX01xgTomatoYugoslavia19615525.53478622,68663.7Potnis et al. 2011
SM406–11JZJW01xgTomatoUSA20114615.26450818,99563.5Schwartz et al. 2015
SM234–10JZJV01xgTomatoUSA20102505.33452142,58363.5Schwartz et al. 2015
SM413–11JZJX01xgTomatoUSA20113135.31452729,04963.5Schwartz et al. 2015
SM605–11JZJY01xgTomatoUSA20112425.34451745,83063.5Schwartz et al. 2015
SM194–10JZJR01xgTomatoUSA20103095.39455041,67863.5Schwartz et al. 2015
SM177–10JZJS01xgTomatoUSA20102455.31450941,49163.5Schwartz et al. 2015
SM182–10JZJT01xgTomatoUSA20102515.3450536,74363.5Schwartz et al. 2015
SM220–10JZJU01xgTomatoUSA20102605.29450638,54463.5Schwartz et al. 2015
SM795–12JZKA01xgTomatoUSA20122935.13432537,71263.6Schwartz et al. 2015
SM775–12JZJZ01xgTomatoUSA20122415.24442045,19063.6Schwartz et al. 2015
BRIP 62383QEZU01xpTomatoBowen2012615.274462302,79964.68this study
BRIP 62384QEZT01xpTomatoBowen2012775.254450312,50964.66this study
BRIP 62398QEZP01xpTomatoBowen2014575.234410187,43864.67this study
BRIP 62404QEZO01xpTomatoBowen20141095.254446140,26064.66this study
BRIP 62397QEZQ01xpTomatoBrisbane2014454.974178288,61264.99this study
BRIP 62386QEZS01xpTomatoBundaberg2012805.294484187,31164.67this study
BRIP 62405QEZN01xpTomatoBundaberg2014775.284478293,98064.67this study
BRIP 63262QEZM01xpTomatoBundaberg20151005.334545372,91564.65this study
BRIP 63565QEZL01xpTomatoBundaberg2015735.234421312,48764.68this study
BRIP 63666QEZK01xpTomatoBundaberg2015595.284473312,51264.68this study
BRIP 62389QEZR01xpTomatoSouth Turramurra2013705.274466225,39464.68this study
Xp91–118CP019725.1xpTomatoUSA200414.8944504,898,34965Potnis et al. 2011
Xp5–6JZVA01xpTomatoUSA2006705.34501202,26264.6Schwartz et al. 2015
Xp7–12JZVB01xpTomatoUSA2006655.124343207,80264.9Schwartz et al. 2015
Xp9–5JZVD01xpTomatoUSA20061135.384624193,59764.6Schwartz et al. 2015
Xp10–13JZVE01xpTomatoUSA2006785.174410158,83064.8Schwartz et al. 2015
Xp11–2JZVF01xpTomatoUSA2006505.354568302,42264.6Schwartz et al. 2015
Xp15–11JZVG01xpTomatoUSA2006635.314520236,13764.6Schwartz et al. 2015
GEV839JZVK01xpTomatoUSA20121095.444669107,80264.4Schwartz et al. 2015
Xp3–15JZUY01xpTomatoUSA2006935.464692226,88664.4Schwartz et al. 2015
Xp8–16JZVC01xpTomatoUSA2006625.264469241,32864.6Schwartz et al. 2015
Xp4–20JZUZ01xpTomatoUSA2006715.294515250,79664.7Schwartz et al. 2015
Xp17–12JZVH01xpTomatoUSA2006845.24387201,08364.7Schwartz et al. 2015
Xp18–15JZVI01xpTomatoUSA2006785.354568178,51964.6Schwartz et al. 2015
GEV909JZVO01xpTomatoUSA2012715.144388128,79264.9Schwartz et al. 2015
GEV940JZVS01xpTomatoUSA2012985.144384118,06664.9Schwartz et al. 2015
GEV968JZVT01xpTomatoUSA2012965.144392114,91364.9Schwartz et al. 2015
GEV1001JZVV01xpTomatoUSA2012785.184402123,60664.8Schwartz et al. 2015
GEV872JZVL01xpTomatoUSA2012805.144401143,70964.9Schwartz et al. 2015
GEV893JZVM01xpTomatoUSA20121165.134387105,78364.9Schwartz et al. 2015
Xp4BJZUX01xpTomatoUSA19982145.22446973,79664.6Schwartz et al. 2015
GEV904JZVN01xpTomatoUSA20121345.14440577,14564.9Schwartz et al. 2015
GEV936JZVR01xpTomatoUSA20121045.144390111,23464.9Schwartz et al. 2015
GEV915JZVP01xpTomatoUSA2012705.124356180,79564.9Schwartz et al. 2015
GEV917JZVQ01xpTomatoUSA20121225.14439085,81464.9Schwartz et al. 2015
GEV1054JZVY01xpTomatoUSA2012855.234450128,85064.7Schwartz et al. 2015
GEV1044JZVX01xpTomatoUSA2012825.244456128,85064.7Schwartz et al. 2015
TB6JZWA01xpTomatoUSA20131485.274473249,90364.6Schwartz et al. 2015
TB9JZWB01xpTomatoUSA20132685.324483236,00764.6Schwartz et al. 2015
GEV993JZVU01xpTomatoUSA2012695.154385176,93864.9Schwartz et al. 2015
GEV1026JZVW01xpTomatoUSA2012705.154373158,06264.9Schwartz et al. 2015
GEV1063JZVZ01xpTomatoUSA2012935.164384128,85064.8Schwartz et al. 2015
TB15JZWC01xpTomatoUSA20132615.334505233,35364.6Schwartz et al. 2015
Xp2010JZVJ01xpTomatoUSA2010135.2545751,113,51164.6Schwartz et al. 2015
4P1S2JRWW01xpTomatoSicily20114425.51521641,48064.7Torelli et al. (Unpublished)
DAR 33341QEYV01x spCapsicumPeats Ridge1979575.184387199,70064.66this study
BRIP 62409QEZJ01x spTomatoStanthorpe2015374.924033210,84065.92this study
BRIP 62411QEZI01x spTomatoStanthorpe2015464.814002334,51966.14this study
BRIP 62415QEZH01x spTomatoStanthorpe2015494.853983325,85766.06this study
BRIP 62418QEZG01x spTomatoStanthorpe2015675.094274193,02465.8this study
BRIP 38864QEZA01xvWild tomatoBowen1982695.224439377,67964.23this study
DAR 26931QEYU01xvCapsicumBurrundulla1977465.184386243,08964.25this study
DAR 26933QEYT01xvCapsicumRedland bay197718514.934899459664.09this study
BRIP 62388QEZD01xvTomatoSouth Turramurra2012855.314488142,44964.02this study
BRIP 62413QEZF01xvTomatoStanthorpe2015565.374574377,58864.12this study
BRIP 62423QEZE01xvTomatoStanthorpe2015695.374582467,77564.12this study
BRIP 62428QEZC01xvTomatoTenterfield2015665.364575298,83364.12this study
BRIP 62429QEZB01xvTomatoTenterfield2015605.364567198,56764.13this study
BRIP 38861QEYZ01xvTomatoVictoria Point1981465.234442377,69764.22this study
ATCC 35937AEQV01xvTomatoNew Zealand19552965.53476841,38064.1Potnis et al. 2011
53 MJSYJ01xvCapsicum annuum cv. Kurtovska kapijaMacedonia20053445.29452629,34064Vancheva et al. 2015
15bJSXZ01xvCapsicum annumBulgaria20053385.29451731,44964Vancheva et al. 2015
LMG 919JTEF01xvTomatoZimbabwena4205.12439121,73164.3Lefeuvre et al. (Unpublished)
LMG 920JTEG01xvTomatoItaly19563265.16443028,85064.2Lefeuvre et al. (Unpublished)

aBrisbane pathology herbarium ID

bWGS project number or Genbank ID where complete genome is available

cxa X. arboricola, xe X. euvesicatoria, xp X. perforans, xv X. vesicatoria, x sp Xanthomonas sp.

dxa pv ce: X. arboricola pv. celebensis, xa pv j: X. arboricola pv. juglandis, xa pv co: X. arboricola pv. corylina, xa pv p: X. arboricola pv. pruni

eData not available

fCDS as determined by the annotation of this study for consistency

Included statistics are contig number, length in Mbp, CDS, N50 and GC content. All reported CDS numbers were generated in this study. The metadata for Australian strains was sourced from Roach et al. 2017

Draft genome statistics of 50 Australian Xanthomonas strains and reported genome statistics of public assemblies from Genbank aBrisbane pathology herbarium ID bWGS project number or Genbank ID where complete genome is available cxa X. arboricola, xe X. euvesicatoria, xp X. perforans, xv X. vesicatoria, x sp Xanthomonas sp. dxa pv ce: X. arboricola pv. celebensis, xa pv j: X. arboricola pv. juglandis, xa pv co: X. arboricola pv. corylina, xa pv p: X. arboricola pv. pruni eData not available fCDS as determined by the annotation of this study for consistency Included statistics are contig number, length in Mbp, CDS, N50 and GC content. All reported CDS numbers were generated in this study. The metadata for Australian strains was sourced from Roach et al. 2017 The SNP-based phylogenetic tree arranges most strains in this study into distinct clades (Additional file 1: Figure S1). The X. euvesicatoria and X. perforans clades grouped distinctly from the X. vesicatoria, X. gardneri and X. arboricola clades. The X. vesicatoria and X. arboricola clades contain three and four distinct subclades respectively (branch support values of 1). Four strains from tomato in Stanthorpe (BRIP 62409, 62411, 62415, and 62418, designated the uncharacterised Xanthomonas sp. clade) resolved in a clade distinct from its closest relative X. arboricola. The core genome phylogenies of individual BLS-causing species (excluding X. gardneri, X. arboricola and the uncharacterised Xanthomonas clade) clustered Australian strains in clades with multiple strains from other countries (Fig. 1). Australian strains of X. perforans cluster with overseas isolates xp 91–118, xp 4p1s2 and xp 17–12. Nine Australian X. euvesicatoria strains cluster in a clade by themselves, with the other ten distributed across clades with overseas strains. Australian X. vesicatoria strains cluster in three clades with overseas strains, distant from the type strain ATCC 35937.
Fig. 1

Phylogeny of Australian and Genbank genomes of a) X. euvesicatoria, b) X. perforans and c) X. vesicatoria based on core genome alignments generated by the Roary program. Australian strains are indicated by BRIP and DAR collection prefixes and highlighted; all others are public genomes of related species. Type strains are indicated in bold and branch support values are displayed to clade level (measured with the Shimodaira-Hasegawa test within FastTree). Branch length is indicated by the scale bar

Phylogeny of Australian and Genbank genomes of a) X. euvesicatoria, b) X. perforans and c) X. vesicatoria based on core genome alignments generated by the Roary program. Australian strains are indicated by BRIP and DAR collection prefixes and highlighted; all others are public genomes of related species. Type strains are indicated in bold and branch support values are displayed to clade level (measured with the Shimodaira-Hasegawa test within FastTree). Branch length is indicated by the scale bar All X. euvesicatoria strains apart from BRIP 39016 tested were pathogenic on both capsicum [4] and tomato, where BRIP 39016 was determined to be non-pathogenic on both hosts. Strains of X. vesicatoria and X. perforans were pathogenic on tomato as determined previously [4], and non-pathogenic on capsicum. Strains of the uncharacterised Xanthomonas sp. were non-pathogenic on both hosts, and strains of X. arboricola were designated non-pathogenic on tomato [4] and capsicum. Pathogenicity of X. euvesicatoria on tomato was observed as small, dark lesions with yellow halo that displayed bacterial streaming. Isolations resulted in yellow, gram negative colonies. Average nucleotide identity (ANI) of core genome sequence analysed in this study are presented in Fig. 2. An ANI of 93% supports the separation of X. arboricola and the four uncharacterised Xanthomonas sp. strains into two separate species. ANI values of > 98% showed the genetic similarity of X. perforans and X. euvesicatoria while also displaying conserved separation. ANI analysis indicates that BRIP 39016 and DAR 26930 are also strains of X. euvesicatoria (ANI > 98%). Strain DAR 33341 has an ANI < 95% to all other strains in the analysis and 94% to X. euvesicatoria and was therefore excluded from X. euvesicatoria.
Fig. 2

A heat map and dendrogram of average nucleotide identity (ANI) between 147 Xanthomonas genomes. The coloured bars represent the species as indicated in the SNP phylogeny and supported ANI values shown here. Xanthomonas perforans strains of X. euvesicatoria are indicated separately to highlight ANI differences. ANI is depicted as the colour gradient indicated by the legend: darker = 1 (100% ANI), lighter = 0.88 (88% ANI)

A heat map and dendrogram of average nucleotide identity (ANI) between 147 Xanthomonas genomes. The coloured bars represent the species as indicated in the SNP phylogeny and supported ANI values shown here. Xanthomonas perforans strains of X. euvesicatoria are indicated separately to highlight ANI differences. ANI is depicted as the colour gradient indicated by the legend: darker = 1 (100% ANI), lighter = 0.88 (88% ANI)

Pan genome composition

The nucleotide homologue cluster matrix grouped all strains (Fig. 3) in a generally similar topology to the phylogeny while also highlighting distinct differences between species. All species contained unique homologues (280 in X. arboricola, 70 in X. euvesicatoria, 69 in X. perforans, 416 in the combined X. euvesicatoria and X. perforans clades, 1639 in X. gardneri, 1646 in X. vesicatoria, and 609 in the uncharacterised Xanthomonas sp. clade). Pan genome pie charts (Fig. 4) based on the homologue matrix (Fig. 3) describe the core, soft core, shell and cloud genome content of each species. Gene discovery plots for each species (Additional file 2: Figure S2) showed that the number of new genes approached zero as genome number increased.
Fig. 3

Cluster matrix of 147 Xanthomonas genomes with dendrogram based on homologue presence (dark) and absence (light). Species groupings are indicated with coloured bars as determined by phylogeny and ANI. Xanthomonas perforans strains of X. euvesicatoria are indicated separately to highlight homologue differences. The four Australian strains most closely related to X. arboricola are designated in the text as an uncharacterised Xanthomonas species

Fig. 4

Pie plots of gene content in core, soft core, shell and cloud genomes describing the pan genome for X. euvesicatoria, X. perforans, X. vesicatoria and X. gardneri. The core genome is defined as genes present in 99–100% of strains; soft core, shell and cloud genomes are defined as 95–99%, 15–95% and 0–15% respectively. Number of genomes in each pan genome is indicated as ‘n’

Cluster matrix of 147 Xanthomonas genomes with dendrogram based on homologue presence (dark) and absence (light). Species groupings are indicated with coloured bars as determined by phylogeny and ANI. Xanthomonas perforans strains of X. euvesicatoria are indicated separately to highlight homologue differences. The four Australian strains most closely related to X. arboricola are designated in the text as an uncharacterised Xanthomonas species Pie plots of gene content in core, soft core, shell and cloud genomes describing the pan genome for X. euvesicatoria, X. perforans, X. vesicatoria and X. gardneri. The core genome is defined as genes present in 99–100% of strains; soft core, shell and cloud genomes are defined as 95–99%, 15–95% and 0–15% respectively. Number of genomes in each pan genome is indicated as ‘n’

Predicted and isolated plasmids and predicted effector content

Contigs originating from plasmids were assembled for 48 of 50 sequenced Australian strains, resulting in a total of 61 plasmids reconstructed for 41 strains (Table 2). A 31 kb plasmid (13 plasmids of 31,328 bp and two of 31,318 bp) was present in 15 of the X. euvesicatoria strains studied, although it was not found in any X. perforans strains. A 150 kb plasmid (six plasmids of 159,114 bp and one of 159,115 bp) was present in seven X. euvesicatoria strains and none of the X. perforans strains. An 88 kb plasmid (3 plasmids of 88,047 bp, 88,057 bp, 88,063 bp) was present in three of the X. vesicatoria strains. A 47 kb plasmid (three plasmids of 47,218 bp, one of 47,214 bp) was present in four strains of X. vesicatoria. The remaining plasmids were not unique to individual species. A 17 kb plasmid (two of 17,360 kb, one of 17,359 kb, 17,382 kb, and 17,377 kb) was present in four X. euvesicatoria strains and one X. arboricola strain. Plasmids of 40–41 kb were present in two X. euvesicatoria strains, one X. perforans strain and two X. vesicatoria strains. A single X. euvesicatoria strain contained two plasmids of 28,462 kb and 32,900 kb, while a strain of the uncharacterised Xanthomonas sp. contained a plasmid of 28,836 kb. The 19 plasmids less than 10 kb in size were found in five X. euvesicatoria strains, nine strains of X. perforans, one strain of the uncharacterised Xanthomonas clade, and DAR 33341. Protein sequence with homology to established effectors were detected in reconstructed plasmid sequence. Homologues of AvrBsT were detected in each of seven strains of X. vesicatoria (four 47 kb plasmids and three 88 kb plasmids); AvrBs3 in two strains of X. euvesicatoria on 40 kb plasmids and XopH in an X. perforans strain on a 41 kb plasmid.
Table 2

Reconstructed circular sequence assembled for Australian Xanthomonas draft genomes from whole genome sequence data

StrainSpecies aPlasmid number bPlasmid .0 cPlasmid .1Plasmid .2Plasmid .3
BRIP 62410xa0
BRIP 62412xa0
BRIP 62414xa0
BRIP 62416xa117,377
BRIP 62432xe117,382
BRIP 62441xe131,328
BRIP 62555xe131,328
BRIP 62757xe2159,11431,328
BRIP 62858xe38247159,11431,328
BRIP 62959xe1159,115
BRIP 38997xe217,360***40,439
BRIP 63464xe2159,11431,328
BRIP 39016xe232,90028,462
BRIP 62396xe231,3284316
BRIP 62400xe131,328
BRIP 62403xe231,328159,114
BRIP 62390xe2159,11431,328
BRIP 62391xe2824731,328
BRIP 62392xe131,328
BRIP 62395xe131,328
BRIP 62438xe0
BRIP 62425xe331,328159,11417,360
DAR 34895xe131,318
DAR 82542xe4258717,350***40,42931,318
DAR 26930xe18201
BRIP 62383xp11418
BRIP 62384xp12808
BRIP 62398xp17812
BRIP 62404xp12808
BRIP 62397xp1**41,842
BRIP 62386xp12808
BRIP 62405xp225895031
BRIP 63262xp225895031
BRIP 63565xp12808
BRIP 63666xp11418
BRIP 62389xp0
BRIP 62409x sp0
BRIP 62411x sp128,836
BRIP 62415x sp0
BRIP 62418x sp11418
DAR 33341x sp271544921
BRIP 38864xv2*8805741,641
BRIP 62388xv0
BRIP 62413xv1*47218
BRIP 62423xv1*47218
BRIP 62428xv1*47218
BRIP 62429xv1*47214
BRIP 38861xv2*8806341,641
DAR 26931xv1*88047
DAR 26933xv0

aspecies as determined by whole genome SNP phylogeny and ANI

bcircular sequences assembled by recycler for each strain

ccircular sequence ID is indicated by decimal value after strain ID

Length is described in base pairs. * indicates the predicted plasmid contains AvrBsT, ** XopH, ***AvrBs3

Reconstructed circular sequence assembled for Australian Xanthomonas draft genomes from whole genome sequence data aspecies as determined by whole genome SNP phylogeny and ANI bcircular sequences assembled by recycler for each strain ccircular sequence ID is indicated by decimal value after strain ID Length is described in base pairs. * indicates the predicted plasmid contains AvrBsT, ** XopH, ***AvrBs3 Predicted plasmids were investigated by isolating and visualising plasmid DNA of strains BRIP 38864, BRIP 62858, BRIP 62416, BRIP 62423, BRIP 62388, BRIP 62397, BRIP 63464, BRIP 38997 and the extraction control DC 3000. (Additional file 3: Figure S3). Bands approximately of the size predicted were observed in strains BRIP 38864, BRIP 62858, BRIP 62423, BRIP 62397, BRIP 63464 and DC 3000. Where no plasmids were predicted for strain BRIP 62388, bands similar to those of other extractions were observed. For strains BRIP 62416, no plasmids were recovered despite the prediction of 17 kbp circular sequence. BRIP 62423, BRIP 62397, BRIP 38997, and BRIP 62858 may have additional large bands that could not be separated effectively. Sizing is only approximate due to the possibility of multiple plasmid structures (nicked circular sequence, linear sequence, supercoiled plasmids) migrating through the gel at different rates.

Effector and CAZyme content

The effector profiles of the dataset (Fig. 5) grouped species in the same general topology as the whole genome SNP phylogeny (Additional file 1: Figure S1). Effectors that were core to each species and the entire dataset, as well as effectors discussed in other studies, are listed in supplementary material (Additional file 4: Table S1). The occurrence of important effectors identified in previous studies [1, 17] are also listed here. Retained as core to BLS-causing species (X. euvesicatoria, X. perforans, X. vesicatoria and X. gardneri) as listed previously (AvrBs2, XopR, XopX, XopZ1, XopAD, XopN, XopF1, XopK, XopL, XopQ, XopD) [1] are AvrBs2, XopR, XopX, XopZ1 and XopAD. Several of these effectors previously considered core were detected in all but a few strains of certain species; XopN was absent in one X. vesicatoria and one X. gardneri strain, XopF1 was absent in one X. gardneri strain, XopK was absent in one X. euvesicatoria strain, and XopL was absent in four X. gardneri strains. XopQ was absent in all X. vesicatoria strains and XopD was absent in eight X. vesicatoria strains.
Fig. 5

Presence/ absence matrix with dendrogram of effectors identified in 147 Xanthomonas genomes. Effector presence is indicated by colour as described in the legend (presence = blue, absence = red). Names and Genbank numbers of identified effectors are listed vertically. Species groupings as determined by phylogeny and ANI are indicated by the horizontal coloured bar as follows: X. euvesicatoria; orange, X. perforans; pink, X. vesicatoria; blue, X. gardneri; dark green, X. arboricola; green, Xanthomonas sp.; light green

Presence/ absence matrix with dendrogram of effectors identified in 147 Xanthomonas genomes. Effector presence is indicated by colour as described in the legend (presence = blue, absence = red). Names and Genbank numbers of identified effectors are listed vertically. Species groupings as determined by phylogeny and ANI are indicated by the horizontal coloured bar as follows: X. euvesicatoria; orange, X. perforans; pink, X. vesicatoria; blue, X. gardneri; dark green, X. arboricola; green, Xanthomonas sp.; light green The dendrogram based on CAZyme family data (Fig. 6) grouped species distinctly in the same general topology as seen in the genome SNP phylogeny (Additional file 1: Figure S1). In contrast to the SNP phylogeny, the CAZyme dendrogram clusters some X. arboricola and X. perforans strains outside of their group. A total of 92 carbohydrate active and facilitator enzyme families were identified, revealing groups present or absent in certain species and clades (Fig. 6, Additional file 5: Table S2). CAZyme families of cell wall degrading enzyme genes identified in BLS-causing Xanthomonas by Potnis et al. [27] were also identified in this dataset (Additional file 5: Table S2).
Fig. 6

Cluster matrix and dendrogram based on number of CAZyme families identified in 147 Xanthomonas genomes. Number of CAZyme families present in each strain is indicated by the red-blue scale of the figure legend (17 families = red, 1 = blue, 0 = white). CAZyme families are listed vertically. Horizontal coloured bars represent species as indicated by SNP phylogeny and ANI. Xanthomonas perforans strains of X. euvesicatoria are indicated separately to highlight differences in cazyme profile

Cluster matrix and dendrogram based on number of CAZyme families identified in 147 Xanthomonas genomes. Number of CAZyme families present in each strain is indicated by the red-blue scale of the figure legend (17 families = red, 1 = blue, 0 = white). CAZyme families are listed vertically. Horizontal coloured bars represent species as indicated by SNP phylogeny and ANI. Xanthomonas perforans strains of X. euvesicatoria are indicated separately to highlight differences in cazyme profile

Discussion

Genomic analysis of BLS-associated Xanthomonas strains revealed diverse groups with distinguishing features that will have implications for future pathogenicity and taxonomic studies. Phylogenetic analysis (SNP, core genome, ANI) supports the close relationship between X. euvesicatoria and X. perforans. An uncharacterised Xanthomonas species (BRIP 62409, BRIP 62411, BRIP 62415 and BRIP 62418) was demonstrated to be distinct from closely related strains of X. arboricola according to the SNP phylogeny and ANI. The effector and CAZyme profiles of species that differ in pathogenicity displayed clear differences that may reflect differences in epidemiology and niche survival.

Taxonomic status of BLS-causing Xanthomonas species

The phylogenies and homologue matrix generated in this study support the current taxonomic status of X. euvesicatoria, X. perforans and X. vesicatoria [28], also confirming previous findings [4] that X. gardneri was not detected in Australian strains of this study. A recent study based on whole genome ANI determined that X. euvesicatoria and X. perforans were not genetically distinct enough to be considered separate species [29]. Our study found an ANI of 98.6% between strains of X. euvesicatoria and X. perforans, supporting these findings. While genetically very similar, strains of X. perforans were still clearly distinguished from X. euvesicatoria and other species in the phylogenetic analyses and the analyses of secretion systems. This may reflect differences in phenotype and pathogenicity, as Australian strains of X. euvesicatoria and X. perforans are generally isolated from capsicum and tomato respectively [4]. This example of genetically similar species being reliably differentiated by other measures is important to consider in the ongoing debate of how to classify bacteria, as a name could reflect phylogenetic groupings or phenotypic (and pathogenic) differences. The core genome phylogenies of the individual BLS-causing species X. euvesicatoria, X. perforans and X. vesicatoria cluster Australian strains in multiple clades with strains from overseas (Fig. 1). Australian X. vesicatoria strains are similar to strains from Italy (LMG 920), Zimbabwe (LMG 919), Macedonia (53 M) and Bulgaria (15b). Australian strains of X. perforans generally resolved in a clade of their own, closely related to the type strain. A subclade clustered BRIP 62398 and BRIP 62397 with xp 4p1s2 and xp 17–12, two strains from Sicily and the USA respectively (Table 1). Australian strains of X. euvesicatoria generally clustered in their own clade, with strains BRIP 62425, BRIP 38997, BRIP 39016, BRIP 62438, DAR34895 and DAR82542 dispersed throughout the phylogeny with overseas strains, generally from the USA. The presence of Australian and overseas strains together in different clades may represent direct introductions of pathogens or the general distribution of the species across the world over time.

Pan genome of collected Xanthomonas species

The homologue matrix of strains in this study reflected the whole genome SNP phylogeny, while also highlighting blocks of unique and shared regions containing hundreds of genes that may be relevant to host specificity, virulence, other phenotypic traits and niche adaptations. This matrix shows that the genomes of X. euvesicatoria strains BRIP 39016 and DAR 26930 were included in the X. perforans group, indicating there may be some recombination events present in these strains/ species. The Australian strains were not significantly differentiated from overseas strains by this matrix, indicating a certain level of species homogeneity distributed across the world. The core genomes of species in this dataset represent conserved functionally essential genes, while the larger accessory and cloud genomes contained genes that may be specific to growth or pathogenicity, particularly as species with different pathogenic capabilities are present in the homologue matrix. Most species analysed individually reflected this trend of large accessory genomes, with the exception of X. gardneri, which was influenced by the small sample size of highly similar strains. The gene discovery plots for the pan genomes of these species showed that the genomes of X. euvesicatoria and X. perforans can be considered closed. Plotting gene discovery indicates there is some potential that additional genomes would result in detection of new genes for X. vesicatoria and X. gardneri. The large accessory genomes observed in many of these species reflects the genetic diversity seen in other studies of Xanthomonas species [3], and also suggests that genome plasticity could result in new genes being added to the population.

Predicted plasmids reconstructed from collected Xanthomonas strains

The seven 150 kb plasmids from X. euvesicatoria had high homology to the X. euvesicatoria plasmid pLMG930.2 of similar size (167,496 bp). Similarly, the 31 kb group of Australian plasmids found in X. euvesicatoria shared homology with X. euvesicatoria and X. citri plasmids pLMG930.4 (GenBank, unpublished) and CFBP6167 plasmid pG [7]. This homology, and their presence in X. euvesicatoria strains (notably absent in X. perforans strains) indicates these 150 kb and 31 kb sequences are previously characterised plasmids. This is likely also the case for plasmids of 47 kb found in X. vesicatoria, as they are similar in size and homology to the X. vesicatoria plasmid pLMG911.2 (CP018727.1, 47 kb). By contrast, three 88 kb plasmids in X. vesicatoria, although similar by sequence homology, are much smaller than the X. vesicatoria plasmid pLMG911.1 (192,558 bp). Plasmids of 40 kb found in X. vesicatoria, X. euvesicatoria and X. perforans were most similar to previously reported plasmids of X. campestris pv. campestris strains CN14 (GenBank: CP017318.1) and CN15 (CP017325.1), and X. perforans pLH3.3 (NZCP018474.1) and pLH3.2 (NZCP018476.1), all of varying sizes. Three plasmids of approximately 28–32 kb were slightly different in size and homology to the 31 kb plasmids, their presence in older and uncharacterised Xanthomonas strains indicating they may be more distantly related. Interestingly, five 17 kb plasmids of X. euvesicatoria did not significantly match any plasmid sequence and were not recovered in the plasmid isolations of BRIP 62416 and BRIP 38997. Evidence for large (80–150 kbp) plasmids was observed in the plasmid extractions, as well as bands that are likely 30–40 kbp in size (Additional file 3: Figure S3). No definitive bands at 8 kbp or 17 kbp were observed in strains BRIP 62858, BRIP 62416 and BRIP 38997, indicating some smaller plasmids may be a result of computational reconstruction. There also appeared to be some plasmids present that were not detected by the selected programs, as in BRIP 62388. False positives and negatives may be a result of integrative conjugative elements or repeat regions that may require further sequencing to fully resolve. Avirulence genes have been found in many described plasmids of most genera, including Xanthomonas pathogens [8]. Sequence with homology to three effectors (AvrBs3, AvrBsT and XopH) were detected in assembled plasmids of ca. 40–47 kb. AvrBsT was detected in most X. vesicatoria plasmids (seven of nine), and has been known to exist on plasmids since its characterisation [9, 30]. As in other studies of plasmid-borne effectors [31], the presence of effectors here demonstrates these circular elements have the potential to influence pathogenicity. XopH, detected in one plasmid of X. perforans, has been suggested as a potential determinant of pathogenicity in X. arboricola pv. corylina [32]. It has also been found in X. campestris pv. campestris [33], and here was found in the majority of X. gardneri and X. vesicatoria chromosomes. Other genes, for example copper tolerance genes, have been found on Xanthomonas plasmids [8], suggesting other significant adaptive genes in addition to effectors may be investigated in future studies.

Effector profiles of Xanthomonas

Many studies have presented core effector lists for Xanthomonas pathogens and found that these effectors are integral to certain strains/species and play key roles in pathogenicity [1, 13, 17, 29, 34]. The effector profiles determined by this study show distinct patterns specific to species and clades. We have revised the core and specific effector list for Xanthomonas species causing BLS and contrast them with species displaying different pathogenic abilities.

Core and specific effectors

Few effectors were found to be shared between closely related phylogenetic groups, a finding consistent with a previous study on the type strains of four BLS-causing Xanthomonas species [27]. Subsequent studies have noted that strains may display different effector profiles to that of the type strain of their species [17], a pattern also observed in this study. The core effectors previously identified in the type strains of the BLS pathogens X. gardneri, X. perforans, X. vesicatoria and X. campestris pv. vesicatoria (X. euvesicatoria) [27] were expanded [17] with the addition of XopE2 and a member of the YopJ family (AvrBsT and XopJ1). Barak et al. [29] further refined the list of core effectors, finding all effectors previously identified [27], with the exception of XopAD that displayed internal stop codons in some X. euvesicatoria strains. The analysis of effectors in species that do not cause BLS provides an opportunity to compare and contrast effector profiles with X. vesicatoria, X. euvesicatoria and X. perforans. The Xanthomonas sp. clade has few effectors, most of which are shared with some X. arboricola and X. vesicatoria strains. Reduced T3SS effector repertoires do not necessarily indicate a lack of pathogenic capability [35], however it is likely these effectors (Additional file 4: Table S1) are not directly involved in pathogenicity on tomato or pepper due to their presence in non-pathogenic strains. The Australian X. arboricola strains in this study have varied effector profiles with few common effectors. As there are relatively few sequenced strains of X. arboricola from different hosts, it is difficult to draw meaningful conclusions about effector profiles in relation to their pathogenicity. The variation observed in these profiles is likely a result of wider host range, presenting a point of contrast to the other groups. In this study, the core effectors AvrBs2, XopAD, XopR, XopX and XopZ1 were found in the majority of strains of X. euvesicatoria, X. gardneri, X. perforans and X. vesicatoria. All of these strains together with those of X. arboricola and the uncharacterised Xanthomonas sp. clade contained rpfA, rpfB and rpfF, members of the rpf gene family that regulate pathogenicity factors and biofilm production [36]. Homologues of AvrBs2, involved in the modulation of effector delivery [37] were also found in all strains of this study. Other effectors previously listed as core to BLS-causing Xanthomonas species were detected in most strains of these species with some exceptions as listed above. Interestingly, many of these effectors are also present in X. arboricola strains and the uncharacterised Xanthomonas clade. For example, XopF1 was only absent in one strain of X. gardneri, but was detected in BLS-causing species as well as most X. arboricola and all strains of the uncharacterised clade. The core effectors XopK, XopL and XopN, were also found in strains not isolated from tomato or pepper, which indicates these proteins may not be associated with specificity to these hosts. No single effector in this study appeared to be consistently associated with pathogenicity on tomato based on comparison with the X. arboricola and Xanthomonas spp. clades. This was also the case for pepper pathogenic strains, though XopAA and XopJ3 were present only in the pepper pathogenic X. euvesicatoria and the non- pepper pathogenic BRIP 39016. The profiles presented here represent homologues in predicted protein sequence, so it is possible inactivation in effector gene sequences play a role in pathogenicity as well. As demonstrated by Barak et al. [29], the core effectors listed above do not determine pathogenicity on tomato due to their presence in an X. euvesicatoria strain isolated from rose. One particular clade of X. arboricola (containing the MAFF strains) shared many effectors with BLS-causing species, further emphasising the need for comprehensive pathogenicity studies to tie effector profile to functional traits.

Effectors and host range of the X. euvesicatoria and X. perforans clades

While X. euvesicatoria is commonly reported as a pathogen of tomato and pepper, all but two X. euvesicatoria strains (BRIP 39016 and DAR 26930) from Australian crops were found in capsicum and chilli [4]. Recent reports indicate it is more common to observe X. perforans (and X. gardneri) in tomato and X. euvesicatoria in peppers [17, 38, 39]. Prior to 1991, X. euvesicatoria was the main BLS pathogen on tomato in Florida [1]. This indicates it was once more common to find X. euvesicatoria in tomato than it is today. As the only Australian X. euvesicatoria strains isolated from tomato were from 1973 and 1976, Australian X. euvesicatoria populations reflect this host shift observed overseas. One X. perforans strain (Xp2010) from Florida displayed dual infecting ability in pepper and tomato [27]. An Australian strain, BRIP 62398, phylogenetically related to Xp2010 did not share this trait, as all tested Australian X. perforans strains were pathogenic only on tomato. While this indicates pathogenicity traits are not necessarily reflected in phylogenies, variation in virulence on pepper of certain phylogenetic groups has been noted [17]. Strains of X. euvesicatoria and X. perforans are genetically similar and share a similar effector profile, while still displaying notable differences. The core effectors XopF1, XopL, XopN, XopQ, XopR, XopX, XopAK, were conserved in the X. euvesicatoria and X. perforans strains in this study as well as in a previous study (that did not include X. perforans strains) [29]. While core effectors may indicate evolutionary history, several studies note that functionality of effector genes must be investigated in addition to presence or absence [17, 29]. Australian strains of X. euvesicatoria displayed almost identical profiles to those of overseas strains, apart from a group of 11 that contain an AvrBs3 homologue along with xe678 and xe685 that likely reflect pathogenicity differences. Australian X. perforans effector profiles were also similar to other X. perforans strains, though they (as well as xp91–118 and xp4p1s2) lack XopE2. Australian X. perforans (excluding BRIP 62397) appear to have XopE3 where all other X. perforans strains do not. These presences and absences may have pathogenicity implications according to the description of the XopE family [40], though this pattern does also reflect their clade groupings in the core genome phylogeny. XopE3 has also been implicated in citrus pathogenicity [41]. Further investigation into the function of these effectors may reveal the significance of these patterns. An effector that has been used to track population changes is the 600 amino acid protein XopAE, which is a fusion of the HpaF and hpaG effectors [27]. The majority of Australian X. euvesicatoria and X. perforans strains contained a XopAE homologue, while four X. euvesicatoria strains (BRIP 62438, BRIP 38997, DAR 34895 and DAR 82542) had hpaG and hpaF as separate effectors. These strains were collected from locations and/ or time points different from the rest of the collection, possibly reflecting different introductions or outbreaks. Barak et al. [29] suggested the presence of the translational mutation and the single alleles represented separate introductions, as they observed in strain LMG918. The difference in effector profiles between strains separated by time has also been noted previously [17] and is reflected in historical strains of X. euvesicatoria in this study, in particular BRIP 39016 and DAR29630 that are also separated geographically.

Effectors of X. vesicatoria and X. gardneri

Strains of X. vesicatoria have a distinct effector profile similar to X. gardneri, which reflects their position in the whole genome SNP phylogeny. The variation of effector profile within X. vesicatoria reflects the phylogenetic clades identified, rather than specific differences in Australian and overseas strains. Homologues of XopAG and XopAI have previously been identified as specific to X. vesicatoria [27]. However, we have shown that homologues of XopAG exist in DAR 33341 (Xanthomonas sp.) and an X. arboricola strain (NCPPB 100457). XopAI was also detected in these and an additional five strains of X. arboricola. Previous studies of the X. gardneri effector profile found differences between the type and other strains, which is also evident in this study [17].

Key CAZymes

Similar to the effector profiles, the cazyme profiles grouped strains mostly into species, highlighting regions of difference. No differences between Australian and overseas strains within species were detected. CAZyme genes and families have been identified previously in the type strains of BLS-causing Xanthomonas species [27] and reflect the profiles seen in this study. Cellulases are known to be common to the gammaproteobacteria [42], and the abundance of GH families was expected. The xylanase families GH10 and GH30 [25, 43] were present in all strains, indicating these groups share similar strategies for degrading plant cells. Identifying core carbohydrate active enzymes needs further investigation into the proteins and genes of these enzyme families [27]. Carbohydrate utilisation reactions have been used as a diagnostic tool for many years to differentiate bacterial species based on substrate usage [23]. It has also been suggested that secreted cell wall degrading enzymes play a role in host adaptation, with several studies linking these enzymes and pathogenicity [35].

CAZyme differences between species

The CAZyme family profiles of the X. euvesicatoria and X. perforans strains were nearly identical, apart from the absence of three families (CE8, CE14, and GH39) in the X. euvesicatoria strains. The absence of these families of plant polysaccharides, acetylases and pectinases highlights a distinct difference between these two groups. The absence of CE8 (a pectin methylesterase family) in X. euvesicatoria strains correlates with their lack of ability to degrade pectin [25, 43]. The X. euvesicatoria strains had mostly glycosyl hydrolase (GH) and polysaccharide lyase (PL) families (PL10, PL3, CBM63, GH16, PL17, PL6, GH4, GH89, GT84) and lacked many CAZyme families present in other species. The CAZyme families PL3, CBM63 and GH16 were found in all X. vesicatoria and X. gardneri strains and many X. arboricola strains. The conserved presence of some CAZyme families that are present or absent in some species may indicate different substrate utilisation capabilities. Groups of cellulose degrading enzymes also display different profiles, which indicates that species have different modes of action on this substrate. The families GH5, GH9 and GH12 were found in most or all of the strains in this study, GH8 and GH6 had a more restricted distribution. In particular, GH6 was identified only in strains of X. vesicatoria and DAR 33341, which may reflect different strategies or evolutionary pathways for degrading cellulose.

Conclusions

This study has provided an overview of the genome structure and content of several Xanthomonas species and expanded the original identification of Australian species associated with BLS. We support the taxonomic status of X. euvesicatoria and X. perforans as one species, though it is clear these strains also have conserved differences that complicate taxonomy. Our analysis of effector proteins and carbohydrate active enzymes links pathogenic data with proteins detected in the genomic analysis, demonstrating that while these profiles differ between species no single pathogenicity factor was identified. It is clear that some differences may also exist in Australian populations regarding effector content. The limitations of bioinformatic reconstruction of plasmids was also highlighted. This study has furthered the understanding of species that cause BLS and provided several points of future study to improve the understanding of Australian bacterial populations.

Methods

Isolate collection, pathogenicity testing and sequencing

Strains of Xanthomonas spp. associated with BLS in Australia were collected as described in Roach et al. [4]. To determine pathogenicity on tomato and capsicum, all isolates (excluding DAR strains as only genomic data was available) were inoculated on susceptible Capsicum annuum var. Yolo Wonder and susceptible Solanum lycopersicum var. Grosse Lisse. Overnight cultures of bacteria were diluted in distilled H20 to concentrations of 1 × 108 cfu/ml and sprayed onto plants until run-off. Pathogenicity was recorded after approximately 7 days. Pathogenicity on host of isolation has been reported [4]. Pathogenicity on the alternative host was observed as small, dark lesions with yellow halo that displayed bacterial streaming. The dataset of 50 Australian strains was comprised of 44 strains held in the Queensland Plant Pathology Herbarium (BRIP) culture collection and six sequenced Australian strains provided by the NSW Plant Pathology Herbarium (DAR) (Table 1). Selected strains of each identified species represented a range of taxa, host and geographical distribution. Strains were grown overnight in lysogeny broth (Luria-Bertani) [44] and the DNA was extracted using a Qiagen DNeasy Blood and Tissue kit (Qiagen; Hilden, Germany). Genomic libraries were prepared using an Illumina Nextera XT Library Preparation Kit according to the manufacturer instructions (Illumina; San Diego, USA). Sequencing was conducted using a Miseq v3 reagent kit on an Illumina Miseq®.

Genome construction

Sequence read adaptors were trimmed with Cutadapt version 1.8.1 and quality trimmed using Trim Galore (q = 25 with ‘paired’ and ‘nextera’ flags) version 0.4.0 [45]. Contigs were assembled with SPAdes version 3.5.0 [46] (with kmers of 127, 117, 107, 97, 87, 77, 67 using the ‘careful’ flag), and annotated with Prokka version 1.11 [47] using the packaged database (using the ‘genus’ and ‘force’ flags). In addition to the 44 sequenced strains, sequence data for 6 Australian strains from DAR were processed with SPAdes and Prokka as above. Genome statistics including GC content, contig number, N50 and genome length were calculated with QUAST version 4.5 [48]. An additional 97 genomes of Xanthomonas strains available in GenBank were downloaded and re-annotated with Prokka (as described above) for standardisation and included in analyses (Table 1). These public genomes represent the majority of sequenced X. arboricola, X. euvesicatoria (X. campestris pv. vesicatoria), X. gardneri, X. perforans, and X. vesicatoria strains in GenBank. Average nucleotide identity of scaffolds was calculated for the entire dataset of 147 genomes with pyani version 0.2.4 [49] using the default settings. Strains were determined to belong to the same species if ANI values were above the 95–96% zone as set in Konstantinidis et al. [50] and utilised in Barak [29].

Plasmid prediction and isolation

Plasmid prediction from the draft genomes of Australian strains was achieved using the plasmidSPAdes option (‘plasmid’ flag) of SPAdes version 3.8.0 [46]. Circular sequences from these assemblies were finished using recycler version 0.62 [51]. Bandage version 0.8.1 [52] was used to view the Recycler paths. The Blast+ algorithm version 2.6.0 [53] was used to compare the plasmid sequences to a custom database of complete Xanthomonas plasmids obtained from GenBank [54]. Plasmid isolation was carried out on a subset of strains (BRIP 38864, BRIP 62858, BRIP 62416, BRIP 62423, BRIP 62388, BRIP 62397, BRIP 63464, BRIP 38997) using the alkaline lysis method described in Chakrabarty [55]. The Pseudomonas strain DC 3000 [56] was used as an extraction control. Strains were grown in LB broth and processed with the described buffers, resuspending the pelleted DNA in distilled H20. Plasmid DNA was visualised on 0.7% agarose gels using standard electrophoresis at 40 V for 4–12 h.

Analysis of genome content

A phylogeny that displays general species relationships was generated using the RedDog pipeline version V1beta.10.3 [57]. Briefly, the RedDog pipeline assembles and aligns raw reads against a reference genome (X. campestris pv. vesicatoria strain 85–10, Table 1), then creates a phylogeny using SNP data generated within the pipeline. Simulated reads were generated for public genomes using WgSim v. 0.3.1-r13 for inclusion in analysis [58]. Support values of the resulting phylogeny calculated by FastTree version 2.1.8 [59] are displayed as a range of 0 to 1. The final tree was annotated in FigTree ver. 1.4.2 [60] and GIMP version 2.8.14 [61]. Roary version 3.8.2 [62] was used to cluster homologues, generate a core alignment (−e and -n flags), and generate the pan-genus homologue matrix. The core alignment was then used to create individual phylogenies of BLS-causing species X. euvesicatoria, X. perforans, and X. vesicatoria with FastTree and annotated with FigTree and GIMP. The homologue matrix was then used to generate pan-genome pie plots using scripts available within the Roary package. Core genes were defined as present in 99–100% strains, soft core genes in 95–99% strains, shell genes in 15–95% strains and cloud genes in 0–15% strains. Gene discovery graphs were plotted using Roary scripts and R version 1.0.136 [62] to determine if the pan-genome was open or closed. These analyses were done for each species as defined in Table 1 (DAR 33341 was not included). The homologue matrix was filtered in R to identify unique genes of each species. Carbohydrate-active and associated enzyme (CAZyme) coding sequences within each genome were identified using HMMER version 3.1b2 [63] and the DbCAN database [64]. The CAZyme hits were then clustered manually and presented as a heat map and dendrogram in R version 3.3.0 [65] using the pheatmap package [66] and annotated in GIMP. T3SS enzyme protein (effector) sequence and select regulatory protein sequence (rpf) were sourced from GenBank as listed in the Xanthomonas resource ‘effector’ page [67] and compared to the genome sequence and reconstructed plasmid sequence using the Blast+ algorithm. Hits were filtered (e-value 0.00001) and presented as a heat map using R and GIMP as described above. Figure S1. Phylogeny of Australian and Genbank genomes based on whole genome SNP data. Australian strains are indicated by BRIP and DAR collection prefixes; all others are public genomes of related species. Type strains are indicated in bold and branch support values are displayed to clade level (measured with the Shimodaira-Hasegawa test). Branch length is indicated by the scale bar. Clade colouring is based on phylogeny and ANI values to assign strains to species. The four Australian strains most closely related to X. arboricola are designated in the text as an uncharacterised Xanthomonas species. (PNG 278 kb) Figure S2. Gene discovery graphs for X. euvesicatoria, X. perforans, X. vesicatoria and X. gardneri plot number of new genes in the species pan-genome as genome number increases. The graph curves demonstrate how many new genes will be added with the addition of more sequenced genomes to estimate pan-genome completeness. X axis: genome number; Y axis: number of new genes. (PNG 45 kb) Figure S3. Standard electrophoresis of plasmid isolations with predicted circular sequence in base pairs below each lane. Ladder = Generuler™ DNA Ladder Mix, ThermoFisher Scientific, Waltham, Massachusetts. The 10 kbp label marks the largest ladder fragment, and the 60–70 kbp label marks the band present in DC 3000 (plasmid extraction control). A) gel was run for approx. 12 h at 40 V B) gel was run for approx. 4 h at 40 V. A and B represent two different extraction experiments (PNG 79 kb) Table S1. Homologues of effector protein families present in all strains of each Xanthomonas species and unique to each species. Effectors listed include all alleles displayed in the effector matrix for each effector family. a present in all strains of a species and possibly present in other strains/ species. b uncharacterised Xanthomonas sp. of four strains. c core to species in Schwartz et al. 2015. d core to species in Potnis et al. 2011. e Xp4B and Xp4p1S2 have truncated HpaG protein annotations, Xp4p1s2 has truncated XopAE protein annotation. f BRIP 39016 has a larger (650 aa) XopAE protein than all others (546 aa). g present in all Australian strains except BRIP 62397 (DOCX 13 kb) Table S2. Cazyme families present in all strains of each Xanthomonas species and families unique to each species. Function is described according to the CAZy database.a all three present in DAR 26930, GH39 present in BRIP 39016, CE14 absent in GEV915. b genes of CAZyme family present in all strains of species. c genes of CAZyme family absent in all strains of species. d genes of CAZyme family present in some strains of species. e absent in 1–3 strains. (DOCX 13 kb)
  3 in total

1.  Comparative Genomic Analysis of the Lettuce Bacterial Leaf Spot Pathogen, Xanthomonas hortorum pv. vitians, to Investigate Race Specificity.

Authors:  Emma Rosenthal; Neha Potnis; Carolee T Bull
Journal:  Front Microbiol       Date:  2022-04-18       Impact factor: 5.640

2.  Responses of Soybean Genes in the Substituted Segments of Segment Substitution Lines Following a Xanthomonas Infection.

Authors:  Jianan Zou; Zhanguo Zhang; Siyang Yu; Qinglin Kang; Yan Shi; Jinhui Wang; Rongsheng Zhu; Chao Ma; Lin Chen; Jieqi Wang; Jianyi Li; Qingying Li; Xueying Liu; Jingyi Zhu; Xiaoxia Wu; Zhenbang Hu; Zhaoming Qi; Chunyan Liu; Qingshan Chen; Dawei Xin
Journal:  Front Plant Sci       Date:  2020-07-02       Impact factor: 5.753

3.  Phylogenetic Analyses of Xanthomonads Causing Bacterial Leaf Spot of Tomato and Pepper: Xanthomonas euvesicatoria Revealed Homologous Populations Despite Distant Geographical Distribution.

Authors:  Upasana Dhakal; Shefali Dobhal; Anne M Alvarez; Mohammad Arif
Journal:  Microorganisms       Date:  2019-10-16
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.