Literature DB >> 30973888

A new Multi Locus Variable Number of Tandem Repeat Analysis Scheme for epidemiological surveillance of Xanthomonas vasicola pv. musacearum, the plant pathogen causing bacterial wilt on banana and enset.

Gloria Valentine Nakato1,2, Juan Luis Fuentes Rojas3, Christian Verniere4, Laurence Blondin4, Teresa Coutinho2, George Mahuku5, Emmanuel Wicker3,6.   

Abstract

Xanthomonas vasicola pv. musacearum (Xvm) which causes Xanthomonas wilt (XW) on banana (Musa accuminata x balbisiana) and enset (Ensete ventricosum), is closely related to the species Xanthomonas vasicola that contains the pathovars vasculorum (Xvv) and holcicola (Xvh), respectively pathogenic to sugarcane and sorghum. Xvm is considered a monomorphic bacterium whose intra-pathovar diversity remains poorly understood. With the sudden emergence of Xvm within east and central Africa coupled with the unknown origin of one of the two sublineages suggested for Xvm, attention has shifted to adapting technologies that focus on identifying the origin and distribution of the genetic diversity within this pathogen. Although microbiological and conventional molecular diagnostics have been useful in pathogen identification. Recent advances have ushered in an era of genomic epidemiology that aids in characterizing monomorphic pathogens. To unravel the origin and pathways of the recent emergence of XW in Eastern and Central Africa, there was a need for a genotyping tool adapted for molecular epidemiology. Multi-Locus Variable Number of Tandem Repeat Analysis (MLVA) is able to resolve the evolutionary patterns and invasion routes of a pathogen. In this study, we identified microsatellite loci from nine published Xvm genome sequences. Of the 36 detected microsatellite loci, 21 were selected for primer design and 19 determined to be highly typeable, specific, reproducible and polymorphic with two- to four- alleles per locus on a sub-collection. The 19 markers were multiplexed and applied to genotype 335 Xvm strains isolated from seven countries over several years. The microsatellite markers grouped the Xvm collection into three clusters; with two similar to the SNP-based sublineages 1 and 2 and a new cluster 3, revealing an unknown diversity in Ethiopia. Five of the 19 markers had alleles present in both Xvm and Xanthomonas vasicola pathovars holcicola and vasculorum, supporting the phylogenetic closeliness of these three pathovars. Thank to the public availability of the haplotypes on the MLVABank database, this highly reliable and polymorphic genotyping tool can be further used in a transnational surveillance network to monitor the spread and evolution of XW throughout Africa.. It will inform and guide management of Xvm both in banana-based and enset-based cropping systems. Due to the suitability of MLVA-19 markers for population genetic analyses, this genotyping tool will also be used in future microevolution studies.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30973888      PMCID: PMC6459536          DOI: 10.1371/journal.pone.0215090

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Plant disease emergences constitute a major threat for global food security worldwide. These emergences may gain in importance due to the increase of plant transnational exchanges, crop intensification, and climate change [1]. For better prevention and control of these emerging diseases, it is essential to understand the origin of plant pathogens, decipher the evolutionary mechanisms leading to pathogen adaptation to new crops, and identify the ecological factors (including the crop system composition) favoring this adaptation. Understanding how the pathogen genetic diversity is distributed and structured over time and space, can allow to infer migration patterns, identify source populations, estimate the efficient population size. Thus, assessing the pathogen genetic diversity and structure can give access to the driving forces of pathogen population dynamics. Moreover, an unbiased assessment of the pathogen genetic diversity is a prerequisite for building up reliable pathogen-informed management strategies, notably involving crop resistance. The exploration of these questions requires the implementation of a molecular epidemiology approach, including properly sampled bacterial collections and adequate genotyping tools. From the early 2000s, an outbreak of Xanthomonas wilt disease caused by the economically important Xanthomonas vasicola pv. musacearum (Xvm, synonym Xanthomonas campestris pv. musacearum) threatened banana (Musa accuminata x balbisiana) and enset (Ensete ventricosum) production within the East and Central African (ECA) countries [2]. The disease affects all cultivated types of both crops, although damaging more severely the ABB type in banana [3, 4]. Despite its economic impact on banana and enset production, little is known about the population biology and epidemiology of Xvm. Several comparative genomics studies revealed the phylogenetic closeness of Xvm to the vasculorum and holcicola pathovars of Xanthomonas vasicola [5, 6], supporting a reclassification of the originally described Xanthomonas campestris pv. musacearum into the species X. vasicola. We will thus refer to the pathogen as Xanthomonas vasicola pv. musacearum (Xvm) in this research. Molecular typing tools including Rep-PCR, random amplification of polymorphic DNAs (RAPDs), enterobacterial repetitive intergenic consensus-PCR (ERIC-PCR) and single-nucleotide polymorphisms (SNPs) were developed to elucidate genetic relationships among Xvm populations [2, 5, 6]. However, due to the low inter- or intra-laboratory reproducibility for the first two methods and to the low sequence diversity contained in Xvm, such markers have been unable to resolve the Xvm evolutionary patterns and invasion routes [7]. SNP typing revealed four haplotypes within the Xvm population from east Africa and Ethiopia, and low levels of polymorphism outside Ethiopia [8]; as an example, populations sampled 10 years apart (2005 and 2014) in Uganda shared the same unique haplotype [8]. With these results, discriminative molecular typing tools for population studies such as Multi-Locus Variable Number of Tandem Repeat Analysis (MLVA) are needed. Such markers have the power to discriminate between closely related strains providing a better understanding of outbreaks and tracing back the pathways of spread from inoculum sources. MLVA is based on the detection of tandem repeat (TR) polymorphisms within the genome of an organism [9, 10]. Tandem repeats (TR) with short repetitive DNA sequences, called microsatellites when smaller than a 9 bp unit, are highly variable within bacterial genomes [11, 12] and may resolve the genetic diversity of monomorphic pathogens [7, 13]. Variability in the number of TRs is mainly generated through slipped strand mispairing during DNA replication [14]. Short TR loci mainly evolve following either a stepwise mutation model (SMM) where new alleles are created by the addition or deletion of a single repeat unit [15] or a generalized two-phase model [16] where the distribution of the numbers of multiple-repeat mutations approximate a geometric distribution [17]. The mutation rates differ between TR loci within a wide range as a result of TR locus specific properties [18]. A high mutation rate increases the likelihood of size homoplasy of TR alleles, i.e. they are identical by state (a same allele size) but have different identity by descent. Size homoplasy may distort the information provided by highly variable loci but this distortion can be minimized by increasing the number of loci and combining markers with different genetic diversity values [16, 19]. MLVAs were first used to characterize highly monomorphic bacterial pathogens within the medical field [20-22], and have now been successfully used to evaluate diversity in plant pathogens such as Ralstonia solanacearum [23-25], Xanthomonas citri subsp. citri [26, 27], Xanthomonas oryzae pathovars [28, 29], Xylella fastidiosa [30], Xanthomonas axonopodis pv. manihotis (Xam) [31], Clavibacter michiganesis [32] and Erwinia amylovora [33]. The advantages of MLVAs include ease of performance, high reproducibility and discriminatory power, portability and rapidity, as well as tremendous reduction in cost since they can be amplified directly from a bacterial colony without need for DNA extraction [31, 34, 35]. In addition, MLVAs allow for analysis of larger numbers of samples and loci due to reduction in sequencing costs [36, 37]. In this study, we describe the development of a new genotyping method for Xvm, based on MLVA. The MLVA scheme was evaluated on a collection of 335 Xvm strains from five countries, and its discriminatory power was compared to the SNP-derived typing method. This work demonstrates the usefulness and power of the MLVA scheme targeting 19 TR loci for monitoring Xvm populations and epidemics at different temporal and geographical scales.

Materials and methods

Bacterial strains

A total of 335 strains of Xvm from known hosts and potential alternative hosts were collected. They were sampled from Ethiopia (n = 122), Uganda (n = 150) and other parts of Eastern and Central Africa (ECA; namely DR. Congo, Kenya, Rwanda, Tanzania, n = 63) (Table 1). Within this collection, 20 strains were obtained from the National collection of Plant Pathogenic Bacteria (NCPPB) (Table 1). All 335 strains were confirmed to be Xvm using GspDm primers [38] and five new Xvm-specific primers [8]. All bacterial strains were grown on Wilbrink or YPGA medium for 48h at 28°C and stored in W broth/glycerol or YPGA slants at -80°C.
Table 1

Summary of the collections of Xanthomonas vasicola pv. musacearum.

CountryYearSNP-based haplotypeStrain number
Part 1: Collection genotyped with SNPs and MLVA (n = 63)
DRC2015Hap_13
Ethiopia1966Hap_11a
Ethiopia1967Hap_11a
Ethiopia2004Hap_11
Ethiopia2004Hap_3a2
Ethiopia2004Hap_3b5
Ethiopia2004Hap_44
Ethiopia2015Hap_31
Rwanda2005Hap_11a
Rwanda2015Hap_14
Tanzania2015Hap_215
Uganda2014Hap_225
Part 2: Collection genotyped with MLVA only (n = 272)
Burundi2007-1a
DRC2005-2a
DRC2015-29
Ethiopia2017-103
Ethiopia2004-2
Ethiopia2015-2
Kenya2006-1
Rwanda2005-3a
Tanzania2007-4a
Uganda2005-6a
Uganda2012-78
Uganda2014-6
Uganda2016-35

a Strain obtained from the National Collection of Plant Pathogenic Bacteria (NCPPB).

a Strain obtained from the National Collection of Plant Pathogenic Bacteria (NCPPB). Furthermore, the specificity of the MLVA scheme was assessed on phylogenetically close strains. Since Xvm genomes are most similar to members of the species X. vasicola, we tested a collection of reference strains of X. vasicola pv. vasculorum (n = 6), X. vasicola pv. holcicola (n = 5), originating from Africa, Reunion Island, and USA (DNA kindly given by J. Lang—Colorado State University) (Table 2). Other Xanthomonas species, phylogenetically closely related to X. vasicola (Jacques et al., 2016) were tested: Xanthomonas oryzae pv. oryzae (n = 2), Xanthomonas campestris pv. cannabis (n = 1), and Xanthomonas citri pv. citri (n = 1).
Table 2

Core-collection of Xanthomonas vasicola pv. musacearum and Xanthomonas vasicola strains.

CodeCFBPNCPPBSpeciesHostCountryLocationYear of isolationGenomeaXvm sublineaged
Core-collection Xanthomonas vasicola pv. musacearum
CIX 319CFBP7123NCPPB2005X. vasicola pv. musacearumEnsete ventricosumEthiopiaNA1966AKBE00000000.1bI
CIX 318CFBP7122NCPPB2251X. vasicola pv. musacearumMusa sp.EthiopiaNA1967LGTY00000000.1bI
CIX 308CFBP7166NCPPB4378X. vasicola pv. musacearumMusa sp.UgandaNA2005NAUnknown
CIX 309CFBP7167NCPPB4386X. vasicola pv. musacearumMusa sp.UgandaNA2005NAUnknown
Xvm 4387CFBP7168NCPPB4387X. vasicola pv. musacearumMusa sp.DR. CongoNA2005SRR494494.1bI
CIX 310CFBP7169NCPPB4388X. vasicola pv. musacearumMusa sp.DR. CongoNA2005NAUnknown
CIX 311CFBP7170NCPPB4389X. vasicola pv. musacearumMusa sp.RwandaNA2005SRR494495.2bI
CIX 312CFBP7171NCPPB4390X. vasicola pv. musacearumMusa sp.RwandaNA2005NAUnknown
CIX 317CFBP7172NCPPB4391X. vasicola pv. musacearumMusa sp.RwandaNA2005NAUnknown
CIX 316CFBP7173NCPPB4392X. vasicola pv. musacearumMusa sp.TanzaniaNA2007AKBI01000000.1bII
CIX 313CFBP7174NCPPB4393X. vasicola pv. musacearumMusa sp.TanzaniaNA2007NAUnknown
CIX 314CFBP7175NCPPB4394X. vasicola pv. musacearumMusa sp.TanzaniaNA2007AKBJ00000000.1bII
CIX 315CFBP7176NCPPB4395X. vasicola pv. musacearumMusa sp.TanzaniaNA2007SRR494490.2bII
Xvm 4433NANCPPB4433X. vasicola pv. musacearumMusa sp.BurundiNA2007SRR494496.1bII
Xvm 4434NANCPPB4434X. vasicola pv. musacearumMusa sp.KenyaNA2006SRR494497.1bII
Collection X. vasicola
19NANCPPB 1241X. vasicola pv. holcicolaSorghum vulgareAustraliaNA1962NA
423NANAX. vasicola pv. holcicolaSorghum bicolorSouth AfricaNANA
453NANAX. vasicola pv. holcicolaUSANebraskaNA
CFBP 2543CFBP 2543NCPPB 2417X. vasicola pv. holcicolaSorghum vulgareNew ZealandNA1969JSBW00000000.2c
989NANCPPB 989X. vasicola pv. holcicolaHolcus sp.USAKansas1961NA
454NANAX. vasicola pv.vasculorumSaccharum officarumMauritiusNANA
387NANAX. vasicola pv.vasculorumZea maysUSANebraskaNA
444NANAX. vasicola pv.vasculorumZea maysUSAKansasNA
459NANAX. vasicola pv.vasculorumZea maysUSAColoradoNA
Xvv 1381NANCPPB 1381X. vasicola pv.vasculorumSaccharum officinarumZimbabweNA1962AKBL00000000.1c
Xvv 0206CFBP 7162NCPPB 206X. vasicola pv. vasculorumZea maysSouth AfricaNA1948AKBM00000000.1c

a Genome identified by GenBank accession number, or SRA accession number. NA: not available.

b Source: Wasukira et al 2012.

c Source: Sapp M. and Studholme D, unpublished.

d Xvm sublineages (SL) as named by Wasukira et al. (2012).

a Genome identified by GenBank accession number, or SRA accession number. NA: not available. b Source: Wasukira et al 2012. c Source: Sapp M. and Studholme D, unpublished. d Xvm sublineages (SL) as named by Wasukira et al. (2012).

DNA extraction

Strains were first grown on a carbon source-free agar medium (Yeast extract-Peptone-Agar) at 28°C for 24h, and DNA extracted using the Wizard Genomic DNA Purification kit (Promega, Charbonnières-les-Bains, France). DNA quantification and quality control were performed using a Tecan Infinite 200 NanoQuant microplate reader (Tecan Trading AG, Switzerland); DNA was diluted to a final concentration of 10 ng.μL−1 using milliQ water, and stored at −20 °C until use.

In silico identification of MLVA loci

Nine Xvm genomes (NCPPB 2005, 2251, 4379, 4380, 4381, 4384, 4392, 4394, and 4434) from the National Centre for Biotechnology Information (NCBI) public database were screened for VNTR loci using the online tool ‘Polloc-V’ (http://bioinfo-web.mpl.ird.fr/xantho/utils/) developed by Luis-Miguel Rodriguez-R and Ralf Koebnik. We used Tandem Repeat Finder, TRF [39] as a TR detection algorithm proposed with the following selection criteria: full loci size from 50 to 400 bp; pattern size from 5 to 9 bp; number of repetitions set to 6 or above; stringency set to maximal values 2 (match) -7 (mismatch) -7 (indel); minimum percentage of similarity between repeat units: 80%. In prokaryotic genomes, short tandem repeats (1–4 nt) are very rare in large genomes with high GC-content [40]. A preliminary screen gave no results for pattern size below 5, so we further focused on larger motifs. Polloc-V identifies groups of loci based on the similarity rate of their flanking regions. After visual inspection of the 36 loci groups identified, 32 groups were further retained based on the criteria that: each was present in a single locus per genome, with a locus of the same sequence. For each identified locus, the repetition sequences of the motif and the 500 bp-upstream and downstream flanking regions were transferred and concatenated using Geneious v.9.2 [41], to design PCR primers.

Definition and selection of the PCR primers

PCR primers (20- to 27- nucleotides long) were designed in the flanking regions of the tandem repeat sequence using Geneious v. 9.2. Melting temperatures were set around 68 °C to allow downstream primer multiplexing with the QIAGEN Multiplex kit. Primer design parameters were set to be stringent, to avoid the formation of primer dimers and hairpins and to allow downstream multiplexing, and the annealing temperature parameter was set at Melting Temperature minus 4°C (Tm—4°C). Using BLASTN [42] under Geneious, primers and their corresponding TRs were searched for within the nine Xvm reference genomes to determine the location of each locus (inter- or intragenic), and to verify that (i) each motif corresponded to a single locus per genome, and (ii) the locus-corresponding primers exactly targeted the genomic region containing the locus. Only primers fulfilling these criteria were selected for subsequent analyses. Hence, 21 loci and their corresponding primers were selected for development of the MLVA scheme.

Preliminary PCR screening

This first screening assessed typeability (ability to amplify all strains of a given lineage or species), reproducibility, and polymorphism of the identified loci. These criteria were assessed on a core-collection comprising of 15 Xvm strains from Burundi, DR. Congo, Ethiopia, Kenya, Rwanda, Tanzania and Uganda representing the geographical distribution of the pathogen. PCR amplifications were done using the Multiplex PCR kit (Qiagen) in a total volume of 15μL. PCR cycles consisted of one initial denaturation (95°C for 15 minutes) to activate the « hot start » Polymerase, followed by 25 cycles of denaturation at 94°C for 30s, annealing (55°C to 62°C) for 90s, elongation at 72°C for 90 s; and a final elongation step at 60°C for 30 minutes. Electrophoresis of PCR products was done on 3% agarose gel at 100V for 45 minutes. From this screening, 19 polymorphic loci and primer pairs were retained for the following steps.

Primer multiplexing

The multiplexing consisted of three to four loci mixes per PCR reaction. Each of the multiplex PCR reactions was optimized by testing three hybridization temperatures (57, 60 and 63°C) per mix, according to QIAGEN indications, on seven bacterial strains of the core Xvm collection. The optimal hybridization temperature was determined by visualization of amplicon intensity on 3% agarose gel electrophoresis. The combinations of the different loci in each reaction mix were chosen according to the size ranges of the PCR products, in order to avoid overlapping fragment sizes.

Genotyping on ABI3500 capillary sequencer

The "forward" primers of each pair were labeled with a fluorophore: 6-FAM, blue (Eurogentec, Angers, France); VIC, green; NED, black; or PET, red (Applied Biosystems, Life Technologies, Saint Aubin, France). The labeling of the different primers was chosen according to the size and intensity of each PCR product: the NED and PET fluorophores being assigned to the smaller fragments, 6-FAM to the primers giving fragments of weak intensity, and VIC to primers giving larger fragments. Pools of four pairs of labeled primers corresponding to each locus were established (Table 3) and tested in multiplex PCR, using the Multiplex PCR kit (QIAGEN, Courtaboeuf, France) according to manufacturer recommendations. Reaction mixtures (15 μL) consisted of 0.2 μM of each primer (forward primer labelled with one of the fluorescent dyes 6-carboxyfluorescein FAM, NED, PET, and VIC, 2X QIAGEN Multiplex MasterMix, 5X Q-solution and 2 μL of bacterial genomic DNA (10 ng.μL−1). PCR reactions consisted of an initial denaturation step of 15 min at 95 °C; 25 cycles of 30 s at 94 °C, at annealing temperatures of either 60 or 63°C, 90 s at 72 °C, and a final 30 min step at 60 °C. Each PCR product was diluted to 100−1 and 1.5 μL of diluted PCR product was added to 1.5 μL Hi-Di Formamide (for GeneScan -500 LIZ) and 12 μl GeneScan -500 LIZ internal size standard (Applied Biosystems). The 100-fold dilution was chosen following preliminary tests of different PCR loading volumes, because it gave peaks of good intensity (3000–10000 fluorescence units) with no stutter peaks and rare fluorescence saturation phenomena. This was done to avoid peaks saturating the electropherogram and enable accurate analysis. Capillary electrophoresis was conducted on the ABI3500XL DNA Analyzer 24-channel sequencer (Applied Biosystems).
Table 3

Microsatellites, primer description and amplification conditions of the MLVA-19 scheme.

LocusPrimerSequence (5’– 3’)Genome of originTandem repeat sequenceProduct SizeMultiplex mixTan (°C)a
XVM022XVM022_289FTGTCCGCATATCCAGCACGCNCPPB4379GGCTGCT313MIX0160
XVM022_601RGATATCCCAGCCGCACGTCTTG
XVM002XVM002_353 FCCAGGCCACCACGTAATTCAGTCAGGNCPPB4379TCGCTG236
XVM002_588 RCGAACTGCAAAAGCCAAGCCAGAG
XVM028XVM028_459FTGAGGGCAACTAGATCGACGGGTTCNCPPB4379GGGAATC200
XVM028_658RCAGACGGATTTGTTCAACGCATCGCAT
XVM029XVM029_310FCGATATTGGGGTTCTGGCTAGGGTCNCPPB2005TTGCAC375
XVM029_684RAAGTGACGTTTGAGGGGCGC
XVM020XVM020_450FCGTTATTGATCTGACGTATTGCCCATCGNCPPB4380GACGCAC365MIX0260
XVM020_814RACTTCATGCCACCCACGTTGC
XVM030XVM030_333 FTGGTGGATGGATGGGTGTTGGTGGTNCPPB2005TTGTTGC250
XVM030_582 RCCCCGGAGAAGCAAGAACCTAGAACCT
XVM027XVM027_367 FGCTCCCGATCCAACGCTTGCTCATGNCPPB4379AGAGCCG228
XVM027_594 RCGCTGCTCCTGGTTCAATTTCCCGATT
XVM015XVM015_460FCGACCAGACCGCCTTGTTCAGAGAAATNCPPB4379AGCGCACGG249
XVM015_708RGGGATGGTGTTGCTGATGTGGTTTTGC
XVM016XVM016_295FACTTCTCCACGCCTCTGTTTGCCNCPPB4384GGCTATT377MIX0363
XVM016_671RGATCTTAACGCTTCCTTGACATCGGC
XVM021XVM021_454FGTCGTTGAAGCGTTCCATGAAGCCGNCPPB4379CTTCTGCG238
XVM021_691RTGTCCTTGGATGAACAAAAGCCCTCGA
XVM035XVM035_419FTTGAATCCAACGGTGCCCTGTCCNCPPB4380GCACCAA225
XVM035_643RGTGCCATGTGTTTCCCCTAGTGTGC
XVM024XVM024_387FCGATCCCAACTCGCCGATGANCPPB4379GCATCGT323
XVM024_709RCGTACTTCAAGATCACCGCAGAGCAT
XVM006XVM006_107 FGGTAGCGGTGTGGGTTGCGAAGACNCPPB4379GGGATTC427MIX0463
XVM006_533 RGGCTACGAGGTGGATGTGCAGGTG
XVM018XVM018_440FGAACTGCTGTAACCGTCGATTGCCTCNCPPB4434TGAGTGC269
XVM018_708RGCGTCACCTACTCCGTTGCCAGAT
XVM038XVM038_404FACGGTAGTAATGGGCAGCAGGGTGNCPPB4379CGGTGGTGGCTT227
XVM038_630RCGGTGTCGTTCGAGAAGCTCAAGATAGA
XVM014XVM014_402FAGGTTCCAGGTCACGCAGATTCTTGTNCPPB4384GAATTGG316
XVM014_718RGACTGGTGTGGATGGGCGTTCT
XVM005XVM005_376FAAGCAGCCACGGAAAGGACAGGNCPPB2005CGCCAG362MIX0563
XVM005_737RTGACCACTGCCGCACACCAA
XVM036XVM036_164 FCTGGCTGCTCAAGGACATCACCAACCNCPPB4384TCCCGAA386
XVM036_549 RGGTAGCGGTGTGGGTTGCGAAGA
XVM012XVM012_223FTGCGTACCGAACTCTGTGGCTACNCPPB2005TGGCGG376
XVM012_598RGGTTGTGCGAACTTTTGTCGTGCTAC

a Tan: annealing temperature.

a Tan: annealing temperature. Analyses were conducted at the GenSeq technical facilities of the « Institut des Sciences de l’Evolution de Montpellier »—Labex CEMEB “Centre Méditerranéen de l’Environnement et de la Biodiversité”.

Analysis of a core-collection using SNP-derived RFLP markers

A collection of 63 strains was analyzed using the MLVA-19 scheme and SNP-derived RFLP typing tools. Two sets of SNP-derived markers were used. The first set, named WAS-SNPs, targeting 500-bp loci was adopted from Wasukira et al. [6] while the second set, named VN-SNPs, targeting between 200 to 600-bp were newly designed to further characterize the Xvm population [8].

Data analysis

Data scoring

Electrophoregramms were analyzed with GeneMapper 4.0 (Applied Biosystems). Peaks were first automatically detected using the analysis panels we defined for each mix. Each peak was then carefully checked by eye, and false peaks (rare artefacts due to fluorescence saturation) were discarded. The reproducibility of the allelic patterns was checked by running several DNA extractions of the strains NCPPB2005 and NCPPB2251, as well as duplicate analyses of eight Ugandan Xvm DNAs. Fragment sizes obtained for each TR locus were transformed to tandem repeats numbers. Subsequently, the allele sizes were transformed into repeat numbers using a home-made script using R version 3.4.0 [43]. The tandem repeat numbers obtained were rounded up to the next integer [12]. All alleles scored by size, and their corresponding “raw” and rounded repeat numbers, are summarized in the S2 Table in Supplemental Information.

Analysis of genetic data

The typeability and specificity of each MLVA locus to Xvm, pathovars of X. vasicola, and other Xanthomonas species were evaluated by comparing the percentage of strains amplified. Principal component analysis (PCA) was performed using the FactoMineR package [44] in R to estimate the contribution of each locus and how they account for the genetic variability described in the current Xvm collection. We estimated the genotypic resolution of the MLVA scheme [45] and described the genotypic diversity in relation to different combinations of TR loci by a genotype accumulation curve using R (R::poppr:: genotype_curve [46]). The curve is generated by sampling x loci randomly and counting the number of multilocus genotypes (MLG) observed. This sampling is repeated r times from 1 to n-1 loci, creating n-1 distributions of observed MLGs [47].

Reconstructing evolutionary relationships across Xvm African haplotypes

Haplotype networks were constructed using the algorithm combining global optimal eBURST (goeBURST) and Euclidean distances in the Phyloviz 2 software [48]. It allowed the visualization of the different clonal complexes (groups of haplotypes differing by a single locus, or Single locus variant (SLV)). The mutation model followed by the MLVA molecular markers was estimated by looking at the locus variation of recently diverging haplotypes, i.e. single-locus variants (SLV) and double-locus variants (DLV), along the haplotype network of the minimum spanning tree. Furthermore, the number of TR repeats involved in the mutation event was examined to determine whether the stepwise mutation model (SMM), i.e. addition or deletion of a single repeat, was supported for these TR loci.

Comparison of the discriminatory power and congruence of the typing techniques

MLVA-19 and SNP-derived RFLP typing techniques were compared using the Hunter and Gaston discriminatory Index (HGDI) [49]. Distance matrices calculated from each MLVA-19 and SNP dataset were calculated and compared using the Mantel test performed by the CADM.post function of the R package ape 5.0, with 9,999 permutations [50]. The Mantel correlation coefficients were computed on rank-transformed distance matrices.

Genetic structure

The genetic structure of the Xvm population was assessed by Discriminant Analysis of Principal Components (DAPC) using the adegenet package for the R software [51-53] since DAPC is free of any assumption linked to a population genetic model (such as Hardy-Weinberg equilibrium or absence of linkage disequilibrium). The number of clusters was assessed using the function find.clusters, which runs successive k-means clustering with increasing number of clusters (k) and the optimal number of clusters selected based on lowest Bayesian information criterion (BIC) [52]. Eleven independent k-means and DAPC runs were performed to assess the stability of clustering.

Deposition to MLVABank website

The MLVA-19 allelic profiles were deposited to the MLVA website dedicated to plant bacterial pathogens http://bioinfo-web.mpl.ird.fr/MLVA_bank/Genotyping/, corresponding to the MLVAbank at http://microbesgenotyping.i2bc.paris-saclay.fr/, to make MLVA-19 data accessible in an interactive way [54]. The website allows viewing databases with sorting and clustering options, submitting queries, and sharing databases which are maintained and managed by different owners once a common agreement is achieved among partners [26].

Results

The MLVA scheme MLVA-19 is based on 19 highly polymorphic loci that are evenly distributed on the genome

From the in silico screening of VNTR loci and corresponding primers (detailed in Materials and methods), 21 loci were selected that were unique in each genome, and whose corresponding primers were specific to the locus flanking regions. From the initial screening of a representative Xvm core-collection strains from different countries (n = 15), 19 loci out of the 21 tested were polymorphic, with two to four alleles per locus. Loci XVM013 and XVM023 did not amplify in any strain and were therefore excluded from the downstream analyses. The primers targeting the 19 loci were multiplexed in sets of either four- or three-loci mixes (Table 3). All but one (XVM038) were considered as microsatellite loci, with motif sizes ranging from 6 to 9 nucleotides. The majority (12 of 19) consisted of 7 nucleotide repeats, and most (12 of 19) had an intergenic location (Table 4). Most loci were evenly distributed on the Xvm genome (S1 Fig). The distance between two adjacent loci ranged from 26.9 kb to 1293.7 kbp, except between XVM006 and XVM036 (652 bp), and between XVM030 and XVM002 (1835 bp).
Table 4

Nomenclature, location, function and genetic diversity of the 19 TR loci retained in the MLVA scheme of Xanthomonas vasicola pv. musacearum.

The basic statistics were obtained from the Xvm collection (n = 335).

LocusOfficial nomenclatureaLocation (intra /intergenic)bCoordinates in NCPPB4379 (bp)cRepeat sizeBasic statisticsdTR numbers range
StartEndHENA
XVM002NCPPB4379_3.746_6_61_10.2Hypothetical protein2152939215221760.73216–35
XVM005NCPPB2005_18.431_6_36_6intergene2519193251814660.672–8
XVM006NCPPB4379_0.145_7_67_9.6intergene1288216128892670.63147–20
XVM012NCPPB2005_28.458_6_36_6intergene72884272830760.62102–14
XVM014NCPPB4384_7.903_7_72_10.3intergene699187099670.66135–25
XVM015NCPPB4379_7.618_9_61_6.8intergene32277232383290.66133–27
XVM016NCPPB4384_1.353_7_53_7.6intergene3154338315539070.5676–12
XVM018Kenyan_21.741_7_44_6.3intergene51238651342270.6964–10
XVM020NCPPB4380_14.211_7_61_8.7intergene4481478448202970.85165–21
XVM021NCPPB4379_28.913_8_64_8Hypothetical protein4175076417613980.82104–13
XVM022NCPPB4379_30.295_7_55_7.9Hypothetical protein4220957422149970.6485–13
XVM024NCPPB4379_20.274_7_90_12.9intergene3580348357925970.79114–14
XVM027NCPPB4379_47.799_7_63_9RNA binding protein3745391374645370.92257–32
XVM028NCPPB4379_7.722_7_57_8.1Hypothetical protein4397186439824970.77155–21
XVM029NCPPB2005_15.36_6_48_8Hypothetical protein3936107393714860.6175–13
XVM030NCPPB2005_4.034_7_42_6intergene2151016215171970.77176–31
XVM035NCPPB4380_1.724_7_59_8.4intergene3310409330937270.5674–10
XVM036NCPPB4384_0.443_7_67_9.6intergene1288274128898470.63167–23
XVM038NCPPB4379_2.022_12.00_86_7.20Hypothetical protein41481854149270120.555–9

a Loci were named according to their genome of origin, physical position (kb), TR unit size, total length in the genome of origin, number of repeats [55].

b The intra/intergenic location was assessed on all nine Xvm genomes available.

c The coordinates are based on the Xvm complete genome NCPPB4379 (ASM27789v2, https://www.ncbi.nlm.nih.gov/assembly/GCF_000277895.2), composed of a 4,79 Mb-chromosome and a 49.4 kb-plasmid named pXCM49; all loci were located on the chromosome.

d HE, Nei’s gene diversity; NA, number of alleles.

Nomenclature, location, function and genetic diversity of the 19 TR loci retained in the MLVA scheme of Xanthomonas vasicola pv. musacearum.

The basic statistics were obtained from the Xvm collection (n = 335). a Loci were named according to their genome of origin, physical position (kb), TR unit size, total length in the genome of origin, number of repeats [55]. b The intra/intergenic location was assessed on all nine Xvm genomes available. c The coordinates are based on the Xvm complete genome NCPPB4379 (ASM27789v2, https://www.ncbi.nlm.nih.gov/assembly/GCF_000277895.2), composed of a 4,79 Mb-chromosome and a 49.4 kb-plasmid named pXCM49; all loci were located on the chromosome. d HE, Nei’s gene diversity; NA, number of alleles. Of the 19 loci, 12 contained perfect repeat motifs (no variation of the TR sequence) among which six were interrupted in 3’ (S1 Table). Seven loci contained imperfect repeat motifs (XVM006, XVM014, XVM016, XVM024, XVM029, XVM036, XVM038) with alternative TR sequence displaying only one mismatch from the reference sequence (mostly transitions between Thymine and Cytosin). Loci XVM027, XVM002, and XVM030 were the most polymorphic, whereas XVM018 and XVM038 were the least polymorphic (Table 4). There was no clear relationship between polymorphism and motif size (S2 Fig), nor with location on the genome, although the least polymorphic locus XVM038 had the greatest motif size (12 bp) and considered a minisatellite.

The MLVA-19 scheme allowed for good genotypic resolution within Xvm

Contribution of each MLVA marker to the scheme was determined from the Principal Component Analysis (PCA). The first three dimensions explained 66.5% of variance, with axes 1 and 2 being the most informative (respectively 41.9% and 16.88%). Loci XVM030, XVM016, XVM024, XVM022 and XVM020 contributed most to the axis 1, while XVM035, XVM018, XVM012, XVM006 and XVM036 contributed most to axis 2 (S3 Fig); loci XVM002 and XVM028 contributed most to axis 3 (S3 Table, S4 Fig). Some variables were correlated, e.g. XVM006 and XVM036; XVM002 and XVM022; XVM005 and XVM030; XVM015 and XVM038; XVM012 and XVM028 (S3 and S4 Figs). The genotypic resolution of the MLVA-19 scheme is represented by the genotype accumulation curve (Fig 1). Our set of loci has been shown to be sufficient to accurately resolve the different haplotypes in our sample as the curve reached a plateau with 19 loci. The genotype accumulation curve revealed that more than 90% of the genotypes could be detected with 16 markers, hence the MLVA-19 scheme accurately estimates the clonal diversity of our sample (Fig 1).
Fig 1

Genotype accumulation curve for 335 strains of Xanthomonas vasicola pv. musacearum genotyped over 19 loci.

The horizontal axis represents the number of loci randomly sampled without replacement up to n-1 loci, the vertical axis shows the number of unique multilocus genotypes observed (n = 208) in the data set. The level of 95% of unique multilocus genotype detected is indicated with a dotted red line.

Genotype accumulation curve for 335 strains of Xanthomonas vasicola pv. musacearum genotyped over 19 loci.

The horizontal axis represents the number of loci randomly sampled without replacement up to n-1 loci, the vertical axis shows the number of unique multilocus genotypes observed (n = 208) in the data set. The level of 95% of unique multilocus genotype detected is indicated with a dotted red line.

Most of the mutations of our MLVA-19 loci involve single repeat events

In order to estimate the mutation model of the MLVA microsatellite marker evolution, we looked at the variation of repeat numbers between recently diverging haplotypes, i.e. SLVs within the clonal complexes and also with double-locus variants (DLV); most loci were analyzed on more than five evolutionary steps. Thirteen loci (XVM002, 5, 6, 12, 14, 20, 21, 22, 24, 27, 28, 35, and 38) revealed that SLVs resulted from more than 60% of single TR variation, with eight above 80%. XVM012 has a majority (44.44%) of single steps, and XVM014 had 50% of both single and double repeat variation. The locus XVM015 displayed a majority of multiple repeats variations ranging from 2 to 13 repeats. Such events involving large number of repeats could result from recombination mechanism [56]. The loci XVM016, XVM018, XVM029, XVM030 and XVM036 were not involved in SLVs or DLVs. But for those loci, almost all the alleles or number of repeats fulfilled the allelic range observed within our collection (S2 Table).

MLVA-19 loci are highly typeable on Xvm and specific to X. vasicola

The typeability of most loci was excellent when used on Xvm strains (n = 335), with 18 of the 19 loci amplifying in more than 97% of strains. On the other hand, the locus XVM 015 was amplified in 90.4% of strains (Table 5).
Table 5

Typeability and specificity of each XVM locus considering X.

vasicola pv. musacearum (Xvm), X. vasicola pathovars holcicola (Xvh), and vasculorum (Xvv), and other Xanthomonas species: X. oryzae pv. oryzae (Xoo); X. campestris pv. cannabis (Xcc); X. citri pv. citri (Xc), estimated by the number (percentage) of amplified strains.

LocusStrains amplified within
Xvm(n = 335)Xvv(n = 5)Xvh(n = 5)Xoo(n = 2)Xcc(n = 1)Xc(n = 1)
XVM002335 (100)2 (40)3 (60)0 (0)0 (0)0 (0)
XVM005335 (100)2 (40)0 (0)0 (0)0 (0)0 (0)
XVM006327 (97.6)1 (20)3 (60)1 (50)0 (0)0 (0)
XVM012328 (97.9)0 (0)0 (0)0 (0)0 (0)0 (0)
XVM014332 (99.1)3 (60)1 (20)0 (0)0 (0)0 (0)
XVM015303 (90.4)5 (100)0 (0)0 (0)0 (0)0 (0)
XVM016332 (99.1)4 (80)2 (40)0 (0)0 (0)0 (0)
XVM018333 (99.4)0 (0)1 (20)0 (0)0 (0)0 (0)
XVM020333 (99.4)3 (60)2 (40)0 (0)0 (0)0 (0)
XVM021332 (99.1)2 (40)1 (20)0 (0)0 (0)0 (0)
XVM022334 (99.7)5 (100)3 (60)0 (0)0 (0)0 (0)
XVM024335 (100)5 (100)0 (0)0 (0)0 (0)0 (0)
XVM027332 (99.1)3 (60)1 (20)0 (0)0 (0)0 (0)
XVM028335 (100)3 (60)2 (40)0 (0)0 (0)0 (0)
XVM029333 (99.4)5 (100)3 (60)0 (0)0 (0)0 (0)
XVM030334 (99.7)4 (80)2 (40)0 (0)0 (0)0 (0)
XVM035332 (99.1)4 (80)0 (0)0 (0)0 (0)0 (0)
XVM036326 (97.3)2 (40)2 (40)1 (50)0 (0)0 (0)
XVM038333 (99.4)3 (60)0 (0)0 (0)0 (0)0 (0)
Total33555211

Typeability and specificity of each XVM locus considering X.

vasicola pv. musacearum (Xvm), X. vasicola pathovars holcicola (Xvh), and vasculorum (Xvv), and other Xanthomonas species: X. oryzae pv. oryzae (Xoo); X. campestris pv. cannabis (Xcc); X. citri pv. citri (Xc), estimated by the number (percentage) of amplified strains. Most of the TR loci of the MLVA scheme for Xvm amplified at least a few strains of X. vasicola pv. vasiculorum (17/19 loci) and pv. holcicola (13/19 loci) (Table 5). However, specificity of the loci was variable between Xvm and X. vasicola. XVM012 was the only Xvm-specific locus and was not amplified in any other X. vasicola pathovar. Four loci (XVM002, XVM006, XVM022, XVM029) amplified 60% of strains belonging to the pathovar holcicola (n = 5), while amplification in X. vasculorum ranged between 20% and 40%. Seven loci (XVM015, XVM016, XVM022, XVM024, XVM029, XVM030, and XVM035) amplified 40 to 100% of pathovar vasculorum strains (n = 5) that were tested (Table 5). Importantly, some loci produced alleles that were both present in Xvm and another pathovar of X. vasicola (XVM002, XVM005, XVM006, XVM014, XVM015, XVM020, XVM027, XVM028, XVM029, and XVM036). For instance three alleles of the locus XVM006 were shared by Xvm and Xvh, while one allele was shared by Xvm and Xvv (S2 Table, summarizing all alleles by size and repeat numbers). No locus or very few loci were amplified with other Xanthomonas species used in this study.

MLVA-19 is congruent with the SNP-RFLP based typing method but more discriminative

We estimated the congruence between the MLVA-19 scheme and the SNP-derived typing method from Wasukira et al.[6] by three complementary approaches. We first assessed the distribution of SNP sublineages and SNP-based haplotypes on the goeBURST minimum spanning tree drawn from the 53 MLVA-19 haplotypes. Two MLVA clusters were consistent with the SNP-sub-lineages SLI and SLII described by Wasukira et al. (Fig 2). Moreover, the MLVA19-haplotypes corresponding to SNP-haplotypes 3 and 4 are also clustered together.
Fig 2

Minimum spanning tree (MST) of Xanthomonas vasicola pv. musacearum based on SNP haplotypes identified using goeBURST in PHYLOVIZ.

Each sequence type (ST) is displayed as a circle with a size proportional to the number of strains by which is represented. The different colors indicate the SNP haplotype. Numbers on the branches indicate the number of locus differences between the neighboring haplotypes.

Minimum spanning tree (MST) of Xanthomonas vasicola pv. musacearum based on SNP haplotypes identified using goeBURST in PHYLOVIZ.

Each sequence type (ST) is displayed as a circle with a size proportional to the number of strains by which is represented. The different colors indicate the SNP haplotype. Numbers on the branches indicate the number of locus differences between the neighboring haplotypes. We also assessed whether the 12 genetic clusters identified by DAPC (named DAPC clusters, and described in detail below) based on MLVA-19, corresponded to SNP-based haplotypes [57] (Fig 3). The SLI haplotypes (Hap1) were distributed within clusters 4, 5, 10 and 11, while SLII haplotypes (Hap2) corresponded to clusters 2, 3, 8 and 11; Hap3 and Hapl4 belonged to clusters 1, 2, and 10, and 7 and 9, respectively (Fig 3). Hence, SLI and SLII strains were distributed in distinct clusters, with the exception of cluster 11 (Fig 3). Cluster 11 contained one Hap1-SLI strain, and the SLII strain T35C; but T35C was actually poorly assigned (53%) to cluster 11 while also assigned to SL II-associated cluster 8 (44%). Clusters 2 and 10 were also shared between Hap3 and Hap2 and Hap1, respectively.
Fig 3

Correspondence between SNP haplotypes with generic clusters inferred by discriminant analysis of principal components.

The clusters are shown on the horizontal axis and the countries and SNP haplotypes are indicated on the vertical axis. The black squares represent the number of strains per SNP haplotype within the given cluster. The Figs were created using the table.value function in the R “Adegenet” package [53].

Correspondence between SNP haplotypes with generic clusters inferred by discriminant analysis of principal components.

The clusters are shown on the horizontal axis and the countries and SNP haplotypes are indicated on the vertical axis. The black squares represent the number of strains per SNP haplotype within the given cluster. The Figs were created using the table.value function in the R “Adegenet” package [53]. We also performed the Mantel tests between distance matrices obtained on a strain subset (n = 32) genotyped by both methods. Distance matrices (Euclidean and Manhattan distances) were highly correlated, with respective Mantel correlation coefficients of 0.412 and 0.492 (P< 0.001 for both Euclidean and Manhattan matrices, 9999 permutations), indicating that genetic distances from MLVA-19 and SNP-based RFLPs were significantly congruent. A collection of 63 strains was typed with both SNP-derived markers [6, 57] and the MLVA-19 scheme. The discriminatory index was 0.981 for MLVA (n = 63 haplotypes) and 0.564 for the SNPs (n = 6 haplotypes).

MLVA-19 reveals epidemiological relationships between countries and is discriminative at the field scale

The MLVA-19 scheme distinguished 208 haplotypes among the Xvm collection (n = 335). Twenty-nine clonal complexes (CC) defined as groups of single locus variant (SLV) grouped 51.34% of the strains using the goeBURST algorithm. Numbers of haplotypes per CC ranged from 2 to 11; 13 CC grouped three haplotypes and above. Fifty-five per cent of the haplotypes (n = 114) remained as singletons, differing from each other by four to thirteen loci. Interestingly, no haplotype was shared by strains from different countries. Most CC (n = 26) were country-specific (13 from Uganda, 12 from Ethiopia, one from Tanzania), while three were shared among countries (CC1: Uganda, Tanzania, Burundi, DRC; CC13 and CC29: DRC, Rwanda) (S4 Table). We also analyzed strains that were isolated from the same field. At this scale, the MLVA-19 scheme discriminated from 8 to 13 haplotypes per field (Ethiopia, n = 2 and Uganda, n = 1) (Table 6). Hence, the MLVA-19 was discriminative enough to distinguish different haplotypes from the country scale to the field scale.
Table 6

MLVA-19 is discriminative at the field scale.

Number of MLVA-19 haplotypes detected in three fields of Ethiopia and Uganda.

Country and locationFieldHostNo. of strainsNo. of haplotypes
ETHIOPIA, YemMelekaEnset1613
Oya FretoEnset148
UGANDA, MukonoKiifuBanana108

MLVA-19 is discriminative at the field scale.

Number of MLVA-19 haplotypes detected in three fields of Ethiopia and Uganda.

Discussion

Understanding invasion routes, biology and epidemiology of pathogens is important to elucidate the main factors involved in the invasion process, to develop epidemiological surveillance strategies aimed at preventing new introductions, as well as building pathogen-informed breeding strategies. At small-scales, it is important to determine the source of outbreaks and the means of transmission to limit the pathogen dispersion. In this study, we developed a highly discriminative typing tool that allowed us to elucidate the population structure and diversity of Xanthomonas vasicola pv. musacearum. The MLVA approach has become a standard in evaluating the population structure and dynamics of bacterial pathogens affecting human, animal and plant health especially due to its ability to detect small yet significant genetic differences.

MLVA-19 scheme is well suited for molecular epidemiological analysis of Xanthomonas vasicola pv. musacearum

To our knowledge, MLVA-19 is the first scheme of this type to be developed for Xvm. This scheme consists of 19 loci evenly distributed on the Xvm genome. When choosing markers for analysis, it is important to ensure that the combination of markers selected allow for accurate discrimination of the haplotypes [58]. Principal component analysis and the genotype accumulation curve indicated that these 19 loci are sufficient to accurately discriminate the Xvm population and thus adding more markers would not identify many new genotypes. The genotype accumulation curve assesses the power to discriminate among unique haplotypes given a random combination of loci [47] and resolved almost all the haplotypes with 16 to 17 microsatellite markers, indicating that such a number of loci would be sufficient to detect most of the genetic diversity within Xvm. MLVA-19 is also well suited for molecular epidemiology analyses. We determined that most loci of the MLVA-19 scheme evolved by gaining or losing a single repeat at one time, supporting a stepwise mutation model. This may facilitate the relatedness analyses and gene genealogies, and makes this MLVA-19-scheme useful in molecular epidemiology study. Several authors have noted that although SMM is considered to be the predominant mutational model for the microsatellites within bacteria, precise data remain scarce on the actual mutation model and possible variations around this model within the Proteobacteria phylum [26, 33, 59, 60]. MLVA-19 was also discriminative enough to identify and monitor different haplotypes of Xvm at the field scale, as shown in the banana and enset fields in Uganda, and Ethiopia respectively. This paves the way for future studies addressing the importance of multi-infection events and the spatial dynamics of bacterial infection within and across farm fields, comparing the spatial structures of aerial infestations and soilborne or tool-mediated infestations, among others.

MLVA-19 are partially transferable to other pathovars of X. vasicola

All MLVA-19 loci were amplified in Xvm, but several loci amplified within the other pathovars of X. vasicola. One MLVA scheme initially developed for Xanthomonas citri pv. citri has also been partly and successfully adapted to a close pathogenic bacterium from the same species [61, 62]. On the other hand, the phylogenetically closest species (X. oryzae, X. cannabis [63]) were not amplified by the MLVA-19. Collectively, these findings further provided evidence for a close phylogenetic relatedness between Xvm and other pathovars of Xanthomonas vasicola.

MLVA-19 is congruent with SNP markers, while revealing an unexpected diversity

From our data, the MLVA typing scheme was much more discriminatory than the SNP typing method described by Wasukira et al.[6]. Indeed, SNP and MLVA markers differ in mutation rates and mechanisms with independent evolutionary processes. Although SNP markers are robust phylogenic markers, less prone to distortion via selective pressure, as is the case for repetitive sequences [64]; their mutation rate is much lower than that of TR, and as such, do not offer enough polymorphisms to discover recent evolutionary events. Whereas, MLVAs mutate faster through the addition or subtraction of tandem repeats, producing greater levels of variation and often providing more discriminatory power per marker. Due to their different evolutionary dynamics, MLVAs and SNPs offer complementary information [65]. MLVA is considered suitable for short-term epidemiological analysis, while SNPs are suited to long-term or global epidemiological analyses [66]. MLVA-19 typing confirmed the differentiation of Xvm into two main sublineages as defined by Wasukira et al [6]. Furthermore, MLVA also identified several homogeneous clusters within each of these sublineages with three to four DAPC clusters per sublineage. Interestingly, some genetic groups (DAPC 6, 7, 9, 12) were not discriminated by SNP-derived markers. The phylogenetic relationship of these clusters to sublineages I and II remains to be determined, and should be clarified by additional genomic sequence analyses.

MLVA-19 as part of a hierarchical Xvm typing system

While SNPs reveal little polymorphism, their phylogenetic signal is informative as it is not disturbed by homoplasic events. The little diversity obtained with SNP markers defined sub-lineages within Xvm but is not sufficient to track the strains of this genetically monomorphic pathogen during outbreaks [7]. We developed highly discriminatory TR markers suitable to separate Xvm isolates within populations which are congruent with SNP typing. In the future, both genotyping systems could be used together within a hierarchical typing procedure [16, 67], with the SNP markers being used to define the higher evolutionary groups at the lineage level, and our MLVA-19 scheme being used for outbreak investigations, regional surveillance, amount and directions of gene flows.

Conclusion

We have established that the MLVA-19 scheme developed in this study is highly resolvent from the regional scale to the field scale. This genotyping tool is thus perfectly suited for exploring the genetic diversity of the recently emerging Xvm populations in East and Central Africa and could in future be helpful in addressing evolutionary and ecological questions that are important to address for deciphering the epidemiology of Xanthomonas wilt on banana, including the reconstruction of Xvm invasion routes throughout Africa. With the MLVA profiles deposited in MLVA Bank (http://bioinfo-web.mpl.ird.fr/MLVA_bank/Genotyping/), it will be possible to share data from new outbreaks or new emerging situations and compare them to the Xvm known genetic diversity for epidemiological investigations. This portable and simple genotyping tool can also be used in the future to assist the regional deployment of new Xvm-resistant banana and enset genitors.

TR Loci characteristics: Motif sequence, array homogeneity and completeness.

aThe nucleotides differing from the reference TR sequence are written in bold. b Occurrence frequency of the alternative TR sequence. (XLSX) Click here for additional data file.

MLVA-19 alleles in X.vasicola pv.

musacearum: Summary of allele sizes and correspondence with raw number of repeats and rounded number of repeats. The alleles shared with other pathovars of X.vasicola are indicated in the column “shared with”. Xvv: X.vasicola pv. vasculorum; Xvh: X.vasicola pv. holcicola. a Raw repeat numbers were rounded to the next integer, following Pourcel [12]. (XLSX) Click here for additional data file.

Relative contributions (A) and correlations (B) of each variable (locus) to each of the first 5 dimensions of the PCA (R::FactoMineR::PCA).

(XLSX) Click here for additional data file.

Composition of clonal complexes obtained from a collection of Xanthomonas vasicola pv.

musacearum (n = 335) with their number of haplotypes and countries of origin. (XLSX) Click here for additional data file.

Comparison of the haplotypes obtained from a core-collection of 36 Xvm strains using the MLVA-19 scheme and the two sets of SNP-derived RFLP markers and their discriminative Hunter-Gaston index.

(XLSX) Click here for additional data file.

Physical positions of the MLVA-19 loci on the chromosome of the Xvm complete genome NCPPB 4379.

Loci are designated by their name, and their coordinates on the chromosome. The intragenic loci are framed. (EPS) Click here for additional data file.

Relationship between VNTR loci repeat sizes and locus polymorphism, as assessed by Nei’s gene diversity (A) and number of alleles (B).

(EPS) Click here for additional data file.

Principal Component Analysis (PCA) showing the correlation circle of each variable to the MLVA-19 scheme on the most informative axes 1 and 2.

(EPS) Click here for additional data file.

Principal Component Analysis (PCA) showing the correlation circle of each variable to the MLVA-19 scheme on the most informative axes 1 and 3.

(EPS) Click here for additional data file.
  56 in total

1.  Simple sequence repeats in prokaryotic genomes.

Authors:  Jan Mrázek; Xiangxue Guo; Apurva Shah
Journal:  Proc Natl Acad Sci U S A       Date:  2007-05-07       Impact factor: 11.205

2.  Highly polymorphic markers reveal the establishment of an invasive lineage of the citrus bacterial pathogen Xanthomonas citri pv. citri in its area of origin.

Authors:  Christian Vernière; Lan Bui Thi Ngoc; Philippe Jarne; Virginie Ravigné; Fabien Guérin; Lionel Gagnevin; Nhat Le Mai; Nguyen M Chau; Olivier Pruvost
Journal:  Environ Microbiol       Date:  2014-01-21       Impact factor: 5.491

3.  Clostridium botulinum group I strain genotyping by 15-locus multilocus variable-number tandem-repeat analysis.

Authors:  Silvia Fillo; Francesco Giordani; Fabrizio Anniballi; Olivier Gorgé; Vincent Ramisse; Gilles Vergnaud; Julia M Riehm; Holger C Scholz; Wolf D Splettstoesser; Jasper Kieboom; Jaran-Strand Olsen; Lucia Fenicia; Florigio Lista
Journal:  J Clin Microbiol       Date:  2011-10-19       Impact factor: 5.948

4.  A complex population structure of the cassava pathogen Xanthomonas axonopodis pv. manihotis in recent years in the Caribbean Region of Colombia.

Authors:  César A Trujillo; Juan C Ochoa; María Fernanda Mideros; Silvia Restrepo; Camilo López; Adriana Bernal
Journal:  Microb Ecol       Date:  2014-04-24       Impact factor: 4.552

5.  New Multilocus Variable-Number Tandem-Repeat Analysis (MLVA) Scheme for Fine-Scale Monitoring and Microevolution-Related Study of Ralstonia pseudosolanacearum Phylotype I Populations.

Authors:  Jérémy Guinard; Anne Latreille; Fabien Guérin; Stéphane Poussier; Emmanuel Wicker
Journal:  Appl Environ Microbiol       Date:  2017-02-15       Impact factor: 4.792

6.  Development of variable-number tandem repeat typing of Mycobacterium bovis: comparison of results with those obtained by using existing exact tandem repeats and spoligotyping.

Authors:  Solvig Roring; Alistair Scott; David Brittain; Ian Walker; Glyn Hewinson; Sydney Neill; Robin Skuce
Journal:  J Clin Microbiol       Date:  2002-06       Impact factor: 5.948

7.  Multiple-locus variable-number tandem repeat analysis reveals genetic relationships within Bacillus anthracis.

Authors:  P Keim; L B Price; A M Klevytska; K L Smith; J M Schupp; R Okinaka; P J Jackson; M E Hugh-Jones
Journal:  J Bacteriol       Date:  2000-05       Impact factor: 3.490

8.  Effect of repeat copy number on variable-number tandem repeat mutations in Escherichia coli O157:H7.

Authors:  Amy J Vogler; Christine Keys; Yoshimi Nemoto; Rebecca E Colman; Zack Jay; Paul Keim
Journal:  J Bacteriol       Date:  2006-06       Impact factor: 3.490

9.  High resolution, on-line identification of strains from the Mycobacterium tuberculosis complex based on tandem repeat typing.

Authors:  Philippe Le Flèche; Michel Fabre; France Denoeud; Jean-Louis Koeck; Gilles Vergnaud
Journal:  BMC Microbiol       Date:  2002-11-27       Impact factor: 3.605

10.  DNA fingerprinting of Shiga-toxin producing Escherichia coli O157 based on Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA).

Authors:  Bjørn-Arne Lindstedt; Even Heir; Elisabet Gjernes; Traute Vardund; Georg Kapperud
Journal:  Ann Clin Microbiol Antimicrob       Date:  2003-12-10       Impact factor: 3.944

View more
  3 in total

1.  SNP-based genotyping and whole-genome sequencing reveal previously unknown genetic diversity in Xanthomonas vasicola pv. musacearum, causal agent of banana xanthomonas wilt, in its presumed Ethiopian origin.

Authors:  Gloria V Nakato; David J Studholme; Guy Blomme; Murray Grant; Teresa A Coutinho; Evans M Were; Emmanuel Wicker; George Mahuku
Journal:  Plant Pathol       Date:  2020-11-27       Impact factor: 2.590

2.  Contrasting genetic diversity and structure among Malagasy Ralstonia pseudosolanacearum phylotype I populations inferred from an optimized Multilocus Variable Number of Tandem Repeat Analysis scheme.

Authors:  Hasina Rasoamanana; Santatra Ravelomanantsoa; Noura Yahiaoui; Niry Dianzinga; Emeline Rébert; Miharisoa-Mirana Gauche; Yann Pecrix; Laurent Costet; Adrien Rieux; Philippe Prior; Isabelle Robène; Gilles Cellier; Fabien Guérin; Stéphane Poussier
Journal:  PLoS One       Date:  2020-12-08       Impact factor: 3.240

3.  Molecular Epidemiology of Xanthomonas euvesicatoria Strains from the Balkan Peninsula Revealed by a New Multiple-Locus Variable-Number Tandem-Repeat Analysis Scheme.

Authors:  Taca Vancheva; Nevena Bogatzevska; Penka Moncheva; Sasa Mitrev; Christian Vernière; Ralf Koebnik
Journal:  Microorganisms       Date:  2021-03-05
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.