| Literature DB >> 21533033 |
Kevin L Schneider1, Glorimar Marrero, Anne M Alvarez, Gernot G Presting.
Abstract
A DNA marker that distinguishes plant associated bacteria at the species level and below was derived by comparing six sequenced genomes of Xanthomonas, a genus that contains many important phytopathogens. This DNA marker comprises a portion of the dnaA replication initiation factor (RIF). Unlike the rRNA genes, dnaA is a single copy gene in the vast majority of sequenced bacterial genomes, and amplification of RIF requires genus-specific primers. In silico analysis revealed that RIF has equal or greater ability to differentiate closely related species of Xanthomonas than the widely used ribosomal intergenic spacer region (ITS). Furthermore, in a set of 263 Xanthomonas, Ralstonia and Clavibacter strains, the RIF marker was directly sequenced in both directions with a success rate approximately 16% higher than that for ITS. RIF frameworks for Xanthomonas, Ralstonia and Clavibacter were constructed using 682 reference strains representing different species, subspecies, pathovars, races, hosts and geographic regions, and contain a total of 109 different RIF sequences. RIF sequences showed subspecific groupings but did not place strains of X. campestris or X. axonopodis into currently named pathovars nor R. solanacearum strains into their respective races, confirming previous conclusions that pathovar and race designations do not necessarily reflect genetic relationships. The RIF marker also was sequenced for 24 reference strains from three genera in the Enterobacteriaceae: Pectobacterium, Pantoea and Dickeya. RIF sequences of 70 previously uncharacterized strains of Ralstonia, Clavibacter, Pectobacterium and Dickeya matched, or were similar to, those of known reference strains, illustrating the utility of the frameworks to classify bacteria below the species level and rapidly match unknown isolates to reference strains. The RIF sequence frameworks are available at the online RIF database, RIFdb, and can be queried for diagnostic purposes with RIF sequences obtained from unknown strains in both chromatogram and FASTA format.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21533033 PMCID: PMC3080875 DOI: 10.1371/journal.pone.0018496
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Completely sequenced genomes discussed in this study.
| Strain Name | Accession | Abbreviations |
|
| NC_009480.1 | Cmm_32 |
|
| NC_010407.1 | Cms_33113 |
|
| NC_013971.1 | Ea_273 |
|
| CP002038 | Dd_3937 |
|
| NC_004547.2 | Pa_1043 |
|
| NC_012214.1 | Ep_1/96 |
|
| NC_010694.1 | Et_1/99 |
|
| NC_012912.1 | Dz_1591 |
|
| NC_012880.1 | Dd_703 |
|
| NC_013592.1 | Dd_586 |
|
| NC_013421.1 | Pw_Wpp163 |
|
| NC_012917.1 | Pcc_PC1 |
|
| NC_008313.1 | Re_H16 |
|
| NC_007347.1 | Re_JMP134 |
|
| NC_007973.1 | Rm_CH34 |
|
| NC_003295.1 | Rs_GMI1000 |
|
| NC_010682.1 | Rp_12J |
|
| NC_010688.1 | Xcc_B100 |
|
| NC_003902.1 | Xcc_33913 |
|
| NC_007086.1 | Xcc_8004 |
|
| NC_007508.1 | Xe_85-10 |
|
| NC_010717.1 | Xoo_PXO99 |
|
| NC_006834.1 | Xoo_10331 |
|
| NC_007705.1 | Xoo_311018 |
|
| NC_003919.1 | Xccit_306 |
|
| NC_019043.1 | Sm_279 |
|
| NC_011071.1 | Sm_R551 |
*Set of 15 genomes used for the identification of RIF.
Figure 1Alignment of the dnaA gene from six Xanthomonas strains.
The dnaA genes of X. campestris pv. campestris strains differ by 5 SNPs (pink highlight) and those of two X. oryzae pv. oryzae strains (green highlight) differ by 1 SNP. Conserved 20+-mers in all six Xanthomonas genomes are highlighted in blue. The only set of primers that flank all SNPs, have a melting temperature of 60±2°C and a G+C content of 50–60%, end in a G or C and have no complementary bases at the ends, are highlighted in yellow.
Genes in Xoo_311018 and Xcc_33913 containing regions that match criteria for potential markers.
| Strain | Location | Gene Name | Number of regions meeting marker criteria | Greater resolution than the ITS | Complete gene is present in all 15 strains |
| Xoo_311018 | 2251324–2254065 | XOO2036 | 1 | no | no |
| Xoo_311018 | 4934856–4938381 | XOO4369 | 4 | no | no |
| Xoo_311018 | 552362–553945 | XOO0506 | 13 | no | no |
| Xoo_311018 | 4512534–4515648 | XOO4004 | 13 | no | no |
| Xoo_311018 | 4897866–4899412 | XOO4332 | 1 | no | no |
| Xoo_311018 | 8754–9959 | XOO0006 | 3 | no | no |
| Xoo_311018 | 4906929–4908174 | XOO4342 | 5 | no | no |
| Xoo_311018 | 4937637–4939494 | XOO4370 | 32 | yes | no |
| Xcc_33913 | 2175534–2176610 | XCC1866 | 1 | no | no |
| Xcc_33913 | 1–1370 | XCC0001 | 27 | yes | yes |
| Xcc_33913 | 4748693–4749484 | XCC4004 | 4 | no | no |
| Xcc_33913 | 4506913–4510053 | XCC3789 | 16 | yes | no |
| Xcc_33913 | 4516069–4517528 | XCC3796 | 1 | no | no |
| Xcc_33913 | 4513098–4514465 | XCC3793 | 3 | no | no |
| Xcc_33913 | 4749117–4751483 | XCC4005 | 6 | no | no |
- Xoo_10331 was compared to Xoo_311018; Xcc_8004 was compared to Xcc_33913.
- Genes identified in Xoo_311018 do not overlap with those identified in Xcc_33913.
- Marker criteria: conserved 20+-mers are separated by >550 nt and produce an amplicon <1000 nt, flank 2 or more SNPs between Xoo_10331 and Xoo_311018 or Xcc_8004 and Xcc_33913.
- Strains are listed in Table 1 with an asterisk.
- Gene XCC0001 met all criteria for use as a potential marker.
- Orthologs differ by a single SNP.
Figure 2Neighbor-joining cladograms for Xanthomonas, Clavibacter and Ralstonia using ITS and RIF sequences.
RIF resolves closely related strains of Xanthomonas that remain unresolved with ITS. The neighbor-joining cladograms for A) Xanthomonas, B) Clavibacter and C) Ralstonia using the ITS and RIF markers from fully sequenced strains in GenBank (Table 1). Bootstrap values >50% (shown at the node) are expressed as a percentage of 5,000 replicates. Red boxes contain two X. oryzae pv. oryzae strains resolved by one SNP between RIF sequences and zero SNPs between ITS sequences. Likewise, blue boxes contain two X. campestris pv. campestris strains resolved by five SNPs between RIF sequences and zero SNPs between ITS sequences. The RIF and ITS sequences of Xoo strains 10331 and PXO99 are identical. Xylella fastidiosa, Burkholderia mallei and Leifsonia xyli are included as outgroups.
RIF primers developed in this study.
| Genus | Forward Primer | Reverse Primer |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
No genome sequence was available for Pantoea agglomerans when these primers were developed.
Primer pairs for three genera of Enterobacteriaceae were initially developed by comparison of the dnaA gene of Dickeya, Pectobacterium and Erwinia available in 2007 (Table 1, Supplemental Figure S1). Primers were developed using the criteria applied for selection of the Clavibacter and Ralstonia primers and alignment of the respective dnaA gene (see methods). The initial Pectobacterium and Dickeya primers failed to produce amplicons for two strains of Dickeya (one D. chrysanthemi and one D. paradisiaca), three strains of Pantoea (one that also failed to amplify with the ITS marker) and nine strains of Pectobacterium (eight of which failed to amplify with the ITS marker). Pectobacterium strains K0509, K0522 and K0574 and Pantoea strains required an annealing temperature of 51°C.
Primer sets were derived using newly sequenced strains of Dickeya (Dz_1591, Dd_703) and Pectobacterium (Pc_PCC1 and Pw_Wpp163). The Dickeya primers amplified all nine reference Dickeya strains. The new Pectobacterium primers no longer amplified any Pantoea strains at 51°C and they amplified strain K0522, K0574 and one additional Pectobacterium strain, K0510, but still did not amplify strain K0509 at 61°C.
Primers developed from Pseudomonas accessions NZ_CH482384, NZ_CM001020, NC_011770, NC_009656, NC_002516, NC_008463, NC_008027, NC_004129, NC_012660, NZ_CM001025, NC_009439, NC_009512, NC_010322, NC_002947, NC_010501, NZ_GG774590, NZ_DS997060, NC_005773, NC_007005, NZ_GG700508, NC_004578, NZ_GG699506 and NC_009434.
Primers developed from Xylella fastidiosa accessions CP002165, CP001011, AE009442, CP000941 and AE003849.
RIF sequencing success rate for Xanthomonas, Ralstonia and Clavibacter is higher than ITS.
|
|
|
| |
| Total strains | 120 | 120 | 23 |
| ITS amplicon | 120 | 119 | 23 |
| ITS sequence |
|
|
|
| RIF amplicon | 119 | 117 | 23 |
| RIF sequence |
|
|
|
| RIF amplicon, no ITS amplicon | 0 | 2 | 0 |
| RIF contig, no ITS contig | 15 | 17 | 4 |
| ITS amplicon, no RIF amplicon | 1 | 6 | 0 |
| ITS contig, no RIF contig | 2 | 2 | 0 |
| ITS amplicon, no ITS contig | 33 | 20 | 4 |
| No ITS contig but ITS read | 18 | 15 | 1 |
| RIF amplicon, no RIF contig | 9 | 4 | 0 |
| No RIF contig, but RIF read | 7 | 3 | 0 |
| Size of ITS sequence (nt) | 557–590 | 574–590 | 504–505 |
| Size of RIF sequence (nt) | 700 | 654 | 680 |
| Number of strains with both the ITS and RIF sequenced | 84 | 97 | 19 |
| ITS average nucleotide differences | 29.3 | 16.1 | 10.5 |
| RIF average nucleotide differences | 44.8 | 11.1 | 23.8 |
| Number of different ITS sequences | 18 | 10 | 11 |
| Number of different RIF sequences | 26 | 13 | 8 |
RIF and ITS marker characteristics for a set of 263 strains from the PBC. Sequence contigs are assembled from read pairs. The total number of RIF and ITS sequences are in bold.
*- Computed using a single representative for each unique sequence.
RIF sequences for Xanthomonas, Ralstonia and Clavibacter.
|
|
|
| |
| Total strains | 444 (361 PBC/84 ICMP) | 248 (239) | 126 (115) |
| RIF amplicon (characterized PBC/ICMP reference strains) | 383 (340 PBC/43 ICMP) | 228 (217) | 125 (114) |
|
| N/A | 198 | N/A |
|
| N/A | 34 | N/A |
| RIF sequence (characterized PBC/ICMP reference strains) | 358 ( | 219( | 125 ( |
|
| N/A | 159/191 | N/A |
| RIF amplicon, no RIF sequence | 29 (29 PBC/0 ICMP) | 10 | 1 |
| No RIF contig, but RIF read | 31 (31 PBC/0 ICMP) | 6 | 1 |
|
| N/A | 32 | N/A |
| No | N/A | 19 | N/A |
| Size of RIF sequence (nt) | 558 | 617 | 660 |
| Size of | N/A | 495 | N/A |
| RIF average nucleotide differences | 44 | 12.9 | 18.6 |
|
| N/A | 25.3 | N/A |
| Number of different RIF sequences | |||
| (characterized PBC/ICMP reference strains) | 82 (82) | 17 (17) | 11 (10) |
| Number of different | N/A | 22 | N/A |
RIF and ITS marker characteristics for a set of 818 strains from the PBC and ICMP collections. Sequence contigs are assembled from read pairs.
-Total strains include characterized strains (separated from the total in parentheses) and unknown accessions.
- One strain did not produce a RIF amplicon that was visible on an agarose gel but did produce sequence data.
- the 191 strains only include those that produced an egl amplicon (Supplemental Table S2). RIF was sequenced in both directions more often (183/191 = 95.8%) than the egl gene (159/191 = 83.2%) for the same strains (Supplemental Table S2).
- Of the 32 failed sequencing reactions with egl 19 had a single direction read.
*- Computed using a single representative for each unique sequence.
Figure 3Classifying Xanthomonas based on RIF genotypes.
The rooted neighbor-joining cladogram was constructed from 358 characterized Xanthomonas strains from the PBC (Table 2), ten reference strains from GenBank (see Table 1 for strain names) and the outgroup Xylella fastidiosa. Xanthomonas abbreviations are given in the text box. Identical sequences are represented only once and the number of sequenced strains is indicated on each leaf. Bootstrap values of >50% (shown at the node) are expressed as a percentage of 5,000 replicates. Blue boxes contain strains of different species with identical RIF sequences. X. axonopodis pv. dieffenbachiae strains are found throughout the tree (red text).
Figure 4Ralstonia solanacearum strains classified based on RIF genotypes.
The rooted neighbor-joining cladogram was constructed from 210 characterized Ralstonia strains from the PBC (Table 2), six reference strains from GenBank (see Table 1 for strain names)and the outgroup Burkholderia mallei. Identical sequences are represented only once and the number of sequenced strains is indicated on each leaf. Bootstrap values >50% (shown at the node) are expressed as a percentage of 5,000 replicates. All Rs strains (clade A) are clearly separated from other Ralstonia species. Rs1, Rs2, Rs3 and Rs4 are the four races of R. solanacearum, BDB is the blood disease bacterium and Rs† is R. solanacearum UW433 (ACH0732) which is an atypical race 1 biovar 2 strain [19]. Blue boxes contain strains of Rs races 1, 2, 4 and BDB that share an identical RIF sequence. In contrast, orange boxes contain only Rs3 strains showing that these RIF sequences are specific to race 3. Numbers in boxes next to tree leaves are egl sequevars obtained for those strains. Red text indicates Rs strains with no previous race designation.
Figure 5The RIF marker separates Clavibacter michiganensis strains into the three subspecies.
The rooted neighbor-joining cladogram was constructed from 114 characterized Clavibacter strains from the PBC (Table 2), two reference strains from GenBank (see Table 1 for strain names) and the outgroup Leifsonia xyli. Identical sequences are represented only once and the number of sequenced strains is indicated on each leaf. Bootstrap values >50% (shown at the node) are expressed as a percentage of 5,000 replicates. Red text indicates C. michiganensis strains with no previous subspecies designation. Cm† is a non-pathogenic strain isolated from tomato that is most similar to Cmi. Unknowns #1-10 were isolated from a recent bacterial canker outbreak of tomato (see text) and perfectly match two of the Cmm RIF reference sequences. Cmm: C. michiganensis subsp. michiganensis. Cmi: C. michiganensis subsp. insidiosus. Cms: C. michiganensis subsp. sepedonicus.
Figure 6Neighbor-joining cladograms for the family Enterobacteriaceae using ITS and RIF sequences.
Sequences for Dickeya, Pectobacterium, Pantoea and Erwinia were extracted from fully sequenced strains in GenBank (Table 1). Bootstrap values >50% (shown at the node) are expressed as a percentage of 5,000 replicates. Note the longer branch lengths for the Erwinia and Dickeya species in the RIF tree.
Comparison of ITS and RIF markers for three genera of Enterobacteriaceae.
|
|
|
| |
| Total Strains (characterized PBC reference strains) | 64 (9) | 23 (19) | 9 (9) |
| ITS amplicon | 59 | 10 | 8 |
| ITS sequence | 35 | 0 | N/A |
| RIF amplicon (characterized PBC reference strains) | 60 (7) | 18 (8) | 6(2) |
| RIF sequence (characterized PBC reference strains) | 53 (7) | 15 (11) | 6 (6) |
| RIF amplicon, no ITS amplicon | 1 | 5 | 0 |
| RIF sequence, no ITS sequence | 25 | N/A | N/A |
| ITS amplicon, no RIF amplicon | 1 | 0 | 4 |
| ITS sequence, no RIF sequence | 6 | N/A | N/A |
| ITS amplicon, no ITS sequence | 24 | N/A | N/A |
| No ITS contig but ITS read | 23 | N/A | N/A |
| RIF amplicon, no RIF sequence | 6 | 1 | 0 |
| No RIF contig, but RIF read | 6 | 0 | 0 |
| Size of ITS sequence (nt) | 416–425 | N/A | N/A |
| Size of RIF sequence (nt) | 722 | 722 | 722 |
| Number of strains with both the ITS and RIF sequenced | 29 | N/A | N/A |
| ITS average nucleotide differences | 10.5 | N/A | N/A |
| RIF average nucleotide differences | 40.3 | 55.8 | 44 |
| Number of different ITS sequences | 11 | N/A | N/A |
| Number of different RIF sequences | |||
| (characterized PBC reference strains) | 17 (6) | 10 (8) | 2 (2) |
RIF and ITS marker characteristics for a set of 96 strains from the PBC collection. Sequence contigs are assembled from read pairs.
*- Computed using a single representative for each unique sequence.
- Eight Pectobacterium strains and one Pantoea strain failed to yield either ITS or RIF amplicons after multiple attempts to produce an amplicon from reisolated DNA (Supplemental Table S2).
- Pantoea strains were not sequenced with ITS.
- 15 strains were amplified at 61°C and 3 strains at 51°C.
- 2 strains were amplified at 61°C and 4 strains at 51°C.
- 2 strains were amplified at 61°C and 2 strains at 51°C.
Figure 7RIF tree of plant pathogenic Enterobacteriaceae.
The rooted neighbor-joining cladogram was constructed from 74 Enterobacteriaceae strains, including 24 characterized strains of Dickeya, Pectobacterium and Pantoea from the PBC (Supplemental Table S2), eight reference strains from GenBank (see Table 1 for strain names) and the outgroup Yersinia pestis. Identical sequences are represented only once and the number of sequenced strains is indicated on each leaf. Bootstrap values >50% (shown at the node) are expressed as a percentage of 5,000 replicates. The true Erwinia species E. tasmaniensis separates from the renamed Erwinia species that branch into three different genera: Dickeya (clade A), Pectobacterium (clades G and H) and Pantoea (clade M). Red text indicates uncharacterized strains of Dickeya and green text indicates uncharacterized strains of Pectobacterium. K numbers are shown for the accessions discussed in the text (Supplemental Table S2).