| Literature DB >> 23408961 |
Hafiz Sohail Naushad1, Radhey S Gupta.
Abstract
The species from the order Xanthomonadales, which harbors many important plant pathogens and some human pathogens, are currently distinguished primarily on the basis of their branching in the 16S rRNA tree. No molecular or biochemical characteristic is known that is specific for these bacteria. Phylogenetic and comparative analyses were conducted on 26 sequenced Xanthomonadales genomes to delineate their branching order and to identify molecular signatures consisting of conserved signature indels (CSIs) in protein sequences that are specific for these bacteria. In a phylogenetic tree based upon sequences for 28 proteins, Xanthomonadales species formed a strongly supported clade with Rhodanobacter sp. 2APBS1 as its deepest branch. Comparative analyses of protein sequences have identified 13 CSIs in widely distributed proteins such as GlnRS, TypA, MscL, LysRS, LipA, Tgt, LpxA, TolQ, ParE, PolA and TyrB that are unique to all species/strains from this order, but not found in any other bacteria. Fifteen additional CSIs in proteins (viz. CoxD, DnaE, PolA, SucA, AsnB, RecA, PyrG, LigA, MutS and TrmD) are uniquely shared by different Xanthomonadales except Rhodanobacter and in a few cases by Pseudoxanthomonas species, providing further support for the deep branching of these two genera. Five other CSIs are commonly shared by Xanthomonadales and 1-3 species from the orders Chromatiales, Methylococcales and Cardiobacteriales suggesting that these deep branching orders of Gammaproteobacteria might be specifically related. Lastly, 7 CSIs in ValRS, CarB, PyrE, GlyS, RnhB, MinD and X001065 are commonly shared by Xanthomonadales and a limited number of Beta- or Gamma-proteobacteria. Our analysis indicates that these CSIs have likely originated independently and they are not due to lateral gene transfers. The Xanthomonadales-specific CSIs reported here provide novel molecular markers for the identification of these important plant and human pathogens and also as potential targets for development of drugs/agents that specifically target these bacteria.Entities:
Mesh:
Year: 2013 PMID: 23408961 PMCID: PMC3568101 DOI: 10.1371/journal.pone.0055216
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sequence Characteristics of Xanthomonadales genomes.
| Organism | GenBank Accession No. | Size(Mbp) | No. of Proteins | % GC content | Reference |
|
| AM743169.1 | 4.8 | 4386 | 66 |
|
|
| CP001111.1 | 4.6 | 4039 | 66 |
|
|
| ACDV00000000 | 4.9 | 4469 | 66 | JCVI |
|
| FP565176.1 | 3.7 | 3114 | 63 |
|
|
| AE008923.1 | 5.3 | 4312 | 64 |
|
|
| CP002914.1 | 5.0 | 4181 | 65 |
|
|
| CP002789.1 | 4.9 | 4520 | 65 |
|
|
| CP000050.1 | 5.1 | 4271 | 64 |
|
|
| AE008922.1 | 5.1 | 4179 | 65 |
|
|
| AM920689.1 | 5.1 | 4466 | 65 |
|
|
| AM039952.1 | 5.4 | 4487 | 64 |
|
|
| AEQV00000000 | 5.5 | 4927 | 65 |
|
|
| AEQX00000000 | 5.5 | 5027 | 65 |
|
|
| CP003057.1 | 4.8 | 4474 | 64 |
|
|
| AE013598.1 | 5.0 | 4064 | 63 |
|
|
| AP008229.1 | 5.0 | 4372 | 63 | NIAS |
|
| CP000967.1 | 5.2 | 4988 | 63 |
|
|
| AEQW00000000 | 5.2 | 4637 | 63 |
|
|
| AE003849.1 | 2.8 | 2766 | 52 |
|
|
| CP000941.1 | 2.5 | 2104 | 51 |
|
|
| CP001011.1 | 2.6 | 2161 | 51 |
|
|
| AE009442.1 | 2.5 | 2034 | 51 |
|
|
| CP002165 | 2.5 | 2216 | 51 |
|
|
| CP003093.2 | 3.5 | 3149 | 67 |
|
|
| CP002446.1 | 3.4 | 3070 | 70 | DOE-JGI |
|
| AGIL00000000 | 4.0 | 3800 | 68 | DOE-JGI |
NIAS = Genome was sequenced by National Institute of Agrobiological Sciences, Japan.
DOE-JGI = Genome was sequenced by DOE Joint Genome Institute USA.
JCVI = Genome was sequenced by J. Craig Venter Institute, USA.
Figure 1Phylogenetic tree for Xanthomonadales based on concatenated sequences for 28 conserved proteins.
The tree shown is a NJ distance tree, however, similar branching was observed in the ML tree (Figure S1). The observed bootstrap scores for various nodes are shown on the branch points. The tree was rooted using sequences from Alphaproteobacteria.
Figure 2Examples of conserved signature indels (CSIs) that are specific for the order Xanthomonadales.
Excerpts are shown from the sequence alignments of (A) Glutaminyl t-RNA synthetase and (B) GTP-binding elongation factor proteins showing two CSIs that are uniquely found in various sequenced Xanthomonadales species, but not found in any other bacteria. Information for other CSIs that are specific for the Xanthomonadales is provided in Figures S2–S12 and Table 2. The dashes in these as well as all other alignments show identity with the amino acid on the top line. The Gene bank identification numbers of various sequences are shown in the second column and the numbers on the top indicate the position of this sequence in the species shown on the top line. The sequence information is shown here for only representative species. However, unless otherwise indicated, these CSIs are highly specific for the indicated group of species.
Conserved Signatures Indels that are specific for Xanthomonadales.
| Protein Name | Gene Name | GenBankIdentifier | Figure No | Indel Size | Indel Position | Exceptions |
| Glutaminyl-tRNA synthetase | glnS | 194364460 |
| 18 aa ins | 239–295 | None |
| GTP-binding elongation factor protein | typA | 58580596 |
| 4 aa ins | 303–350 | None |
| Amino acid/peptide transporter | – | 71275790 |
| 7 aa ins | 164–213 | None |
| Large-conductance mechanosensitive channel | mscL | 294667079 |
| 5 aa ins | 40–85 | None |
| Lysyl-tRNA synthetase | lysS | 194365604 |
| 3 aa ins | 34–85 | None |
| Lipoyl synthase | lipA | 58583575 |
| 2 aa ins | 156–209 | None |
| Queuine tRNA-ribosyltransferase | tgt | 194365393 |
| 1 aa ins | 289–339 | None |
| Acyl-(acyl-carrier-protein)–UDP-N-acetyl-glucosamine O-acyltransferase | lpxA | 71275623 |
| 1 aa ins | 164–210 | None |
| TolQ protein | tolQ | 21232451 |
| 1 aa ins | 177–217 | None |
| Alpha-2-macroglobulin domain-containing protein | – | 194366795 |
| 13 aa del | 607–661 | None |
| DNA topoisomerase IV subunit B | parE | 84624476 |
| 1 aa del | 282–326 | None |
| DNA polymerase I | polA | 194367713 |
| 1 aa del | 28–65 | None |
| Aromatic amino acid aminotransferase | tyrB | 28197970 |
| 1 aa del | 306–354 | None |
| Glutaminyl-tRNA synthetase | glnS | 194364460 |
| 1 aa del | 77–131 |
|
| DNA polymerase III subunit beta | RpoB | 194363780 |
| 1 aa del | 44–81 |
|
| Lipid-A-disaccharide synthase | lpxB | 190573490 |
| 2 aa ins | 317–358 |
|
| Carbamoyl phosphate synthase large subunit | carB | 166711938 |
| 1 aa ins | 403–457 |
|
| Putative secreted protein | – | 188992701 |
| 1 aa ins | 1285–1318 |
|
| Aminopeptidase P | pepP | 294627124 |
| 1 aa del | 211–246 |
|
The indel position provided indicates the region of the protein containing the CSI.
For details go to respective figures.
Figure 3Partial sequence alignment of glutaminyl t-RNA synthetase showing a CSI that is specifically present in various sequenced Xanthomonadales and some other Gammaproteobacteria.
This CSI as well as a few other CSIs identified in this work (see Table 2 and Figures S13–S17) suggest a possible relationship of Xanthomonadales to these deep branching orders of Gammaproteobacteria.
Figure 4Examples of CSIs those are present in various Xanthomonadales species except Rhodanobacter sp. 2APBS1.
Excerpts are shown from the sequence alignments of (A) uroporphyrinogen decarboxylase (HemE) and (B) tRNA delta(2)-isopentenylpyrophosphate transferase (MiaA) proteins showing two conserved signature indels (boxed) that are specifically found in various sequenced Xanthomonadales species, except Rhodanobacter sp. 2APBS1. These CSIs were likely introduced in these genes in a common ancestor of the Xanthomonadales after branching of Rhodanobacter. Information for CSIs in other proteins showing similar species specificities is provided in Figures S18–S30 and Table 3.
CSIs that are specific for Xanthomonadales except Rhodanobacter sp. 2APBS1.
| Protein Name | Gene Name | GenBankIdentifier | Figure No | Indel Size | Indel Position | Specificity within Xanthomonadales |
| Uroporphyrinogen decarboxylase | hemE | 294625972 |
| 5 aa ins | 295–340 | All except |
| tRNA delta(2)-isopentenylpyrophosphate transferase | miaA | 194365248 |
| 5 aa ins | 219–256 | All except |
| Protoheme IX farnesyltransferase | coxD | 15837961 |
| 4 aa ins | 150–192 | All except |
| DNA polymerase III subunit alpha | dnaE | 21242159 |
| 1 aa ins | 583–638 | All except |
| DEAD box helicase domain-containing protein | – | 194364258 |
| 1 aa ins | 155–200 | All except |
| Ribose-5-phosphate isomerase A | rpiA | 194367055 |
| 1 aa ins | 127–169 | All except |
| DNA polymerase I | polA | 21244827 |
| 1 aa ins | 136–180 | All except |
| Aspartyl-tRNA synthetase | aspS | 194366904 |
| 4 aa del | 343–391 | All except |
| 2-oxoglutarate-dehydrogenase E1 component | sucA | 194366403 |
| 1 aa del | 782–830 | All except |
| Coproporphyrinogen III oxidase | cpoX | 194367710 |
| 1 aa del | 166–215 | All except |
| 2-oxoglutarate-dehydrogenase E1 component | sucA | 194366403 |
| 1 aa del | 106–164 | All except |
| Asparagine synthase b protein | asnB | 285018780 |
| 4–5 aa ins | 404–445 | All except |
| Asparagine synthase b protein | asnB | 194365058 |
| 1–2 aa ins | 96–132 | All except |
| DNA polymerase III subunit alpha | dnaE | 77747494 |
| 4 aa ins | 522–576 | All except |
| DNA repair protein RecA | recA | 15836728 |
| 2 aa ins | 172–238 | All except |
| 5′-nucleotidase | – | 21231001 |
| 11–13 aa ins | 123–188 | All except |
| CTP synthetase | pyrG | 194365226 |
| 2 aa ins | 253–291 | All except |
| DNA mismatch repair protein MutS | mutS | 15838317 |
| 5 aa ins | 765–806 | All except |
| DNA polymerase III subunit alpha | dnaE | 194365029 |
| 2 aa del | 65–120 | All except |
| tRNA (guanine-N(1)-)-methyltransferase | trmD | 194364933 |
| 2 aa ins | 140–200 | All except |
| Glucose-6-phosphate 1-dehydrogenase | zwf | 190573773 |
| 4 aa del | 290–334 | All except |
| DNA ligase NAD dependent | ligA | 77747612 |
| 57–65 aa ins | 461–583 | All except |
The indel position provided indicates the region of the protein containing the CSI.
These CSIs have been previously described [28].
Figure 5Example of CSIs those are able to distinguish two different clades of Xanthomonadales.
Partial sequence alignments are shown of the proteins (A) 5′-nucleotidase and (B) CTP synthetase showing two CSI, which due to their different lengths are able to distinguish between two different clades of Xanthomonadales. In (A), a 13 aa insert is present in all of the Xanthomonas and Xylella species, whereas the two Stenotrophomonas spp. contain an 11 aa insert in this position. Similarly, in (B), all of the Xanthomonas, Xylella and Stenotrophomonas species have a 2 aa insert, whereas the two Pseudoxanthomonas spp. contain a 1 aa insert in this position. Different possibilities to account for these CSIs are discussed in the text.
Figure 6Partial sequence alignments of valyl t-RNA synthetase showing a 13 aa insert that is commonly shared by various Xanthomonadales and a subgroup of Alphaproteobacteria.
Other Alpha- and Gamma-proteobacteria do not contain this insert.
Figure 7Partial sequence alignment of carbamoyl phosphate synthase showing a 1 aa insert that is commonly shared by Xanthomonadales and a subgroup of Betaproteobacteria.
The distinct branching of these two groups in a phylogenetic tree based upon CarB sequence (Figure S36) provides evidence that this shared CSIs is not a result of LGT.
Figure 8Phylogenetic tree based upon valyl t-RNA synthetase sequences.
The distinct branching of Xanthomonadales and the Alphaproteobacteria containing this insert suggests that the shared presence of this CSIs in these two groups is not due to a LGT.
Figure 9A summary diagram showing the species specificity of various CSIs identified in this work and the evolutionary stages where the genetic changes responsible for them were likely introduced.