| Literature DB >> 25566316 |
Amandeep Sawana1, Mobolaji Adeolu1, Radhey S Gupta1.
Abstract
The genus Burkholderia contains large number of diverse species which include many clinically important organisms, phytopathogens, as well as environmental species. However, currently, there is a paucity of biochemical or molecular characteristics which can reliably distinguish different groups of Burkholderia species. We report here the results of detailed phylogenetic and comparative genomic analyses of 45 sequenced species of the genus Burkholderia. In phylogenetic trees based upon concatenated sequences for 21 conserved proteins as well as 16S rRNA gene sequence based trees, members of the genus Burkholderia grouped into two major clades. Within these main clades a number of smaller clades including those corresponding to the clinically important Burkholderia cepacia complex (BCC) and the Burkholderia pseudomallei groups were also clearly distinguished. Our comparative analysis of protein sequences from Burkholderia spp. has identified 42 highly specific molecular markers in the form of conserved sequence indels (CSIs) that are uniquely found in a number of well-defined groups of Burkholderia spp. Six of these CSIs are specific for a group of Burkholderia spp. (referred to as Clade I in this work) which contains all clinically relevant members of the genus (viz. the BCC and the B. pseudomallei group) as well as the phytopathogenic Burkholderia spp. The second main clade (Clade II), which is composed of environmental Burkholderia species, is also distinguished by 2 identified CSIs that are specific for this group. Additionally, our work has also identified multiple CSIs that serve to clearly demarcate a number of smaller groups of Burkholderia spp. including 3 CSIs that are specific for the B. cepacia complex, 4 CSIs that are uniquely found in the B. pseudomallei group, 5 CSIs that are specific for the phytopathogenic Burkholderia spp. and 22 other CSI that distinguish two groups within Clade II. The described molecular markers provide highly specific means for the demarcation of different groups of Burkholderia spp. and they also offer novel and useful targets for the development of diagnostic assays for the clinically important members of the BCC or the pseudomallei groups. Based upon the results of phylogenetic analyses, the identified CSIs and the pathogenicity profile of Burkholderia species, we are proposing a division of the genus Burkholderia into two genera. In this new proposal, the emended genus Burkholderia will correspond to the Clade I and it will contain only the clinically relevant and phytopathogenic Burkholderia species. All other Burkholderia spp., which are primarily environmental, will be transferred to a new genus Paraburkholderia gen. nov.Entities:
Keywords: Burkholderia; Burkholderia cepacia complex; conserved signature indels; molecular signatures; phylogenetic trees
Year: 2014 PMID: 25566316 PMCID: PMC4271702 DOI: 10.3389/fgene.2014.00429
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Genome characteristics of the sequenced members of the genus .
| PRJNA57953 | 8.06 | 66.9 | 3 | 7116 | Holden et al., | |
| PRJNA57733 | 7.25 | 68.1 | 2 | 5727 | Holden et al., | |
| PRJNA57725 | 5.84 | 68.5 | 2 | 5022 | Nierman et al., | |
| PRJNA58081 | 6.72 | 67.6 | 2 | 5632 | Kim et al., | |
| PRJNA54789 | 6.99 | 67.0 | – | 6954 | NMRC | |
| PRJNA58909 | 7.01 | 66.7 | 3 | 6111 | DOE | |
| PRJNA58303 | 7.53 | 66.8 | 3 | 6610 | Coenye et al., | |
| PRJNA59397 | 7.28 | 67.9 | 2 | 5773 | Lim et al., | |
| PRJNA57823 | 9.73 | 62.6 | 3 | 8702 | Chain et al., | |
| PRJNA42523 | 7.88 | 63.3 | 3 | 6889 | Ormeno-Orrillo et al., | |
| PRJNA42975 | 6.83 | 63.6 | 2 | 5965 | DOE | |
| PRJNA46253 | 7.04 | 63.2 | 2 | 5988 | DOE | |
| PRJNA48975 | 8.74 | 62.4 | – | 7742 | DOE | |
| PRJNA55101 | 7.89 | 62.9 | – | 7460 | Ormeno-Orrillo et al., | |
| PRJNA58073 | 8.68 | 66.3 | 3 | 7716 | DOE | |
| PRJNA66661 | 7.76 | 63.2 | – | – | DOE | |
| PRJNA81081 | 8.90 | 63.3 | 3 | 7804 | Lim et al., | |
| PRJNA160003 | 7.88 | 61.4 | – | 7268 | Kumar et al., | |
| PRJNA165309 | 6.31 | 63.1 | – | – | DOE | |
| PRJNA165871 | 6.63 | 67.2 | 3 | 6024 | Kwak et al., | |
| PRJNA179699 | 7.38 | 66.9 | – | 8940 | Khan et al., | |
| PRJNA180532 | 10.64 | 61.9 | – | 10126 | Oak Ridge | |
| PRJNA182741 | 7.21 | 63.1 | – | – | DOE | |
| PRJNA182743 | 7.68 | 61.8 | – | – | DOE | |
| PRJNA190816 | 7.24 | 62.8 | – | – | DOE | |
| PRJNA199219 | 9.07 | 62.9 | – | 8336 | DOE | |
| PRJNA199221 | 8.61 | 63.1 | – | 7873 | DOE | |
| PRJNA199222 | 6.39 | 63.1 | – | 5963 | DOE | |
| PRJNA205541 | 6.96 | 63.1 | 3 | 6498 | Shibata et al., | |
| PRJNA58075 | 8.39 | 65.7 | 3 | 7617 | DOE | |
| PRJNA54351 | 6.42 | 66.8 | – | 4795 | Broad Institute | |
| PRJNA58699 | 8.68 | 62.3 | 2 | 7496 | Vandamme et al., | |
| PRJNA58729 | 8.21 | 62.3 | 2 | 7241 | Weilharter et al., | |
| PRJNA54793 | 6.93 | 67.3 | – | 7181 | NMRC | |
| PRJNA54887 | 7.48 | 62.9 | – | 6747 | DOE | |
| PRJNA60487 | 3.75 | 60.7 | 1 | 3870 | Lackner et al., | |
| PRJNA66301 | 9.05 | 67.4 | 2 | 7411 | Seo et al., | |
| PRJNA173858 | 6.47 | 66.7 | 2 | 5825 | Hong et al., | |
| PRJNA74017 | 4.01 | 62.9 | – | 2069 | Van Oevelen et al., | |
| PRJNA163559 | 8.41 | 63.9 | – | – | DOE | |
| PRJNA168186 | 11.29 | 61.8 | – | 10234 | Nazir et al., | |
| PRJNA199595 | 8.05 | 67.4 | – | 7324 | Song et al., | |
| PRJNA199910 | 7.13 | 65.0 | – | 6311 | Coutinho et al., | |
| PRJNA176370 | 7.65 | 63.1 | 2 | 6496 | Oliveira Cunha et al., | |
| PRJNA201182 | 7.38 | 61.9 | – | 6722 | DOE |
The Broad Institute Genome Sequencing Platform (Broad Institute).
Naval Medical Research Center/ Biological Defense Research Directorate (NMRC).
Oak Ridge National Lab (Oak Ridge).
DOE Joint Genome Institute (DOE).
Figure 1A maximum likelihood phylogenetic tree of the genome sequenced members of the genus . The tree was rooted using Cupriavidus necator N-1, Bordetella pertussis Tohama I, and Neisseria meningitides MC58. Bootstrap analysis scores are indicated for each node. The major Burkholderia clades (Clades I and II) and their main sub-clades are indicated by brackets.
Figure 2A maximum likelihood tree based on the 16S rRNA gene sequences of 97 members of the genus . Accession numbers for the 16S rRNA sequenced used for each organism are provided in the brackets following the name of the organism. The tree was rooted using four species from the genera Cupriadivus and Ralstonia. Bootstrap analysis scores are indicated for each node. The major Burkholderia clades (Clades I and II) and the subclades within Clade I are indicated by brackets.
Figure 3Partial sequence alignments of (A) a periplasmic amino acid-binding protein showing a 1 amino acid deletion identified in all members of Clade I of the genus . These CSIs were not found in the sequence homologs of these proteins from any other sequenced bacteria. In each case, sequence information for a Burkholderia species and a limited number other bacteria are shown, but unless otherwise indicated, similar CSIs were detected in all members of the indicated group and not detected in any other bacterial species in the top 250 BLAST hits. The dashes (–) in the alignments indicate identity with the residue in the top sequence. GenBank identification (GI) numbers for each sequence are indicated in the second column. Sequence information for other CSIs specific to the members of Clade I and Clade II of the genus Burkholderia are presented in Supplemental Figures 1–5 and Supplemental Figure 6, respectively, and their characteristics are summarized in Table 2.
Conserved signature indels specific for the two major clades within the genus .
| Periplasmic amino acid-binding protein | 385357135 | Figure | 1 aa del | 135–195 | Clade I |
| Putative lyase | 167724527 | Supplemental Figure | 1 aa del | 70–121 | Clade I |
| 4-hydroxybenzoate 3-monooxygenase | 238023559 | Supplemental Figure | 1 aa ins | 101–171 | Clade I |
| 6-phosphogluconate dehydrogenase, decarboxylating | 330820932 | Supplemental Figure | 1 aa ins | 137–202 | Clade I |
| Putative lipoprotein | 121598811 | Supplemental Figure | 1 aa del | 363–393 | Clade I |
| Sarcosine oxidase subunit alpha | 493818877 | Supplemental Figure | 3 aa ins | 904–965 | Clade I |
| Dehydrogenase | 497456569 | Figure | 1 aa ins | 279–333 | Clade II |
| LysR family transcriptional regulator | 187919777 | Supplemental Figure | 2 aa del | 260–294 | Clade II |
The region of the specified protein that contains the indel.
Figure 4Partial sequence alignments of (A) a histidine utilization repressor showing a 2 amino acid insertion (boxed) identified in all members of the . These CSIs were not found in the sequence homologs of these proteins from any other sequenced bacteria in the top 250 BLAST hits. Sequence information for other CSIs specific to subclades within Clade I of the genus Burkholderia are presented in Supplemental Figures 7–15 and their characteristics are summarized in Table 3.
Conserved signature indels specific for groups within Clades I and II.
| Histidine utilization repressor | 172064454 | Figure | 2 aa ins | 157–196 | Clade Ia |
| Molybdate ABC transporter substrate-binding protein | 189352411 | Supplemental Figure | 1 aa ins | 110–158 | Clade Ia |
| Acid phosphatase | 221203041 | Supplemental Figure | 1 aa ins | 305–338 | Clade Ia |
| Periplasmic oligopeptide-binding protein | 488606492 | Figure | 1 aa ins | 332–372 | Clade Ib |
| OpgC protein | 53716883 | Supplemental Figure | 1 aa ins | 137–204 | Clade Ib |
| Polysaccharide deacetylase family protein | 167725414 | Supplemental Figure | 1 aa ins | 29–63 | Clade Ib |
| Thioredoxin domain protein | 497613277 | Supplemental Figure | 1 aa ins | 247–294 | Clade Ib |
| SMP-30/gluconolaconase/LRE-like region-containing protein | 238024002 | Figure | 1 aa ins | 403–438 | Clade Ic |
| Cation efflux protein | 330820376 | Supplemental Figure | 1 aa ins | 129–160 | Clade Ic |
| putative peptidoglycan-binding LysM/M23B peptidase | 238024763 | Supplemental Figure | 1 aa ins | 155–198 | Clade Ic |
| SMP-30/gluconolaconase/LRE-like region-containing protein | 238024002 | Supplemental Figure | 2 aa del | 80–130 | Clade Ic |
| hypothetical protein bgla_2g22890 | 330821370 | Supplemental Figure | 1 aa ins | 322–358 | Clade Ic |
| 3-phosphoglycerate dehydrogenase | 494056927 | Figure | 1 aa ins | 61–100 | Clade IIa |
| Hypothetical protein BYI23_A021470 | 377821591 | Supplemental Figure | 1 aa del | 16–76 | Clade IIa |
| Prepilin peptidase | 377821714 | Supplemental Figure | 1 aa ins | 179–230 | Clade IIa |
| Uracil-DNA glycosylase | 495619839 | Supplemental Figure | 2 aa ins | 191–230 | Clade IIa |
| Hypothetical protein BYI23_A015260 | 377820970 | Supplemental Figure | 2 aa ins | 221–270 | Clade IIa |
| Carboxylate-amine ligase | 377822128 | Supplemental Figure | 1 aa del | 321–362 | Clade IIa |
| NADH:ubiquinone oxidoreductase subunit M | 494056355 | Supplemental Figure | 3 aa ins | 303–348 | Clade IIa |
| NADH:ubiquinone oxidoreductase subunit L | 494056354 | Supplemental Figure | 1 aa ins | 538–585 | Clade IIa |
| ABC transporter | 377821271 | Supplemental Figure | 1 aa del | 59–99 | Clade IIa |
| Hypothetical protein BYI23_A002220 | 377819666 | Supplemental Figure | 2 aa ins | 133–172 | Clade IIa |
| 16S rRNA-processing protein RimM | 494056031 | Supplemental Figure | 1 aa ins | 147–201 | Clade IIa |
| FAD linked oxidase domain-containing protein | 377819737 | Supplemental Figure | 1 aa ins | 106–144 | Clade IIa |
| Preprotein translocase subunit SecD | 495626933 | Supplemental Figure | 1 aa del | 306–341 | Clade IIa |
| Mechanosensitive ion channel protein MscS | 494057445 | Supplemental Figure | 3 aa ins | 101–143 | Clade IIa |
| Hypothetical protein BYI23_A006130 | 377820057 | Supplemental Figure | 1 aa ins | 199–253 | Clade IIa |
| Uroporphyrinogen-III synthase | 494056428 | Supplemental Figure | 7 aa ins | 37–79 | Clade IIa |
| 4-hydroxyacetophenone monooxygenase | 496202984 | Figure | 1 aa ins | 380–449 | Clade IIb |
| Transposase A-like protein | 187923943 | Supplemental Figure | 1 aa ins | 5–50 | Clade IIb |
| Group 1 glycosyl transferase | 186475830 | Supplemental Figure | 1 aa ins | 153–194 | Clade IIb |
| 4-hydroxyacetophenone monooxygenase | 496202984 | Supplemental Figure | 3 aa ins | 145–219 | Clade IIb |
| Undecaprenyl-phosphate glucose phosphotransferase | 209521823 | Supplemental Figure | 1 aa ins | 208–275 | Clade IIb |
| putative flavin-binding monooxygenase-like protein | 186476032 | Supplemental Figure | 3 aa ins | 102–148 | Clade IIb |
The region of the specified protein that contains the indel.
Figure 5Partial sequence alignments of (A) 3-phosphoglycerate dehydrogenase showing a 1 amino acid insertion (boxed) identified in all members of Clade IIa of the genus . These CSIs were not found in the sequence homologs of these proteins from any other sequenced bacteria in the top 250 BLAST hits. Sequence information for other CSIs specific to subclades within Clade II of the genus Burkholderia are presented in Supplemental Figures 16–35 and their characteristics are summarized in Table 3.
Figure 6A summary diagram depicting the distribution of identified CSIs and the proposed names of the two major groups (Clade I and II) within . The major Burkholderia clades are indicated by brackets and highlighting.
Descriptions of the new combinations in the genus .
| SA33 | Aizawa et al., | ||
| NBRC 101816 | |||
| VTCC-D6-6 | |||
| — | Lemaire et al., | ||
| ATCC 23061 | Gillis et al., | ||
| CCUG 32772 | |||
| CFBP 2421 | |||
| CIP 105771 | |||
| DSM 9511 | |||
| ICMP 2807 | |||
| JCM 10487 | |||
| LMG 2129 | |||
| NCPPB 934 | |||
| NRRL B-14296 | |||
| VG1C | Mavengere et al., | ||
| DSM 27239 | |||
| LMG 27731 | |||
| E25 | Aizawa et al., | ||
| BCC 36998 | |||
| NBRC 103871 | |||
| 1S18 | Vandamme et al., | ||
| CCUG 52993 | |||
| LMG 23644 | |||
| TNe-841 | Martínez-Aguilar et al., | ||
| CIP 110324 | |||
| LMG 26416 | |||
| W50D | Coenye et al., | ||
| CCUG 42236 | |||
| CIP 107098 | |||
| JCM 21561 | |||
| LMG 19076 | |||
| NBRC 102488 | |||
| — | Van Oevelen et al., | ||
| MWAP64 | Achouak et al., | ||
| CCUG 42847 | |||
| CIP 106784 | |||
| DSM 13236 | |||
| LMG 18531 | |||
| ATCC 25418 | Yabuuchi et al., | ||
| CCUG 20834 | |||
| CFBP 2429 | |||
| CFBP 3818 | |||
| CIP 105770 | |||
| DSM 50341 | |||
| HAMBI 2159 | |||
| ICMP 512 | |||
| JCM 9310 | |||
| JCM 10488 | |||
| LMG 2155 | |||
| NCPPB 2151 | |||
| LMG 22940 | Vandamme et al., | ||
| CCUG 63063 | |||
| KIS30-44 | Lee et al., | ||
| DSM 24336 | |||
| KACC 12733 | |||
| JPY461 | Sheu et al., | ||
| NKMU-JPY461 | |||
| BCRC 80259 | |||
| KCTC 23308 | |||
| LMG 26031 | |||
| WSM3556 | De Meyer et al., | ||
| LMG 27173 | |||
| HAMBI 3353 | |||
| RR11 | Kang et al., | ||
| KEMC 7302-065 | |||
| JCM 18070 | |||
| HKI 456 | Partida-Martinez et al., | ||
| CIP 109454 | |||
| DSM 19003 | |||
| FeGl01 | Valverde et al., | ||
| CECT 7171 | |||
| DSM 18251 | |||
| LMG 23612 | |||
| Croize P763-2 | Coenye et al., | ||
| CCUG 31961 | |||
| CIP 107096 | |||
| JCM 21562 | |||
| LMG 16225 | |||
| NBRC 102489 | |||
| KMY03 | Kim et al., | ||
| KCTC 12389 | |||
| NBRC 100965 | |||
| ATCC 29195 | Vandamme et al., | ||
| CFBP 4791 | |||
| CIP 105421 | |||
| DSM 50014 | |||
| JCM 10563 | |||
| LMG 14190 | |||
| C4D1M | Viallard et al., | ||
| ATCC 700544 | |||
| CCUG 42231 | |||
| CIP 106649 | |||
| LMG 18924 | |||
| R27 | Tian et al., | ||
| CGMCC 1.11013 | |||
| DSM 25160 | |||
| SA41 | Aizawa et al., | ||
| NBRC 101817 | |||
| VTCC-D6-7 | |||
| — | Lemaire et al., | ||
| LMG 20598 | Goris et al., | ||
| CCUG 43658 | |||
| LMG 22934 | Vandamme et al., | ||
| CCUG 63059 | |||
| — | Van Oevelen et al., | ||
| KP23 | Zhang et al., | ||
| ATCC 700977 | |||
| CCUG 43663 | |||
| CIP 106643 | |||
| DSM 13646 | |||
| JCM 10599 | |||
| LMG 19447 | |||
| A3 | Vandamme et al., | ||
| CCUG 53006 | |||
| LMG 23650 | |||
| PAS44 | Chen et al., | ||
| BCRC 17516 | |||
| LMG 23256 | |||
| — | Van Oevelen et al., | ||
| Br3437 | Chen et al., | ||
| BCRC 17575 | |||
| LMG 23741 | |||
| OX-01 | Otsuka et al., | ||
| DSM 22550 | |||
| NBRC 105797 | |||
| — | Lemaire et al., | ||
| ATCC 33666 | Viallard et al., | ||
| CCUG 20836 | |||
| CFBP 4793 | |||
| CIP 106502 | |||
| DSM 10684 | |||
| JCM 10564 | |||
| LMG 2247 | |||
| NCIMB 11027 | |||
| AC1100 | Coenye et al., | ||
| CCUG 48558 | |||
| LMG 22037 | |||
| STM815 | Vandamme et al., | ||
| LMG 21445 | |||
| CCUG 47179 | |||
| PsJN | Sessitsch et al., | ||
| CCUG 49060 | |||
| LMG 22146 | |||
| HKI 454 | Partida-Martinez et al., | ||
| CIP 109453 | |||
| DSM 19002 | |||
| WSM3937 | De Meyer et al., | ||
| LMG 27174 | |||
| HAMBI 3354 | |||
| — | Lemaire et al., | ||
| Br3407 | Chen et al., | ||
| BCRC 17587 | |||
| LMG 24235 | |||
| CCT 6771 | Brämer et al., | ||
| CCUG 46043 | |||
| CIP 107211 | |||
| IPT 101 | |||
| LMG 19450 | |||
| RP007 | Vanlaere et al., | ||
| CCUG 53604 | |||
| ICMP 13529 | |||
| LMG 24000 | |||
| — | Lemaire et al., | ||
| HU2-65W | Lim et al., | ||
| KCTC 22086 | |||
| LMG 24238 | |||
| SRMrh-20 | Perin et al., | ||
| ATCC BAA-1244 | |||
| LMG 23149 | |||
| GP25-8 | Yoo et al., | ||
| DSM 18235 | |||
| KACC 11589 | |||
| CCUG 49583 | Lim et al., | ||
| JCM 11778 | |||
| KCTC 12081 | |||
| WSM5005 | De Meyer et al., | ||
| LMG 27175 | |||
| HAMBI 3357 | |||
| JPY-345 | Sheu et al., | ||
| NKMU-JPY-345 | |||
| BCRC 80258 | |||
| KCTC 23309 | |||
| LMG 26032 | |||
| LMG 22936 | Vandamme et al., | ||
| CCUG 63060 | |||
| KMY02 | Yang et al., | ||
| KCTC 12388 | |||
| NBRC 100964 | |||
| LMG 22937 | Vandamme et al., | ||
| CCUG 63062 | |||
| CCUG 44527 | Goris et al., | ||
| LMG 20594 | |||
| Ppe8 | Reis et al., | ||
| ATCC BAA-831 | |||
| DSM 15359 | |||
| LMG 22274 | |||
| STM678 | Vandamme et al., | ||
| CCUG 47178 | |||
| LMG 21444 | |||
| LMG 27134 | Vandamme et al., | ||
| CCUG 63061 | |||
| MTl-641 | Caballero-Mellado et al., | ||
| ATCC BAA-744 | |||
| CIP 107921 | |||
| LB400 | Goris et al., | ||
| CCUG 46959 | |||
| LMG 21463 | |||
| NRRL B-18064 | |||
| OP-1 | Lu et al., | ||
| KCTC 23300 |