| Literature DB >> 25803742 |
Jason W Sahl1, Christopher J Allender2, Rebecca E Colman3, Katy J Califf2, James M Schupp3, Bart J Currie4, Kristopher E Van Zandt5, H Carl Gelhaus5, Paul Keim1, Apichai Tuanyok6.
Abstract
Burkholderia pseudomallei is the causative agent of melioidosis and a potential bioterrorism agent. In the development of medical countermeasures against B. pseudomallei infection, the US Food and Drug Administration (FDA) animal Rule recommends using well-characterized strains in animal challenge studies. In this study, whole genome sequence data were generated for 6 B. pseudomallei isolates previously identified as candidates for animal challenge studies; an additional 5 isolates were sequenced that were associated with human inhalational melioidosis. A core genome single nucleotide polymorphism (SNP) phylogeny inferred from a concatenated SNP alignment from the 11 isolates sequenced in this study and a diverse global collection of isolates demonstrated the diversity of the proposed Animal Rule isolates. To understand the genomic composition of each isolate, a large-scale blast score ratio (LS-BSR) analysis was performed on the entire pan-genome; this demonstrated the variable composition of genes across the panel and also helped to identify genes unique to individual isolates. In addition, a set of ~550 genes associated with pathogenesis in B. pseudomallei were screened against the 11 sequenced genomes with LS-BSR. Differential gene distribution for 54 virulence-associated genes was observed between genomes and three of these genes were correlated with differential virulence observed in animal challenge studies using BALB/c mice. Differentially conserved genes and SNPs associated with disease severity were identified and could be the basis for future studies investigating the pathogenesis of B. pseudomallei. Overall, the genetic characterization of the 11 proposed Animal Rule isolates provides context for future studies involving B. pseudomallei pathogenesis, differential virulence, and efficacy to therapeutics.Entities:
Mesh:
Year: 2015 PMID: 25803742 PMCID: PMC4372212 DOI: 10.1371/journal.pone.0121052
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Details of isolates sequenced in current study.
| Isolate | Isolation source | Isolation country | Isolation Year | Passages | SRA | Genbank Accession | BEI accession |
|---|---|---|---|---|---|---|---|
| 1106a | liver abscess | Thailand | 1993 | 10 | SRX263957 | N/A | NR-44208 |
| K96243 | blood | Thailand | 1996 | 22 | SRR797065 | N/A | NR-44206 |
| MSHR305 | brain | Australia | 1994 | 10 | SRX263963 | N/A | NR-44225 |
| MSHR668 | blood | Australia | 1995 | 11 | SRX259746 | N/A | NR-44224 |
| 406e | toe swab | Thailand | 1988 | 10 | SRX256398 | AQTK00000000 | NR-44207 |
| NCTC 13392 | unknown | Thailand | 1996 | unknown | SRX245558 | AOUG00000000 | N/A |
| 1026b | blood | Thailand | 1993 | 9 | SRX259661 | N/A | NR-9910 |
| MSHR5855 | sputum | Australia | 2011 | 7 | SRX526461 | JMMV00000000 | NR-45120 |
| MSHR5858 | sputum | Australia | 2011 | 2 | SRX264496 | AVAK00000000 | NR-45120 |
| HBPUB10134a | sputum | Thailand | 2010 | 10 | SRR796658 | AVAL00000000 | NR-44220 |
| HBPUB10303a | sputum | Thailand | 2011 | 10 | SRX254948 | AVAM00000000 | NR-44221 |
*genome has been sequenced previously
Nucleotide variant information for re-sequencing projects conducted in current study.
| Name | Chromosome | Coordinate | Reference | Query | Locus | Effect | Annotation | Proportion | Depth |
|---|---|---|---|---|---|---|---|---|---|
| K96243 | NC006350.1 | 549058 | G | A | BPSL0500 | non-synonymous | hexosaminidase | 0.99 | 165 |
| K96243 | NC006350.1 | 549059 | T | G | BPSL0500 | synonymous | hexosaminidase | 1.00 | 163 |
| K96243 | NC006350.1 | 549061 | C | T | BPSL0500 | non-synonymous | hexosaminidase | 1.00 | 162 |
| K96243 | NC006350.1 | 549062 | C | T | BPSL0500 | synonymous | hexosaminidase | 1.00 | 165 |
| K96243 | NC006350.1 | 2399742 | C | G | BPSL2010 | non-synonymous | lipid metabolism-like protein | 0.99 | 138 |
| K96243 | NC006350.1 | 2399743 | C | G | BPSL2010 | non-synonymous | lipid metabolism-like protein | 0.99 | 140 |
| K96243 | NC006351.1 | 1607761 | T | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 1.00 | 8 |
| K96243 | NC006351.1 | 1607796 | G | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 0.98 | 100 |
| K96243 | NC006351.1 | 1607820 | G | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 0.99 | 99 |
| K96243 | NC006351.1 | 1607822 | G | C | BPSS1194 | synonymous | peptide synthase/polyketide synthase | 0.99 | 99 |
| K96243 | NC006351.1 | 1607825 | G | C | BPSS1194 | synonymous | peptide synthase/polyketide synthase | 0.97 | 102 |
| K96243 | NC006351.1 | 1607838 | T | G | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 0.99 | 117 |
| K96243 | NC006351.1 | 1607851 | T | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 0.99 | 111 |
| K96243 | NC006351.1 | 1607874 | G | T | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 0.97 | 62 |
| K96243 | NC006351.1 | 1607887 | G | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 1.00 | 58 |
| K96243 | NC006351.1 | 1607894 | G | A | BPSS1194 | synonymous | peptide synthase/polyketide synthase | 0.99 | 72 |
| K96243 | NC006351.1 | 1607902 | G | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 1.00 | 68 |
| K96243 | NC006351.1 | 1607910 | C | A | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 0.95 | 57 |
| K96243 | NC006351.1 | 1607917 | G | C | BPSS1194 | non-synonymous | peptide synthase/polyketide synthase | 1.00 | 48 |
| K96243 | NC006351.1 | 1607997 | C | A | intergenic | N/A | N/A | 0.96 | 49 |
| K96243 | NC006351.1 | 1608005 | T | C | BPSS1195 | stop codon destroyed | non-ribosomal peptide synthase | 1.00 | 53 |
| K96243 | NC006351.1 | 1608012 | G | C | BPSS1195 | non-synonymous | non-ribosomal peptide synthase | 0.98 | 55 |
| K96243 | NC006351.1 | 1608015 | G | T | BPSS1195 | non-synonymous | non-ribosomal peptide synthase | 1.00 | 55 |
| K96243 | NC006351.1 | 1608017 | T | A | BPSS1195 | non-synonymous | non-ribosomal peptide synthase | 1.00 | 55 |
| K96243 | NC006351.1 | 1608029 | G | C | BPSS1195 | synonymous | non-ribosomal peptide synthase | 1.00 | 98 |
| K96243 | NC006351.1 | 1615675 | C | T | BPSS1197 | non-synonymous | BPSS1197 | 0.98 | 192 |
| K96243 | NC006351.1 | 1764438 | G | T | intergenic | N/A | N/A | 1.00 | 96 |
| K96243 | NC006351.1 | 1764448 | G | T | intergenic | N/A | N/A | 0.93 | 101 |
| K96243 | NC006351.1 | 2337386 | A | C | BPSS1703 | stop codon destroyed | hypothetical protein | 1.00 | 188 |
| 1106a | NC_009076.1 | 797819 | T | G | BURPS1106A_0812 | non-synonymous | sensor histidine kinase | 0.99 | 125 |
| 1026b | NC_017832.1 | 2020919 | G | A | BP1026B_II1596 | non-synonymous | type VI secretion system, VGR | 0.91 | 297 |
| 668 | NC_009074.1 | 3755785 | G | C | BURPS668_3852 | synonymous | chemotaxis protein methyltransferase | 0.99 | 289 |
| 668 | NC_009075.1 | 92668 | CG | C | intergenic | N/A | N/A | 0.91 | 276 |
Fig 1A maximum likelihood phylogeny inferred from a concatenation of ~63,000 core-genome single nucleotide polymorphisms (SNPs) identified in the eleven genomes sequenced in this study, shown in red, and a reference set of genomes (S2 Table).
The tree was inferred with RAxML v8 [31, 32] using the ASC_GTRGAMMA model and 100 bootstrap replicates. Filled circles are placed at nodes where the bootstrap support values are >90%.
Fig 2Plots of single nucleotide polymorphism (SNP) density and homoplasy density (HD), across the two chromosomes of the reference isolate, K96243 [30].
The outer ring represents the number of informative SNPs across 1-kb genomic intervals. The inner ring indicates the number of homoplasious SNPs, as determined by a retention index (RI) value <0.5 calculated by Paup [35], divided by the total number of informative SNPs over the same 1-kb genomic interval. HD and SD values were visualized with Circos [36].
Annotation for unique genes identified in genomes sequenced in the current study.
| Genome | closest BLAST match | closest BLAST annotation | nearest BLAST organism | protein ID (%) | query length |
|---|---|---|---|---|---|
| MSHR5858 | BUPH_05469 | integrase catalytic subunit |
| 99 | 99 |
| NCTC13392 | BDB_110343 | hypothetical | blood disease bacterium R229 | 83 | 99 |
| NCTC13392 | BDB_110341 | hypothetical | blood disease bacterium R229 | 82 | 98 |
| 10134a | YP_582472 | tyrosine-based site-specific recombinase |
| 64 | 98 |
| 10134a | YP_005995055 | hypothetical |
| 62 | 98 |
| 10134a | YP_005995053 | putative integrase |
| 49 | 98 |
| 10134a | WP_017232947 | hypothetical |
| 91 | 99 |
| 10134a | WP_017232948 | DEAD/DEAH box helicase |
| 91 | 100 |
| 10134a | WP_008918033 | N-6 DNA methylase |
| 78 | 99 |
| 10134a | WP_017232950 | ATP-dependent helicase |
| 87 | 99 |
| 10134a | WP_006395564 | hypothetical |
| 60 | 83 |
| HBPUB10303a | YP_443256 | type I restriction-modification system |
| 78 | 72 |
| HBPUB10303a | YP_443255 | hypothetical |
| 91 | 99 |
| HBPUB10303a | YP_005028223 | hypothetical |
| 74 | 99 |
| HBPUB10303a | WP_008248767 | hypothetical |
| 51 | 99 |
| HBPUB10303a | YP_006030638 | helicase domain-containing protein |
| 79 | 99 |
| MSHR5855 | Bamb_2400 | phage integrase family protein |
| 89 | 96 |
| MSHR5855 | BTQ_1983 | putative membrane proten |
| 99 | 100 |
| MSHR5855 | BTJ_373 | hypothetical |
| 99 | 100 |
| MSHR5855 | bglu_1g24070 | hypothetical |
| 91 | 99 |
| MSHR5855 | BTI_1944 | hypothetical |
| 99 | 100 |
| MSHR5855 | BTI_1943 | hypothetical |
| 78 | 100 |
| MSHR5855 | BTI_1942 | helix-turn-helix family protein |
| 99 | 100 |
| MSHR5855 | Rpic12D_1056 | lipoprotein releasing system |
| 98 | 100 |
Fig 3A heatmap of blast score ratio (BSR) values [44] calculated from a known set of virulence factors characterized in B. pseudomallei (S3 Table) with the large-scale blast score ratio (LS-BSR) pipeline [43].
A maximum likelihood phylogeny was inferred on a concatenation of single nucleotide polymorphisms (SNPs) and was correlated to the heatmap.
Survival data of 10 strains injected intranasally in BALB/c mice.
| Challenge Strain | Target CFU:10 | Target CFU: 100 | Target CFU: 1000 | Target CFU: 10000 | ||||
|---|---|---|---|---|---|---|---|---|
| CFUs | %survival | CFUs | %survival | CFUs | %survival | CFUs | %survival | |
| K96243 | 13(+/-0) | 57 | 170(+/-21) | 14 | 1581(+/-212) | 0 | 11940(+/-1485) | 0 |
| MSHR406e | 13(+/-0) | 0 | 135(+/-4) | 0 | 1365(+/-106) | 0 | 12840(+/-636) | 0 |
| 1026b | 12(+/-0) | 57 | 127(+/-11) | 0 | 1215(+/-106) | 0 | 11355(+/-785) | 0 |
| 1106a | 12(+/-0) | 100 | 128(+/-23) | 71 | 1220(+/-112) | 29 | 12090(+/-2121) | 0 |
| MSHR305 | 16(+/-1) | 29 | 149(+/-15) | 0 | 1524(+/-21) | 0 | 14100(+/-2121) | 0 |
| MSHR668 | 14(+/-0) | 0 | 122(+/-2) | 0 | 1095(+/-8) | 0 | 12945(+/-912) | 0 |
| MSHR5855 | 16(+/-1) | 0 | 185(+/-48) | 0 | 1575(+/-136) | 0 | 14550(+/-933) | 0 |
| MSHR5858 | 14(+/-2) | 86 | 154(+/-8) | 14 | 1485(+/-119) | 0 | 15750(+/-2630) | 0 |
| HBPUB10303a | 6(+/-0) | 71 | 59(+/-1) | 14 | 537(+/-38) | 0 | 5280(+/-424) | 0 |
| HBPUB10134a | 16 | 0 | 163 | 0 | 1779 | 0 | 18810 | 0 |
Correlations of LS-BSR values with observed differential virulence in BALB/c mice.
| Accession | Annotation | BSR average (high) | BSR average (intermediate) | BSR average (low) | FDR p-value |
|---|---|---|---|---|---|
| BPSS1185 | undefined product | 73.5 | 87.2 | 14 | 0.0000 |
| BPSL2990 | histone H1-like protein | 100 | 71 | 41 | 0.0003 |
| BPSL0016 | general secretory pathway protein L | 100 | 99.8 | 51 | 0.0022 |
| BDL_4286 | response regulator | 97.25 | 87.2 | 49 | 0.0082 |
| BPSS1308 | isoaspartyl peptidase | 100 | 99.6 | 54 | 0.0082 |
| BPSL0859 | N-acetylmuramoyl-L-alanine amidase | 100 | 100 | 55 | 0.0106 |
| BPSS0771 | hypothetical protein | 98 | 81.6 | 50 | 0.0147 |
| BPSS1269 | peptide_synthase/polyketide_synthase | 87.75 | 99.2 | 54 | 0.0324 |
| BPSS1212 | hypothetical protein | 99.75 | 78.4 | 54 | 0.0454 |
*High, intermediate and low virulence determined by intranasal challenge at ~10 colony forming units.
Single nucleotide polymorphisms (SNPs) unique to high virulence isolates.
| Chrom | Coordinate | K96243 call | query call | refAA | derivedAA | locus tag | Annotation |
|---|---|---|---|---|---|---|---|
| NC_006350.1 | 220073 | C | T | R | Q | BPSL0211 | lipid A biosynthesis lauroyl acyltransferase |
| NC_006350.1 | 220163 | C | T | R | Q | BPSL0211 | lipid A biosynthesis lauroyl acyltransferase |
| NC_006350.1 | 240744 | T | C | M | V | BPSL0230 | fliF; flagellar MS-ring protein |
| NC_006350.1 | 534205 | G | T | A | D | BPSL0492 | hypothetical protein |
| NC_006350.1 | 554241 | T | C | A | A | BPSL0504 | rpoH; RNA polymerase factor sigma-32 |
| NC_006350.1 | 1839157 | C | G | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 1839166 | G | A | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 1839172 | T | C | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 1839197 | A | C | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 1839256 | A | G | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 1839334 | C | T | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 1839988 | T | C | K | R | BPSL1583 | hypothetical protein |
| NC_006350.1 | 1840050 | G | A | A | A | BPSL1583 | hypothetical protein |
| NC_006350.1 | 2402720 | T | C | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 2440328 | A | G | D | D | BPSL2041 | hypothetical protein |
| NC_006350.1 | 2553415 | C | G | A | A | BPSL2126 | transport-related, membrane protein |
| NC_006350.1 | 2555149 | G | A | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 2555151 | C | T | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 2820597 | A | G | I | T | BPSL2334 | hypothetical protein |
| NC_006350.1 | 2847891 | T | C | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 3021077 | A | C | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 3403668 | G | T | T | T | BPSL2842 | FAD-binding oxidase |
| NC_006350.1 | 3654489 | C | G | N/A | N/A | intergenic | N/A |
| NC_006350.1 | 3924581 | G | A | H | H | BPSL3305 | cheW; chemotaxis protein |
| NC_006350.1 | 4073158 | T | C | E | G | BPSL3430 | glutamine amidotransferase |
| NC_006351.1 | 702758 | G | A | G | S | BPSS0515 | hypothetical protein |
| NC_006351.1 | 1236140 | T | G | P | P | BPSS0936 | hypothetical protein |
| NC_006351.1 | 1269935 | T | C | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 1398575 | C | T | P | P | BPSS1026 | hypothetical protein |
| NC_006351.1 | 1398581 | A | G | * | W | BPSS1026 | hypothetical protein |
| NC_006351.1 | 1917766 | T | C | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 2455301 | T | G | S | A | BPSS1795 | hypothetical protein |
| NC_006351.1 | 2514965 | G | A | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 2695450 | C | T | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 3000485 | G | A | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 3042744 | A | G | D | D | BPSS2265 | monooxygenase |
| NC_006351.1 | 3097464 | G | A | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 3097471 | C | T | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 3097776 | C | A | N/A | N/A | intergenic | N/A |
| NC_006351.1 | 3160929 | G | A | N/A | N/A | intergenic | N/A |