| Literature DB >> 16953889 |
Claudia M Romero1, David DeShazer, Tamara Feldblyum, Jacques Ravel, Donald Woods, H Stanley Kim, Yan Yu, Catherine M Ronning, William C Nierman.
Abstract
BACKGROUND: More than 12,000 simple sequence repeats (SSRs) have been identified in the genome of Burkholderia mallei ATCC 23344. As a demonstrated mechanism of phase variation in other pathogenic bacteria, these may function as mutable loci leading to altered protein expression or structure variation. To determine if such alterations are occurring in vivo, the genomes of various single-colony passaged B. mallei ATCC 23344 isolates, one from each source, were sequenced from culture, a mouse, a horse, and two isolates from a single human patient, and the sequence compared to the published B. mallei ATCC 23344 genome sequence.Entities:
Mesh:
Year: 2006 PMID: 16953889 PMCID: PMC1574311 DOI: 10.1186/1471-2164-7-228
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Perfect simple sequence repeats (SSRs) identified in the B. mallei ATCC 23344 genome.
| Chromosome | Coding | Intergenic | Total | ||
| 5' end | Middle | 3' end | |||
| 1 | 1809 | 1811 | 1786 | 1789 | 7195 |
| 2 | 1401 | 1433 | 1310 | 1208 | 5352 |
| Total | 3210 | 3244 | 3096 | 2997 | 12547 |
Locations of the SSRs in the genome are denoted with the coordinates of their start and end points (i.e. match 5' end and 3' end) in the relevant chromosomes (i.e. 1 or 2) and also with their relative positions within the coding region of a gene: 5' end, middle, and 3' end.
Indels within intergenic regions.
| SSR | Nearest ORF | |||||||
| Isolate | Indel | Unit | Reference | Query | Near Promoter | Near Palindrome | Locus | Annotation |
| Lab Culture | 1 | TTGCCCGGCGA | 7 | 6 | yes | no | BMA1136 | Hypothetical protein |
| 2 | TTCGACGC | 29 | 28 | yes | no | BMA2062 | Hypothetical protein | |
| 3 | AGTCGGCA | 38 | 39 | no | no | BMA1136 | Hypothetical protein | |
| 4 | CGATTGCCCGG | 7 | 8 | yes | no | BMA1138 | ABC transporter, putative | |
| 5 | GGGGCTTC | 9 | 8 | no | no | BMA2063 | Transcriptional regulator | |
| 6 | TGCGCGA | 19 | 15 | no | no | BMA2374 | THUMP domain protein | |
| 7 | No (-G) | NA | NA | yes | no | BMAA0389 | Hypothetical protein | |
| 8 | CTGTCGTG | 21 | 22 | no | no | BMAA0376 | Transporter | |
| 9 | GTGCGAT | 19 | 20 | no | no | BMAA1878 | Transcriptional regulator | |
| Mouse Spleen | 1 | GAGGCGT | 26 | 25 | no | no | BMA2774 | Secretory path protein |
| 2 | No (+TT) | NA | NA | yes | no | BMA1596 | Acetyltransferase | |
| 3 | CGCGAGG | 23 | 22 | yes | no | BMAA0247 | Oxidoreductase | |
| 4 | GTGGCGA | 7 | 6 | no | no | BMAA0375 | Transcriptional regulator | |
| 5 | AAGTTCCG | 3 | 4 | yes | no | BMAA0242 | Acyl-CoA dehydrogenase | |
| 6 | CTGTCGTG | 21 | 22 | no | no | BMAA0376 | Transporter | |
| 7 | TGGCGTT | 26 | 27 | yes | no | BMAA0242 | Acyl-CoA dehydrogenase | |
| 8 | GAAAGAGAC | 10 | 11 | yes | no | BMAA0815 | DNA-binding regulator | |
| Horse Lung | 1 | GTGAGCC | 13 | 14 | no | no | BMA0984 | Hypothetical protein |
| 2 | No (-C) | NA | NA | no | yes | BMAA1128 | ABC Transporter | |
| 3 | GGGAAACGCGAAAC | 6 | 5 | yes | no | BMAA1873 | Hypothetical protein | |
| 4 | No (-C) | NA | NA | no | yes | BMAA1868 | Aconitate hydratase | |
| 5 | No (+C) | NA | NA | no | yes | BMAA1420 | Synthetase protein | |
| 6 | No (+T) | NA | NA | no | yes | BMAA1237 | Carboxyvinyltransferase | |
| 7 | GCGAAAC | 5 | 6 | no | yes | BMAA1872 | Chemotaxis protein | |
| 8 | GATGAGC | 19 | 20 | no | yes | BMAA0612 | Signal sequence protein | |
| Human Liver | 1 | GGCAAGTC | 38 | 40 | no | yes | BMA1135 | Drug resistance transporter |
| 2 | No (-C) | NA | NA | no | yes | BMAA1868 | Aconitate hydratase | |
| 3 | GTGCTGTC | 21 | 22 | no | yes | BMAA0375 | Transcriptional regulator | |
| Human Blood | 1 | TTGGCGC | 111 | 109 | no | no | BMAA1866 | Conserved hypothetical protein |
| 2 | AAGCAGC | 42 | 40 | no | yes | BMAA0117 | 6-phosphofructokinase (pfk) | |
| 3 | GTGCTGTC | 21 | 22 | no | yes | BMAA0375 | Transcriptional regulator | |
| 4 | No (-C) | NA | NA | no | yes | BMAA1128 | ABC transporter | |
| 5 | No (-C) | NA | NA | no | yes | BMAA1868 | Aconitate hydratase | |
NA: Not applicable.
Indels within coding regions.
| Isolate | Indel | Reference Protein Length | Frameshift Length (bp change) | Query Protein Length | Locus | Annotation |
| Lab Culture | 1 | 638 aa | 29 aa (+C) | No stop codonb | BMAA1927 | Hypothetical Protein |
| 2 | 154 aa | 50 aa (-T) | 75 aa | BMA1435 | Hypothetical Protein | |
| 3 | 145 aa | 64 aa (-G) | 85 aa | BMA2147 | N utilization substance protein B | |
| 4 | 711 aa | 586 aa (+A) | No stop codonb | BMA2914 | Oxidoreductase | |
| Mouse spleen | 1a | 942 aa | No | 938 aa | BMAA0680 | Penicillin-binding protein |
| 2 | 357 aa | 13 aa (+G) | 321 aa | BMA0161 | Rod shape-determining protein MreC | |
| Horse Lung | 1 | 787 aa | 525 aa (+G) | 721 aa | BMAA0367 | Acetyltransferase, GNAT family |
| 2a | 136 aa | No | 138 aa | BMAA0623 | Hypothetical protein | |
| 3a | 120 aa | 66 aa | 100 aa | BMA2996 | Hypothetical protein | |
| Human Liver | 1a | 659 aa | 52 aa | 62 aa | BMAA0729 | Hypothetical protein |
| 2a | 122 aa | 69 aa | 85 aa | BMA3028 | Conserved domain protein | |
| 3 | No translation | 49 aa (+A) | 361 aa | BMAA1903 | Conserved hypoth. protein | |
| 4 | 685 aa | 164 aa (+T) | 328 aa | BMA0685 | Vit. B12 receptor BtuB, putative | |
| Human Blood | 1a | 122 aa | 69 aa | 85 aa | BMA3028 | Conserved domain protein |
| 2a | 193 aa | 157 aa | No stop codonb | BMAA0789 | Hypothetical protein | |
| 3 | No translation | 49 aa (+A) | 361 aa | BMA1903 | Conserved hypoth. protein |
aCoding region indels within SSRs. Mouse spleen 1a: repeat unit AACACCGAACCG; Horse lung 2a: repeat unit GGTGCC, 3a: repeat unit GAGCGGT; Human liver 1a : repeat unit CGAGTCAT extra copy in reference, 2a: repeat unit GCCGATT extra copy in query; Human blood 1a: GCCGATT extra copy in query, 2a: GCGCCTC two extra copies in reference.
bReference protein lost the stop codon at the original position due to the frameshift; query protein has a new stop codon in a different position.
Expression profile of unpassaged reference strain (ATCC 23344) relative to the human blood isolate (FMH), expressed as the log2 ratio of intensities.
| 2.15 | Rhs element Vgr protein | |
| 1.64 | BMAA1895 | conserved domain protein |
| 1.60 | lipoprotein, putative | |
| 1.56 | hypothetical protein | |
| 1.52 | BMAA1663 | hypothetical protein |
| 1.51 | hypothetical protein | |
| 1.49 | BMAA0810 | YadA-like C-terminal region protein |
| 1.44 | C4-dicarboxylate transport protein | |
| 1.43 | BMAA2044 | conserved hypothetical protein |
| 1.42 | hypothetical protein | |
| 1.42 | BMA0632 | conserved hypothetical protein |
| 1.42 | hypothetical protein | |
| 1.41 | BMAA1999 | hypothetical protein |
| 1.41 | BMAA0682 | hypothetical protein |
| 1.39 | hypothetical protein | |
| 1.38 | conserved hypothetical protein | |
| 1.38 | BMAA0268 | rubrerythrin |
| 1.38 | BMA1006 | hypothetical protein |
| 1.34 | BMA0985 | hypothetical protein |
| 1.34 | hypothetical protein | |
| 1.31 | BMA0040 | conserved hypothetical protein |
| 1.31 | drug resistance transporter, EmrB/QacA family | |
| 1.30 | BMA0813 | conserved hypothetical protein |
| 1.29 | BMA0833 | DNA-binding response regulator |
| 1.28 | acyltransferase family protein | |
| 1.28 | BMAA0059 | conserved hypothetical protein |
| 1.26 | BMAA0976 | dipeptide ABC transporter, permease protein, putative |
| 1.25 | BMAA2019 | hypothetical protein |
| 1.25 | BMAA1885 | membrane protein, putative |
| 1.24 | BMA2676 | DNA-binding response regulator |
| 1.24 | BMA1631 | hypothetical protein |
| 1.20 | BMAA0737 | Rhs element Vgr protein |
| 1.20 | BMA1854 | Ser/Thr protein phosphatase family protein |
| 1.19 | hypothetical protein | |
| 1.18 | BMA0036 | hypothetical protein |
| 1.14 | BMAA1974 | conserved hypothetical protein |
| 1.13 | BMAA0656 | hypothetical protein |
| 1.13 | BMA1633 | dioxygenase, TauD/TfdA |
| 1.12 | BMAA1879 | hypothetical protein |
| 1.11 | BMAA0651 | H-NS histone family protein |
| 1.09 | BMAA0585 | secretory lipase family protein |
| 1.09 | BMAA0076 | conserved domain protein |
| 1.08 | BMAA0178 | hypothetical protein |
| 1.08 | BMAA0053 | membrane protein, putative |
| 1.07 | BMAA0935 | hypothetical protein |
| 1.06 | BMAA0204 | ortho-halobenzoate 1,2-dioxygenase beta-ISP protein OhbA |
| 1.06 | BMAA1888 | hypothetical protein |
| 1.05 | BMAA2035 | stress response protein |
| 1.05 | BMA2983 | ethanolamine ammonia-lyase heavy chain |
| 1.05 | BMAA1652 | MoaC domain protein |
| 1.04 | BMA1132 | hypothetical protein |
| 1.04 | BMAA1916 | hypothetical protein |
| 1.04 | BMAA0112 | hypothetical protein |
| 1.04 | BMAA1627 | type III secretion inner membrane protein SctS |
| 1.02 | BMAA0391 | monooxygenase family protein |
| 1.01 | BMAA0752 | hypothetical protein |
| 1.00 | BMA3275 | oxidoreductase, GMC family |
| 1.00 | BMAA0470 | hypothetical protein |
| 1.00 | BMAA0061 | RNA polymerase sigma-70 factor, ECF subfamily |
| -1.05 | BMAA0866 | hypothetical protein |
| -1.06 | BMA1987 | dTDP-4-dehydrorhamnose reductase |
Genes exhibiting ≥ 2-fold intensity (mRNA abundance) difference are listed. Highlighted genes are also differentially expressed in the human liver isolate (JHU) (see Table 5).
Expression profile of the unpassaged reference strain (ATCC 23344) relative to the human liver isolate (JHU), expressed as the log2 ratio of intensities.
| 1.82 | lipoprotein, putative | |
| 1.81 | BMA3047 | heat shock protein, Hsp20 family |
| 1.79 | Rhs element Vgr protein | |
| 1.68 | BMA3048 | heat shock protein, Hsp20 family |
| 1.46 | C4-dicarboxylate transport protein | |
| 1.24 | BMA0118 | RNA polymerase sigma factor RpoD, putative |
| 1.22 | drug resistance transporter, EmrB/QacA family | |
| 1.19 | hypothetical protein | |
| 1.18 | conserved hypothetical protein | |
| 1.16 | acyltransferase family protein | |
| 1.15 | hypothetical protein | |
| 1.14 | hypothetical protein | |
| 1.11 | hypothetical protein | |
| 1.06 | hypothetical protein | |
| 1.06 | hypothetical protein | |
| 1.04 | BMA0361 | thioredoxin, authentic frameshift |
| 1.01 | hypothetical protein | |
| -1.02 | BMAA0427 | TonB-dependent copper receptor |
| -1.04 | BMA0665 | phosphoadenosine phosphosulfate reductase, putative |
| -1.07 | BMAA1196 | transcriptional regulator, LysR family |
Genes exhibiting ≥ 2-fold intensity (mRNA abundance) difference are listed. Loci in bold type are also differentially expressed in the human blood isolate (FMH) (see Table 4).
Expression ratios of genes at or near sites of indels.
| BMAA1865 | Human liver 2, BMAA1868 |
| BMAA1865 | Human blood 1, BMAA1866 |
| BMAA1865 | Human blood 5, BMAA1868 |
| BMAA0112 | FMH2, BMAA0117 |
| 0.08 | Human liver 1, BMA1135 |
| 0.29 | Human liver 2, BMAA1868 |
| 0 | Human liver 3, BMAA0375 |
| 0.47 | Human blood 1, BMAA1866 |
| -0.34 | Human blood 2, BMAA0117 |
| 0.13 | Human blood 3, BMAA0375 |
| 0.35 | Human blood 4, BMAA1128 |
| 0.42 | Human blood 5, BMAA1868 |
| 0.62 | Human liver 1, BMAA0729 |
| 0.63 | Human liver 2, BMA0328 |
| 0.04 | Human liver 3, BMAA1903 |
| 0.06 | Human liver 4, BMA0685 |
| 0.93 | Human blood 1, BMA3028 |
| -0.15 | Human blood 2, BMAA0789 |
| 0.48 | Human blood 3, BMA1903 |