| Literature DB >> 19503843 |
Anu Chakicherla1, Carol L Ecale Zhou, Martha Ligon Dang, Virginia Rodriguez, J Norman Hansen, Adam Zemla.
Abstract
Here we introduce a quantitative structure-driven computational domain-fusion method, which we used to predict the structures of proteins believed to be involved in regulation of the subtilin pathway in Bacillus subtilis, and used to predict a protein-protein complex formed by interaction between the proteins. Homology modeling of SpaK and SpaR yielded preliminary structural models based on a best template for SpaK comprising a dimer of a histidine kinase, and for SpaR a response regulator protein. Our LGA code was used to identify multi-domain proteins with structure homology to both modeled structures, yielding a set of domain-fusion templates then used to model a hypothetical SpaK/SpaR complex. The models were used to identify putative functional residues and residues at the protein-protein interface, and bioinformatics was used to compare functionally and structurally relevant residues in corresponding positions among proteins with structural homology to the templates. Models of the complex were evaluated in light of known properties of the functional residues within two-component systems involving His-Asp phosphorelays. Based on this analysis, a phosphotransferase complexed with a beryllofluoride was selected as the optimal template for modeling a SpaK/SpaR complex conformation. In vitro phosphorylation studies performed using wild type and site-directed SpaK mutant proteins validated the predictions derived from application of the structure-driven domain-fusion method: SpaK was phosphorylated in the presence of (32)P-ATP and the phosphate moiety was subsequently transferred to SpaR, supporting the hypothesis that SpaK and SpaR function as sensor and response regulator, respectively, in a two-component signal transduction system, and furthermore suggesting that the structure-driven domain-fusion approach correctly predicted a physical interaction between SpaK and SpaR. Our domain-fusion algorithm leverages quantitative structure information and provides a tool for generation of hypotheses regarding protein function, which can then be tested using empirical methods.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19503843 PMCID: PMC2686270 DOI: 10.1371/journal.pcbi.1000401
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Homology model of SpaK based on PDB entries 2c2a and 2ftk.
Modeled region: 219–459. The 218-residue long N-terminal membrane spanning region (residues 1–218) was not modeled. A: Model of the oligomeric state: homodimer. Coloring scheme reflects in each modeled monomer a consecutive ordering of amino acids in the N-to-C-terminal direction, whereby N-most residues are colored blue and C-most residues are red. Blue-cyan (residues 219–300): central four-helix bundle formed by interaction of 2 helixes from each monomer; Green-red (residues 301–459): C-terminal ATPase-c domain. The labels H247 and G392 show the location of two residues that were changed using site-directed mutagenesis to construct mutants for the phosphorylation studies (see Materials and Methods). B: Homology model of SpaK with marked domains: P1 (dark pink; 219–254), P2 (pink helix; 255–305), P3 (brown; 306–310), P4 (red; 311–455), and P5 (pink strand; 456–459) that are considered as 5 separate functional units. Characteristic sequence motifs (“boxes”) are colored as follows: H (yellow), N (plum), G1 (pale green), F (blue), and G2 (green). Highlighted motifs correspond to those in Fig. 1 from [41] (see Table 3).
Examples of pairwise residue-residue correspondences between SpaK, Beryllofluoride Spo0F, and CheA histidine kinase.
|
| |||||
|
|
|
|
|
|
|
| S | 28_A | L | 245_A | 0.411 | 0.076 |
| R | 29_A | A | 246_A | 0.532 | 0.071 |
|
| 30_A |
| 247_A | 0.597 | 0.149 |
| D | 31_A | E | 248_A | 0.668 | 0.119 |
| W | 32_A | I | 249_A | 0.949 | 0.064 |
| M | 33_A | K | 250_A | 1.52 | 0.329 |
| N | 34_A | I | 251_A | 1.505 | 0.044 |
| K | 35_A | P | 252_A | 1.523 | 0.207 |
| L | 36_A | I | 253_A | 1.299 | 0.106 |
| Q | 37_A | T | 254_A | 1.22 | 0.265 |
|
| |||||
|
|
|
|
|
|
|
| L | 403_A | L | 356_A | 0.48 | 0.172 |
| L | 404_A | L | 357_A | 0.67 | 0.163 |
| H | 405_A | N | 358_A | 0.716 | 0.183 |
| L | 406_A | I | 359_A | 0.512 | 0.159 |
| L | 407_A | L | 360_A | 0.334 | 0.271 |
| R | 408_A | T | 361_A | 0.564 | 0.289 |
|
| 409_A |
| 362_A | 0.558 | 0.277 |
| A | 410_A | A | 363_A | 0.623 | 0.202 |
| I | 411_A | V | 364_A | 0.615 | 0.33 |
|
| |||||
|
|
|
|
|
|
|
| E | 446_A | F | 387_A | 0.898 | 0.169 |
| V | 447_A | V | 388_A | 0.354 | 0.13 |
| E | 448_A | K | 389_A | 0.134 | 0.18 |
|
| 449_A |
| 390_A | 0.803 | 0.202 |
| D | 450_A | T | 391_A | 0.595 | 0.323 |
|
| 451_A |
| 392_A | 1.041 | 0.321 |
| R | 452_A | N | 393_A | 0.862 | 0.322 |
|
| 453_A |
| 394_A | 0.758 | 0.62 |
| I | 454_A | F | 395_A | 0.989 | 0.982 |
| D | 455_A | S | 396_A | 2.154 | 0.845 |
|
| |||||
|
|
|
|
|
|
|
| L | 483_A | L | 400_A | 0.819 | 2.499 |
| N | 484_A | K | 401_A | 1.193 | 0.703 |
| F | 485_A | K | 402_A | 1.008 | 0.233 |
| L | 486_A | A | 403_A | 0.84 | 0.306 |
|
| 487_A | T | 404_A | 0.987 | 0.45 |
| V | 488_A | E | 405_A | 1.894 | 0.474 |
| P | 489_A | L | 406_A | 2.433 | 0.365 |
| G | 490_A |
| 407_A | 2.514 | 0.611 |
|
| 491_A | Y | 408_A | 2.078 | 0.773 |
|
| |||||
|
|
|
|
|
|
|
| S | 501_A | G | 418_A | 3.312 | 1.066 |
| G | 502_A | H | 419_A | 0.966 | 1.007 |
| R | 503_A | Y | 420_A | 2.398 | 1.666 |
|
| 504_A |
| 421_A | 1.198 | 1.07 |
| V | 505_A | M | 422_A | 3.453 | 1.131 |
|
| 506_A |
| 423_A | 0.755 | 1.293 |
| M | 507_A | L | 424_A | 1.089 | 0.793 |
Comparisons are made in presumed functional “box” motifs, the highly conserved sequences termed H, N, G1, F, and G2 boxes, characteristic of histidine kinases [40]. 2ftk corresponds to Beryllofluoride (PDB: 2ftk) and 2ch4 corresponds to CheA histidine kinase (PDB: 2ch4). Highly conserved residues among the histidine kinase proteins are indicated in bold type [21],[43]. See Table 2 for column header abbreviations.
Figure 2Homology model of the SpaR N-terminal (residues 1–117) and C-terminal (residues 118–220) domains.
Modeling of the N-terminal domain was based on PDB template 1mvo_A, and the C-terminal domain was based on PDB template 2gwr_A. The conformation between domains was modeled based on 2gwr (response regulator protein MTRA from Mycobacterium tuberculosis). Coloring scheme reflects consecutive ordering of amino acids from the N-terminal region (blue) to the C-terminal region (red). Residues in SpaR that correspond to the functional residues in response regulator 2ftk (Spo0F; see Table 2B) are displayed as sticks.
Residue-residue correspondences between functional motifs in domain-fusion template 2ftk and SpaK (A) or SpaR (B) homology models.
| A | |||||
| 2ftk_A | SpaK | ||||
| Res | ResName | Res | ResName | Distance | RMSD(3) |
| R | 29_A | A | 246_A | 0.508 | 0.14 |
|
| 30_A |
| 247_A | 0.565 | 0.236 |
| D | 31_A | E | 248_A | 0.644 | 0.203 |
Residue.
Residue name in PDB or model file.
Distance between C-alpha carbons (under global superposition).
RMSD(3): Root mean square deviation along the mainchain atoms (N,CA,C,O) averaged over three residues: current and immediate neighbors along peptide chain (local superposition).
X – aspartic acid (ASP) modified to aspartate beryllium trifluoride (BFD).
2ftk_A corresponds to Spo0B, and 2ftk_E corresponds to Spo0F. Letters in bold represent corresponding functional residues. Neighboring residues within 1 position of functional residues are included in order to provide a sequence-structure context in which highlighted residues were located. A) Residue-residue correspondences between histidine phosphorylation site and neighboring residues of 2ftk chain A and those of SpaK. B) Residue-residue correspondences between regions containing 6 functional residues of 2ftk chain E and SpaR.
Candidate domain-fusion templates for structure modeling of a SpaK/SpaR complex.
| Template - SpaK/R | N1 | N2 | N | RMSD | Seq_ID | LGA_S |
| 1f51_A - SpaK_d2 | 181 | 159 | 104 | 2.56 | 6.73 | 42.32 |
| 1f51_E - SpaR_d1 | 119 | 117 | 116 | 1.41 | 25 | 93.11 |
| 2ftk_A - SpaK_d2 | 181 | 159 | 106 | 2.58 | 6.6 | 42.86 |
| 2ftk_E - SpaR_d1 | 119 | 117 | 116 | 1.11 | 24.14 | 95.71 |
| 1th8_A - SpaK_d2 | 132 | 159 | 95 | 2.34 | 17.89 | 42.99 |
| 1th8_B - SpaR_d1 | 115 | 117 | 76 | 2.71 | 7.89 | 39.3 |
| 1thn_A - SpaK_d2 | 136 | 159 | 99 | 2.23 | 17.17 | 45.15 |
| 1thn_B - SpaR_d1 | 114 | 117 | 75 | 2.75 | 6.67 | 38.68 |
| 1tid_A - SpaK_d2 | 136 | 159 | 98 | 2.23 | 17.35 | 44.47 |
| 1tid_B - SpaR_d1 | 119 | 117 | 76 | 2.88 | 6.58 | 38.52 |
| 1til_A - SpaK_d2 | 141 | 159 | 101 | 2.19 | 16.83 | 45.47 |
| 1til_B - SpaR_d1 | 117 | 117 | 71 | 2.96 | 4.23 | 37.11 |
The domains from the structure models of SpaK and SpaR were compared with all structures from PDB. Listed are those domain-fusion templates for which at least one domain from each of SpaK and SpaR had structure similarity LGA_S> = 35%.
The residue ranges in modeled SpaK domains are: SpaK_d1: 219–300 and SpaK_d2: 301–459, and the residue ranges in modeled SpaR domains are: SpaR_d1: 1–117 and SpaR_d2: 118–220.
N1 denotes a number of residues in the structural domain-fusion template.
N2 denotes the number of residues in the corresponding domain from SpaK or SpaR.
N denotes the number of superimposed C-alpha atoms that fit under a distance of 5.0 Angstroms.
RMSD is the root mean square deviation of N corresponding C-alpha atom pairs from the calculated structural alignment.
Seq_ID denotes the sequence identity in % between the domain-fusion template and the corresponding SpaK or SpaR domain calculated from the structural alignment.
LGA_S is a measure of the level of structure similarity [20] identified between the domain-fusion template and the corresponding domain from SpaK or SpaR.
Domains from the structural models of SpaK and SpaR were compared with all structures from PDB. Listed are the domain-fusion templates that for at least one domain from the SpaK or SpaR model had a level of structure similarity LGA_S above 37%. LGA_S scores are reported for alignments between each modeled domain of SpaK or SpaR and a domain-fusion template domain. The residue ranges in modeled SpaK domains were: SpaK_d1: 219–300 and SpaK_d2: 301–459, and the residue ranges in modeled SpaR domains were: SpaR_d1: 1–117 and SpaR_d2: 118–220.
Figure 3Homology model of a SpaK-SpaR complex.
A: Model is based on the A and E chains of SPO0B, a phosphotransferase, complexed with SPO0F, a beryllofluoride (PDB template 2ftk). Blue, red: monomers of SpaK; Green: SpaR. B: Close up view of interacting residues (SpaK: H247; SpaR: D8, D9, D51; shown as stick) believed to mediate transfer of phosphate group from SpaK to SpaR.
Figure 4In vitro phosphorylation studies of SpaK and SpaR.
A, B: SDS-PAGE of 6xHis-SpaK and 6xHis-SpaR in isolation or in combination and at various mass ratios, in the presence of ATP. A: Coomassie blue staining. B: Autoradiography; lane a: molecular weight markers. C: Phosphorimage analysis of SpaK incubated with [g-32P]-ATP (lane 1) followed by addition of 4 mM (lane 2), 10 mM (lane3), or 50 mM non-labeled (cold) ATP. D: PEI cellulose thin-layer chromatography of 6xHis-SpaK in isolation, or in combination with 6xHis-SpaR with and without EDTA.
Figure 5In vitro phosphorylation studies involving SpaK mutants.
A, B: Polyacrylamide gel electrophoresis of 6xHis-SpaR and 6xHis-SpaK wild type or mutants in isolation or in combination, in the presence of ATP. Lanes 1, 7: molecular weight markers. A: Coomassie blue staining. B: Autoradiography. Mutant1: H247Q, Mutant 2: G392A.