| Literature DB >> 22808083 |
Sandeep Kumar Srivastava1, Savitha Gayathri, Babu A Manjasetty, Balasubramanian Gopal.
Abstract
Experimental conditions or the presence of interacting components can lead to variations in the structural models of macromolecules. However, the role of these factors in conformational selection is often omitted by in silico methods to extract dynamic information from protein structural models. Structures of small peptides, considered building blocks for larger macromolecular structural models, can substantially differ in the context of a larger protein. This limitation is more evident in the case of modeling large multi-subunit macromolecular complexes using structures of the individual protein components. Here we report an analysis of variations in structural models of proteins with high sequence similarity. These models were analyzed for sequence features of the protein, the role of scaffolding segments including interacting proteins or affinity tags and the chemical components in the experimental conditions. Conformational features in these structural models could be rationalized by conformational selection events, perhaps induced by experimental conditions. This analysis was performed on a non-redundant dataset of protein structures from different SCOP classes. The sequence-conformation correlations that we note here suggest additional features that could be incorporated by in silico methods to extract dynamic information from protein structural models.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22808083 PMCID: PMC3392262 DOI: 10.1371/journal.pone.0039993
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Summary of the dataset of molecular models examined for structural variations and conformational selection by experimental methods.
(A) The initial dataset of proteins was compiled for a representative sampling of folds and families. After selecting protein-structural pairs based on experimental and sequence criteria, the dataset for analysis included 31 different protein pairs across five different structural classes. (B) Bar diagrams represent the protein-protein, protein-nucleic acid complexes and peptides used in this study. Dark blue bars in all the classes represent the initial selection from a set of 183 protein-protein complexes, 82 protein-nucleic acid complexes and 110 peptide structures. The final composition of this dataset (shown here in gray and light blue bars) is based on the sequence and structural criteria described in the methods section of this manuscript.
Comparison between X-ray and NMR structures in different classes of proteins.
| α-class of proteins | ||||
|
|
|
|
|
|
| 1K96/1A03 | 90 | 42∶51 - LTIGSKLQDA | α-helix in crystal structure; Unstructured in solution | α-helix |
| 1GU2/1E8E | 100 | 28∶36 - ITDGKIFFN | α-helix in crystal structure; Unstructured in solution | β-strand +Coil |
| 48∶54 - ACASCHT | α-helix in crystal structure; Hydrogen bonded turn in solution | β-strand +Coil | ||
| 61∶70- GKNIVTGKEI | β-strand and turn in crystal structure; β- bridge in solution | β-strand +Coil | ||
| 1NZN/1PC2 | 78.1 | 5∶13 - EAVLNELVSVED | α-helix in crystal structure; Unstructured in solution | α-helix |
| 1I27/1NHA | 86.6 | 478∶483- QTKKTGL | α-helix in crystal structure; Unstructured in solution | α-helix |
| 1OMR/1JSA | 100 | 97∶109- TNQKLEWAFSLY | 310-helix and α-helix in crystal structure; Unstructured in solution | α-helix |
| 1HH5/1F22 | 100 | 17∶29 - HKAHAEKLGCDAC | α-helix and 310-helix in crystal structure; α-helix and coil in solution | Coil |
| 61∶66 - KCGGCH | α-helix in crystal structure; Unstructured in solution | Coil + β-strand | ||
| 1H0A/1INZ | 88.9 | 3∶15 - TSSLRRQMKNIVH | α-helix in crystal structure; Unstructured in solution | α-helix |
| 5P2P/1SFV | 91.9 | 18∶22 - PLMDF | α-helix in crystal structure; Unstructured in solution | Coil |
| 113∶115- KEH | 310-helix and β-bridge in crystal structure; β-bridge in solution | ND | ||
| 120∶123- TKKY | α-helix in crystal structure; Unstructured in solution | ND | ||
|
| ||||
| 1OPA/1B4M | 100 | 27∶35 - FATRKIAVR | α-helix in crystal structure; Unstructured in solution | α -helix |
| 5∶14 - NGTWEMESNE | β-strand in crystal structure; Small β-strand and unstructuredin solution | β-strand | ||
| 72∶75 - EHTK | β-strand and turn in crystal structure; unstructured in solution | ND | ||
| 1XCA/1BLR | 99.3 | 28∶38 - LRKIAVAAASK | α-helix in crystal structure; Unstructured in solution | α -helix |
| 127∶137- DVVCTRVYVRE | β-strand in crystal structure; Small β-strand and turn in solution | β-strand | ||
| 60∶66 - TTEINFK | β-strand in crystal structure, unstructured in solution | β-strand | ||
| 2GIM/1FA4 | 99.1 | 2∶7 - ETYTVKL | β-strand in crystal structure; Unstructured in solution | β-strand |
| 52∶60 - SADLAKSLS | α-helix in crystal structure; Unstructured in solution | α-helix | ||
| 83∶89 - GEYTFYC | β-strand in crystal structure; Unstructured in solution | β-strand | ||
| 90∶96 - EPHRGAG | α-helix in crystal structure; Unstructured in solution | Coil | ||
| 1SPD/1RK7 | 96.1 | 41∶48 - GLHGFHVH | β-strand in crystal structure; Unstructured in solution | β-strand |
| 85∶89 - NVTA | β-strand in crystal structure; Unstructured in solution | ND | ||
| 97∶99 - VSI | β-strand in crystal structure; β-strand in solution | ND | ||
| 116∶120- TLVVH | β-strand in crystal structure; Unstructured in solution | ND | ||
| 54∶60 - TAGCTSA | Turn in crystal structure; Unstructured in solution | Coil | ||
| 132∶137- EESTKT | α-helix in crystal structure; Hydrogen bonded in solution | Coil | ||
| 1J2A/1CLH | 99.4 | 38∶45 - SGFYNNTT | Hydrogen bonded turn and β-sheet; Unstructured in solution | Coil |
| 48∶57 - RVIPGFMIQG | Anti-parallel β-sheet in crystal structure; Short anti-parallelβ-sheet in solution | Anti-parallel β-sheet | ||
| 77∶80 - ADNG | 310-helix in crystal structure; Unstructured in solution | ND | ||
| 1IAZ/1KD6 | 99.2 | 8∶15 - VIDGSALS | β-strand and 310-helix in crystal; Hydrogen bonded turnin solution | β-strand + Coil |
| 129∶138 - DQRMYEELYY | α-helix in crystal structure; Short α-helix followed by unstructuredregion in solution | α-helix | ||
| 1WHO/1BMW | 100 | 4∶8 - VTFTV | β-strand in crystal structure; Unstructured in solution | ND |
| 16∶23 - HLAVLVKY | β-strand in crystal structure; Isolated β-bridge mostly unstructuredin solution | β-strand | ||
| 28∶34 - MAEVELR | β-strand in crystal structure; Small β-strand mostly unstructuredin solution | β-strand | ||
| 51∶55 - VWTFD | β-strand in crystal structure; Unstructured, isolated β-strand,bridge | ND | ||
| 64∶70 - FNFRFLT | β-strand in crystal structure; Unstructured, isolated β-bridge | β-strand | ||
| 75∶82 - KNVFDDVV | β-strand in crystal structure; Unstructured, isolated β-strand,bridge in solution | β-strand+ Coil | ||
|
| ||||
| 2SAK/1SSN | 89 | 38∶48 - ELLSPHYVEFP | β-strand in X-ray structure; Unstructured in solution | Coil+ β-strand |
| 76∶81 - FRVVEL | β-strand in crystal structure; Unstructured in solution | β-strand | ||
| 1QVE/1HPW | 96.1 | 31∶55 - AQLSEAMTLASGLKTKVSDIFSQDG | Two helices connected through a turn in crystal structure;Single helix in solution | α-helix-coil-α-helix |
| 78∶88 - VAKVTTGGTA | β-strand in crystal structure; Unstructured in solution | β-strand | ||
| 1C44/1QND | 97.6 | 90∶95 - PQSAFF | α-helix-coil-β-strand in crystal structure; Hydrogen bondedturn in solution | α-helix-coil-β-strand |
| 99∶102- LKIT | β-strand in crystal structure; Unstructured in solution | ND | ||
| 105∶112- MGLAMKLQ | α-helix in crystal structure; Hydrogen bonded turn in solution | α-helix | ||
| 3IL8/1IKM | 100 | 19∶28 - PKFIKELRVI | 310-helix followed by β-strand in crystal structure; β-bridge andβ-strand in solution | Coil + β-strand |
| 66∶72 - LKRAENS | α-helix in crystal structure; Unstructured in solution | α-helix | ||
| 1TN3/1RJH | 86.1 | 58∶68 - MKCFLAFTQTK | β-strand in crystal structure; Unstructured in solution | β-strand + Coil |
|
| ||||
| 1RRF/1U81 | 90.6 | 18∶24 - MRILMVG | β-strand in crystal structure; Unstructured, isolated β-strand,bridge in solution | α-helix + β-strand |
| 43∶48 - VTTIPT | β-strand in crystal structure; Unstructured, isolated β-bridgein solution | β-strand | ||
| 76∶92- PLWRFQNTQGLIFVV | 310-helix, unstructured followed by β-strand in crystal structure;α-helical followed by β-strand in solution | α-helix-coil- β-strand | ||
| 99∶113- RVNEAREELMRMLAE | α-helix in crystal structure; Unstructured, short helix, turnin solution | α-helix | ||
| 1EZ9/1EZO | 99.7 | 145∶147- SAL | β-strand in crystal structure; unstructured in solution | ND |
| 222∶227- TAMTIN | β-strand in crystal structure; unstructured in solution | ND | ||
| 258∶266- FVGVLSAGI | β-strand in crystal structure; unstructured in solution | β-strand | ||
| 305∶311- KSYEEEL | α-helix in crystal structure; turn and short helix in solution | α-helix | ||
| 5P21/1CRP | 99.4 | 37∶46 - EDSYRKQVVI 49∶58 - ETCLLDILDT | β-strand-turn- β-strand in crystal structure; Shortened β-strandin solution | β-strand-coil- β-strand |
|
| ||||
| 1PSP/1PCP | 100 | 5∶10 - ACRCSR | α-helix in crystal structure; turn in solution | β-strand |
| 13∶15- PKN | 310-helix in crystal structure; unstructured in solution | ND | ||
| 55∶59 - SEECV | 310-helix in crystal structure; turn in solution | ND | ||
| 61∶64 - QVSA | 310-helix in crystal structure; turn in solution | ND | ||
| 1NTN/1W6B | 97.3 | 50∶52 - ESY | Turn in crystal structure; unstructured in solution | ND |
| 62∶68 - NCNPHPK | Mix of turn in crystal structure; 310-helix in solution | Coil | ||
| 1BRF/1RWD | 94.3 | 2∶13 - KWVCKICGYIYD | β-strand-turn- β-strand in crystal structure; isolated β-bridgein solution | β-strand |
| 43∶51 - APKSEFEKL | Mix of β-bridge, 310-helix and β-strand in crystal structure; unstructuredin solution | α-helix | ||
| 9PTI/1OA5 | 100 | 3∶6 - DFCL | 310-helix in crystal structure; unstructured in solution | ND |
| 1RDG/1E8J | 100 | 18∶24 - GDPDSGI | Mix of β-bridge, 310-helix and β-bridge in crystal structure; unstructuredin solution | Coil |
| 30∶33 - FEDL | 310-helix in crystal structure; unstructured in solution | ND | ||
| 44∶49 - ASKDAF | Mix of β-bridge, 310-helix in crystal structure; unstructuredin solution | ND | ||
ND - Not Determined.
Structural variations in protein complexes.
|
|
|
|
|
|
|
| 1 | 3HRY/3K33 | 50–73: AALDAEFASLFDTLDSTNKELVNR | α-helix in complex, turns and coil inindividual protein structure | α-helix | 72–73 |
| 2 | 3FII/1SFC | 27–57: TSNRRLQQTQAQVDEVVDIMRVNVDKVLERD | Largely unstructured in one complexand α-helix in another complex | α-helix | 28–36 |
| 3 | 1N7S/1XTG | 167–204: MGNEIDTQNRQIDRIMEKADSNKTRIDEANQRATKMLG | α-helix in one complex and largelyunstructured in the other | α-helix | 167–169, 171–173, 198, 204 |
| 4 | 3C98/3HD7 | 189–248: KQALSEIETRHSEIIKLENSIRELHDMFMDMAMLVESQGEMIDRIEYNVEHAVDYVERAV | α-helix in one complex, unstructuredwith distorted helix in another complex | α-helix | 189, 248 |
| 5 | 2GRX/1IHR | 164–182: PARAQALRIEGQVKVKFDV | α-helix, β-strand in complex,β-strand in individual protein | α-helix, β-strand | 164–168 |
| 221–235: GSGIVVNILFKINGT | Coil and β-strand in complex,β-strand in individual protein | β-strand | 221, 236 | ||
| 6 | 2JKR/2BP5 | 316–325: LAQKIEVRIP | β-strand in one complex, unstructuredin the other | β-strand | 316–319 |
| 419–434: IKWVRYIGRSGIYETR | β-strand in one complex, unstructuredin the other | β-strand | 431–434 | ||
| 7 | 1CDJ/1G9M | 54–69: RADSRRSLWDQG | α-helix in complex, turn in individualprotein | α-helix | 58–59, 62–64 |
| 12–18: VELTCTA | β-strand in individual protein, coilin complex | β-strand | 12,18 | ||
| 8 | 3B2V/1IVO | 19–32: FEDHFLSLQRMFNN | α-helix in one complex and unstructuredin other | α-helix | 19,32 |
| 92–97: YALAVL | β-strand in one complex and no electrondensity in other | α-helix | – | ||
| 9 | 1BGW/2RGR | 630–682: LQGNDKDYIDLAFSKKKADDRKEWLRQYEPGTVLDPTLKEIPISDFINKELI | α-helix in complex. Unstructured inindividual protein structure | α-helix | 634 |
| 10 | 1K4S/1A36 | 633–710: QRAPPKTFEKSMMNLQTKIDAKKEQLADARRDLKSAKADAKVMKDAKTKKVVESKKKAVQRLEEQLMKLEVQATDREE | α-helix in one complex and unstructuredin the other | α-helix | 633–636,639–640,669–687,700–701,703–710 |
| 11 | 1SER/1SRY | 36–86: EVQELKKRLQEVQTERNQVAKRVPKAPPEEKEALIARGKALGEEAKRLEEA | Unstructured in complex and α-helix inindividual structure | Α-helix | 48, 50–69 |
| 12 | 1HLO/1R05 | 12–27: ADKRAHHNALERKRRD | α-helix in the complex (crystal) andunstructured in the individual protein(NMR) | α-helix | 27-Dec |
| 13 | 1B70/1EIY | 6–85: LAAIQNARDLEELKALKARYLGKKGLLTQEMKGLSALPLEERRKRGQELNAIKAALEAALEAREKALEEAALKEALERER | α-helix in one complex and unstructuredin the other | α-helix | – |
| 14 | 1JYE/1EFA | 5–13: TLYDVAEYA | α-helix in complex, unstructured inindividual protein | α-helix | 11 |
| 16–25: SYQTVSRVVN | α-helix in complex, unstructured inindividual protein | α-helix and β-strand | 16–17 | ||
| 32–46: AKTREKVEAAMAELN | α-helix in complex, unstructured inindividual protein | α-helix | 32–37 | ||
| 15 | 1D1U/1D0E | 67–84: SQEARLGIKPHIQRLLDQ | α-helix in one complex and unstructuredin other | α-helix | 69–71, 83–84 |
| 99–114: LPVKKPGTNDYRPVQD | α-helix in one complex, β-strand and coilin other | Coil | 99–114 | ||
| 237–273: QQGTRALLQTLGNLGYRASAKKAQICQ | α-helix and coil in one complex, β-strandin the other | α-helix | 238–241, 264 | ||
| 16 | 1VS5/2QOU | 19–30: SIVVAIERFVKH | β-strand in one complex, mostlyunstructured in other | β-strand and α-helix | - |
| 34–46: GKFIKRTTKLHVH | β-strand in one complex, mostlyunstructured in other | β-strand | 34–35, 45–46 | ||
| 57–67: VVEIRECRPLS | β-strand in one complex, mostlyunstructured in other | β-strand | – |
Conformational variation in peptide structures.
|
|
|
|
|
|
|
|
|
| 1 | 1QWP | GSNKGAIIGLM | 80% α-helix | 63% β- strand | 3MOQ(A) | 100 | 63% β-strand |
| 1HZ3(A) | 100 | Coil | |||||
| 2BEG(A) | 100 | 72% β-strand | |||||
| 1Z0Q(A) | 100 | 45% α-helix | |||||
| 1IYT(A) | 100 | α-helix | |||||
| 2WK3(C) | 100 | – | |||||
| 2G47(C) | 100 | – | |||||
| 1AML(A) | 100 | 45% α-helix | |||||
| 1BA4(A) | 100 | α-helix | |||||
| 2OTK(C) | 100 | 72% β-strand | |||||
| 2 | 1CFG | TRYLRIHPQSWVHQIALRMEVL | 30% α-helix | 3HNB(M) | 100 | 80% β-strand | |
| 3HNY(M) | 100 | 80% β-strand | |||||
| 3HOB(M) | 100 | 80% β-strand | |||||
| 1D7P(M) | 100 | 80% β-strand | |||||
| 3CDZ(B) | 100 | 80% β-strand | |||||
| 2R7E(B) | 100 | 30% β-strand | |||||
| 3 | 1P5A | AVGIGALFLGFLGAAGSTMGARSX | 25% α-helix | 58% α-helix | 2ARI(A) | 100 | α-helix |
| 3ABI(A) | 80 | β-strand + α-helix | |||||
| 4 | 1OMQ | RQIKIWFQNRRMKWKK | 70% α-helix | 90% β-strand | 1HOM(A) | 100 | 80% α-helix |
| 1AHD(P) | 100 | α-helix | |||||
| 5 | 1DVW | TLAVPGMTCAACPITVKK | Coil | 66% β-strand | 1AFI(A) | 100 | 22% β-strand, 27%α-helix |
| 1AFJ(A) | 100 | 22% β-strand, 27%α-helix | |||||
| 2HQI(A) | 100 | 22% β-strand, 27%α-helix | |||||
| 6 | 1IBN | GLFGAIAGFIENGWEGMIDG | 90% α-helix | 55% α-helix | 3EYM(B) | 100 | Coil |
| 1MQL(B) | 100 | Coil | |||||
| 1MQN(B) | 100 | Coil | |||||
| 5HMG(B) | 100 | Coil | |||||
| 3EYK(B) | 90 | Coil | |||||
| 7 | 1HZ3 | YEVHHQKLVFFAEDVGSNKGAIIGLM | Coil | 80% β-strand | 2BEG(A) | 100 | 80% β- strand |
| 1Z0Q(A) | 100 | 80% α-helix | |||||
| 1IYT(A) | 100 | a-helix | |||||
| 2WK3(C) | 100 | 10% β-strand | |||||
| 3IFN(P) | 100 | – | |||||
| 1AML(A) | 100 | 50% α-helix | |||||
| 1BA4(A) | 100 | 80% α-helix | |||||
| 2OTK(C) | 100 | 80% β-strand | |||||
| 1AMB(A) | 100 | 90% α-helix | |||||
| 8 | 1XV7 | FQWQRNIRKVRX | Coil | 1LFH(A) | 91 | α-helix | |
| 1B0L(A) | 91 | α-helix | |||||
| 9 | 2BP4 | DAEFRHDSGYEVHHQK | 70% α-helix | 50% β-strand | 2WK3 | 100 | Coil |
| 1BA6 | 100 | Coil | |||||
| 1IYT | 100 | 50% α-helix | |||||
| 10 | 2RMV | GNDYEDRYYRENMARYPNQVYYRPVC | Coil | 50% α-helix, 15% β-strand | 3O79 | 96 | 73% α-helix |
| 1XYW | 96 | 73% α-helix | |||||
| 2KU4 | 96 | 45% α-helix |
Figure 2Representative examples of conformational variations.
(A) All α class (B) All β class (C) α+β class (D) α/β class (E) Small proteins. A comprehensive list of these parameters is compiled in Table 1.
Figure 3Conformational variations induced by interactions with proteins and nucleic acids.
Structural differences in (A) human splicing protein Prp-8 (Full length and N-terminal deletion) variants. These structures illustrate sequence length-dependent structural changes. (B) & (C) depict structural changes in protein-protein and protein-nucleic acid complexes.
Figure 4Structural variability in peptide sequences.
(A) Context dependent conformational changes of a peptide from the amyloid β A4 protein (PDB: 1QWP) and (B) C2 domain of Factor VIII (PDB: 1CFG).
Figure 5In silico methods to extract dynamic information.
CONCOORD and temperature factor analysis of Prevent host death protein (Phd: 3HRY) that shows a disordered-to-ordered conformational transition upon forming a complex with the Death on curing protein (Phd-Doc complex: 3K33). The grey bar represents the region in the Phd protein that undergoes structural change upon forming the Phd-Doc complex.
Figure 6Summary of the potential cause of variations in protein structural models.
This data is based on information presented in Tables 1–3. The abbreviations used here are- psipred score: differences between predicted and observed secondary structure; Disorder promoting residues, Chameleon sequences: Classification based on aminoacid composition; Salt, pH, PEG: Effects of ionic strength, pH, high concentration of polyethylene glycol; Packing induced, Technique/Resolution: Differences between solution and crystal structural models.