| Literature DB >> 25573921 |
Jobie Kirkwood1, David Hargreaves1, Simon O'Keefe1, Julie Wilson2.
Abstract
MOTIVATION: The identification of suitable conditions for crystallization is a rate-limiting step in protein structure determination. The pH of an experiment is an important parameter and has the potential to be used in data-mining studies to help reduce the number of crystallization trials required. However, the pH is usually recorded as that of the buffer solution, which can be highly inaccurate.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25573921 PMCID: PMC4410668 DOI: 10.1093/bioinformatics/btv011
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
AstraZeneca dataset details
| (a) | ||||
|---|---|---|---|---|
| Protein | Source* | Conc. (mg/ml) | Buffer solution | pI |
| Protease K212A | a | 13 | 1 | 4.93 |
| Protease K234A | a | 13.4 | 1 | 5.03 |
| Protease K249A | a | 12.1 | 1 | 5.03 |
| ProteaseE171A | a | 13.2 | 1 | 5 |
| Concanavalin A | b | 15 | 2 | 5.47 |
| Bovine catalase | b | 15 | 2 | 6.79 |
| Pig Trypsin | b | 31 | 2 | 7 |
| Thaumatin | b | 50 | 2 | 8.46 |
| α- Chymo A | b | 15 | 2 | 8.52 |
| Lysozyme | b | 20 | 2 | 9.36 |
| Glycolytic A | a | 30.1 | 3 | 7.52 |
| Glycolytic D | a | 21.9 | 3 | 6.75 |
| Glycolytic wt | a | 9.76 | 3 | 6.75 |
| Kinase 1 | a | 12.2 | 4 | 5.18 |
| (b) | ||||
| 1 | 20 mM MES, 5 mM calcium chloride, 5 mM DTT, 100 mM sodium chloride, 300 mM AHA, pH 6.5 | |||
| 2 | 10 mM PCTP, 100 mM sodium chloride, 0.5 mM TCEP, pH 7.6 | |||
| 3 | 20 mM Tris–HCL, 150 sodium chloride, 2 mM TCEP, pH 7.5 | |||
| 4 | 10 mM Tris, 50 mM sodium chloride, 1 mM DTT, 50 µM Zinc Acetate, pH 7.5 | |||
Note: pI values for the commercially available and in-house protein targets that were screened over a range of pH values are shown in (a) with buffer solution details in (b). Source: a, In-house; b, Sigma-Aldrich.
EMBOSS acid dissociation constants
| Amino acid | Charge | |
|---|---|---|
| Amine group | 8.6 | Positive |
| Carboxyl group | 3.6 | Negative |
| Cysteine (C) | 8.5 | Negative |
| Aspartic acid (D) | 3.9 | Negative |
| Glutamic acid (E) | 4.1 | Negative |
| Histidine (H) | 6.5 | Positive |
| Lysine (K) | 10.8 | Positive |
| Arginine (R) | 12.5 | Positive |
| Tyrosine (Y) | 10.1 | Negative |
Fig. 1.Net charge of the sequence CRV with varying pH
Fig. 2.Organization of data used for linear regression modelling
Fig. 3.Accuracy of pH values. The pH obtained by spectrophotometry is shown plotted against the pH predicted by an artificial neural network in (A) and the pH of the buffer in (B)
Fig. 4.Histogram showing errors in predicted and buffer pH values in relation to the spectrophotometric pH
Fig. 5.Histograms showing the absolute difference between the pI and the closest pH at which crystals were obtained for proteins in the SGC dataset. (a) shows group 1 (structure determined), (b) shows group 2 (diffraction to at least 3.6 Å), (c) shows group 3 (at least one protein crystal) and (d) shows group 4 (crystal to be followed up)