| Literature DB >> 16545116 |
Colleen D Behrsin1, Chris J Brandl, David W Litchfield, Brian H Shilton, Lindi M Wahl.
Abstract
BACKGROUND: Unigenic evolution is a powerful genetic strategy involving random mutagenesis of a single gene product to delineate functionally important domains of a protein. This method involves selection of variants of the protein which retain function, followed by statistical analysis comparing expected and observed mutation frequencies of each residue. Resultant mutability indices for each residue are averaged across a specified window of codons to identify hypomutable regions of the protein. As originally described, the effect of changes to the length of this averaging window was not fully eludicated. In addition, it was unclear when sufficient functional variants had been examined to conclude that residues conserved in all variants have important functional roles.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16545116 PMCID: PMC1434776 DOI: 10.1186/1471-2105-7-150
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Nucleotide Substitutions in a Random Sample of 18 Unscreened Pin1 Clones.
| A to G | 38 |
| T to C | 35 |
| G to A | 25 |
| C to T | 28 |
| T to A | 11 |
| A to T | 16 |
| C to A | 8 |
| G to T | 5 |
| A to C | 4 |
| T to G | 0 |
| C to G | 0 |
| G to C | 1 |
Estimated mutation probabilities based on 18 unscreened Pin1 clones.
| mA-C and mT-G | 0.0012 |
| mA-G and mT-C | 0.0223 |
| mA-T and mT-A | 0.0082 |
| mC-A and mG-T | 0.0023 |
| mC-G and mG-C | 0.0002 |
| mC-T and mG-A | 0.0095 |
Comparison of fexpected missense values for Pin1 codons.
| Met (ATG) | 1.00 | 1.00 |
| Trp (TGG) | 1.00 | 1.00 |
| Phe (TTT) | 0.77 | 0.76 |
| Lys (AAA) | 0.77 | 0.76 |
| Ala (GCC) | 0.67 | 0.67 |
| Ala (GCG) | 0.67 | 0.67 |
| Gly (GGC) | 0.67 | 0.67 |
| Gly (GGG) | 0.67 | 0.67 |
| Pro (CCC) | 0.67 | 0.67 |
| Pro (CCG) | 0.67 | 0.67 |
| Arg (AGA) | 0.69 | 0.72 |
Boldface indicates rows in which the two methods differ by 5% or more.
Figure 1Mutability plots. Mutability plots were determined as described in the text (grey bars). The mutability of individual residues was averaged over a window of 1, 5, 11 or 25 codons. The hypo- or hypermutability was then plotted as a bar in the center of the specified window and the window was shifted downstream one codon at a time. Individual hypomutable regions, designated A, B, C, and D are indicated on the plot for the 11 codon window. For comparison, the difference between mutability calculated by previous methods (5) and mutability as described in this manuscript is also shown (circles).
Figure 2Region-independent chi-square analysis. Three-dimensional plot illustrating χ2 of each significantly hypomutable region plotted against region length and amino acid residue number. Calculations were performed as described in the text. Only regions that are significant at the 0.005 level are plotted; the whole window is plotted whenever this significance level is achieved. If a residue is involved in more than one significant region of the same length, the region with the highest χ2 value is plotted. Colours indicate χ2 of the region and range from deep red (χ2 > 15, corresponding to α < 0.0001) to pale green (χ2 > 8, α < 0.005). Four hypomutable regions approximating regions A-D (Figure 1) are evident.
Summary of non-mutated residues and corresponding Q values.
| Gly 10 (GGC) | 0.135* |
| Trp 11 (TGG) | 0.010 |
| Arg 21 (CGA) | 0.013 |
| Asn 30 (AAC) | 0.004 |
| Ala 31 (GCC) | 0.135* |
| Ser 32 (AGC) | 0.020 |
| Pro 52 (CCT) | 0.135* |
| Val 55 (GTC) | 0.026 |
| Cys 57 (TGC) | 0.021 |
| His 59 (CAC) | 0.020 |
| Leu 60 (CTG) | 0.058* |
| Leu 61 (CTG) | 0.058* |
| Lys 63 (AAG) | 0.004 |
| His 64 (CAC) | 0.020 |
| Ser 67 (TCA) | 0.026 |
| Trp 73 (TGG) | 0.010 |
| Arg 74 (CGG) | 0.164* |
| Arg 80 (CGG) | 0.164* |
| Glu 84 (GAG) | 0.020 |
| Ala 85 (GCC) | 0.135* |
| Tyr 92 (TAC) | 0.004 |
| Gly 99 (GGA) | 0.135* |
| Leu 106 (CTG) | 0.058* |
| Ser 108 (TCA) | 0.026 |
| Ser 111 (AGC) | 0.026 |
| Asp 112 (GAC) | 0.020 |
| Cys 113 (TGC) | 0.021 |
| Ser 115 (TCA) | 0.026 |
| Ala 116 (GCC) | 0.135* |
| Gly 120 (GGA) | 0.135* |
| Leu 122 (CTG) | 0.058* |
| Ala 137 (GCC) | 0.135* |
| Glu 145 (GAG) | 0.020 |
| Met 146 (ATG) | 0.002 |
| Ser 147 (AGC) | 0.026 |
| Val 150 (GTG) | 0.026 |
| Gly 155 (GGC) | 0.135* |
| His 157 (CAC) | 0.020 |
| Thr 162 (ACT) | 0.025 |
* Q value greater than 5%