| Literature DB >> 17708757 |
David F Burke1, Catherine L Worth, Eva-Maria Priego, Tammy Cheng, Luc J Smink, John A Todd, Tom L Blundell.
Abstract
BACKGROUND: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functional effects of an nsSNP on protein structure and function could be predicted, both for the final identification process of a causal variant in a disease-associated chromosome region, and in further functional analyses of the nsSNP and its disease-associated protein.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17708757 PMCID: PMC1978506 DOI: 10.1186/1471-2105-8-301
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Databases used. Summary of databases mentioned in this analysis
| Database | Description | nsSNPs used for this study | Availability |
| dbSNP | Database of Single Nucleotide Polymorphisms | 21471 | [8] |
| OMIM | Online database of Mendelian Inherited dimorphisms in Man | 2256 | [15] |
| HapMap | Database quantifying frequencies of common haplotypes in four populations | 5770 | [3] |
| HGMD | Human Gene Mutation Database of mutations within the coding regions, splicing and regulatory regions of human genes causing inherited disease | N/A | [12] |
| GAD | Genetic association database of medically relevant polymorphisms identified in published scientific papers | N/A | [13] |
| HGVBase | Human Genome Variation Database | N/A | [6] |
| LS-SNP | Database of large scale annotation of predicted effects of human SNPs. | 11220 | [24] |
| Polyphen | Database of predicted functional effect of human nsSNPs | 4459 | [27] |
| HOMSTRAD | Database of protein structures classified by protein family | N/A | [40] |
Figure 1Distribution of fold recognition results. Distribution of the number of HOMSTRAD families predicted by FUGUE for a gene.
Prediction methods. Summary of computational methods mentioned in this analysis
| Method | Predicts effect on Structure | Predicts effect on Function | Requires structural information | Method | Number of nSNPs with predictions | Availability |
| SIFT | Implicit | implicit | No | SequenceConservation | 22728 | [22] |
| LS-SNP | Yes | yes | No | Knowledge-based rules and support vector machine | 11220 | [24] |
| POLYPHEN | yes | yes | Yes | Sequence Profile and structural properties | 4459 | [27] |
| SDM | Yes | no | Yes | Structure based Substitution Table | 5705 | Available On request |
| Imutant | yes | no | Yes | Support vector machine | 8879 | [19] |
| env_score | Yes | no | Yes | Structure based Substitution Table | 10625 | Available On request |
| Crescendo | No | yes | Yes | Structure based Substitution Table | 10625 | Available On request |
| FUGUE | N/A | N/A | No | Sequence Substitution Profile | N/A | [39] |
| Modeller | N/A | N/A | Yes | N/A | [43] |
Figure 2Model of IL21R/IL21 complex. Ribbon representation of the model of IL21R in complex with its ligand, IL21. The residue which is mutated, Arg191, is show is ball-and-stick representation.
Figure 3Model of TCF7 in complex with DNA. Ribbon representation of the model of TCF7 in complex with DNA. The residue which is mutated, Trp336, is show is ball-and-stick representation. The tryptophan residue interacts with the major groove of the DNA.
Figure 4Distribution of predicted functional residues. Pie chart showing percentage of mutations affecting predicted functional residues for OMIM and dbSNP
Figure 5Distribution of predicted structurally deleterious nsSNPs. Pie chart showing percentage of structurally deleterious predictions for OMIM and dbSNP datasets.
Deleterious nsSNPs in dbSNP. Mutations in dbSNP with MAF > 5% predicted to be deleterious by at least four of the methods
| Ensembl Gene Id | Hugoname | Gene description | dbSNP id | Mutation | MAF |
| ENSG00000100116 | GCAT | 2-amino-3-ketobutyrate coenzyme A ligase | rs710187 | R39C | 47 |
| ENSG00000133048 | CHI3L1 | Chitinase-3 like protein 1 | rs880633 | R145G | 47 |
| ENSG00000148773 | MKI67 | Antigen KI-67 | rs1063535 | P2608L | 47 |
| ENSG00000168124 | OR1F1 | Olfactory receptor 1F1 | rs1834026 | F75S | 44 |
| ENSG00000147576 | ADHFE1 | alcoho dehydrogenase | rs1060242 | C401R | 42 |
| ENSG00000122359 | ANXA11 | Annexin A11 | rs1049550 | R230C | 41 |
| ENSG00000137124 | ALDH1B1 | Aldehyde dehydrogenase X | rs2073478 | R107L | 41 |
| ENSG00000132677 | RHBG | Rhesus type B glycoprotein | rs3748569 | G315R | 39 |
| ENSG00000108759 | KRTHA2 | Keratin, type I cuticular HA2 | rs2071563 | T395M | 38 |
| ENSG00000006788 | MYH13 | Myosin heavy chain | rs2074877 | M1071V | 38 |
| ENSG00000114480 | GBE1 | 1,4-alpha-glucan branching enzyme | rs2305246 | R190G | 35 |
| ENSG00000104901 | DKKL1 | Dickkopf-like protein 1 | rs1054770 | G187S | 31 |
| ENSG00000163482 | STK36 | serine/threonine kinase 36 fused homolog | rs1863704 | G1003D | 30 |
| ENSG00000125775 | SDCBP2 | Syntenin-2 | rs1048621 | R138C | 28 |
| ENSG00000176937 | OR52R1 | Olfactory receptor 52R1. | rs7941731 | I128T | 28 |
| ENSG00000172071 | EIF2AK3 | Eukaryotic translation initiation factor 2a3 | rs867529 | S136C | 27 |
| ENSG00000137809 | ITGA11 | Integrin alpha-11 | rs7168069 | L524R | 26 |
| ENSG00000149305 | HTR3B | 5-hydroxytryptamine serotonin receptor 3B | rs1176744 | Y129S | 25 |
| ENSG00000132677 | RHBG | Rhesus type B glycoprotein | rs11586833 | V143D | 25 |
| ENSG00000168787 | OR12D2 | Olfactory receptor | rs2073152 | S121C | 23 |
| ENSG00000072571 | HMMR | Hyaluronan mediated motility receptor | rs299284 | R92C | 13 |
| ENSG00000143412 | ANXA9 | Annexin A9 | rs267733 | D159G | 13 |
| ENSG00000113108 | APBB3 | Amyloid beta A4 protein-binding family B3 | rs250430 | C240R | 13 |
| ENSG00000070371 | CLTCL1 | Clathrin heavy chain 2 | rs807459 | Y279C | 12 |
| ENSG00000070371 | CLTCL1 | Clathrin heavy chain 2 | rs712952 | R1046C | 12 |
| ENSG00000070371 | CLTCL1 | Clathrin heavy chain 2 | rs1633399 | I1394T | 12 |
| ENSG00000165799 | RNASE7 | Ribonuclease 7 | rs1243469 | H116Y | 12 |
| ENSG00000167207 | CARD15 | Caspase recruitment domain protein 15 | rs2066844 | R702W | 11 |
| ENSG00000141504 | SAT2 | Diamine acetyltransferase 2 | rs13894 | R126C | 10 |
| ENSG00000100033 | PRODH | Proline oxidase | rs450046 | R521Q | 7.5 |
| ENSG00000183059 | unknown | Annexin A2 pseudogene 2. | rs855523 | E53A | 7.5 |
| ENSG00000174942 | OR5R1 | Olfactory receptor 5R1 | rs7111634 | D121G | 7.5 |
| ENSG00000122679 | RAMP3 | Receptor activity-modifying protein 3 | rs2074654 | W56R | 5.8 |
| ENSG00000005844 | ITGAL | Leukocyte adhesion glycoprotein 1 | rs1064524 | R214W | 5.8 |
| ENSG00000187021 | PNLIPRP1 | Pancreatic lipase related protein 1 | rs11197744 | N61D | 5 |