| Literature DB >> 22369681 |
Yang Hu1, Yunlong Liu, Jeesun Jung, A Keith Dunker, Yadong Wang.
Abstract
BACKGROUND: Recent studies suggest that many proteins or regions of proteins lack 3D structure. Defined as intrinsically disordered proteins, these proteins/peptides are functionally important. Recent advances in next generation sequencing technologies enable genome-wide identification of novel nucleotide variations in a specific population or cohort.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22369681 PMCID: PMC3287498 DOI: 10.1186/1471-2164-12-S5-S2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematics of the workflow.
Figure 2Comparisons of average Δ. The ΔDS values the same amino acid change in the different proteins were collected and averaged. Shown are the results for all 150 different amino acid changes.
Rank of top 10 amino acid (AA) changes with + mean ΔDS
| AA changes | Mean Δ | STD |
|---|---|---|
| W → S | 0.17 | ± 0.08 |
| F→ S | 0.15 | ± 0.07 |
| W→ R | 0.14 | ± 0.07 |
| I→ S | 0.14 | ± 0.08 |
| Y→ S | 0.13 | ± 0.07 |
| V → D | 0.13 | ± 0.02 |
| L→ S | 0.13 | ± 0.06 |
| I→ K | 0.13 | ± 0.07 |
| Y→ D | 0.12 | ± 0.03 |
| Y→ N | 0.12 | ± 0.03 |
Rank of top 10 amino acid (AA) changes with - mean
| AA changes | Mean Δ | STD |
|---|---|---|
| K→I | -0.17 | ± 0.03 |
| S→W | -0.16 | ± 0.08 |
| G→W | -0.15 | ± 0.07 |
| E→V | -0.15 | ± 0.05 |
| R→W | -0.15 | ± 0.07 |
| S→C | -0.14 | ± 0.06 |
| S→F | -0.13 | ± 0.07 |
| D→Y | -0.12 | ± 0.06 |
| S→I | -0.12 | ± 0.07 |
| S→Y | -0.11 | ± 0.07 |
Rank of top 10 codon changes with + mean ΔDS
| Codon changes | Mean Δ | STD |
|---|---|---|
| TGG→ AGG | 0.23 | ± 0.002 |
| ATT →AGT | 0.18 | ± 0.038 |
| TAC→ AAC | 0.17 | ± 0.073 |
| TGG→TCG | 0.17 | ± 0.079 |
| TTT→ TCT | 0.16 | ± 0.086 |
| TTA→ TCA | 0.15 | ± 0.059 |
| GTA→ GAA | 0.15 | ± 0.080 |
| TGC→ AGC | 0.14 | ± 0.168 |
| TAT→ TCT | 0.14 | ± 0.065 |
| TTC→ TCC | 0.14 | ± 0.060 |
Rank of top 10 codon changes with - mean ΔDS
| Codon changes | Mean | STD |
|---|---|---|
| AGT→TGT | -0.20 | ± 0.040 |
| AAA→ATA | -0.17 | ± 0.034 |
| GAA→GTA | -0.17 | ± 0.034 |
| TCG→TGG | -0.16 | ± 0.081 |
| TCT→TAT | -0.16 | ± 0.044 |
| AGG→TGG | -0.16 | ± 0.070 |
| AGT →ATT | -0.16 | ± 0.064 |
| GGG→TGG | -0.15 | ± 0.070 |
| CGG→TGG | -0.15 | ± 0.073 |
| GAG→GTG | -0.14 | ± 0.055 |
Number of SNVs in synonymous/non-synonymous region
| Missense | ||||
|---|---|---|---|---|
| Synonymous | |Δ | |Δ | Nonsense | |
| +Δ | -- | 1,572 | 4,345 | -- |
| -- | 2,629 | 5,909 | -- | |
| Total | 7,511 | -- | 10,254 | 310 |
Figure 3Average MAFs for SNVs with different Δ. (A) Comparison of average MAFs for SNVs with |ΔDS| smaller than 0.04. (B) Comparison of average MAFs for SNVs with |ΔDS| greater than 0.04. *** stands for p-value<0.001
Summary of results on GAW17 simulated data
| Trait | # of genes | # of SNVs with | # of SNVs with |Δ | # of SNVs in the answer sheet |
|---|---|---|---|---|
| Q1 | 9 | 8 | 2 | 39 |
| Q2 | 13 | 12 | 7 | 72 |
| disease liability | 15 | 10 | 6 | 51 |
SNVs with |ΔDS |>0.04 in Q1, Q2 and disease liability
| trait | SNV name | Gene name | AA changes | Δ | Major Allele Score |
|---|---|---|---|---|---|
| Q1 | C19S4815 | HIF3A | R→C | -0.10 | 0.81 |
| Q1 | C4S1879 | KDR | V →M | 0.04 | 0.11 |
| Q2 | C9S377 | VLDLR | W→ C | 0.18 | 0.51 |
| Q2 | C9S444 | VLDLR | D→ Y | -0.16 | 0.36 |
| Q2 | C10S3050 | SIRT1 | P → L | -0.16 | 0.33 |
| Q2 | C8S476 | LPL | I→S | 0.15 | 0.17 |
| Q2 | C8S530 | LPL | V→ G | 0.13 | 0.32 |
| Q2 | C6S5446 | VNN3 | V→ I | -0.05 | 0.44 |
| Q2 | C8S442 | LPL | D→N | -0.04 | 0.29 |
| disease Liability | C17S4581 | PRKCA | V → E | 0.19 | 0.37 |
| disease Liability | C18S2492 | PIK3C3 | V→ G | 0.13 | 0.32 |
| disease Liability | C1S9266 | PIK3C2B | S → F | -0.13 | 0.77 |
| disease Liability | C8S900 | PTK2B | S → L | -0.13 | 0.38 |
| disease Liability | C2S2307 | BCL2L11 | M → R | 0.07 | 0.44 |
| disease Liability | C1S9267 | PIK3C2B | P → L | -0.06 | 0.83 |