| Literature DB >> 28288557 |
Christina McCarthy1,2,3, Alejandra Carrea1,2, Luis Diambra4,5.
Abstract
BACKGROUND: For a long time synonymous single nucleotide polymorphisms were considered as silent mutations. However, nowadays it is well known that they can affect protein conformation and function, leading to altered disease susceptibilities, differential prognosis and/or drug responses, among other clinically relevant genetic traits. This occurs through different mechanisms: by disrupting the splicing signals of precursor mRNAs, affecting regulatory binding-sites of transcription factors and miRNAs, or by modifying the secondary structure of mRNAs.Entities:
Keywords: Co-translational folding; Codon pairs; Genetic code; Human diseases; Synonymous codon usage
Mesh:
Substances:
Year: 2017 PMID: 28288557 PMCID: PMC5347174 DOI: 10.1186/s12864-017-3609-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Ribosome-mediated translational attenuation program and bicodons. a Graphical representation of how a sSNP can alter the ribosome-mediated translational attenuation program and, ultimately, final protein conformation. Consequently, this can affect protein function leading to pathological phenotypes. b Simplified graph showing codons (green boxes) and bicodons (red ellipses) within an mRNA sequence. Owing to the superposition of bicodons, each codon is part of two different bicodons in a given open reading frame. Alterations of the ribosome-mediated translational attenuation program, with the resulting alternative protein conformations and functions, could be encoded by bicodons, rather than by codons
Fig. 2Relationship between synonymous bicodons, protein abundances (PA) and pause propensity. a Frequency distributions associated with the bicodons that encode the amino acid pair SK, computed using sequences from the low PA sample (red bars), and from the high PA sample (orange bars). Some bicodons are more frequent within lowly abundant proteins (such as TCCAAG), some within highly abundant ones (such as AGCAAG), and other bicodons have similar frequencies in both groups of proteins (such as TCAAAA, TCAAAG and TCGAAG). b Raster plot of the p-values versus the pause propensity for all bicodons. To improve the visualization of this correlation we plot −S log10[p-value] instead of p-value, where S takes the values +1 or −1 when the bicodon has preference for sequences with low or with high PA, respectively. c Raster plot of the residual scores χ 2 versus the pause propensity and p-values for all bicodons. Small pause propensity values (orange zone) are related to bicodons with high PA, whereas large pause propensity values (red zone) are related to bicodons with low PA. Green lines represent the nine synonymous bicodon variants for the SK amino acid pair that involve a large change in pause propensity (Δ π≥0.754)
Fig. 3Pause propensity variation and Z-score. a Histogram of the pause propensity variation of synonymous bicodon variants associated with sSNPs. b Associated Z-score. The red region indicates the highest 10% pause propensity variation, i.e., those sSNPs with a pause propensity variation larger than Δ π=0.754
Pause propensity variation due to sSNPs linked to clinically relevant genetic diseases or traits
| Disease/trait | Gene | rsID | Bicondon change |
| ||
|---|---|---|---|---|---|---|
| from | to | bicodon | codon | |||
| Macular degeneration | CFHR5 | rs34533956 | GA | GA | 0.91 | 0.51 |
| Longevity | TERT | rs33954691 | CA | CA | 0.84 | 0.71 |
| rs33959226 | GC | GC | 0.83 | 0.55 | ||
| Asthma | SLC6A7 | rs2240794 | GA | GA | 0.11 | 0.51 |
| Pul. sarcoidosis | CARD15 | rs1861759 | GTGCG | GTGCG | 0.99 | 0.17 |
| Tuberculosis | TIRAP | rs7932766 | GC | GC | 0.74 | 0.62 |
| Cystic Fibrosis | CFTR | rs1800092 | AT | AT | 0.09 | 0.17 |
| Coeliac disease | CD44 | rs1071695 | CA | CA | 0.84 | 0.71 |
| APIP | rs1571133 | AC | AC | 0.32 | 0.39 | |
| Crohn’ disease | IRGM | rs10065172 | CTGATG |
| 0.77 | 0.93 |
| Smoking-related cancer | NBS1 | rs709816 | GA | GA | 0.80 | 0.51 |
| rs1061302 | AATCC | AATCC | 0.79 | 0.47 | ||
| Colorectal cancer | ERCC1 | rs11615 | AA | AA | 0.97 | 0.07 |
| Chronic myeloid leukemia | WT1 | rs2229069 | CGCAC | CGCAC | 0.80 | 0.24 |
| rs2227985 | CAGGA | CAGGA | 0.69 | 0.10 | ||
| Non-small-cell lung carcinoma | EGFR | rs2293347 | ACAGA | ACAGA | 0.48 | 0.51 |
| Cervical & vulvar cancer | IL2 | rs2069763 | CT | CT | 0.49 | 0.80 |
| Drug resistance | ABCB1 | rs1045642 | AT | AT | 0.94 | 0.55 |
| CHRNA4 | rs1044396 | CCGAG | CCGAG | 0.26 | 0.74 | |
| Alzheimer | COX6B1 | rs7991 | AAGAC | AAGAC | 0.84 | 0.36 |
| COX6C | rs1130569 | TA | TA | 0.59 | 0.24 | |
| COX8A | rs61759492 | AT | AT | 0.83 | 0.17 | |
| ADHD | NTF3 | rs6332 | CAGCC | CAGCC | 0.46 | 0.47 |
| Huntington | ADORA2A | rs5751876 | GGCTA | GGCTA | 0.99 | 0.24 |
| PADI2 | rs2076615 | GG | GG | 0.93 | 0.77 | |
| Schizophrenia | SYNGR1 | rs74681509 | ACCTT | ACCTT | 0.84 | 0.36 |
| DRD2 | rs6277 | ACTCC | ACTCC | 0.95 | 0.82 | |
| rs6275 | CACCA | CACCA | 0.81 | 0.71 | ||
| TMD | COMT | rs769223 ∗ | GC | GC | 0.67 | 0.55 |
| rs1121923 | GT | GT | 0.89 | 0.13 | ||
| Type III | rs248 | GA | GA | 0.34 | 0.10 | |
| hyperlipidemia | LPL | rs45607438 | CA | CA | 0.89 | 0.71 |
| rs316 | AAGAC | AAGAC | 0.91 | 0.39 | ||
| Chronic hepatitis C | IRF7 | rs1061501 | CG | CG | 0.84 | 0.17 |
| Osteoporosis | CD44 | rs11033026 | CATGA | CATGA | 0.93 | 0.10 |
We computed the Z-score associated to the change in the pause propensity values, or to the change in the differential RSCU, as a consequence of single nucleotide mutations in a subset of 22 human diseases or traits selected from [41]. The first column lists the diseases/traits, the following columns list the affected genes, the rs IDs, the bicodon changes, and Z-scores. Only the variant associated with larger Z-score is listed, for a more complete table, please see Additional file 3: Table S1. ∗ is validated by HapMap