| Literature DB >> 17201911 |
Abdel Aouacheria1, Vincent Navratil, Ricardo López-Pérez, Norma C Gutiérrez, Alexander Churkin, Danny Barash, Dominique Mouchiroud, Christian Gautier.
Abstract
BACKGROUND: A promising application of the huge amounts of genetic data currently available lies in developing a better understanding of complex diseases, such as cancer. Analysis of publicly available databases can help identify potential candidates for genes or mutations specifically related to the cancer phenotype. In spite of their huge potential to affect gene function, no systematic attention has been paid so far to the changes that occur in untranslated regions of mRNA.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17201911 PMCID: PMC1774567 DOI: 10.1186/1471-2164-8-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview of the EST-based data mining strategy.
| Total SNPs (after algorithm filtering) | ~51,760 |
| Total UTR-SNPs | 20,304 |
| UTR-SNPs with | 1354 |
| UTR-SNPs with | 358 |
| Total SNPs in 5'-UTR | 47 |
| Total SNPs in 3'-UTR | 311 |
Identification of tumor-associated polymorphisms located in human mRNA untranslated regions. UTR-located SNPs are referred to as UTR-SNPs. SNP counts in each analytical step.
Summary of cancer-associated UTR-SNPs.
| Description | SNP ID | UTR | Variation | mRNA secondary structure distance | Putative functional element | |||
| SPARC precursor [P09486/ENST00000231061] | rs1059829 | 38 | ||||||
| Histidine triad nucleotide-binding protein 1 [P49773/ENST00000304043] | 3' | t | 483 | g | 62 | IRES | ||
| Ig alpha-1 chain C region [P01876/ENST00000251006] | 3' | g | 1643 | a | 78 | IRES | ||
| Annexin A1 [P04083/ENST00000257497] | rs3739956 | 5' | a | 58 | g | 8 | ||
| Lithostathine 1 alpha precursor [P05451/ENST00000233735] | 3' | c | 718 | t | 0 | |||
| 26S proteasome-associated pad1 homolog [NM_005805/ENST00000263639] | rs9713 | 5' | a | 460 | t | 12 | ||
| Glucagon precursor [P01275/ENST00000233604] | 3' | g | 698 | a | 6 | IRES * | ||
| mitochondrial ribosomal protein L41 [NM_032477/ENST00000332183] | rs698148 | 3' | g | 526 | c | 6 | IRES | |
| Actin, cytoplasmic 2 (Gamma-actin) [P02571/ENST00000331925] | 3' | g | 1572 | a | 0 | IRES | ||
| Biglycan precursor [P21810/ENST00000331595] | 5' | g | 94 | t | 4 | 15-LOX-DICE | ||
| Mitochondrial processing peptidase alpha subunit [Q10713/ENST00000298536] | rs7628 | 3' | a | 1933 | g | 68 | ADH_DRE | |
| Beta-2-glycoprotein I precursor [P02749/ENST00000205948] | rs6933 | 3' | c | 1090 | t | 0 | ||
| SH3-containing GRB2-like protein 1 [Q99961/ENST00000269886] | 3' | c | 2116 | t | 38 | |||
| dynactin p62 subunit [NM_016221/ENST00000255263] | 3' | c | 3575 | t | 26 | IRES | ||
| Interleukin-4 receptor alpha chain precursor [P24394/ENST00000170630] | rs8832 | 3' | a | 3135 | g | 52 | ||
| Ran GTPase [P17080/ENST00000254675] | 3' | g | 900 | a | 0 | IRES * | ||
| Chemokine-like factor super family member 6 [Q9NX76/ENST00000205636] | 3' | t | 3549 | a | 14 | |||
| 60S ribosomal protein L29 [P47914/ENST00000294189] | 3' | c | 613 | a | 4 | |||
| 14-3-3 protein zeta/delta [P29312/ENST00000297569] | rs11551356 | 3' | t | 2351 | c | 0 | ||
| TC4 protein [Q96QB7/ENST00000316561] | 3' | g | 841 | a | 48 | |||
| Metallothionein-IE [P04732/ENST00000306061] | rs708274 | 3' | g | 209 | t | 66 | ||
| Retinoic acid- inducible E3 protein [Q13571/ENST00000294507] | rs1050739 | 3' | g | 2125 | a | 12 | IRES * | |
| Voltage-dependent anion-selective channel protein 2 [P45880/ENST00000298468] | rs11543 | 5' | g | 51 | c | 64 | ||
| PDZ and LIM domain protein 1 [O00151/ENST00000265995] | rs1049989 | 3' | t | 1193 | g | 0 | ||
| Pituitary tumor-transforming protein binding factor [P53801/ENST00000330938] | 3' | c | 2723 | t | 8 | |||
| Mitochondrial import receptor TOM22 homolog [Q9NS69/ENST00000216034] | rs1056691 | 3' | t | 966 | g | 48 | ||
| Ubiquitin-like protein SUMO-1 conjugating enzyme [P50550/ENST00000219558] | rs7302 | 3' | t | 1024 | g | 16 | IRES | |
| Dynein light chain 1 [Q15701/ENST00000242577] | 5' | t | 45 | c | 22 | |||
| 20 kDa nuclear cap binding protein [P52298/ENST00000321256] | 3' | a | 1683 | c | 0 | |||
| Cytochrome P450 1B1 [Q16678/ENST00000260630] | rs162549 | 3' | a | 4412 | t | 6 | ||
| PTD012 protein [NM_014039/ENST00000332038] | 3' | a | 2118 | g | 74 | |||
| Small proline-rich protein 3 [Q9UBC9/ENST00000295367] | rs1134220 | 3' | t | 958 | g | 68 | ||
| Large neutral amino acids transporter small subunit 2 [Q9UHI5/ENST00000316902] | 3' | c | 2689 | t | 0 | |||
| Transforming protein p21A Ki-Ras [P01116/ENST00000311936] | 3' | t | 1260 | c | 10 | IRES | ||
| nucleolar protein family A, member 3 [NM_018648/ENST00000328848] | rs1045238 | 3' | g | 327 | c | 58 | TOP | |
| Heterogeneous nuclear ribonucleoprotein K [Q07244/ENST00000297818] | rs167203 | 5' | c | 156 | g | 10 | ||
| RECS1 protein homolog [Q969X1/ENST00000258412] | 3' | g | 2920 | a | 0 | |||
| G1/S-specific cyclin D2 [P30279/ENST00000261254] | 3' | a | 6471 | c | 20 | |||
| Zinc finger protein 384 [Q8TF68/ENST00000319770] | rs6786 | 3' | t | 3145 | c | 0 | ||
| Myosin regulatory light chain 2 [P19105/ENST00000217652] | rs7811 | 3' | a | 1107 | g | 22 | ||
| Neuron specific protein family member 2 [Q9Y328/ENST00000303177] | rs4457100 | 3' | a | 1030 | g | 8 | ||
| Sorting nexin 4 [O95219/ENST00000251775] | 3' | t | 2178 | a | ND | IRES * | ||
| Paired amphipathic helix protein Sin3b [O75182/ENST00000248054] | rs1044880 | 3' | c | 4888 | t | ND | 15-LOX-DICE | |
| FK506-binding protein 1A [P20071/ENST00000262925] | 3' | a | 518 | g | ND | |||
| Epsin 4 [Q14677/ENST00000296951] | rs254682 | 3' | t | 3136 | c | ND | ||
| 40S ribosomal protein S5 [P46782/ENST00000196551] | 5' | c | 27 | t | ND | |||
| Rho-related GTP-binding protein RhoH [Q15669/ENST00000303700] | rs2245466 | 5' | g | 175 | c | ND | ||
| Death-associated protein. [Q9BUC9/ENST00000230895] | rs267927 | 5' | t | 163 | c | 1,69E-03 | ND | |
| Inhibitor of apoptosis protein 1 [Q13489/ENST00000263464] | 5' | t | 539 | g | 4,88E-03 | ND | ||
| Kruppel-like factor 6 [Q99612/ENST00000173785] | 3' | c | 1206 | t | 9,50E-03 | ND | ||
The Table shows a selection of 50 UTR-SNPs (out of 358) with significantly different allele frequency in normal versus tumoral tissues (exact Fisher's test; p < 0.01). UTR-SNPs are ranked by decreasing p-value. Swissprot protein accession references and Ensembl transcript accession references are indicated between brackets. Candidate positive after the multiple testing corrections are set in italics (Bonferroni), in bold (Benjamini and Hochberg) or underlined (candidate positive after a resampling procedure). Predictive effect of the polymorphisms on RNA secondary structure and putative UTR functional elements is indicated. Asterisk means that the reference allele sequence is modified by the cancer-associated UTR-SNP. For full data access, see Additional file 1. Accession numbers and SNP rs numbers are indicated in column 1 (description) and column 2 (SNP ID), respectively.
Results of SPARC genotyping analysis in AML samples.
| AML patients | Healthy donors | OR (95% CI) | ||
| SNP 1474 | GG | 13 | 20 | |
| GA | 0 | 0 | ||
| AA | 0 | 0 | ||
| Allelic Frequency A (%) | ||||
| SNP 1551 | GG | 2 | 7 | |
| GC | 8 | 7 | ||
| CC | 3 | 6 | ||
| Allelic Frequency C (%) | 53 | 47 | 2.9 (0.5–17.3) | |
| SNP 1922 | TT | 13 | 5 | |
| TG | 2 | 1 | ||
| GG | 0 | 0 | ||
| Allelic Frequency G (%) | 13 | 0.8 (0.05–10.5) | ||
| SNP 2072 | CC | 4 | 10 | |
| CT | 7 | 3 | ||
| TT | 4 | 4 | ||
| Allelic Frequency T (%) | 50 | 32 | 3.9 (0.9–17.5) | |
| SNP 2168 | GG | 3 | 11 | |
| GA | 6 | 1 | ||
| AA | 4 | 5 | ||
| Allelic Frequency A (%) | 54 | 32 | 6.1 (1.2–31.2) |
AML: acute myeloblastic leukaemia; OR: odds ratio; CI: confidence interval.
Prediction of UTR-SNPs affecting mRNA folding structures.
| number of sequences | |||||||||
| distance | dist>10 | dist>20 | dist>30 | dist>40 | dist>50 | dist>60 | dist>70 | dist>80 | average |
| Control (min-max) | 155 (148–162) | 126 (119–137) | 104 (97–115) | 81 (74–99) | 60 (49–82) | 38 (30–61) | 23 (16–32) | 8 (3–12) | 20.30 (19.27–23.39) |
| cancer (n = 358) | 179 | 136 | 115 | 94 | 77 | 52 | 20 | 4 | 22.85 |
| cancer (permutation positive pool, n = 92) | 49 | 39 | 32 | 27 | 22 | 17 | 7 | 2 | 25.24* |
'Control': average of experiments with UTR-SNPs not associated with cancer phenotype (n = 358, 10 independent control trials). Results were statistically analyzed using the two sample t-test ('*', p < 0.05).
Putative UTR regulatory elements affected by cancer-associated UTR-SNPs.
| UTR-SNPs Ref (n = 20,304) | UTR-SNPs Var (n = 20,304) | CANCER Ref (n = 358) | CANCER Var (n = 358) | CANCER Ref (permutation pool, n = 92) | CANCER Var (permutation pool, n = 92) | |
| 5766 | 5802 | 92 | 94 | 20 | 20 | |
| Gained | 625 | 11 | 3 | |||
| Lost | 589 | 9 | 3 | |||
| 1788 | 1744 | 37 | 36 | 8 | 8 | |
| Gained | 90 | 2 | ||||
| Lost | 134 | 3 | ||||
| 491 | 490 | 11 | 10 | 5 | 5 | |
| Gained | 1 | |||||
| Lost | 2 | 1 | ||||
| 287 | 287 | 3 | 3 | 0 | 0 | |
| Gained | 16 | |||||
| Lost | 16 | |||||
| 174 | 168 | 5 | 5 | 0 | 0 | |
| Gained | 9 | |||||
| Lost | 15 | |||||
| 111 | 116 | 2 | 1 | 1 | 0 | |
| Gained | 13 | |||||
| Lost | 8 | 1 | 1 | |||
| 44 | 50 | 1 | 1 | 1 | 1 | |
| Gained | 6 | |||||
| Lost | 0 | |||||
| 26 | 30 | 1 | 1 | 0 | 0 | |
| Gained | 12 | |||||
| Lost | 1 | |||||
| 12 | 10 | 1 | 0 | 1 | 0 | |
| Gained | 1 | |||||
| Lost | 3 | 1 | 1 |
The results identify cis-regulatory elements located in the immediate vicinity of or at the UTR-SNP sites. UTR regulatory elements can be 'gained' or 'lost' when reference allele sequences are modified by cancer-associated SNPs.