| Literature DB >> 17584784 |
Zhenbao Yu1, Zhen Li, Normand Jolicoeur, Linhua Zhang, Yves Fortin, Edwin Wang, Meiqun Wu, Shi-Hsiang Shen.
Abstract
MicroRNAs (miRNAs) are a class of noncoding small RNAs that regulate gene expression by base pairing with target mRNAs at the 3'-terminal untranslated regions (3'-UTRs), leading to mRNA cleavage or translational repression. Single-nucleotide polymorphisms (SNPs) located at miRNA-binding sites (miRNA-binding SNPs) are likely to affect the expression of the miRNA target and may contribute to the susceptibility of humans to common diseases. We herein performed a genome-wide analysis of SNPs located in the miRNA-binding sites of the 3'-UTR of various human genes. We found that miRNA-binding SNPs are negatively selected in respect to SNP distribution between the miRNA-binding 'seed' sequence and the entire 3'-UTR sequence. Furthermore, we comprehensively defined the expression of each miRNA-binding SNP in cancers versus normal tissues through mining EST databases. Interestingly, we found that some miRNA-binding SNPs exhibit significant different allele frequencies between the human cancer EST libraries and the dbSNP database. More importantly, using human cancer specimens against the dbSNP database for case-control association studies, we found that twelve miRNA-binding SNPs indeed display an aberrant allele frequency in human cancers. Hence, SNPs located in miRNA-binding sites affect miRNA target expression and function, and are potentially associated with cancers.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17584784 PMCID: PMC1935019 DOI: 10.1093/nar/gkm480
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison of the abundance of SNPs located in the miRNA-binding sites versus all the 3′-UTRs in the human genome
| SNP number | Base number | SNP/kilobase | ||
|---|---|---|---|---|
| 3′UTR | 20 588 | 96 484 523 | 0.213 | 0.022 |
| miRNA target | 265 | 1 400 000 | 0.182 |
Note: P-value was calculated using Fisher's exact test.
Figure 1.Distribution of the minor allele frequencies of miRNA-binding SNPs. The number of each allele of a miRNA-binding SNP found in the dbSNP database was counted and the frequency of the minor allele (ratio of the number of minor allele to total number of the two alleles) was calculated. The SNPs were then grouped according to their minor allele frequencies.
Comparison of allele frequency of miRNA-binding SNPs in the dbEST database and in the dbSNP database
| Number of SNPs with a higher frequency of the non-target alleles in the dbEST database than in the dbSNP database | 59 |
| Number of SNPs with a higher frequency of the non-target alleles in the dbSNP database than in the dbEST database | 35 |
| Average ratio of non-target allele frequency to target allele frequency in the dbEST database | 0.127 |
| Average ratio of non-target allele frequency to target allele frequency in the dbSNP database | 0.086 |
Figure 2.Comparison of the allele frequency of miRNA-binding SNPs in the cancer EST libraries and in the dbSNP database. EST sequences containing 30 nt with 100% homology to one of the two SNP alleles were identified from the cancer EST libraries. For each individual SNP, the total number of ESTs with each allele was counted and compared with that found in the dbSNP database. A Fisher's exact test was performed to determine the significance of the different distribution of the two alleles in the cancer EST libraries versus the control population found in the dbSNP database. The samples above the red line have a P-value less than 0.01.
MiRNA-binding SNPs with an aberrant SNP allele frequency in human tumor tissues
| SNP | miRNA target | Count in dbSNP | Count in cancers | MAF (ctrl) | MAF (case) | Odds ratio [95% CI] | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene name | Gene ID | Major allele | Minor allele | Major allele | Minor allele | |||||
| rs1044129 | RYR3 | NM_001036 | 250 | 118 | 208 | 172 | 0.3207 | 0.4526 | 1.75 [1.30, 2.36] | 4.9 × 10−10 |
| rs4901706 | C14orf101 | NM_017799 | 644 | 4 | 346 | 18 | 0.0062 | 0.0495 | 8.37 [3.34, 21.00] | 1.1 × 10−5 |
| rs12583 | DAG1 | NM_00493 | 375 | 225 | 162 | 46 | 0.3750 | 0.2212 | 0.47 [0.33, 0.68] | 1.5 × 10−5 |
| rs1128665 | STK40 | NM_032017 | 353 | 15 | 356 | 0 | 0.0408 | 0 | 5.4 × 10−5 | |
| rs16917496 | SETD8 | NM_020382 | 287 | 81 | 248 | 136 | 0.2201 | 0.3542 | 1.94 [1.41, 2.68] | 5.5 × 10−5 |
| rs17703261 | AFF1 | NM_005935 | 62 | 26 | 336 | 48 | 0.2955 | 0.1250 | 0.34 [0.20, 0.58] | 2.5 × 10−4 |
| rs1053667 | KIAA0423 | NM_015091 | 681 | 19 | 174 | 16 | 0.0271 | 0.0842 | 3.29 [1.72, 6.32] | 1.0 × 10−3 |
| rs11337 | GOLGA7 | NM_00102296 | 690 | 14 | 310 | 18 | 0.0199 | 0.0549 | 2.86 [1.45, 5.66] | 3.6 × 10−3 |
| rs14109 | MATR3 | NM_108834 | 596 | 12 | 362 | 20 | 0.0197 | 0.0524 | 2.74 [1.36, 5.53] | 8.6 × 10−3 |
| rs3660 | KRT81 | NM_002281 | 385 | 281 | 175 | 177 | 0.4219 | 0.5028 | 1.39 [1.07, 1.80] | 0.014 |
| rs17107469 | SH2D4B | NM_207372 | 326 | 0 | 188 | 4 | 0 | 0.0208 | 0.018 | |
| rs10463 | USP9X | NM_001039590 | 478 | 26 | 171 | 19 | 0.0516 | 0.1000 | 2.04 [1.11, 3.74] | 0.025 |
*Fisher's exact test was used to calculate the P-values based on the minor allele frequencies found in the human tumor specimens against the total Caucasian subjects present in the dbSNP database. Minor allele frequencies (MAF) and odds ratios with 95% confidence intervals (95% CI) are shown for each SNP except rs1128665 and rs17197469, the counts of the minor alleles of which are zero.
aThe minor allele of rs1044129 is ‘A’ and the major allele is ‘G’ in the dbSNP database, but ‘G’ is the minor allele and ‘A’ is the major allele in the cancer samples. The minor allele frequency (MAF) shown in the table for this SNP is the frequency of ‘A’ in dbSNP and the frequency of ‘G’ in the cancer population. The frequency of ‘A’, which is 0.32 in the dbSNP database and 0.55 in the cancer population, was used to calculate odds ratio.
Figure 3.Representative sequencing results of two colon tumor samples validating the aberrant allele frequency of SNP rs16917496. (A) A colon tumor sample displays a normal homozygous T/T allele. (B) A colon tumor sample displays a heterozygous T/G allele. Arrows indicate the SNP position.
Contribution of EST sequence quality to the errors of the analysis
| Major alleles | Minor alleles | Non-SNP alleles | |
|---|---|---|---|
| Number of ESTs | 17016 | 1829 | 24 |
| Percentage of errors | 0.07% | 0.65% | |
| 0.13% | |||
*Errors were calculated as:
24/(17016 + 1829) × 100% = 0.13%; (24/2)/17016 × 100% = 0.07%; (24/2)/1829 × 100% = 0.65%.