| Literature DB >> 19127300 |
Ricardo Palacios1, Elodie Gazave, Joaquín Goñi, Gabriel Piedrafita, Olga Fernando, Arcadi Navarro, Pablo Villoslada.
Abstract
Allelic specific gene expression (ASGE) appears to be an important factor in human phenotypic variability and as a consequence, for the development of complex traits and diseases. In order to study ASGE across the human genome, we have performed a study in which genotyping was coupled with an analysis of ASGE by screening 11,500 SNPs using the Mapping 10 K Array to identify differential allelic expression. We found that from the 5,133 SNPs that were suitable for analysis (heterozygous in our sample and expressed in peripheral blood mononuclear cells), 2,934 (57%) SNPs had differential allelic expression. Such SNPs were equally distributed along human chromosomes and biological processes. We validated the presence or absence of ASGE in 18 out 20 SNPs (90%) randomly selected by real time PCR in 48 human subjects. In addition, we observed that SNPs close to -but not included in- segmental duplications had increased levels of ASGE. Finally, we found that transcripts of unknown function or non-coding RNAs, also display ASGE: from a total of 2,308 intronic SNPs, 1510 (65%) SNPs underwent differential allelic expression. In summary, ASGE is a widespread mechanism in the human genome whose regulation seems to be far more complex than expected.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19127300 PMCID: PMC2613524 DOI: 10.1371/journal.pone.0004150
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Differential allele expression ratios for imprinted genes.
| Imprinted gene | dbSNP RS ID | Alleles | Location | ASGE (p<0.01) | ASGE ratio | No. of individuals studied | Percent of individuals studied with ASGE |
|
| rs63934 | C+/T | 11p15.5 | Yes | 1,76±1,06 | 8 | 63% |
|
| rs721910 | A/C | 14q32.2 | No | 1,23 | 1 | — |
|
| rs721909 | A+/G | 14q32.2 | Yes | 1,38±0,41 | 2 | 50% |
|
| rs2374983 | A+/G | 7q21.3 | Yes | 1,49 | 1 | 100% |
|
| rs4128536 | A/G | 6p25.2 | No | 1,00 | 1 | — |
|
| rs4128535 | A/G | 6p25.2 | No | 1,05±0,07 | 2 | — |
|
| rs2174914 | C/G+ | 6q25.3 | Yes | 1,29±0,44 | 7 | 29% |
Results are presented as the mean±SD. Values are the ratios (Allele 1/Allele 2) between the two alleles. The values were inverted if less than one (Allele 2/Allele 1, when Allele 2 was preferentially expressed). The preferentially expressed allele is labelled with “+”.
Validation of differential allele expression ratios for 20 SNPs randomly selected in forty-eight new subjects by real-time quantitative PCR (rtPCR).
| dbSNP ID | Consequence type (Variation Feature) | Gene | Alleles | Location | ASGE by rtPCR (p<0.01) | ASGE ratio by rtPCR | Percent of individuals studied with ASGE by rtPCR | ASGE in 10 K array (p<0.01) | ASGE ratio in 10 K array | No of individuals studied in 10 K array | Percent of individuals studied with ASGE in 10 K array |
|
| Intergenic | — | A+/T | 12q23.2 | Yes | >8 | 100% | Yes | 1,75±1,00 | 9 | 44% |
|
| Intergenic | — | A/C | 4q21.1 | No | 2.86±3.69 | — | No | 1,14 | 1 | — |
|
| Intergenic | — | A/G+ | 10q11.21 | Yes | 1.39±0.18 | 75% | Yes | 1,49±0,38 | 10 | 40% |
|
| Intergenic | — | C+/T | 4q21.1 | Yes | 2.10±0.22 | 88% | Yes | 2,70±1,31 | 4 | 100% |
|
| Intronic | AP4M1 | C/T | 7q31.32 | Yes | >8 | 100% | No | 1,20±0,23 | 2 | — |
|
| Intergenic | — | C/T+ | 12q15 | Yes | 1.44±1.08 | 63% | Yes | 2,19±1,98 | 3 | 33% |
|
| Intergenic | — | C/T | 1p32.3 | Yes | 5.31±0.07 | 63% | Yes | 1,41±0,23 | 4 | 50% |
|
| Intronic | MMP2 | C/T | 16q12.2 | Yes | 6.75±7.34 | 88% | Yes | 2,78±1,06 | 4 | 100% |
|
| Intergenic | — | C/T | 2q33.2 | Yes | 1.21±0.50 | 38% | Yes | 1,40±0,31 | 3 | 67% |
|
| Intergenic | — | C/G | 7q21.11 | No | 1.81±0.85 | — | No | 1,16 | 1 | — |
|
| Intergenic | — | A/T | 10q21.3 | No | 1.06±0.10 | — | No | 1,07 | 1 | — |
|
| Intergenic | — | C/T | 12q15 | Yes | 1.80±0.78 | 88% | Yes | 1,71±0,97 | 7 | 43% |
|
| Intergenic | — | A+/G | 1p13.2 | Yes | 1.27±0.04 | 75% | Yes | 1,71±0,85 | 2 | 50% |
|
| Intronic | STYX | C/G | 14q22.1 | Yes | 1.92±3.12 | 100% | Yes | 1,27±0,19 | 6 | 17% |
|
| Intergenic | — | C/G | 12q15 | Yes | 3.32±1.95 | 100% | Yes | 1,96±1,10 | 6 | 67% |
|
| Intronic | ROBO1 | A/G+ | 3p12.3 | Yes | 1.64±0.15 | 75% | Yes | 1,28±0,43 | 8 | 25% |
|
| Intronic | FAM113B | A/C+ | 12q13.11 | Yes | 2.27±0.54 | 38% | Yes | 1,16±0,12 | 15 | 7% |
|
| Intergenic | — | A/G+ | 10p11.23 | No | 1.01±0.11 | — | Yes | 1,43±0,68 | 4 | 25% |
|
| Intronic | FSTL4 | C+/T | 5q31.1 | Yes | 4.35±0.09 | 100% | Yes | 3,51 | 1 | 100% |
|
| Intronic | HTR2A | A+/G | 13q14.2 | Yes | 5.00±0.15 | 88% | Yes | 1,56±0,76 | 2 | 50% |
Results are presented as the mean±SD. Values are the ratios (Allele 1/Allele 2) between the two alleles. The values were inverted if less than one (Allele 2/Allele 1, when Allele 2 was preferentially expressed). The preferentially expressed allele is labelled with “+”.
Distribution of differentially expressed alleles of SNPs across chromosomes (assembly March 2006; chr. Y not included).
| Chr | No. of SNPs available for analysis | Differentially expressed SNPs | % differentially expressed SNPs | No. Clusters |
| 1 | 405 | 250 | 62 | 4 |
| 2 | 459 | 271 | 59 | 3 |
| 3 | 397 | 231 | 58 | 9 |
| 4 | 339 | 168 | 50 | 4 |
| 5 | 343 | 198 | 58 | 12 |
| 6 | 362 | 229 | 63 | 5 |
| 7 | 273 | 154 | 56 | 3 |
| 8 | 257 | 141 | 55 | 3 |
| 9 | 239 | 138 | 58 | 36 |
| 10 | 284 | 165 | 58 | 5 |
| 11 | 271 | 149 | 55 | 5 |
| 12 | 253 | 138 | 55 | 6 |
| 13 | 196 | 90 | 46 | 2 |
| 14 | 196 | 111 | 57 | 1 |
| 15 | 149 | 85 | 57 | 4 |
| 16 | 117 | 59 | 50 | 1 |
| 17 | 89 | 58 | 65 | 1 |
| 18 | 126 | 63 | 50 | 1 |
| 19 | 52 | 34 | 65 | 15 |
| 20 | 112 | 80 | 71 | 5 |
| 21 | 86 | 47 | 55 | 3 |
| 22 | 38 | 23 | 61 | 4 |
| X | 42 | 24 | 57 | 1 |
| No mapped | 48 | 28 | 58 | – |
See Figure 1.
The total No. of SNPs mapped was 5,085 of the 5,133 found in the screening due to the fact that there is no annotation for 48 studied SNPs.
Figure 1Chromosome mapping of heterozygous SNPs expressed in PBMCs.
The position of each SNP on the chromosome is based on the annotation in dbSNP (version 126, May 2006). Differentially expressed SNP alleles are coloured in black. The vertical bar above the horizontal line means the SNP is on the forward strand, the one below means that it is on the reverse strand. SNP stretches with a p-value<0.00005 are highlighted in blue boxes.
Mapping of SNPs.
| Consequence type (Variation Feature) | No. of SNPs available for analysis | Differentially expressed SNPs | % differentially expressed SNPs |
|
| 57 | 31 | 54 |
|
| 43 | 32 | 74 |
|
| 5 | 2 | 40 |
|
| 119 | 76 | 64 |
|
| 2.455 | 1.190 | 48 |
|
| 2.311 | 1.511 | 65 |
|
| 14 | 12 | 86 |
|
| 26 | 20 | 77 |
|
| 103 | 60 | 58 |
|
| 5.133 | 2.934 | 57 |
Despite we study poliadenilated RNA, intronic and intergenic SNPs also present ASGE. Thus, ncRNA display ASGE similar to that of known genes revealing even more complexity in the system that regulates transcription.
Details of the chi-square tables and P-values for the dbSD database.
| dbSD | Distance of a SNP to the closest SD | |||||||||||
| Inside | 10 Kb | 100 Kb | 1 Mb | |||||||||
| Out | In | Out | In | Out | In | Out | In | |||||
|
| 2160 | 39 | 2199 | 2103 | 57 | 2160 | 1667 | 436 | 2103 | 244 | 1423 | 1667 |
|
| 2867 | 67 | 2934 | 2751 | 116 | 2867 | 2089 | 662 | 2751 | 243 | 1846 | 2089 |
| 5027 | 106 | 5133 | 4854 | 173 | 5027 | 3756 | 1098 | 4854 | 487 | 3269 | 3756 | |
|
| 1.62 | 7.34 | 7.56 | 1.20 | ||||||||
|
| 0.204 | 0.007 | 0.006 | 0.272 | ||||||||
The distances represent the different windows of distance we considered around each SNPs to test the effect of the proximity of a SD on ASGE. “In” means inside the windows of size considered.
Details of the permutation tests for the mean absolute values of allelic gene expression ratios depending on their position relatively to segmental duplications.
| Windows | Category | n | mean | P |
|
| Inside SD | 106 | 2.1014 | |
| Outside | 5027 | 1.5438 | <0.001 | |
|
| Inside 10 Kb | 173 | 1.6803 | |
| Outside | 4854 | 1.5389 | 0.051 | |
|
| Inside 100 Kb | 1098 | 1.5724 | |
| Outside | 3756 | 1.5292 | 0.183 | |
|
| Inside 1 Mb | 3269 | 1.5308 | |
| Outside | 487 | 1.5183 | 0.776 |
Windows of sizes are the same as previously described.
Details of the chi-square tables and P-values for the dbCNV database.
| dbCNV | Distance of a SNP to the closest CNV | |||||||||||
| Inside | 10 Kb | 100 Kb | 1 Mb | |||||||||
| Out | In | Out | In | Out | In | Out | In | |||||
|
| 1799 | 400 | 2199 | 1750 | 49 | 1799 | 1351 | 399 | 1750 | 182 | 1169 | 1351 |
|
| 2410 | 524 | 2934 | 2355 | 55 | 2410 | 1896 | 459 | 2355 | 241 | 1655 | 1896 |
| 4209 | 924 | 5133 | 4105 | 104 | 4209 | 3247 | 858 | 4105 | 423 | 2824 | 3247 | |
|
| 4.60 | 0.833 | 6.65 | 0.403 | ||||||||
|
| 0.032 | 0.361 | 0.010 | 0.526 | ||||||||
The distances represent the different windows of distance we considered around each SNPs to test the effect of the proximity of a CNV on ASGE. “In” means inside the windows of size considered.
Details of the permutation tests for the mean absolute values of allelic gene expression ratios depending on their position relatively to copy number variants from databases.
| Windows | Category | n | mean | P |
| Inside | Inside SD | 924 | 1.5934 | |
| Outside | 4209 | 1.5469 | 0.213 | |
| 10 Kb | Inside 10 Kb | 104 | 1.5162 | |
| Outside | 4105 | 1.5477 | 0.771 | |
| 100 Kb | Inside 100 Kb | 858 | 1.5724 | |
| Outside | 3247 | 1.5292 | 0.080 | |
| 1 Mb | Inside 1 Mb | 2824 | 1.5726 | |
| Outside | 423 | 1.4928 | 0.138 |
Windows of sizes are the same as previously described.
Details of the chi-square tables and P-values for the indCNV database.
| dbSD | Distance of a SNP to the closest indCNV | |||||||||||
| Inside | 10 Kb | 100 Kb | 1 Mb | |||||||||
| Out | In | Out | In | Out | In | Out | In | |||||
|
| 9549 | 3 | 9552 | 9545 | 4 | 9549 | 9535 | 10 | 9545 | 9566 | 69 | 9635 |
|
| 7522 | 4 | 7526 | 7518 | 4 | 7522 | 7503 | 15 | 7518 | 7370 | 52 | 7422 |
| 17071 | 7 | 17078 | 17063 | 8 | 17071 | 17038 | 25 | 17063 | 16936 | 121 | 17057 | |
|
| NA | NA | 2.58 | 0.014 | ||||||||
|
| NA | NA | 0.11 | 0.905 | ||||||||
The distances represent the different windows of distance we considered around each SNPs to test the effect of the proximity of a CNV on ASGE. “In” means inside the windows of size considered.