| Literature DB >> 27980619 |
Juan Manuel Peralta1, Marcio Almeida1, Lawrence J Abraham2, Eric Moses2, John Blangero3.
Abstract
We present a novel approach to detect potential cis-acting regulatory loci that combines the functional potential, an empirical DNase-seq based estimate of the allele-specificity of DNase-I hypersensitivity sites, with kernel-based variance component association analyses against expression phenotypes. To test our method we used public ENCODE whole genome DNase-I sequencing data, from a single sample, to estimate the functional potentials of the subset of 10,552 noncoding heterozygous single-nucleotide polymorphisms (SNPs) that were also present in the Genetic Analysis Workshop 19 (GAW19) family-based data set. We then built two covariance kernels, one nonweighted and one weighted by the functional potentials, and conducted kernel-based variance component association analyses against the 20,527 transcript expression phenotypes in the GAW19 family-based data set. We found signals of potential cis-regulatory effects, that surpassed the Bonferroni significance threshold, for ten transcripts. Stepwise removal of the cis-located SNPs from the weighted kernel lead to the disappearance of the association signal from our top transcript hit. We found compelling evidence of allele-specific cis-regulation for four transcripts using both kernels, and our results agree with previous research that suggests the involvement of specific cis-located variants in the regulation of their neighboring gene.Entities:
Year: 2016 PMID: 27980619 PMCID: PMC5133489 DOI: 10.1186/s12919-016-0013-1
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Quantile–quantile (Q-Q) plot of the p values obtained under a null hypothesis test. Analysis of 10,000 simulated phenotypes not associated with any of the GAW19 SNP loci. The obtained the p values follow the expected uniform distribution
Fig. 2Frequency distribution of the weights used for variants in the weighted covariance kernel. Each weight represents the relative proportion of the functional potential − loglikelihood estimation of each variant in the kernel. The large proportion of variants in the first bin have a very small weight, and correspond to variants with a low confidence of having an allele-specific chromatin accessibility effect
Transcripts for whom their variation in expression levels can be explained by a covariance kernel composed by SNP with FP estimates, at genome-wide significance
| Transcript | Covariance kernel | |||||||
|---|---|---|---|---|---|---|---|---|
| Non-weighted | Weighted | |||||||
| h2r | h2r_p | geff | geff_p | h2r | h2r_p | Geff | geff_p | |
| GI_42544126-I | 0.0000 | 0.5000 | 0.7074 | 4.03E-15 | 0.0000 | 0.5000 | 0.7145 | 4.55E-18 |
| GI_23097237-S | 0.0000 | 0.5000 | 0.7848 | 1.15E-14 | 0.0000 | 0.5000 | 0.7493 | 7.46E-12 |
| GI_10863968-S | 0.0000 | 0.5000 | 0.6109 | 5.68E-10 | 0.0000 | 0.5000 | 0.6122 | 9.05E-11 |
| Hs.283934-S | 0.0746 | 0.3138 | 0.8382 | 9.77E-10 | 0.1497 | 0.1443 | 0.7657 | 4.87E-09 |
| GI_12056480-A | 0.2357 | 0.0457 | 0.7069 | 1.69E-08 | 0.2792 | 0.0194 | 0.6628 | 5.43E-08 |
| GI_20986517-S | 0.0000 | 0.5000 | 0.7671 | 5.58E-08 | 0.0000 | 0.5000 | 0.7726 | 3.08E-08 |
| Hs.58104-S | 0.2230 | 0.0753 | 0.6886 | 6.89E-07 | 0.2705 | 0.0333 | 0.6415 | 8.47E-07 |
| GI_41393558-I | 0.0000 | 0.5000 | 0.5331 | 1.92E-06 | 0.0000 | 0.5000 | 0.5371 | 1.73E-06 |
| GI_4506738-S | NA | 0.0000 | 0.5000 | 0.4758 | 6.66E-07 | |||
| GI_15451941-S | NA | 0.2611 | 0.0441 | 0.6090 | 1.33E-06 | |||
geff, Gene-specific effect estimate (h 2 geff)
geff_p, significance of the gene-specific effect estimate
h2r, trait heritability estimate (h 2)
h2r_p, significance of the trait heritability estimate
Annotated transcript and SNP table
| Transcript | Gene | Chromosome | Start | Length | SNP | DBSnp rs | SNP annotation |
|---|---|---|---|---|---|---|---|
| GI_42544126-I | SF1 | chr11 | 64532075 | 14241 | 11_64511322 | rs2073798 | RASGRP2 intron |
| 11_64519345 | rs686171 | PYGM intron | |||||
| 11_64546106 | rs3741398 | SF1 2 kb upstream, nc transcript variant, 5’ UTR | |||||
| 11_64546257 | rs1633462 | SF1 2 kb upstream, nc transcript variant, 5’ UTR | |||||
| 11_64573589 | rs669976 | MEN1 intron | |||||
| 11_64576598 | rs67808744 | MEN1 intron | |||||
| 11_64577620 | rs7949944 | MEN1 5’ UTR, 2 kb upstream | |||||
| GI_23097237-S | CHST13 | chr3 | 126243130 | 19004 | 3_126218788 | rs6774768 | UROC1 intron |
| 3_126228953 | rs1873388 | UROC1 intron | |||||
| 3_126242964 | rs1388096 | CHST13 2 kb upstream | |||||
| 3_126245956 | rs4592980 | CHST13 intron/3’UTR | |||||
| 3_126246370 | rs1994642 | CHST13 intron/3’UTR | |||||
| 3_126247795 | rs11717719 | CHST13 intron | |||||
| 3_126247848 | rs11718493 | CHST13 intron | |||||
| GI_10863968-S | POLD4 | chr11 | 67119018 | 2034 | 11_67196237 | rs1476792 | |
| Hs.283934-S | TSPAN16 | chr19 | 11406815 | 30857 | 19_11340057 | rs17001244 | |
| 19_11358700 | rs4804579 | ||||||
| 19_11374675 | rs416231 | ||||||
| 19_11380295 | rs4804159 | ||||||
| 19_11406952 | rs374409 | ||||||
| GI_12056480-A | UTS2 | chr1 | 7907271 | 6280 | 1_7710810 | rs58905635 | CAMTA1 intron |
| 1_7725855 | rs4908471 | CAMTA1 intron | |||||
| 1_7749807 | rs3124797 | CAMTA1 intron | |||||
| GI_20986517-S | MAPK8IP1 | chr11 | 45907046 | 20970 | 11_45838926 | rs11038668 | |
| 11_45840939 | rs7112505 | ||||||
| 11_45891418 | rs7123390 | CRY2 intron | |||||
| Hs.58104-S | FAM101B | chr17 | 289771 | 8960 | 17_185027 | rs12951437 | |
| 17_198698 | rs11869174 | ||||||
| 17_206962 | rs11657163 | ||||||
| GI_41393558-I | KIF1B | chr1 | 10270763 | 97892 | 1_10270386 | rs3828081 | KIF1B 2 kb upstream |
| 1_10307453 | rs4240911 | KIF1B intron | |||||
| 1_10438687 | rs1536262 | KIF1B 3’UTR | |||||
| GI_4506738-S | RPS6KB2 | chr11 | 67195934 | 6945 | 11_67196237 | rs1476792 | RPS6KB2 intron |
| 11_67204342 | rs12787021 | PTPRCAP intron | |||||
| 11_67213956 | rs2109123 | ||||||
| 11_67253564 | rs7110021 | ||||||
| 11_67258805 | rs751567 | ||||||
| 11_67264679 | rs2276120 | ||||||
| GI_15451941-S | UBA52 | chr19 | 18682613 | 5657 | 19_18499151 | rs1059022 | |
| 19_18499238 | rs1804826 | ||||||
| 19_18715154 | rs72995445 | CRLF1 intron | |||||
| 19_18859680 | rs11085244 |
Gene symbols and coordinates for the ten transcripts that were detected as being potentially cis-regulated by SNPs in our covariance kernel. The closest SNPs to each gene are listed
Decrease in the association signal when cis-located SNPs are removed from the kernel
| Transcript | Gene | SNPs removed from the kernel | Covariance kernel | ||
|---|---|---|---|---|---|
| Weighted | |||||
| h2r | geff | geff_p | |||
| none | 0.0000 | 0.7145 | 4.55E-18 | ||
| GI_42544126-I | SF1 | 2 in SF1 | 0.0000 | 0.6809 | 1.32E-12 |
| all in transcript region | 0.1349 | 0.1349 | 2.00E-05 | ||