| Literature DB >> 27591082 |
Jie Zheng1,2, Santiago Rodriguez1,2, Charles Laurin1,2, Denis Baird1, Lea Trela-Larsen2, Mesut A Erzurumluoglu2,3, Yi Zheng4, Jon White5, Claudia Giambartolomei5, Delilah Zabaneh5, Richard Morris2, Meena Kumari5, Juan P Casas5,6, Aroon D Hingorani5,7, David M Evans1,8, Tom R Gaunt1,2, Ian N M Day2.
Abstract
MOTIVATION: Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients ([Formula: see text]) of the variants. However, haplotypes rather than pairwise [Formula: see text], are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this article, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel.Entities:
Mesh:
Year: 2016 PMID: 27591082 PMCID: PMC5544112 DOI: 10.1093/bioinformatics/btw565
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1The SNP by haplotypes matrix for HAPRAP. The iteration of HAPRAP is built based on a matrix summarizing the haplotypes and haplotype frequencies for a certain population. ‘0’ in the matrix means the haplotype contains the baseline allele for the relevant SNP, whereas ‘1’ means the haplotype contains the effect allele for the relevant SNP. The small arrow (from left to right) is the marginal SNP effects estimation step. The large arrow (from right to left) is the haplotype effects adjustment step
Fig. 2Schematic diagram of HAPRAP
Notation of HAPRAP
| Par. | Description |
|---|---|
| The set of haplotypes containing the effect allele of SNP | |
| The set of haplotypes containing the baseline allele of SNP | |
| The set of haplotype frequencies containing the baseline allele of SNP | |
| The set of haplotype frequencies containing the baseline allele of SNP | |
| The SNP with the greatest deviation between the observed marginal SNP effect and the estimated marginal SNP effect in the g iteration | |
Column ‘Par.’ lists the parameters used in HAPRAP.
Fig. 3Performance comparison between HAPRAP and COJO in one of the three-SNPs Model MSE is MSE of HAPRAP (or that of GCTA) compare to joint effect from multiple regression mode. X-axis is the number of individuals in the reference panel on a log scale, which is equivalent to sample size of 10 000, 5000, 1000 or 500, respectively. In this simulation, SNP1 is a signal with a joint effect of 1, SNP2 is a bystander SNP with no effect, SNP3 is a secondary SNP with a joint effect of 0.3, between SNP1 and SNP2 was 0.8, between SNP1 and SNP3 was 0.5
Fig. 4Performance comparison of HAPRAP and COJO using parametric simulation of 20 SNPs from GIANT height meta-analysis MSE is the MSE of the method compare to multiple regression. X-axis is the number of individuals in the meta-analysis in Log scale. Horizontal line is the threshold line of MSE of 0.1
Summary of 20 associated SNPs at 3 loci for height with P < 5 × 10−8 in the HAPRAP step-wise model selection analysis using the ALSPAC cohort as a reference sample for LD
| SNP | COJO-GIANT | HAPRAP | ||
|---|---|---|---|---|
| Beta | Beta | |||
| rs1348002 | 0.020 | 1.5 × 10−10 | 0.018 | 2.8 × 10−09 |
| rs11633371 | 0.024 | 2.1 × 10−15 | 0.028 | 4.8 × 10−20 |
| rs16942341 | −0.114 | 3.0 × 10−29 | −0.122 | 3.4 × 10−34 |
| rs2280470 | 0.031 | 5.5 × 10−21 | 0.032 | 1.9 × 10−25 |
| rs3817428 | 0.022 | 2.6 × 10−09 | 0.019 | 1.2 × 10−08 |
| rs2238300 | −0.018 | 1.6 × 10−09 | −0.020 | 3.8 × 10−11 |
| rs2573625 | 0.030 | 3.7 × 10−22 | 0.025 | 2.4 × 10−15 |
| rs1529889 | Unselected | Unselected | 0.019 | 6.4 × 10−10 |
| rs4246302 | −0.027 | 1.4 × 10−16 | −0.028 | 1.4 × 10−17 |
| rs4548838 | 0.034 | 9.1 × 10−30 | 0.033 | 1.4 × 10−28 |
| rs7170986 | −0.019 | 1.1 × 10−08 | −0.018 | 4.5 × 10−08 |
| rs8042424 | −0.022 | 5.1 × 10−10 | −0.022 | 2.2 × 10−10 |
| rs1257763 | 0.071 | 9.4 × 10−14 | 0.078 | 2.2 × 10−12 |
| rs12347744 | −0.056 | 2.8 × 10−20 | −0.039 | 1.7 × 10−19 |
| rs357564 | Unselected | Unselected | −0.046 | 3.9 × 10−13 |
| rs4448343 | −0.035 | 1.1 × 10−28 | −0.035 | 2.0 × 10−17 |
| rs1329393 | 0.038 | 1.4 × 10−15 | 0.034 | 5.1 × 10−13 |
| rs817300 | −0.070 | 2.2 × 10−23 | −0.085 | 4.8 × 10−16 |
| rs10990303 | 0.032 | 1.4 × 10−19 | 0.036 | 5.4 × 10−18 |
| rs7870753 | −0.045 | 1.7 × 10−37 | −0.043 | 1.3 × 10−30 |
Beta and P-value under COJO-GIANT refer to the joint SNP effect and its P-value presented in the GIANT height paper. Beta and P-value under HAPRAP are the joint SNP effect and its P-value for HAPRAP. ‘Unselected’ means the SNP was not selected by COJO in the step-wise selection. The comparison details are presented in Table S5.