| Literature DB >> 29563569 |
Masoud Shirali1, Sara A Knott2, Ricardo Pong-Wong3, Pau Navarro1, Chris S Haley4,5.
Abstract
We propose a novel approach to analyze genomic data that incorporates haplotype information for detecting rare variants within a regional heritability mapping framework. The performance of our approach was tested in a simulation study based on human genotypes. The phenotypes were simulated by generating regional variance using either SNP(s) or haplotype(s). Regional genomic relationship matrices, constructed with either a SNP-based or a haplotype-based estimator, were employed to estimate the regional variance. The results from the study show that haplotype heritability mapping captures the regional effect, with its relative performance decreasing with increasing analysis window size. The SNP-based regional mapping approach often misses the effect of causal haplotype(s); however, it has a greater power to detect simulated SNP-based-variants. Heritability estimates suggest that the haplotype heritability mapping estimates the simulated regional heritability accurately for all phenotypes and analysis windows. However, the SNP-based analysis overestimates the regional heritability and performs less well than our haplotype-based approach for the simulated rare haplotype-based-variant. We conclude that haplotype heritability mapping is a useful tool to capture the effect of rare variants, and explain a proportion of the missing heritability.Entities:
Mesh:
Year: 2018 PMID: 29563569 PMCID: PMC5862984 DOI: 10.1038/s41598-018-23307-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Average LRT (A,C,E,G) and RH (B,D,F,H) for the 20 regions analyzed ordered by window size measured as number of SNPs for the 1SNP, AllSNP, 1Hap and AllHap simulations using a threshold of 5 cM/Mb to define block boundaries for both the HHM and the RHM methods.
Figure 2The relation between the frequency of the causal haplotype for the 1Hap simulation and estimated LRT (A and C) and RH (B and D) over all the regions analyzed using a threshold of 5 cM/Mb to define block boundaries.
Figure 3Plot of estimated RH against LRT for the four simulated scenarios (1SNP, AllSNP, 1Hap and AllHap) by using the HHM (A) and the RHM (B) for all replicates of the simulated regions.
Window size in number of SNPs for the 5 cM/Mb and 10 cM/Mb recombination rate boundaries for each region used in the simulation study.
| Region | 5 cM/Mb | 10 cM/Mb | ||||||
|---|---|---|---|---|---|---|---|---|
| Chr | Start | End | Size | Chr | Start | End | Size | |
| A | 2 | 19,615 | 19,615 | 1 | 2 | 19,615 | 19,615 | 1 |
| B | 16 | 2,786 | 2,786 | 1 | 16 | 2,786 | 2,786 | 1 |
| C | 16 | 4,025 | 4,025 | 1 | 16 | 4,025 | 4,027 | 3 |
| D | 11 | 7,263 | 7,264 | 2 | 11 | 7,239 | 7,264 | 26 |
| E | 9 | 8,298 | 8,301 | 4 | 9 | 8,298 | 8,317 | 20 |
| F | 9 | 9,793 | 9,796 | 4 | 9 | 9,793 | 9,797 | 5 |
| G | 15 | 3,293 | 3,296 | 4 | 15 | 3,293 | 3,297 | 5 |
| H | 8 | 12,690 | 12,694 | 5 | 8 | 12,675 | 12,697 | 23 |
| I | 8 | 3,059 | 3,065 | 7 | 8 | 3,057 | 3,065 | 9 |
| J | 11 | 2,268 | 2,275 | 8 | 11 | 2,268 | 2,275 | 8 |
| K | 11 | 5,470 | 5,477 | 8 | 11 | 5,437 | 5,485 | 49 |
| L | 16 | 3,790 | 3,798 | 9 | 16 | 3,790 | 3,798 | 9 |
| M | 11 | 10,464 | 10,473 | 10 | 11 | 10,464 | 10,473 | 10 |
| N | 15 | 4,413 | 4,424 | 12 | 15 | 4,413 | 4,424 | 12 |
| O | 2 | 2,441 | 2,461 | 21 | 2 | 2,441 | 2,465 | 25 |
| P | 8 | 1,503 | 1,523 | 21 | 8 | 1,503 | 1,547 | 45 |
| Q | 18 | 4,908 | 49,28 | 21 | 18 | 4,908 | 4,929 | 22 |
| R | 8 | 14,229 | 14,252 | 24 | 8 | 14,221 | 14,255 | 35 |
| S | 18 | 4,132 | 4,160 | 29 | 18 | 4,119 | 4,160 | 42 |
| T | 16 | 4,979 | 5,050 | 72 | 16 | 4,969 | 5,073 | 105 |
Chr: Chromosome; Start: Start SNP number; End: End SNP Number; Size: Window Size. NSNP: Number of SNPs in the window.