| Literature DB >> 25519382 |
Brunilda Balliu1, Hae-Won Uh2, Roula Tsonaka1, Stefan Boehringer1, Quinta Helmer2, Jeanine J Houwing-Duistermaat1.
Abstract
In this analysis, we investigate the contributions that linkage-based methods, such as identical-by-descent mapping, can make to association mapping to identify rare variants in next-generation sequencing data. First, we identify regions in which cases share more segments identical-by-descent around a putative causal variant than do controls. Second, we use a two-stage mixed-effect model approach to summarize the single-nucleotide polymorphism data within each region and include them as covariates in the model for the phenotype. We assess the impact of linkage disequilibrium in determining identical-by-descent states between individuals by using markers with and without linkage disequilibrium for the first part and the impact of imputation in testing for association by using imputed genome-wide association studies or raw sequence markers for the second part. We apply the method to next-generation sequencing longitudinal family data from Genetic Association Workshop 18 and identify a significant region at chromosome 3: 40249244-41025167 (p-value = 2.3 × 10(-3)).Entities:
Year: 2014 PMID: 25519382 PMCID: PMC4143620 DOI: 10.1186/1753-6561-8-S1-S34
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Description of genotypic data sets used in each part of the analysis
| Analysis | IBD mapping: regions with excess of IBD sharing | Association mapping: Two-stage approach | ||
|---|---|---|---|---|
| Type of data | GWAS | Imputed (dosage) WGS based on existing GWAS framework | Whole genome sequence | |
| No. markers | ~ 50,000 | 784 | ~1.2 million | ~1.7 million |
| No. individuals | 939 | 939 | 464 | |
Description of IBD between case-case and case-control pairs
| AllMark | CaCa | 0.295 | 0.499 | 0.206 | 58.27 | 144.48 | 25.98 |
| CaCo | 0.292 | 0.503 | 0.205 | 58.01 | 145.58 | 25.91 | |
| NoLD | CaCa | 0.006 | 0.950 | 0.044 | 44.81 | 316.00 | 21.27 |
| CaCo | 0.004 | 0.951 | 0.045 | 39.52 | 315.09 | 21.59 | |
Descriptions of regions
| 27279401-27292557 | 77 | 38 | 100 | 61 | 29239664-29531222 | 2153 | 919 | 2984 | 1659 |
| 52618319-52637439 | 105 | 46 | 168 | 111 | 34834899-35282759 | 2730 | 1284 | 4267 | 2715 |
| 52759860-52771468 | 77 | 44 | 117 | 82 | 35718847-36018767 | 1618 | 927 | 2446 | 1755 |
| 52830547-52866115 | 291 | 156 | 379 | 244 | 36815704-37526013 | 3738 | 2151 | 5669 | 4038 |
| 86269515-86282586 | 60 | 24 | 96 | 58 | 40249244-41025167 | 4247 | 2530 | 6168 | 4214 |
| 99537305-99580268 | 211 | 120 | 322 | 260 | 167635899-168125439 | 2665 | 1349 | 3926 | 2552 |
| 99621002-99676384 | 270 | 144 | 386 | 299 | 168621773-168859006 | 1508 | 708 | 2018 | 1207 |
| 99927237-100004117 | 396 | 185 | 575 | 427 | |||||
N, number of SNPs per region; n, number of rare variants (MAF <5%) per region
P-value for testing region effects using the NoLD data set
| 0.03 | 0.25 | 0.04 | 0.02 | 0.04 | 0.76 | 0.12 | 0.04 |
| 0.93 | 0.91 | 0.99 | 0.92 | 0.81 | 0.27 | 0.54 | 0.93 |
| 0.99 | 0.11 | 0.27 | 0.77 | 0.35 | 0.41 | 0.50 | 0.51 |
| 0.18 | 0.24 | 0.25 | 0.20 | 0.32 | 0.13 | 0.15 | 0.23 |
| 1.3 × 10−3 | 0.05 | 2.3 × 10−3 | 3.6 × 10−3 | 9.3 × 10−3 | 0.33 | 0.01 | 2.1 × 10−3 |
| 0.29 | 1.00 | 0.55 | 0.22 | 0.27 | 0.75 | 0.54 | 0.28 |
| 0.09 | 0.66 | 0.22 | 0.09 | 0.25 | 0.26 | 0.31 | 0.33 |
Two different models are fitted: one with and one without including the number of rare variants as covariates. The regions are in the same order as in Table 3.
aBased on fitting .
b Based on fitting .