| Literature DB >> 23979941 |
Riyan Cheng1, Clarissa C Parker, Mark Abney, Abraham A Palmer.
Abstract
Genome-wide association studies of complex traits often are complicated by relatedness among individuals. Ignoring or inappropriately accounting for relatedness often results in inflated type I error rates. Either genotype or pedigree data can be used to estimate relatedness for use in mixed-models when undertaking quantitative trait locus mapping. We performed simulations to investigate methods for controlling type I error and optimizing power considering both full and partial pedigrees and, similarly, both sparse and dense marker coverage; we also examined real data sets. (1) When marker density was low, estimating relatedness by genotype data alone failed to control the type I error rate; (2) this was resolved by combining both genotype and pedigree data. (3) When sufficiently dense marker data were used to estimate relatedness, type I error was well controlled and power increased; however, (4) this was only true when the relatedness was estimated using genotype data that excluded genotypes on the chromosome currently being scanned for a quantitative trait locus.Entities:
Keywords: quantitative trait locus (QTL); relatedness; statistical power; type I error rate
Mesh:
Year: 2013 PMID: 23979941 PMCID: PMC3789811 DOI: 10.1534/g3.113.007948
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Names of marker sets used to estimate genotype-based relationship matrices
| Chromosomes Used in Marker Set | Marker Set Name When Polygenes and Markers Are | |
|---|---|---|
| Completely Linked | Incompletely Linked | |
| 1–5 | CL (complete linkage only) | IL (incomplete linkage only) |
| 6–10 | UL (unlinked only) | |
| 1–10 | CUL (Both CL and UL) | IUL (Both IL and UL) |
| 1–10 + (11–15 choose one) | CUL + 1 | — |
QTL scans were only performed on chromosomes 11–15 with the following marker sets used to estimate relatedness. QTL, quantitative trait locus.
Estimates of relatedness included chromosomes 1–10 and an additional chromosome selected from 11–15 such that the chromosome selected is the one being scanned, as described in the text.
Marker set power and error rates
| AIL | STR | |||
|---|---|---|---|---|
| Type I Error Rate | Power | Type I Error Rate | Power | |
| Naive | 0.538 | – | 0.802 | – |
| Last3 | 0.059 | 0.682 | 0.083 | – |
| Last6 | 0.051 | 0.674 | 0.060 | – |
| Last12 | 0.058 | 0.678 | 0.057 | 0.653 |
| AllPed | 0.049 | 0.676 | 0.053 | 0.664 |
| CL | 0.055 | 0.893 | 0.056 | 0.882 |
| IL | 0.099 | – | 0.102 | – |
| UL | 0.213 | – | 0.241 | – |
| CUL | 0.048 | 0.805 | 0.042 | 0.800 |
| IUL | 0.078 | – | 0.080 | – |
| CUL + 1 | 0.008 | 0.559 | 0.014 | 0.527 |
| IUL+Last3 | 0.052 | 0.741 | 0.048 | 0.716 |
| IUL+AllPed | 0.040 | 0.734 | 0.052 | 0.716 |
Type 1 error rate and power at significance level 0.05 under different marker sets and variance models. AIL, advanced intercross line; STR, structured population; AllPed, entire pedigree; CL, complete linkage; IL, incomplete linkage; UL, unlinked; CUL, completely linked and unlinked; IUL, incompletely linked and unlinked.
Indicate that the estimated type I error rate is significantly different from 0.05 at significance levels 0.05.
Indicate that the estimated type I error rate is significantly different from 0.05 at significance levels 0.01.
Power results are not shown when the type I error rate is inflated.
Figure 1Estimated type I error rate and statistical power at significance level 0.05 for varying densities of markers. Marker densities on chromosomes 1–10 were varied, and markers on chromosomes 11–15 were either varied (left) or held fixed at 2 cM spacing (right). VM1: relationship matrices estimated using the entire pedigree (AllPed); VM2: relationship matrices estimated using genotypes on chromosomes 1–10 (IL); VM3: relationship matrices estimated using both genotypes on chromosomes 1–10 (IL) and the last three generations of the pedigree (Last3). Red symbols indicate conditions with inflated type I error rate.
Figure 2Mapping results of the B6xD2 F8 methamphetamine sensitivity data (Parker ). The green dashed line indicates the threshold for genome-wide significance at the 0.05 level. Red dots are the mapping results that estimate relatedness by combining AllSNP-1 and pedigree information. Black dots are the mapping results that estimate relatedness using (A) AllSNP or (B) AllPed.