| Literature DB >> 27980662 |
Elizabeth M Blue1, Lisa A Brown2, Matthew P Conomos2, Jennifer L Kirk2, Alejandro Q Nato1, Alice B Popejoy3, Jesse Raffa4, John Ranola4, Ellen M Wijsman5, Timothy Thornton2.
Abstract
BACKGROUND: Estimating relationships among subjects in a sample, within family structures or caused by population substructure, is complicated in admixed populations. Inaccurate allele frequencies can bias both kinship estimates and tests for association between subjects and a phenotype. We analyzed the simulated and real family data from Genetic Analysis Workshop 19, and were aware of the simulation model.Entities:
Year: 2016 PMID: 27980662 PMCID: PMC5133521 DOI: 10.1186/s12919-016-0056-3
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Box-plot comparison of pairwise kinship estimates from WGS vs. exome data. Blue, PLINK method of moments; cyan, PC-Relate; magenta, KING-robust; red, maximum likelihood; Un, unrelated; WGS, whole genome sequence
Rate of successfully classified relationships. Frequency pairs within each relationship are correctly assigned to this degree of relationship
| Degree of relationship | ||||||
|---|---|---|---|---|---|---|
| Design | Estimator | 1st | 2nd | 3rd | 4th | Unrelated |
| Agnostic WGS | MLE | 99.7 % | 91.0 % | 76.1 % | 60.2 % | 99.7 % |
| MoM | 100 % | 93.4 % | 54.4 % | 16.9 % | 14.5 % | |
| KING | 98.9 % | 65.2 % | 30.5 % | 14.9 % | 100 % | |
| PC-Relate | 99.7 % | 94.1 % | 81.9 % | 57.9 % | 93.9 % | |
| Selective WGS | MLE | 99.7 % | 92.7 % | 76.3 % | 61.3 % | 99.9 % |
| MoM | 100 % | 97.4 % | 78.4 % | 38.1 % | 58.0 % | |
| KING | 100 % | 96.7 % | 80.9 % | 47.3 % | 86.3 % | |
| PC-Relate | 99.5 % | 91.8 % | 78.4 % | 62.6 % | 100 % | |
| Selective exome | MLE | 99.7 % | 93.2 % | 80.2 % | 62.6 % | 99.2 % |
| MoM | 99.7 % | 96.5 % | 71.6 % | 33.0 % | 43.4 % | |
| KING | 99.5 % | 80.0 % | 52.4 % | 30.5 % | 99.5 % | |
| PC-Relate | 99.7 % | 96.5 % | 79.8 % | 52.4 % | 97.1 % | |
| Homogenizing WGS | MLE | 99.5 % | 91.0 % | 74.2 % | 57.3 % | 100 % |
| MoM | 99.5 % | 96.3 % | 79.6 % | 44.9 % | 61.7 % | |
| KING | 99.5 % | 88.1 % | 73.3 % | 59.5 % | 99.9 % | |
| PC-Relate | 99.7 % | 92.7 % | 79.6 % | 62.0 % | 100 % | |
Fig. 2Heritability estimate from longitudinal analysis as a function of age. Mean baseline age was 39.0 years (SD = 16.3) with a median follow-up of 10.4 years (Q1-Q3: 1.8–14.4). Number of SBP observations in each age (in years) strata: <30: 483, [30,40): 415,[40,50): 438, [50,60): 326, ≥60: 324. The upper/lower dotted lines represent 95 % confidence bands
Fig. 3EMMAX association test results for real first (a) and third (b) visit adjusted SBP, and third visit excluding an outlier (c). Bonferroni-corrected significance threshold is shown with a horizontal line
Fig. 4Admixture mapping (a) and association testing (b) for adjusted simulated SBP using local ancestry. Significance thresholds are shown with a red horizontal line, based on theory [16] (a) or Bonferroni correction (b)
Principal components analysis with rare and common WGS variants
| PCA-Seq | EIGENSTRAT | |||||||
|---|---|---|---|---|---|---|---|---|
| PCs | MAF | CEU | AMR | YRI | CEU | AMR | YRI | |
| R2 for rare variants | 10 | <0.01 | 0.18 | 0.16 | 0.33 | 0.17 | 0.15 | 0.33 |
| <0.05 | 0.79 | 0.80 | 0.55 | 0.78 | 0.79 | 0.54 | ||
| 20 | <0.01 | 0.42 | 0.42 | 0.43 | 0.42 | 0.42 | 0.42 | |
| <0.05 | 0.83 | 0.84 | 0.66 | 0.82 | 0.83 | 0.65 | ||
| R2 for common variants | 10 | <0.01 | 0.95 | 0.95 | 0.13 | 0.95 | 0.95 | 0.17 |
| <0.05 | 0.95 | 0.95 | 0.13 | 0.95 | 0.95 | 0.11 | ||
| 20 | <0.01 | 0.97 | 0.97 | 0.22 | 0.97 | 0.97 | 0.41 | |
| <0.05 | 0.97 | 0.97 | 0.22 | 0.97 | 0.97 | 0.23 | ||
| R2 for all variants | 10 | — | 0.95 | 0.96 | 0.18 | 0.96 | 0.96 | 0.22 |
| 20 | — | 0.97 | 0.97 | 0.41 | 0.97 | 0.97 | 0.49 | |
MAF minor allele frequency, R2, the coefficient of determination from a linear regression model with 10 PCs included as predictors and the proportion of CEU, AMR, and YRI ancestry proportions from ADMIXTURE analysis as the response