| Literature DB >> 20017970 |
Anthony L Hinrichs1, Robert Culverhouse, Carol H Jin, Brian K Suarez.
Abstract
Although identification of cryptic population stratification is necessary for case/control association analyses, it is also vital for linkage analyses and family-based association tests when founder genotypes are missing. However, including related individuals in an analysis such as EIGENSTRAT can result in bias; using only founders or one individual per pedigree results in loss of data and inaccurate estimates of stratification. We examine a generalization of principal-component analyses to allow for the inclusion of related individuals by down-weighting the significance of individual comparisons.Entities:
Year: 2009 PMID: 20017970 PMCID: PMC2795877 DOI: 10.1186/1753-6561-3-s7-s106
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Number of effective individuals for five samples and scaled principal components
| Data seta | Individuals | PC1 | PC2 | PC3 |
|---|---|---|---|---|
| MaxUnrel | 2014 | 0.5113 | 0.405 | |
| Singletons | 418 | 0.4808 | ||
| OnePer | 1180 | 0.4475 | ||
| Full | 6757 | 0.4002 | 0.3717 | |
| Weighted | 2898.7 | 0.4147 | 0.3652 |
aMaxUnrel, maximum number of unrelateds; Singletons, individuals without genotyped relatives; One per, one individual per pedigree chosen at random; Full, all individuals without weighting; and Weighted, all individuals with weighted PCA
bBold components were found significant by the acceleration method.
Figure 1PC1 for multiple samples. Consistency of normalized PC1 for subsets compared with maximal set of unrelateds.
Figure 2Samples used. Mean number of individuals used for number of typed individuals. Possible indicates the mean number of founders (typed or untyped).