| Literature DB >> 19697356 |
Cristopher V Van Hout1, Albert M Levin, Evadnie Rampersaud, Haiqing Shen, Jeffrey R O'Connell, Braxton D Mitchell, Alan R Shuldiner, Julie A Douglas.
Abstract
Knowledge of the extent and distribution of linkage disequilibrium (LD) is critical to the design and interpretation of gene mapping studies. Because the demographic history of each population varies and is often not accurately known, it is necessary to empirically evaluate LD on a population-specific basis. Here we present the first genome-wide survey of LD in the Old Order Amish (OOA) of Lancaster County Pennsylvania, a closed population derived from a modest number of founders. Specifically, we present a comparison of LD between OOA individuals and US Utah participants in the International HapMap project (abbreviated CEU) using a high-density single nucleotide polymorphism (SNP) map. Overall, the allele (and haplotype) frequency distributions and LD profiles were remarkably similar between these two populations. For example, the median absolute allele frequency difference for autosomal SNPs was 0.05, with an inter-quartile range of 0.02-0.09, and for autosomal SNPs 10-20 kb apart with common alleles (minor allele frequency > or =0.05), the LD measure r(2) was at least 0.8 for 15 and 14% of SNP pairs in the OOA and CEU, respectively. Moreover, tag SNPs selected from the HapMap CEU sample captured a substantial portion of the common variation in the OOA ( approximately 88%) at r(2) > or =0.8. These results suggest that the OOA and CEU may share similar LD profiles for other common but untyped SNPs. Thus, in the context of the common variant-common disease hypothesis, genetic variants discovered in gene mapping studies in the OOA may generalize to other populations. 2009 Wiley-Liss, Inc.Entities:
Mesh:
Year: 2010 PMID: 19697356 PMCID: PMC2811753 DOI: 10.1002/gepi.20444
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135
Summary of autosomal SNPs
| OOA | CEU | Overlap | |
|---|---|---|---|
| Total genotyped | 489,922 | 489,922 | 489,922 |
| > 1 duplicate inconsistency | 51,459 | NA | NA |
| >5% missing data | 50,085 | 16,896 | 8,973 |
| Mendelian inconsistencies | 3,188 | 1,168 | 202 |
| | 379 | 217 | 116 |
| Passed QC filter | 415,440 | 472,851 | 409,071 |
| Passed QC in both OOA and CEU | |||
| Monomorphic | 68,869 | 57,669 | 52,467 |
| Polymorphic | |||
| MAF≥0.05 | 297,605 | 310,704 | 287,476 |
| MAF≥0.10 | 256,614 | 267,149 | 240,375 |
| MAF≥0.20 | 182,941 | 189,133 | 161,062 |
OOA, Old Order Amish; CEU, US Utah residents from HapMap; MAF, minor allele frequency; SNPs that failed a QC measure in either sample were excluded from further analysis, and SNPs with MAF≥0.05 passing QC in both samples (n = 287,476) were used for LD analysis.
Based on the 61 OOA individuals who were also genotyped on the Affymetrix 6.0 array; SNPs with more than one duplicated genotype discrepancy were excluded.
Based on 837 OOA and 90 CEU individuals (30 trios).
SNPs with >5 and >1 Mendelian inconsistencies in OOA and CEU, respectively.
Based on 60 unrelated individuals (30 men and 30 women) from each sample.
SNPs may fail QC in more than one way, so rows do not sum to the subtotal passing QC.
Summary of X chromosome SNPs
| OOA | CEU | Overlap | |
|---|---|---|---|
| Total genotyped | 10,525 | 10,525 | 10,525 |
| >1 duplicate inconsistency | 1,061 | NA | NA |
| >5% missing data | 547 | 461 | 261 |
| Mendelian inconsistencies | 44 | 246 | 10 |
| P<10−6 for HWE test | 0 | 0 | 0 |
| Passed QC filter | 9,139 | 10,064 | 8,972 |
| Passed QC in both OOA and CEU | |||
| Monomorphic | 2,272 | 1,905 | 1,805 |
| Polymorphic | |||
| MAF≥0.05 | 5,763 | 6,106 | 5,516 |
| MAF≥0.10 | 4,971 | 5,376 | 4,449 |
| MAF≥0.20 | 3,571 | 3,925 | 2,929 |
OOA, Old Order Amish; CEU, US Utah residents from HapMap; MAF, minor allele frequency. SNPs that failed a QC measure in either sample were excluded from further analysis, and SNPs with MAF≥0.05 passing QC in both samples (n=5,516) were used for LD analysis.
Based on the 61 OOA individuals who were also genotyped on the Affymetrix 5.0 array; SNPs with more than one duplicated genotype discrepancy were excluded.
Based on 837 OOA and 90 CEU individuals (30 trios).
SNPs with >5 and >1 Mendelian inconsistencies in OOA and CEU, respectively.
Based on 60 unrelated individuals (30 men and 30 women) from each sample.
SNPs may fail QC in more than one way, so rows do not sum to the subtotal passing QC.
Percentage of autosomal SNP pairsa showing no evidence of recombination (D′ = 1), perfect LD (r2 = 1), or where useful LD is observed (r2 ≥ 0.8)
| Inter-SNP distance (kb) | OOA | CEU | OOA | CEU | OOA | CEU |
|---|---|---|---|---|---|---|
| ≤10 | 79 | 75 | 20 | 19 | 30 | 29 |
| 10–20 | 60 | 53 | 9 | 7 | 15 | 14 |
| 20–50 | 43 | 34 | 4 | 3 | 9 | 7 |
| 50–100 | 28 | 20 | 1 | 1 | 3 | 2 |
| 100–200 | 20 | 11 | 0 | 0 | 1 | 1 |
| 200–500 | 14 | 7 | 0 | 0 | 0 | 0 |
| 500–1,000 | 12 | 6 | 0 | 0 | 0 | 0 |
| 1,000–2,000 | 11 | 5 | 0 | 0 | 0 | 0 |
| 2,000–5,000 | 10 | 5 | 0 | 0 | 0 | 0 |
| 5,000–10,000 | 8 | 5 | 0 | 0 | 0 | 0 |
OOA, Old Order Amish (n = 60); CEU, US Utah residents from HapMap (n = 60).
Restricted to SNPs with minor allele frequency ≥0.05 in both samples (n = 287,476).
Percentage of X chromosome SNP pairsa showing no evidence of recombination (D′ = 1), perfect LD (r2 = 1), or where useful LD is observed (r2≥0.8)
| Inter-SNP distance (kb) | OOA | CEU | OOA | CEU | OOA | CEU |
|---|---|---|---|---|---|---|
| ≤10 | 88 | 85 | 39 | 35 | 51 | 49 |
| 10–20 | 72 | 64 | 23 | 19 | 34 | 31 |
| 20–50 | 60 | 48 | 12 | 9 | 21 | 18 |
| 50–100 | 44 | 31 | 6 | 3 | 11 | 10 |
| 100–200 | 31 | 19 | 3 | 1 | 6 | 4 |
| 200–500 | 22 | 11 | 1 | 0 | 2 | 1 |
| 500–1,000 | 18 | 7 | 0 | 0 | 0 | 0 |
| 1,000–2,000 | 17 | 7 | 0 | 0 | 0 | 0 |
| 2,000–5,000 | 15 | 7 | 0 | 0 | 0 | 0 |
| 5,000–10,000 | 13 | 7 | 0 | 0 | 0 | 0 |
OOA, Old Order Amish (n = 60); CEU, US Utah residents from HapMap (n = 60).
Restricted to SNPs with minor allele frequency ≥0.05 in both samples (n = 5,516).