| Literature DB >> 23316436 |
Lei Jiang1, Dana Willner, Patrick Danoy, Huji Xu, Matthew A Brown.
Abstract
Most genome-wide association studies to date have been performed in populations of European descent, but there is increasing interest in expanding these studies to other populations. The performance of genotyping chips in Asian populations is not well established. Therefore, we sought to test the performance of widely used fixed-marker, genome-wide association studies chips in the Han Chinese population. Non-HapMap Chinese samples (n = 396) were genotyped using the Illumina OmniExpress and Affymetrix 6.0 platforms, whereas a subset also were genotyped using the Immunochip. Genotyped markers from the Affymetrix 6.0 and Illumina OmniExpress were used for full genome imputation based on the HapMap 2 JPT+CHB (Japanese from Tokyo, Japan and Chinese from Beijing, China) reference panel. The concordance between markers genotypes for the three platforms was very high whether directly genotyped or genotyped and imputed single nucleotide polymorphisms (SNPs; >99.8% for directly genotyped and >99.5% for genotyped and imputed SNPs, respectively) were compared. The OmniExpress chip data enabled more SNPs to be imputed, particularly SNPs with minor allele frequency >5%. The OmniExpress chip achieved better coverage of HapMap SNPs than the Affymetrix 6.0 chip (73.6% vs. 65.9%, respectively, for minor allele frequency >5%). The Affymetrix 6.0 and Illumina OmniExpress chip have similar genotyping accuracy and provide similar accuracy of imputed SNPs. The OmniExpress chip however provides better coverage of Asian HapMap SNPs, although its coverage of HapMap SNPs is moderate.Entities:
Keywords: Affymetrix 6.0; Illumina OmniExpress; Immunochip; genetic polymorphisms
Mesh:
Substances:
Year: 2013 PMID: 23316436 PMCID: PMC3538340 DOI: 10.1534/g3.112.004069
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1 Study sample size. The total number of individuals genotyped by each platform is indicated under the platform name. Sample sizes in overlap regions represent the number of study subjects genotyped by two or more platforms.
Concordance between genotyped SNPs
| Array 1 | Array 2 | No. Shared Genotyped SNPs Passing QC with MAF > 0.01 | Percent Concordance |
|---|---|---|---|
| Affymetrix 6.0 | OmniExpress | 146,885 | 99.89 |
| Affymetrix 6.0 | Immunochip | 13,644 | 99.89 |
| OmniExpress | Immunochip | 18,859 | 99.97 |
The third column indicates the number of SNPs shared by Array 1 and Array 2 after quality and MAF filtering. Concordance rates presented in the fourth column represent the sum of counts of concordant markers for each subject divided by the total number of markers for all subjects. MAF, minimum allele frequency; SNP, single-nucleotide polymorphism.
Coverage of JPT+CHB HapMap 2 SNPs
| No. JPT+CHB HapMap SNPs | HapMap SNPs Genotyped | HapMap SNPs Imputed with r2 > 0.8 | Coverage | Corrected Coverage | |
|---|---|---|---|---|---|
| MAF > 0.05 | |||||
| OmniExpress | 2,133,507 | 540,163 (25.3%) | 1,128,626 (52.9%) | 78.2% | 73.0% |
| Affymetrix 6.0 | 2,133,507 | 501,993 (23.5%) | 919,764 (43.1%) | 66.6% | 59.2% |
| MAF > 0.01 | |||||
| OmniExpress | 2,344,748 | 581,788 (24.8%) | 1,288,127 (54.9%) | 79.7% | 71.1% |
| Affymetrix 6.0 | 2,344,748 | 512,669 (21.9%) | 1,045,048 (44.6%) | 66.5% | 57.3% |
Only SNPs with a genotyping rate greater than 0.95 were considered. Coverage was calculated as the number of genotyped plus imputed SNPs (r2 > 0.8) divided by the number of genotyped HapMap markers for the JPT+CHB reference panel at a given MAF. Corrected coverage was calculated using the formula presented in Barrett and Cardon (2006), which adjusts coverage estimates for the total number of polymorphisms in the genome. MAF, minimum allele frequency; SNP, single-nucleotide polymorphism.
Imputable SNPs for each array
| Array 1 | Array 2 | No. SNPs Passing QC with MAF > 0.01 Genotyped on Array 1 Only | No. Imputable SNPs Using HapMap CHB+JPT Reference Set (Percent) |
|---|---|---|---|
| Affymetrix 6.0 | OmniExpress | 435,399 | 372,433 (85.53%) |
| OmniExpress | Affymetrix 6.0 | 446,697 | 391,758 (87.70%) |
| Immunochip | Affymetrix 6.0 | 28,305 | 25,844 (91.31%) |
| Immunochip | OmniExpress | 23,370 | 20,135 (86.16%) |
To calculate the number of imputable SNPs, SNPs dually genotyped by two arrays were first removed from consideration, as well as any SNPs failing QC. The number of remaining markers with phasing information in the HapMap CHB+JPT set (the reference set for imputations) was then tabulated. The percentages in the fourth column are the number of SNPs with phasing information divided by the number of non-shared SNPs (third column). MAF, minimum allele frequency; QC, quality control; SNP, single-nucleotide polymorphism.
Concordance and types of discordance in dually genotyped or genotyped and imputed SNPs
| Platform 1 | Platform 2 | Concordance (No. SNPs) | Homozygote to Homozygote | Heterozygote to Homozygote | Homozygote to Heterozygote |
|---|---|---|---|---|---|
| Affymetrix 6.0 | OmniExpress | 99.89% (146,885) | 45 (0.2%) | 12486 (64.8%) | 6750 (35.0%) |
| Affymetrix 6.0 | OmniExpress Imputed | 99.55% (252,679) | 388 (0.3%) | 55336 (44.1%) | 69891 (55.6%) |
| Affymetrix 6.0 | Immunochip | 99.89% (13,644) | 35 (3.9%) | 466 (51.5%) | 403 (44.6%) |
| OmniExpress | Affymetrix 6.0 Imputed | 99.52% (190,555) | 853 (0.3%) | 55336 (44.1%) | 69891 (55.6%) |
| OmniExpress | Immunochip | 99.97% (18,859) | 7 (0.8%) | 343 (39.2%) | 524 (60.0%) |
| Immunochip | Affymetrix 6.0 Imputed | 99.66% (19,727) | 73 (2.2%) | 1551 (45.7%) | 1769 (52.1%) |
| Immunochip | OmniExpress Imputed | 99.77% (15,024) | 186 (1.4%) | 6056 (47.6%) | 6492 (51.0%) |
The number of SNPs exhibiting heterozygote to homozygote mismatch vs. homozygote mis-calls are indicated. Percentages indicate the percent of discordant calls which fall into each category. SNP, single-nucleotide polymorphism.
Figure 2 Concordance vs. MAF and R2 for genotyping platforms. Each data point represents a genetic marker genotyped on one platform and imputed by the other. Concordance represents the proportion of subjects for which the directly obtained and imputed genotypes were the same, and MAF and R2 were calculated using PLINK.