Literature DB >> 23316436

Comparison of the performance of two commercial genome-wide association study genotyping platforms in Han Chinese samples.

Lei Jiang1, Dana Willner, Patrick Danoy, Huji Xu, Matthew A Brown.   

Abstract

Most genome-wide association studies to date have been performed in populations of European descent, but there is increasing interest in expanding these studies to other populations. The performance of genotyping chips in Asian populations is not well established. Therefore, we sought to test the performance of widely used fixed-marker, genome-wide association studies chips in the Han Chinese population. Non-HapMap Chinese samples (n = 396) were genotyped using the Illumina OmniExpress and Affymetrix 6.0 platforms, whereas a subset also were genotyped using the Immunochip. Genotyped markers from the Affymetrix 6.0 and Illumina OmniExpress were used for full genome imputation based on the HapMap 2 JPT+CHB (Japanese from Tokyo, Japan and Chinese from Beijing, China) reference panel. The concordance between markers genotypes for the three platforms was very high whether directly genotyped or genotyped and imputed single nucleotide polymorphisms (SNPs; >99.8% for directly genotyped and >99.5% for genotyped and imputed SNPs, respectively) were compared. The OmniExpress chip data enabled more SNPs to be imputed, particularly SNPs with minor allele frequency >5%. The OmniExpress chip achieved better coverage of HapMap SNPs than the Affymetrix 6.0 chip (73.6% vs. 65.9%, respectively, for minor allele frequency >5%). The Affymetrix 6.0 and Illumina OmniExpress chip have similar genotyping accuracy and provide similar accuracy of imputed SNPs. The OmniExpress chip however provides better coverage of Asian HapMap SNPs, although its coverage of HapMap SNPs is moderate.

Entities:  

Keywords:  Affymetrix 6.0; Illumina OmniExpress; Immunochip; genetic polymorphisms

Mesh:

Substances:

Year:  2013        PMID: 23316436      PMCID: PMC3538340          DOI: 10.1534/g3.112.004069

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Genetic polymorphism is known to contribute to phenotypic variation, disease risks, and an individual’s response to pharmaceuticals and the environment. For the past 20 years, genetic linkage combined with positional cloning has achieved tremendous success for mapping the variations that underlie monogenic Mendelian diseases (Jimenez-Sanchez ). However, it was only with the development of the genome-wide association study (GWAS) approach that significant progress was made mapping disease-associated loci in common complex human diseases (Wang ). Since the advent of the GWAS era with the publication of the Wellcome Trust Case Control Consortium studies in 2007 (The Wellcome Trust Case Control Consortium 2007), GWAS has become a routinely used tool to identify common, low-risk variants associated with or causative of a wide variety of human diseases. This has led to the robust identification of more than 500 loci associated with common human diseases, representing a huge leap in our understanding of the aetiopathogenesis of human disease (Hindorff ; Manolio ). The GWAS approach has matured considerably since its introduction. The key technical advances have come about through refinement of single-nucleotide polymorphism (SNP) selection on chips, marked increases in SNP density, the development of copy number variant tagging approaches, and improvements in chip throughput and reduction in per SNP genotyping costs. At the moment, Illumina (http://www.illumina.com) and Affymetrix (http://www.affymetrix.com) microarrays are the two most popular microarray platforms in the GWAS research area. The ability to identify associations depends heavily on the coverage of the genotyping chip used (Barrett and Cardon 2006). This coverage varies significantly between major ethnic groups and can be assessed in silico using previously genotyped samples of known ethnicity, typically from the HapMap study (Barrett and Cardon 2006; The International Hapmap Consortium 2005; Pei ). This approach does not take into account real-world factors, notably the genotyping success rate of the different platforms and chips. To date, most GWAS studies have been performed in populations of European descent. Interest in gene-mapping in Asian and African populations in particular has increased recently, driven at least partly by the fact that the so-called “low-hanging fruit” already have been identified for most common human diseases and the increasing evidence of specificity of a significant number of genetic associations to individual ethnic groups (e.g., Cai ; Carty ; Noguchi ; Smith ; Yoon ). This is particularly the case in East Asia, where large case collections have already been established, or where high-quality health services in large populations enable the establishment of suitable case collections for GWAS studies. Transethnic mapping, or comparing association findings in ethnically remote populations, can be highly informative about established loci in addition to identifying novel loci (Hughes ). When disease associations are shared (with the same polymorphisms) between ethnically remote groups, this implies that the same causal variant is in linkage disequilibrium with the genotyped variant in each population, indicating a common founder mutation (Franceschini ; Twee-Hee Ong ). Here, we test the coverage of two commonly used genotyping chips, the Affymetrix SNP Array 6.0 (abbreviated as Affymetrix 6.0) and Illumina Human OmniExpress Bead Chip (abbreviated as OmniExpress), in a Han Chinese population (n = 396). The Affymetrix 6.0 is designed to genotype more than 906,600 SNPs and 946,000 copy number variants. The OmniExpress is an Illumina Infinium HD BeadChip, which is designed to genotype 733,202 markers per sample. A subset of individuals also was genotyped using the Immunochip, an Illumina Infinium chip containing 196,524 polymorphisms for known immunogenetic loci (Cortes and Brown 2011). Chip performance was assessed in relation to data quality and genomic coverage, both for directly genotyped and imputed SNPs. Genotype concordance was evaluated between the Affymetrix 6.0 and OmniExpress, as well as between these two platforms and a third genotyping platform, the Immunochip.

Materials and Methods

Study samples

This study was approved by the research ethics committee of the Second Military Medical University, China. Blood samples were obtained from patients attending outpatient clinics in Changzheng Hospital, Shanghai, China. All the study subjects provided signed informed consent. Subjects genotyped with the Affymetrix 6.0 platform were a subset of a larger cohort of non-HapMap Han Chinese individuals at Changzheng Hospital, which included rheumatoid arthritis cases (n = 51) and controls (n = 188). Only control subjects were genotyped further using OmniExpress or Immunochip. Genomic DNA was extracted from peripheral blood leukocytes using the AxyPrep Blood Genomic DNA Miniprep Kit (Axygen, Union City, CA) according to the manufacturer’s instructions. All genomic DNA was resuspended in TE Buffer for the following study.

Genotyping

The concentration of genomic DNA was measured using the Quant-iT PicoGreen dsDNA assay (Invitrogen, Carlsbad, CA), and concentrations were standardized to 50 ng/μL for genotyping. All samples were genotyped initially using the Illumina OmniExpress (Illumina Inc., San Diego, CA) and Affymetrix 6.0 (Affymetrix Inc., Santa Clara, CA) chips. Correlation of genotypes was then checked with a further Illumina Infinium chip, the Immunochip (Illumina Inc.), which contains a mixture of common and rare variants (Cortes and Brown 2011). All platforms were operated according to the manufacturer’s instructions.

Genotype clustering and quality control

Illumina genotype clustering was performed using Illumina’s BeadStudio software (Illumina Inc.). SNPs were reclustered using the study samples, low quality subjects were removed, and a subset of clustering results were manually inspected and verified. All SNPs with GenCall scores less than 0.15 were excluded. For Affymetrix chips, clustering was performed using the program Birdseed version 2, as implemented using Affymetrix Genotyper Console v4.0 (Affymetrix Inc.), using the default quality control (QC) thresholds. Population stratification was assessed with Eigenstrat and SmartPCA (both available from http://genepath.med.harvard.edu/∼reich/Software.htm) using the HapMap phase 3 Release 27 data (Price ). This identified three ethnic outliers, which were removed before further analysis. Genotyped SNPs were evaluated using PLINK version 1.07 (pngu.mgh.harvard.edu/∼purcell/plink/), and markers with call rates less than 95%, minor allele frequency (MAF) less than 1%, and Hardy-Weinberg Equilibrium P-values less than 10−6 as were excluded from further analysis (Purcell ). Genotyping data in PLINK format has been deposited at the European Genome Phenome Archive (EGA, Genome-Phenome http://www.ebi.ac.uk/ega/) which is hosted at the European Bionformatics Institute, under accession number EGAS00000000131.

Imputation

Imputations were carried out using MACH (www.sph.umich.edu/csg/abecasis/MACH) with 50 rounds of model building (Li ). The phase 2 JPT+CHB (Japanese from Tokyo, Japan and Chinese from Beijing, China) HapMap Release 27 data were used as the reference panel. Imputation quality was assessed using the R2 metric. Imputed SNPs with R2 less than 0.3 were considered to be of poor quality and were excluded from analysis of imputation efficacy and genotyping concordance. MACH output files were converted to PLINK format using the free software GenGen [www.openbioinformatics.org/gengen/ (Wang )]. Coverage was calculated as the number of genotyped plus imputed SNPs (r2 > 0.8) divided by the number of genotyped HapMap markers for the JPT+CHB reference panel at a given MAF. Corrected coverage was calculated using the formula presented in Barrett and Cardon (2006), which adjusts coverage estimates for the total number of polymorphisms in the genome.

Concordance analysis

Genotype concordance was analyzed using PLINK version 1.07 with the merge function in merge-mode 7, which compares concordance ignoring missing genotypes (Purcell ). Prior to comparison SNPs genotyped on different platforms were adjusted for strand differences and allele coding to allow for accurate assessment of homozygote to homozygote miscalls as discussed in Results. Figure 1 shows the number of individuals genotyped by each pair of platforms, and Table 1 shows the number of shared markers. Types of discordance, MAF, and R2 values for each SNP were extracted from PLINK and GenGen files using Perl scripts. Correlations between concordance, MAF, R2 were analyzed using the R function cor.test, and scatterplots were made using the R plot function (R Development Core Team 2005).
Figure 1 

Study sample size. The total number of individuals genotyped by each platform is indicated under the platform name. Sample sizes in overlap regions represent the number of study subjects genotyped by two or more platforms.

Table 1

Concordance between genotyped SNPs

Array 1Array 2No. Shared Genotyped SNPs Passing QC with MAF > 0.01Percent Concordance
Affymetrix 6.0OmniExpress146,88599.89
Affymetrix 6.0Immunochip13,64499.89
OmniExpressImmunochip18,85999.97

The third column indicates the number of SNPs shared by Array 1 and Array 2 after quality and MAF filtering. Concordance rates presented in the fourth column represent the sum of counts of concordant markers for each subject divided by the total number of markers for all subjects. MAF, minimum allele frequency; SNP, single-nucleotide polymorphism.

Study sample size. The total number of individuals genotyped by each platform is indicated under the platform name. Sample sizes in overlap regions represent the number of study subjects genotyped by two or more platforms. The third column indicates the number of SNPs shared by Array 1 and Array 2 after quality and MAF filtering. Concordance rates presented in the fourth column represent the sum of counts of concordant markers for each subject divided by the total number of markers for all subjects. MAF, minimum allele frequency; SNP, single-nucleotide polymorphism.

Results

Genotyping success and concordance across chips

A total of 396 subjects were genotyped, with subsets subjects genotyped on one, two, or all three platforms (Figure 1). For each array type, data were first subjected to QC to remove SNPs with low call rates and MAF less than 1%. For the Affymetrix 6.0 array, 582,284 SNPs (62.28% of total markers) passed QC and had MAF greater than 1% for each array. The number of SNPs that were successfully genotyped by the OmniExpress chip was 593,582 (81.19% of total markers), which was slightly greater than for Affymetrix 6.0. For the Immunochip, 82,084 markers had MAF less than 0.01, and an additional 2,388 failed QC, leaving 112,112 (57.04% of total markers) genotyped markers. Genotype concordance between chip types was very high for all pairs of platforms; dually genotyped SNPs were more than 99.8% concordant in all cases (Table 1). No significant differences were noted between pairwise concordance, indicating that for genotypes that passed QC, genotyping accuracy did not vary substantially between the different platforms.

Imputation efficacy

Information on imputation efficacy for HapMap SNPs using genotyped Affymetrix 6.0 and OmniExpress markers is presented in Table S1. The number of imputable SNPs refers to the number of SNPs in the HapMap II CHB+JPT data set for a particular chromosome (Table S1A and B, second column). Imputation efficacy was initially evaluated using the MACH default value of r2 =0.3 as in Nothnagel . Overall, imputations were more efficacious using the OmniExpress chip, with 4.59% of SNPs failing QC vs. 7.10% for the Affymetrix 6.0. For both data sets, there was a high prevalence of SNPs with MAF less than 1%, but this was greater for the Affymetrix 6.0 chip (8.42%) than the OmniExpress (4.72%).

Coverage of HapMap SNPs

Imputed SNPs also were evaluated at r2 = 0.80 to calculate coverage of HapMap CHB+JPT markers (Table 2). The HapMap Phase II CHB+JPT panel contains 2,133,507 markers with MAF greater than 5%. Using the OmniExpress platform, 25.3% of these were genotyped, whereas 52.9% were successfully imputed, giving a naïve coverage rate of 78.2%. Using Barrett and Cardon’s corrected formula for coverage with an estimate of 7.5 million SNPs in the genome, we calculated coverage as 73.0% (Barrett and Cardon 2006), which is much lower than the 91% coverage for markers with MAF greater than 5% reported by Illumina for JPT+CHB populations (www.illumina.com/products/human_omni_express.ilmn). Corresponding coverage estimates for HapMap SNPs with MAF greater than 1% were 76.1% (uncorrected) and 71.1% (corrected; Table 2). Using the Affymetrix platform, we found that the empirical coverages for SNPs with MAF greater than 5% were 66.6% (uncorrected) and 59.2% (corrected), and for MAF greater than 1% were 66.4% (uncorrected) and 57.3% (corrected). This observed coverage was much lower than the theoretical coverage of 84% for JPT+CHB reported in Li .
Table 2

Coverage of JPT+CHB HapMap 2 SNPs

No. JPT+CHB HapMap SNPsHapMap SNPs GenotypedHapMap SNPs Imputed with r2 > 0.8CoverageCorrected Coverage
MAF > 0.05
 OmniExpress2,133,507540,163 (25.3%)1,128,626 (52.9%)78.2%73.0%
 Affymetrix 6.02,133,507501,993 (23.5%)919,764 (43.1%)66.6%59.2%
MAF > 0.01
 OmniExpress2,344,748581,788 (24.8%)1,288,127 (54.9%)79.7%71.1%
 Affymetrix 6.02,344,748512,669 (21.9%)1,045,048 (44.6%)66.5%57.3%

Only SNPs with a genotyping rate greater than 0.95 were considered. Coverage was calculated as the number of genotyped plus imputed SNPs (r2 > 0.8) divided by the number of genotyped HapMap markers for the JPT+CHB reference panel at a given MAF. Corrected coverage was calculated using the formula presented in Barrett and Cardon (2006), which adjusts coverage estimates for the total number of polymorphisms in the genome. MAF, minimum allele frequency; SNP, single-nucleotide polymorphism.

Only SNPs with a genotyping rate greater than 0.95 were considered. Coverage was calculated as the number of genotyped plus imputed SNPs (r2 > 0.8) divided by the number of genotyped HapMap markers for the JPT+CHB reference panel at a given MAF. Corrected coverage was calculated using the formula presented in Barrett and Cardon (2006), which adjusts coverage estimates for the total number of polymorphisms in the genome. MAF, minimum allele frequency; SNP, single-nucleotide polymorphism.

Concordance of imputed SNPs

The number of imputable SNPs, i.e., the number of SNPs for one platform that can be imputed using a second platform, was calculated as the number of unshared (not dually genotyped) markers with phasing information in the HapMap II CHB+JPT set (Table 3). Imputation efficacy between platforms corresponds to the percentage of imputable SNPs which passed imputation QC (r2 > 0.3, MAF > 0.1, call rate > 0.95; Table 4). The OmniExpress platform was more successful at imputing Affymetrix 6.0 markers than vice versa; however, both platforms had similar efficacy for imputation of Immunochip SNPs.
Table 3

Imputable SNPs for each array

Array 1Array 2No. SNPs Passing QC with MAF > 0.01 Genotyped on Array 1 OnlyNo. Imputable SNPs Using HapMap CHB+JPT Reference Set (Percent)
Affymetrix 6.0OmniExpress435,399372,433 (85.53%)
OmniExpressAffymetrix 6.0446,697391,758 (87.70%)
ImmunochipAffymetrix 6.028,30525,844 (91.31%)
ImmunochipOmniExpress23,37020,135 (86.16%)

To calculate the number of imputable SNPs, SNPs dually genotyped by two arrays were first removed from consideration, as well as any SNPs failing QC. The number of remaining markers with phasing information in the HapMap CHB+JPT set (the reference set for imputations) was then tabulated. The percentages in the fourth column are the number of SNPs with phasing information divided by the number of non-shared SNPs (third column). MAF, minimum allele frequency; QC, quality control; SNP, single-nucleotide polymorphism.

Table 4

Concordance and types of discordance in dually genotyped or genotyped and imputed SNPs

Platform 1Platform 2Concordance
(No. SNPs)Homozygote to HomozygoteHeterozygote to HomozygoteHomozygote to Heterozygote
Affymetrix 6.0OmniExpress99.89% (146,885)45 (0.2%)12486 (64.8%)6750 (35.0%)
Affymetrix 6.0OmniExpress Imputed99.55% (252,679)388 (0.3%)55336 (44.1%)69891 (55.6%)
Affymetrix 6.0Immunochip99.89% (13,644)35 (3.9%)466 (51.5%)403 (44.6%)
OmniExpressAffymetrix 6.0 Imputed99.52% (190,555)853 (0.3%)55336 (44.1%)69891 (55.6%)
OmniExpressImmunochip99.97% (18,859)7 (0.8%)343 (39.2%)524 (60.0%)
ImmunochipAffymetrix 6.0 Imputed99.66% (19,727)73 (2.2%)1551 (45.7%)1769 (52.1%)
ImmunochipOmniExpress Imputed99.77% (15,024)186 (1.4%)6056 (47.6%)6492 (51.0%)

The number of SNPs exhibiting heterozygote to homozygote mismatch vs. homozygote mis-calls are indicated. Percentages indicate the percent of discordant calls which fall into each category. SNP, single-nucleotide polymorphism.

To calculate the number of imputable SNPs, SNPs dually genotyped by two arrays were first removed from consideration, as well as any SNPs failing QC. The number of remaining markers with phasing information in the HapMap CHB+JPT set (the reference set for imputations) was then tabulated. The percentages in the fourth column are the number of SNPs with phasing information divided by the number of non-shared SNPs (third column). MAF, minimum allele frequency; QC, quality control; SNP, single-nucleotide polymorphism. The number of SNPs exhibiting heterozygote to homozygote mismatch vs. homozygote mis-calls are indicated. Percentages indicate the percent of discordant calls which fall into each category. SNP, single-nucleotide polymorphism. Concordance between imputed and genotyped SNPs for pairs of array platforms was very similar to concordance between pairs of genotyped SNPs (Table 1 and Table 4). Concordance rates exceeded 99.5% in all cases, and no differences were noted in the concordance rates for different pairs of genotyping arrays. This finding suggests that imputation is sufficiently robust to permit combining imputed and genotyped SNPs in studies using both Affymetrix 6.0 and OmniExpress arrays. Discordance between imputed and genotyped markers was classified into three categories: (1) a heterozygote mis-called as a homozygote, (2) a homozygote mis-called as a heterozygote, and (3) a homozygote called as the opposite homozygote. For all chip types, homozygote mis-calls as heterozygotes were more common than the converse (Table 4). Homozygote-to-homozygote mis-calls were uncommon with all chip types, comprising less than 5% of miscalls in all cases, and occurred slightly less frequently in the imputed OmniExpress data than the imputed Affymetrix 6.0 data (Table 4). Interestingly, homozygote mis-calls were noted between the Illumina OmniExpress and Immunochip arrays, which use the same strand designation system (Table 4).

Correlation between concordance, MAF, and R2

Concordance was analyzed on a per-marker basis to explore correlations with MAF and R2, as well as to determine the types of discordance present. It has been previously reported that imputation accuracy is positively correlated with MAF, which implies that concordance between genotyped and imputed markers also should increase with increasing MAF (Pei ; Marchini and Howie 2010). Here, we observed small but statistically significant positive correlations between MAF and concordance. A statistically significant but negligible correlation was observed between concordance and both MAF (Pearson’s r = 0.07, P < 0.001) and R2 (r = −0.09, P < 0.001) were detected when comparing the imputed OmniExpress SNPs to the Affymetrix 6.0 genotyped SNPs (Figure 2, top panels). Similarly, a negligible correlation was observed between concordance and MAF (r = −0.04, P < 0.001) for the imputed Affymetrix 6.0 markers compared with OmniExpress genotypes (Figure 2, bottom panels). The correlation between R2 and concordance was greater (r = −0.12, P < 0.001), but still quite low.
Figure 2 

Concordance vs. MAF and R2 for genotyping platforms. Each data point represents a genetic marker genotyped on one platform and imputed by the other. Concordance represents the proportion of subjects for which the directly obtained and imputed genotypes were the same, and MAF and R2 were calculated using PLINK.

Concordance vs. MAF and R2 for genotyping platforms. Each data point represents a genetic marker genotyped on one platform and imputed by the other. Concordance represents the proportion of subjects for which the directly obtained and imputed genotypes were the same, and MAF and R2 were calculated using PLINK.

Discussion

In this study we have compared the real-world performance of two widely used genotyping platforms, Affymetrix 6.0 and Illumina OmniExpress, in a non-HapMap Han Chinese population. Concordance between genotyped markers across the Affymetrix and Illumina platforms, including the Immunochip, was high, with no significant differences noted between different pairs of chips. Nothnagel et al. showed similarly high rates of concordance between the Affymetrix 6.0 and Illumina 550k platforms for genotyping in a German population (Nothnagel ). Concordance of imputed with directly genotyped SNPs was slightly lower than between genotyped SNPs alone, but the level of discordance was still quite low. Although this discordance is minor, care must be taken when combining information from different genotyping platforms for cases and controls because platform-specific differences can lead to spurious or inflated associations (Sinnott and Kraft 2011). Low MAF had little practical impact on the concordance or imputation quality as assessed by R2, suggesting that these GWAS chips, although not specifically designed to capture rare variants, may actually capture a significant fraction of genetic variation due to low frequency variants. Although discordance between both directly genotyped and imputed and genotyped SNPs was low, we noted differences in the type of discordance, with a greater frequency of homozygote genotypes being imputed as heterozygotes than the converse. A slightly greater error rate was observed when comparing imputed data from Affymetrix 6.0 chips with either the OmniExpress or Immunochip data, than with genotypes imputed from OmniExpress compared with directly genotyped Affymetrix 6.0 or Immunochip SNPs. This finding suggests that the imputation accuracy from Affymetrix SNPs is lower than from OmniExpress, although the difference was small. Imputation accuracy was lower on average for Affymetrix platforms as compared to Illumina platforms for CHB+JPT populations in a previous study by Li . Significant differences were noted in the ability to impute from the OmniExpress and Affymetrix 6.0, and imputation efficacy using MACH was much lower in general than previously reported [16]. Many imputed SNPs were not considered for further analysis due to low genotyping rates, poor QC values, and MAF less than 1% in the target panel (Table 2). Xu demonstrated the presence of significant substructure in Han Chinese populations, with notable differences between Northern and Southern Han subpopulations. The HapMap CHB+JPT reference panel includes genotypes from Northern Han (Beijing) and Japanese individuals, while our study population was from Shanghai, which is in the Southern Han region. Some markers which are polymorphic in the JPT+CHB reference panel with MAF greater than 1% may be less polymorphic or monomorphic in the Han Chinese population studied here, contributing to the reduced imputation efficacy. Greater imputation efficacy might be achieved by the use of mixed reference panels as described in Huang . Using HapMap data, Illumina reports that OmniExpress chips capture around 91% SNPs with MAF > 5% and r2 > 0.8 in CEU (Utah residents with European ancestry from the Centre du Etude Polymorphisme Humain collection) populations, with similarly good performance also in CHB+JPT (91% in HapMap). In our study, coverage was much lower for the OmniExpress platform (Table 3). This decreased coverage was largely attributable to imputation efficacy as described previously. Coverage estimates are known to be inflated when coverage is calculated using on the reference panel from which tagSNPs were selected during the chip design process (Barrett and Cardon 2006). This results in overfitting, which in combination with the small sample size of the HapMap data set can lead to exaggerated coverage estimates (Hao ). Markers for the Illumina OmniExpress platform were selected from HapMap SNPs representative of common variants (MAF > 5%) which may also resulted in overestimation of coverage (www.illumina.com/products/human_omni_express.ilmn). The use of more specialized reference panels with larger sample sizes, and genotyping platforms tailored to East Asian populations could improve coverage in future studies. Coverage for the Affymetrix 6.0 chip was lower than for the OmniExpress because a greater proportion of HapMap SNPs could be imputed using OmniExpress. Lower coverage of CEU, JPT+CHB, and YRI populations by Affymetrix arrays as compared with Illumina arrays has previously been reported and may be related to differences in SNP selection between the two platforms (Mägi ; Li , 2010). The Affymetrix microarray technology relies on the differential hybridization of genomic DNA to 25-mer probes which match SNP alleles, while the Illumina Infinium technology uses hybridization followed by primer extension (Kennedy ; Steemers ; Ragoussis 2009). Although Illumina emphasizes tagging SNPs, SNP selection for the Affymetrix system is limited by technical constraints. For example, SNPs chosen for the Affymetrix 6.0 assay must be located within fragments generated by a restriction digest by the enzymes Nsp I and Sty I (http://www.affymetrix.com), and are then required to ameliorate to universal hybridization conditions for adaptor ligation and subsequent inclusion on the array (Ding and Jin 2009). This study demonstrates that, in a real-world setting, both the Affymetrix 6.0 and OmniExpress chips achieve good coverage of genetic variation in Japanese and Chinese as defined by HapMap. From our results, we can conclude that despite the different mechanisms in chemistry, the Affymetrix 6.0 and OmniExpress platforms both give good call rates and similar genotype accuracy, in comparison with the Illumina Infinium Immunochip genotypes. Further, the imputation accuracy comparing SNPs imputed on one platform with SNPs genotyped on another platform was high, indicating that, with appropriate quality control, it is valid to combine imputed and genotyped SNPs in studies where samples have been genotyped either on the Affymetrix 6.0 or OmniExpress chips.
  31 in total

1.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  Evaluating coverage of genome-wide association studies.

Authors:  Jeffrey C Barrett; Lon R Cardon
Journal:  Nat Genet       Date:  2006-05-21       Impact factor: 38.330

4.  Evaluation of coverage variation of SNP chips for genome-wide association studies.

Authors:  Mingyao Li; Chun Li; Weihua Guan
Journal:  Eur J Hum Genet       Date:  2008-02-06       Impact factor: 4.246

5.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

6.  Genomic dissection of population substructure of Han Chinese and its implication in association studies.

Authors:  Shuhua Xu; Xianyong Yin; Shilin Li; Wenfei Jin; Haiyi Lou; Ling Yang; Xiaohong Gong; Hongyan Wang; Yiping Shen; Xuedong Pan; Yungang He; Yajun Yang; Yi Wang; Wenqing Fu; Yu An; Jiucun Wang; Jingze Tan; Ji Qian; Xiaoli Chen; Xin Zhang; Yangfei Sun; Xuejun Zhang; Bailin Wu; Li Jin
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

7.  Analyses and comparison of accuracy of different genotype imputation methods.

Authors:  Yu-Fang Pei; Jian Li; Lei Zhang; Christopher J Papasian; Hong-Wen Deng
Journal:  PLoS One       Date:  2008-10-29       Impact factor: 3.240

8.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

9.  Evaluating the performance of commercial whole-genome marker sets for capturing common genetic variation.

Authors:  Reedik Mägi; Arne Pfeufer; Mari Nelis; Alexandre Montpetit; Andres Metspalu; Maido Remm
Journal:  BMC Genomics       Date:  2007-06-11       Impact factor: 3.969

10.  Calibrating the performance of SNP arrays for whole-genome association studies.

Authors:  Ke Hao; Eric E Schadt; John D Storey
Journal:  PLoS Genet       Date:  2008-06-27       Impact factor: 5.917

View more
  9 in total

1.  Modulative effects of COMT haplotype on age-related associations with brain morphology.

Authors:  Annie Lee; Anqi Qiu
Journal:  Hum Brain Mapp       Date:  2016-02-27       Impact factor: 5.038

2.  Coverage and efficiency in current SNP chips.

Authors:  Ngoc-Thuy Ha; Saskia Freytag; Heike Bickeboeller
Journal:  Eur J Hum Genet       Date:  2014-01-22       Impact factor: 4.246

3.  Population Genomics of Human Adaptation.

Authors:  Joseph Lachance; Sarah A Tishkoff
Journal:  Annu Rev Ecol Evol Syst       Date:  2013-11       Impact factor: 13.915

4.  FKBP5 Moderates the Association between Antenatal Maternal Depressive Symptoms and Neonatal Brain Morphology.

Authors:  Changqing Wang; Mojun Shen; Bryan Guillaume; Yap-Seng Chong; Helen Chen; Marielle V Fortier; Michael J Meaney; Anqi Qiu
Journal:  Neuropsychopharmacology       Date:  2017-10-03       Impact factor: 7.853

5.  Inter- and intra-reproducibility of genotypes from sheep technical replicates on Illumina and Affymetrix platforms.

Authors:  Donagh P Berry; Aine O'Brien; Eamonn Wall; Kevin McDermott; Shane Randles; Paul Flynn; Stephen Park; Jenny Grose; Rebecca Weld; Noirin McHugh
Journal:  Genet Sel Evol       Date:  2016-11-10       Impact factor: 4.297

6.  Psychiatric polygenic risk associates with cortical morphology and functional organization in aging.

Authors:  Annie Lee; Mojun Shen; Anqi Qiu
Journal:  Transl Psychiatry       Date:  2017-12-11       Impact factor: 6.222

7.  Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome.

Authors:  Henry Richard Johnston; Yi-Juan Hu; Jingjing Gao; Timothy D O'Connor; Gonçalo R Abecasis; Genevieve L Wojcik; Christopher R Gignoux; Pierre-Antoine Gourraud; Antoine Lizee; Mark Hansen; Rob Genuario; Dave Bullis; Cindy Lawley; Eimear E Kenny; Carlos Bustamante; Terri H Beaty; Rasika A Mathias; Kathleen C Barnes; Zhaohui S Qin
Journal:  Sci Rep       Date:  2017-04-21       Impact factor: 4.379

8.  Effects of Antenatal Maternal Depressive Symptoms and Socio-Economic Status on Neonatal Brain Development are Modulated by Genetic Risk.

Authors:  Anqi Qiu; Mojun Shen; Claudia Buss; Yap-Seng Chong; Kenneth Kwek; Seang-Mei Saw; Peter D Gluckman; Pathik D Wadhwa; Sonja Entringer; Martin Styner; Neerja Karnani; Christine M Heim; Kieran J O'Donnell; Joanna D Holbrook; Marielle V Fortier; Michael J Meaney
Journal:  Cereb Cortex       Date:  2017-05-01       Impact factor: 5.357

9.  M(3)-S: a genotype calling method incorporating information from samples with known genotypes.

Authors:  Gengxin Li; Hongyu Zhao
Journal:  BMC Bioinformatics       Date:  2015-12-03       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.