| Literature DB >> 26092718 |
Mark N Kvale1, Stephanie Hesselson2, Thomas J Hoffmann3, Yang Cao2, David Chan4, Sheryl Connell5, Lisa A Croen5, Brad P Dispensa2, Jasmin Eshragh2, Andrea Finn4, Jeremy Gollub4, Carlos Iribarren5, Eric Jorgenson5, Lawrence H Kushi5, Richard Lao2, Yontao Lu4, Dana Ludwig5, Gurpreet K Mathauda2, William B McGuire5, Gangwu Mei4, Sunita Miles5, Michael Mittman4, Mohini Patil4, Charles P Quesenberry5, Dilrini Ranatunga5, Sarah Rowell5, Marianne Sadler5, Lori C Sakoda5, Michael Shapero4, Ling Shen5, Tanu Shenoy2, David Smethurst5, Carol P Somkin5, Stephen K Van Den Eeden5, Lawrence Walter5, Eunice Wan2, Teresa Webster4, Rachel A Whitmer5, Simon Wong2, Chia Zau5, Yiping Zhan4, Catherine Schaefer6, Pui-Yan Kwok7, Neil Risch8.
Abstract
The Kaiser Permanente (KP) Research Program on Genes, Environment and Health (RPGEH), in collaboration with the University of California-San Francisco, undertook genome-wide genotyping of >100,000 subjects that constitute the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. The project, which generated >70 billion genotypes, represents the first large-scale use of the Affymetrix Axiom Genotyping Solution. Because genotyping took place over a short 14-month period, creating a near-real-time analysis pipeline for experimental assay quality control and final optimized analyses was critical. Because of the multi-ethnic nature of the cohort, four different ethnic-specific arrays were employed to enhance genome-wide coverage. All assays were performed on DNA extracted from saliva samples. To improve sample call rates and significantly increase genotype concordance, we partitioned the cohort into disjoint packages of plates with similar assay contexts. Using strict QC criteria, the overall genotyping success rate was 103,067 of 109,837 samples assayed (93.8%), with a range of 92.1-95.4% for the four different arrays. Similarly, the SNP genotyping success rate ranged from 98.1 to 99.4% across the four arrays, the variation depending mostly on how many SNPs were included as single copy vs. double copy on a particular array. The high quality and large scale of genotype data created on this cohort, in conjunction with comprehensive longitudinal data from the KP electronic health records of participants, will enable a broad range of highly powered genome-wide association studies on a diversity of traits and conditions.Entities:
Keywords: Affymetrix Axiom; GERA cohort; genome-wide genotyping; quality control; saliva DNA
Mesh:
Year: 2015 PMID: 26092718 PMCID: PMC4574249 DOI: 10.1534/genetics.115.178905
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562