Literature DB >> 28334151

Strategies for processing and quality control of Illumina genotyping arrays.

Shilin Zhao1, Wang Jing1, David C Samuels2, Quanghu Sheng1, Yu Shyr3, Yan Guo1.   

Abstract

Illumina genotyping arrays have powered thousands of large-scale genome-wide association studies over the past decade. Yet, because of the tremendous volume and complicated genetic assumptions of Illumina genotyping data, processing and quality control (QC) of these data remain a challenge. Thorough QC ensures the accurate identification of single-nucleotide polymorphisms and is required for the correct interpretation of genetic association results. By processing genotyping data on > 100 000 subjects from >10 major Illumina genotyping arrays, we have accumulated extensive experience in handling some of the most peculiar scenarios related to the processing and QC of Illumina genotyping data. Here, we describe strategies for processing Illumina genotyping data from the raw data to an analysis ready format, and we elaborate on the necessary QC procedures required at each processing step. High-quality Illumina genotyping data sets can be obtained by following our detailed QC strategies.

Mesh:

Year:  2018        PMID: 28334151      PMCID: PMC6171493          DOI: 10.1093/bib/bbx012

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  37 in total

1.  Detection of genotyping errors by Hardy-Weinberg equilibrium testing.

Authors:  Louise Hosking; Sheena Lumsden; Karen Lewis; Astrid Yeo; Linda McCarthy; Aruna Bansal; John Riley; Ian Purvis; Chun-Fang Xu
Journal:  Eur J Hum Genet       Date:  2004-05       Impact factor: 4.246

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

4.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

5.  Stem cell transcriptome profiling via massive-scale mRNA sequencing.

Authors:  Nicole Cloonan; Alistair R R Forrest; Gabriel Kolle; Brooke B A Gardiner; Geoffrey J Faulkner; Mellissa K Brown; Darrin F Taylor; Anita L Steptoe; Shivangi Wani; Graeme Bethel; Alan J Robertson; Andrew C Perkins; Stephen J Bruce; Clarence C Lee; Swati S Ranade; Heather E Peckham; Jonathan M Manning; Kevin J McKernan; Sean M Grimmond
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

6.  Rational inferences about departures from Hardy-Weinberg equilibrium.

Authors:  Jacqueline K Wittke-Thompson; Anna Pluzhnikov; Nancy J Cox
Journal:  Am J Hum Genet       Date:  2005-04-15       Impact factor: 11.025

7.  Data quality control in genetic case-control association studies.

Authors:  Carl A Anderson; Fredrik H Pettersson; Geraldine M Clarke; Lon R Cardon; Andrew P Morris; Krina T Zondervan
Journal:  Nat Protoc       Date:  2010-08-26       Impact factor: 13.491

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  Genome-wide association study for early-onset and morbid adult obesity identifies three new risk loci in European populations.

Authors:  David Meyre; Jérôme Delplanque; Jean-Claude Chèvre; Cécile Lecoeur; Stéphane Lobbens; Sophie Gallina; Emmanuelle Durand; Vincent Vatin; Franck Degraeve; Christine Proença; Stefan Gaget; Antje Körner; Peter Kovacs; Wieland Kiess; Jean Tichet; Michel Marre; Anna-Liisa Hartikainen; Fritz Horber; Natascha Potoczna; Serge Hercberg; Claire Levy-Marchal; François Pattou; Barbara Heude; Maithé Tauber; Mark I McCarthy; Alexandra I F Blakemore; Alexandre Montpetit; Constantin Polychronakos; Jacques Weill; Lachlan J M Coin; Julian Asher; Paul Elliott; Marjo-Riitta Järvelin; Sophie Visvikis-Siest; Beverley Balkau; Rob Sladek; David Balding; Andrew Walley; Christian Dina; Philippe Froguel
Journal:  Nat Genet       Date:  2009-01-18       Impact factor: 38.330

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  14 in total

Review 1.  Alternative Applications of Genotyping Array Data Using Multivariant Methods.

Authors:  David C Samuels; Jennifer E Below; Scott Ness; Hui Yu; Shuguang Leng; Yan Guo
Journal:  Trends Genet       Date:  2020-08-06       Impact factor: 11.639

2.  Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets.

Authors:  Praveen F Cherukuri; Melissa M Soe; David E Condon; Shubhi Bartaria; Kaitlynn Meis; Shaopeng Gu; Frederick G Frost; Lindsay M Fricke; Krzysztof P Lubieniecki; Joanna M Lubieniecka; Robert E Pyatt; Catherine Hajek; Cornelius F Boerkoel; Lynn Carmichael
Journal:  BMC Med Genomics       Date:  2022-03-14       Impact factor: 3.063

3.  Genetic Origins of the Two Canis lupus familiaris (Dog) Freight Dog Populations.

Authors:  Muhammad Basil Ali; Dayna L Dreger; Reuben M Buckley; Shahid Mansoor; Qaiser M Khan; Elaine A Ostrander
Journal:  J Hered       Date:  2022-05-16       Impact factor: 2.679

4.  Quality and concordance of genotyping array data of 12,064 samples from 5840 cancer patients.

Authors:  Mingsheng Guo; Wei Yue; David C Samuels; Hui Yu; Jing He; Ying-Yong Zhao; Yan Guo
Journal:  Genomics       Date:  2018-06-11       Impact factor: 5.736

5.  Overcoming polyploidy pitfalls: a user guide for effective SNP conversion into KASP markers in wheat.

Authors:  M Makhoul; C Rambla; K P Voss-Fels; L T Hickey; R J Snowdon; C Obermeier
Journal:  Theor Appl Genet       Date:  2020-06-04       Impact factor: 5.699

6.  Calculating Polygenic Risk Scores (PRS) in UK Biobank: A Practical Guide for Epidemiologists.

Authors:  Jennifer A Collister; Xiaonan Liu; Lei Clifton
Journal:  Front Genet       Date:  2022-02-18       Impact factor: 4.599

7.  GTQC: Automated Genotyping Array Quality Control and Report.

Authors:  Shilin Zhao; Limin Jiang; Hui Yu; Yan Guo
Journal:  J Genomics       Date:  2022-02-14

8.  Evaluating human autosomal loci for sexually antagonistic viability selection in two large biobanks.

Authors:  Katja R Kasimatis; Abin Abraham; Peter L Ralph; Andrew D Kern; John A Capra; Patrick C Phillips
Journal:  Genetics       Date:  2021-03-03       Impact factor: 4.562

9.  Imputation and Reanalysis of ExomeChip Data Identifies Novel, Conditional and Joint Genetic Effects on Parkinson's Disease Risk.

Authors:  Linduni M Rodrigo; Dale R Nyholt
Journal:  Genes (Basel)       Date:  2021-05-04       Impact factor: 4.096

Review 10.  Ten Years of EWAS.

Authors:  Siyu Wei; Junxian Tao; Jing Xu; Xingyu Chen; Zhaoyang Wang; Nan Zhang; Lijiao Zuo; Zhe Jia; Haiyan Chen; Hongmei Sun; Yubo Yan; Mingming Zhang; Hongchao Lv; Fanwu Kong; Lian Duan; Ye Ma; Mingzhi Liao; Liangde Xu; Rennan Feng; Guiyou Liu; The Ewas Project; Yongshuai Jiang
Journal:  Adv Sci (Weinh)       Date:  2021-08-11       Impact factor: 16.806

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.