Literature DB >> 22155947

M(3): an improved SNP calling algorithm for Illumina BeadArray data.

Gengxin Li1, Joel Gelernter, Henry R Kranzler, Hongyu Zhao.   

Abstract

SUMMARY: Genotype calling from high-throughput platforms such as Illumina and Affymetrix is a critical step in data processing, so that accurate information on genetic variants can be obtained for phenotype-genotype association studies. A number of algorithms have been developed to infer genotypes from data generated through the Illumina BeadStation platform, including GenCall, GenoSNP, Illuminus and CRLMM. Most of these algorithms are built on population-based statistical models to genotype every SNP in turn, such as GenCall with the GenTrain clustering algorithm, and require a large reference population to perform well. These approaches may not work well for rare variants where only a small proportion of the individuals carry the variant. A fundamentally different approach, implemented in GenoSNP, adopts a single nucleotide polymorphism (SNP)-based model to infer genotypes of all the SNPs in one individual, making it an appealing alternative to call rare variants. However, compared to the population-based strategies, more SNPs in GenoSNP may fail the Hardy-Weinberg Equilibrium test. To take advantage of both strategies, we propose a two-stage SNP calling procedure, named the modified mixture model (M(3)), to improve call accuracy for both common and rare variants. The effectiveness of our approach is demonstrated through applications to genotype calling on a set of HapMap samples used for quality control purpose in a large case-control study of cocaine dependence. The increase in power with M(3) is greater for rare variants than for common variants depending on the model. AVAILABILITY: M(3) algorithm: http://bioinformatics.med.yale.edu/group. CONTACT: name@bio.com; hongyu.zhao@yale.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2011        PMID: 22155947      PMCID: PMC3268244          DOI: 10.1093/bioinformatics/btr673

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  Quality and completeness of SNP databases.

Authors:  David E Reich; Stacey B Gabriel; David Altshuler
Journal:  Nat Genet       Date:  2003-03-24       Impact factor: 38.330

2.  A genotype calling algorithm for affymetrix SNP arrays.

Authors:  Nusrat Rabbee; Terence P Speed
Journal:  Bioinformatics       Date:  2005-11-02       Impact factor: 6.937

3.  Whole-genome genotyping with the single-base extension assay.

Authors:  Frank J Steemers; Weihua Chang; Grace Lee; David L Barker; Richard Shen; Kevin L Gunderson
Journal:  Nat Methods       Date:  2006-01       Impact factor: 28.547

4.  Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data.

Authors:  Benilton Carvalho; Henrik Bengtsson; Terence P Speed; Rafael A Irizarry
Journal:  Biostatistics       Date:  2006-12-22       Impact factor: 5.899

5.  A genome-wide association study identifies novel risk loci for type 2 diabetes.

Authors:  Robert Sladek; Ghislain Rocheleau; Johan Rung; Christian Dina; Lishuang Shen; David Serre; Philippe Boutin; Daniel Vincent; Alexandre Belisle; Samy Hadjadj; Beverley Balkau; Barbara Heude; Guillaume Charpentier; Thomas J Hudson; Alexandre Montpetit; Alexey V Pshezhetsky; Marc Prentki; Barry I Posner; David J Balding; David Meyre; Constantin Polychronakos; Philippe Froguel
Journal:  Nature       Date:  2007-02-11       Impact factor: 49.962

6.  A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors:  Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal:  Nat Genet       Date:  2007-06-17       Impact factor: 38.330

7.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

8.  Comparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChips.

Authors:  Matthew E Ritchie; Ruijie Liu; Benilton S Carvalho; Rafael A Irizarry
Journal:  BMC Bioinformatics       Date:  2011-03-08       Impact factor: 3.169

9.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

10.  A genotype calling algorithm for the Illumina BeadArray platform.

Authors:  Yik Y Teo; Michael Inouye; Kerrin S Small; Rhian Gwilliam; Panagiotis Deloukas; Dominic P Kwiatkowski; Taane G Clark
Journal:  Bioinformatics       Date:  2007-09-10       Impact factor: 6.937

View more
  13 in total

1.  AspWood: High-Spatial-Resolution Transcriptome Profiles Reveal Uncharacterized Modularity of Wood Formation in Populus tremula.

Authors:  David Sundell; Nathaniel R Street; Manoj Kumar; Ewa J Mellerowicz; Melis Kucukoglu; Christoffer Johnsson; Vikash Kumar; Chanaka Mannapperuma; Nicolas Delhomme; Ove Nilsson; Hannele Tuominen; Edouard Pesquet; Urs Fischer; Totte Niittylä; Björn Sundberg; Torgeir R Hvidsten
Journal:  Plant Cell       Date:  2017-06-27       Impact factor: 11.277

Review 2.  Benefits and limitations of genome-wide association studies.

Authors:  Vivian Tam; Nikunj Patel; Michelle Turcotte; Yohan Bossé; Guillaume Paré; David Meyre
Journal:  Nat Rev Genet       Date:  2019-08       Impact factor: 53.242

3.  optiCall: a robust genotype-calling algorithm for rare, low-frequency and common variants.

Authors:  T S Shah; J Z Liu; J A B Floyd; J A Morris; N Wirth; J C Barrett; C A Anderson
Journal:  Bioinformatics       Date:  2012-04-12       Impact factor: 6.937

4.  Efficiency and power as a function of sequence coverage, SNP array density, and imputation.

Authors:  Jason Flannick; Joshua M Korn; Pierre Fontanillas; George B Grant; Eric Banks; Mark A Depristo; David Altshuler
Journal:  PLoS Comput Biol       Date:  2012-07-12       Impact factor: 4.475

5.  Comparison of eleven methods for genomic DNA extraction suitable for large-scale whole-genome genotyping and long-term DNA banking using blood samples.

Authors:  Androniki Psifidi; Chrysostomos I Dovas; Georgios Bramis; Thomai Lazou; Claire L Russel; Georgios Arsenos; Georgios Banos
Journal:  PLoS One       Date:  2015-01-30       Impact factor: 3.240

6.  GStream: improving SNP and CNV coverage on genome-wide association studies.

Authors:  Arnald Alonso; Sara Marsal; Raül Tortosa; Oriol Canela-Xandri; Antonio Julià
Journal:  PLoS One       Date:  2013-07-03       Impact factor: 3.240

7.  Comparison of genotype clustering tools with rare variants.

Authors:  Louis-Philippe Lemieux Perreault; Marc-André Legault; Amina Barhdadi; Sylvie Provost; Valérie Normand; Jean-Claude Tardif; Marie-Pierre Dubé
Journal:  BMC Bioinformatics       Date:  2014-02-21       Impact factor: 3.169

8.  BCRgt: a Bayesian cluster regression-based genotyping algorithm for the samples with copy number alterations.

Authors:  Shengping Yang; Xiangqin Cui; Zhide Fang
Journal:  BMC Bioinformatics       Date:  2014-03-15       Impact factor: 3.169

9.  KRLMM: an adaptive genotype calling method for common and low frequency variants.

Authors:  Ruijie Liu; Zhiyin Dai; Meredith Yeager; Rafael A Irizarry; Matthew E Ritchie
Journal:  BMC Bioinformatics       Date:  2014-05-23       Impact factor: 3.169

10.  A new model calling procedure for Illumina BeadArray data.

Authors:  Gengxin Li
Journal:  BMC Genet       Date:  2016-06-24       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.