Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Quantifying uncertainty in genotype calls.

Literature DB >> 19906825

Quantifying uncertainty in genotype calls.

Benilton S Carvalho¹, Thomas A Louis, Rafael A Irizarry.

Abstract

MOTIVATION: Genome-wide association studies (GWAS) are used to discover genes underlying complex, heritable disorders for which less powerful study designs have failed in the past. The number of GWAS has skyrocketed recently with findings reported in top journals and the mainstream media. Microarrays are the genotype calling technology of choice in GWAS as they permit exploration of more than a million single nucleotide polymorphisms (SNPs) simultaneously. The starting point for the statistical analyses used by GWAS to determine association between loci and disease is making genotype calls (AA, AB or BB). However, the raw data, microarray probe intensities, are heavily processed before arriving at these calls. Various sophisticated statistical procedures have been proposed for transforming raw data into genotype calls. We find that variability in microarray output quality across different SNPs, different arrays and different sample batches have substantial influence on the accuracy of genotype calls made by existing algorithms. Failure to account for these sources of variability can adversely affect the quality of findings reported by the GWAS.
RESULTS: We developed a method based on an enhanced version of the multi-level model used by CRLMM version 1. Two key differences are that we now account for variability across batches and improve the call-specific assessment of each call. The new model permits the development of quality metrics for SNPs, samples and batches of samples. Using three independent datasets, we demonstrate that the CRLMM version 2 outperforms CRLMM version 1 and the algorithm provided by Affymetrix, Birdseed. The main advantage of the new approach is that it enables the identification of low-quality SNPs, samples and batches. AVAILABILITY: Software implementing of the method described in this article is available as free and open source code in the crlmm R/BioConductor package. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Disease

Mesh：

Year: 2009 PMID： 19906825 PMCID： PMC2804295 DOI： 10.1093/bioinformatics/btp624

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

16 in total

1. Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays.

Authors: Xiaojun Di; Hajime Matsuzaki; Teresa A Webster; Earl Hubbell; Guoying Liu; Shoulian Dong; Dan Bartell; Jing Huang; Richard Chiles; Geoffrey Yang; Mei-mei Shen; David Kulp; Giulia C Kennedy; Rui Mei; Keith W Jones; Simon Cawley
Journal: Bioinformatics Date: 2005-01-18 Impact factor: 6.937

2. Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors: Gordon K Smyth
Journal: Stat Appl Genet Mol Biol Date: 2004-02-12

3. A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors: Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal: Nat Genet Date: 2007-06-17 Impact factor: 38.330

4. Solving the riddle of the bright mismatches: labeling and effective binding in oligonucleotide arrays.

Authors: Felix Naef; Marcelo O Magnasco
Journal: Phys Rev E Stat Nonlin Soft Matter Phys Date: 2003-07-16

5. R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips.

Authors: Matthew E Ritchie; Benilton S Carvalho; Kurt N Hetrick; Simon Tavaré; Rafael A Irizarry
Journal: Bioinformatics Date: 2009-08-06 Impact factor: 6.937

6. Inflammation, hemostasis, and the risk of kidney function decline in the Atherosclerosis Risk in Communities (ARIC) Study.

Authors: Lori D Bash; Thomas P Erlinger; Josef Coresh; Jane Marsh-Manzi; Aaron R Folsom; Brad C Astor
Journal: Am J Kidney Dis Date: 2008-12-24 Impact factor: 8.860

7. Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection.

Authors: C Li; W H Wong
Journal: Proc Natl Acad Sci U S A Date: 2001-01-02 Impact factor: 11.205

8. Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application.

Authors: C Li; W Hung Wong
Journal: Genome Biol Date: 2001-08-03 Impact factor: 13.583

9. Validation and extension of an empirical Bayes method for SNP calling on Affymetrix microarrays.

Authors: Shin Lin; Benilton Carvalho; David J Cutler; Dan E Arking; Aravinda Chakravarti; Rafael A Irizarry
Journal: Genome Biol Date: 2008-04-03 Impact factor: 13.583

10. A method to address differential bias in genotyping in large-scale association studies.

Authors: Vincent Plagnol; Jason D Cooper; John A Todd; David G Clayton
Journal: PLoS Genet Date: 2007-04-05 Impact factor: 5.917

27 in total

1. Family-based association tests using genotype data with uncertainty.

Authors: Zhaoxia Yu
Journal: Biostatistics Date: 2011-12-08 Impact factor: 5.899

2. A framework for oligonucleotide microarray preprocessing.

Authors: Benilton S Carvalho; Rafael A Irizarry
Journal: Bioinformatics Date: 2010-08-05 Impact factor: 6.937

3. A multilevel model to address batch effects in copy number estimation using SNP arrays.

Authors: Robert B Scharpf; Ingo Ruczinski; Benilton Carvalho; Betty Doan; Aravinda Chakravarti; Rafael A Irizarry
Journal: Biostatistics Date: 2010-07-12 Impact factor: 5.899

4. A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis.

Authors: Sarah E Reese; Kellie J Archer; Terry M Therneau; Elizabeth J Atkinson; Celine M Vachon; Mariza de Andrade; Jean-Pierre A Kocher; Jeanette E Eckel-Passow
Journal: Bioinformatics Date: 2013-08-19 Impact factor: 6.937

5. Genome-wide association study in East Asians identifies two novel breast cancer susceptibility loci.

Authors: Mi-Ryung Han; Jirong Long; Ji-Yeob Choi; Siew-Kee Low; Sun-Seog Kweon; Ying Zheng; Qiuyin Cai; Jiajun Shi; Xingyi Guo; Keitaro Matsuo; Motoki Iwasaki; Chen-Yang Shen; Mi Kyung Kim; Wanqing Wen; Bingshan Li; Atsushi Takahashi; Min-Ho Shin; Yong-Bing Xiang; Hidemi Ito; Yoshio Kasuga; Dong-Young Noh; Koichi Matsuda; Min Ho Park; Yu-Tang Gao; Hiroji Iwata; Shoichiro Tsugane; Sue K Park; Michiaki Kubo; Xiao-Ou Shu; Daehee Kang; Wei Zheng
Journal: Hum Mol Genet Date: 2016-06-27 Impact factor: 6.150

6. Genome-epigenome interactions associated with Myalgic Encephalomyelitis/Chronic Fatigue Syndrome.

Authors: Santiago Herrera; Wilfred C de Vega; David Ashbrook; Suzanne D Vernon; Patrick O McGowan
Journal: Epigenetics Date: 2018-12-05 Impact factor: 4.528

7. Genetic markers of comorbid depression and alcoholism in women.

Authors: Daniela O Procopio; Laura M Saba; Henriette Walter; Otto Lesch; Katrin Skala; Golda Schlaff; Lauren Vanderlinden; Peter Clapp; Paula L Hoffman; Boris Tabakoff
Journal: Alcohol Clin Exp Res Date: 2012-12-27 Impact factor: 3.455

8. SomaticSniper: identification of somatic point mutations in whole genome sequencing data.

Authors: David E Larson; Christopher C Harris; Ken Chen; Daniel C Koboldt; Travis E Abbott; David J Dooling; Timothy J Ley; Elaine R Mardis; Richard K Wilson; Li Ding
Journal: Bioinformatics Date: 2011-12-06 Impact factor: 6.937

9. A Comprehensive cis-eQTL Analysis Revealed Target Genes in Breast Cancer Susceptibility Loci Identified in Genome-wide Association Studies.

Authors: Xingyi Guo; Weiqiang Lin; Jiandong Bao; Qiuyin Cai; Xiao Pan; Mengqiu Bai; Yuan Yuan; Jiajun Shi; Yaqiong Sun; Mi-Ryung Han; Jing Wang; Qi Liu; Wanqing Wen; Bingshan Li; Jirong Long; Jianghua Chen; Wei Zheng
Journal: Am J Hum Genet Date: 2018-05-03 Impact factor: 11.025

10. TumorBoost: normalization of allele-specific tumor copy numbers from a single pair of tumor-normal genotyping microarrays.

Authors: Henrik Bengtsson; Pierre Neuvial; Terence P Speed
Journal: BMC Bioinformatics Date: 2010-05-12 Impact factor: 3.169