Literature DB >> 20625178

A multilevel model to address batch effects in copy number estimation using SNP arrays.

Robert B Scharpf1, Ingo Ruczinski, Benilton Carvalho, Betty Doan, Aravinda Chakravarti, Rafael A Irizarry.   

Abstract

Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of base pairs in the genome. Genomewide association studies (GWAS) may simultaneously screen for copy number phenotype and single nucleotide polymorphism (SNP) phenotype associations as part of the analytic strategy. However, genomewide array analyses are particularly susceptible to batch effects as the logistics of preparing DNA and processing thousands of arrays often involves multiple laboratories and technicians, or changes over calendar time to the reagents and laboratory equipment. Failure to adjust for batch effects can lead to incorrect inference and requires inefficient post hoc quality control procedures to exclude regions that are associated with batch. Our work extends previous model-based approaches for copy number estimation by explicitly modeling batch and using shrinkage to improve locus-specific estimates of copy number uncertainty. Key features of this approach include the use of biallelic genotype calls from experimental data to estimate batch-specific and locus-specific parameters of background and signal without the requirement of training data. We illustrate these ideas using a study of bipolar disease and a study of chromosome 21 trisomy. The former has batch effects that dominate much of the observed variation in the quantile-normalized intensities, while the latter illustrates the robustness of our approach to a data set in which approximately 27% of the samples have altered copy number. Locus-specific estimates of copy number can be plotted on the copy number scale to investigate mosaicism and guide the choice of appropriate downstream approaches for smoothing the copy number as a function of physical position. The software is open source and implemented in the R package crlmm at Bioconductor (http:www.bioconductor.org).

Entities:  

Mesh:

Year:  2010        PMID: 20625178      PMCID: PMC3006124          DOI: 10.1093/biostatistics/kxq043

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  22 in total

1.  Determination of cancer risk associated with germ line BRCA1 missense variants by functional analysis.

Authors:  Marcelo A Carvalho; Sylvia M Marsillac; Rachel Karchin; Siranoush Manoukian; Scott Grist; Ramona F Swaby; Turan P Urmenyi; Edson Rondinelli; Rosane Silva; Luis Gayol; Lisa Baumbach; Rebecca Sutphen; Jennifer L Pickard-Brzosowicz; Katherine L Nathanson; Andrej Sali; David Goldgar; Fergus J Couch; Paolo Radice; Alvaro N A Monteiro
Journal:  Cancer Res       Date:  2007-02-15       Impact factor: 12.701

2.  Structural variation of chromosomes in autism spectrum disorder.

Authors:  Christian R Marshall; Abdul Noor; John B Vincent; Anath C Lionel; Lars Feuk; Jennifer Skaug; Mary Shago; Rainald Moessner; Dalila Pinto; Yan Ren; Bhooma Thiruvahindrapduram; Andreas Fiebig; Stefan Schreiber; Jan Friedman; Cees E J Ketelaars; Yvonne J Vos; Can Ficicioglu; Susan Kirkpatrick; Rob Nicolson; Leon Sloman; Anne Summers; Clare A Gibbons; Ahmad Teebi; David Chitayat; Rosanna Weksberg; Ann Thompson; Cathy Vardy; Vicki Crosbie; Sandra Luscombe; Rebecca Baatjes; Lonnie Zwaigenbaum; Wendy Roberts; Bridget Fernandez; Peter Szatmari; Stephen W Scherer
Journal:  Am J Hum Genet       Date:  2008-01-17       Impact factor: 11.025

3.  Integrated detection and population-genetic analysis of SNPs and copy number variation.

Authors:  Steven A McCarroll; Finny G Kuruvilla; Joshua M Korn; Simon Cawley; James Nemesh; Alec Wysoker; Michael H Shapero; Paul I W de Bakker; Julian B Maller; Andrew Kirby; Amanda L Elliott; Melissa Parkin; Earl Hubbell; Teresa Webster; Rui Mei; James Veitch; Patrick J Collins; Robert Handsaker; Steve Lincoln; Marcia Nizzari; John Blume; Keith W Jones; Rich Rava; Mark J Daly; Stacey B Gabriel; David Altshuler
Journal:  Nat Genet       Date:  2008-09-07       Impact factor: 38.330

4.  Quantifying uncertainty in genotype calls.

Authors:  Benilton S Carvalho; Thomas A Louis; Rafael A Irizarry
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

5.  Estimating genome-wide copy number using allele-specific mixture models.

Authors:  Wenyi Wang; Benilton Carvalho; Nathaniel D Miller; Jonathan Pevsner; Aravinda Chakravarti; Rafael A Irizarry
Journal:  J Comput Biol       Date:  2008-09       Impact factor: 1.479

6.  Mapping autism risk loci using genetic linkage and chromosomal rearrangements.

Authors:  Peter Szatmari; Andrew D Paterson; Lonnie Zwaigenbaum; Wendy Roberts; Jessica Brian; Xiao-Qing Liu; John B Vincent; Jennifer L Skaug; Ann P Thompson; Lili Senman; Lars Feuk; Cheng Qian; Susan E Bryson; Marshall B Jones; Christian R Marshall; Stephen W Scherer; Veronica J Vieland; Christopher Bartlett; La Vonne Mangin; Rhinda Goedken; Alberto Segre; Margaret A Pericak-Vance; Michael L Cuccaro; John R Gilbert; Harry H Wright; Ruth K Abramson; Catalina Betancur; Thomas Bourgeron; Christopher Gillberg; Marion Leboyer; Joseph D Buxbaum; Kenneth L Davis; Eric Hollander; Jeremy M Silverman; Joachim Hallmayer; Linda Lotspeich; James S Sutcliffe; Jonathan L Haines; Susan E Folstein; Joseph Piven; Thomas H Wassink; Val Sheffield; Daniel H Geschwind; Maja Bucan; W Ted Brown; Rita M Cantor; John N Constantino; T Conrad Gilliam; Martha Herbert; Clara Lajonchere; David H Ledbetter; Christa Lese-Martin; Janet Miller; Stan Nelson; Carol A Samango-Sprouse; Sarah Spence; Matthew State; Rudolph E Tanzi; Hilary Coon; Geraldine Dawson; Bernie Devlin; Annette Estes; Pamela Flodman; Lambertus Klei; William M McMahon; Nancy Minshew; Jeff Munson; Elena Korvatska; Patricia M Rodier; Gerard D Schellenberg; Moyra Smith; M Anne Spence; Chris Stodgell; Ping Guo Tepper; Ellen M Wijsman; Chang-En Yu; Bernadette Rogé; Carine Mantoulan; Kerstin Wittemeyer; Annemarie Poustka; Bärbel Felder; Sabine M Klauck; Claudia Schuster; Fritz Poustka; Sven Bölte; Sabine Feineis-Matthews; Evelyn Herbrecht; Gabi Schmötzer; John Tsiantis; Katerina Papanikolaou; Elena Maestrini; Elena Bacchelli; Francesca Blasi; Simona Carone; Claudio Toma; Herman Van Engeland; Maretha de Jonge; Chantal Kemner; Frederieke Koop; Frederike Koop; Marjolein Langemeijer; Marjolijn Langemeijer; Channa Hijmans; Channa Hijimans; Wouter G Staal; Gillian Baird; Patrick F Bolton; Michael L Rutter; Emma Weisblatt; Jonathan Green; Catherine Aldred; Julie-Anne Wilkinson; Andrew Pickles; Ann Le Couteur; Tom Berney; Helen McConachie; Anthony J Bailey; Kostas Francis; Gemma Honeyman; Aislinn Hutchinson; Jeremy R Parr; Simon Wallace; Anthony P Monaco; Gabrielle Barnby; Kazuhiro Kobayashi; Janine A Lamb; Ines Sousa; Nuala Sykes; Edwin H Cook; Stephen J Guter; Bennett L Leventhal; Jeff Salt; Catherine Lord; Christina Corsello; Vanessa Hus; Daniel E Weeks; Fred Volkmar; Maïté Tauber; Eric Fombonne; Andy Shih; Kacie J Meyer
Journal:  Nat Genet       Date:  2007-02-18       Impact factor: 38.330

7.  Gene copy number variation in schizophrenia.

Authors:  Smitha R Sutrala; Dirk Goossens; Nigel M Williams; Lien Heyrman; Rolf Adolfsson; Nadine Norton; Paul R Buckland; Jurgen Del-Favero
Journal:  Schizophr Res       Date:  2007-09-07       Impact factor: 4.939

8.  A genome-wide investigation of SNPs and CNVs in schizophrenia.

Authors:  Anna C Need; Dongliang Ge; Michael E Weale; Jessica Maia; Sheng Feng; Erin L Heinzen; Kevin V Shianna; Woohyun Yoon; Dalia Kasperaviciūte; Massimo Gennarelli; Warren J Strittmatter; Cristian Bonvicini; Giuseppe Rossi; Karu Jayathilake; Philip A Cola; Joseph P McEvoy; Richard S E Keefe; Elizabeth M C Fisher; Pamela L St Jean; Ina Giegling; Annette M Hartmann; Hans-Jürgen Möller; Andreas Ruppert; Gillian Fraser; Caroline Crombie; Lefkos T Middleton; David St Clair; Allen D Roses; Pierandrea Muglia; Clyde Francks; Dan Rujescu; Herbert Y Meltzer; David B Goldstein
Journal:  PLoS Genet       Date:  2009-02-06       Impact factor: 5.917

9.  Mapping and sequencing of structural variation from eight human genomes.

Authors:  Jeffrey M Kidd; Gregory M Cooper; William F Donahue; Hillary S Hayden; Nick Sampas; Tina Graves; Nancy Hansen; Brian Teague; Can Alkan; Francesca Antonacci; Eric Haugen; Troy Zerr; N Alice Yamada; Peter Tsang; Tera L Newman; Eray Tüzün; Ze Cheng; Heather M Ebling; Nadeem Tusneem; Robert David; Will Gillett; Karen A Phelps; Molly Weaver; David Saranga; Adrianne Brand; Wei Tao; Erik Gustafson; Kevin McKernan; Lin Chen; Maika Malig; Joshua D Smith; Joshua M Korn; Steven A McCarroll; David A Altshuler; Daniel A Peiffer; Michael Dorschner; John Stamatoyannopoulos; David Schwartz; Deborah A Nickerson; James C Mullikin; Richard K Wilson; Laurakay Bruhn; Maynard V Olson; Rajinder Kaul; Douglas R Smith; Evan E Eichler
Journal:  Nature       Date:  2008-05-01       Impact factor: 49.962

10.  Mechanisms for human genomic rearrangements.

Authors:  Wenli Gu; Feng Zhang; James R Lupski
Journal:  Pathogenetics       Date:  2008-11-03
View more
  27 in total

1.  CalMaTe: a method and software to improve allele-specific copy number of SNP arrays for downstream segmentation.

Authors:  Maria Ortiz-Estevez; Ander Aramburu; Henrik Bengtsson; Pierre Neuvial; Angel Rubio
Journal:  Bioinformatics       Date:  2012-05-09       Impact factor: 6.937

Review 2.  Current analysis platforms and methods for detecting copy number variation.

Authors:  Wenli Li; Michael Olivier
Journal:  Physiol Genomics       Date:  2012-11-06       Impact factor: 3.107

3.  Inheritance model introduces differential bias in CNV calls between parents and offspring.

Authors:  Sulgi Kim; Steven P Millard; Chang-En Yu; Lesley Leong; Allen Radant; Dorcas Dobie; Debby W Tsuang; Ellen M Wijsman
Journal:  Genet Epidemiol       Date:  2012-05-24       Impact factor: 2.135

Review 4.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

5.  Sample processing obscures cancer-specific alterations in leukemic transcriptomes.

Authors:  Heidi Dvinge; Rhonda E Ries; Janine O Ilagan; Derek L Stirewalt; Soheil Meshinchi; Robert K Bradley
Journal:  Proc Natl Acad Sci U S A       Date:  2014-11-10       Impact factor: 11.205

6.  Using the R Package crlmm for Genotyping and Copy Number Estimation.

Authors:  Robert B Scharpf; Rafael A Irizarry; Matthew E Ritchie; Benilton Carvalho; Ingo Ruczinski
Journal:  J Stat Softw       Date:  2011-05-01       Impact factor: 6.440

7.  Comparative genomic and genetic analysis of glioblastoma-derived brain tumor-initiating cells and their parent tumors.

Authors:  Brad Davis; Yaoqing Shen; Candice C Poon; H Artee Luchman; Owen D Stechishin; Carly S Pontifex; Wei Wu; John J Kelly; Michael D Blough
Journal:  Neuro Oncol       Date:  2015-08-05       Impact factor: 12.300

8.  A model of binding on DNA microarrays: understanding the combined effect of probe synthesis failure, cross-hybridization, DNA fragmentation and other experimental details of affymetrix arrays.

Authors:  Yasminka A Jakubek; David J Cutler
Journal:  BMC Genomics       Date:  2012-12-27       Impact factor: 3.969

9.  Getting DNA copy numbers without control samples.

Authors:  Maria Ortiz-Estevez; Ander Aramburu; Angel Rubio
Journal:  Algorithms Mol Biol       Date:  2012-08-16       Impact factor: 1.405

10.  Improving breast cancer survival analysis through competition-based multidimensional modeling.

Authors:  Erhan Bilal; Janusz Dutkowski; Justin Guinney; In Sock Jang; Benjamin A Logsdon; Gaurav Pandey; Benjamin A Sauerwine; Yishai Shimoni; Hans Kristian Moen Vollan; Brigham H Mecham; Oscar M Rueda; Jorg Tost; Christina Curtis; Mariano J Alvarez; Vessela N Kristensen; Samuel Aparicio; Anne-Lise Børresen-Dale; Carlos Caldas; Andrea Califano; Stephen H Friend; Trey Ideker; Eric E Schadt; Gustavo A Stolovitzky; Adam A Margolin
Journal:  PLoS Comput Biol       Date:  2013-05-09       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.