Literature DB >> 22260749

Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias.

John P Didion1, Hyuna Yang, Keith Sheppard, Chen-Ping Fu, Leonard McMillan, Fernando Pardo-Manuel de Villena, Gary A Churchill.   

Abstract

BACKGROUND: High-density genotyping arrays that measure hybridization of genomic DNA fragments to allele-specific oligonucleotide probes are widely used to genotype single nucleotide polymorphisms (SNPs) in genetic studies, including human genome-wide association studies. Hybridization intensities are converted to genotype calls by clustering algorithms that assign each sample to a genotype class at each SNP. Data for SNP probes that do not conform to the expected pattern of clustering are often discarded, contributing to ascertainment bias and resulting in lost information - as much as 50% in a recent genome-wide association study in dogs.
RESULTS: We identified atypical patterns of hybridization intensities that were highly reproducible and demonstrated that these patterns represent genetic variants that were not accounted for in the design of the array platform. We characterized variable intensity oligonucleotide (VINO) probes that display such patterns and are found in all hybridization-based genotyping platforms, including those developed for human, dog, cattle, and mouse. When recognized and properly interpreted, VINOs recovered a substantial fraction of discarded probes and counteracted SNP ascertainment bias. We developed software (MouseDivGeno) that identifies VINOs and improves the accuracy of genotype calling. MouseDivGeno produced highly concordant genotype calls when compared with other methods but it uniquely identified more than 786000 VINOs in 351 mouse samples. We used whole-genome sequence from 14 mouse strains to confirm the presence of novel variants explaining 28000 VINOs in those strains. We also identified VINOs in human HapMap 3 samples, many of which were specific to an African population. Incorporating VINOs in phylogenetic analyses substantially improved the accuracy of a Mus species tree and local haplotype assignment in laboratory mouse strains.
CONCLUSION: The problems of ascertainment bias and missing information due to genotyping errors are widely recognized as limiting factors in genetic studies. We have conducted the first formal analysis of the effect of novel variants on genotyping arrays, and we have shown that these variants account for a large portion of miscalled and uncalled genotypes. Genetic studies will benefit from substantial improvements in the accuracy of their results by incorporating VINOs in their analyses.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22260749      PMCID: PMC3305361          DOI: 10.1186/1471-2164-13-34

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  27 in total

1.  Estimation and assessment of raw copy numbers at the single locus level.

Authors:  H Bengtsson; R Irizarry; B Carvalho; T P Speed
Journal:  Bioinformatics       Date:  2008-01-19       Impact factor: 6.937

2.  A new standard genetic map for the laboratory mouse.

Authors:  Allison Cox; Cheryl L Ackert-Bicknell; Beth L Dumont; Yueming Ding; Jordana Tzenova Bell; Gudrun A Brockmann; Jon E Wergedal; Carol Bult; Beverly Paigen; Jonathan Flint; Shirng-Wern Tsaih; Gary A Churchill; Karl W Broman
Journal:  Genetics       Date:  2009-06-17       Impact factor: 4.562

3.  Microindel detection in short-read sequence data.

Authors:  Peter Krawitz; Christian Rödelsperger; Marten Jäger; Luke Jostins; Sebastian Bauer; Peter N Robinson
Journal:  Bioinformatics       Date:  2010-02-09       Impact factor: 6.937

4.  Integrating common and rare genetic variation in diverse human populations.

Authors:  David M Altshuler; Richard A Gibbs; Leena Peltonen; David M Altshuler; Richard A Gibbs; Leena Peltonen; Emmanouil Dermitzakis; Stephen F Schaffner; Fuli Yu; Leena Peltonen; Emmanouil Dermitzakis; Penelope E Bonnen; David M Altshuler; Richard A Gibbs; Paul I W de Bakker; Panos Deloukas; Stacey B Gabriel; Rhian Gwilliam; Sarah Hunt; Michael Inouye; Xiaoming Jia; Aarno Palotie; Melissa Parkin; Pamela Whittaker; Fuli Yu; Kyle Chang; Alicia Hawes; Lora R Lewis; Yanru Ren; David Wheeler; Richard A Gibbs; Donna Marie Muzny; Chris Barnes; Katayoon Darvishi; Matthew Hurles; Joshua M Korn; Kati Kristiansson; Charles Lee; Steven A McCarrol; James Nemesh; Emmanouil Dermitzakis; Alon Keinan; Stephen B Montgomery; Samuela Pollack; Alkes L Price; Nicole Soranzo; Penelope E Bonnen; Richard A Gibbs; Claudia Gonzaga-Jauregui; Alon Keinan; Alkes L Price; Fuli Yu; Verneri Anttila; Wendy Brodeur; Mark J Daly; Stephen Leslie; Gil McVean; Loukas Moutsianas; Huy Nguyen; Stephen F Schaffner; Qingrun Zhang; Mohammed J R Ghori; Ralph McGinnis; William McLaren; Samuela Pollack; Alkes L Price; Stephen F Schaffner; Fumihiko Takeuchi; Sharon R Grossman; Ilya Shlyakhter; Elizabeth B Hostetter; Pardis C Sabeti; Clement A Adebamowo; Morris W Foster; Deborah R Gordon; Julio Licinio; Maria Cristina Manca; Patricia A Marshall; Ichiro Matsuda; Duncan Ngare; Vivian Ota Wang; Deepa Reddy; Charles N Rotimi; Charmaine D Royal; Richard R Sharp; Changqing Zeng; Lisa D Brooks; Jean E McEwen
Journal:  Nature       Date:  2010-09-02       Impact factor: 49.962

5.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

6.  A customized and versatile high-density genotyping array for the mouse.

Authors:  Hyuna Yang; Yueming Ding; Lucie N Hutchins; Jin Szatkiewicz; Timothy A Bell; Beverly J Paigen; Joel H Graber; Fernando Pardo-Manuel de Villena; Gary A Churchill
Journal:  Nat Methods       Date:  2009-08-09       Impact factor: 28.547

7.  A simple genetic architecture underlies morphological variation in dogs.

Authors:  Adam R Boyko; Pascale Quignon; Lin Li; Jeffrey J Schoenebeck; Jeremiah D Degenhardt; Kirk E Lohmueller; Keyan Zhao; Abra Brisbin; Heidi G Parker; Bridgett M vonHoldt; Michele Cargill; Adam Auton; Andy Reynolds; Abdel G Elkahloun; Marta Castelhano; Dana S Mosher; Nathan B Sutter; Gary S Johnson; John Novembre; Melissa J Hubisz; Adam Siepel; Robert K Wayne; Carlos D Bustamante; Elaine A Ostrander
Journal:  PLoS Biol       Date:  2010-08-10       Impact factor: 8.029

8.  Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds.

Authors:  Richard A Gibbs; Jeremy F Taylor; Curtis P Van Tassell; William Barendse; Kellye A Eversole; Clare A Gill; Ronnie D Green; Debora L Hamernik; Steven M Kappes; Sigbjørn Lien; Lakshmi K Matukumalli; John C McEwan; Lynne V Nazareth; Robert D Schnabel; George M Weinstock; David A Wheeler; Paolo Ajmone-Marsan; Paul J Boettcher; Alexandre R Caetano; Jose Fernando Garcia; Olivier Hanotte; Paola Mariani; Loren C Skow; Tad S Sonstegard; John L Williams; Boubacar Diallo; Lemecha Hailemariam; Mario L Martinez; Chris A Morris; Luiz O C Silva; Richard J Spelman; Woudyalew Mulatu; Keyan Zhao; Colette A Abbey; Morris Agaba; Flábio R Araujo; Rowan J Bunch; James Burton; Chiara Gorni; Hanotte Olivier; Blair E Harrison; Bill Luff; Marco A Machado; Joel Mwakaya; Graham Plastow; Warren Sim; Timothy Smith; Merle B Thomas; Alessio Valentini; Paul Williams; James Womack; John A Woolliams; Yue Liu; Xiang Qin; Kim C Worley; Chuan Gao; Huaiyang Jiang; Stephen S Moore; Yanru Ren; Xing-Zhi Song; Carlos D Bustamante; Ryan D Hernandez; Donna M Muzny; Shobha Patil; Anthony San Lucas; Qing Fu; Matthew P Kent; Richard Vega; Aruna Matukumalli; Sean McWilliam; Gert Sclep; Katarzyna Bryc; Jungwoo Choi; Hong Gao; John J Grefenstette; Brenda Murdoch; Alessandra Stella; Rafael Villa-Angulo; Mark Wright; Jan Aerts; Oliver Jann; Riccardo Negrini; Mike E Goddard; Ben J Hayes; Daniel G Bradley; Marcos Barbosa da Silva; Lilian P L Lau; George E Liu; David J Lynn; Francesca Panzitta; Ken G Dodds
Journal:  Science       Date:  2009-04-24       Impact factor: 47.728

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  Elusive copy number variation in the mouse genome.

Authors:  Avigail Agam; Binnaz Yalcin; Amarjit Bhomra; Matthew Cubin; Caleb Webber; Christopher Holmes; Jonathan Flint; Richard Mott
Journal:  PLoS One       Date:  2010-09-21       Impact factor: 3.240

View more
  44 in total

1.  Maximum likelihood inference of reticulate evolutionary histories.

Authors:  Yun Yu; Jianrong Dong; Kevin J Liu; Luay Nakhleh
Journal:  Proc Natl Acad Sci U S A       Date:  2014-11-03       Impact factor: 11.205

2.  The Mouse Universal Genotyping Array: From Substrains to Subspecies.

Authors:  Andrew P Morgan; Chen-Ping Fu; Chia-Yu Kao; Catherine E Welsh; John P Didion; Liran Yadgary; Leeanna Hyacinth; Martin T Ferris; Timothy A Bell; Darla R Miller; Paola Giusti-Rodriguez; Randal J Nonneman; Kevin D Cook; Jason K Whitmire; Lisa E Gralinski; Mark Keller; Alan D Attie; Gary A Churchill; Petko Petkov; Patrick F Sullivan; Jennifer R Brennan; Leonard McMillan; Fernando Pardo-Manuel de Villena
Journal:  G3 (Bethesda)       Date:  2015-12-18       Impact factor: 3.154

3.  A novel intronic single nucleotide polymorphism in the myosin heavy polypeptide 4 gene is responsible for the mini-muscle phenotype characterized by major reduction in hind-limb muscle mass in mice.

Authors:  Scott A Kelly; Timothy A Bell; Sara R Selitsky; Ryan J Buus; Kunjie Hua; George M Weinstock; Theodore Garland; Fernando Pardo-Manuel de Villena; Daniel Pomp
Journal:  Genetics       Date:  2013-09-20       Impact factor: 4.562

4.  Male Infertility Is Responsible for Nearly Half of the Extinction Observed in the Mouse Collaborative Cross.

Authors:  John R Shorter; Fanny Odet; David L Aylor; Wenqi Pan; Chia-Yu Kao; Chen-Ping Fu; Andrew P Morgan; Seth Greenstein; Timothy A Bell; Alicia M Stevans; Ryan W Feathers; Sunny Patel; Sarah E Cates; Ginger D Shaw; Darla R Miller; Elissa J Chesler; Leonard McMillian; Deborah A O'Brien; Fernando Pardo-Manuel de Villena
Journal:  Genetics       Date:  2017-06       Impact factor: 4.562

Review 5.  Informatics resources for the Collaborative Cross and related mouse populations.

Authors:  Andrew P Morgan; Catherine E Welsh
Journal:  Mamm Genome       Date:  2015-07-02       Impact factor: 2.957

6.  Adaptive evolution and effective population size in wild house mice.

Authors:  Megan Phifer-Rixey; François Bonhomme; Pierre Boursot; Gary A Churchill; Jaroslav Piálek; Priscilla K Tucker; Michael W Nachman
Journal:  Mol Biol Evol       Date:  2012-04-03       Impact factor: 16.240

Review 7.  Deconstructing Mus gemischus: advances in understanding ancestry, structure, and variation in the genome of the laboratory mouse.

Authors:  John P Didion; Fernando Pardo-Manuel de Villena
Journal:  Mamm Genome       Date:  2012-12-09       Impact factor: 2.957

8.  Development and evaluation of a transfusion medicine genome wide genotyping array.

Authors:  Yuelong Guo; Michael P Busch; Mark Seielstad; Stacy Endres-Dighe; Connie M Westhoff; Brendan Keating; Carolyn Hoppe; Aarash Bordbar; Brian Custer; Adam S Butterworth; Tamir Kanias; Alan E Mast; Steve Kleinman; Yontao Lu; Grier P Page
Journal:  Transfusion       Date:  2018-11-20       Impact factor: 3.157

9.  Striking Immune Phenotypes in Gene-Targeted Mice Are Driven by a Copy-Number Variant Originating from a Commercially Available C57BL/6 Strain.

Authors:  Vinay S Mahajan; Ezana Demissie; Hamid Mattoo; Vinay Viswanadham; Ajit Varki; Robert Morris; Shiv Pillai
Journal:  Cell Rep       Date:  2016-05-19       Impact factor: 9.423

10.  Genome-Wide Detection of Gene Coexpression Domains Showing Linkage to Regions Enriched with Polymorphic Retrotransposons in Recombinant Inbred Mouse Strains.

Authors:  Marie-Pier Scott-Boyer; Christian F Deschepper
Journal:  G3 (Bethesda)       Date:  2013-04-09       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.