Literature DB >> 27153703

Phasing for medical sequencing using rare variants and large haplotype reference panels.

Kevin Sharp1, Warren Kretzschmar2, Olivier Delaneau3, Jonathan Marchini4.   

Abstract

MOTIVATION: There is growing recognition that estimating haplotypes from high coverage sequencing of single samples in clinical settings is an important problem. At the same time very large datasets consisting of tens and hundreds of thousands of high-coverage sequenced samples will soon be available. We describe a method that takes advantage of these huge human genetic variation resources and rare variant sharing patterns to estimate haplotypes on single sequenced samples. Sharing rare variants between two individuals is more likely to arise from a recent common ancestor and, hence, also more likely to indicate similar shared haplotypes over a substantial flanking region of sequence.
RESULTS: Our method exploits this idea to select a small set of highly informative copying states within a Hidden Markov Model (HMM) phasing algorithm. Using rare variants in this way allows us to avoid iterative MCMC methods to infer haplotypes. Compared to other approaches that do not explicitly use rare variants we obtain significant gains in phasing accuracy, less variation over phasing runs and improvements in speed. For example, using a reference panel of 7420 haplotypes from the UK10K project, we are able to reduce switch error rates by up to 50% when phasing samples sequenced at high-coverage. In addition, a single step rephasing of the UK10K panel, using rare variant information, has a downstream impact on phasing performance. These results represent a proof of concept that rare variant sharing patterns can be utilized to phase large high-coverage sequencing studies such as the 100 000 Genomes Project dataset.
AVAILABILITY AND IMPLEMENTATION: A webserver that includes an implementation of this new method and allows phasing of high-coverage clinical samples is available at https://phasingserver.stats.ox.ac.uk/ CONTACT: marchini@stats.ox.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2016        PMID: 27153703      PMCID: PMC4920110          DOI: 10.1093/bioinformatics/btw065

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Compound heterozygosity for loss-of-function lysyl-tRNA synthetase mutations in a patient with peripheral neuropathy.

Authors:  Heather M McLaughlin; Reiko Sakaguchi; Cuiping Liu; Takao Igarashi; Davut Pehlivan; Kristine Chu; Ram Iyer; Pedro Cruz; Praveen F Cherukuri; Nancy F Hansen; James C Mullikin; Leslie G Biesecker; Thomas E Wilson; Victor Ionasescu; Garth Nicholson; Charles Searby; Kevin Talbot; Jeffrey M Vance; Stephan Züchner; Kinga Szigeti; James R Lupski; Ya-Ming Hou; Eric D Green; Anthony Antonellis
Journal:  Am J Hum Genet       Date:  2010-10-08       Impact factor: 11.025

2.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

3.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

4.  Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.

Authors:  Brian L Browning; Zhaoxia Yu
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

Review 5.  Haplotype-resolved genome sequencing: experimental methods and applications.

Authors:  Matthew W Snyder; Andrew Adey; Jacob O Kitzman; Jay Shendure
Journal:  Nat Rev Genet       Date:  2015-05-07       Impact factor: 53.242

6.  Cerebral palsy in siblings caused by compound heterozygous mutations in the gene encoding protein C.

Authors:  Choong Y I Fong; Andrew D Mumford; Marcus J Likeman; Philip E Jardine
Journal:  Dev Med Child Neurol       Date:  2010-02-19       Impact factor: 5.449

7.  Detection of sharing by descent, long-range phasing and haplotype imputation.

Authors:  Augustine Kong; Gisli Masson; Michael L Frigge; Arnaldur Gylfason; Pasha Zusmanovich; Gudmar Thorleifsson; Pall I Olason; Andres Ingason; Stacy Steinberg; Thorunn Rafnar; Patrick Sulem; Magali Mouy; Frosti Jonsson; Unnur Thorsteinsdottir; Daniel F Gudbjartsson; Hreinn Stefansson; Kari Stefansson
Journal:  Nat Genet       Date:  2008-09       Impact factor: 38.330

8.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

9.  Demography and the age of rare variants.

Authors:  Iain Mathieson; Gil McVean
Journal:  PLoS Genet       Date:  2014-08-07       Impact factor: 5.917

10.  The UK10K project identifies rare variants in health and disease.

Authors:  Klaudia Walter; Josine L Min; Jie Huang; Lucy Crooks; Yasin Memari; Shane McCarthy; John R B Perry; ChangJiang Xu; Marta Futema; Daniel Lawson; Valentina Iotchkova; Stephan Schiffels; Audrey E Hendricks; Petr Danecek; Rui Li; James Floyd; Louise V Wain; Inês Barroso; Steve E Humphries; Matthew E Hurles; Eleftheria Zeggini; Jeffrey C Barrett; Vincent Plagnol; J Brent Richards; Celia M T Greenwood; Nicholas J Timpson; Richard Durbin; Nicole Soranzo
Journal:  Nature       Date:  2015-09-14       Impact factor: 49.962

View more
  9 in total

1.  Genotyping, the Usefulness of Imputation to Increase SNP Density, and Imputation Methods and Tools.

Authors:  Florence Phocas
Journal:  Methods Mol Biol       Date:  2022

Review 2.  Genetic analysis of deep phenotyping projects in common disorders.

Authors:  Elliot S Gershon; Godfrey Pearlson; Matcheri S Keshavan; Carol Tamminga; Brett Clementz; Peter F Buckley; Ney Alliey-Rodriguez; Chunyu Liu; John A Sweeney; Sarah Keedy; Shashwath A Meda; Neeraj Tandon; Rebecca Shafee; Jeffrey R Bishop; Elena I Ivleva
Journal:  Schizophr Res       Date:  2017-10-20       Impact factor: 4.939

3.  Haplotype estimation for biobank-scale data sets.

Authors:  Jared O'Connell; Kevin Sharp; Nick Shrine; Louise Wain; Ian Hall; Martin Tobin; Jean-Francois Zagury; Olivier Delaneau; Jonathan Marchini
Journal:  Nat Genet       Date:  2016-06-06       Impact factor: 38.330

4.  BCFtools/csq: haplotype-aware variant consequences.

Authors:  Petr Danecek; Shane A McCarthy
Journal:  Bioinformatics       Date:  2017-07-01       Impact factor: 6.937

5.  Impact of pre- and post-variant filtration strategies on imputation.

Authors:  Céline Charon; Rodrigue Allodji; Vincent Meyer; Jean-François Deleuze
Journal:  Sci Rep       Date:  2021-03-18       Impact factor: 4.379

6.  A reference panel of 64,976 haplotypes for genotype imputation.

Authors:  Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin
Journal:  Nat Genet       Date:  2016-08-22       Impact factor: 38.330

7.  Haplotype-resolved germline and somatic alterations in renal medullary carcinomas.

Authors:  William C Hahn; Matthew Meyerson; Andrew L Hong; Kar-Tong Tan; Hyunji Kim; Jian Carrot-Zhang; Yuxiang Zhang; Won Jun Kim; Guillaume Kugener; Jeremiah A Wala; Thomas P Howard; Yueh-Yun Chi; Rameen Beroukhim; Heng Li; Gavin Ha; Seth L Alper; Elizabeth J Perlman; Elizabeth A Mullen
Journal:  Genome Med       Date:  2021-07-14       Impact factor: 11.117

8.  Reference-based phasing using the Haplotype Reference Consortium panel.

Authors:  Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A Reshef; Hilary K Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Goncalo R Abecasis; Richard Durbin; Alkes L Price
Journal:  Nat Genet       Date:  2016-10-03       Impact factor: 38.330

9.  The Necessity of Diploid Genome Sequencing to Unravel the Genetic Component of Complex Phenotypes.

Authors:  Fernando Aleman
Journal:  Front Genet       Date:  2017-10-11       Impact factor: 4.599

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.