Literature DB >> 34478634

Fast two-stage phasing of large-scale sequence data.

Brian L Browning1, Xiaowen Tian2, Ying Zhou3, Sharon R Browning4.   

Abstract

Haplotype phasing is the estimation of haplotypes from genotype data. We present a fast, accurate, and memory-efficient haplotype phasing method that scales to large-scale SNP array and sequence data. The method uses marker windowing and composite reference haplotypes to reduce memory usage and computation time. It incorporates a progressive phasing algorithm that identifies confidently phased heterozygotes in each iteration and fixes the phase of these heterozygotes in subsequent iterations. For data with many low-frequency variants, such as whole-genome sequence data, the method employs a two-stage phasing algorithm that phases high-frequency markers via progressive phasing in the first stage and phases low-frequency markers via genotype imputation in the second stage. This haplotype phasing method is implemented in the open-source Beagle 5.2 software package. We compare Beagle 5.2 and SHAPEIT 4.2.1 by using expanding subsets of 485,301 UK Biobank samples and 38,387 TOPMed samples. Both methods have very similar accuracy and computation time for UK Biobank SNP array data. However, for TOPMed sequence data, Beagle is more than 20 times faster than SHAPEIT, achieves similar accuracy, and scales to larger sample sizes.
Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  TOPMed; UK Biobank; genotype phasing; haplotype phasing; phasing

Mesh:

Year:  2021        PMID: 34478634      PMCID: PMC8551421          DOI: 10.1016/j.ajhg.2021.08.005

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  40 in total

1.  A whole genome long-range haplotype (WGLRH) test for detecting imprints of positive selection in human populations.

Authors:  Chun Zhang; Dione K Bailey; Tarif Awad; Guoying Liu; Guoliang Xing; Manqiu Cao; Venu Valmeekam; Jacques Retief; Hajime Matsuzaki; Margaret Taub; Mark Seielstad; Giulia C Kennedy
Journal:  Bioinformatics       Date:  2006-07-15       Impact factor: 6.937

2.  RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference.

Authors:  Brian K Maples; Simon Gravel; Eimear E Kenny; Carlos D Bustamante
Journal:  Am J Hum Genet       Date:  2013-08-01       Impact factor: 11.025

3.  A Fast and Simple Method for Detecting Identity-by-Descent Segments in Large-Scale Data.

Authors:  Ying Zhou; Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2020-03-12       Impact factor: 11.025

4.  Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

Authors:  Olivier Delaneau; Jonathan Marchini
Journal:  Nat Commun       Date:  2014-06-13       Impact factor: 14.919

5.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

6.  Haplotype estimation for biobank-scale data sets.

Authors:  Jared O'Connell; Kevin Sharp; Nick Shrine; Louise Wain; Ian Hall; Martin Tobin; Jean-Francois Zagury; Olivier Delaneau; Jonathan Marchini
Journal:  Nat Genet       Date:  2016-06-06       Impact factor: 38.330

7.  Fine-Scale Inference of Ancestry Segments Without Prior Knowledge of Admixing Groups.

Authors:  Michael Salter-Townshend; Simon Myers
Journal:  Genetics       Date:  2019-05-23       Impact factor: 4.562

8.  Reference-based phasing using the Haplotype Reference Consortium panel.

Authors:  Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A Reshef; Hilary K Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Goncalo R Abecasis; Richard Durbin; Alkes L Price
Journal:  Nat Genet       Date:  2016-10-03       Impact factor: 38.330

9.  The UK Biobank resource with deep phenotyping and genomic data.

Authors:  Clare Bycroft; Colin Freeman; Desislava Petkova; Gavin Band; Lloyd T Elliott; Kevin Sharp; Allan Motyer; Damjan Vukcevic; Olivier Delaneau; Jared O'Connell; Adrian Cortes; Samantha Welsh; Alan Young; Mark Effingham; Gil McVean; Stephen Leslie; Naomi Allen; Peter Donnelly; Jonathan Marchini
Journal:  Nature       Date:  2018-10-10       Impact factor: 49.962

10.  Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

Authors:  Daniel Taliun; Daniel N Harris; Michael D Kessler; Jedidiah Carlson; Zachary A Szpiech; Raul Torres; Sarah A Gagliano Taliun; André Corvelo; Stephanie M Gogarten; Hyun Min Kang; Achilleas N Pitsillides; Jonathon LeFaive; Seung-Been Lee; Xiaowen Tian; Brian L Browning; Sayantan Das; Anne-Katrin Emde; Wayne E Clarke; Douglas P Loesch; Amol C Shetty; Thomas W Blackwell; Albert V Smith; Quenna Wong; Xiaoming Liu; Matthew P Conomos; Dean M Bobo; François Aguet; Christine Albert; Alvaro Alonso; Kristin G Ardlie; Dan E Arking; Stella Aslibekyan; Paul L Auer; John Barnard; R Graham Barr; Lucas Barwick; Lewis C Becker; Rebecca L Beer; Emelia J Benjamin; Lawrence F Bielak; John Blangero; Michael Boehnke; Donald W Bowden; Jennifer A Brody; Esteban G Burchard; Brian E Cade; James F Casella; Brandon Chalazan; Daniel I Chasman; Yii-Der Ida Chen; Michael H Cho; Seung Hoan Choi; Mina K Chung; Clary B Clish; Adolfo Correa; Joanne E Curran; Brian Custer; Dawood Darbar; Michelle Daya; Mariza de Andrade; Dawn L DeMeo; Susan K Dutcher; Patrick T Ellinor; Leslie S Emery; Celeste Eng; Diane Fatkin; Tasha Fingerlin; Lukas Forer; Myriam Fornage; Nora Franceschini; Christian Fuchsberger; Stephanie M Fullerton; Soren Germer; Mark T Gladwin; Daniel J Gottlieb; Xiuqing Guo; Michael E Hall; Jiang He; Nancy L Heard-Costa; Susan R Heckbert; Marguerite R Irvin; Jill M Johnsen; Andrew D Johnson; Robert Kaplan; Sharon L R Kardia; Tanika Kelly; Shannon Kelly; Eimear E Kenny; Douglas P Kiel; Robert Klemmer; Barbara A Konkle; Charles Kooperberg; Anna Köttgen; Leslie A Lange; Jessica Lasky-Su; Daniel Levy; Xihong Lin; Keng-Han Lin; Chunyu Liu; Ruth J F Loos; Lori Garman; Robert Gerszten; Steven A Lubitz; Kathryn L Lunetta; Angel C Y Mak; Ani Manichaikul; Alisa K Manning; Rasika A Mathias; David D McManus; Stephen T McGarvey; James B Meigs; Deborah A Meyers; Julie L Mikulla; Mollie A Minear; Braxton D Mitchell; Sanghamitra Mohanty; May E Montasser; Courtney Montgomery; Alanna C Morrison; Joanne M Murabito; Andrea Natale; Pradeep Natarajan; Sarah C Nelson; Kari E North; Jeffrey R O'Connell; Nicholette D Palmer; Nathan Pankratz; Gina M Peloso; Patricia A Peyser; Jacob Pleiness; Wendy S Post; Bruce M Psaty; D C Rao; Susan Redline; Alexander P Reiner; Dan Roden; Jerome I Rotter; Ingo Ruczinski; Chloé Sarnowski; Sebastian Schoenherr; David A Schwartz; Jeong-Sun Seo; Sudha Seshadri; Vivien A Sheehan; Wayne H Sheu; M Benjamin Shoemaker; Nicholas L Smith; Jennifer A Smith; Nona Sotoodehnia; Adrienne M Stilp; Weihong Tang; Kent D Taylor; Marilyn Telen; Timothy A Thornton; Russell P Tracy; David J Van Den Berg; Ramachandran S Vasan; Karine A Viaud-Martinez; Scott Vrieze; Daniel E Weeks; Bruce S Weir; Scott T Weiss; Lu-Chen Weng; Cristen J Willer; Yingze Zhang; Xutong Zhao; Donna K Arnett; Allison E Ashley-Koch; Kathleen C Barnes; Eric Boerwinkle; Stacey Gabriel; Richard Gibbs; Kenneth M Rice; Stephen S Rich; Edwin K Silverman; Pankaj Qasba; Weiniu Gan; George J Papanicolaou; Deborah A Nickerson; Sharon R Browning; Michael C Zody; Sebastian Zöllner; James G Wilson; L Adrienne Cupples; Cathy C Laurie; Cashell E Jaquish; Ryan D Hernandez; Timothy D O'Connor; Gonçalo R Abecasis
Journal:  Nature       Date:  2021-02-10       Impact factor: 69.504

View more
  27 in total

1.  Profiling of Fusarium head blight resistance QTL haplotypes through molecular markers, genotyping-by-sequencing, and machine learning.

Authors:  Zachary J Winn; Jeanette Lyerly; Brian Ward; Gina Brown-Guedira; Richard E Boyles; Mohamed Mergoum; Jerry Johnson; Stephen Harrison; Ali Babar; Richard E Mason; Russell Sutton; J Paul Murphy
Journal:  Theor Appl Genet       Date:  2022-07-24       Impact factor: 5.574

2.  Genomic-environmental associations in wild cranberry (Vaccinium macrocarpon Ait.).

Authors:  Jeffrey L Neyhart; Michael B Kantar; Juan Zalapa; Nicholi Vorsa
Journal:  G3 (Bethesda)       Date:  2022-09-30       Impact factor: 3.542

3.  Identification and genomic characterization of major effect bacterial blight resistance locus (BB-13) in Upland cotton (Gossypium hirsutum L.).

Authors:  S Anjan Gowda; Navin Shrestha; Taylor M Harris; Vasu Kuraparthy; Anne Z Phillips; Hui Fang; Shilpa Sood; Kuang Zhang; Fred Bourland; Rebecca Bart
Journal:  Theor Appl Genet       Date:  2022-10-08       Impact factor: 5.574

4.  Karyotype variation, spontaneous genome rearrangements affecting chemical insensitivity, and expression level polymorphisms in the plant pathogen Phytophthora infestans revealed using its first chromosome-scale assembly.

Authors:  Michael E H Matson; Qihua Liang; Stefano Lonardi; Howard S Judelson
Journal:  PLoS Pathog       Date:  2022-10-10       Impact factor: 7.464

5.  Small EPIDERMAL PATTERNING FACTOR-LIKE2 peptides regulate awn development in rice.

Authors:  Luling Xiong; Yingyong Huang; Zupei Liu; Chen Li; Hang Yu; Muhammad Qasim Shahid; Yanhui Lin; Xiaoyi Qiao; Junyi Xiao; Julie E Gray; Jing Jin
Journal:  Plant Physiol       Date:  2022-08-29       Impact factor: 8.005

6.  Genotype error biases trio-based estimates of haplotype phase accuracy.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2022-06-02       Impact factor: 11.043

7.  The emergence of supergenes from inversions in Atlantic salmon.

Authors:  Kristina Stenløkk; Marie Saitou; Live Rud-Johansen; Torfinn Nome; Michel Moser; Mariann Árnyasi; Matthew Kent; Nicola Jane Barson; Sigbjørn Lien
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-06-13       Impact factor: 6.671

8.  Novel Fusarium wilt resistance genes uncovered in natural and cultivated strawberry populations are found on three non-homoeologous chromosomes.

Authors:  Dominique D A Pincot; Mitchell J Feldmann; Michael A Hardigan; Mishi V Vachev; Peter M Henry; Thomas R Gordon; Marta Bjornson; Alan Rodriguez; Nicolas Cobo; Randi A Famula; Glenn S Cole; Gitta L Coaker; Steven J Knapp
Journal:  Theor Appl Genet       Date:  2022-05-18       Impact factor: 5.574

9.  Between a Rock and a Hard Polytomy: Phylogenomics of the Rock-Dwelling Mbuna Cichlids of Lake Malaŵi.

Authors:  Mark D Scherz; Paul Masonick; Axel Meyer; C Darrin Hulsey
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

10.  Population genomics of Puccinia graminis f.sp. tritici highlights the role of admixture in the origin of virulent wheat rust races.

Authors:  Yuanwen Guo; Bliss Betzen; Andres Salcedo; Fei He; Robert L Bowden; John P Fellers; Katherine W Jordan; Alina Akhunova; Mathew N Rouse; Les J Szabo; Eduard Akhunov
Journal:  Nat Commun       Date:  2022-10-21       Impact factor: 17.694

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.