Literature DB >> 28176766

A fast and accurate method for detection of IBD shared haplotypes in genome-wide SNP data.

Douglas W Bjelland1, Uday Lingala1, Piyush S Patel1, Matt Jones2, Matthew C Keller1,2.   

Abstract

Identical by descent (IBD) segments are used to understand a number of fundamental issues in genetics. IBD segments are typically detected using long stretches of identical alleles between haplotypes in phased, whole-genome SNP data. Phase or SNP call errors in genomic data can degrade accuracy of IBD detection and lead to false-positive/negative calls and to under/overextension of true IBD segments. Furthermore, the number of comparisons increases quadratically with sample size, requiring high computational efficiency. We developed a new IBD segment detection program, FISHR (Find IBD Shared Haplotypes Rapidly), in an attempt to accurately detect IBD segments and to better estimate their endpoints using an algorithm that is fast enough to be deployed on very large whole-genome SNP data sets. We compared the performance of FISHR to three leading IBD segment detection programs: GERMLINE, refined IBD, and HaploScore. Using simulated and real genomic sequence data, we show that FISHR is slightly more accurate than all programs at detecting long (>3 cm) IBD segments but slightly less accurate than refined IBD at detecting short (~1 cm) IBD segments. More centrally, FISHR outperforms all programs in determining the true endpoints of IBD segments, which is crucial for several applications of IBD information. FISHR takes two to three times longer than GERMLINE to run, whereas both GERMLINE and FISHR were orders of magnitude faster than refined IBD and HaploScore. Overall, FISHR provides accurate IBD detection in unrelated individuals and is computationally efficient enough to be utilized on large SNP data sets >60 000 individuals.

Mesh:

Year:  2017        PMID: 28176766      PMCID: PMC5437913          DOI: 10.1038/ejhg.2017.6

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


  20 in total

1.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

2.  Whole genome genotyping technologies on the BeadArray platform.

Authors:  Frank J Steemers; Kevin L Gunderson
Journal:  Biotechnol J       Date:  2007-01       Impact factor: 4.677

3.  A fast, powerful method for detecting identity by descent.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2011-02-11       Impact factor: 11.025

4.  Genome-wide mapping of IBD segments in an Ashkenazi PD cohort identifies associated haplotypes.

Authors:  Vladimir Vacic; Laurie J Ozelius; Lorraine N Clark; Anat Bar-Shira; Mali Gana-Weisz; Tanya Gurevich; Alexander Gusev; Merav Kedmi; Eimear E Kenny; Xinmin Liu; Helen Mejia-Santana; Anat Mirelman; Deborah Raymond; Rachel Saunders-Pullman; Robert J Desnick; Gil Atzmon; Edward R Burns; Harry Ostrer; Hakon Hakonarson; Aviv Bergman; Nir Barzilai; Ariel Darvasi; Inga Peter; Saurav Guha; Todd Lencz; Nir Giladi; Karen Marder; Itsik Pe'er; Susan B Bressman; Avi Orr-Urtreger
Journal:  Hum Mol Genet       Date:  2014-05-19       Impact factor: 6.150

5.  HLA type inference via haplotypes identical by descent.

Authors:  Manu N Setty; Alexander Gusev; Itsik Pe'er
Journal:  J Comput Biol       Date:  2011-03       Impact factor: 1.479

6.  Detection of sharing by descent, long-range phasing and haplotype imputation.

Authors:  Augustine Kong; Gisli Masson; Michael L Frigge; Arnaldur Gylfason; Pasha Zusmanovich; Gudmar Thorleifsson; Pall I Olason; Andres Ingason; Stacy Steinberg; Thorunn Rafnar; Patrick Sulem; Magali Mouy; Frosti Jonsson; Unnur Thorsteinsdottir; Daniel F Gudbjartsson; Hreinn Stefansson; Kari Stefansson
Journal:  Nat Genet       Date:  2008-09       Impact factor: 38.330

7.  Technical reproducibility of genotyping SNP arrays used in genome-wide association studies.

Authors:  Huixiao Hong; Lei Xu; Jie Liu; Wendell D Jones; Zhenqiang Su; Baitang Ning; Roger Perkins; Weigong Ge; Kelci Miclaus; Li Zhang; Kyunghee Park; Bridgett Green; Tao Han; Hong Fang; Christophe G Lambert; Silvia C Vega; Simon M Lin; Nadereh Jafari; Wendy Czika; Russell D Wolfinger; Federico Goodsaid; Weida Tong; Leming Shi
Journal:  PLoS One       Date:  2012-09-07       Impact factor: 3.240

8.  A genotype calling algorithm for the Illumina BeadArray platform.

Authors:  Yik Y Teo; Michael Inouye; Kerrin S Small; Rhian Gwilliam; Panagiotis Deloukas; Dominic P Kwiatkowski; Taane G Clark
Journal:  Bioinformatics       Date:  2007-09-10       Impact factor: 6.937

9.  Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip.

Authors:  Chris C A Spencer; Zhan Su; Peter Donnelly; Jonathan Marchini
Journal:  PLoS Genet       Date:  2009-05-15       Impact factor: 5.917

10.  The UK10K project identifies rare variants in health and disease.

Authors:  Klaudia Walter; Josine L Min; Jie Huang; Lucy Crooks; Yasin Memari; Shane McCarthy; John R B Perry; ChangJiang Xu; Marta Futema; Daniel Lawson; Valentina Iotchkova; Stephan Schiffels; Audrey E Hendricks; Petr Danecek; Rui Li; James Floyd; Louise V Wain; Inês Barroso; Steve E Humphries; Matthew E Hurles; Eleftheria Zeggini; Jeffrey C Barrett; Vincent Plagnol; J Brent Richards; Celia M T Greenwood; Nicholas J Timpson; Richard Durbin; Nicole Soranzo
Journal:  Nature       Date:  2015-09-14       Impact factor: 49.962

View more
  7 in total

1.  Rapid, Phase-free Detection of Long Identity-by-Descent Segments Enables Effective Relationship Classification.

Authors:  Daniel N Seidman; Sushila A Shenoy; Minsoo Kim; Ramya Babu; Ian G Woods; Thomas D Dyer; Donna M Lehman; Joanne E Curran; Ravindranath Duggirala; John Blangero; Amy L Williams
Journal:  Am J Hum Genet       Date:  2020-03-19       Impact factor: 11.025

2.  False discovery rate control in genome-wide association studies with population structure.

Authors:  Matteo Sesia; Stephen Bates; Emmanuel Candès; Jonathan Marchini; Chiara Sabatti
Journal:  Proc Natl Acad Sci U S A       Date:  2021-10-05       Impact factor: 11.205

3.  Breakpoint mapping and haplotype analysis of translocation t(1;12)(q43;q21.1) in two apparently independent families with vascular phenotypes.

Authors:  Tiia Maria Luukkonen; Mana M Mehrjouy; Minna Pöyhönen; Anna-Kaisa Anttonen; Päivi Lahermo; Pekka Ellonen; Lars Paulin; Niels Tommerup; Aarno Palotie; Teppo Varilo
Journal:  Mol Genet Genomic Med       Date:  2017-11-23       Impact factor: 2.183

4.  NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants.

Authors:  Seong-Keun Yoo; Chang-Uk Kim; Hie Lim Kim; Sungjae Kim; Jong-Yeon Shin; Namcheol Kim; Joshua Sung Woo Yang; Kwok-Wai Lo; Belong Cho; Fumihiko Matsuda; Stephan C Schuster; Changhoon Kim; Jong-Il Kim; Jeong-Sun Seo
Journal:  Genome Med       Date:  2019-10-22       Impact factor: 11.117

5.  Novel Approach for Parallelizing Pairwise Comparison Problems as Applied to Detecting Segments Identical By Decent in Whole-Genome Data.

Authors:  Emmanuel Sapin; Matthew C Keller
Journal:  Bioinformatics       Date:  2021-03-10       Impact factor: 6.931

6.  Rapid detection of identity-by-descent tracts for mega-scale datasets.

Authors:  Ruhollah Shemirani; Gillian M Belbin; Christy L Avery; Eimear E Kenny; Christopher R Gignoux; José Luis Ambite
Journal:  Nat Commun       Date:  2021-06-10       Impact factor: 14.919

7.  Attacks on genetic privacy via uploads to genealogical databases.

Authors:  Michael D Edge; Graham Coop
Journal:  Elife       Date:  2020-01-07       Impact factor: 8.713

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.