Literature DB >> 29028986

A new haplotype block detection method for dense genome sequencing data based on interval graph modeling of clusters of highly correlated SNPs.

Sun Ah Kim1, Chang-Sung Cho2, Suh-Ryung Kim2, Shelley B Bull3,4, Yun Joo Yoo2,5.   

Abstract

Motivation: Linkage disequilibrium (LD) block construction is required for research in population genetics and genetic epidemiology, including specification of sets of single nucleotide polymorphisms (SNPs) for analysis of multi-SNP based association and identification of haplotype blocks in high density sequencing data. Existing methods based on a narrow sense definition do not allow intermediate regions of low LD between strongly associated SNP pairs and tend to split high density SNP data into small blocks having high between-block correlation.
Results: We present Big-LD, a block partition method based on interval graph modeling of LD bins which are clusters of strong pairwise LD SNPs, not necessarily physically consecutive. Big-LD uses an agglomerative approach that starts by identifying small communities of SNPs, i.e. the SNPs in each LD bin region, and proceeds by merging these communities. We determine the number of blocks using a method to find maximum-weight independent set. Big-LD produces larger LD blocks compared to existing methods such as MATILDE, Haploview, MIG ++, or S-MIG ++ and the LD blocks better agree with recombination hotspot locations determined by sperm-typing experiments. The observed average runtime of Big-LD for 13 288 240 non-monomorphic SNPs from 1000 Genomes Project autosome data (286 East Asians) is about 5.83 h, which is a significant improvement over the existing methods. Availability and implementation: Source code and documentation are available for download at http://github.com/sunnyeesl/BigLD. Contact: yyoo@snu.ac.kr. Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2018        PMID: 29028986      PMCID: PMC5860363          DOI: 10.1093/bioinformatics/btx609

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

1.  Genetic epidemiology of single-nucleotide polymorphisms.

Authors:  A Collins; C Lonjou; N E Morton
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-21       Impact factor: 11.205

2.  A dynamic programming algorithm for haplotype block partitioning.

Authors:  Kui Zhang; Minghua Deng; Ting Chen; Michael S Waterman; Fengzhu Sun
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-28       Impact factor: 11.205

3.  The future of association studies: gene-based analysis and replication.

Authors:  Benjamin M Neale; Pak C Sham
Journal:  Am J Hum Genet       Date:  2004-07-22       Impact factor: 11.025

4.  Powerful SNP-set analysis for case-control genome-wide association studies.

Authors:  Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

5.  The fine-scale structure of recombination rate variation in the human genome.

Authors:  Gilean A T McVean; Simon R Myers; Sarah Hunt; Panos Deloukas; David R Bentley; Peter Donnelly
Journal:  Science       Date:  2004-04-23       Impact factor: 47.728

6.  Haplotype structure, LD blocks, and uneven recombination within the LRP5 gene.

Authors:  Rebecca C J Twells; Charles A Mein; Michael S Phillips; J Fred Hess; Riitta Veijola; Matthew Gilbey; Matthew Bright; Michael Metzker; Benedicte A Lie; Amanda Kingsnorth; Edward Gregory; Yusuke Nakagawa; Hywel Snook; William Y S Wang; Jennifer Masters; Gillian Johnson; Iain Eaves; Joanna M M Howson; David Clayton; Heather J Cordell; Sarah Nutland; Helen Rance; Philippa Carr; John A Todd
Journal:  Genome Res       Date:  2003-05       Impact factor: 9.043

Review 7.  Linkage disequilibrium--understanding the evolutionary past and mapping the medical future.

Authors:  Montgomery Slatkin
Journal:  Nat Rev Genet       Date:  2008-06       Impact factor: 53.242

8.  Pathway-based analysis using reduced gene subsets in genome-wide association studies.

Authors:  Jingyuan Zhao; Simone Gupta; Mark Seielstad; Jianjun Liu; Anbupalam Thalamuthu
Journal:  BMC Bioinformatics       Date:  2011-01-12       Impact factor: 3.169

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  Clique-Based Clustering of Correlated SNPs in a Gene Can Improve Performance of Gene-Based Multi-Bin Linear Combination Test.

Authors:  Yun Joo Yoo; Sun Ah Kim; Shelley B Bull
Journal:  Biomed Res Int       Date:  2015-08-04       Impact factor: 3.411

View more
  12 in total

1.  A major-effect genetic locus, ApRVII, controlling resistance against both adapted and non-adapted aphid biotypes in pea.

Authors:  Marie-Laure Pilet-Nayel; Jean-Christophe Simon; Akiko Sugio; Rémi Ollivier; Isabelle Glory; Romuald Cloteau; Jean-François Le Gallic; Gaëtan Denis; Stéphanie Morlière; Henri Miteul; Jean-Philippe Rivière; Angélique Lesné; Anthony Klein; Grégoire Aubert; Jonathan Kreplak; Judith Burstin
Journal:  Theor Appl Genet       Date:  2022-02-22       Impact factor: 5.699

Review 2.  Non-coding regulatory elements: Potential roles in disease and the case of epilepsy.

Authors:  Susanna Pagni; James D Mills; Adam Frankish; Jonathan M Mudge; Sanjay M Sisodiya
Journal:  Neuropathol Appl Neurobiol       Date:  2021-12-16       Impact factor: 6.250

3.  Genome-wide association mapping for resistance to bacterial blight and bacterial leaf streak in rice.

Authors:  Nan Jiang; Jun Fu; Qin Zeng; Yi Liang; Yanlong Shi; Zhouwei Li; Youlun Xiao; Zhizhou He; Yuntian Wu; Yu Long; Kai Wang; Yuanzhu Yang; Xionglun Liu; Junhua Peng
Journal:  Planta       Date:  2021-04-08       Impact factor: 4.116

4.  SCN1A overexpression, associated with a genomic region marked by a risk variant for a common epilepsy, raises seizure susceptibility.

Authors:  Katri Silvennoinen; Kinga Gawel; Despina Tsortouktzidis; Albert J Becker; Camila V Esguerra; Sanjay M Sisodiya; Julika Pitsch; Saud Alhusaini; Karen M J van Loo; Richard Picardo; Zuzanna Michalak; Susanna Pagni; Helena Martins Custodio; James Mills; Christopher D Whelan; Greig I de Zubicaray; Katie L McMahon; Wietske van der Ent; Karolina J Kirstein-Smardzewska; Ettore Tiraboschi; Jonathan M Mudge; Adam Frankish; Maria Thom; Margaret J Wright; Paul M Thompson; Susanne Schoch
Journal:  Acta Neuropathol       Date:  2022-05-12       Impact factor: 15.887

Review 5.  5Gs for crop genetic improvement.

Authors:  Rajeev K Varshney; Pallavi Sinha; Vikas K Singh; Arvind Kumar; Qifa Zhang; Jeffrey L Bennetzen
Journal:  Curr Opin Plant Biol       Date:  2020-01-28       Impact factor: 7.834

6.  HaploBlocker: Creation of Subgroup-Specific Haplotype Blocks and Libraries.

Authors:  Torsten Pook; Martin Schlather; Gustavo de Los Campos; Manfred Mayer; Chris Carolin Schoen; Henner Simianer
Journal:  Genetics       Date:  2019-05-31       Impact factor: 4.562

7.  gpart: human genome partitioning and visualization of high-density SNP data by identifying haplotype blocks.

Authors:  Sun Ah Kim; Myriam Brossard; Delnaz Roshandel; Andrew D Paterson; Shelley B Bull; Yun Joo Yoo
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

8.  Integration of Alzheimer's disease genetics and myeloid genomics identifies disease risk regulatory elements and genes.

Authors:  Edoardo Marcora; Alison M Goate; Gloriia Novikova; Manav Kapoor; Julia Tcw; Edsel M Abud; Anastasia G Efthymiou; Steven X Chen; Haoxiang Cheng; John F Fullard; Jaroslav Bendl; Yiyuan Liu; Panos Roussos; Johan Lm Björkegren; Yunlong Liu; Wayne W Poon; Ke Hao
Journal:  Nat Commun       Date:  2021-03-12       Impact factor: 14.919

9.  Haplotype-Based Single-Step GWAS for Yearling Temperament in American Angus Cattle.

Authors:  Andre C Araujo; Paulo L S Carneiro; Amanda B Alvarenga; Hinayah R Oliveira; Stephen P Miller; Kelli Retallick; Luiz F Brito
Journal:  Genes (Basel)       Date:  2021-12-22       Impact factor: 4.096

10.  A forward genetics approach integrating genome-wide association study and expression quantitative trait locus mapping to dissect leaf development in maize (Zea mays).

Authors:  Mara Miculan; Hilde Nelissen; Manel Ben Hassen; Fabio Marroni; Dirk Inzé; Mario Enrico Pè; Matteo Dell'Acqua
Journal:  Plant J       Date:  2021-07-08       Impact factor: 6.417

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.