Literature DB >> 23407359

A dynamic Bayesian Markov model for phasing and characterizing haplotypes in next-generation sequencing.

Yu Zhang1.   

Abstract

MOTIVATION: Next-generation sequencing (NGS) technologies have enabled whole-genome discovery and analysis of genetic variants in many species of interest. Individuals are often sequenced at low coverage for detecting novel variants, phasing haplotypes and inferring population structures. Although several tools have been developed for SNP and genotype calling in NGS data, haplotype phasing is often done separately on the called genotypes.
RESULTS: We propose a dynamic Bayesian Markov model (DBM) for simultaneous genotype calling and haplotype phasing in low-coverage NGS data of unrelated individuals. Our method is fully probabilistic that produces consistent inference of genotypes, haplotypes and recombination probabilities. Using data from the 1000 Genomes Project, we demonstrate that DBM not only yields more accurate results than some popular methods, but also provides novel characterization of haplotype structures at the individual level for visualization, interpretation and comparison in downstream analysis. DBM is a powerful and flexible tool that can be applied to many sequencing studies. Its statistical framework can also be extended to accommodate broader scopes of data.
AVAILABILITY AND IMPLEMENTATION: http://stat.psu.edu/∼yuzhang/software/dbm.tar. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2013        PMID: 23407359      PMCID: PMC3656686          DOI: 10.1093/bioinformatics/btt065

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.

Authors:  Na Li; Matthew Stephens
Journal:  Genetics       Date:  2003-12       Impact factor: 4.562

2.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

3.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Authors:  Paul Scheet; Matthew Stephens
Journal:  Am J Hum Genet       Date:  2006-02-17       Impact factor: 11.025

4.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

5.  Low-coverage sequencing: implications for design of complex trait association studies.

Authors:  Yun Li; Carlo Sidore; Hyun Min Kang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genome Res       Date:  2011-04-01       Impact factor: 9.043

6.  Phasing of many thousands of genotyped samples.

Authors:  Amy L Williams; Nick Patterson; Joseph Glessner; Hakon Hakonarson; David Reich
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

7.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.

Authors:  Yun Li; Cristen J Willer; Jun Ding; Paul Scheet; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

Review 8.  Haplotype phasing: existing methods and new developments.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Nat Rev Genet       Date:  2011-09-16       Impact factor: 53.242

Review 9.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

10.  Nonparametric Bayes Modeling of Multivariate Categorical Data.

Authors:  David B Dunson; Chuanhua Xing
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

View more
  5 in total

1.  Continuous-time Markov chain-based flux analysis in metabolism.

Authors:  Yunzhang Huo; Ping Ji
Journal:  J Comput Biol       Date:  2014-08-04       Impact factor: 1.479

2.  Coval: improving alignment quality and variant calling accuracy for next-generation sequencing data.

Authors:  Shunichi Kosugi; Satoshi Natsume; Kentaro Yoshida; Daniel MacLean; Liliana Cano; Sophien Kamoun; Ryohei Terauchi
Journal:  PLoS One       Date:  2013-10-08       Impact factor: 3.240

3.  Evaluating allopolyploid origins in strawberries (Fragaria) using haplotypes generated from target capture sequencing.

Authors:  Olga K Kamneva; John Syring; Aaron Liston; Noah A Rosenberg
Journal:  BMC Evol Biol       Date:  2017-08-04       Impact factor: 3.260

Review 4.  Recombination in viruses: mechanisms, methods of study, and evolutionary consequences.

Authors:  Marcos Pérez-Losada; Miguel Arenas; Juan Carlos Galán; Ferran Palero; Fernando González-Candelas
Journal:  Infect Genet Evol       Date:  2014-12-23       Impact factor: 3.342

5.  De novo inference of stratification and local admixture in sequencing studies.

Authors:  Yu Zhang
Journal:  BMC Bioinformatics       Date:  2013-04-10       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.