Literature DB >> 21121036

Haplotype phasing by multi-assembly of shared haplotypes: phase-dependent interactions between rare variants.

Bjarni V Halldórsson1, Derek Aguiar, Sorin Istrail.   

Abstract

In this paper we propose algorithmic strategies, Lander-Waterman-like statistical estimates, and genome-wide software for haplotype phasing by multi-assembly of shared haplotypes. Specifically, we consider four types of results which together provide a comprehensive workflow of GWAS data sets: (1) statistics of multi-assembly of shared haplotypes (2) graph theoretic algorithms for haplotype assembly based on conflict graphs of sequencing reads (3) inference of pedigree structure through haplotype sharing via tract finding algorithms and (4) multi-assembly of shared haplotypes of cases, controls, and trios. The input for the workflows that we consider are any of the combination of: (A) genotype data (B) next generation sequencing (NGS) (C) pedigree information. (1) We present Lander-Waterman-like statistics for NGS projects for the multi-assembly of shared haplotypes. Results are presented in Sec. 2. (2) In Sec. 3, we present algorithmic strategies for haplotype assembly using NGS, NGS + genotype data, and NGS + pedigree information. (3) This work builds on algorithms presented in Halldórsson et al. and are part of the same library of tools co-developed for GWAS workflows. (4) Section 3.3.1 contains algorithmic strategies for multi-assembly of GWAS data. We present algorithms for assembling large data sets and for determining and using shared haplotypes to more reliably assemble and phase the data. Workflows 1-4 provide a set of rigorous algorithms which have the potential to identify phase-dependent interactions between rare variants in linkage equilibrium which are associated with cases. They build on our extensive work on haplotype phasing, haplotype assembly, and whole genome assembly comparison.

Entities:  

Mesh:

Year:  2011        PMID: 21121036     DOI: 10.1142/9789814335058_0010

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  4 in total

1.  HapCompass: a fast cycle basis algorithm for accurate haplotype assembly of sequence data.

Authors:  Derek Aguiar; Sorin Istrail
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

2.  Tumor haplotype assembly algorithms for cancer genomics.

Authors:  Derek Aguiar; Wendy S W Wong; Sorin Istrail
Journal:  Pac Symp Biocomput       Date:  2014

3.  Evaluating allopolyploid origins in strawberries (Fragaria) using haplotypes generated from target capture sequencing.

Authors:  Olga K Kamneva; John Syring; Aaron Liston; Noah A Rosenberg
Journal:  BMC Evol Biol       Date:  2017-08-04       Impact factor: 3.260

4.  Haplotype assembly in polyploid genomes and identical by descent shared tracts.

Authors:  Derek Aguiar; Sorin Istrail
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.