Literature DB >> 15961486

Efficient computation of close lower and upper bounds on the minimum number of recombinations in biological sequence evolution.

Yun S Song1, Yufeng Wu, Dan Gusfield.   

Abstract

MOTIVATION: We are interested in studying the evolution of DNA single nucleotide polymorphism sequences which have undergone (meiotic) recombination. For a given set of sequences, computing the minimum number of recombinations needed to explain the sequences (with one mutation per site) is a standard question of interest, but it has been shown to be NP-hard, and previous algorithms that compute it exactly work either only on very small datasets or on problems with special structure.
RESULTS: In this paper, we present efficient, practical methods for computing both upper and lower bounds on the minimum number of needed recombinations, and for constructing evolutionary histories that explain the input sequences. We study in detail the efficiency and accuracy of these algorithms on both simulated and real data sets. The algorithms produce very close upper and lower bounds, which match exactly in a surprisingly wide range of data. Thus, with the use of new, very effective lower bounding methods and an efficient algorithm for computing upper bounds, this approach allows the efficient, exact computation of the minimum number of needed recombinations, with high frequency in a large range of data. When upper and lower bounds match, evolutionary histories found by our algorithm correspond to the most parsimonious histories. AVAILABILITY: HapBound and SHRUB, programs implementing the new algorithms discussed in this paper, are available at http://wwwcsif.cs.ucdavis.edu/~gusfield/lu.html

Mesh:

Substances:

Year:  2005        PMID: 15961486     DOI: 10.1093/bioinformatics/bti1033

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  Estimating the contribution of mutation, recombination and gene conversion in the generation of haplotypic diversity.

Authors:  Peter L Morrell; Donna M Toleno; Karen E Lundy; Michael T Clegg
Journal:  Genetics       Date:  2006-04-19       Impact factor: 4.562

2.  Algorithms to distinguish the role of gene-conversion from single-crossover recombination in the derivation of SNP sequences in populations.

Authors:  Yun S Song; Zhihong Ding; Dan Gusfield; Charles H Langley; Yufeng Wu
Journal:  J Comput Biol       Date:  2007-12       Impact factor: 1.479

3.  A decomposition theory for phylogenetic networks and incompatible characters.

Authors:  Dan Gusfield; Vikas Bansal; Vineet Bafna; Yun S Song
Journal:  J Comput Biol       Date:  2007-12       Impact factor: 1.479

4.  Inferring genome-wide mosaic structure.

Authors:  Qi Zhang; Wei Wang; Leonard McMillan; Fernando Pardo-Manuel De Villena; David Threadgill
Journal:  Pac Symp Biocomput       Date:  2009

5.  Genome-wide compatible SNP intervals and their properties.

Authors:  Jeremy Wang; Fernando Pardo-Manual de Villena; Kyle J Moore; Wei Wang; Qi Zhang; Leonard McMillan
Journal:  ACM Int Conf Bioinform Comput Biol (2010)       Date:  2010-08

6.  Genome-wide inference of ancestral recombination graphs.

Authors:  Matthew D Rasmussen; Melissa J Hubisz; Ilan Gronau; Adam Siepel
Journal:  PLoS Genet       Date:  2014-05-15       Impact factor: 5.917

7.  A survey of combinatorial methods for phylogenetic networks.

Authors:  Daniel H Huson; Celine Scornavacca
Journal:  Genome Biol Evol       Date:  2010-11-15       Impact factor: 3.416

8.  Inference of Ancestral Recombination Graphs through Topological Data Analysis.

Authors:  Pablo G Cámara; Arnold J Levine; Raúl Rabadán
Journal:  PLoS Comput Biol       Date:  2016-08-17       Impact factor: 4.475

9.  Recombinational landscape and population genomics of Caenorhabditis elegans.

Authors:  Matthew V Rockman; Leonid Kruglyak
Journal:  PLoS Genet       Date:  2009-03-13       Impact factor: 5.917

10.  Algorithms to estimate the lower bounds of recombination with or without recurrent mutations.

Authors:  Xiaoming Liu; Yun-Xin Fu
Journal:  BMC Genomics       Date:  2008       Impact factor: 3.969

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.