Literature DB >> 28293275

Approximating the DCJ distance of balanced genomes in linear time.

Diego P Rubert1, Pedro Feijão2, Marília Dias Vieira Braga2, Jens Stoye2, Fábio Henrique Viduani Martinez1.   

Abstract

BACKGROUND: Rearrangements are large-scale mutations in genomes, responsible for complex changes and structural variations. Most rearrangements that modify the organization of a genome can be represented by the double cut and join (DCJ) operation. Given two balanced genomes, i.e., two genomes that have exactly the same number of occurrences of each gene in each genome, we are interested in the problem of computing the rearrangement distance between them, i.e., finding the minimum number of DCJ operations that transform one genome into the other. This problem is known to be NP-hard.
RESULTS: We propose a linear time approximation algorithm with approximation factor O(k) for the DCJ distance problem, where k is the maximum number of occurrences of any gene in the input genomes. Our algorithm works for linear and circular unichromosomal balanced genomes and uses as an intermediate step an O(k)-approximation for the minimum common string partition problem, which is closely related to the DCJ distance problem.
CONCLUSIONS: Experiments on simulated data sets show that our approximation algorithm is very competitive both in efficiency and in quality of the solutions.

Entities:  

Keywords:  Approximation algorithms; Comparative genomics; Double cut and join (DCJ); Genome rearrangements

Year:  2017        PMID: 28293275      PMCID: PMC5345319          DOI: 10.1186/s13015-017-0095-y

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  5 in total

1.  The solution space of sorting by DCJ.

Authors:  Marília D V Braga; Jens Stoye
Journal:  J Comput Biol       Date:  2010-09       Impact factor: 1.479

2.  Efficient sorting of genomic permutations by translocation, inversion and block interchange.

Authors:  Sophia Yancopoulos; Oliver Attie; Richard Friedberg
Journal:  Bioinformatics       Date:  2005-06-09       Impact factor: 6.937

3.  Efficient tools for computing the number of breakpoints and the number of adjacencies between two genomes with duplicate genes.

Authors:  Sébastien Angibaud; Guillaume Fertin; Irena Rusu; Annelyse Thévenin; Stéphane Vialette
Journal:  J Comput Biol       Date:  2008-10       Impact factor: 1.479

4.  An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.

Authors:  Mingfu Shao; Yu Lin; Bernard M E Moret
Journal:  J Comput Biol       Date:  2014-12-17       Impact factor: 1.479

5.  Inapproximability of (1,2)-exemplar distance.

Authors:  Laurent Bulteau; Minghui Jiang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Nov-Dec       Impact factor: 3.710

  5 in total
  2 in total

1.  Computing the family-free DCJ similarity.

Authors:  Diego P Rubert; Edna A Hoshino; Marília D V Braga; Jens Stoye; Fábio V Martinez
Journal:  BMC Bioinformatics       Date:  2018-05-08       Impact factor: 3.169

2.  The distance and median problems in the single-cut-or-join model with single-gene duplications.

Authors:  Aniket C Mane; Manuel Lafond; Pedro C Feijao; Cedric Chauve
Journal:  Algorithms Mol Biol       Date:  2020-05-04       Impact factor: 1.405

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.