Literature DB >> 19803734

DCJ path formulation for genome transformations which include insertions, deletions, and duplications.

Sophia Yancopoulos1, Richard Friedberg.   

Abstract

We extend the double cut and join operation (DCJ) paradigm to perform genome rearrangements on pairs of genomes having unequal gene content and/or multiple copies by permitting genes in one genome which are completely or partially unmatched in the other. The existence of unmatched gene ends introduces new kinds of paths in the adjacency graph, since some paths can now terminate internal to a chromosome and not on telomeres. We introduce "ghost adjacencies" to supply the missing gene ends in the genome not containing them. Ghosts enable us to close paths that were due to incomplete matching, just as null points enable us to close even paths terminating in telomeres. We define generalized DCJ operations on the generalized adjacency graph, and give a prescription for calculating the DCJ distance for the expanded repertoire of operations, which includes insertions, deletions, and duplications. For the case of insertions and deletions, with linear as well as circular chromosomes, we suggest permitting a "nugh" (half ghost, half null), which can shorten the distance. We give algorithms for the optimal closure, with and without nughs, and give the resulting distance formula in terms of paths. For certain simplest cases, we calculate the number of optimal ways to close the graph.

Entities:  

Mesh:

Year:  2009        PMID: 19803734     DOI: 10.1089/cmb.2009.0092

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  11 in total

1.  Natural family-free genomic distance.

Authors:  Diego P Rubert; Fábio V Martinez; Marília D V Braga
Journal:  Algorithms Mol Biol       Date:  2021-05-10       Impact factor: 1.405

2.  On the weight of indels in genomic distances.

Authors:  Marília D V Braga; Raphael Machado; Leonardo C Ribeiro; Jens Stoye
Journal:  BMC Bioinformatics       Date:  2011-10-05       Impact factor: 3.169

3.  The rise and fall of breakpoint reuse depending on genome resolution.

Authors:  Oliver Attie; Aaron E Darling; Sophia Yancopoulos
Journal:  BMC Bioinformatics       Date:  2011-10-05       Impact factor: 3.169

4.  Genomic distance under gene substitutions.

Authors:  Marília D V Braga; Raphael Machado; Leonardo C Ribeiro; Jens Stoye
Journal:  BMC Bioinformatics       Date:  2011-10-05       Impact factor: 3.169

5.  On the inversion-indel distance.

Authors:  Eyla Willing; Simone Zaccaria; Marília D V Braga; Jens Stoye
Journal:  BMC Bioinformatics       Date:  2013-10-15       Impact factor: 3.169

6.  Chromosome structures: reduction of certain problems with unequal gene content and gene paralogs to integer linear programming.

Authors:  Vassily Lyubetsky; Roman Gershgorin; Konstantin Gorbunov
Journal:  BMC Bioinformatics       Date:  2017-12-06       Impact factor: 3.169

7.  DCJ-Indel sorting revisited.

Authors:  Phillip Ec Compeau
Journal:  Algorithms Mol Biol       Date:  2013-03-01       Impact factor: 1.405

8.  DCJ-indel and DCJ-substitution distances with distinct operation costs.

Authors:  Poly H da Silva; Raphael Machado; Simone Dantas; Marília Dv Braga
Journal:  Algorithms Mol Biol       Date:  2013-07-23       Impact factor: 1.405

9.  A unifying model of genome evolution under parsimony.

Authors:  Benedict Paten; Daniel R Zerbino; Glenn Hickey; David Haussler
Journal:  BMC Bioinformatics       Date:  2014-06-19       Impact factor: 3.169

10.  Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs.

Authors:  Daniel R Zerbino; Tracy Ballinger; Benedict Paten; Glenn Hickey; David Haussler
Journal:  BMC Bioinformatics       Date:  2016-09-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.