Literature DB >> 15032508

The footprint sorting problem.

Claudia Fried1, Wim Hordijk, Sonja J Prohaska, Claus R Stadler, Peter F Stadler.   

Abstract

Phylogenetic footprints are short pieces of noncoding DNA sequence in the vicinity of a gene that are conserved between evolutionary distant species. A seemingly simple problem is to sort footprints in their order along the genomes. It is complicated by the fact that not all footprints are collinear: they may cross each other. The problem thus becomes the identification of the crossing footprints, the sorting of the remaining collinear cliques, and finally the insertion of the noncollinear ones at "reasonable" positions. We show that solving the footprint sorting problem requires the solution of the "Minimum Weight Vertex Feedback Set Problem", which is known to be NP-complete and APX-hard. Nevertheless good approximations can be obtained for data sets of interest. The remaining steps of the sorting process are straightforward: computation of the transitive closure of an acyclic graph, linear extension of the resulting partial order, and finally sorting w.r.t. the linear extension. Alternatively, the footprint sorting problem can be rephrased as a combinatorial optimization problem for which approximate solutions can be obtained by means of general purpose heuristics. Footprint sortings obtained with different methods can be compared using a version of multiple sequence alignment that allows the identification of unambiguously ordered sublists. As an application we show that the rat has a slightly increased insertion/deletion rate in comparison to the mouse genome.

Entities:  

Year:  2004        PMID: 15032508     DOI: 10.1021/ci030411+

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  3 in total

1.  SynBlast: assisting the analysis of conserved synteny information.

Authors:  Jörg Lehmann; Peter F Stadler; Sonja J Prohaska
Journal:  BMC Bioinformatics       Date:  2008-08-24       Impact factor: 3.169

2.  Orthologs, turn-over, and remolding of tRNAs in primates and fruit flies.

Authors:  Cristian A Velandia-Huerto; Sarah J Berkemer; Anne Hoffmann; Nancy Retzlaff; Liliana C Romero Marroquín; Maribel Hernández-Rosales; Peter F Stadler; Clara I Bermúdez-Santana
Journal:  BMC Genomics       Date:  2016-08-11       Impact factor: 3.969

3.  Coordinate systems for supergenomes.

Authors:  Fabian Gärtner; Christian Höner Zu Siederdissen; Lydia Müller; Peter F Stadler
Journal:  Algorithms Mol Biol       Date:  2018-09-24       Impact factor: 1.405

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.