Literature DB >> 28559921

The gene family-free median of three.

Daniel Doerr1,2, Metin Balaban1, Pedro Feijão2, Cedric Chauve3.   

Abstract

BACKGROUND: The gene family-free framework for comparative genomics aims at providing methods for gene order analysis that do not require prior gene family assignment, but work directly on a sequence similarity graph. We study two problems related to the breakpoint median of three genomes, which asks for the construction of a fourth genome that minimizes the sum of breakpoint distances to the input genomes.
METHODS: We present a model for constructing a median of three genomes in this family-free setting, based on maximizing an objective function that generalizes the classical breakpoint distance by integrating sequence similarity in the score of a gene adjacency. We study its computational complexity and we describe an integer linear program (ILP) for its exact solution. We further discuss a related problem called family-free adjacencies for k genomes for the special case of [Formula: see text] and present an ILP for its solution. However, for this problem, the computation of exact solutions remains intractable for sufficiently large instances. We then proceed to describe a heuristic method, FFAdj-AM, which performs well in practice.
RESULTS: The developed methods compute accurate positional orthologs for genomes comparable in size of bacterial genomes on simulated data and genomic data acquired from the OMA orthology database. In particular, FFAdj-AM performs equally or better when compared to the well-established gene family prediction tool MultiMSOAR.
CONCLUSIONS: We study the computational complexity of a new family-free model and present algorithms for its solution. With FFAdj-AM, we propose an appealing alternative to established tools for identifying higher confidence positional orthologs.

Entities:  

Keywords:  Breakpoint median; Family-free genome comparison; Positional orthology

Year:  2017        PMID: 28559921      PMCID: PMC5446766          DOI: 10.1186/s13015-017-0106-z

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  13 in total

Review 1.  Homology a personal view on some of the problems.

Authors:  W M Fitch
Journal:  Trends Genet       Date:  2000-05       Impact factor: 11.639

2.  Multichromosomal median and halving problems under different genomic distances.

Authors:  Eric Tannier; Chunfang Zheng; David Sankoff
Journal:  BMC Bioinformatics       Date:  2009-04-22       Impact factor: 3.169

3.  Gene family assignment-free comparative genomics.

Authors:  Daniel Doerr; Annelyse Thévenin; Jens Stoye
Journal:  BMC Bioinformatics       Date:  2012-12-19       Impact factor: 3.169

Review 4.  Functional and evolutionary implications of gene orthology.

Authors:  Toni Gabaldón; Eugene V Koonin
Journal:  Nat Rev Genet       Date:  2013-04-04       Impact factor: 53.242

5.  MultiMSOAR 2.0: an accurate tool to identify ortholog groups among multiple genomes.

Authors:  Guanqun Shi; Meng-Chih Peng; Tao Jiang
Journal:  PLoS One       Date:  2011-06-21       Impact factor: 3.240

Review 6.  Positional orthology: putting genomic evolutionary relationships into context.

Authors:  Colin N Dewey
Journal:  Brief Bioinform       Date:  2011-06-24       Impact factor: 11.622

7.  Proteinortho: detection of (co-)orthologs in large-scale analysis.

Authors:  Marcus Lechner; Sven Findeiss; Lydia Steiner; Manja Marz; Peter F Stadler; Sonja J Prohaska
Journal:  BMC Bioinformatics       Date:  2011-04-28       Impact factor: 3.169

8.  ALF--a simulation framework for genome evolution.

Authors:  Daniel A Dalquen; Maria Anisimova; Gaston H Gonnet; Christophe Dessimoz
Journal:  Mol Biol Evol       Date:  2011-12-08       Impact factor: 16.240

9.  Orthology detection combining clustering and synteny for very large datasets.

Authors:  Marcus Lechner; Maribel Hernandez-Rosales; Daniel Doerr; Nicolas Wieseke; Annelyse Thévenin; Jens Stoye; Roland K Hartmann; Sonja J Prohaska; Peter F Stadler
Journal:  PLoS One       Date:  2014-08-19       Impact factor: 3.240

10.  Metrics for GO based protein semantic similarity: a systematic evaluation.

Authors:  Catia Pesquita; Daniel Faria; Hugo Bastos; António E N Ferreira; André O Falcão; Francisco M Couto
Journal:  BMC Bioinformatics       Date:  2008-04-29       Impact factor: 3.169

View more
  1 in total

1.  The distance and median problems in the single-cut-or-join model with single-gene duplications.

Authors:  Aniket C Mane; Manuel Lafond; Pedro C Feijao; Cedric Chauve
Journal:  Algorithms Mol Biol       Date:  2020-05-04       Impact factor: 1.405

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.