Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The gene family-free median of three.

Literature DB >> 28559921

The gene family-free median of three.

Daniel Doerr^1,2, Metin Balaban¹, Pedro Feijão², Cedric Chauve³.

Abstract

BACKGROUND: The gene family-free framework for comparative genomics aims at providing methods for gene order analysis that do not require prior gene family assignment, but work directly on a sequence similarity graph. We study two problems related to the breakpoint median of three genomes, which asks for the construction of a fourth genome that minimizes the sum of breakpoint distances to the input genomes.
METHODS: We present a model for constructing a median of three genomes in this family-free setting, based on maximizing an objective function that generalizes the classical breakpoint distance by integrating sequence similarity in the score of a gene adjacency. We study its computational complexity and we describe an integer linear program (ILP) for its exact solution. We further discuss a related problem called family-free adjacencies for k genomes for the special case of [Formula: see text] and present an ILP for its solution. However, for this problem, the computation of exact solutions remains intractable for sufficiently large instances. We then proceed to describe a heuristic method, FFAdj-AM, which performs well in practice.
RESULTS: The developed methods compute accurate positional orthologs for genomes comparable in size of bacterial genomes on simulated data and genomic data acquired from the OMA orthology database. In particular, FFAdj-AM performs equally or better when compared to the well-established gene family prediction tool MultiMSOAR.
CONCLUSIONS: We study the computational complexity of a new family-free model and present algorithms for its solution. With FFAdj-AM, we propose an appealing alternative to established tools for identifying higher confidence positional orthologs.

Entities: CellLine Chemical Disease Species

Keywords: Breakpoint median; Family-free genome comparison; Positional orthology

Year: 2017 PMID： 28559921 PMCID： PMC5446766 DOI： 10.1186/s13015-017-0106-z

Source DB: PubMed Journal: Algorithms Mol Biol ISSN： 1748-7188 Impact factor: 1.405

13 in total

The gene family-free median of three.

Review 1. Homology a personal view on some of the problems.

2. Multichromosomal median and halving problems under different genomic distances.

3. Gene family assignment-free comparative genomics.

Review 4. Functional and evolutionary implications of gene orthology.

5. MultiMSOAR 2.0: an accurate tool to identify ortholog groups among multiple genomes.

Review 6. Positional orthology: putting genomic evolutionary relationships into context.

7. Proteinortho: detection of (co-)orthologs in large-scale analysis.

8. ALF--a simulation framework for genome evolution.

9. Orthology detection combining clustering and synteny for very large datasets.

10. Metrics for GO based protein semantic similarity: a systematic evaluation.

1. The distance and median problems in the single-cut-or-join model with single-gene duplications.