Literature DB >> 25571793

Identifying gene clusters by discovering common intervals in indeterminate strings.

Daniel Doerr, Jens Stoye, Sebastian Böcker, Katharina Jahn.   

Abstract

BACKGROUND: Comparative analyses of chromosomal gene orders are successfully used to predict gene clusters in bacterial and fungal genomes. Present models for detecting sets of co-localized genes in chromosomal sequences require prior knowledge of gene family assignments of genes in the dataset of interest. These families are often computationally predicted on the basis of sequence similarity or higher order features of gene products. Errors introduced in this process amplify in subsequent gene order analyses and thus may deteriorate gene cluster prediction.
RESULTS: In this work, we present a new dynamic model and efficient computational approaches for gene cluster prediction suitable in scenarios ranging from traditional gene family-based gene cluster prediction, via multiple conflicting gene family annotations, to gene family-free analysis, in which gene clusters are predicted solely on the basis of a pairwise similarity measure of the genes of different genomes. We evaluate our gene family-free model against a gene family-based model on a dataset of 93 bacterial genomes.
CONCLUSIONS: Our model is able to detect gene clusters that would be also detected with well-established gene family-based approaches. Moreover, we show that it is able to detect conserved regions which are missed by gene family-based methods due to wrong or deficient gene family assignments.

Entities:  

Mesh:

Year:  2014        PMID: 25571793      PMCID: PMC4274641          DOI: 10.1186/1471-2164-15-S6-S2

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  21 in total

Review 1.  Domains, motifs and clusters in the protein universe.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Curr Opin Chem Biol       Date:  2003-02       Impact factor: 8.822

2.  Efficient computation of approximate gene clusters based on reference occurrences.

Authors:  Katharina Jahn
Journal:  J Comput Biol       Date:  2011-09       Impact factor: 1.479

3.  Genome-wide comparative gene family classification.

Authors:  Christian Frech; Nansheng Chen
Journal:  PLoS One       Date:  2010-10-15       Impact factor: 3.240

4.  OrthoDB: the hierarchical catalog of eukaryotic orthologs in 2011.

Authors:  Robert M Waterhouse; Evgeny M Zdobnov; Fredrik Tegenfeldt; Jia Li; Evgenia V Kriventseva
Journal:  Nucleic Acids Res       Date:  2010-10-23       Impact factor: 16.971

5.  MultiMSOAR 2.0: an accurate tool to identify ortholog groups among multiple genomes.

Authors:  Guanqun Shi; Meng-Chih Peng; Tao Jiang
Journal:  PLoS One       Date:  2011-06-21       Impact factor: 3.240

6.  Proteinortho: detection of (co-)orthologs in large-scale analysis.

Authors:  Marcus Lechner; Sven Findeiss; Lydia Steiner; Manja Marz; Peter F Stadler; Sonja J Prohaska
Journal:  BMC Bioinformatics       Date:  2011-04-28       Impact factor: 3.169

7.  eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges.

Authors:  Sean Powell; Damian Szklarczyk; Kalliopi Trachana; Alexander Roth; Michael Kuhn; Jean Muller; Roland Arnold; Thomas Rattei; Ivica Letunic; Tobias Doerks; Lars J Jensen; Christian von Mering; Peer Bork
Journal:  Nucleic Acids Res       Date:  2011-11-16       Impact factor: 16.971

8.  Evolution of gene order conservation in prokaryotes.

Authors:  J Tamames
Journal:  Genome Biol       Date:  2001-06-01       Impact factor: 13.583

9.  InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.

Authors:  Gabriel Ostlund; Thomas Schmitt; Kristoffer Forslund; Tina Köstler; David N Messina; Sanjit Roopra; Oliver Frings; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2009-11-05       Impact factor: 16.971

10.  RegulonDB v8.0: omics data sets, evolutionary conservation, regulatory phrases, cross-validated gold standards and more.

Authors:  Heladia Salgado; Martin Peralta-Gil; Socorro Gama-Castro; Alberto Santos-Zavaleta; Luis Muñiz-Rascado; Jair S García-Sotelo; Verena Weiss; Hilda Solano-Lira; Irma Martínez-Flores; Alejandra Medina-Rivera; Gerardo Salgado-Osorio; Shirley Alquicira-Hernández; Kevin Alquicira-Hernández; Alejandra López-Fuentes; Liliana Porrón-Sotelo; Araceli M Huerta; César Bonavides-Martínez; Yalbi I Balderas-Martínez; Lucia Pannier; Maricela Olvera; Aurora Labastida; Verónica Jiménez-Jacinto; Leticia Vega-Alvarado; Victor Del Moral-Chávez; Alfredo Hernández-Alvarez; Enrique Morett; Julio Collado-Vides
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

View more
  1 in total

1.  Finding approximate gene clusters with Gecko 3.

Authors:  Sascha Winter; Katharina Jahn; Stefanie Wehner; Leon Kuchenbecker; Manja Marz; Jens Stoye; Sebastian Böcker
Journal:  Nucleic Acids Res       Date:  2016-09-26       Impact factor: 16.971

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.