Literature DB >> 12952885

OrthoMCL: identification of ortholog groups for eukaryotic genomes.

Li Li1, Christian J Stoeckert, David S Roos.   

Abstract

The identification of orthologous groups is useful for genome annotation, studies on gene/protein evolution, comparative genomics, and the identification of taxonomically restricted sequences. Methods successfully exploited for prokaryotic genome analysis have proved difficult to apply to eukaryotes, however, as larger genomes may contain multiple paralogous genes, and sequence information is often incomplete. OrthoMCL provides a scalable method for constructing orthologous groups across multiple eukaryotic taxa, using a Markov Cluster algorithm to group (putative) orthologs and paralogs. This method performs similarly to the INPARANOID algorithm when applied to two genomes, but can be extended to cluster orthologs from multiple species. OrthoMCL clusters are coherent with groups identified by EGO, but improved recognition of "recent" paralogs permits overlapping EGO groups representing the same gene to be merged. Comparison with previously assigned EC annotations suggests a high degree of reliability, implying utility for automated eukaryotic genome annotation. OrthoMCL has been applied to the proteome data set from seven publicly available genomes (human, fly, worm, yeast, Arabidopsis, the malaria parasite Plasmodium falciparum, and Escherichia coli). A Web interface allows queries based on individual genes or user-defined phylogenetic patterns (http://www.cbil.upenn.edu/gene-family). Analysis of clusters incorporating P. falciparum genes identifies numerous enzymes that were incompletely annotated in first-pass annotation of the parasite genome.

Entities:  

Mesh:

Year:  2003        PMID: 12952885      PMCID: PMC403725          DOI: 10.1101/gr.1224503

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  33 in total

Review 1.  Searching for drug targets in microbial genomes.

Authors:  M Y Galperin; E V Koonin
Journal:  Curr Opin Biotechnol       Date:  1999-12       Impact factor: 9.740

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

Review 3.  Homology a personal view on some of the problems.

Authors:  W M Fitch
Journal:  Trends Genet       Date:  2000-05       Impact factor: 11.639

4.  The TIGR gene indices: reconstruction and representation of expressed gene sequences.

Authors:  J Quackenbush; F Liang; I Holt; G Pertea; J Upton
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

5.  The COG database: a tool for genome-scale analysis of protein functions and evolution.

Authors:  R L Tatusov; M Y Galperin; D A Natale; E V Koonin
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

6.  Comparative genomics of the eukaryotes.

Authors:  G M Rubin; M D Yandell; J R Wortman; G L Gabor Miklos; C R Nelson; I K Hariharan; M E Fortini; P W Li; R Apweiler; W Fleischmann; J M Cherry; S Henikoff; M P Skupski; S Misra; M Ashburner; E Birney; M S Boguski; T Brody; P Brokstein; S E Celniker; S A Chervitz; D Coates; A Cravchik; A Gabrielian; R F Galle; W M Gelbart; R A George; L S Goldstein; F Gong; P Guan; N L Harris; B A Hay; R A Hoskins; J Li; Z Li; R O Hynes; S J Jones; P M Kuehl; B Lemaitre; J T Littleton; D K Morrison; C Mungall; P H O'Farrell; O K Pickeral; C Shue; L B Vosshall; J Zhang; Q Zhao; X H Zheng; S Lewis
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

7.  Human and nematode orthologs--lessons from the analysis of 1800 human genes and the proteome of Caenorhabditis elegans.

Authors:  S J Wheelan; M S Boguski; L Duret; W Makałowski
Journal:  Gene       Date:  1999-09-30       Impact factor: 3.688

8.  The COG database: new developments in phylogenetic classification of proteins from complete genomes.

Authors:  R L Tatusov; D A Natale; I V Garkavtsev; T A Tatusova; U T Shankavaram; B S Rao; B Kiryutin; M Y Galperin; N D Fedorova; E V Koonin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

9.  The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species.

Authors:  J Quackenbush; J Cho; D Lee; F Liang; I Holt; S Karamycheva; B Parvizi; G Pertea; R Sultana; J White
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 10.  Comparison of the complete protein sets of worm and yeast: orthology and divergence.

Authors:  S A Chervitz; L Aravind; G Sherlock; C A Ball; E V Koonin; S S Dwight; M A Harris; K Dolinski; S Mohr; T Smith; S Weng; J M Cherry; D Botstein
Journal:  Science       Date:  1998-12-11       Impact factor: 47.728

View more
  2000 in total

1.  Extensive transcriptional response associated with seasonal plasticity of butterfly wing patterns.

Authors:  Emily V Daniels; Rabi Murad; Ali Mortazavi; Robert D Reed
Journal:  Mol Ecol       Date:  2014-12-04       Impact factor: 6.185

2.  In silico work flow for scaffold hopping in Leishmania.

Authors:  Barnali Waugh; Ambarnil Ghosh; Dhananjay Bhattacharyya; Nanda Ghoshal; Rahul Banerjee
Journal:  BMC Res Notes       Date:  2014-11-17

3.  The Iccare web server: an attempt to merge sequence and mapping information for plant and animal species.

Authors:  Cédric Muller; Mathieu Denis; Laurent Gentzbittel; Thomas Faraut
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

4.  Known and novel post-transcriptional regulatory sequences are conserved across plant families.

Authors:  Justin N Vaughn; Sally R Ellingson; Flavio Mignone; Albrecht von Arnim
Journal:  RNA       Date:  2012-01-11       Impact factor: 4.942

5.  Dissecting plant genomes with the PLAZA comparative genomics platform.

Authors:  Michiel Van Bel; Sebastian Proost; Elisabeth Wischnitzki; Sara Movahedi; Christopher Scheerlinck; Yves Van de Peer; Klaas Vandepoele
Journal:  Plant Physiol       Date:  2011-12-23       Impact factor: 8.340

6.  Genome sequence for "Candidatus Mycoplasma haemominutum," a low-pathogenicity hemoplasma species.

Authors:  Emily N Barker; Alistair C Darby; Chris R Helps; Iain R Peters; Margaret A Hughes; Alan D Radford; Marilisa Novacco; Felicitas S Boretti; Regina Hofmann-Lehmann; Séverine Tasker
Journal:  J Bacteriol       Date:  2012-02       Impact factor: 3.490

7.  Two Rumex species from contrasting hydrological niches regulate flooding tolerance through distinct mechanisms.

Authors:  Hans van Veen; Angelika Mustroph; Gregory A Barding; Marleen Vergeer-van Eijk; Rob A M Welschen-Evertman; Ole Pedersen; Eric J W Visser; Cynthia K Larive; Ronald Pierik; Julia Bailey-Serres; Laurentius A C J Voesenek; Rashmi Sasidharan
Journal:  Plant Cell       Date:  2013-11-27       Impact factor: 11.277

8.  Cultivable, Host-Specific Bacteroidetes Symbionts Exhibit Diverse Polysaccharolytic Strategies.

Authors:  Arturo Vera-Ponce de León; Benjamin C Jahnes; Jun Duan; Lennel A Camuy-Vélez; Zakee L Sabree
Journal:  Appl Environ Microbiol       Date:  2020-04-01       Impact factor: 4.792

9.  Repeated replacement of an intrabacterial symbiont in the tripartite nested mealybug symbiosis.

Authors:  Filip Husnik; John P McCutcheon
Journal:  Proc Natl Acad Sci U S A       Date:  2016-08-29       Impact factor: 11.205

10.  PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species.

Authors:  Derrick E Fouts; Lauren Brinkac; Erin Beck; Jason Inman; Granger Sutton
Journal:  Nucleic Acids Res       Date:  2012-08-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.