| Literature DB >> 21113020 |
Adrian M Altenhoff1, Adrian Schneider, Gaston H Gonnet, Christophe Dessimoz.
Abstract
OMA (Orthologous MAtrix) is a database that identifies orthologs among publicly available, complete genomes. Initiated in 2004, the project is at its 11th release. It now includes 1000 genomes, making it one of the largest resources of its kind. Here, we describe recent developments in terms of species covered; the algorithmic pipeline--in particular regarding the treatment of alternative splicing, and new features of the web (OMA Browser) and programming interface (SOAP API). In the second part, we review the various representations provided by OMA and their typical applications. The database is publicly accessible at http://omabrowser.org.Entities:
Mesh:
Year: 2010 PMID: 21113020 PMCID: PMC3013747 DOI: 10.1093/nar/gkq1238
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Growth of the OMA database since its first release.
Figure 2.Visualization of the 1000 genomes included in the 10th release of OMA. Lineage assignments are based on NCBI Taxonomy database (24). Note that the radial scale (mean number of genes/genome) is not cumulative, i.e. it refers to the height of individual “band sectors”.
Figure 3.The four types of ortholog grouping provided in OMA. (a) protein-centric view reports orthologs to a gene of reference; (b) genome pair view lists all orthologs between two species; (c) OMA group view displays sets of genes in which all pairs of genes are orthologs and (d) hierarchical groups, groups of genes that descend from a single common ancestral gene within a given taxonomic range.
Typical applications of orthologs and their most suitable representation of orthology
| Application | Appropriate orthology type | Comments |
|---|---|---|
| Propagate functional annotations between two genomes | Genome pair view, filtered to 1:1 orthologs | According to current models, 1:1 orthologs are likely to have a similar function. |
| Identify all orthologs of a gene in a given set of organisms | Protein-centric view | This includes in-paralogs in the target genomes lineages with respect to the speciation event with the query genome |
| Allign two whole genomes | Genome pair view | Orthologous genes are anchor points. |
| Align protein–protein interaction networks between two genomes | Genome pair view | Orthologs can be used to restrict the alignment search space. Thus, only conserved parts are used as anchors. |
| Identify all genes in vertebrates that descended from the ancestral β-hemoglobin | Hierarchical group of human β-hemoglobin with ‘Vertebrata’ as taxonomic range | This also retrieves γ-hemoglobin and ε-hemoglobin, which are thought to have diverged within the vertebrates ( |
| Identify orthologs to infer a species tree | OMA groups with high-coverage of the species of interest | Since, barring classification error, all sequence pairs within an OMA group are orthologs, the corresponding gene tree is expected to follow the species tree. |