| Literature DB >> 25428351 |
Evgenia V Kriventseva1, Fredrik Tegenfeldt2, Tom J Petty2, Robert M Waterhouse2, Felipe A Simão2, Igor A Pozdnyakov2, Panagiotis Ioannidis2, Evgeny M Zdobnov3.
Abstract
Orthology, refining the concept of homology, is the cornerstone of evolutionary comparative studies. With the ever-increasing availability of genomic data, inference of orthology has become instrumental for generating hypotheses about gene functions crucial to many studies. This update of the OrthoDB hierarchical catalog of orthologs (http://www.orthodb.org) covers 3027 complete genomes, including the most comprehensive set of 87 arthropods, 61 vertebrates, 227 fungi and 2627 bacteria (sampling the most complete and representative genomes from over 11,000 available). In addition to the most extensive integration of functional annotations from UniProt, InterPro, GO, OMIM, model organism phenotypes and COG functional categories, OrthoDB uniquely provides evolutionary annotations including rates of ortholog sequence divergence, copy-number profiles, sibling groups and gene architectures. We re-designed the entirety of the OrthoDB website from the underlying technology to the user interface, enabling the user to specify species of interest and to select the relevant orthology level by the NCBI taxonomy. The text searches allow use of complex logic with various identifiers of genes, proteins, domains, ontologies or annotation keywords and phrases. Gene copy-number profiles can also be queried. This release comes with the freely available underlying ortholog clustering pipeline (http://www.orthodb.org/software).Entities:
Mesh:
Year: 2014 PMID: 25428351 PMCID: PMC4383991 DOI: 10.1093/nar/gku1220
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Organism coverage of the major resources providing orthology
| Number of genomes | |||||
|---|---|---|---|---|---|
| Database | Total | Bacteria | Eukaryotes | Orthology levels | Availability |
| OrthoDB.v8 | 3028 | 2627 | 401 | 270 | GUI, data, software |
| eggNOG.v4 | 3686 | 2031a | 238 | 107 | GUI, data |
| KEGG-OC | 3098 | 2675 | 256 | n.a. | GUI |
aUsed to define orthologous groups.
Figure 1.OrthoDB web user interface. The orthologous group centric results panel is on the left and the query-building panel is on the right.
Comparative performance of available orthology calling methods versus RefOGs (32)
| RefOGs | |||||||
|---|---|---|---|---|---|---|---|
| Method | Num. of OGs (RefOGs=67) | RefOGs with F1 ≥85% | RefOGs with Presicion ≥85% | RefOGs with Recall ≥85% | Sum: Exact, Akin | Sum: Fused(events), Split(events) | Sum: Complex, Missed |
| OrthoDB v8 (2014) | 112 | 51 | 67 | 46 | 43: 30, 13 | 45: 0(0), 20(45) | 4: 4, 0 |
| OrthoDB v5* (2010) | 156 | 42 | 67 | 34 | 33: 24, 9 | 89: 0(0), 30(89) | 4: 4, 0 |
| OrthoMCL (2.0.8) | 124 | 45 | 64 | 49 | 40: 30, 10 | 51: 2(1), 20(58) | 5: 4, 1 |
| COGsoft (4.2.3) | 164 | 29 | 66 | 19 | 19: 12, 7 | 64: 0(0), 28(64) | 20: 19, 1 |
| OMA (0.99t) | 224 | 20 | 66 | 13 | 12: 8, 4 | 134: 0(0), 31(134) | 24: 23, 1 |
* Used in prior benchmarking (32).
F1 is a harmonic mean of precision and recall (http://en.wikipedia.org/wiki/Sensitivity_and_specificity). RefOG events are defined as follows: ‘Exact’–having 100% of both precision and recall; ‘Akin’–having precision and recall >85% (i.e. up to 1 ‘wrong’ gene for 37% of RefOGs and up to 2 ‘wrong’ genes for another 20% of RefOGs); ‘Fused’–counting fusing events when more than one RefOG represented one method cluster with RefOG recall >85% and summed method cluster precision >85%; ‘Split’–defined symmetrically to Fused when one RefOG is represented by more than one method cluster; ‘Complex’–when the matches can not be classified into another category; ‘Missed’–when a RefOG recall <50%.
Concordance on ‘Variation of Information’ between the methods and RefOGs (lower values indicate more similar classifications)
| Reference | OrthoDB.v8 | OrthoDB.v5 | OrthoMCL | COGsoft | OMA | |
|---|---|---|---|---|---|---|
| Reference | 0 | 7.7 | 12.5 | 10.3 | 17.3 | 20.6 |
| OrthoDB.v8 | 7.7 | 0 | 6 | 7.7 | 12.1 | 15.4 |
| OrthoDB.v5 | 12.5 | 6 | 0 | 7.5 | 9.1 | 11.5 |
| OrthoMCL | 10.3 | 7.7 | 7.5 | 0 | 9.9 | 13.9 |
| COGsoft | 17.3 | 12.1 | 0 | 9.9 | 0 | 10.4 |
| OMA | 20.6 | 15.4 | 9.9 | 13.9 | 10.4 | 0 |