| Literature DB >> 35561192 |
Emma Persson1, Erik L L Sonnhammer1.
Abstract
SUMMARY: Predicting orthologs, genes in different species having shared ancestry, is an important task in bioinformatics. Orthology prediction tools are required to make accurate and fast predictions, in order to analyze large amounts of data within a feasible time frame. InParanoid is a well-known algorithm for orthology analysis, shown to perform well in benchmarks, but having the major limitation of long runtimes on large datasets. Here, we present an update to the InParanoid algorithm that can use the faster tool DIAMOND instead of BLAST for the homolog search step. We show that it reduces the runtime by 94%, while still obtaining similar performance in the Quest for Orthologs benchmark.Entities:
Mesh:
Year: 2022 PMID: 35561192 PMCID: PMC9113356 DOI: 10.1093/bioinformatics/btac194
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Ortholog prediction methods InParanoid-DIAMOND, InParanoid-BLAST, the three variants of SonicParanoid, Proteinortho, OrthoFinder default and with MSA (using DIAMOND for similarity search), their runtime in hours on the Quest for Orthologs reference proteomes on a 48-thread node, the runtime in CPU hours and the number of appearances on the Pareto frontier out of the 11 Orthology benchmark service tests
| Ortholog prediction method | Runtime (hours) | Runtime (CPU hours) | Appearances on Pareto frontier |
|---|---|---|---|
| InParanoid-DIAMOND (very-sensitive) | 10.49 | 135.8 | 6 |
| InParanoid-BLAST | 166.5 | 5940.1 | 5 |
| SonicParanoid default | 2.44 | 100.8 | 2 |
| SonicParanoid sensitive | 7.23 | 325.4 | 4 |
| SonicParanoid mostSensitive | 15.68 | 725.4 | 7 |
| Proteinortho 6 | 1.69 | 74.2 | 4 |
| OrthoFinder 2 default | 2.75 | 97.8 | 7 |
| OrthoFinder 2 default+MSA | 22.07 | 749.2 | 5 |