| Literature DB >> 33312711 |
Ehsan Tabari1, Zhengchang Su1.
Abstract
BACKGROUND: Finding orthologous genes among multiple sequenced genomes is a primary step in comparative genomics studies. With the number of sequenced genomes increasing exponentially, comparative genomics becomes more powerful than ever for genomic analysis. However, the very large number of genomes in need of analysis makes conventional orthology prediction methods incapable of this task. Thus, an ultrafast tool is urgently needed.Entities:
Keywords: Algorithms; Orthologous Genes; Sequence alignment; Software
Year: 2017 PMID: 33312711 PMCID: PMC7731588 DOI: 10.1186/s41044-016-0019-8
Source DB: PubMed Journal: Big Data Anal ISSN: 2058-6345
Fig. 1Flowchart of PorthoMCL. Original OrthoMCL steps are shown in white, and PorthoMCL steps are in grey shades. Black boxes are the externalapplications that PorthoMCL requires
Comparison of runtimes of OrthoMCL and PorthoMCL for different number of genomes
| Genomes | Proteins | BLAST Hits | OrthoMCL | PorthoMCL | Speedup |
|---|---|---|---|---|---|
| 10 | 19,240 | 298,647 | 0:00:18 | 0:00:11 | 164 % |
| 29,912 | 637,091 | 0:01:07 | 0:00:21 | 319% | |
| 30,111 | 656,689 | 0:01:16 | 0:00:23 | 330 % | |
| 32,962 | 721,997 | 0:01:12 | 0:00:24 | 300 % | |
| 50 | 126,020 | 5,771,483 | 0:15:55 | 0:05:55 | 269 % |
| 127,724 | 6,363,917 | 0:27:53 | 0:06:08 | 455 % | |
| 133,974 | 6,418,035 | 0:08:29 | 0:06:15 | 136 % | |
| 138,258 | 7,008,798 | 0:24:06 | 0:06:18 | 383 % | |
| 100 | 252,109 | 18,326,608 | 1:02:58 | 0:31:49 | 198 % |
| 500 | 1,327,716 | 283,850,847 | - | 17:38:55 | ∞ |