| Literature DB >> 28655294 |
Michael P Dunne1, Steven Kelly2.
Abstract
Entities:
Year: 2017 PMID: 28655294 PMCID: PMC5488338 DOI: 10.1186/s12864-017-3849-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Species Set A, fungal species used for algorithm validation
| Species Name | Source | Strain | Taxonomy ID | References |
|---|---|---|---|---|
|
| JGIa |
| 284,811 | [12] |
|
| JGI |
| 284,592 | [13] [14] |
|
| JGI |
| 284,590 | [13] |
|
| SGDb |
| 559,292 | [24] |
|
| JGI |
| 284,591 | [13] |
aJoint Genome Institute; bSaccharomyces Genome Database
Recovery of removed genes in A. thaliana, averaged over five runs
| 10% annotations removed | 90% annotations removed | |||||||
|---|---|---|---|---|---|---|---|---|
| OrthoFiller | de novo | OrthoFiller | de novo | |||||
| No. genes removed | 2410 | 2410 | 21,683 | 21,683 | ||||
| Total genes founda | 1233 | (37.5) | 13,184 | (426.7) | 11,480 | (96.2) | 42,504 | (223.5) |
| Found genes which overlap removed genesa | 1106 | (37.5) | 4918 | (130.4) | 11,343 | (89.0) | 35,609 | (149.5) |
| Total recovered genesa | 1035 | (31.72) | 2268 | (16.4) | 10,380 | (59.7) | 20,430 | (33.0) |
| Number of split genesa | 67 | (5.4) | 1213 | (23.6) | 944 | (34.4) | 7451 | (37.7) |
| Mean pF-score of found genesa | 0.75 | (0.01) | 0.45 | (<0.01) | 0.70 | (<0.01) | 0.55 | (<0.01) |
| Mean oF-score of found genesa | 0.94 | (<0.01) | 0.51 | (<0.01) | 0.89 | (<0.01) | 0.64 | (<0.01) |
| High-quality found genes (pF-score ≥ 0.95)a | 432.0 | (29.3) | 640.8 | (27.4) | 3419.8 | (21.3) | 9079.6 | (99.5) |
| Lower-quality found genes (pF-score < 0.95)a | 674.6 | (32.1) | 4277.2 | (147.8) | 7923.8 | (99.5) | 26,529.8 | (201.3) |
| Mean pF-score of lower-quality genesa | 0.61 | (0.02) | 0.31 | (<0.01) | 0.57 | (<0.01) | 0.33 | (<0.01) |
| % of lower-quality genes with oF-score < 0.95a | 34.6 | (1.9) | 81.1 | (0.9) | 43.4 | (0.6) | 73.9 | (0.1) |
aNumbers shown are rounded mean values from 5 disjoint removed subsets of genes, with standard deviations bracketed