| Literature DB >> 24659104 |
Horacio Caniza1, Alfonso E Romero1, Samuel Heron1, Haixuan Yang1, Alessandra Devoto1, Marco Frasca1, Marco Mesiti1, Giorgio Valentini1, Alberto Paccanaro1.
Abstract
SUMMARY: We present GOssTo, the Gene Ontology semantic similarity Tool, a user-friendly software system for calculating semantic similarities between gene products according to the Gene Ontology. GOssTo is bundled with six semantic similarity measures, including both term- and graph-based measures, and has extension capabilities to allow the user to add new similarities. Importantly, for any measure, GOssTo can also calculate the Random Walk Contribution that has been shown to greatly improve the accuracy of similarity measures. GOssTo is very fast, easy to use, and it allows the calculation of similarities on a genomic scale in a few minutes on a regular desktop machine. CONTACT: alberto@cs.rhul.ac.uk AVAILABILITY: GOssTo is available both as a stand-alone application running on GNU/Linux, Windows and MacOS from www.paccanarolab.org/gossto and as a web application from www.paccanarolab.org/gosstoweb. The stand-alone application features a simple and concise command line interface for easy integration into high-throughput data processing pipelines.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24659104 PMCID: PMC4103586 DOI: 10.1093/bioinformatics/btu144
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Time, in minutes, required for calculating semantic similarities for a few model organisms
| Organism | Number of GO terms | Number of annotated genes | Time term-wise | Time gene-wise |
|---|---|---|---|---|
| Arabidopsis | 6610 | 9703 | 3 m 48 s | 43 m 35 s |
| Rat | 9422 | 5270 | 58 m 19 s | 29 m 54 s |
| Mouse | 12961 | 15020 | 24 m 35 s | 689 m 26 s |
| Fly | 7304 | 8235 | 4 m 56 s | 47 m 46 s |
| Yeast | 7077 | 4898 | 4 m 0 s | 23 m 55 s |
| Worm | 4467 | 4370 | 1 m 29 s | 5 m 1 s |
Note: For each organism: number of unique GO terms appearing in the GO annotation; number of annotated genes; time (in minutes and seconds) required for calculating the Resnik semantic similarity including the Random Walk Contribution term- and gene-wise. Calculations used GO experimental evidence codes (EXP, IDA, IPI, IMP, IGI, IEP, TAS) and is_a and part_of GO relationships. Data downloaded in February 2014. Experiments run on AMD Opteron 6128 HE.