| Literature DB >> 24067102 |
Gaston K Mazandu1, Nicola J Mulder.
Abstract
BACKGROUND: The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24067102 PMCID: PMC3849277 DOI: 10.1186/1471-2105-14-284
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Different GO term semantic similarity approaches and functional similarity measures implemented in DaGO-Fun
| | | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| | | ||||||||
| | |||||||||
| Annotation-based | | x | x | x | | | | | |
| | XGraSM | | | | | x | x | x | x |
| | Resnik | | | | | x | x | x | x |
| | Lin | | | | | x | x | x | x |
| | Li et al. | | | | | x | x | x | x |
| Relevance | x | x | x | x | |||||
| Topology-based | | | | | x | | | | |
| | Zhang et al | | | | | | x | | |
| | Wang et al. | | | | | | x | | |
| GO-universal | x | ||||||||
The letter 'x’ indicates that the relevant approach is implemented in DaGO-Fun with the corresponding functional similarity measure.
Figure 1Flowchart of all GO measures implemented in DaGO-Fun. The solid line indicates that the performance of a given measure has already been assessed and the dashed line stands for measures or approaches that have yet to be evaluated.
Figure 2The DaGO-Fun system architecture. The user selects the application and enters the input (GO Ids, Protein Accessions or Gene names, GO Id pairs and protein or gene name pairs). The application is processed from the DaGO-Fun system and results are displayed in a comprehensive format for visualization.
Figure 3Application example of querying IT-GOM and an output summary. The left figure shows the DaGO-Fun interface providing the query form with user input data and the figure on the right displays the results table of protein similarity scores produced by the selected algorithm.
Results obtained after running the GOSP-FIT for specific GO Ids and using different GO term semantic similarity approaches
| | | | | | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| GO:0044255 | 4 | Cellular lipid metabolic
process | 154 | 1590 | 1907 | 225 | 1652 | 1286 | 0.00e+00 | 0.00e+00 |
| GO:0071236 | 6 | Cellular response to
antibiotic | 123 | 307 | 545 | 277 | 739 | 588 | 2.42e-14 | 9.68e-14 |
| GO:0051409 | 3 | Response to nitrosative
stress | 91 | 226 | 426 | 243 | 435 | 418 | 1.07e-14 | 4.26e-14 |
| GO:0052099 | 6 | Acquisition by symbiont of
nutrients | 47 | 128 | 463 | 2 | 418 | 269 | 2.75e-14 | 1.10e-13 |
| from host via siderophores | ||||||||||
GO-universal (GA), Wang et al. (WA), Zhang et al. (ZA), Resnik (RA), Lin (LA) and Li et al. (LLA).
Figure 4Clustering results obtained by running the hierarchical clustering program. Clustering results obtained by running the hierarchical clustering program using different similarity metrics under the DaGO-Fun tool. Protein label is colored according to the process in which the protein is involved. Magenta for proteins involved in GO:00051409, blue for GO:0071236, green for GO:0044255 and red for GO:0052099. (1) Using GO-universal approach. (2) Using Wang et al. approach. (3) Using Resnik approach. (4) Using Li et al. approach.
Running the GOSS-FEAT for specific GO Ids and using different GO term semantic similarity approaches
| GO-Universal | GO:0051409 | Response to nitrosative
stress | 3 | 91 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0006979 | Response to oxidative stress | 3 | 90 | 18 | 8.08e-13 | 5.17e-11 |
| | GO:0052572 | Response to host immune
response | 7 | 92 | 8 | 1.16e-07 | 7.45e-06 |
| Wang et al. | GO:0006979 | Response to oxidative stress | 3 | 226 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0046677 | Response to antibiotic | 4 | 164 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0001666 | Response to hypoxia | 5 | 163 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0006974 | Response to DNA damage
stimulus | 5 | 419 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0009432 | SOS response | 5 | 316 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0034605 | Cellular response to heat | 5 | 351 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0071500 | Cellular response to nitrosative
stress | 5 | 365 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0075136 | Response to host | 5 | 293 | 12 | 9.10e-08 | 5.82e-06 |
| Resnik | GO:0009432 | SOS response | 5 | 294 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0034605 | Cellular response to heat | 5 | 296 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0071500 | Cellular response to nitrosative
stress | 5 | 294 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0009267 | Cellular response to
starvation | 6 | 294 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0071456 | Cellular response to hypoxia | 7 | 370 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0071732 | Cellular response to nitric
oxide | 7 | 369 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0006284 | Base-excision repair | 8 | 361 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0006289 | Nucleotide-excision repair | 8 | 361 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0006307 | DNA dealkylation involved in DNA
repair | 9 | 424 | 18 | 0.00e+00 | 0.00e+00 |
| | GO:0052059 | Evasion or tolerance by symbiont of
host- | 11 | 319 | 18 | 1.63e-12 | 1.04e-10 |
| | | produced reactive oxygen
species | | | | | |
| | GO:0052060 | Evasion or tolerance by symbiont of
host- | 11 | 319 | 18 | 1.63e-12 | 1.04e-10 |
| | | produced nitric oxide | | | | | |
| GO:0051701 | Interaction with host | 4 | 35 | 6 | 1.05e-07 | 6.74e-06 |
IC-based GO semantic similarity tools and functional similarity measures (FSM) they support
| | | |||
|---|---|---|---|---|
| G-SESAME | Web | Topology-based | Wang et al. | ABM |
| | | Annotation-based | Classical Resnik, Lin | Average |
| | | | and Jiang & Conrath | |
| ProteInOn | Web | Annotation-based | Classical Resnik, Lin | BMA and |
| | | | and Jiang & Conrath | SimGIC |
| | | | GraSM | |
| FuSSiMeg | Web | Annotation-based | Classical Resnik, Lin | Max |
| | | | and Jiang & Conrath | |
| | | | GraSM | |
| FunSimMat | Web | Annotation-based | Classical Resnik, Lin | ABM |
| | | | and Jiang & Conrath | |
| | | | SimRel (enhancement) | |
| SemSim | R | Topology-based | Wang et al. Method | ABM |
| | | Annotation-based | Classical Resnik, Lin | Average |
| | | | and Jiang & Conrath | |
| | | | SimRel (enhancement) | |
| csbl.go | R | Annotation-based | Classical Resnik, Lin | SimGIC |
| | | | and Jiang & Conrath | Average |
| GraSM and SimRel | ||||