| Literature DB >> 17181854 |
Francisco M Couto1, Mário J Silva, Vivian Lee, Emily Dimmer, Evelyn Camon, Rolf Apweiler, Harald Kirsch, Dietrich Rebholz-Schuhmann.
Abstract
BACKGROUND: Annotation of proteins with gene ontology (GO) terms is ongoing work and a complex task. Manual GO annotation is precise and precious, but it is time-consuming. Therefore, instead of curated annotations most of the proteins come with uncurated annotations, which have been generated automatically. Text-mining systems that use literature for automatic annotation have been proposed but they do not satisfy the high quality expectations of curators.Entities:
Year: 2006 PMID: 17181854 PMCID: PMC1769513 DOI: 10.1186/1747-5333-1-19
Source DB: PubMed Journal: J Biomed Discov Collab ISSN: 1747-5333
Figure 1List of documents related with a given protein. The list is sorted by the most similar term extracted from each document. The curator can use the Extract option to see the extracted terms together with the evidence text. By default GOAnnotator uses only the abstract, but the curator can use the AddText option to replace or insert text.
Figure 2GO terms extracted. For each uncurated annotation, GOAnnotator shows the similar GO terms extracted from a sentence of the selected document. If any of the sentences provides correct evidence for the uncurated annotation, or if the evidence supports a GO term similar to that present in the uncurated annotation, the curator can use the Add option to store the annotation together with the document reference, the evidence codes and any comments.
Distribution of the GO terms from the selected uncurated annotations through the different aspects of GO.
| GO Aspect | GO Terms |
| molecular function | 54 |
| biological process | 18 |
| cellular component | 6 |
| total | 78 |
Evaluation of the evidence text substantiating uncurated annotations provided by the GOAnnotator.
| Evidence Evaluation | Extracted Annotations |
| correct | 83 |
| incorrect | 6 |
| total | 89 |
Comparison between the extracted GO terms with correct evidence text and the GO terms from the uncurated annotations.
| GO Terms | Extracted Annotations |
| exact | 65 |
| same lineage | 15 |
| different lineage | 3 |
| total | 83 |