| Literature DB >> 20428317 |
Meng-Ru Ho1, Chun-houh Chen, Wen-chang Lin.
Abstract
The accumulation of complete genomic sequences enhances the need for functional annotation. Associating existing functional annotation of orthologs can speed up the annotation process and even examine the existing annotation. However, current protein sequence-based ortholog databases provide ambiguous and incomplete orthology in eukaryotes. It is because that isoforms, derived by alternative splicing (AS), often share higher sequence similarity to interfere the sequence-based identification. Gene-Oriented Ortholog Database (GOOD) employs genomic locations of transcripts to cluster AS-derived isoforms prior to ortholog delineation to eliminate the interference from AS. From the gene-oriented presentation, isoforms can be clearly associated to their genes to provide comprehensive ortholog information and further be discriminated from paralogs. Aside from, displaying clusters of isoforms between orthologous genes can present the evolution variation at the transcription level. Based on orthology, GOOD additionally comprises functional annotation from the Gene Ontology (GO) database. However, there exist redundant annotations, both parent and child terms assigned to the same gene, in the GO database. It is difficult to precisely draw the numerical comparison of term counts between orthologous genes annotated with redundant terms. Instead of the description only, GOOD further provides the GO graphs to reveal hierarchical-like relationships among divergent functionalities. Therefore, the redundancy of GO terms can be examined, and the context among compared terms is more comprehensive. In sum, GOOD can improve the interpretation in the molecular function from experiments in the model organism and provide clear comparative genomic annotation across organisms. Database URL: http://goods.ibms.sinica.edu.tw/goods/Entities:
Mesh:
Substances:
Year: 2010 PMID: 20428317 PMCID: PMC2860896 DOI: 10.1093/database/baq002
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Snapshots of the GOOD web interface. Panels A and B are the two ways, browse and search functions, for users to select a genomic locus on the website. Users can browse according to genomic positions to look into a specific genomic locus. Or they can achieve the same purpose by searching text of a gene name or a NCBI accession number. Panel C demonstrates the simultaneous display of transcripts and GO annotation between orthologous genomic loci, GTF2IRD1. Transcripts are limited to NCBI reference sequence database, and GO terms are arranged with respect to three ontologies. Users can further click GO terms to see their topology. There are two graphs of GO terms shown in panel D.
| Input: The queried GO term (Q), | |
| Relationship table from GO (term2term: T2T), | |
| Roots (cellular component: CC; | |
| biological process: BP; | |
| molecular function: MF) | |
| Output: The relationship graph of the queried GO term | |
| Step 1 | Recursively query the parents of Q according to T2T, until the obtained parent belonging to {CC, BP, MF} |
| Step 2 | Construct and list all the possible paths from Q to {CC, BP, MF} |
| Step 3 | Find out the longest path (LP) from all the possible paths |
| Step 4 | Horizontally depict LP on the center of a graph |
| Step 5 | Find out the longest common path to LP from the remainder |
| Case 1: Upscore > Downscore | |
| Attach the path onto the existing paths | |
| Case 2: Upscore < Downscore | |
| Attach the path under the existing paths | |
| Case 3: Upscore = Downscore | |
| Attach the path to the side that contains fewer paths | |
| Until all paths are plotted | |