Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Gotrees: predicting go associations from protein domain composition using decision trees.

Literature DB >> 15759620

Gotrees: predicting go associations from protein domain composition using decision trees.

Abstract

The Gene Ontology (GO) offers a comprehensive and standardized way to describe a protein's biological role. Proteins are annotated with GO terms based on direct or indirect experimental evidence. Term assignments are also inferred from homology and literature mining. Regardless of the type of evidence used, GO assignments are manually curated or electronic. Unfortunately, manual curation cannot keep pace with the data, available from publications and various large experimental datasets. Automated literature-based annotation methods have been developed in order to speed up the annotation. However, they only apply to proteins that have been experimentally investigated or have close homologs with sufficient and consistent annotation. One of the homology-based electronic methods for GO annotation is provided by the InterPro database. The InterPro2GO/PFAM2GO associates individual protein domains with GO terms and thus can be used to annotate the less studied proteins. However, protein classification via a single functional domain demands stringency to avoid large number of false positives. This work broadens the basic approach. We model proteins via their entire functional domain content and train individual decision tree classifiers for each GO term using known protein assignments. We demonstrate that our approach is sensitive, specific and precise, as well as fairly robust to sparse data. We have found that our method is more sensitive when compared to the InterPro2GO performance and suffers only some precision decrease. In comparison to the InterPro2GO we have improved the sensitivity by 22%, 27% and 50% for Molecular Function, Biological Process and Cellular GO terms respectively.

Mesh：

Substances：
Proteins

Year: 2005 PMID： 15759620

Source DB: PubMed Journal: Pac Symp Biocomput ISSN： 2335-6928

Keyword Cloud
Cited

10 in total

Gotrees: predicting go associations from protein domain composition using decision trees.

1. Expanding the landscape of chromatin modification (CM)-related functional domains and genes in human.

2. Domain architecture conservation in orthologs.

3. Genome comparison using Gene Ontology (GO) with statistical testing.

4. Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks.

5. Computational prediction of protein function based on weighted mapping of domains and GO terms.

6. Metagenomic insights into the RDX-degrading potential of the ovine rumen microbiome.

7. Predicting gene function using hierarchical multi-label decision tree ensembles.

8. Protein function prediction using domain families.

9. Protein domain recurrence and order can enhance prediction of protein functions.

10. Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach.