Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Automatic prediction of protein domains from sequence information using a hybrid learning system.

Literature DB >> 14962932

Automatic prediction of protein domains from sequence information using a hybrid learning system.

Abstract

MOTIVATION: We describe a novel method for detecting the domain structure of a protein from sequence information alone. The method is based on analyzing multiple sequence alignments that are derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence and are combined into a single predictor using a neural network. The output is further smoothed and post-processed using a probabilistic model to predict the most likely transition positions between domains.
RESULTS: The method was assessed using the domain definitions in SCOP and CATH for proteins of known structure and was compared with several other existing methods. Our method performs well both in terms of accuracy and sensitivity. It improves significantly over the best methods available, even some of the semi-manual ones, while being fully automatic. Our method can also be used to suggest and verify domain partitions based on structural data. A few examples of predicted domain definitions and alternative partitions, as suggested by our method, are also discussed. AVAILABILITY: An online domain-prediction server is available at http://biozon.org/tools/domains/

Mesh：

Substances：
Proteins

Year: 2004 PMID： 14962932 DOI： 10.1093/bioinformatics/bth086

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

23 in total

Automatic prediction of protein domains from sequence information using a hybrid learning system.

1. DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile.

2. Computer-aided NMR assay for detecting natively folded structural domains.

3. DomSVR: domain boundary prediction with support vector regression from sequence information alone.

4. HangOut: generating clean PSI-BLAST profiles for domains with long insertions.

5. OPUS-Dom: applying the folding-based method VECFOLD to determine protein domain boundaries.

6. DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning.

7. Prediction of protein domain with mRMR feature selection and analysis.

8. Improved general regression network for protein domain boundary prediction.

9. A novel method of predicting protein disordered regions based on sequence features.

10. DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.