Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Domain architecture comparison for multidomain homology identification.

Literature DB >> 17572026

Domain architecture comparison for multidomain homology identification.

Abstract

Homology identification is the first step for many genomic studies. Current methods, based on sequence comparison, can result in a substantial number of mis-assignments due to the similarity of homologous domains in otherwise unrelated sequences. Here we propose methods to detect homologs through explicit comparison of protein domain content. We developed several schemes for scoring the homology of a pair of protein sequences based on methods used in the field of information retrieval. We evaluate the proposed methods and methods used in the literature using a benchmark of fifteen sequence families of known evolutionary history. The results of these studies demonstrate the effectiveness of comparing domain architectures using these similarity measures. We also demonstrate the importance of both weighting promiscuous domains and of compensating for the statistical effect of having a large number of domains in a protein. Using logistic regression, we demonstrate the benefit of combining similarity measures based on domain content with sequence similarity measures.

Mesh：

Substances：
Proteins

Year: 2007 PMID： 17572026 DOI： 10.1089/cmb.2007.A009

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
Cited

19 in total

1. Cocos: Constructing multi-domain protein phylogenies.

Authors: Max Homilius; John Wiedenhoeft; Sebastian Thieme; Christoph Standfuß; Ivan Kel; Roland Krause
Journal: PLoS Curr Date: 2011-06-09

2. Identifying gene clusters by discovering common intervals in indeterminate strings.

Authors: Daniel Doerr; Jens Stoye; Sebastian Böcker; Katharina Jahn
Journal: BMC Genomics Date: 2014-10-17 Impact factor: 3.969

3. Domain architecture conservation in orthologs.

Authors: Kristoffer Forslund; Isabella Pekkari; Erik L L Sonnhammer
Journal: BMC Bioinformatics Date: 2011-08-05 Impact factor: 3.169

4. DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture.

Authors: Byungwook Lee; Doheon Lee
Journal: Nucleic Acids Res Date: 2008-04-14 Impact factor: 16.971

5. Domain similarity based orthology detection.

Authors: Tristan Bitard-Feildel; Carsten Kemena; Jenny M Greenwood; Erich Bornberg-Bauer
Journal: BMC Bioinformatics Date: 2015-05-13 Impact factor: 3.169

6. An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

Authors: Divya P Syamaladevi; Adwait Joshi; Ramanathan Sowdhamini
Journal: Bioinformation Date: 2013-06-08

Domain architecture comparison for multidomain homology identification.

1. Cocos: Constructing multi-domain protein phylogenies.

2. Identifying gene clusters by discovering common intervals in indeterminate strings.

3. Domain architecture conservation in orthologs.

4. DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture.

5. Domain similarity based orthology detection.

6. An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

7. Protein comparison at the domain architecture level.

8. Family classification without domain chaining.

9. Protein domain recurrence and order can enhance prediction of protein functions.

10. Sequence similarity network reveals common ancestry of multidomain proteins.