Literature DB >> 17572026

Domain architecture comparison for multidomain homology identification.

N Song1, R D Sedgewick, D Durand.   

Abstract

Homology identification is the first step for many genomic studies. Current methods, based on sequence comparison, can result in a substantial number of mis-assignments due to the similarity of homologous domains in otherwise unrelated sequences. Here we propose methods to detect homologs through explicit comparison of protein domain content. We developed several schemes for scoring the homology of a pair of protein sequences based on methods used in the field of information retrieval. We evaluate the proposed methods and methods used in the literature using a benchmark of fifteen sequence families of known evolutionary history. The results of these studies demonstrate the effectiveness of comparing domain architectures using these similarity measures. We also demonstrate the importance of both weighting promiscuous domains and of compensating for the statistical effect of having a large number of domains in a protein. Using logistic regression, we demonstrate the benefit of combining similarity measures based on domain content with sequence similarity measures.

Mesh:

Substances:

Year:  2007        PMID: 17572026     DOI: 10.1089/cmb.2007.A009

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  19 in total

1.  Cocos: Constructing multi-domain protein phylogenies.

Authors:  Max Homilius; John Wiedenhoeft; Sebastian Thieme; Christoph Standfuß; Ivan Kel; Roland Krause
Journal:  PLoS Curr       Date:  2011-06-09

2.  Identifying gene clusters by discovering common intervals in indeterminate strings.

Authors:  Daniel Doerr; Jens Stoye; Sebastian Böcker; Katharina Jahn
Journal:  BMC Genomics       Date:  2014-10-17       Impact factor: 3.969

3.  Domain architecture conservation in orthologs.

Authors:  Kristoffer Forslund; Isabella Pekkari; Erik L L Sonnhammer
Journal:  BMC Bioinformatics       Date:  2011-08-05       Impact factor: 3.169

4.  DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture.

Authors:  Byungwook Lee; Doheon Lee
Journal:  Nucleic Acids Res       Date:  2008-04-14       Impact factor: 16.971

5.  Domain similarity based orthology detection.

Authors:  Tristan Bitard-Feildel; Carsten Kemena; Jenny M Greenwood; Erich Bornberg-Bauer
Journal:  BMC Bioinformatics       Date:  2015-05-13       Impact factor: 3.169

6.  An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

Authors:  Divya P Syamaladevi; Adwait Joshi; Ramanathan Sowdhamini
Journal:  Bioinformation       Date:  2013-06-08

7.  Protein comparison at the domain architecture level.

Authors:  Byungwook Lee; Doheon Lee
Journal:  BMC Bioinformatics       Date:  2009-12-03       Impact factor: 3.169

8.  Family classification without domain chaining.

Authors:  Jacob M Joseph; Dannie Durand
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

9.  Protein domain recurrence and order can enhance prediction of protein functions.

Authors:  Mario Abdel Messih; Meghana Chitale; Vladimir B Bajic; Daisuke Kihara; Xin Gao
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

10.  Sequence similarity network reveals common ancestry of multidomain proteins.

Authors:  Nan Song; Jacob M Joseph; George B Davis; Dannie Durand
Journal:  PLoS Comput Biol       Date:  2008-05-16       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.