Literature DB >> 26038555

Using homology relations within a database markedly boosts protein sequence similarity search.

Jing Tong1, Ruslan I Sadreyev2, Jimin Pei3, Lisa N Kinch3, Nick V Grishin4.   

Abstract

Inference of homology from protein sequences provides an essential tool for analyzing protein structure, function, and evolution. Current sequence-based homology search methods are still unable to detect many similarities evident from protein spatial structures. In computer science a search engine can be improved by considering networks of known relationships within the search database. Here, we apply this idea to protein-sequence-based homology search and show that it dramatically enhances the search accuracy. Our new method, COMPADRE (COmparison of Multiple Protein sequence Alignments using Database RElationships) assesses the relationship between the query sequence and a hit in the database by considering the similarity between the query and hit's known homologs. This approach increases detection quality, boosting the precision rate from 18% to 83% at half-coverage of all database homologs. The increased precision rate allows detection of a large fraction of protein structural relationships, thus providing structure and function predictions for previously uncharacterized proteins. Our results suggest that this general approach is applicable to a wide variety of methods for detection of biological similarities. The web server is available at prodata.swmed.edu/compadre.

Keywords:  homology detection; homology network; protein modeling; remote sequence similarity search; similarity score

Mesh:

Substances:

Year:  2015        PMID: 26038555      PMCID: PMC4460465          DOI: 10.1073/pnas.1424324112

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  29 in total

1.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

2.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment.

Authors:  Michael Remmert; Andreas Biegert; Andreas Hauser; Johannes Söding
Journal:  Nat Methods       Date:  2011-12-25       Impact factor: 28.547

3.  Structure of N-terminal domain of ZAP indicates how a zinc-finger protein recognizes complex RNA.

Authors:  Shoudeng Chen; Yihui Xu; Kuo Zhang; Xinlu Wang; Jian Sun; Guangxia Gao; Yingfang Liu
Journal:  Nat Struct Mol Biol       Date:  2012-03-11       Impact factor: 15.369

Review 4.  Next-generation sequencing platforms.

Authors:  Elaine R Mardis
Journal:  Annu Rev Anal Chem (Palo Alto Calif)       Date:  2013       Impact factor: 10.745

5.  Zinc-finger antiviral protein inhibits HIV-1 infection by selectively targeting multiply spliced viral mRNAs for degradation.

Authors:  Yiping Zhu; Guifang Chen; Fengxiang Lv; Xinlu Wang; Xin Ji; Yihui Xu; Jing Sun; Li Wu; Yong-Tang Zheng; Guangxia Gao
Journal:  Proc Natl Acad Sci U S A       Date:  2011-08-29       Impact factor: 11.205

6.  CASP10 results compared to those of previous CASP experiments.

Authors:  Andriy Kryshtafovych; Krzysztof Fidelis; John Moult
Journal:  Proteins       Date:  2013-12-17

7.  Challenging the state of the art in protein structure prediction: Highlights of experimental target structures for the 10th Critical Assessment of Techniques for Protein Structure Prediction Experiment CASP10.

Authors:  Andriy Kryshtafovych; John Moult; Patrick Bales; J Fernando Bazan; Marco Biasini; Alex Burgin; Chen Chen; Frank V Cochran; Timothy K Craig; Rhiju Das; Deborah Fass; Carmela Garcia-Doval; Osnat Herzberg; Donald Lorimer; Hartmut Luecke; Xiaolei Ma; Daniel C Nelson; Mark J van Raaij; Forest Rohwer; Anca Segall; Victor Seguritan; Kornelius Zeth; Torsten Schwede
Journal:  Proteins       Date:  2014-02

8.  Assessment of template-based protein structure predictions in CASP10.

Authors:  Yuanpeng J Huang; Binchen Mao; James M Aramini; Gaetano T Montelione
Journal:  Proteins       Date:  2014-02

9.  Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison.

Authors:  Mindaugas Margelevicius; Ceslovas Venclovas
Journal:  BMC Bioinformatics       Date:  2010-02-17       Impact factor: 3.169

10.  PROCAIN: protein profile comparison with assisting information.

Authors:  Yong Wang; Ruslan I Sadreyev; Nick V Grishin
Journal:  Nucleic Acids Res       Date:  2009-04-07       Impact factor: 16.971

View more
  3 in total

1.  ECOD: new developments in the evolutionary classification of domains.

Authors:  R Dustin Schaeffer; Yuxing Liao; Hua Cheng; Nick V Grishin
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

2.  AnABlast: a new in silico strategy for the genome-wide search of novel genes and fossil regions.

Authors:  Juan Jimenez; Caia D S Duncan; María Gallardo; Juan Mata; Antonio J Perez-Pulido
Journal:  DNA Res       Date:  2015-10-21       Impact factor: 4.458

3.  UBTOR/KIAA1024 regulates neurite outgrowth and neoplasia through mTOR signaling.

Authors:  Hefei Zhang; Quan Zhang; Ge Gao; Xinjian Wang; Tiantian Wang; Zhitao Kong; Guoxiang Wang; Cuizhen Zhang; Yun Wang; Gang Peng
Journal:  PLoS Genet       Date:  2018-08-06       Impact factor: 5.917

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.