Literature DB >> 25717198

A new method to improve network topological similarity search: applied to fold recognition.

John Lhota1, Ruth Hauptman1, Thomas Hart1, Clara Ng1, Lei Xie2.   

Abstract

MOTIVATION: Similarity search is the foundation of bioinformatics. It plays a key role in establishing structural, functional and evolutionary relationships between biological sequences. Although the power of the similarity search has increased steadily in recent years, a high percentage of sequences remain uncharacterized in the protein universe. Thus, new similarity search strategies are needed to efficiently and reliably infer the structure and function of new sequences. The existing paradigm for studying protein sequence, structure, function and evolution has been established based on the assumption that the protein universe is discrete and hierarchical. Cumulative evidence suggests that the protein universe is continuous. As a result, conventional sequence homology search methods may be not able to detect novel structural, functional and evolutionary relationships between proteins from weak and noisy sequence signals. To overcome the limitations in existing similarity search methods, we propose a new algorithmic framework-Enrichment of Network Topological Similarity (ENTS)-to improve the performance of large scale similarity searches in bioinformatics.
RESULTS: We apply ENTS to a challenging unsolved problem: protein fold recognition. Our rigorous benchmark studies demonstrate that ENTS considerably outperforms state-of-the-art methods. As the concept of ENTS can be applied to any similarity metric, it may provide a general framework for similarity search on any set of biological entities, given their representation as a network.
AVAILABILITY AND IMPLEMENTATION: Source code freely available upon request CONTACT: : lxie@iscb.org.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2015        PMID: 25717198      PMCID: PMC4481851          DOI: 10.1093/bioinformatics/btv125

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  53 in total

1.  Protein ranking: from local to global structure in the protein similarity network.

Authors:  Jason Weston; Andre Elisseeff; Dengyong Zhou; Christina S Leslie; William Stafford Noble
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-15       Impact factor: 11.205

2.  Nature of the protein universe.

Authors:  Michael Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  2009-06-18       Impact factor: 11.205

3.  The continuity of protein structure space is an intrinsic property of proteins.

Authors:  Jeffrey Skolnick; Adrian K Arakaki; Seung Yup Lee; Michal Brylinski
Journal:  Proc Natl Acad Sci U S A       Date:  2009-09-01       Impact factor: 11.205

Review 4.  Profile hidden Markov models.

Authors:  S R Eddy
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

Review 5.  Structural trees for protein superfamilies.

Authors:  A V Efimov
Journal:  Proteins       Date:  1997-06

Review 6.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

7.  CATH--a hierarchic classification of protein domain structures.

Authors:  C A Orengo; A D Michie; S Jones; D T Jones; M B Swindells; J M Thornton
Journal:  Structure       Date:  1997-08-15       Impact factor: 5.006

8.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

9.  Comparative protein modelling by satisfaction of spatial restraints.

Authors:  A Sali; T L Blundell
Journal:  J Mol Biol       Date:  1993-12-05       Impact factor: 5.469

Review 10.  Discrete-continuous duality of protein structure space.

Authors:  Ruslan I Sadreyev; Bong-Hyun Kim; Nick V Grishin
Journal:  Curr Opin Struct Biol       Date:  2009-05-29       Impact factor: 6.809

View more
  6 in total

1.  Protein-fold recognition using an improved single-source K diverse shortest paths algorithm.

Authors:  John Lhota; Lei Xie
Journal:  Proteins       Date:  2016-02-04

Review 2.  Providing data science support for systems pharmacology and its implications to drug discovery.

Authors:  Thomas Hart; Lei Xie
Journal:  Expert Opin Drug Discov       Date:  2016-01-09       Impact factor: 6.098

Review 3.  The language of the protein universe.

Authors:  Andrea Scaiewicz; Michael Levitt
Journal:  Curr Opin Genet Dev       Date:  2015-11-03       Impact factor: 5.578

4.  ANTENNA, a Multi-Rank, Multi-Layered Recommender System for Inferring Reliable Drug-Gene-Disease Associations: Repurposing Diazoxide as a Targeted Anti-Cancer Therapy.

Authors:  Annie Wang; Hansaim Lim; Shu-Yuan Cheng; Lei Xie
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2018-03-16       Impact factor: 3.710

5.  CMsearch: simultaneous exploration of protein sequence space and structure space improves not only protein homology detection but also protein structure prediction.

Authors:  Xuefeng Cui; Zhiwu Lu; Sheng Wang; Jim Jing-Yan Wang; Xin Gao
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

6.  Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem.

Authors:  Hansaim Lim; Paul Gray; Lei Xie; Aleksandar Poleksic
Journal:  Sci Rep       Date:  2016-12-13       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.