Literature DB >> 35507261

Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

Gayatri Kumar1, Narayanaswamy Srinivasan2, Sankaran Sandhya3,4.   

Abstract

Sequence-based approaches are fundamental to guide experimental investigations in obtaining structural and/or functional insights into uncharacterized protein families. Powerful profile-based sequence search methods rely on a sequence space continuum to identify non-trivial relationships through homology detection. The computational design of protein-like sequences that serve as "artificial linkers" is useful in identifying relationships between distant members of a structural fold. Such sequences act as intermediates and guide homology searches between distantly related proteins. Here, we describe an approach that represents natural intermediate sequences and designed protein-like sequences as HMM (Hidden Markov Models) profiles, to improve the sensitivity of existing search methods. Searches made within the "Profile database" were shown to recognize the parent structural fold for 90% of the search queries at query coverage better than 60%. For 1040 protein families with no available structure, fold associations were made through searches in the database of natural and designed sequence profiles. Most of the associations were made with the Alpha-alpha superhelix, Transmembrane beta-barrels, TIM barrel, and Immunoglobulin-like beta-sandwich folds. For 11 domain families of unknown functions, we provide confident fold associations using the profiles of designed sequences and a consensus from other fold recognition methods. For two DUFs (Domain families of Unknown Functions), we performed detailed functional annotation through comparisons with characterized templates of families of known function.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Fold recognition; Functional annotation; Homology; Protein design; Protein domain; Sequence evolution

Mesh:

Substances:

Year:  2022        PMID: 35507261     DOI: 10.1007/978-1-0716-2095-3_5

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  42 in total

1.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-04-09       Impact factor: 5.469

2.  FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking.

Authors:  Dong Xu; Lukasz Jaroszewski; Zhanwen Li; Adam Godzik
Journal:  Bioinformatics       Date:  2013-10-15       Impact factor: 6.937

3.  Successful protein fold recognition by optimal sequence threading validated by rigorous blind testing.

Authors:  D T Jones; R T Miller; J M Thornton
Journal:  Proteins       Date:  1995-11

4.  Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition.

Authors:  Wisam Ibrahim; Mohammad Saniee Abadeh
Journal:  J Theor Biol       Date:  2017-03-27       Impact factor: 2.691

5.  I-TASSER-MR: automated molecular replacement for distant-homology proteins using iterative fragment assembly and progressive sequence truncation.

Authors:  Yan Wang; Jouko Virtanen; Zhidong Xue; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

6.  Protein sequences classification by means of feature extraction with substitution matrices.

Authors:  Rabie Saidi; Mondher Maddouri; Engelbert Mephu Nguifo
Journal:  BMC Bioinformatics       Date:  2010-04-08       Impact factor: 3.169

7.  Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts.

Authors:  Jianwei Zhu; Haicang Zhang; Shuai Cheng Li; Chao Wang; Lupeng Kong; Shiwei Sun; Wei-Mou Zheng; Dongbo Bu
Journal:  Bioinformatics       Date:  2017-12-01       Impact factor: 6.937

8.  The Phyre2 web portal for protein modeling, prediction and analysis.

Authors:  Lawrence A Kelley; Stefans Mezulis; Christopher M Yates; Mark N Wass; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2015-05-07       Impact factor: 13.491

9.  LOMETS: a local meta-threading-server for protein structure prediction.

Authors:  Sitao Wu; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2007-05-03       Impact factor: 16.971

10.  ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles.

Authors:  Yassine Ghouzam; Guillaume Postic; Pierre-Edouard Guerin; Alexandre G de Brevern; Jean-Christophe Gelly
Journal:  Sci Rep       Date:  2016-06-20       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.