Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

Literature DB >> 35507261

Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

Gayatri Kumar¹, Narayanaswamy Srinivasan², Sankaran Sandhya^3,4.

Abstract

Sequence-based approaches are fundamental to guide experimental investigations in obtaining structural and/or functional insights into uncharacterized protein families. Powerful profile-based sequence search methods rely on a sequence space continuum to identify non-trivial relationships through homology detection. The computational design of protein-like sequences that serve as "artificial linkers" is useful in identifying relationships between distant members of a structural fold. Such sequences act as intermediates and guide homology searches between distantly related proteins. Here, we describe an approach that represents natural intermediate sequences and designed protein-like sequences as HMM (Hidden Markov Models) profiles, to improve the sensitivity of existing search methods. Searches made within the "Profile database" were shown to recognize the parent structural fold for 90% of the search queries at query coverage better than 60%. For 1040 protein families with no available structure, fold associations were made through searches in the database of natural and designed sequence profiles. Most of the associations were made with the Alpha-alpha superhelix, Transmembrane beta-barrels, TIM barrel, and Immunoglobulin-like beta-sandwich folds. For 11 domain families of unknown functions, we provide confident fold associations using the profiles of designed sequences and a consensus from other fold recognition methods. For two DUFs (Domain families of Unknown Functions), we performed detailed functional annotation through comparisons with characterized templates of families of known function.

Entities: Chemical

Keywords: Fold recognition; Functional annotation; Homology; Protein design; Protein domain; Sequence evolution

Mesh：

Substances：
Proteins

Year: 2022 PMID： 35507261 DOI： 10.1007/978-1-0716-2095-3_5

Source DB: PubMed Journal: Methods Mol Biol ISSN： 1064-3745

Keyword Cloud
References

42 in total

1. GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences.

Authors: D T Jones
Journal: J Mol Biol Date: 1999-04-09 Impact factor: 5.469

2. FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking.

Authors: Dong Xu; Lukasz Jaroszewski; Zhanwen Li; Adam Godzik
Journal: Bioinformatics Date: 2013-10-15 Impact factor: 6.937

3. Successful protein fold recognition by optimal sequence threading validated by rigorous blind testing.

Authors: D T Jones; R T Miller; J M Thornton
Journal: Proteins Date: 1995-11

4. Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition.

Authors: Wisam Ibrahim; Mohammad Saniee Abadeh
Journal: J Theor Biol Date: 2017-03-27 Impact factor: 2.691

5. I-TASSER-MR: automated molecular replacement for distant-homology proteins using iterative fragment assembly and progressive sequence truncation.

Authors: Yan Wang; Jouko Virtanen; Zhidong Xue; Yang Zhang
Journal: Nucleic Acids Res Date: 2017-07-03 Impact factor: 16.971

Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

1. GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences.

2. FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking.

3. Successful protein fold recognition by optimal sequence threading validated by rigorous blind testing.

4. Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition.

5. I-TASSER-MR: automated molecular replacement for distant-homology proteins using iterative fragment assembly and progressive sequence truncation.

6. Protein sequences classification by means of feature extraction with substitution matrices.

7. Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts.

8. The Phyre2 web portal for protein modeling, prediction and analysis.

9. LOMETS: a local meta-threading-server for protein structure prediction.

10. ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles.