Literature DB >> 30500873

ConDo: protein domain boundary prediction using coevolutionary information.

Seung Hwan Hong1, Keehyoung Joo2, Jooyoung Lee1,2.   

Abstract

MOTIVATION: Domain boundary prediction is one of the most important problems in the study of protein structure and function. Many sequence-based domain boundary prediction methods are either template-based or machine learning (ML) based. ML-based methods often perform poorly due to their use of only local (i.e. short-range) features. These conventional features such as sequence profiles, secondary structures and solvent accessibilities are typically restricted to be within 20 residues of the domain boundary candidate.
RESULTS: To address the performance of ML-based methods, we developed a new protein domain boundary prediction method (ConDo) that utilizes novel long-range features such as coevolutionary information in addition to the aforementioned local window features as inputs for ML. Toward this purpose, two types of coevolutionary information were extracted from multiple sequence alignment using direct coupling analysis: (i) partially aligned sequences, and (ii) correlated mutation information. Both the partially aligned sequence information and the modularity of residue-residue couplings possess long-range correlation information.
AVAILABILITY AND IMPLEMENTATION: https://github.com/gicsaw/ConDo.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30500873     DOI: 10.1093/bioinformatics/bty973

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  FUpred: detecting protein domains through deep-learning-based contact map prediction.

Authors:  Wei Zheng; Xiaogen Zhou; Qiqige Wuyun; Robin Pearce; Yang Li; Yang Zhang
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

Review 2.  I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.

Authors:  Xiaogen Zhou; Wei Zheng; Yang Li; Robin Pearce; Chengxin Zhang; Eric W Bell; Guijun Zhang; Yang Zhang
Journal:  Nat Protoc       Date:  2022-08-05       Impact factor: 17.021

3.  Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.

Authors:  Sajid Mahmud; Zhiye Guo; Farhan Quadir; Jian Liu; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2022-07-19       Impact factor: 3.307

4.  High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function.

Authors:  Mu Gao; Peik Lund-Andersen; Alex Morehead; Sajid Mahmud; Chen Chen; Xiao Chen; Nabin Giri; Raj S Roy; Farhan Quadir; T Chad Effler; Ryan Prout; Subil Abraham; Wael Elwasif; N Quentin Haas; Jeffrey Skolnick; Jianlin Cheng; Ada Sedova
Journal:  Workshop Mach Learn HPC Environ       Date:  2021-12-27

5.  LOMETS3: integrating deep learning and profile alignment for advanced protein template recognition and function annotation.

Authors:  Wei Zheng; Qiqige Wuyun; Xiaogen Zhou; Yang Li; Peter L Freddolino; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2022-04-14       Impact factor: 19.160

6.  4mCpred-EL: An Ensemble Learning Framework for Identification of DNA N4-methylcytosine Sites in the Mouse Genome.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Da Yeon Lee; Leyi Wei; Gwang Lee
Journal:  Cells       Date:  2019-10-28       Impact factor: 6.600

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.