Literature DB >> 31197318

Sequence alignment using machine learning for accurate template-based protein structure prediction.

Shuichiro Makigaki1, Takashi Ishida1.   

Abstract

MOTIVATION: Template-based modeling, the process of predicting the tertiary structure of a protein by using homologous protein structures, is useful if good templates can be found. Although modern homology detection methods can find remote homologs with high sensitivity, the accuracy of template-based models generated from homology-detection-based alignments is often lower than that from ideal alignments.
RESULTS: In this study, we propose a new method that generates pairwise sequence alignments for more accurate template-based modeling. The proposed method trains a machine learning model using the structural alignment of known homologs. It is difficult to directly predict sequence alignments using machine learning. Thus, when calculating sequence alignments, instead of a fixed substitution matrix, this method dynamically predicts a substitution score from the trained model. We evaluate our method by carefully splitting the training and test datasets and comparing the predicted structure's accuracy with that of state-of-the-art methods. Our method generates more accurate tertiary structure models than those produced from alignments obtained by other methods.
AVAILABILITY AND IMPLEMENTATION: https://github.com/shuichiro-makigaki/exmachina. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2020        PMID: 31197318     DOI: 10.1093/bioinformatics/btz483

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields.

Authors:  Sung Jong Lee; Keehyoung Joo; Sangjin Sim; Juyong Lee; In-Ho Lee; Jooyoung Lee
Journal:  Molecules       Date:  2022-06-09       Impact factor: 4.927

2.  Sequence Alignment Using Machine Learning for Accurate Template-based Protein Structure Prediction.

Authors:  Shuichiro Makigaki; Takashi Ishida
Journal:  Bio Protoc       Date:  2020-05-05
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.