Literature DB >> 34472594

mCNN-ETC: identifying electron transporters and their functional families by using multiple windows scanning techniques in convolutional neural networks with evolutionary information of protein sequences.

Quang-Thai Ho1, Nguyen Quoc Khanh Le2, Yu-Yen Ou3.   

Abstract

In the past decade, convolutional neural networks (CNNs) have been used as powerful tools by scientists to solve visual data tasks. However, many efforts of convolutional neural networks in solving protein function prediction and extracting useful information from protein sequences have certain limitations. In this research, we propose a new method to improve the weaknesses of the previous method. mCNN-ETC is a deep learning model which can transform the protein evolutionary information into image-like data composed of 20 channels, which correspond to the 20 amino acids in the protein sequence. We constructed CNN layers with different scanning windows in parallel to enhance the useful pattern detection ability of the proposed model. Then we filtered specific patterns through the 1-max pooling layer before inputting them into the prediction layer. This research attempts to solve a basic problem in biology in terms of application: predicting electron transporters and classifying their corresponding complexes. The performance result reached an accuracy of 97.41%, which was nearly 6% higher than its predecessor. We have also published a web server on http://bio219.bioinfo.yzu.edu.tw, which can be used for research purposes free of charge.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  convolutional neural network; deep learning; electron transport chain; five complexes; motif scanning; position-specific scoring matrix

Mesh:

Substances:

Year:  2022        PMID: 34472594     DOI: 10.1093/bib/bbab352

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  2 in total

1.  SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information.

Authors:  Adeel Malik; Sathiyamoorthy Subramaniyam; Chang-Bae Kim; Balachandran Manavalan
Journal:  Comput Struct Biotechnol J       Date:  2021-12-14       Impact factor: 7.271

2.  Identifying SNARE Proteins Using an Alignment-Free Method Based on Multiscan Convolutional Neural Network and PSSM Profiles.

Authors:  Quang-Hien Kha; Quang-Thai Ho; Nguyen Quoc Khanh Le
Journal:  J Chem Inf Model       Date:  2022-09-27       Impact factor: 6.162

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.