Literature DB >> 26476414

Prediction of recombinant protein overexpression in Escherichia coli using a machine learning based model (RPOLP).

Narjeskhatoon Habibi1, Alireza Norouzi2, Siti Z Mohd Hashim3, Mohd Shahir Shamsir4, Razip Samian5.   

Abstract

Recombinant protein overexpression, an important biotechnological process, is ruled by complex biological rules which are mostly unknown, is in need of an intelligent algorithm so as to avoid resource-intensive lab-based trial and error experiments in order to determine the expression level of the recombinant protein. The purpose of this study is to propose a predictive model to estimate the level of recombinant protein overexpression for the first time in the literature using a machine learning approach based on the sequence, expression vector, and expression host. The expression host was confined to Escherichia coli which is the most popular bacterial host to overexpress recombinant proteins. To provide a handle to the problem, the overexpression level was categorized as low, medium and high. A set of features which were likely to affect the overexpression level was generated based on the known facts (e.g. gene length) and knowledge gathered from related literature. Then, a representative sub-set of features generated in the previous objective was determined using feature selection techniques. Finally a predictive model was developed using random forest classifier which was able to adequately classify the multi-class imbalanced small dataset constructed. The result showed that the predictive model provided a promising accuracy of 80% on average, in estimating the overexpression level of a recombinant protein.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  E. coli; Feature selection; Machine learning; Prediction; Protein overexpression level; Recombinant protein

Mesh:

Substances:

Year:  2015        PMID: 26476414     DOI: 10.1016/j.compbiomed.2015.09.015

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  1 in total

1.  PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli.

Authors:  Kulandai Arockia Rajesh Packiam; Chien Wei Ooi; Fuyi Li; Shutao Mei; Beng Ti Tey; Huey Fang Ong; Jiangning Song; Ramakrishnan Nagasundara Ramanan
Journal:  Comput Struct Biotechnol J       Date:  2022-06-03       Impact factor: 6.155

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.