Literature DB >> 31348610

SecProMTB: Support Vector Machine-Based Classifier for Secretory Proteins Using Imbalanced Data Sets Applied to Mycobacterium tuberculosis.

Chaolu Meng1,2, Leyi Wei1, Quan Zou1,3,4.   

Abstract

Secretory proteins of Mycobacterium tuberculosis have created more concern, given their dominant immunogenicity and role in pathogenesis. In view of expensive and time-consuming traditional biochemical experiments, an advanced support vector machine model named SecProMTB is constructed in this study and the proteins are identified by a bioinformatic approach. First, an improved pseudo-amino acid composition (PseAAC) algorithm is used to extract features from all entities. Second, a novel imbalanced-data strategy is proposed and adopted to divide the original data set into train set and test set. Third, to overcome the overfitting problem, feature-ranking algorithms are applied with an increment feature selection. Finally, the model is trained and optimized. Consequently, a model is obtained with an area under the curve of 0.862 and average accuracy of 86% in the independent test. For the convenience of users, SecProMTB and related data are openly accessible at http://server.malab.cn/SecProMTB/index.jsp.
© 2019 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  imbalanced-data strategy; improved PseAAC; secretory proteins of Mycobacterium tuberculosis; support vector machine

Mesh:

Substances:

Year:  2019        PMID: 31348610     DOI: 10.1002/pmic.201900007

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  10 in total

1.  Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms.

Authors:  Fei Yang; Jiazhi Du; Jiying Lang; Weigang Lu; Lei Liu; Changlong Jin; Qinma Kang
Journal:  Biomed Res Int       Date:  2020-06-21       Impact factor: 3.411

2.  PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method.

Authors:  Jun Wang; Huiwen Zheng; Yang Yang; Wanyue Xiao; Taigang Liu
Journal:  Biomed Res Int       Date:  2020-04-13       Impact factor: 3.411

3.  Predicting Endoplasmic Reticulum Resident Proteins Using Auto-Cross Covariance Transformation With a U-Shaped Residue Weight-Transfer Function.

Authors:  Yang-Yang Miao; Wei Zhao; Guang-Ping Li; Yang Gao; Pu-Feng Du
Journal:  Front Genet       Date:  2019-12-20       Impact factor: 4.599

4.  A SNARE Protein Identification Method Based on iLearnPlus to Efficiently Solve the Data Imbalance Problem.

Authors:  Dong Ma; Zhihua Chen; Zhanpeng He; Xueqin Huang
Journal:  Front Genet       Date:  2022-01-28       Impact factor: 4.599

5.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

6.  ACP-DA: Improving the Prediction of Anticancer Peptides Using Data Augmentation.

Authors:  Xian-Gan Chen; Wen Zhang; Xiaofei Yang; Chenhong Li; Hengling Chen
Journal:  Front Genet       Date:  2021-06-30       Impact factor: 4.599

7.  STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity.

Authors:  Xiangeng Wang; Xiaolei Zhu; Mingzhi Ye; Yanjing Wang; Cheng-Dong Li; Yi Xiong; Dong-Qing Wei
Journal:  Front Bioeng Biotechnol       Date:  2019-11-06

8.  PSBP-SVM: A Machine Learning-Based Computational Identifier for Predicting Polystyrene Binding Peptides.

Authors:  Chaolu Meng; Yang Hu; Ying Zhang; Fei Guo
Journal:  Front Bioeng Biotechnol       Date:  2020-03-31

9.  Early Diagnosis of Hepatocellular Carcinoma Using Machine Learning Method.

Authors:  Zi-Mei Zhang; Jiu-Xin Tan; Fang Wang; Fu-Ying Dao; Zhao-Yue Zhang; Hao Lin
Journal:  Front Bioeng Biotechnol       Date:  2020-03-27

10.  Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions.

Authors:  Yixiao Zhai; Yu Chen; Zhixia Teng; Yuming Zhao
Journal:  Front Cell Dev Biol       Date:  2020-10-29
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.