Literature DB >> 33490085

ApoPred: Identification of Apolipoproteins and Their Subfamilies With Multifarious Features.

Ting Liu1, Jia-Mao Chen1, Dan Zhang2, Qian Zhang1, Bowen Peng3, Lei Xu4, Hua Tang1,5.   

Abstract

Apolipoprotein is a group of plasma proteins that are associated with a variety of diseases, such as hyperlipidemia, atherosclerosis, Alzheimer's disease, and diabetes. In order to investigate the function of apolipoproteins and to develop effective targets for related diseases, it is necessary to accurately identify and classify apolipoproteins. Although it is possible to identify apolipoproteins accurately through biochemical experiments, they are expensive and time-consuming. This work aims to establish a high-efficiency and high-accuracy prediction model for recognition of apolipoproteins and their subfamilies. We firstly constructed a high-quality benchmark dataset including 270 apolipoproteins and 535 non-apolipoproteins. Based on the dataset, pseudo-amino acid composition (PseAAC) and composition of k-spaced amino acid pairs (CKSAAP) were used as input vectors. To improve the prediction accuracy and eliminate redundant information, analysis of variance (ANOVA) was used to rank the features. And the incremental feature selection was utilized to obtain the best feature subset. Support vector machine (SVM) was proposed to construct the classification model, which could produce the accuracy of 97.27%, sensitivity of 96.30%, and specificity of 97.76% for discriminating apolipoprotein from non-apolipoprotein in 10-fold cross-validation. In addition, the same process was repeated to generate a new model for predicting apolipoprotein subfamilies. The new model could achieve an overall accuracy of 95.93% in 10-fold cross-validation. According to our proposed model, a convenient webserver called ApoPred was established, which can be freely accessed at http://tang-biolab.com/server/ApoPred/service.html. We expect that this work will contribute to apolipoprotein function research and drug development in relevant diseases.
Copyright © 2021 Liu, Chen, Zhang, Zhang, Peng, Xu and Tang.

Entities:  

Keywords:  apolipoprotein; identification; machine learning; multiple features; subfamily-classification

Year:  2021        PMID: 33490085      PMCID: PMC7820372          DOI: 10.3389/fcell.2020.621144

Source DB:  PubMed          Journal:  Front Cell Dev Biol        ISSN: 2296-634X


  64 in total

Review 1.  Survey of Machine Learning Techniques in Drug Discovery.

Authors:  Natalie Stephenson; Emily Shane; Jessica Chase; Jason Rowland; David Ries; Nicola Justice; Jie Zhang; Leong Chan; Renzhi Cao
Journal:  Curr Drug Metab       Date:  2019       Impact factor: 3.731

2.  DNA sequence models of genome-wide Drosophila melanogaster Polycomb binding sites improve generalization to independent Polycomb Response Elements.

Authors:  Bjørn André Bredesen; Marc Rehmsmeier
Journal:  Nucleic Acids Res       Date:  2019-09-05       Impact factor: 16.971

3.  MNDR v2.0: an updated resource of ncRNA-disease associations in mammals.

Authors:  Tianyu Cui; Lin Zhang; Yan Huang; Ying Yi; Puwen Tan; Yue Zhao; Yongfei Hu; Liyan Xu; Enmin Li; Dong Wang
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

4.  Apolipoprotein D subcellular distribution pattern in neuronal cells during oxidative stress.

Authors:  Eva Martínez-Pinilla; Ana Navarro; Cristina Ordóñez; Eva del Valle; Jorge Tolivia
Journal:  Acta Histochem       Date:  2015-05-04       Impact factor: 2.479

5.  Modulation of sphingosine-1-phosphate and apolipoprotein M levels in the plasma, liver and kidneys in streptozotocin-induced diabetic mice.

Authors:  Takahiro Nojiri; Makoto Kurano; Yasunori Tokuhara; Shigeo Ohkubo; Masumi Hara; Hitoshi Ikeda; Kazuhisa Tsukamoto; Yutaka Yatomi
Journal:  J Diabetes Investig       Date:  2014-04-21       Impact factor: 4.232

6.  mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides.

Authors:  Vinothini Boopathi; Sathiyamoorthy Subramaniyam; Adeel Malik; Gwang Lee; Balachandran Manavalan; Deok-Chun Yang
Journal:  Int J Mol Sci       Date:  2019-04-22       Impact factor: 5.923

7.  Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Leyi Wei; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-04-30

8.  DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites.

Authors:  Xiaofeng Wang; Renxiang Yan; Jiangning Song
Journal:  Sci Rep       Date:  2016-03-22       Impact factor: 4.379

9.  AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Leyi Wei; Gwang Lee
Journal:  Comput Struct Biotechnol J       Date:  2019-07-03       Impact factor: 7.271

10.  Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo-EM Density Maps.

Authors:  Dong Si; Spencer A Moritz; Jonas Pfab; Jie Hou; Renzhi Cao; Liguo Wang; Tianqi Wu; Jianlin Cheng
Journal:  Sci Rep       Date:  2020-03-09       Impact factor: 4.379

View more
  1 in total

Review 1.  AOPM: Application of Antioxidant Protein Classification Model in Predicting the Composition of Antioxidant Drugs.

Authors:  Yixiao Zhai; Jingyu Zhang; Tianjiao Zhang; Yue Gong; Zixiao Zhang; Dandan Zhang; Yuming Zhao
Journal:  Front Pharmacol       Date:  2022-01-18       Impact factor: 5.810

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.