Literature DB >> 25123433

PECM: prediction of extracellular matrix proteins using the concept of Chou's pseudo amino acid composition.

Jian Zhang1, Pingping Sun2, Xiaowei Zhao1, Zhiqiang Ma3.   

Abstract

The extracellular matrix proteins (ECMs) are widely found in the tissues of multicellular organisms. They consist of various secreted proteins, mainly polysaccharides and glycoproteins. The ECMs involve the exchange of materials and information between resident cells and the external environment. Accurate identification of ECMs is a significant step in understanding the evolution of cancer as well as promises wide range of potential applications in therapeutic targets or diagnostic markers. In this paper, an accurate computational method named PECM is proposed for identifying ECMs. Here, we explore various sequence-derived discriminative features including evolutionary information, predicted secondary structure, and physicochemical properties. Rather than simply combining the features which may bring information redundancy and unwanted noises, we use Fisher-Markov selector and incremental feature selection approach to search the optimal feature subsets. Then, we train our model by the technique of support vector machine (SVM). PECM achieves good prediction performance with the ACC scores about 86% and 90% on testing and independent datasets, which are competitive with the state-of-the-art ECMs prediction tools. A web-server named PECM which implements the proposed approach is freely available at http://59.73.198.144:8088/PECM/.
Copyright © 2014 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Extracellular matrix proteins; Fisher–Markov selector; Incremental feature selection; Support vector machine

Mesh:

Substances:

Year:  2014        PMID: 25123433     DOI: 10.1016/j.jtbi.2014.08.002

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  6 in total

1.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

Review 2.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

3.  Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme.

Authors:  Jian Zhang; Wenhan Chen; Pingping Sun; Xiaowei Zhao; Zhiqiang Ma
Journal:  BioData Min       Date:  2015-01-31       Impact factor: 2.522

4.  Predicting Presynaptic and Postsynaptic Neurotoxins by Developing Feature Selection Technique.

Authors:  Hua Tang; Yunchun Yang; Chunmei Zhang; Rong Chen; Po Huang; Chenggang Duan; Ping Zou
Journal:  Biomed Res Int       Date:  2017-02-12       Impact factor: 3.411

5.  A Multifeatures Fusion and Discrete Firefly Optimization Method for Prediction of Protein Tyrosine Sulfation Residues.

Authors:  Song Guo; Chunhua Liu; Peng Zhou; Yanling Li
Journal:  Biomed Res Int       Date:  2016-03-10       Impact factor: 3.411

6.  Identification of DNA-binding proteins using multi-features fusion and binary firefly optimization algorithm.

Authors:  Jian Zhang; Bo Gao; Haiting Chai; Zhiqiang Ma; Guifu Yang
Journal:  BMC Bioinformatics       Date:  2016-08-26       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.