Literature DB >> 23929869

Normalized feature vectors: a novel alignment-free sequence comparison method based on the numbers of adjacent amino acids.

De-Shuang Huang1, Hong-Jie Yu.   

Abstract

Based on all kinds of adjacent amino acids (AAA), we map each protein primary sequence into a 400 by ((L-1)) matrix (M). In addition, we further derive a normalized 400-tuple mathematical descriptors (D), which is extracted from the primary protein sequences via singular values decomposition (SVD) of the matrix. The obtained 400-D normalized feature vectors (NFVs) further facilitate our quantitative analysis of protein sequences. Using the normalized representation of the primary protein sequences, we analyze the similarity for different sequences upon two data sets: 1) ND5 sequences from nine species and 2) transferrin sequences of 24 vertebrates. We also compared the results in this study with those from other related works. These two experiments illustrate that our proposed NFV-AAA approach does perform well in the field of similarity analysis of sequence.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23929869     DOI: 10.1109/TCBB.2013.10

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  25 in total

1.  Identifying anticancer peptides by using a generalized chaos game representation.

Authors:  Li Ge; Jiaguo Liu; Yusen Zhang; Matthias Dehmer
Journal:  J Math Biol       Date:  2018-10-05       Impact factor: 2.259

2.  Incorporating EBO-HSIC with SVM for Gene Selection Associated with Cervical Cancer Classification.

Authors:  S Geeitha; M Thangamani
Journal:  J Med Syst       Date:  2018-10-06       Impact factor: 4.460

3.  Predicting protein-protein interactions from primary protein sequences using a novel multi-scale local feature representation scheme and the random forest.

Authors:  Zhu-Hong You; Keith C C Chan; Pengwei Hu
Journal:  PLoS One       Date:  2015-05-06       Impact factor: 3.240

4.  Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set.

Authors:  Zhu-Hong You; Lin Zhu; Chun-Hou Zheng; Hong-Jie Yu; Su-Ping Deng; Zhen Ji
Journal:  BMC Bioinformatics       Date:  2014-12-03       Impact factor: 3.169

5.  PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites.

Authors:  Yong Gao; Weilin Hao; Jing Gu; Diwei Liu; Chao Fan; Zhigang Chen; Lei Deng
Journal:  J Biol Res (Thessalon)       Date:  2016-07-04       Impact factor: 1.889

6.  CIPPN: computational identification of protein pupylation sites by using neural network.

Authors:  Wenzheng Bao; Zhu-Hong You; De-Shuang Huang
Journal:  Oncotarget       Date:  2017-11-06

7.  Protein Sequence Comparison Based on Physicochemical Properties and the Position-Feature Energy Matrix.

Authors:  Lulu Yu; Yusen Zhang; Ivan Gutman; Yongtang Shi; Matthias Dehmer
Journal:  Sci Rep       Date:  2017-04-10       Impact factor: 4.379

8.  An Enhanced Grey Wolf Optimization Based Feature Selection Wrapped Kernel Extreme Learning Machine for Medical Diagnosis.

Authors:  Qiang Li; Huiling Chen; Hui Huang; Xuehua Zhao; ZhenNao Cai; Changfei Tong; Wenbin Liu; Xin Tian
Journal:  Comput Math Methods Med       Date:  2017-01-26       Impact factor: 2.238

9.  Discovery of Bladder Cancer-related Genes Using Integrative Heterogeneous Network Modeling of Multi-omics Data.

Authors:  Chen Peng; Ao Li; Minghui Wang
Journal:  Sci Rep       Date:  2017-11-15       Impact factor: 4.379

10.  Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model.

Authors:  Ji-Yong An; Fan-Rong Meng; Zhu-Hong You; Xing Chen; Gui-Ying Yan; Ji-Pu Hu
Journal:  Protein Sci       Date:  2016-08-09       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.