Literature DB >> 18991767

Predicting membrane protein types by the LLDA algorithm.

Tong Wang1, Jie Yang, Hong-Bin Shen, Kuo-Chen Chou.   

Abstract

Membrane proteins are generally classified into the following eight types: (1) type I transmembrane, (2) type II, (3) type III, (4) type IV, (5) multipass transmembrane, (6) lipid-chain-anchored membrane, (7) GPI-anchored membrane, and (8) peripheral membrane (K.C. Chou and H.B. Shen: BBRC, 2007, 360: 339-345). Knowing the type of an uncharacterized membrane protein often provides useful clues for finding its biological function and interaction process with other molecules in a biological system. With the explosion of protein sequences generated in the Post-Genomic Age, it is urgent to develop an automated method to deal with such a challenge. Recently, the PsePSSM (Pseudo Position-Specific Score Matrix) descriptor is proposed by Chou and Shen (Biochem. Biophys. Res. Comm. 2007, 360, 339-345) to represent a protein sample. The advantage of the PsePSSM descriptor is that it can combine the evolution information and sequence-correlated information. However, incorporating all these effects into a descriptor may cause the "high dimension disaster". To overcome such a problem, the fusion approach was adopted by Chou and Shen. Here, a completely different approach, the so-called LLDA (Local Linear Discriminant Analysis) is introduced to extract the key features from the high-dimensional PsePSSM space. The dimension-reduced descriptor vector thus obtained is a compact representation of the original high dimensional vector. Our jackknife and independent dataset test results indicate that it is very promising to use the LLDA approach to cope with complicated problems in biological systems, such as predicting the membrane protein type.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18991767     DOI: 10.2174/092986608785849308

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  20 in total

1.  iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.

Authors:  Hao Lin; En-Ze Deng; Hui Ding; Wei Chen; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2014-10-31       Impact factor: 16.971

2.  Quat-2L: a web-server for predicting protein quaternary structural attributes.

Authors:  Xuan Xiao; Pu Wang; Kuo-Chen Chou
Journal:  Mol Divers       Date:  2010-02-11       Impact factor: 2.943

3.  IDDLncLoc: Subcellular Localization of LncRNAs Based on a Framework for Imbalanced Data Distributions.

Authors:  Yan Wang; Xiaopeng Zhu; Lili Yang; Xuemei Hu; Kai He; Cuinan Yu; Shaoqing Jiao; Jiali Chen; Rui Guo; Sen Yang
Journal:  Interdiscip Sci       Date:  2022-02-22       Impact factor: 2.233

4.  Study of peptide fingerprints of parasite proteins and drug-DNA interactions with Markov-Mean-Energy invariants of biopolymer molecular-dynamic lattice networks.

Authors:  Lázaro Guillermo Pérez-Montoto; María Auxiliadora Dea-Ayuela; Francisco J Prado-Prado; Francisco Bolas-Fernández; Florencio M Ubeira; Humberto González-Díaz
Journal:  Polymer (Guildf)       Date:  2009-06-03       Impact factor: 4.430

5.  Predicting drug-target interaction networks based on functional groups and biological features.

Authors:  Zhisong He; Jian Zhang; Xiao-He Shi; Le-Le Hu; Xiangyin Kong; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2010-03-11       Impact factor: 3.240

6.  iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.

Authors:  Bin Liu; Jinghao Xu; Xun Lan; Ruifeng Xu; Jiyun Zhou; Xiaolong Wang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2014-09-03       Impact factor: 3.240

7.  Identification of real microRNA precursors with a pseudo structure status composition approach.

Authors:  Bin Liu; Longyun Fang; Fule Liu; Xiaolong Wang; Junjie Chen; Kuo-Chen Chou
Journal:  PLoS One       Date:  2015-03-30       Impact factor: 3.240

8.  iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition.

Authors:  Wei Chen; Peng-Mian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2013-01-08       Impact factor: 16.971

9.  iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.

Authors:  Wei Chen; Peng-Mian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2014-05-21       Impact factor: 3.411

10.  Prediction of DNase I hypersensitive sites by using pseudo nucleotide compositions.

Authors:  Pengmian Feng; Ning Jiang; Nan Liu
Journal:  ScientificWorldJournal       Date:  2014-08-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.