| Literature DB >> 27443605 |
Hua Tang1, Ping Zou1, Chunmei Zhang1, Rong Chen1, Wei Chen2,3, Hao Lin2.
Abstract
Apolipoprotein is a kind of protein which can transport the lipids through the lymphatic and circulatory systems. The abnormal expression level of apolipoprotein always causes angiocardiopathy. Thus, correct recognition of apolipoprotein from proteomic data is very crucial to the comprehension of cardiovascular system and drug design. This study is to develop a computational model to predict apolipoproteins. In the model, the apolipoproteins and non-apolipoproteins were collected to form benchmark dataset. On the basis of the dataset, we extracted the g-gap dipeptide composition information from residue sequences to formulate protein samples. To exclude redundant information or noise, the analysis of various (ANOVA)-based feature selection technique was proposed to find out the best feature subset. The support vector machine (SVM) was selected as discrimination algorithm. Results show that 96.2% of sensitivity and 99.3% of specificity were achieved in five-fold cross-validation. These findings open new perspectives to improve apolipoproteins prediction by considering the specific dipeptides. We expect that these findings will help to improve drug development in anti-angiocardiopathy disease.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27443605 PMCID: PMC4957217 DOI: 10.1038/srep30441
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1A plot showing the IFS procedure for discriminating apolipoproteins from non-apolipoproteins.
When the top 229 6-gap dipeptides were used to perform prediction, the overall success rate reaches an IFS peak of 98.4% in five-fold cross-validation.
Figure 2A heat map or chromaticity diagram for the F-scores of the 400 6-gap dipeptides.
The blue boxes indicate that the features are enriched in apolipoproteins, while the red boxes indicate that the features are enriched in non-apolipoproteins.
Figure 3A semi-screenshot to show the top page of the ApoliPred webserver.
Its website address is http://lin.uestc.edu.cn/server/apoliPred.