| Literature DB >> 25141912 |
Arun Sharma, Deepak Singla, Mamoon Rashid, Gajendra Pal Singh Raghava1.
Abstract
BACKGROUND: In past, a number of peptides have been reported to possess highly diverse properties ranging from cell penetrating, tumor homing, anticancer, anti-hypertensive, antiviral to antimicrobials. Owing to their excellent specificity, low-toxicity, rich chemical diversity and availability from natural sources, FDA has successfully approved a number of peptide-based drugs and several are in various stages of drug development. Though peptides are proven good drug candidates, their usage is still hindered mainly because of their high susceptibility towards proteases degradation. We have developed an in silico method to predict the half-life of peptides in intestine-like environment and to design better peptides having optimized physicochemical properties and half-life.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25141912 PMCID: PMC4150950 DOI: 10.1186/1471-2105-15-282
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Physicochemical properties and amino acid composition of top 20 peptides having longest half-life (stable peptides) and top 20 peptides having shortest half-life (unstable peptides). (A) physicochemical properties of 16mer peptides having short and long half-life; (B) physicochemical properties of 10mer peptides having short and long half-life; (C) average amino acid composition 16mer peptides having short and long half-life; (D) shows average amino acid composition of 10mer peptides having short and long half-life.
The performance of SVM based models developed on HL10 dataset using different features of peptide sequences
| Input feature | Residues in peptides | Total attributes | R | R 2 | MAE |
|---|---|---|---|---|---|
|
| All residues | 20 | 0.57 | 0.32 | 1.87 |
| 5 N-terminal | 0.39 | 0.12 | 2.12 | ||
| 5 C-terminal | 0.32 | 0.09 | 1.99 | ||
|
| All residues | 200 | 0.22 | 0.02 | 1.87 |
| 5 N-terminal | 100 | 0.06 | -0.06 | 1.27 | |
| 5 C-terminal | 100 | 0.34 | 0.12 | 2.01 | |
|
|
|
|
|
|
|
|
| All residues | 8000 | 0.69 | 0.47 | 1.38 |
The performance of Weka models developed using selected features on HL10 dataset
| Total attributes | Techniques | Selected attributes | Method | R | R 2 | MAE |
|---|---|---|---|---|---|---|
| 20 | KStar | D, G, P, R | Amino acid composition | 0.61 | 0.35 | 1.42 |
|
|
|
|
|
|
|
|
| 8000 | IBk | AAH, AGR, AMP, ARE, ASV, DSI, EEK, ELY, ESK, FCI, FGD, FSL, FSS, FYC, GDS, GFG, GLF, GSI, GTS, ILP, INF, INK, IRN, ITK, KIL, KIS, KLP, LVL, MVL, PGF, PVQ, SGL, SIE, SLR, SVL, VFK, VLF, VYL | Tripeptide composition | 0.73 | 0.35 | 1.39 |
The performance SVM based models developed on HL16 dataset using composition and binary pattern
| Input features | Residues in peptides | Total attributes | R | R 2 | MAE |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| 5 N-terminal | 0.57 | 0.32 | 0.23 | ||
| 10 N-terminal | 0.77 | 0.60 | 0.24 | ||
| 5 C-terminal | 0.37 | 0.13 | 0.27 | ||
| 10 C-terminal | 0.81 | 0.65 | 0.24 | ||
|
| All residues | 320 | 0.13 | 0.01 | 0.18 |
| 5 N-terminal | 100 | 0.17 | 0.02 | 0.20 | |
| 5 C-terminal | 100 | 0.03 | -0.002 | 0.18 | |
| 10 N-terminal | 200 | 0.22 | 0.02 | 0.18 | |
| 10 C-terminal | 200 | 0.09 | -0.002 | 0.18 | |
|
| All residues | 400 | 0.90 | 0.39 | 0.23 |
|
| All residues | 8000 | 0.90 | 0.31 | 0.24 |
The performance of various models (Weka) using selected features on HL16 dataset
| Total attributes | Techniques | Selected attributes | Method | R | R 2 | MAE |
|---|---|---|---|---|---|---|
| 20 | DecisionTable | C, D, G, R | Amino acid composition | 0.97 | 0.93 | 0.07 |
|
|
|
|
|
|
|
|
| 8000 | DecisionTable | AQC, EAQ, FGD, GFG, QCG | Tripeptide composition | 0.98 | 0.96 | 0.06 |