| Literature DB >> 30832671 |
Javad Zahiri1, Babak Khorsand2, Ali Akbar Yousefi3, Mohammadjavad Kargar3, Ramin Shirali Hossein Zade4, Ghasem Mahdevar5.
Abstract
BACKGROUND: Angiogenesis inhibition research is a cutting edge area in angiogenesis-dependent disease therapy, especially in cancer therapy. Recently, studies on anti-angiogenic peptides have provided promising results in the field of cancer treatment.Entities:
Keywords: Angiogenesis; Anti-angiogenic; Cancer; Cancer treatment; Machine learning; Peptide
Year: 2019 PMID: 30832671 PMCID: PMC6399940 DOI: 10.1186/s12967-019-1813-7
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Distribution of the features used to encode each peptide
| Feature type | No. of features |
|---|---|
| PseAAC (λ = 6) | 28 |
| k-mer composition | |
| k = 2 | 400 |
| k = 3 | 8000 |
| k = 4 | 160,000 |
| k-mer composition (reduced alphabeta) | |
| k = 2 | 64 |
| k = 3 | 512 |
| k = 4 | 4096 |
| Physico-chemical profile | 1910 |
| Atomic profile | 80 |
| Total | 175,062 |
aTo compute k-mer composition features, the reduced amino acid alphabet proposed by Zahiri et al. was applied: the 20 alphabet of amino acids was reduced to a new alphabet with size 8 according to 544 physicochemical and biochemical indices that extracted from AAIndex database (C1 = {A, E}, C2 = {I, L, F, M, V}, C3 = {N, D, T, S}, C4 = {G}, C5 = {P}, C6 = {R, K, Q, H}, C7 = {Y, W}, C8 = {C}). We computed k-mer composition for k = 2, 3, 4 for each peptide
Fig. 1Schematic representation of the proposed method (AntAngioCOOL) for anti-angiogenic peptide prediction
Fig. 2Prediction performance of the three selected classifiers among 227 classifiers to be included in AntAngioCOOL package in the test dataset
Fig. 3Feature importance according to their contribution in final selected features (informative features). a Square chart of the distribution of different feature types in the final important features. b Inner layer of this sunburst chart shows the distribution of different feature types in the primary extracted features (in log scale). The outer layer shows the proportion of each feature type selected as informative features