| Literature DB >> 31045538 |
Zhongting Jiang1, Dong Wang1,2,3, Peng Wu1, Yuehui Chen1, Huijie Shang1, Luyao Wang1, Huichun Xie3.
Abstract
BACKGROUND: For a protein to execute its function, ensuring its correct subcellular localization is essential. In addition to biological experiments, bioinformatics is widely used to predict and determine the subcellular localization of proteins. However, single-feature extraction methods cannot effectively handle the huge amount of data and multisite localization of proteins. Thus, we developed a pseudo amino acid composition (PseAAC) method and an entropy density technique to extract feature fusion information from subcellular multisite proteins.Entities:
Keywords: Pseudo amino acid composition (PseAAC); entropy density; multi-label k-nearest neighbors (ML-KNN); subcellular localization of multisite proteins; wML-KNN
Year: 2019 PMID: 31045538 PMCID: PMC6598103 DOI: 10.3233/THC-199018
Source DB: PubMed Journal: Technol Health Care ISSN: 0928-7329 Impact factor: 1.285
Figure 1.Flowchart of the prediction algorithm.
Results after applying entropy density to different algorithms
| Evaluation criterion | Algorithm | |
|---|---|---|
| ML-KNN | wML-KNN | |
| Hamming loss | 0.1783 | 0.0918 |
| One-error | 0.3576 | 0.1855 |
| Coverage | 0.6099 | 0.2333 |
| Average recall | 0.7844 | 0.9018 |
“” indicates that larger values provided better results, whereas “” indicates that smaller values provided better results.
Results after applying PseAAC to different algorithms
| Evaluation criterion | Algorithm | |
|---|---|---|
| ML-KNN | wML-KNN | |
| Hamming loss | 0.1769 | 0.0798 |
| One-error | 0.3595 | 0.1587 |
| Coverage | 0.5946 | 0.2084 |
| Average Recall | 0.7871 | 0.9149 |
“” indicates that larger values provided better results, whereas “” indicates that smaller values provided better results.
Comparative results of two different algorithms adopting the entropy density method
| Algorithm | ML-KNN | wML-KNN |
| Absolute-true | 62.72% | 81.26% |
Comparative results of two different algorithms adopting PseAAC
| Algorithm | ML-KNN | wML-KNN |
| Absolute-true | 62.52% | 83.37% |