| Literature DB >> 30196077 |
Faisal Javed1, Maqsood Hayat2.
Abstract
The emergence of numerous genome projects has made the experimental classification of the protein localization almost impossible due to the exponential increase in the number of protein samples. However, most of the applications are merely developed for single-plex and completely ignored the presence of one protein at two or more locations in a cell. In this regard, few attempts were carried out to target Multi-label protein localizations; consequently, undesirable accuracies are achieved. This paper presents a novel approach, in which a discrete feature extraction method is fused with physicochemical properties of amino acids by using Chou's general form of Pseudo Amino Acid Composition. The technique is tested on two benchmark datasets namely: Gpos-mploc and Virus-mPLoc. The empirical results demonstrated that the proposed method yields better results via two examined classifiers i.e. ML-KNN and Rank-SVM. It is established that the proposed model has improved values in all performance measures considered for the comparison.Entities:
Keywords: ML-KNN; Pseudo Amino Acid Composition; Rank-SVM; SMOTE; Split Amino Acid Composition
Mesh:
Substances:
Year: 2018 PMID: 30196077 DOI: 10.1016/j.ygeno.2018.09.004
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736