| Literature DB >> 32818637 |
Chunyan Ao1, Wenyang Zhou2, Lin Gao3, Benzhi Dong4, Liang Yu5.
Abstract
Natural antioxidant proteins are mainly found in plants and animals, which interact to eliminate excessive free radicals and protect cells and DNA from damage, prevent and treat some diseases. Therefore, accurate identification of antioxidant proteins is important for the development of new drugs and research of related diseases. This article proposes novel method based on the combination of random forest and hybrid features that can accurately predict antioxidant proteins. Four single feature extraction methods (188D, profile-based Auto-cross covariance (ACC-PSSM), N-gram, and g-gap) and hybrid feature representation methods were used to feature extraction. Three feature selection methods (MRMD, t-SNE, and the optimal feature set selection) were adopted to determine the optimal features. The new hybrid feature vectors derived by combining 188D with the other three features all have indicators ranging from 0.9550 to 0.9990. The novel method showed better performance compared with the other methods.Keywords: Antioxidant protein; Hybrid feature representation methods; MRMD; Random forest
Year: 2020 PMID: 32818637 DOI: 10.1016/j.ygeno.2020.08.016
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736