| Literature DB >> 27061661 |
Safdar Ali1, Abdul Majid2, Syed Gibran Javed3, Mohsin Sattar4.
Abstract
Early prediction of breast cancer is important for effective treatment and survival. We developed an effective Cost-Sensitive Classifier with GentleBoost Ensemble (Can-CSC-GBE) for the classification of breast cancer using protein amino acid features. In this work, first, discriminant information of the protein sequences related to breast tissue is extracted. Then, the physicochemical properties hydrophobicity and hydrophilicity of amino acids are employed to generate molecule descriptors in different feature spaces. For comparison, we obtained results by combining Cost-Sensitive learning with conventional ensemble of AdaBoostM1 and Bagging. The proposed Can-CSC-GBE system has effectively reduced the misclassification costs and thereby improved the overall classification performance. Our novel approach has highlighted promising results as compared to the state-of-the-art ensemble approaches.Entities:
Keywords: Breast cancer; Cost-sensitive classifier; GentleBoost ensemble; Imbalanced data; Tissue protein
Mesh:
Substances:
Year: 2016 PMID: 27061661 DOI: 10.1016/j.compbiomed.2016.04.002
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 4.589