| Literature DB >> 26120567 |
Niloofar Yousefi Moteghaed1, Keivan Maghooli1, Shiva Pirhadi1, Masoud Garshasbi2.
Abstract
The improvement of high-through-put gene profiling based microarrays technology has provided monitoring the expression value of thousands of genes simultaneously. Detailed examination of changes in expression levels of genes can help physicians to have efficient diagnosing, classification of tumors and cancer's types as well as effective treatments. Finding genes that can classify the group of cancers correctly based on hybrid optimization algorithms is the main purpose of this paper. In this paper, a hybrid particle swarm optimization and genetic algorithm method are used for gene selection and also artificial neural network (ANN) is adopted as the classifier. In this work, we have improved the ability of the algorithm for the classification problem by finding small group of biomarkers and also best parameters of the classifier. The proposed approach is tested on three benchmark gene expression data sets: Blood (acute myeloid leukemia, acute lymphoblastic leukemia), colon and breast datasets. We used 10-fold cross-validation to achieve accuracy and also decision tree algorithm to find the relation between the biomarkers for biological point of view. To test the ability of the trained ANN models to categorize the cancers, we analyzed additional blinded samples that were not previously used for the training procedure. Experimental results show that the proposed method can reduce the dimension of the data set and confirm the most informative gene subset and improve classification accuracy with best parameters based on datasets.Entities:
Keywords: Artificial neural network; cancer classification; gene expression; genetic algorithm; particle swarm optimization algorithm
Year: 2015 PMID: 26120567 PMCID: PMC4460670
Source DB: PubMed Journal: J Med Signals Sens ISSN: 2228-7477
Datasets which used for classification problems for testing the efficiency of proposed method
Parameters in PSOGA
A sample chromosome of PSOGA/ANN population
Figure 1Hybrid algorithm flowchart (particle swarm optimization/genetic algorithm/artificial neural network)
The result of applying hybrid algorithm (PSO/GA) to ANN classifier with t-test preprocessing on cancer databases
Figure 2Occurrence frequency of genes by hybrid particle swarm optimization/genetic algorithm/artificial neural network algorithm with 10-fold cross validation. Figures from left to right are: (a) For breast cancer (b) colon cancer and (c) blood cancer type acute lymphoblastic leukemia and acute myeloid leukemia
Discovered biomarkers for all groups by PSO/GA/ANN
Figure 3Heat maps view on three cancer data show the difference behavior of genes in two classes of data. (a-c) The result for breast and colon cancer data and leukemia cancer in types acute lymphoblastic leukemia and acute myeloid leukemia, respectively
Summarizes results and comparison with literatures
Extracted rules by decision tree on 4 cancer database