Literature DB >> 21775110

Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.

Dong Ling Tong1, Amanda C Schierz.   

Abstract

OBJECTIVE: Suitable techniques for microarray analysis have been widely researched, particularly for the study of marker genes expressed to a specific type of cancer. Most of the machine learning methods that have been applied to significant gene selection focus on the classification ability rather than the selection ability of the method. These methods also require the microarray data to be preprocessed before analysis takes place. The objective of this study is to develop a hybrid genetic algorithm-neural network (GANN) model that emphasises feature selection and can operate on unpreprocessed microarray data.
METHOD: The GANN is a hybrid model where the fitness value of the genetic algorithm (GA) is based upon the number of samples correctly labelled by a standard feedforward artificial neural network (ANN). The model is evaluated by using two benchmark microarray datasets with different array platforms and differing number of classes (a 2-class oligonucleotide microarray data for acute leukaemia and a 4-class complementary DNA (cDNA) microarray dataset for SRBCTs (small round blue cell tumours)). The underlying concept of the GANN algorithm is to select highly informative genes by co-evolving both the GA fitness function and the ANN weights at the same time.
RESULTS: The novel GANN selected approximately 50% of the same genes as the original studies. This may indicate that these common genes are more biologically significant than other genes in the datasets. The remaining 50% of the significant genes identified were used to build predictive models and for both datasets, the models based on the set of genes extracted by the GANN method produced more accurate results. The results also suggest that the GANN method not only can detect genes that are exclusively associated with a single cancer type but can also explore the genes that are differentially expressed in multiple cancer types.
CONCLUSIONS: The results show that the GANN model has successfully extracted statistically significant genes from the unpreprocessed microarray data as well as extracting known biologically significant genes. We also show that assessing the biological significance of genes based on classification accuracy may be misleading and though the GANN's set of extra genes prove to be more statistically significant than those selected by other methods, a biological assessment of these genes is highly recommended to confirm their functionality.
Copyright © 2011 Elsevier B.V. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21775110     DOI: 10.1016/j.artmed.2011.06.008

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  14 in total

1.  Raman spectroscopy of blood serum for Alzheimer's disease diagnostics: specificity relative to other types of dementia.

Authors:  Elena Ryzhikova; Oleksandr Kazakov; Lenka Halamkova; Dzintra Celmins; Paula Malone; Eric Molho; Earl A Zimmerman; Igor K Lednev
Journal:  J Biophotonics       Date:  2014-09-25       Impact factor: 3.207

2.  Cancer adjuvant chemotherapy prediction model for non-small cell lung cancer.

Authors:  Russul Alanni; Jingyu Hou; Hasseeb Azzawi; Yong Xiang
Journal:  IET Syst Biol       Date:  2019-06       Impact factor: 1.615

3.  A benchmark for evaluation of algorithms for identification of cellular correlates of clinical outcomes.

Authors:  Nima Aghaeepour; Pratip Chattopadhyay; Maria Chikina; Tom Dhaene; Sofie Van Gassen; Miron Kursa; Bart N Lambrecht; Mehrnoush Malek; G J McLachlan; Yu Qian; Peng Qiu; Yvan Saeys; Rick Stanton; Dong Tong; Celine Vens; Sławomir Walkowiak; Kui Wang; Greg Finak; Raphael Gottardo; Tim Mosmann; Garry P Nolan; Richard H Scheuermann; Ryan R Brinkman
Journal:  Cytometry A       Date:  2015-10-08       Impact factor: 4.355

4.  A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies.

Authors:  Dong L Tong; David J Boocock; Clare Coveney; Jaimy Saif; Susana G Gomez; Sergio Querol; Robert Rees; Graham R Ball
Journal:  Clin Proteomics       Date:  2011-09-19       Impact factor: 3.988

5.  Biomarker Discovery Based on Hybrid Optimization Algorithm and Artificial Neural Networks on Microarray Data for Cancer Classification.

Authors:  Niloofar Yousefi Moteghaed; Keivan Maghooli; Shiva Pirhadi; Masoud Garshasbi
Journal:  J Med Signals Sens       Date:  2015 Apr-Jun

6.  Artificial neural network inference (ANNI): a study on gene-gene interaction for biomarkers in childhood sarcomas.

Authors:  Dong Ling Tong; David J Boocock; Gopal Krishna R Dhondalay; Christophe Lemetre; Graham R Ball
Journal:  PLoS One       Date:  2014-07-15       Impact factor: 3.240

Review 7.  Intelligent Techniques Using Molecular Data Analysis in Leukaemia: An Opportunity for Personalized Medicine Support System.

Authors:  Haneen Banjar; David Adelson; Fred Brown; Naeem Chaudhri
Journal:  Biomed Res Int       Date:  2017-07-25       Impact factor: 3.411

8.  Improved glomerular filtration rate estimation by an artificial neural network.

Authors:  Xun Liu; Xiaohua Pei; Ningshan Li; Yunong Zhang; Xiang Zhang; Jinxia Chen; Linsheng Lv; Huijuan Ma; Xiaoming Wu; Weihong Zhao; Tanqi Lou
Journal:  PLoS One       Date:  2013-03-13       Impact factor: 3.240

9.  Application of genetic algorithms and constructive neural networks for the analysis of microarray cancer data.

Authors:  Rafael Marcos Luque-Baena; Daniel Urda; Jose Luis Subirats; Leonardo Franco; Jose M Jerez
Journal:  Theor Biol Med Model       Date:  2014-05-07       Impact factor: 2.432

10.  Prediction of NSCLC recurrence from microarray data with GEP.

Authors:  Russul Al-Anni; Jingyu Hou; Rana Dhia'a Abdu-Aljabar; Yong Xiang
Journal:  IET Syst Biol       Date:  2017-06       Impact factor: 1.615

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.