Literature DB >> 15302085

Data mining and genetic algorithm based gene/SNP selection.

Shital C Shah1, Andrew Kusiak.   

Abstract

OBJECTIVE: Genomic studies provide large volumes of data with the number of single nucleotide polymorphisms (SNPs) ranging into thousands. The analysis of SNPs permits determining relationships between genotypic and phenotypic information as well as the identification of SNPs related to a disease. The growing wealth of information and advances in biology call for the development of approaches for discovery of new knowledge. One such area is the identification of gene/SNP patterns impacting cure/drug development for various diseases.
METHODS: A new approach for predicting drug effectiveness is presented. The approach is based on data mining and genetic algorithms. A global search mechanism, weighted decision tree, decision-tree-based wrapper, a correlation-based heuristic, and the identification of intersecting feature sets are employed for selecting significant genes.
RESULTS: The feature selection approach has resulted in 85% reduction of number of features. The relative increase in cross-validation accuracy and specificity for the significant gene/SNP set was 10% and 3.2%, respectively.
CONCLUSION: The feature selection approach was successfully applied to data sets for drug and placebo subjects. The number of features has been significantly reduced while the quality of knowledge was enhanced. The feature set intersection approach provided the most significant genes/SNPs. The results reported in the paper discuss associations among SNPs resulting in patient-specific treatment protocols.

Entities:  

Mesh:

Year:  2004        PMID: 15302085     DOI: 10.1016/j.artmed.2004.04.002

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  12 in total

1.  Mapping genes that predict treatment outcome in admixed populations.

Authors:  T M Baye; R A Wilke
Journal:  Pharmacogenomics J       Date:  2010-10-05       Impact factor: 3.550

2.  Identifying differences in protein expression levels by spectral counting and feature selection.

Authors:  P C Carvalho; J Hewel; V C Barbosa; J R Yates
Journal:  Genet Mol Res       Date:  2008-04-15

3.  Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.

Authors:  Kenneth Lange; Jeanette C Papp; Janet S Sinsheimer; Eric M Sobel
Journal:  Annu Rev Stat Appl       Date:  2014-01-01       Impact factor: 5.810

4.  Cuckoo search epistasis: a new method for exploring significant genetic interactions.

Authors:  M Aflakparast; H Salimi; A Gerami; M-P Dubé; S Visweswaran; A Masoudi-Nejad
Journal:  Heredity (Edinb)       Date:  2014-02-19       Impact factor: 3.821

5.  AncestrySNPminer: a bioinformatics tool to retrieve and develop ancestry informative SNP panels.

Authors:  Sushil Amirisetty; Gurjit K Khurana Hershey; Tesfaye M Baye
Journal:  Genomics       Date:  2012-05-11       Impact factor: 5.736

6.  Comparison of measures of marker informativeness for ancestry and admixture mapping.

Authors:  Lili Ding; Howard Wiener; Tilahun Abebe; Mekbib Altaye; Rodney C P Go; Carolyn Kercsmar; Greg Grabowski; Lisa J Martin; Gurjit K Khurana Hershey; Ranajit Chakorborty; Tesfaye M Baye
Journal:  BMC Genomics       Date:  2011-12-20       Impact factor: 3.969

Review 7.  Mapping asthma-associated variants in admixed populations.

Authors:  Tesfaye B Mersha
Journal:  Front Genet       Date:  2015-09-29       Impact factor: 4.599

8.  FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.

Authors:  Shouheng Tuo; Junying Zhang; Xiguo Yuan; Yuanyuan Zhang; Zhaowen Liu
Journal:  PLoS One       Date:  2016-03-25       Impact factor: 3.240

9.  Applications of random forest feature selection for fine-scale genetic population assignment.

Authors:  Emma V A Sylvester; Paul Bentzen; Ian R Bradbury; Marie Clément; Jon Pearce; John Horne; Robert G Beiko
Journal:  Evol Appl       Date:  2017-09-14       Impact factor: 5.183

Review 10.  Big data in IBD: big progress for clinical practice.

Authors:  Nasim Sadat Seyed Tabib; Matthew Madgwick; Padhmanand Sudhakar; Bram Verstockt; Tamas Korcsmaros; Séverine Vermeire
Journal:  Gut       Date:  2020-02-28       Impact factor: 23.059

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.