Literature DB >> 25431544

An Analysis Pipeline with Statistical and Visualization-Guided Knowledge Discovery for Michigan-Style Learning Classifier Systems.

Ryan J Urbanowicz1, Ambrose Granizo-Mackenzie1, Jason H Moore1.   

Abstract

Michigan-style learning classifier systems (M-LCSs) represent an adaptive and powerful class of evolutionary algorithms which distribute the learned solution over a sizable population of rules. However their application to complex real world data mining problems, such as genetic association studies, has been limited. Traditional knowledge discovery strategies for M-LCS rule populations involve sorting and manual rule inspection. While this approach may be sufficient for simpler problems, the confounding influence of noise and the need to discriminate between predictive and non-predictive attributes calls for additional strategies. Additionally, tests of significance must be adapted to M-LCS analyses in order to make them a viable option within fields that require such analyses to assess confidence. In this work we introduce an M-LCS analysis pipeline that combines uniquely applied visualizations with objective statistical evaluation for the identification of predictive attributes, and reliable rule generalizations in noisy single-step data mining problems. This work considers an alternative paradigm for knowledge discovery in M-LCSs, shifting the focus from individual rules to a global, population-wide perspective. We demonstrate the efficacy of this pipeline applied to the identification of epistasis (i.e., attribute interaction) and heterogeneity in noisy simulated genetic association data.

Entities:  

Year:  2012        PMID: 25431544      PMCID: PMC4244006          DOI: 10.1109/MCI.2012.2215124

Source DB:  PubMed          Journal:  IEEE Comput Intell Mag        ISSN: 1556-603X            Impact factor:   11.356


  8 in total

1.  Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity.

Authors:  Marylyn D Ritchie; Lance W Hahn; Jason H Moore
Journal:  Genet Epidemiol       Date:  2003-02       Impact factor: 2.135

Review 2.  Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans.

Authors:  Heather J Cordell
Journal:  Hum Mol Genet       Date:  2002-10-01       Impact factor: 6.150

3.  Accuracy-based learning classifier systems: models, analysis and applications to classification tasks.

Authors:  Ester Bernadó-Mansilla; Josep M Garrell-Guiu
Journal:  Evol Comput       Date:  2003       Impact factor: 3.277

Review 4.  Genetics, statistics and human disease: analytical retooling for complexity.

Authors:  Tricia A Thornton-Wells; Jason H Moore; Jonathan L Haines
Journal:  Trends Genet       Date:  2004-12       Impact factor: 11.639

5.  A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility.

Authors:  Jason H Moore; Joshua C Gilbert; Chia-Ti Tsai; Fu-Tien Chiang; Todd Holden; Nate Barney; Bill C White
Journal:  J Theor Biol       Date:  2006-02-02       Impact factor: 2.691

6.  Problems with genome-wide association studies.

Authors:  Daniel Shriner; Laura K Vaughan; Miguel A Padilla; Hemant K Tiwari
Journal:  Science       Date:  2007-06-29       Impact factor: 47.728

7.  Human microbiome visualization using 3D technology.

Authors:  Jason H Moore; Richard Cowper Sal Lari; Douglas Hill; Patricia L Hibberd; Juliette C Madan
Journal:  Pac Symp Biocomput       Date:  2011

Review 8.  Missing heritability and strategies for finding the underlying causes of complex disease.

Authors:  Evan E Eichler; Jonathan Flint; Greg Gibson; Augustine Kong; Suzanne M Leal; Jason H Moore; Joseph H Nadeau
Journal:  Nat Rev Genet       Date:  2010-06       Impact factor: 53.242

  8 in total
  9 in total

1.  Hard Data Analytics Problems Make for Better Data Analysis Algorithms: Bioinformatics as an Example.

Authors:  Jaume Bacardit; Paweł Widera; Nicola Lazzarini; Natalio Krasnogor
Journal:  Big Data       Date:  2014-09-01       Impact factor: 2.128

2.  A Multi-Core Parallelization Strategy for Statistical Significance Testing in Learning Classifier Systems.

Authors:  James Rudd; Jason H Moore; Ryan J Urbanowicz
Journal:  Evol Intell       Date:  2013-11

Review 3.  Machine learning applications in genetics and genomics.

Authors:  Maxwell W Libbrecht; William Stafford Noble
Journal:  Nat Rev Genet       Date:  2015-05-07       Impact factor: 53.242

4.  ExSTraCS 2.0: Description and Evaluation of a Scalable Learning Classifier System.

Authors:  Ryan J Urbanowicz; Jason H Moore
Journal:  Evol Intell       Date:  2015-04-03

5.  Predicting the difficulty of pure, strict, epistatic models: metrics for simulated model selection.

Authors:  Ryan J Urbanowicz; Jeff Kiralis; Jonathan M Fisher; Jason H Moore
Journal:  BioData Min       Date:  2012-09-26       Impact factor: 2.522

6.  Analysis of mass spectrometry data from the secretome of an explant model of articular cartilage exposed to pro-inflammatory and anti-inflammatory stimuli using machine learning.

Authors:  Anna L Swan; Kirsty L Hillier; Julia R Smith; David Allaway; Susan Liddell; Jaume Bacardit; Ali Mobasheri
Journal:  BMC Musculoskelet Disord       Date:  2013-12-13       Impact factor: 2.362

7.  Functional networks inference from rule-based machine learning models.

Authors:  Nicola Lazzarini; Paweł Widera; Stuart Williamson; Rakesh Heer; Natalio Krasnogor; Jaume Bacardit
Journal:  BioData Min       Date:  2016-09-05       Impact factor: 2.522

8.  Machine Learning Methods as a Tool for Predicting Risk of Illness Applying Next-Generation Sequencing Data.

Authors:  Patrick Murigu Kamau Njage; Clementine Henri; Pimlapas Leekitcharoenphon; Michel-Yves Mistou; Rene S Hendriksen; Tine Hald
Journal:  Risk Anal       Date:  2018-11-21       Impact factor: 4.000

9.  Role of genetic heterogeneity and epistasis in bladder cancer susceptibility and outcome: a learning classifier system approach.

Authors:  Ryan John Urbanowicz; Angeline S Andrew; Margaret Rita Karagas; Jason H Moore
Journal:  J Am Med Inform Assoc       Date:  2013-02-26       Impact factor: 4.497

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.