Literature DB >> 14728126

HITON: a novel Markov Blanket algorithm for optimal variable selection.

C F Aliferis1, I Tsamardinos, A Statnikov.   

Abstract

UNLABELLED: We introduce a novel, sound, sample-efficient, and highly-scalable algorithm for variable selection for classification, regression and prediction called HITON. The algorithm works by inducing the Markov Blanket of the variable to be classified or predicted. A wide variety of biomedical tasks with different characteristics were used for an empirical evaluation. Namely, (i) bioactivity prediction for drug discovery, (ii) clinical diagnosis of arrhythmias, (iii) bibliographic text categorization, (iv) lung cancer diagnosis from gene expression array data, and (v) proteomics-based prostate cancer detection. State-of-the-art algorithms for each domain were selected for baseline comparison.
RESULTS: (1) HITON reduces the number of variables in the prediction models by three orders of magnitude relative to the original variable set while improving or maintaining accuracy. (2) HITON outperforms the baseline algorithms by selecting more than two orders-of-magnitude smaller variable sets than the baselines, in the selected tasks and datasets.

Entities:  

Mesh:

Year:  2003        PMID: 14728126      PMCID: PMC1480117     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  4 in total

1.  Support vector machine classification and validation of cancer tissue samples using microarray expression data.

Authors:  T S Furey; N Cristianini; N Duffy; D W Bednarski; M Schummer; D Haussler
Journal:  Bioinformatics       Date:  2000-10       Impact factor: 6.937

2.  An evaluation of machine-learning methods for predicting pneumonia mortality.

Authors:  G F Cooper; C F Aliferis; R Ambrosino; J Aronis; B G Buchanan; R Caruana; M J Fine; C Glymour; G Gordon; B H Hanusa; J E Janosky; C Meek; T Mitchell; T Richardson; P Spirtes
Journal:  Artif Intell Med       Date:  1997-02       Impact factor: 5.326

3.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

4.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men.

Authors:  Bao-Ling Adam; Yinsheng Qu; John W Davis; Michael D Ward; Mary Ann Clements; Lisa H Cazares; O John Semmes; Paul F Schellhammer; Yutaka Yasui; Ziding Feng; George L Wright
Journal:  Cancer Res       Date:  2002-07-01       Impact factor: 12.701

  4 in total
  42 in total

1.  A sparse structure learning algorithm for Gaussian Bayesian Network identification from high-dimensional data.

Authors:  Shuai Huang; Jing Li; Jieping Ye; Adam Fleisher; Kewei Chen; Teresa Wu; Eric Reiman
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-06       Impact factor: 6.226

2.  Early prediction of reading disability using machine learning.

Authors:  H Atakan Varol; Subramani Mani; Donald L Compton; Lynn S Fuchs; Douglas Fuchs
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

3.  Text categorization models for high-quality article retrieval in internal medicine.

Authors:  Yindalon Aphinyanaphongs; Ioannis Tsamardinos; Alexander Statnikov; Douglas Hardin; Constantin F Aliferis
Journal:  J Am Med Inform Assoc       Date:  2004-11-23       Impact factor: 4.497

4.  Formative evaluation of a prototype system for automated analysis of mass spectrometry data.

Authors:  N Fananapazir; M Li; D Spentzos; C F Aliferis
Journal:  AMIA Annu Symp Proc       Date:  2005

5.  Predicting cancer type with dimensionality-reduced gene expression micro-array data.

Authors:  Marc Santoro; Douglas A Talbert
Journal:  AMIA Annu Symp Proc       Date:  2005

6.  Extracting drug-drug interaction articles from MEDLINE to improve the content of drug databases.

Authors:  Stephany Duda; Constantin Aliferis; Randolph Miller; Alexander Statnikov; Kevin Johnson
Journal:  AMIA Annu Symp Proc       Date:  2005

7.  A comparison of citation metrics to machine learning filters for the identification of high quality MEDLINE documents.

Authors:  Yindalon Aphinyanaphongs; Alexander Statnikov; Constantin F Aliferis
Journal:  J Am Med Inform Assoc       Date:  2006-04-18       Impact factor: 4.497

8.  Medical decision support using machine learning for early detection of late-onset neonatal sepsis.

Authors:  Subramani Mani; Asli Ozdas; Constantin Aliferis; Huseyin Atakan Varol; Qingxia Chen; Randy Carnevale; Yukun Chen; Joann Romano-Keeler; Hui Nian; Jörn-Hendrik Weitkamp
Journal:  J Am Med Inform Assoc       Date:  2013-09-16       Impact factor: 4.497

9.  Brain Effective Connectivity Modeling for Alzheimer's Disease by Sparse Gaussian Bayesian Network.

Authors:  Shuai Huang; Jing Li; Jieping Ye; Adam Fleisher; Kewei Chen; Teresa Wu; Eric Reiman
Journal:  KDD       Date:  2011

10.  The FAST-AIMS Clinical Mass Spectrometry Analysis System.

Authors:  Nafeh Fananapazir; Alexander Statnikov; Constantin F Aliferis
Journal:  Adv Bioinformatics       Date:  2009-07-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.