Literature DB >> 18999186

Improving classification performance with discretization on biomedical datasets.

Jonathan L Lustgarten1, Vanathi Gopalakrishnan, Himanshu Grover, Shyam Visweswaran.   

Abstract

Discretization acts as a variable selection method in addition to transforming the continuous values of the variable to discrete ones. Machine learning algorithms such as Support Vector Machines and Random Forests have been used for classification in high-dimensional genomic and proteomic data due to their robustness to the dimensionality of the data. We show that discretization can help improve significantly the classification performance of these algorithms as well as algorithms like Naïve Bayes that are sensitive to the dimensionality of the data.

Entities:  

Mesh:

Year:  2008        PMID: 18999186      PMCID: PMC2656082     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  5 in total

1.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

2.  caGEDA: a web application for the integrated analysis of global gene expression patterns in cancer.

Authors:  Satish Patel; James Lyons-Weiler
Journal:  Appl Bioinformatics       Date:  2004

3.  Feature Selection for Classification of SELDI-TOF-MS Proteomic Profiles.

Authors:  Milos Hauskrecht; Richard Pelikan; David E Malehorn; William L Bigbee; Michael T Lotze; Herbert J Zeh; David C Whitcomb; James Lyons-Weiler
Journal:  Appl Bioinformatics       Date:  2005

4.  Serum proteomic patterns for detection of prostate cancer.

Authors:  Emanuel F Petricoin; David K Ornstein; Cloud P Paweletz; Ali Ardekani; Paul S Hackett; Ben A Hitt; Alfredo Velassco; Christian Trucco; Laura Wiegand; Kamillah Wood; Charles B Simone; Peter J Levine; W Marston Linehan; Michael R Emmert-Buck; Seth M Steinberg; Elise C Kohn; Lance A Liotta
Journal:  J Natl Cancer Inst       Date:  2002-10-16       Impact factor: 13.506

5.  Are random forests better than support vector machines for microarray-based cancer classification?

Authors:  Alexander Statnikov; Constantin F Aliferis
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11
  5 in total
  16 in total

1.  Discretization of continuous features in clinical datasets.

Authors:  David M Maslove; Tanya Podchiyska; Henry J Lowe
Journal:  J Am Med Inform Assoc       Date:  2012-10-11       Impact factor: 4.497

2.  Transfer learning of classification rules for biomarker discovery and verification from molecular profiling studies.

Authors:  Philip Ganchev; David Malehorn; William L Bigbee; Vanathi Gopalakrishnan
Journal:  J Biomed Inform       Date:  2011-05-06       Impact factor: 6.317

3.  A machine learning-based framework to identify type 2 diabetes through electronic health records.

Authors:  Tao Zheng; Wei Xie; Liling Xu; Xiaoying He; Ya Zhang; Mingrong You; Gong Yang; You Chen
Journal:  Int J Med Inform       Date:  2016-10-01       Impact factor: 4.046

4.  Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.

Authors:  R Andrew Taylor; Joseph R Pare; Arjun K Venkatesh; Hani Mowafi; Edward R Melnick; William Fleischman; M Kennedy Hall
Journal:  Acad Emerg Med       Date:  2016-02-13       Impact factor: 3.451

5.  Application of an efficient Bayesian discretization method to biomedical data.

Authors:  Jonathan L Lustgarten; Shyam Visweswaran; Vanathi Gopalakrishnan; Gregory F Cooper
Journal:  BMC Bioinformatics       Date:  2011-07-28       Impact factor: 3.169

6.  A noninvasive method for coronary artery diseases diagnosis using a clinically-interpretable fuzzy rule-based system.

Authors:  Hamid Reza Marateb; Sobhan Goudarzi
Journal:  J Res Med Sci       Date:  2015-03       Impact factor: 1.852

7.  Identification of host-microbe interaction factors in the genomes of soft rot-associated pathogens Dickeya dadantii 3937 and Pectobacterium carotovorum WPP14 with supervised machine learning.

Authors:  Bing Ma; Amy O Charkowski; Jeremy D Glasner; Nicole T Perna
Journal:  BMC Genomics       Date:  2014-06-21       Impact factor: 3.969

8.  Neural hypernetwork approach for pulmonary embolism diagnosis.

Authors:  Matteo Rucco; David Sousa-Rodrigues; Emanuela Merelli; Jeffrey H Johnson; Lorenzo Falsetti; Cinzia Nitti; Aldo Salvi
Journal:  BMC Res Notes       Date:  2015-10-29

9.  Context Relevant Prediction Model for COPD Domain Using Bayesian Belief Network.

Authors:  Hamid Mcheick; Lokman Saleh; Hicham Ajami; Hafedh Mili
Journal:  Sensors (Basel)       Date:  2017-06-23       Impact factor: 3.576

10.  A three-step approach for the derivation and validation of high-performing predictive models using an operational dataset: congestive heart failure readmission case study.

Authors:  Samir E AbdelRahman; Mingyuan Zhang; Bruce E Bray; Kensaku Kawamoto
Journal:  BMC Med Inform Decis Mak       Date:  2014-05-27       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.