Literature DB >> 27532062

omniClassifier: a Desktop Grid Computing System for Big Data Prediction Modeling.

John H Phan1, Sonal Kothari1, May D Wang1.   

Abstract

Robust prediction models are important for numerous science, engineering, and biomedical applications. However, best-practice procedures for optimizing prediction models can be computationally complex, especially when choosing models from among hundreds or thousands of parameter choices. Computational complexity has further increased with the growth of data in these fields, concurrent with the era of "Big Data". Grid computing is a potential solution to the computational challenges of Big Data. Desktop grid computing, which uses idle CPU cycles of commodity desktop machines, coupled with commercial cloud computing resources can enable research labs to gain easier and more cost effective access to vast computing resources. We have developed omniClassifier, a multi-purpose prediction modeling application that provides researchers with a tool for conducting machine learning research within the guidelines of recommended best-practices. omniClassifier is implemented as a desktop grid computing system using the Berkeley Open Infrastructure for Network Computing (BOINC) middleware. In addition to describing implementation details, we use various gene expression datasets to demonstrate the potential scalability of omniClassifier for efficient and robust Big Data prediction modeling. A prototype of omniClassifier can be accessed at http://omniclassifier.bme.gatech.edu/.

Entities:  

Keywords:  Prediction modeling; big data; desktop grid computing; nested cross validation

Year:  2014        PMID: 27532062      PMCID: PMC4983434          DOI: 10.1145/2649387.2649439

Source DB:  PubMed          Journal:  ACM BCB


  27 in total

1.  An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival.

Authors:  Lance D Miller; Johanna Smeds; Joshy George; Vinsensius B Vega; Liza Vergara; Alexander Ploner; Yudi Pawitan; Per Hall; Sigrid Klaar; Edison T Liu; Jonas Bergh
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-02       Impact factor: 11.205

2.  Experimental trial for diagnosis of pancreatic ductal carcinoma based on gene expression profiles of pancreatic ductal cells.

Authors:  Madoka Ishikawa; Koji Yoshida; Yoshihiro Yamashita; Jun Ota; Shuji Takada; Hiroyuki Kisanuki; Koji Koinuma; Young Lim Choi; Ruri Kaneda; Toshiyasu Iwao; Kiichi Tamada; Kentaro Sugano; Hiroyuki Mano
Journal:  Cancer Sci       Date:  2005-07       Impact factor: 6.716

3.  Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia.

Authors:  Liviu Badea; Vlad Herlea; Simona Olimpia Dima; Traian Dumitrascu; Irinel Popescu
Journal:  Hepatogastroenterology       Date:  2008 Nov-Dec

4.  FKBP51 affects cancer cell response to chemotherapy by negatively regulating Akt.

Authors:  Huadong Pei; Liang Li; Brooke L Fridley; Gregory D Jenkins; Krishna R Kalari; Wilma Lingle; Gloria Petersen; Zhenkun Lou; Liewei Wang
Journal:  Cancer Cell       Date:  2009-09-08       Impact factor: 31.743

5.  Gene expression correlates of clinical prostate cancer behavior.

Authors:  Dinesh Singh; Phillip G Febbo; Kenneth Ross; Donald G Jackson; Judith Manola; Christine Ladd; Pablo Tamayo; Andrew A Renshaw; Anthony V D'Amico; Jerome P Richie; Eric S Lander; Massimo Loda; Philip W Kantoff; Todd R Golub; William R Sellers
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

6.  PARAMO: a PARAllel predictive MOdeling platform for healthcare analytic research using electronic health records.

Authors:  Kenney Ng; Amol Ghoting; Steven R Steinhubl; Walter F Stewart; Bradley Malin; Jimeng Sun
Journal:  J Biomed Inform       Date:  2013-12-25       Impact factor: 6.317

7.  Win percentage: a novel measure for assessing the suitability of machine classifiers for biological problems.

Authors:  R Mitchell Parry; John H Phan; May D Wang
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

8.  Bias in error estimation when using cross-validation for model selection.

Authors:  Sudhir Varma; Richard Simon
Journal:  BMC Bioinformatics       Date:  2006-02-23       Impact factor: 3.169

9.  High-resolution DNA copy number and gene expression analyses distinguish chromophobe renal cell carcinomas and renal oncocytomas.

Authors:  Maria V Yusenko; Roland P Kuiper; Tamas Boethe; Börje Ljungberg; Ad Geurts van Kessel; Gyula Kovacs
Journal:  BMC Cancer       Date:  2009-05-18       Impact factor: 4.430

10.  Activation of Wnt signalling in stroma from pancreatic cancer identified by gene expression profiling.

Authors:  Christian Pilarsky; Ole Ammerpohl; Bence Sipos; Edgar Dahl; Arndt Hartmann; Axel Wellmann; Till Braunschweig; Matthias Löhr; Ralf Jesenofsky; Ralf Jesnowski; Helmut Friess; Moritz Nicolas Wente; Glen Kristiansen; Beatrix Jahnke; Axel Denz; Felix Rückert; Hans K Schackert; Günter Klöppel; Holger Kalthoff; Hans Detlev Saeger; Robert Grützmann
Journal:  J Cell Mol Med       Date:  2008-02-24       Impact factor: 5.310

View more
  2 in total

1.  Discovery of Lipidome Alterations Following Traumatic Brain Injury via High-Resolution Metabolomics.

Authors:  Scott R Hogan; John H Phan; Melissa Alvarado-Velez; May Dongmei Wang; Ravi V Bellamkonda; Facundo M Fernández; Michelle C LaPlaca
Journal:  J Proteome Res       Date:  2018-04-27       Impact factor: 4.466

2.  DNA methylation signatures as biomarkers of prior environmental exposures.

Authors:  Christine Ladd-Acosta; M Daniele Fallin
Journal:  Curr Epidemiol Rep       Date:  2019-02-01
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.