Literature DB >> 24883047

Parallel classification and feature selection in microarray data using SPRINT.

Lawrence Mitchell1, Terence M Sloan1, Muriel Mewissen2, Peter Ghazal2, Thorsten Forster2, Michal Piotrowski1, Arthur Trew1.   

Abstract

The statistical language R is favoured by many biostatisticians for processing microarray data. In recent times, the quantity of data that can be obtained in experiments has risen significantly, making previously fast analyses time consuming or even not possible at all with the existing software infrastructure. High performance computing (HPC) systems offer a solution to these problems but at the expense of increased complexity for the end user. The Simple Parallel R Interface is a library for R that aims to reduce the complexity of using HPC systems by providing biostatisticians with drop-in parallelised replacements of existing R functions. In this paper we describe parallel implementations of two popular techniques: exploratory clustering analyses using the random forest classifier and feature selection through identification of differentially expressed genes using the rank product method.

Entities:  

Keywords:  Genomics; HPC; Parallel programming

Year:  2014        PMID: 24883047      PMCID: PMC4038771          DOI: 10.1002/cpe.2928

Source DB:  PubMed          Journal:  Concurr Comput        ISSN: 1532-0626            Impact factor:   1.536


  15 in total

1.  RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis.

Authors:  Fangxin Hong; Rainer Breitling; Connor W McEntee; Ben S Wittner; Jennifer L Nemhauser; Joanne Chory
Journal:  Bioinformatics       Date:  2006-09-18       Impact factor: 6.937

2.  Data mining, neural nets, trees--problems 2 and 3 of Genetic Analysis Workshop 15.

Authors:  Andreas Ziegler; Anita L DeStefano; Inke R König; Claire Bardel; Dumitru Brinza; Shelley Bull; Zhaohui Cai; Beate Glaser; Wei Jiang; Kristine E Lee; Chuang Xing Li; Jing Li; Xin Li; Paul Majoram; Yan Meng; Kristin K Nicodemus; Alexander Platt; Daniel F Schwarz; Weilang Shi; Yin Yao Shugart; Hans H Stassen; Yan V Sun; Sungho Won; Wenyi Wang; Grace Wahba; Usumah A Zagaar; Zhenming Zhao
Journal:  Genet Epidemiol       Date:  2007       Impact factor: 2.135

3.  Comments on the rank product method for analyzing replicated experiments.

Authors:  James A Koziol
Journal:  FEBS Lett       Date:  2010-01-20       Impact factor: 4.124

4.  Translational bioinformatics in the cloud: an affordable alternative.

Authors:  Joel T Dudley; Yannick Pouliot; Rong Chen; Alexander A Morgan; Atul J Butte
Journal:  Genome Med       Date:  2010-08-06       Impact factor: 11.117

5.  Performance of random forest when SNPs are in linkage disequilibrium.

Authors:  Yan A Meng; Yi Yu; L Adrienne Cupples; Lindsay A Farrer; Kathryn L Lunetta
Journal:  BMC Bioinformatics       Date:  2009-03-05       Impact factor: 3.169

6.  Optimization of a parallel permutation testing function for the SPRINT R package.

Authors:  Savvas Petrou; Terence M Sloan; Muriel Mewissen; Thorsten Forster; Michal Piotrowski; Bartosz Dobrzelecki; Peter Ghazal; Arthur Trew; Jon Hill
Journal:  Concurr Comput       Date:  2011-06-23       Impact factor: 1.536

7.  R/parallel--speeding up bioinformatics analysis with R.

Authors:  Gonzalo Vera; Ritsert C Jansen; Remo L Suppi
Journal:  BMC Bioinformatics       Date:  2008-09-22       Impact factor: 3.169

8.  SPRINT: a new parallel framework for R.

Authors:  Jon Hill; Matthew Hambley; Thorsten Forster; Muriel Mewissen; Terence M Sloan; Florian Scharinger; Arthur Trew; Peter Ghazal
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

9.  Computational cluster validation for microarray data analysis: experimental assessment of Clest, Consensus Clustering, Figure of Merit, Gap Statistics and Model Explorer.

Authors:  Raffaele Giancarlo; Davide Scaturro; Filippo Utro
Journal:  BMC Bioinformatics       Date:  2008-10-29       Impact factor: 3.169

10.  Picking single-nucleotide polymorphisms in forests.

Authors:  Daniel F Schwarz; Silke Szymczak; Andreas Ziegler; Inke R König
Journal:  BMC Proc       Date:  2007-12-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.