Literature DB >> 15492032

A high productivity/low maintenance approach to high-performance computation for biomedicine: four case studies.

Nicholas Carriero1, Michael V Osier, Kei-Hoi Cheung, Perry L Miller, Mark Gerstein, Hongyu Zhao, Baolin Wu, Scott Rifkin, Joseph Chang, Heping Zhang, Kevin White, Kenneth Williams, Martin Schultz.   

Abstract

The rapid advances in high-throughput biotechnologies such as DNA microarrays and mass spectrometry have generated vast amounts of data ranging from gene expression to proteomics data. The large size and complexity involved in analyzing such data demand a significant amount of computing power. High-performance computation (HPC) is an attractive and increasingly affordable approach to help meet this challenge. There is a spectrum of techniques that can be used to achieve computational speedup with varying degrees of impact in terms of how drastic a change is required to allow the software to run on an HPC platform. This paper describes a high- productivity/low-maintenance (HP/LM) approach to HPC that is based on establishing a collaborative relationship between the bioinformaticist and HPC expert that respects the former's codes and minimizes the latter's efforts. The goal of this approach is to make it easy for bioinformatics researchers to continue to make iterative refinements to their programs, while still being able to take advantage of HPC. The paper describes our experience applying these HP/LM techniques in four bioinformatics case studies: (1) genome-wide sequence comparison using Blast, (2) identification of biomarkers based on statistical analysis of large mass spectrometry data sets, (3) complex genetic analysis involving ordinal phenotypes, (4) large-scale assessment of the effect of possible errors in analyzing microarray data. The case studies illustrate how the HP/LM approach can be applied to a range of representative bioinformatics applications and how the approach can lead to significant speedup of computationally intensive bioinformatics applications, while making only modest modifications to the programs themselves.

Entities:  

Mesh:

Year:  2004        PMID: 15492032      PMCID: PMC543832          DOI: 10.1197/jamia.M1571

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  14 in total

Review 1.  DNA microarrays: their use and misuse.

Authors:  Xinmin Li; Weikuan Gu; Subburaman Mohan; David J Baylink
Journal:  Microcirculation       Date:  2002-01       Impact factor: 2.628

Review 2.  Protein microarray technology.

Authors:  Markus F Templin; Dieter Stoll; Monika Schrenk; Petra C Traub; Christian F Vöhringer; Thomas O Joos
Journal:  Trends Biotechnol       Date:  2002-04       Impact factor: 19.536

3.  Evolution of gene expression in the Drosophila melanogaster subgroup.

Authors:  Scott A Rifkin; Junhyong Kim; Kevin P White
Journal:  Nat Genet       Date:  2003-01-27       Impact factor: 38.330

4.  Accelerating comparative genomics using parallel computing.

Authors:  Chintalapati Janaki; Rajendra R Joshi
Journal:  In Silico Biol       Date:  2003

Review 5.  Experiments using microarray technology: limitations and standard operating procedures.

Authors:  T Forster; D Roy; P Ghazal
Journal:  J Endocrinol       Date:  2003-08       Impact factor: 4.286

6.  A "polyORFomic" analysis of prokaryote genomes using disabled-homology filtering reveals conserved but undiscovered short ORFs.

Authors:  Paul M Harrison; Nicholas Carriero; Yang Liu; Mark Gerstein
Journal:  J Mol Biol       Date:  2003-11-07       Impact factor: 5.469

Review 7.  Two-dimensional gel electrophoresis; better than a poke in the ICAT?

Authors:  Wayne F Patton; Birte Schulenberg; Thomas H Steinberg
Journal:  Curr Opin Biotechnol       Date:  2002-08       Impact factor: 9.740

8.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

9.  Parallelization of general-linkage analysis problems.

Authors:  S Dwarkadas; A A Schäffer; R W Cottingham; A L Cox; P Keleher; W Zwaenepoel
Journal:  Hum Hered       Date:  1994 May-Jun       Impact factor: 0.444

10.  Use of proteomic patterns in serum to identify ovarian cancer.

Authors:  Emanuel F Petricoin; Ali M Ardekani; Ben A Hitt; Peter J Levine; Vincent A Fusaro; Seth M Steinberg; Gordon B Mills; Charles Simone; David A Fishman; Elise C Kohn; Lance A Liotta
Journal:  Lancet       Date:  2002-02-16       Impact factor: 79.321

View more
  2 in total

1.  X!!Tandem, an improved method for running X!tandem in parallel on collections of commodity computers.

Authors:  Robert D Bjornson; Nicholas J Carriero; Christopher Colangelo; Mark Shifman; Kei-Hoi Cheung; Perry L Miller; Kenneth Williams
Journal:  J Proteome Res       Date:  2007-09-29       Impact factor: 4.466

2.  Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies.

Authors:  Tahsin Kurc; Xin Qi; Daihou Wang; Fusheng Wang; George Teodoro; Lee Cooper; Michael Nalisnik; Lin Yang; Joel Saltz; David J Foran
Journal:  BMC Bioinformatics       Date:  2015-12-01       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.