Literature DB >> 25419546

High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms.

George Teodoro1, Tony Pan2, Tahsin M Kurc, Jun Kong, Lee A D Cooper, Norbert Podhorszki, Scott Klasky, Joel H Saltz.   

Abstract

Analysis of large pathology image datasets offers significant opportunities for the investigation of disease morphology, but the resource requirements of analysis pipelines limit the scale of such studies. Motivated by a brain cancer study, we propose and evaluate a parallel image analysis application pipeline for high throughput computation of large datasets of high resolution pathology tissue images on distributed CPU-GPU platforms. To achieve efficient execution on these hybrid systems, we have built runtime support that allows us to express the cancer image analysis application as a hierarchical data processing pipeline. The application is implemented as a coarse-grain pipeline of stages, where each stage may be further partitioned into another pipeline of fine-grain operations. The fine-grain operations are efficiently managed and scheduled for computation on CPUs and GPUs using performance aware scheduling techniques along with several optimizations, including architecture aware process placement, data locality conscious task assignment, data prefetching, and asynchronous data copy. These optimizations are employed to maximize the utilization of the aggregate computing power of CPUs and GPUs and minimize data copy overheads. Our experimental evaluation shows that the cooperative use of CPUs and GPUs achieves significant improvements on top of GPU-only versions (up to 1.6×) and that the execution of the application as a set of fine-grain operations provides more opportunities for runtime optimizations and attains better performance than coarser-grain, monolithic implementations used in other works. An implementation of the cancer image analysis pipeline using the runtime support was able to process an image dataset consisting of 36,848 4Kx4K-pixel image tiles (about 1.8TB uncompressed) in less than 4 minutes (150 tiles/second) on 100 nodes of a state-of-the-art hybrid cluster system.

Entities:  

Keywords:  CPUGPU platforms; GPGPU; Image Segmentation Pipelines

Year:  2013        PMID: 25419546      PMCID: PMC4240318          DOI: 10.1109/IPDPS.2013.11

Source DB:  PubMed          Journal:  IPDPS


  6 in total

1.  An Integrated Framework for Parameter-based Optimization of Scientific Workflows.

Authors:  Vijay S Kumar; P Sadayappan; Gaurang Mehta; Karan Vahi; Ewa Deelman; Varun Ratnakar; Jihie Kim; Yolanda Gil; Mary Hall; Tahsin Kurc; Joel Saltz
Journal:  Proc Int Symp High Perform Distrib Comput       Date:  2009

2.  Morphological grayscale reconstruction in image analysis: applications and efficient algorithms.

Authors:  L Vincent
Journal:  IEEE Trans Image Process       Date:  1993       Impact factor: 10.856

3.  A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments.

Authors:  George Teodoro; Tulio Tavares; Renato Ferreira; Tahsin Kurc; Wagner Meira; Dorgival Guedes; Tony Pan; Joel Saltz
Journal:  Int J Parallel Program       Date:  2008-04       Impact factor: 1.382

4.  An integrative approach for in silico glioma research.

Authors:  Lee A D Cooper; Jun Kong; David A Gutman; Fusheng Wang; Sharath R Cholleti; Tony C Pan; Patrick M Widener; Ashish Sharma; Tom Mikkelsen; Adam E Flanders; Daniel L Rubin; Erwin G Van Meir; Tahsin M Kurc; Carlos S Moreno; Daniel J Brat; Joel H Saltz
Journal:  IEEE Trans Biomed Eng       Date:  2010-07-23       Impact factor: 4.538

5.  Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines.

Authors:  George Teodoro; Tony Pan; Tahsin Kurc; Jun Kong; Lee Cooper; Joel Saltz
Journal:  Parallel Comput       Date:  2013-04-01       Impact factor: 0.986

6.  Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems.

Authors:  George Teodoro; Tahsin M Kurc; Tony Pan; Lee A D Cooper; Jun Kong; Patrick Widener; Joel H Saltz
Journal:  IPDPS       Date:  2012-05
  6 in total
  10 in total

1.  Safe "cloudification" of large images through picker APIs.

Authors:  Erich Bremer; Tahsin Kurc; Yi Gao; Joel Saltz; Jonas S Almeida
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  Feature-based Analysis of Large-scale Spatio-Temporal Sensor Data on Hybrid Architectures.

Authors:  Joel Saltz; George Teodoro; Tony Pan; Lee Cooper; Jun Kong; Scott Klasky; Tahsin Kurc
Journal:  Int J High Perform Comput Appl       Date:  2013-06-09       Impact factor: 1.942

3.  Region Templates: Data Representation and Management for High-Throughput Image Analysis.

Authors:  George Teodoro; Tony Pan; Tahsin Kurc; Jun Kong; Lee Cooper; Scott Klasky; Joel Saltz
Journal:  Parallel Comput       Date:  2014-12-01       Impact factor: 0.986

4.  Comparative Performance Analysis of Intel Xeon Phi, GPU, and CPU: A Case Study from Microscopy Image Analysis.

Authors:  George Teodoro; Tahsin Kurc; Jun Kong; Lee Cooper; Joel Saltz
Journal:  IEEE Trans Parallel Distrib Syst       Date:  2014-05       Impact factor: 2.687

5.  MaReIA: A Cloud MapReduce Based High Performance Whole Slide Image Analysis Framework.

Authors:  Hoang Vo; Jun Kong; Dejun Teng; Yanhui Liang; Ablimit Aji; George Teodoro; Fusheng Wang
Journal:  Distrib Parallel Databases       Date:  2018-07-30       Impact factor: 1.500

6.  Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs, GPUs and MICs: A Case Study with Microscopy Image Analysis.

Authors:  George Teodoro; Tahsin Kurc; Guilherme Andrade; Jun Kong; Renato Ferreira; Joel Saltz
Journal:  Int J High Perform Comput Appl       Date:  2015-07-27       Impact factor: 1.942

7.  High-Performance Computational Analysis of Glioblastoma Pathology Images with Database Support Identifies Molecular and Survival Correlates.

Authors:  Jun Kong; Fusheng Wang; George Teodoro; Lee Cooper; Carlos S Moreno; Tahsin Kurc; Tony Pan; Joel Saltz; Daniel Brat
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2013-12

8.  Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems.

Authors:  G Andrade; R Ferreira; George Teodoro; Leonardo Rocha; Joel H Saltz; Tahsin Kurc
Journal:  Proc Symp Comput Archit High Perform Comput       Date:  2014-10

9.  A Framework for 3D Vessel Analysis using Whole Slide Images of Liver Tissue Sections.

Authors:  Yanhui Liang; Fusheng Wang; Darren Treanor; Derek Magee; Nick Roberts; George Teodoro; Yangyang Zhu; Jun Kong
Journal:  Int J Comput Biol Drug Des       Date:  2016

10.  Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies.

Authors:  Tahsin Kurc; Xin Qi; Daihou Wang; Fusheng Wang; George Teodoro; Lee Cooper; Michael Nalisnik; Lin Yang; Joel Saltz; David J Foran
Journal:  BMC Bioinformatics       Date:  2015-12-01       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.