Literature DB >> 28884169

Cloud Engineering Principles and Technology Enablers for Medical Image Processing-as-a-Service.

Shunxing Bao1, Andrew J Plassard1, Bennett A Landman1, Aniruddha Gokhale1.   

Abstract

Traditional in-house, laboratory-based medical imaging studies use hierarchical data structures (e.g., NFS file stores) or databases (e.g., COINS, XNAT) for storage and retrieval. The resulting performance from these approaches is, however, impeded by standard network switches since they can saturate network bandwidth during transfer from storage to processing nodes for even moderate-sized studies. To that end, a cloud-based "medical image processing-as-a-service" offers promise in utilizing the ecosystem of Apache Hadoop, which is a flexible framework providing distributed, scalable, fault tolerant storage and parallel computational modules, and HBase, which is a NoSQL database built atop Hadoop's distributed file system. Despite this promise, HBase's load distribution strategy of region split and merge is detrimental to the hierarchical organization of imaging data (e.g., project, subject, session, scan, slice). This paper makes two contributions to address these concerns by describing key cloud engineering principles and technology enhancements we made to the Apache Hadoop ecosystem for medical imaging applications. First, we propose a row-key design for HBase, which is a necessary step that is driven by the hierarchical organization of imaging data. Second, we propose a novel data allocation policy within HBase to strongly enforce collocation of hierarchically related imaging data. The proposed enhancements accelerate data processing by minimizing network usage and localizing processing to machines where the data already exist. Moreover, our approach is amenable to the traditional scan, subject, and project-level analysis procedures, and is compatible with standard command line/scriptable image processing software. Experimental results for an illustrative sample of imaging data reveals that our new HBase policy results in a three-fold time improvement in conversion of classic DICOM to NiFTI file formats when compared with the default HBase region split policy, and nearly a six-fold improvement over a commonly available network file system (NFS) approach even for relatively small file sets. Moreover, file access latency is lower than network attached storage.

Entities:  

Keywords:  Grid computing; HBase; Hadoop; Medical imaging

Year:  2017        PMID: 28884169      PMCID: PMC5584067          DOI: 10.1109/IC2E.2017.23

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Cloud Eng


  6 in total

1.  Bridging clinical information systems and grid middleware: a Medical Data Manager.

Authors:  Johan Montagnat; Daniel Jouvenot; Christophe Pera; Akos Frohner; Peter Kunszt; Birger Koblitz; Nuno Santos; Cal Loomis
Journal:  Stud Health Technol Inform       Date:  2006

Review 2.  Gene expression omnibus: microarray data storage, submission, retrieval, and analysis.

Authors:  Tanya Barrett; Ron Edgar
Journal:  Methods Enzymol       Date:  2006       Impact factor: 1.600

3.  Theoretical and Empirical Comparison of Big Data Image Processing with Apache Hadoop and Sun Grid Engine.

Authors:  Shunxing Bao; Frederick D Weitendorf; Andrew J Plassard; Yuankai Huo; Aniruddha Gokhale; Bennett A Landman
Journal:  Proc SPIE Int Soc Opt Eng       Date:  2017-03-13

4.  Cloud based toolbox for image analysis, processing and reconstruction tasks.

Authors:  Tomasz Bednarz; Dadong Wang; Yulia Arzhaeva; Ryan Lagerstrom; Pascal Vallotton; Neil Burdett; Alex Khassapov; Piotr Szul; Shiping Chen; Changming Sun; Luke Domanski; Darren Thompson; Timur Gureyev; John A Taylor
Journal:  Adv Exp Med Biol       Date:  2015       Impact factor: 2.622

5.  Mapping brain activity at scale with cluster computing.

Authors:  Jeremy Freeman; Nikita Vladimirov; Takashi Kawashima; Yu Mu; Nicholas J Sofroniew; Davis V Bennett; Joshua Rosen; Chao-Tsung Yang; Loren L Looger; Misha B Ahrens
Journal:  Nat Methods       Date:  2014-07-27       Impact factor: 28.547

6.  High dimensional biological data retrieval optimization with NoSQL technology.

Authors:  Shicai Wang; Ioannis Pandis; Chao Wu; Sijin He; David Johnson; Ibrahim Emam; Florian Guitton; Yike Guo
Journal:  BMC Genomics       Date:  2014-11-13       Impact factor: 3.969

  6 in total
  2 in total

1.  A Data Colocation Grid Framework for Big Data Medical Image Processing: Backend Design.

Authors:  Shunxing Bao; Yuankai Huo; Prasanna Parvathaneni; Andrew J Plassard; Camilo Bermudez; Yuang Yao; Ilwoo Lyu; Aniruddha Gokhale; Bennett A Landman
Journal:  Proc SPIE Int Soc Opt Eng       Date:  2018-03

Review 2.  Towards Portable Large-Scale Image Processing with High-Performance Computing.

Authors:  Yuankai Huo; Justin Blaber; Stephen M Damon; Brian D Boyd; Shunxing Bao; Prasanna Parvathaneni; Camilo Bermudez Noguera; Shikha Chaganti; Vishwesh Nath; Jasmine M Greer; Ilwoo Lyu; William R French; Allen T Newton; Baxter P Rogers; Bennett A Landman
Journal:  J Digit Imaging       Date:  2018-06       Impact factor: 4.056

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.