Literature DB >> 27447310

Near Real-Time Processing of Proteomics Data Using Hadoop.

Chris Hillman1, Yasmeen Ahmad2, Mark Whitehorn1, Andy Cobley1.   

Abstract

This article presents a near real-time processing solution using MapReduce and Hadoop. The solution is aimed at some of the data management and processing challenges facing the life sciences community. Research into genes and their product proteins generates huge volumes of data that must be extensively preprocessed before any biological insight can be gained. In order to carry out this processing in a timely manner, we have investigated the use of techniques from the big data field. These are applied specifically to process data resulting from mass spectrometers in the course of proteomic experiments. Here we present methods of handling the raw data in Hadoop, and then we investigate a process for preprocessing the data using Java code and the MapReduce framework to identify 2D and 3D peaks.

Year:  2014        PMID: 27447310     DOI: 10.1089/big.2013.0036

Source DB:  PubMed          Journal:  Big Data        ISSN: 2167-6461            Impact factor:   2.128


  1 in total

1.  Developing Predictive or Prognostic Biomarkers for Charged Particle Radiotherapy.

Authors:  Michael D Story; Jing Wang
Journal:  Int J Part Ther       Date:  2018
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.