| Literature DB >> 23722089 |
Hagen Meckel1, Christian Stephan, Christian Bunse, Michael Krafzik, Christopher Reher, Michael Kohl, Helmut Erich Meyer, Martin Eisenacher.
Abstract
Proteomics methods, especially high-throughput mass spectrometry analysis have been continually developed and improved over the years. The analysis of complex biological samples produces large volumes of raw data. Data storage and recovery management pose substantial challenges to biomedical or proteomic facilities regarding backup and archiving concepts as well as hardware requirements. In this article we describe differences between the terms backup and archive with regard to manual and automatic approaches. We also introduce different storage concepts and technologies from transportable media to professional solutions such as redundant array of independent disks (RAID) systems, network attached storages (NAS) and storage area network (SAN). Moreover, we present a software solution, which we developed for the purpose of long-term preservation of large mass spectrometry raw data files on an object storage device (OSD) archiving system. Finally, advantages, disadvantages, and experiences from routine operations of the presented concepts and technologies are evaluated and discussed. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.Keywords: Bioinformatics infrastructure; CD; CIFS; CPU; DVD; Data archive; Data backup; FC; GUI; GZ; HEP; HTML; Information technology; JOOQ; LTO; MSDX; NAS; NFS; NGS; OS; RAID; RAM; SAN; SCSP; SDK; SMB; SWT; Storage attached network; Storage redundancy; TCP/IP; USB; UUID; VMM; WORM; central processor unit; common internet file system; compact disk; digital versatile disk; fiber channel; graphical user interface; gzip; high-energy physics; hypertext markup language; iRODS; iSCSI; integrated Rule-Orientated Data System; internet small computer system interface; java object orientated query; linear tape open; mass spectrometry DX client; network attached storage; network file system; next-generation sequencing; operating system; random-access memory; redundant array of independent disks; server messages block; simple content storage protocol; software development kit; standard widget toolkit; storage area network; transmission control protocol/internet protocol; universal serial bus; universally unique identifier; virtual machine manager; write once read many
Mesh:
Substances:
Year: 2013 PMID: 23722089 DOI: 10.1016/j.bbapap.2013.05.018
Source DB: PubMed Journal: Biochim Biophys Acta ISSN: 0006-3002