| Literature DB >> 30563850 |
Joshua Klein1, Joseph Zaia2,3.
Abstract
mzML and mzIdentML are commonly used, powerful tools for representing mass spectrometry data and derived identification information. These formats are complex, requiring non-trivial logic to translate data into the appropriate representation. Most published implementations are tightly coupled to data structures. The most complete implementations are written in compiled languages that cannot expose the complete flexibility of the implementation to external programs or bindings. To our knowledge, there are no complete implementations for mzML or mzIdentML available to scripting languages like Python or R. We present psims, a library written in Python for writing mzML and mzIdentML. The library allows writing either XML format using built-in Python data structures. It includes a controlled vocabulary resolution system to simplify the encoding process and an identity tracking system to manage entity relationships. The source code is available at https://github.com/mobiusklein/psims, and through the Python Package Index as psims, licensed under the Apache 2 common license.Keywords: Bioinformatics; Bioinformatics software; Data standards; Mass Spectrometry; Quality control and metrics; mzIdentML; mzML
Mesh:
Year: 2018 PMID: 30563850 PMCID: PMC6398200 DOI: 10.1074/mcp.RP118.001070
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911