| Literature DB >> 16336652 |
Florian Grosse-Coosmann1, Andreas M Boehm, Albert Sickmann.
Abstract
BACKGROUND: Mascot is a commonly used protein identification program for MS as well as for tandem MS data. When analyzing huge shotgun proteomics datasets with Mascot's native tools, limits of computing resources are easily reached. Up to now no application has been available as open source that is capable of converting the full content of Mascot result files from the original MIME format into a database-compatible tabular format, allowing direct import into database management systems and efficient handling of huge datasets analyzed by Mascot.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16336652 PMCID: PMC1325259 DOI: 10.1186/1471-2105-6-290
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1An example of the MIME format of Mascot™ result files is shown in this figure. Wrapped lines are indented. Some lines are removed due to space savings, marked by [...]. The original example file contains 322 lines. Cross-reference links are marked in red.
The command line options of mres2x. Parameters for setting the Mascot™'s username, changing line break characters as well as debugging mode exist, too. The usage of mres2x is: mres2x -d destination -o type [-rvfpSuU] filemask_of_input_files, where the last parameter defines the input file(s) including the path and can even be a single file. The input must be in original Mascot™ format, not HTML. The files from the file mask must be in the same directory if the output format is not tab. In case of tab format output, the destination must be a single file, otherwise a folder. mrex2x explicitly expands input file masks. A description of the parameters also can be found in the file Overview.html (see additional file 1) included in the source code package.
| Describes the output format. Supported types are: | ||
| Use CR LF instead of LF as linefeed in data blocks. Some OS need special line feed characters in text files. | ||
| Increase verbosity mode by one per occurrence of -v. A maximum of two -v is allowed. | ||
| Overwrite files/allow usage of non-empty directories. Usually, the destination directory must be empty. | ||
| Preserves files on unsuccessful program termination. Useful for debugging purposes. | ||
| Show message indicator even if stderr is a terminal. | ||
| Set the username to name, if no entry is present and if the tab output format is selected. | ||
| Set the username to name in all cases if the tab output format is selected. This allows changing the username in the result files of Mascot™. |
Figure 2An example of the output of mres2x in tabular format, one record per line. The lines begin with a prefix, indicating the line type. Lower case letters indicate description lines; the corresponding data has upper case letters as prefix which may be directly followed by running numbers. The format is semicolon separated. Chapters are marked and commented in red. The format is described in the file Tabformat.html (see Additional file 2), included in the source code package.