| Literature DB >> 16448014 |
Yonghua Han1, Bin Ma, Kaizhong Zhang.
Abstract
For the identification of novel proteins using MS/MS, de novo sequencing software computes one or several possible amino acid sequences (called sequence tags) for each MS/MS spectrum. Those tags are then used to match, accounting amino acid mutations, the sequences in a protein database. If the de novo sequencing gives correct tags, the homologs of the proteins can be identified by this approach and software such as MS-BLAST is available for the matching. However, de novo sequencing very often gives only partially correct tags. The most common error is that a segment of amino acids is replaced by another segment with approximately the same masses. We developed a new efficient algorithm to match sequence tags with errors to database sequences for the purpose of protein and peptide identification. A software package, SPIDER, was developed and made available on Internet for free public use. This paper describes the algorithms and features of the SPIDER software.Mesh:
Substances:
Year: 2004 PMID: 16448014 DOI: 10.1109/csb.2004.1332434
Source DB: PubMed Journal: Proc IEEE Comput Syst Bioinform Conf ISSN: 1551-7497