| Literature DB >> 28248257 |
Kira Vyatkina1,2.
Abstract
De novo sequencing of tandem (MS/MS) mass spectra represents the only way to determine the sequence of proteins from organisms with unknown genomes, or the ones not directly inscribed in a genome-such as antibodies, or novel splice variants. Top-down mass spectrometry provides new opportunities for analyzing such proteins; however, retrieving a complete protein sequence from top-down MS/MS spectra still remains a distant goal. In this paper, we review the state-of-the-art on this subject, and enhance our previously developed Twister algorithm for de novo sequencing of peptides from top-down MS/MS spectra to derive longer sequence fragments of a target protein.Entities:
Keywords: de novo sequencing; tag convolution; top-down mass spectrometry
Year: 2017 PMID: 28248257 PMCID: PMC5372227 DOI: 10.3390/proteomes5010006
Source DB: PubMed Journal: Proteomes ISSN: 2227-7382
Figure 1The alignment of the gapped strings against the respective protein sequence: (a) for the light chain of alemtuzumab; (b) for ubiquitin (a contaminant from the CAH2 sample). The matching fragments (up to the I/L substitution) of the gapped strings and protein sequences are marked in red; the spurious amino acids of the gapped strings are shown in gray. The gap sizes (in Da), along with the corresponding fragments of the protein sequences, are depicted in green. The zero-size gap between two adjacent fragments of ubiquitin that appeared as part of the input is indicated with the green vertical bar. The gap in the second gapped string for the light chain of alemtuzumab, labeled with the green asterisk, comprises Da, and thus, approximately equals the mass of its corresponding dimer RP. All the other gaps accurately match the respective theoretical estimates as well.