| Literature DB >> 28968100 |
Yunxiang Dai1, Michael R Shortreed1, Mark Scalf1, Brian L Frey1, Anthony J Cesnik1, Stefan Solntsev1, Leah V Schaffer1, Lloyd M Smith1,2.
Abstract
A proteoform family is a group of related molecular forms of a protein (proteoforms) derived from the same gene. We have previously described a strategy to identify proteoforms and elucidate proteoform families in complex mixtures of intact proteins. The strategy is based upon measurements of two properties for each proteoform: (i) the accurate proteoform intact-mass, measured by liquid chromatography/mass spectrometry (LC-MS), and (ii) the number of lysine residues in each proteoform, determined using an isotopic labeling approach. These measured properties are then compared with those extracted from a catalog of theoretical proteoforms containing protein sequences and localized post-translational modifications (PTMs) for the organism under study. A match between the measured properties and those in the catalog constitutes an identification of the proteoform. In the present study, this strategy is extended by utilizing a global PTM discovery database and is applied to the widely studied model organism Escherichia coli, providing the most comprehensive elucidation of E. coli proteoforms and proteoform families to date.Entities:
Keywords: E. coli; NeuCode; PTM; database search; intact-mass; proteoform; proteoform family
Mesh:
Substances:
Year: 2017 PMID: 28968100 PMCID: PMC5679780 DOI: 10.1021/acs.jproteome.7b00516
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466