| Literature DB >> 19519948 |
Allen D Taylor1, William S Hancock, Marina Hincapie, Naoyuki Taniguchi, Samir M Hanash.
Abstract
Advances in mass spectrometry have had a great impact on the field of proteomics. A major challenge of proteomic analysis has been the elucidation of glycan modifications of proteins in complex proteomes. Glycosylation is the most structurally elaborate and diverse type of protein post-translational modification and, because of this, proteomics and glycomics have largely developed independently. However, given that such a large proportion of proteins contain glycan modifications, and that these may be important for their function or may produce biologically relevant protein variation, a convergence of the fields of glycomics and proteomics would be highly desirable. Here we review the current status of glycoproteomic efforts, focusing on the identification of glycoproteins as cancer biomarkers.Entities:
Year: 2009 PMID: 19519948 PMCID: PMC2703866 DOI: 10.1186/gm57
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Identification of mucin proteins from plasma proteomic analysis
| IPI | Gene | Description | Length (aa) | MW | N |
|---|---|---|---|---|---|
| IPI00552461 | Cd164 sialomucin-like 2 | 171 | 17,874 | 2 | |
| IPI00452565 | Isoform 1 of endomucin precursor | 261 | 27,452 | 2 | |
| IPI00452566 | Isoform 2 of endomucin precursor | 178 | 18,848 | 1 | |
| IPI00400813 | Isoform 2 of putative EGF-like module-containing mucin-like hormone receptor-like 4 precursor | 474 | 52,857 | 1 | |
| IPI00289931 | Isoform 1 of mucosal addressin cell adhesion molecule 1 precursor | 406 | 42,691 | 24 | |
| IPI00397424 | Mucosal vascular addressin cell adhesion molecule 1 isoform a precursor | 382 | 40,155 | 1 | |
| IPI00016334 | Isoform 1 of cell surface glycoprotein muc18 precursor | 646 | 71,607 | 69 | |
| IPI00445227 | Isoform 2 of cell surface glycoprotein muc18 precursor | 527 | 57,605 | 43 | |
| IPI00013955 | Isoform 1 of mucin-1 precursor | 1,255 | 122,072 | 9 | |
| IPI00103552 | Mucin-16 | 22,152 | 2353428 | 18 | |
| IPI00646572 | Uncharacterized protein muc16 | 14,495 | 1518170 | 61 | |
| IPI00384775 | Small intestinal mucin muc3 (fragment) | 648 | 65,520 | 1 | |
| IPI00470673 | Isoform 1 of mucin-17 precursor | 4,493 | 451,723 | 1 | |
| IPI00737886 | Similar to submaxillary apomucin | 1,509 | 158,995 | 8 | |
| IPI00829796 | Muc19 | 246 | 23,202 | 1 | |
| IPI00027201 | Mucin-2 precursor | 5,179 | 540,300 | 20 | |
| IPI00178316 | Isoform 12 of mucin-4 precursor | 1,125 | 125,170 | 1 | |
| IPI00181286 | 52 kDa protein | 475 | 51,806 | 2 | |
| IPI00103397 | Mucin-5ac precursor (fragment) | 5,030 | 526,608 | 18 | |
| IPI00386327 | Mucin-5ac (fragment) | 1,233 | 130,073 | 2 | |
| IPI00479313 | Gastric mucin (fragment) | 1,373 | 147,788 | 1 | |
| IPI00744035 | 591 kDa protein | 5,709 | 591,191 | 3 | |
| IPI00869170 | Mucin 5ac, oligomeric mucus/gel-forming | 6,207 | 648,803 | 1 | |
| IPI00012165 | Predicted: mucin 5, subtype b, tracheobronchial | 5,708 | 591,045 | 15 | |
| IPI00747373 | Mucin-5b precursor | 5,703 | 590,499 | 1 | |
| IPI00401776 | Mucin 6, gastric | 2,439 | 257,051 | 23 | |
| IPI00030366 | Mucin | 274 | 29,038 | 51 | |
| IPI00180687 | Isoform 3 of mucin and cadherin-like protein precursor | 487 | 53,233 | 3 | |
| IPI00301869 | Isoform 2 of mucin and cadherin-like protein precursor | 651 | 69,541 | 1 | |
| IPI00029147 | Mucin (fragment) | 171 | 17,863 | 1 | |
| IPI00376207 | Similar to muc19 | 598 | 61,206 | 1 | |
| IPI00398026 | Predicted: similar to mucin-4 | 270 | 27,866 | 1 | |
| IPI00455879 | Predicted: similar to mucin 1 precursor (muc-1) | 879 | 93,058 | 8 | |
| IPI00457080 | Predicted: similar to submaxillary apomucin | 7,202 | 694,615 | 66 |
Abbreviations: aa, amino acids; IPI, International Protein Index [55]; MW, molecular weight; N, number of experiments. Descriptions are taken from annotations in the IPI database.
Figure 1Distribution of mucin 16 peptides identified by mass spectrometry along part of the sequence of this large protein. Red rectangles, peptides identified; black bars, potential sites of glycosylation; green sequence, repeat domains that are heavily glycosylated. Note the lack of identified peptides spanning glycosylation sites.
Figure 2Glycoproteomic flow chart reflecting the integration of traditional proteomics methods with glycomic techniques. This addresses the need to identify a glycoprotein (1), identify a glycosylation site (2), and determine the structure of the glycan (3). The ability to successfully characterize glycoproteins requires an emphasis on purification, enrichment, and fractionation strategies at both the protein and peptide level.
Figure 3Identification of EGFR glycoforms. (a) The extracellular-domain sequence of EGFR (629 amino acids) is shown at the top. Black bars, peptides that are predicted to be generated by tryptic digestion; green boxes, peptides with known N-linked glycosylation sites; red boxes, peptides identified in a series of four LC-MS/MS experiments to identify proteins in reference pools of plasma from healthy subjects and patients with cancer. A manual inspection for known glycosylated peptides from EGFR was then carried out, leading to the identification of the peptide containing the Asn328 glycosylation (green box in experiment 4). (b) EGFR from the A431 cell line was subjected to MS analysis. Ten N-linked glycosylation sites were identified, and the main glycan proposed to be at each site is shown [41]. Curly brackets indicate that the monomer (sialic acid) is present in one of these positions (or two in the case of the glycan attached to Asn544). (c) The proposed glycan structure for one EGFR peptide differs from that of the corresponding peptide identified in A431 cells. This structure was determined following LC-MS/MS analysis of human plasma from a mixture of control individuals and patients with lung cancer (SL Wu and BL Karger, personal communication). Reproduced with permission from [40].