| Literature DB >> 23667664 |
Kun He1, Xin-Yu Wen, Ai-Ling Li, Tao Li, Jie Wang, Hong-Xia Wang, Na Wang.
Abstract
The emergence of mass spectrometry (MS)-based signatures as biomarkers has generated considerable enthusiasm among oncologists. However, variations in normal individuals also exist, and a better understanding of serum peptide patterns of healthy individuals will be important for further exploring disease-specific serum peptide patterns. Following development of a serum peptide pattern platform, we analyzed 500 serum samples obtained from healthy individuals. Samples from breast (n = 84), lung (n = 70), and rectal (n = 30) cancer patients were also examined. Extensive data analysis revealed negligible contributions of age to serum peptide patterns except in healthy individuals between 20-30 and 60+ years of age. Gender-related variations in the serum patterns of healthy individuals were only observed in 20-30 year-old individuals. Our results revealed substantial variation in individual peptide profiles, but 65 peptides were detected at a 20% higher frequency in the healthy population. A peptide profile was developed for each type of cancer, containing 10 discriminating peptides not prevalent in healthy individuals. Sequence identification of 111 signature peptides revealed that they fell into several tight clusters and most were exopeptidase products of serum resident proteins. We have obtained a MS-based serum peptide profile for healthy individuals, providing a reference for observing the occurrence of cancer-specific peptides.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23667664 PMCID: PMC3648468 DOI: 10.1371/journal.pone.0063724
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Peptide profiles in healthy subjects.
(A) Study overview. The diagram shows the design of the healthy serum peptidome demographic study, and the statistical analysis approach used for gender and different age groups. (B) Example of within-run reproducibility of mass spectra. Serum from a pooled healthy sample (10 individuals) was prepared with copper-chelated magnetic bead extraction. Six in one day randomly selected spectra obtained by MALDI-TOF MS and pseudo gel spectra used in the analysis. The analysis m/z range is from 1 KD to 10 KD. (C) Age distribution of 500 healthy subjects. The colored fields indicate the five age groups used throughout this study. The black horizontal lines represent medians for each cohort. (D) Distribution of 65 peptides with their molecular weight ranges.
Figure 2Occurrence frequency of 65 peptides in the sera of healthy individuals and patients with three types of cancer.
m/z (mass:charge ratio) indicates protonated average molecule weight (M+H) in linear MALDI-TOF-MS. Red: zero frequency in cancers; Green: <20% frequency; Blue: 20–50% frequency; Grey: >50%.
Figure 3Peptides with >20% frequency in the sera of cancer patients and <20% frequency in healthy individuals.
Red asterisk: occurrence in three cancers with >20% frequency; Blue asterisk: occurrence in two of the three cancers with >20% frequency.
Figure 4PLS analysis on the age and gender-related variations of the serum peptide profiles of healthy individuals.
(A) 3D scatter plots of PLS analysis on the serum peptide profiles derived from male (blue) and female (red) donors with different age ranges. (B) 3D scatter plots of PLS analysis on age-related variation of serum peptide profiles.
Identification of serum peptides.
| Protein name | No. | Mmon | Mavg+H | Sequence |
| Fibrinogen a | 1 | 1535.69 | 1537.58 | ADSGEGDFLAEGGGVR |
| 2 | 1464.65 | 1466.50 | DSGEGDFLAEGGGVR | |
| 3 | 1349.62 | 1351.42 | SGEGDFLAEGGGVR | |
| 4 | 1262.59 | 1264.34 | GEGDFLAEGGGVR | |
| 5 | 1205.57 | 1207.29 | EGDFLAEGGGVR | |
| 6 | 1076.53 | 1078.17 | GDFLAEGGGVR | |
| 7 | 1019.50 | 1021.12 | DFLAEGGGVR | |
| 8 | 1308.55 | 1310.32 | DSGEGDFLAEGGGV | |
| 9 | 1193.52 | 1195.23 | SGEGDFLAEGGGV | |
| 10 | 920.42 | 921.98 | GDFLAEGGGV | |
| 11 | 863.40 | 864.93 | DFLAEGGGV | |
| 12 | 748.38 | 749.84 | FLAEGGGV | |
| 13 | 649.31 | 650.71 | FLAEGGG | |
| 14 | 707.31 | 708.75 | DFLAEGG | |
| 15 | 1389.62 | 1391.44 | NRGDSTFESKSY | |
| 16 | 2552.09 | 2554.60 | SSSYSKQFTSSTSYNRGDSTFES | |
| 17 | 2767.22 | 2769.85 | SSSYSKQFTSSTSYNRGDSTFESKS | |
| 18 | 2930.28 | 2933.03 | SSSYSKQFTSSTSYNRGDSTFESKSY | |
| 19 | 3189.42 | 3192.40 | SSSYSKQFTSSTSYNRGDSTFESKSYKM | |
| 20 | 4783.09 | 4786.97 | SSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHGTHSTK | |
| 21 | 4996.21 | 5000.21 | SSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHGTHSTKRG | |
| 22 | 5333.35 | 5337.55 | SSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHEGTHSTKRGHA | |
| 23 | 5900.70 | 5905.24 | SSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHEGTHSTKRGHAKSRPV | |
| 24 | 3522.53 | 3525.67 | GDSTFESKSYKMADEAGSEADHEGTHSTKRGHA | |
| 25 | 3238.52 | 3241.49 | SYKMADEAGSEADHEGTHSTKRGHAKSRPV | |
| 26 | 2671.17 | 2673.80 | SYKMADEAGSEADHEGTHSTKRGHA | |
| 27 | 2249.95 | 2252.34 | SYKMADEAGSEADHEGTHSTK | |
| 28 | 2121.85 | 2124.16 | SYKMADEAGSEADHEGTHST | |
| 29 | 2988.42 | 2991.23 | KMADEAGSEADHEGTHSTKRGHAKSRPV | |
| 30 | 2874.34 | 2877.08 | MADEAGSEADHCH3EGTHSTKRGHAKSRPV | |
| 31 | 2860.33 | 2863.06 | MADEAGSEADHEGTHSTKRGHAKSRPV | |
| 32 | 2761.26 | 2763.93 | MADEAGSEADHEGTHSTKRGHAKSRP | |
| 33 | 2292.98 | 2295.37 | MADEAGSEADHEGTHSTKRGHA | |
| 34 | 2729.29 | 2731.86 | ADEAGSEADHEGTHSTKRGHAKSRPV | |
| 35 | 2161.94 | 2164.17 | ADEAGSEADHEGTHSTKRGHA | |
| 36 | 2658.25 | 2660.78 | DEAGSEADHEGTHSTKRGHAKSRPV | |
| 37 | 2559.18 | 2561.65 | DEAGSEADHEGTHSTKRGHAKSRP | |
| 38 | 2090.90 | 2093.09 | DEAGSEADHEGTHSTKRGHA | |
| 39 | 2019.86 | 2022.01 | DEAGSEADHEGTHSTKRGH | |
| 40 | 1882.80 | 1884.87 | DEAGSEADHEGTHSTKRG | |
| 41 | 2543.22 | 2545.69 | EAGSEADHEGTHSTKRGHAKSRPV | |
| 42 | 2343.14 | 2345.50 | GSEADHEGTHSTKRGHAKSRPV | |
| 43 | 2815.32 | 2817.99 | GSESGIFTNTKESSSHHPGIAEFPSRG | |
| C3f | 44 | 2020.10 | 2022.32 | SSKITHRIHWESASLLR |
| 45 | 1305.69 | 1307.50 | SKITHRIHWE | |
| 46 | 1776.96 | 1779.05 | SKITHRIHWESASLL | |
| 47 | 1689.93 | 1691.97 | KITHRIHWESASLL | |
| 48 | 1448.75 | 1450.64 | THRIHWESASLL | |
| 49 | 977.48 | 979.09 | THRIHWE | |
| 50 | 1347.70 | 1349.54 | HRIHWESASLL | |
| 51 | 1054.54 | 1056.21 | IHWESASLL | |
| 52 | 941.46 | 943.05 | HWESASLL | |
| 53 | 955.48 | 957.07 | HCH3WESASLL | |
| 54 | 828.38 | 829.89 | HWESASL | |
| C3 precursor | 55 | 667.37 | 668.77 | ASHLGLA |
| 56 | 1854.85 | 1856.94 | SEETKENEGFTVTAEGK | |
| C4a | 57 | 1895.02 | 1897.15 | RNGFKSHALQLNNRQI |
| 58 | 1497.78 | 1499.67 | NGFKSHALQLNNR | |
| 59 | 895.45 | 896.98 | SHALQLNN | |
| 60 | 3199.79 | 3202.72 | GLEEELQFSLGSKINVKVGGNSKGTLKVLR | |
| 61 | 2703.44 | 2706.07 | GLEEELQFSLGSKINVKVGGNSKGTL | |
| 62 | 2304.20 | 2306.58 | GLEEELQFSLGSKINVKVGGNS | |
| 63 | 2377.20 | 2379.64 | DDPDAPLQPVTPLQLFEGRRN | |
| ITIH4 | 64 | 3271.63 | 3274.73 | MNFRPGVLSSRQLGLPGPPDVPDHAAYHPF |
| 65 | 2723.38 | 2726.07 | PGVLSSRQLGLPGPPDVPDHAAYHPF | |
| 66 | 2626.33 | 2628.95 | GVLSSRQLGLPGPPDVPDHAAYHPF | |
| 67 | 2357.16 | 2359.61 | SSRQLGLPGPPDVPDHAAYHPF | |
| 68 | 2183.09 | 2185.45 | QLGLPGPPDVPDHAAYHPFR | |
| 69 | 2026.99 | 2029.26 | QLGLPGPPDVPDHAAYHPF | |
| 70 | 1785.85 | 1787.97 | GLPGPPDVPDHAAYHPF | |
| 71 | 1170.57 | 1172.28 | GLPGPPDVPDHA | |
| 72 | 1099.53 | 1101.20 | GLPGPPDVPDH | |
| 73 | 1000.46 | 1002.07 | PGPPDVPDHA | |
| 74 | 903.41 | 904.96 | GPPDVPDHA | |
| 75 | 832.37 | 833.88 | GPPDVPDH | |
| 76 | 3155.62 | 3158.59 | NVHSGSTFFKYYLQGAKIPKPEASFSPR | |
| 77 | 1388.71 | 1390.60 | GSEMVVAGKLQDR | |
| apoA-I | 78 | 3374.74 | 3377.81 | AELQEGARQKLHELQEKLSPLGEEMCH3RDRA |
| 79 | 3181.73 | 3184.70 | QGLLPVLESFKVSFLSALEEYTKKLNTQ | |
| 80 | 2052.07 | 2054.31 | ATEHLSTLSEKAKPALEDL | |
| apoA-IV | 81 | 2507.35 | 2509.83 | ISASAEELRQRLAPLAEDVRGNL |
| 82 | 2754.36 | 2756.99 | GNTEGLQKSLAELGGHLDQQVEEFR | |
| Kininogen HMW | 83 | 2208.05 | 2210.34 | KHNLGHGHKHERDQGHGHQ |
| 84 | 1942.90 | 1945.03 | NLGHGHKHERDQGHGHQ | |
| 85 | 2126.01 | 2128.28 | GHGLGHGHEQQHGLGHGHKF | |
| Transthyretin | 86 | 2450.20 | 2452.69 | ALGISPFHEHAEVVFTANDSGPR |
| 87 | 3156.61 | 3159.52 | DSGPRRYTIAALLSPYSYSTTAVVTNPKE | |
| 88 | 2040.04 | 2042.29 | ALLSPYSYSTTAVVTNPKE | |
| 89 | 1395.69 | 1397.53 | SYSTTAVVTNPKE | |
| 90 | 1409.70 | 1411.55 | SYSTTAVVTNPKCH3E | |
| Antichymotrypsin | 91 | 4622.49 | 4626.49 | SALVETRTIVRFNRPFLMIIVPTDTQNIFFMSKVTNPKQA |
| 92 | 4464.42 | 4468.34 | LVETRTIVRFNRPFLMIIVPTDTQNIFFMSKVTNPKQA | |
| 93 | 4351.33 | 4355.18 | VETRTIVRFNRPFLMIIVPTDTQNIFFMSKVTNPKQA | |
| 94 | 2735.44 | 2738.29 | LMIIVPTDTQNIFFMSKVTNPKQA | |
| 95 | 2491.31 | 2493.93 | IIVPTDTQNIFFMSKVTNPKQA | |
| α-1 antitrypsin | 96 | 4772.55 | 4776.77 | LEAIPMSIPPEVKFNKPFVFLMIDQNTKSPLFMGKVVNPTQK |
| 97 | 4118.21 | 4121.95 | SIPPEVKFNKPFVFLMIDQNTKSPLFMGKVVNPTQK | |
| 98 | 2488.32 | 2491.00 | LMIDQNTKSPLFMGKVVNPTQK | |
| proapolipoprotein | 99 | 1365.71 | 1367.54 | LEEYTKKLNTQ |
| Cytokeratin | 100 | 1811.85 | 1813.88 | SRSGGGGGGGLGSGGSIRSSY |
| Protein c inhibitor | 101 | 3888.15 | 3891.63 | SARLNSQRLVFNRPFLMFIVDNNILFLGKVNRP |
| 102 | 2195.18 | 2197.62 | SARLNSQRLVFNRPFLMF | |
| 103 | 2048.11 | 2050.44 | SARLNSQRLVFNRPFLM | |
| prealbumin | 104 | 1437.83 | 1439.70 | VKVLDAVRGSPAIN |
| 105 | 785.43 | 786.90 | VVTNPKE | |
| serum albumin | 106 | 1295.64 | 1297.42 | DAHKSEVAHRF |
| 107 | 1538.76 | 1540.68 | DAHKSEVAHRFKD | |
| SAA | 108 | 1524.78 | 1526.74 | PNHFRPAGLPEKY |
| SP40 | 109 | 4266.29 | 4269.96 | PITVTVPVEVSRKNPKFMETVAEKALQEYRKKHREE |
| apolipoprotein CI | 110 | 1461.86 | 1463.76 | FQKVKEKLKIDS |
| C1-inhibitor | 111 | 1063.52 | 1065.24 | MGRVYDPRA |
Mmon is the monoisotopic molecular weight. Mavg+H is protonated average molecule weight as listed under m/z in Figure 2 and 3.
Figure 5NanoLC-FT-ICR MS/MS identification of a peptide, which is a fragment of protein Fibrinogen α, with sequence SSSYSKQFTSSTSYNRGDSTFESKSY and molecular weight 2930.2842 Da.
(A) Base peak ion chromatogram of serum peptides. The arrow indicates the peak including the triple-charged peptide at m/z 977.7690 (B) Mass spectrum of the ion chromatography peak indicated in (A), the diamond indicates the precursor ion at m/z 977.7690. The insert is a magnified view of the precursor ion showing a triple-charged ion at high mass accuracy (0.3 ppm). (C) Tandem mass spectrum of the triple-charged peptide at m/z 977.7690 with amino acid sequence deduced from the y ion series. Note that y ions originate at the C terminus and the sequence therefore reads backwards.