| Literature DB >> 18793429 |
Susann Schenk1, Gary J Schoenhals, Gustavo de Souza, Matthias Mann.
Abstract
BACKGROUND: The immense diagnostic potential of human plasma has prompted great interest and effort in cataloging its contents, exemplified by the Human Proteome Organization (HUPO) Plasma Proteome Project (PPP) pilot project. Due to challenges in obtaining a reliable blood plasma protein list, HUPO later re-analysed their own original dataset with a more stringent statistical treatment that resulted in a much reduced list of high confidence (at least 95%) proteins compared with their original findings. In order to facilitate the discovery of novel biomarkers in the future and to realize the full diagnostic potential of blood plasma, we feel that there is still a need for an ultra-high confidence reference list (at least 99% confidence) of blood plasma proteins.Entities:
Year: 2008 PMID: 18793429 PMCID: PMC2563020 DOI: 10.1186/1755-8794-1-41
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Plasma experiments performed.
| Experiment | Designated name | Plasma treatment | [Protein] | Plasma | MS | LC-MS run | Comment |
| Plasma_01 | 01_MS2 | none | 450 | 1-D PAGE | MS/MS2 | 100 | |
| Plasma_04 | 04_MS2_prec | none | 750 | 1-D PAGE | MS/MS2 | 100 | Precursor selection |
| Plasma_05 | 05 | none | 750 | 1-D PAGE | MS/MS2/MS3 | 100 | |
| Plasma_06 | 06_Alb_depl | Albumin-depletion | 1800 | 1-D PAGE | MS/MS2/MS3 | 100 | |
| Plasma_10 | 10_Alb_depl _NL | Albumin depletion, | 1800 | 1-D PAGE | MS/MS2/MS3 | 100 | Neutral loss |
| Plasma_08 | 08_OGE | none | 1800 | OGE | MS/MS2/MS3 | 140 | |
| Plasma_09 | 09_OGE_6_depl | Depletion of albumin, | 650 | OGE | MS/MS2/MS3 | 140 | |
| Plasma_11 | 11_Orbitrap | Depletion of albumin, | 300 | 1-D PAGE | MS/MS2 | 140 |
A summary of the differences in the experimental protocols between plasma experiments is shown. Each experiment has been assigned a meaningful "designated name" that is used in the text and in the accompanying figures and tables. Note that all experiments include MS3 measurements with the exception of 01_MS2, 04_MS2_prec and 11_Orbitrap. The type of treatment performed on the plasma samples, if any, is indicated, along with the amount of material applied to the gel, the separation method employed, and the method used for MS data collection.
Figure 1Histogram showing the number of proteins identified . The number of validated, non-redundant peptides used to identify each protein was calculated and the proteins with identical numbers of peptides were plotted in the same group, indicated on the X-axis. For example, there are a total of 70 proteins that were identified with a single peptide. The inset depicts in detail the number of proteins that were identified with 1–20 peptides. Proteins identified with more than 20 peptides were categorized into groups as indicated.
Proteins identified with more than 100 non-redundant, validated peptides.
| Protein name | Number of distinct, validated peptides |
| Apolipoprotein B-100 [Precursor] | 505 |
| Complement C3 [Precursor] variant | 235 |
| Complement C3 [Precursor] | 165 |
| Alpha-2-macroglobulin [Precursor] | 164 |
| Complement component C4B, C4B1 | 164 |
| Fibronectin [Precursor], isoform 1, 3, 5, 7, 8, 9, or 10 | 143 |
| Talin-1 | 135 |
| Filamin A, alpha (actin binding protein 280) | 117 |
| Fibronectin 1, isoforms 3, 4 or 5, or CRA isoforms h, j, n or m | 115 |
| Fibronectin [Precursor], isoform 8 | 107 |
| Complement component 4A | 105 |
| Complement C5 [Precursor] | 105 |
After combining the experimental data for all 8 treatments described here, the number of validated, non-redundant peptides was calculated for each protein and the proteins having 100 or more such peptides appear in this list. The list is arranged in descending order beginning with the protein having the largest number of distinct, validated peptides.
Figure 2Histogram depicting the number of validated, non-redundant peptides versus the MW of the identified proteins. The number of validated, non-redundant peptides used to identify each protein was calculated and this number was plotted as a function of the molecular weight of that particular protein. The MW range (X-axis) was truncated at 550 kDa, resulting in the loss of one protein. Likewise, the number of validated, non-redundant peptides (Y-axis) was truncated at 250 peptides, resulting in the loss of an additional protein.
Figure 3Histogram showing the molecular weight distribution of the calculated masses of 697 observed plasma proteins. The calculated protein masses for all proteins from all experiments were categorized into molecular weight groups as indicated so that the number of proteins falling into each molecular weight group is proportional to the height of each bar in the histogram.
Figure 4Pie chart representation of all validated proteins which were categorized as GO cellular component. Of the 697 plasma proteins identified, 540 of these fell into the 'GO cellular component' category. Of the 540 'GO cellular component' proteins, 392 (65%) were classified as 'cellular' and 208 (35%) were classified as 'extracellular'. 44 of the proteins from the 'extracellular' category were classified as 'extracellular matrix' proteins. Note that because some of the proteins have been reported more than once, the total number of proteins reported for the two categories shown is actually higher than the total number of proteins for the 'GO cellular component'. We have therefore normalized the sum of the cellular and extracellular components to 100%.
Figure 5Pie chart representation of the 21 most highly abundant plasma proteins from our set of 697 proteins across all experiments. 10 of the 21 most highly abundant plasma proteins from our analysis are depicted in the pie chart. The small table to the right of the pie chart contains the next 11 most abundant proteins. The size of each pie piece is proportional to the number of unique peptides that we sequenced for that particular protein, relative to the total number of peptides for all 21 proteins depicted.
Plasma protein concentration ranges of selected proteins.
| Plasma protein | Known protein concentrations (g/ml) | Distinct peptides | Mass (Da) |
| Serum albumin | 3.5 – 5.2 × 10-2 | 94 | 69367 |
| Fibrinogen (alpha chain) | 2.0 – 4.0 × 10-3 | 96 | 94973 |
| Alpha-2-macroglobulin (male adult) | 0.9 – 4.0 × 10-3 | 164 | 163278 |
| Alpha-1-antitrypsin | 7.8 – 20 × 10-4 | 44 | 46737 |
| Haptoglobin | 3.0 – 22 × 10-4 | 52 | 45205 |
| Transthyretin, Thyroxine-binding prealbumin | 2.8 – 3.5 × 10-4 | 19 | 15887 |
| Ceruloplasmin | 1.5 – 6.0 × 10-4 | 95 | 122205 |
| Prothrombin | 1.0 × 10-4 | 53 | 70037 |
| Fletcher factor (Plasma kallikrein precursor) | 5.0 × 10-5 | 37 | 71370 |
| Complement component C6 | 4.8 – 6.4 × 10-5 | 61 | 104844 |
| Complement component C9 | 4.7 – 6.9 × 10-5 | 32 | 63174 |
| Hageman factor (Coagulation factor XII) | 2.9 × 10-5 | 27 | 67818 |
| Complement C1r component | 2.5 – 3.8 × 10-5 | 45 | 80174 |
| Properdin (Factor P) | 2.4 – 3.2 × 10-5 | 2 | 51276 |
| Complement C2 | 2.2 – 3.4 × 10-5 | 45 | 83268 |
| Von Willebrand factor | 7 × 10-6 | 55 | 309299 |
| Stuart factor (Coagulation factor X) | 5.0 × 10-6 | 18 | 54732 |
| Christmas factor (Coagulation factor IX) | 4.0 × 10-6 | 16 | 51748 |
| Transferrin soluble receptor (adult), Serotransferrin | 0.8 – 1.8 × 10-6 | 89 | 77050 |
| Proconvertin (Coagulation factor VII) | 1.0 × 10-6 | * | 53043 |
| Mannose-binding protein C (MBP) | 0.3 – 4.1 × 10-6 | 14 | 26144 |
| Beta-2-microglobulin | 8.0 – 24 × 10-7 | 8 | 13715 |
| Antihemophilic factor (Coagulation factor VIII) | 1.0 × 10-7 | --- | 267009 |
| C-reactive protein, splice isoform 1 | 6.8 – 820 × 10-8 | 8 | 25039 |
| Insulin-like growth factor II | 9.9 – 50 × 10-8 | 5 | 20140 |
| Myoglobin | 6.0 – 85 × 10-9 | 4 | 17053 |
| Prolactin (male), awake | 1.0 – 7.0 × 10-9 | --- | 25876 |
| Insulin | 2.0 – 8.4 × 10-10 | --- | 11981 |
Included in the table are proteins identified in our high confidence set of plasma proteins that represent a wide range of protein molecular weights and which have known plasma protein concentrations. Also included are some proteins that are known to be of higher abundance, but which were not detected in our experiments, and these were included to demonstrate that fact. The protein name, known plasma protein concentration, the number of unique peptides identifying the protein, the summed Mascot score and the MW is shown and the table has been sorted in descending order with the protein possessing the highest known protein concentration listed first.
--- indicates that this entry was not identified in this study.
* indicates that this entry was identified but not validated.
Selected proteins and their possible involvement in diseases.
| Accession | MW | Protein name | Function | Disease |
| Q15848 | 26414 | Adiponectin | Hematopoiesis, immune system; fat metabolism | Adiponectin deficiency; obesity |
| P37840 | 14460 | Alpha-synuclein, isoform 1, 2, or 3 | Regulation of dopamine release and transport. | Defects in SNCA cause autosomal |
| P02741-1 | 25039 | C-reactive protein, splice | Enhances host defense. | Inflammation, heart disease |
| P06703 | 10180 | Calcyclin (Prolactin receptor | Preferentially expressed during quiescent fibroblast | It is inducible by growth factors |
| P31944 | 27680 | Caspase-14 | Apoptosis. | |
| P07339 | 44552 | Cathepsin D | Acid protease active in intracellular | Disease pathogenesis: |
| P81605 | 11284 | Dermcidin | Neuron survival; phosphatase and antimicrobial | |
| Q99497 | 19891 | DJ-1 protein (Oncogene DJ1) | Androgen receptor-dependent transcription | Early-onset Parkinson disease 7 |
| P23142-1 | 77261 | Fibulin-1, splice isoform 1 or D | Cell adhesion/migration, organization of ECM, | human breast cancer; |
| P23142-4 | 74462 | Fibulin-1, splice isoform 4 or C | human breast cancer; does not | |
| O75636-1 | 32903 | Ficolin 3, splice isoform 1 | Lectin activity. | Systemic lupus erythematosus |
| P16930 | 46374 | Fumarylacetoacetase | Not found | Defects in FAH are the cause of |
| Q7M4S4 | 2046 | Granulocyte inhibitory protein | ||
| P01344-1 | 20140 | Insulin-like growth factor II, | Growth-promoting activity; fetal development. | |
| P05362 | 57826 | Intercellular adhesion molecule-1 | ICAM proteins are ligands for the leukocyte | |
| P13473 | 44961 | Lysosome-associated membrane | Lysosomal maintenance; intracellular signal | Implicated in tumor cell metastasis. |
| P02144 | 17053 | Myoglobin | Reserve O2 supply, O2 movement within muscles. | |
| P58546 | 12764 | Myotrophin | Cerebellar morphogenesis. | Seems to be associated with |
| P22392 | 17401 | NM23-LV (contains Nucleoside | Nucleoside diphosphate kinase B is a | Reduced amounts of Nucleoside |
| P15531 | 17149 | Nucleoside diphosphate kinase A | Synthesis of nucleoside triphosphates other than | Neuroblastoma. |
| P10720 | 10845 | Platelet factor 4 variant | Inhibitor of angiogenesis, endothelial cell | |
| P01133 | 133946 | Pro-epidermal growth factor | Growth of epidermal and epithelial tissues. | |
| P27918 | 51276 | Properdin (Factor P) | Alternate complement pathway; binds C3- and C5- | Properdin deficiency (PFD); higher |
| P61019 | 23546 | Ras-related protein Rab-2A | Protein transport; endoplasmic reticulum to Golgi | |
| P51149 | 23490 | Ras-related protein Rab-7 | Protein transport. Vesicular traffic. | Charcot-Marie-Tooth disease type |
| Q12913 | 145927 | Receptor-type tyrosine-protein | Mechanism of contact inhibition of cell growth. | Cancers of colon, lung, and breast. |
| P48594 | 44854 | Squamous cell carcinoma antigen | Protease inhibitor; host immune response | Seems to also be secreted in |
| Q15582 | 74681 | Transforming growth factor-beta | Cell-collagen interactions; endochondral bone | Corneal dystrophy Groenouw type |
| P07911 | 69761 | Uromodulin | Not known. Possible regulation of cytokines. | Familial juvenile hyperuricemic |
We have provided a short selection of proteins that we identified in our experiments and that have important biological functions and/or may have some role in a disease process. The table includes a description of the protein function, the disease involved, the primary accession number of the protein, and the molecular weight and name of the protein and has been arranged alphabetically according to protein name.
Figure 6Venn diagram representations of comparisons between pairs of individual plasma experiments. Individual experiments are represented by circles, with compared experiment pairs being depicted by overlapping sets of two circles. The number of proteins identified in both members of a compared set of experiments is given in the intersection region of the circles. The number of proteins that are unique to an individual experiment is shown outside of the intersection region, along with the total number of proteins identified for that individual experiment, shown in parentheses. The plasma experiment represented by each circle is represented at the top of the respective circle, outside of the intersection region (panel A, for example, shows a comparison of experiments 01_MS2, designated "01'', and 04_MS2_prec, designated "04''). The experimental conditions used in each experiment can be found in Table 1. Panel F shows a comparison between all 7 experiments performed on the FT and the last experiment which was performed on the Orbitrap.
Figure 7Histograms showing the number of proteins identified with a given number of unique peptides for each individual experiment. The number of validated, non-redundant peptides was calculated for each protein identified within the context of each individual experiment, and proteins having identical numbers of peptides were grouped together and plotted as indicated. The Y-axes (number of proteins) for the experiments indicated in panels C and D and in A, B, E, F and G have been standardized in order to facilitate cross-experiment comparison.
Figure 8Histograms depicting the molecular weight distribution of the identified proteins in each of the 8 experiments. The proteins from each individual experiment were grouped according to their calculated molecular weight and plotted as indicated. All Y-axis scales except for the ones in panels D and H have been standardized to facilitate cross-experiment comparison.
Figure 9Histograms showing the molecular weight distribution of shared proteins. The calculated molecular weights of the proteins determined to be in common between each set of indicated experiments were categorized and plotted. Panel A depicts a comparison of the proteins found to be in common between the FT (seven independent experiments) and the Orbitrap. Panel B shows the calculated molecular weights from the 56 proteins identified in all experimental approaches.
List of the 56 proteins found in all 8 experiments.
| Primary accession | IPI number (version 3.25) | Protein | Protein name |
| P43652 | IPI00019943 | 69069 | Afamin [Precursor] |
| P02763 | IPI00022429 | 23512 | Alpha-1-acid glycoprotein 1 precursor |
| P19652 | IPI00020091 | 23603 | Alpha-1-acid glycoprotein 2 [Precursor] |
| P01011 | IPI00431656/IPI00550991/IPI00411920 | 47651 | Alpha-1-antichymotrypsin [Precursor] |
| P04217 | IPI00022895/IPI00644018 | 54273 | Alpha-1B-glycoprotein [Precursor] |
| P02765 | IPI00022431 | 39325 | Alpha-2-HS-glycoprotein [Precursor] |
| P01023 | IPI00478003 | 163278 | Alpha-2-macroglobulin [Precursor] |
| P02760 | IPI00022426 | 39000 | AMBP protein [Precursor] |
| P01008 | IPI00032179 | 52603 | Antithrombin-III [Precursor] |
| P02647 | IPI00021841 | 30778 | Apolipoprotein A-I [Precursor] |
| P02652 | IPI00021854/IPI00382587 | 11175 | Apolipoprotein A-II [Precursor] |
| ENSP00000350425 | IPI00304273 | 45399 | Apolipoprotein A-IV [Precursor] (Apo-AIV) |
| P04114 | IPI00022229 | 515563 | Apolipoprotein B-100 [Precursor] |
| P02655 | IPI00021856 | 11284 | Apolipoprotein C-II precursor |
| P02656 | IPI00021857/IPI00657670 | 10852 | Apolipoprotein C-III [Precursor] |
| P05090 | IPI00006662 | 21276 | Apolipoprotein D [Precursor] |
| P02649 | IPI00021842 | 36154 | Apolipoprotein E [Precursor] |
| O14791-2 | IPI00514475/IPI00186903 | 45871 | Apolipoprotein-L1 [Precursor], splice isoform 2 |
| P04003 | IPI00021727 | 67033 | C4b-binding protein alpha chain [Precursor] |
| P00450 | IPI00017601 | 122205 | Ceruloplasmin [Precursor] |
| P10909 | IPI00291262 | 52495 | Clusterin [Precursor] |
| P00748 | IPI00019581 | 67818 | Coagulation factor XII [Precursor] (Hageman factor) |
| P02747 | IPI00022394 | 25774 | Complement C1q subcomponent, C chain [Precursor] |
| P09871 | IPI00017696 | 76685 | Complement C1s subcomponent [Precursor] |
| P06681 | IPI00303963 | 83268 | Complement C2 [Precursor] |
| P01031 | IPI00032291/IPI00169407 | 188331 | Complement C5 [Precursor] |
| P13671 | IPI00009920 | 104844 | Complement component C6 [Precursor] |
| P00751-1 | IPI00019591 | 85533 | Complement factor B [Precursor], Splice isoform 1 |
| P08603-1 | IPI00029739 | 139071 | Complement factor H [Precursor], splice isoform 1 |
| P02671-1 | IPI00021885 | 94973 | Fibrinogen alpha chain [Precursor] (Fibrinogen alpha/alpha-E chain precursor), Splice isoform Alpha-E |
| P02675 | IPI00298497 | 55928 | Fibrinogen beta chain [Precursor] |
| P02679-2 | IPI00021891/IPI00167009/IPI00219713 | 49496 | Fibrinogen gamma chain [Precursor], splice isoform 2 (isoform Gamma-A) |
| P06396 | IPI00026314/IPI00377087 | 85698 | Gelsolin [Precursor], plasma (Actin-depolymerizing factor) |
| P00738 | IPI00478493/IPI00641737/IPI00431645 | 45205 | Haptoglobin [Precursor] |
| P02790 | IPI00022488 | 51676 | Hemopexin [Precursor] |
| P05546 | IPI00292950 | 57071 | Heparin cofactor II [Precursor] |
| P04196 | IPI00022371 | 59579 | Histidine-rich glycoprotein [Precursor] |
| Q3B7H5 | IPI00028413 | 99849 | Inter-alpha (Globulin) inhibitor H3 – Homo sapiens (Human). |
| P19827 | IPI00292530/IPI00383338 | 101389 | Inter-alpha-trypsin inhibitor heavy chain H1 [Precursor] (ITI heavy chain H1) (Inter-alpha-inhibitor heavy chain 1) (Inter-alpha-trypsin inhibitor complex component III) (Serum-derived hyaluronan-associated protein) (SHAP) |
| P19823 | IPI00305461/IPI00289083 | 106436 | Inter-alpha-trypsin inhibitor heavy chain H2 [Precursor] |
| NP_000412 | IPI00009865 | 58827 | keratin 10 [Homo sapiens] |
| NP_000217 | IPI00019359 | 62064 | Keratin, type I cytoskeletal 9 (Keratin-9) |
| NP_006112 | IPI00220327 | 66067 | Keratin, type II cytoskeletal 1 (Keratin-1, Cytokeratin 1; hair alpha protein) |
| P03952 | IPI00654888 | 71370 | Plasma kallikrein [Precursor] (Fletcher factor) |
| P05155 | IPI00291866 | 55154 | Plasma protease C1 inhibitor precursor |
| P02753 | IPI00022420 | 23010 | Plasma retinol-binding protein [Precursor] |
| P00747 | IPI00019580 | 90569 | Plasminogen [Precursor] |
| P00734 | IPI00019568/IPI00006618 | 70037 | Prothrombin [Precursor] (EC 3.4.21.5) |
| P02768 | IPI00022434/IPI00745872 | 69367 | Serum albumin [Precursor], splice isoform 1 |
| P02743 | IPI00022391 | 25387 | Serum amyloid P-component [Precursor] |
| Q9Y490 | IPI00298994 | 269767 | Talin-1 |
| P07996 | IPI00296099 | 129413 | Thrombospondin-1 [Precursor] |
| P02766 | IPI00022432 | 15887 | Transthyretin [Precursor] |
| ENSP00000273951 | IPI00555812/IPI00742696 | 52918 | Vitamin D-binding protein precursor |
| P04004 | IPI00298971 | 54306 | Vitronectin [Precursor] |
| P04275 | IPI00023014 | 309299 | Von Willebrand factor [Precursor] |
The table provides the primary accession number, IPI number(s), protein annotation and the molecular weight of all 56 proteins that we found to be in common among our eight different experiments and is sorted alphabetically according to protein name.
Figure 10GoMiner analysis of proteins found to be in common with this study and the HUPO study. Panel A depicts a pie chart representation of a GoMiner analysis of the 242 proteins found to be in common between ours and the HUPO data sets. 208 of these were categorized by GoMiner to be "GO cellular component" as indicated. 98 of these proteins were categorized as cellular and 146 were categorized as extracellular, with 18 of the extracellular category being further classified as extracellular matrix proteins. Due to redundancy within the cellular and extracellular categories, the sum of the two categories was normalized to 100% for the purpose of calculating percentages. Panel B shows a histogram representing the molecular weight distribution of the 242 proteins found to be in common between ours and the HUPO data sets. The protein molecular weights were categorized as indicated in the panel before plotting.