| Literature DB >> 18439290 |
Sudipto Saha1, Scott H Harrison, Changyu Shen, Haixu Tang, Predrag Radivojac, Randy J Arnold, Xiang Zhang, Jake Yue Chen.
Abstract
BACKGROUND: With the introduction of increasingly powerful mass spectrometry (MS) techniques for clinical research, several recent large-scale MS proteomics studies have sought to characterize the entire human plasma proteome with a general objective for identifying thousands of proteins leaked from tissues in the circulating blood. Understanding the basic constituents, diversity, and variability of the human plasma proteome is essential to the development of sensitive molecular diagnosis and treatment monitoring solutions for future biomedical applications. Biomedical researchers today, however, do not have an integrated online resource in which they can search for plasma proteins collected from different mass spectrometry platforms, experimental protocols, and search software for healthy individuals. The lack of such a resource for comparisons has made it difficult to interpret proteomics profile changes in patients' plasma and to design protein biomarker discovery experiments. DESCRIPTION: To aid future protein biomarker studies of disease and health from human plasma, we developed an online database, HIP2 (Healthy Human Individual's Integrated Plasma Proteome). The current version contains 12,787 protein entries linked to 86,831 peptide entries identified using different MS platforms.Entities:
Year: 2008 PMID: 18439290 PMCID: PMC2396660 DOI: 10.1186/1755-8794-1-12
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Relational data model schema for the table. Protein, peptide evidence, MS Experiment and sample.
Summary of HIP2 database. The numbers of peptides and proteins represent unique entries that are the union of multiple subjects, possibly from different ethnic groups.
| Source | Platform | Peptides | Proteins | Search Software |
| ESI-MS/MS_DECA | 712 | 348 | SEQUEST | |
| ESI-MS/MS_DECAXP | 5796 | 2149 | SEQUEST | |
| ESI-MS/MS_LCQ | 1818 | 427 | SEQUEST/SONAR | |
| ESI-MS/MS_QSTAR | 309 | 137 | SEQUEST | |
| ESI-MS/MS_QTOF | 5078 | 573 | MASCOT | |
| ESI-MS/MS_QTRAP | 195 | 51 | MASCOT | |
| MALDI_MS/MS_QSTAR | 384 | 60 | SEQUEST/MASCOT | |
| IMS_MS/MS_TOF | 35781 | 9,087 | MASCOT | |
| ESI-MS/MS_DECA | 260 | 110 | SEQUEST | |
| ESI-MS/MS_DECAXP | 317 | 159 | SEQUEST | |
| ESI-MS/MS_LCQ | 1101 | 263 | SEQUEST | |
| ESI-MS/MS_QSTAR | 14 | 14 | SEQUEST | |
| ESI-MS/MS_QTOF | 728 | 215 | SEQUEST | |
| LC_MS/MS* | 1153 | 250 | SEQUEST | |
| 2DEMS & LC_MS/MS* | ---- | 928 | SEQUEST |
*Not defined
Figure 2The overall design of the HIP. 1 = Query of protein "X" observed in plasma proteome; 2 = Query of peptide "Y" observed in plasma proteome; 3 = Output result of "X" protein page ; 4 = Link to external database; 5 = Link to experimental page of protein "X"; 6 = Link to the peptide page of the protein "X"; 7 = Output result of peptide "Y"; 8 = Iterative query from peptide page to search for other proteins associated with the peptide "Y".
Figure 6Snapshot of the peptide query search.
Figure 3Protein identification. Number of proteins (Y axis) versus the number of distinct peptides used for the protein identification (X axis).
Figure 4Total proteins categorized by the number of platforms that identify them. Numbers in the legend refer to the number of platforms.
Figure 5Overlapping of plasma proteins identified from different sources. C = Clemmer's group; H = HUPO PPP ; L = Leigh Anderson's group; P = PepAtlas data source.