| Literature DB >> 28473704 |
Sherry Bhalla1, Ruchi Verma1, Harpreet Kaur1, Rajesh Kumar1, Salman Sadullah Usmani1, Suresh Sharma2, Gajendra P S Raghava3.
Abstract
CancerPDF (Cancer Peptidome Database of bioFluids) is a comprehensive database of endogenous peptides detected in the human biofluids. The peptidome patterns reflect the synthesis, processing and degradation of proteins in the tissue environment and therefore can act as a gold mine to probe the peptide-based cancer biomarkers. Although an extensive data on cancer peptidome has been generated in the recent years, lack of a comprehensive resource restrains the facility to query the growing community knowledge. We have developed the cancer peptidome resource named CancerPDF, to collect and compile all the endogenous peptides isolated from human biofluids in various cancer profiling studies. CancerPDF has 14,367 entries with 9,692 unique peptide sequences corresponding to 2,230 unique precursor proteins from 56 high-throughput studies for ~27 cancer conditions. We have provided an interactive interface to query the endogenous peptides along with the primary information such as m/z, precursor protein, the type of cancer and its regulation status in cancer. To add-on, many web-based tools have been incorporated, which comprise of search, browse and similarity identification modules. We consider that the CancerPDF will be an invaluable resource to unwind the potential of peptidome-based cancer biomarkers. The CancerPDF is available at the web address http://crdd.osdd.net/raghava/cancerpdf/ .Entities:
Mesh:
Substances:
Year: 2017 PMID: 28473704 PMCID: PMC5431423 DOI: 10.1038/s41598-017-01633-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Architecture of CancerPDF database.
Figure 2Distribution of peptides according to length (A), mass range (B), cancer tissue types (C) and biofluids (D) in CancerPDF database.
Distribution of CancerPDF entries across key cancer types and major body fluids.
| Biofluid | Serum | Plasma | Urine | Others | Total |
|---|---|---|---|---|---|
| Cancer | |||||
| Ovary | 67 | 0 | 4368 | 777 | 5212 |
| Bladder | 80 | 0 | 1515 | 0 | 1595 |
| Melanoma | 1539 | 0 | 0 | 0 | 1539 |
| Colorectal | 1186 | 44 | 3 | 0 | 1233 |
| Multiple myeloma | 0 | 1083 | 0 | 0 | 1083 |
| Lung | 836 | 0 | 0 | 0 | 836 |
| Pancreas | 66 | 690 | 0 | 0 | 756 |
| Breast | 419 | 8 | 0 | 5 | 432 |
| Gastric | 335 | 0 | 0 | 1 | 336 |
| Thyroid | 103 | 0 | 0 | 0 | 103 |
| Renal | 36 | 0 | 62 | 0 | 98 |
| Others | 81 | 9 | 7 | 19 | 116 |
| Total | 4748 | 1834 | 5955 | 802 | 13339 |
Top ten proteins with maximum numbers of reported peptides in CancerPDF.
|
| Number of Unique peptides | Number of Studies | Number of Cancer conditions |
|---|---|---|---|
| FIBA_HUMAN | 727 | 25 | 21 |
| CO3_HUMAN | 296 | 16 | 15 |
| APOA1_HUMAN | 266 | 14 | 13 |
| CO1A1_HUMAN | 232 | 4 | 3 |
| A4_HUMAN | 223 | 12 | 12 |
| A1AT_HUMAN | 204 | 8 | 7 |
| H4_HUMAN | 200 | 20 | 16 |
| APOA4_HUMAN | 199 | 9 | 9 |
| ITIH4_HUMAN | 182 | 18 | 14 |
| ALBU_HUMAN | 168 | 11 | 11 |
Top fifteen unique peptides associated with different cancers. Each number in the cell represents the number of studies associated with each cancer.
|
| Prostate Cancer | Bladder Cancer | Breast Cancer | NSCLC | Lung adeno- carcinoma | Renal Cell carcinoma | Colorectal carcinoma | Metastatic thyroid carcinomas | Ovarian Cancer | ESCC | Cervical Cancer |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
| ADSGEGDFLAEGGGVR | 1 | 3 | 1 | 1 | 1 | — | — | — | 1 | — | — |
| SGEGDFLAEGGGVR | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | — | — |
| RPPGFSPFR | 1 | 2 | 3 | — | — | — | 1 | — | — | — | — |
| DSGEGDFLAEGGGVR | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | — | — | — |
| SKITHRIHWESASLL | 1 | 1 | 2 | 1 | — | — | 1 | 1 | — | — | — |
| MNFRPGVLSSRQLGLPGPPDVPDHAAYHPF | 1 | 1 | 5 | — | — | — | — | — | — | 1 | — |
| SSKITHRIHWESASLL | 1 | 1 | 2 | 1 | — | — | — | 1 | — | — | — |
| RPPGFSPF | 1 | 1 | 2 | 1 | — | — | 1 | — | — | — | 1 |
| KITHRIHWESASLL | 1 | 1 | 2 | 1 | — | — | — | 1 | — | — | — |
| GEGDFLAEGGGVR | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | — | — | — |
| DEAGSEADHEGTHSTKRGHAKSRPV | 1 | 3 | 4 | — | — | — | — | — | — | — | — |
| SSSYSKQFTSSTSYNRGDSTFESKSYKM | 1 | 1 | 2 | — | 1 | — | — | 1 | — | — | 1 |
| NGFKSHALQLNNRQIR | 1 | 1 | 2 | — | — | — | 1 | — | — | — | 1 |
| DFLAEGGGVR | 1 | 1 | 1 | — | 1 | — | 1 | 1 | — | — | 1 |
| THRIHWESASLL | 1 | 1 | 2 | — | — | — | — | 1 | — | — | — |