| Literature DB >> 30345029 |
Nicholas Borcherding1,2,3,4, Nicholas L Bormann5, Andrew P Voigt4, Weizhou Zhang1,2,3,4.
Abstract
Reverse-phase protein arrays (RPPAs) are a highthroughput approach to protein quantification utilizing antibody-based micro-to-nano scale dot blot. Within the Cancer Genome Atlas (TCGA), RPPAs were used to quantify over 200 proteins in 8,167 tumor and metastatic samples. Protein-level data has particular advantages in assessing putative prognostic or therapeutic targets in tumors. However, many of the available pipelines do not allow for the partitioning of clinical and RPPA information to make meaningful conclusions. We developed a cloud-based application, TRGAted to enable researchers to better examine patient survival based on single or multiple proteins across 31 cancer types in the TCGA. TRGAted contains up-to-date overall survival, disease-specific survival, disease-free interval and progression-free interval information. Furthermore, survival information for primary tumor samples can be stratified based on gender, age, tumor stage, histological type, and subtype, allowing for highly adaptive and intuitive user experience. The code and processed data are open sourced and available on github and contains a tutorial built into the application for assisting users.Entities:
Keywords: Bioinformatics; Cancer Proteomics; Survival Analysis; TCGA
Mesh:
Year: 2018 PMID: 30345029 PMCID: PMC6173115 DOI: 10.12688/f1000research.15789.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Diagram of the implementation of TRGAted.
Each file communicates within the R Shiny framework. On the user side (left, blue), users select pertinent cancer type, protein of interest, and clinical variables into the CSS-enabled user interface. This information is received by the server file enabling the subsequent run in R. On the server side (right, orange), the specific cancer type from the database, R packages, and functions are retrieved and executed. After execution, the server file provides both tabular and graphical output (purple) to the user interface.
Survival information and protein summary available in TRGAted.
| Cancer Type | Samples | OS | DSS | DFI | PFI | Proteins |
|---|---|---|---|---|---|---|
| Adrenocortical carcinoma (ACC) | 46 | 46 | 46 | 28 | 46 | 221 |
| Bladder Urothelial Carcinoma (BLCA) | 344 | 344 | 330 | 153 | 344 | 223 |
| Breast invasive carcinoma (BRCA) | 901 | 873 | 855 | 750 | 873 | 224 |
| Cervical squamous cell carcinoma
| 171 | 171 | 168 | 112 | 171 | 220 |
| Cholangiocarcinoma (CHOL) | 30 | 30 | 29 | 21 | 30 | 219 |
| Colon adenocarcinoma (COAD) | 358 | 325 | 311 | 126 | 325 | 223 |
| Diffuse Large B-cell Lymphoma
| 33 | 33 | 33 | 19 | 33 | 219 |
| Esophageal carcinoma (ESCA) | 126 | 126 | 124 | 76 | 126 | 220 |
| Glioblastoma multiforme (GBM) | 205 | 136 | 123 | 0 | 136 | 223 |
| Head and Neck squamous cell
| 346 | 346 | 326 | 85 | 346 | 239 |
| Kidney Chromophobe (KICH) | 63 | 63 | 63 | 27 | 63 | 220 |
| Kidney renal clear cell carcinoma
| 445 | 444 | 434 | 72 | 444 | 233 |
| Kidney renal papillary cell carcinoma
| 208 | 207 | 205 | 127 | 207 | 221 |
| Lower Grade Glioma (LGG) | 427 | 426 | 420 | 114 | 426 | 220 |
| Liver hepatocellular carcinoma (LIHC) | 184 | 184 | 177 | 145 | 184 | 220 |
| Lung adenocarcinoma (LUAD) | 362 | 361 | 327 | 203 | 361 | 239 |
| Lung squamous cell carcinoma
| 325 | 325 | 295 | 210 | 325 | 239 |
| Mesothelioma (MESO) | 61 | 61 | 45 | 10 | 61 | 220 |
| Ovarian serous cystadenocarcinoma
| 411 | 405 | 377 | 199 | 407 | 224 |
| Pancreatic adenocarcinoma (PAAD) | 105 | 105 | 99 | 40 | 105 | 221 |
| Pheochromocytoma and
| 81 | 79 | 79 | 71 | 79 | 220 |
| Prostate adenocarcinoma (PRAD) | 351 | 351 | 350 | 233 | 351 | 220 |
| Rectum adenocarcinoma (READ) | 130 | 126 | 120 | 31 | 126 | 223 |
| Sarcoma (SARC) | 221 | 221 | 215 | 125 | 22 | 220 |
| Skin Cutaneous Melanoma (SKCM) | 354 | 349 | 346 | 0 | 349 | 223 |
| Stomach adenocarcinoma (STAD) | 392 | 357 | 334 | 207 | 357 | 220 |
| Testicular Germ Cell Tumors (TGCT) | 118 | 104 | 104 | 79 | 104 | 219 |
| Thyroid carcinoma (THCA) | 374 | 372 | 366 | 268 | 372 | 219 |
| Thymoma (THYM) | 90 | 90 | 90 | 9 | 90 | 219 |
| Uterine Corpus Endometrial
| 404 | 404 | 403 | 325 | 404 | 223 |
| Uterine Carcinosarcoma (UCS) | 48 | 48 | 46 | 22 | 48 | 220 |
OS, overall survival; DSS, disease-specific survival; DFI, disease-free interval; PFI, progression-free interval.
Figure 2. Generating survival curves.
The interface shows an example of an overall survival curve for the RAD50 protein in the basal subtype of breast cancer using the optimal cutpoint ( A). Disease-specific survival, disease-free interval, and progression-free interval can also be selected ( B). The cutpoint can be varied to separate samples based on protein level into quartiles, tertiles, medians or separating into two groups based on the lowest p-value ( C).
Figure 3. Visualizing all proteins across a single cancer type.
The interface shows an example of the visualization of Cox hazard ratio of each protein across the basal subtype of breast cancer ( A). Good prognostic markers appear on the left in blue, while poor prognostic markers are on the right in red. The natural log transformation allows the graph to be centered at 0 and makes the visualization of good prognostic markers easier. Labeling for proteins can be adjusted to include more or less protein. Proportional comparisons for protein using the optimal cutpoint function is available as well ( B).
Figure 4. Visualizing all proteins across a single cancer type.
The interface shows an example of the visualization of Cox hazard ratio of for RAD50 across all 31 cancer types ( A). This feature is similar to the Across Cancer tab with the ability to adjust labels and log-transform the Cox hazard ratios. Additionally, the hazard ratios for significant cancer types can be visualized using a bar chart ( B).