| Literature DB >> 31114875 |
Zefang Tang1, Boxi Kang1, Chenwei Li2, Tianxiang Chen1, Zemin Zhang1,2,3.
Abstract
Introduced in 2017, the GEPIA (Gene Expression Profiling Interactive Analysis) web server has been a valuable and highly cited resource for gene expression analysis based on tumor and normal samples from the TCGA and the GTEx databases. Here, we present GEPIA2, an updated and enhanced version to provide insights with higher resolution and more functionalities. Featuring 198 619 isoforms and 84 cancer subtypes, GEPIA2 has extended gene expression quantification from the gene level to the transcript level, and supports analysis of a specific cancer subtype, and comparison between subtypes. In addition, GEPIA2 has adopted new analysis techniques of gene signature quantification inspired by single-cell sequencing studies, and provides customized analysis where users can upload their own RNA-seq data and compare them with TCGA and GTEx samples. We also offer an API for batch process and easy retrieval of the analysis results. The updated web server is publicly accessible at http://gepia2.cancer-pku.cn/.Entities:
Year: 2019 PMID: 31114875 PMCID: PMC6602440 DOI: 10.1093/nar/gkz430
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schema describing the features of public data and uploaded data analyses in GEPIA2.
Figure 2.Examples of GEPIA2 outputs. (A) Users can check the prognostic impacts of gene or isoform expression level based on the survival heat map. The heat map shows the hazard ratios in logarithmic scale (log10) for different genes. The red and blue blocks denote higher and lower risks, respectively. The rectangles with frames mean the significant unfavorable and favorable results in prognostic analyses. (B) The violin-plots show the expression level (log2(TPM + 1)) of each isoform in a certain gene while the bar-plot panel present the isoform usage (from 0% to 100%) distribution. (C) GEPIA2 provides differential signature score analysis based on the box plots. The signature score is calculated by mean value of log2(TPM + 1) of each gene in Th1-like signature gene set. The orange box indicates the tumor samples while the blue one represents the normal tissues. The Th1-like signature score level in stomach tumors is significantly higher than normal tissues. (D) The overall survival analyses based on the cancer type and cancer subtype showed the significant prognostic impact of MAPK6 in Lumina A breast cancer subtype (left, P = 0.022) but not the whole breast cancer (right, P = 0.15).
Table describing the new features and improvements in the GEPIA2 visualization tools
| Gene | Isoform | Gene signature | Cancer subtype | Python package | ||
|---|---|---|---|---|---|---|
| General | ✓ | ✓ | ||||
| Differential genes | ✓ | ✓ | ||||
| Expression DIY | Gene expression profile | ✓ | ✓ | |||
| Box plots | ✓ | ✓ | ✓ | ✓ | ✓ | |
| Pathological stage plot | ✓ | ✓ | ||||
| Multiple gene comparison | ✓ | |||||
| Survival analysis | Survival plots | ✓ | ✓ | ✓ | ✓ | ✓ |
| Most differential survival genes | ✓ | ✓ | ||||
| Survival Map | ✓ | ✓ | ✓ | |||
| Isoform details | Isoform usage | ✓ | ||||
| Isoform structure | ✓ | |||||
| Correlation analysis | ✓ | ✓ | ✓ | ✓ | ||
| Similar genes detection | ✓ | ✓ | ✓ | ✓ | ||
| Dimensionality reduction | ✓ | |||||