Literature DB >> 34323947

UCSCXenaShiny: An R/CRAN Package for Interactive Analysis of UCSC Xena Data.

Shixiang Wang1,2,3, Yi Xiong4,5,6, Longfei Zhao7, Kai Gu8, Yin Li9, Fei Zhao3,10, Jianfeng Li11,12, Mingjie Wang11, Haitao Wang13, Ziyu Tao1, Tao Wu1, Yichao Zheng7, Xuejun Li4,5, Xue-Song Liu1.   

Abstract

SUMMARY: UCSC Xena platform provides huge amounts of processed cancer omics data from large cancer research projects (e.g. TCGA, CCLE and PCAWG) or individual research groups and enables unprecedented research opportunities. However, a graphical user interface (GUI) based tool for interactively analyzing UCSC Xena data and generating elegant plots is still lacking, especially for cancer researchers and clinicians with limited programming experience. Here, we present UCSCXenaShiny, an R Shiny package for quickly searching, downloading, exploring, analyzing and visualizing data from UCSC Xena data hubs. This tool could effectively promote the practical use of public data, and can serve as an important complement to the current Xena genomics explorer. AVAILABILITY: UCSCXenaShiny is an open source R package under GPLv3 license and it is freely available at https://github.com/openbiox/UCSCXenaShiny or https://cran.r-project.org/package=UCSCXenaShiny. The docker image is available at https://hub.docker.com/r/shixiangwang/ucscxenashiny. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Year:  2021        PMID: 34323947      PMCID: PMC8723150          DOI: 10.1093/bioinformatics/btab561

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Over the past decade, large research programs including TCGA (The Cancer Genome Atlas) (Weinstein ), ICGC (International Cancer Genome Consortium) (Zhang ), PCAWG (Pan-cancer analysis of whole genomes) (Campbell ), GTEx (Genotype-Tissue Expression) (Ardlie ), CCLE (Cancer Cell Line Encyclopedia) (Barretina ) have generated large amounts of molecular data characterizing the different omics landscapes (including genomics, transcriptomics, proteomics and epigenomics) of thousands of tumors. The data have been uniformly preprocessed, curated and stored in data hubs of UCSC Xena (https://xenabrowser.net/datapages/) along with many public cancer datasets from individual research groups, providing unprecedented opportunities for either simple or systematic exploration of cancer behaviors and mechanisms at multiple molecular layers in individual cancer type or across cancer types (Goldman ). Despite the fact that UCSC Xena provides a functional genomics explorer (https://xenabrowser.net/) to allow users to explore and analyze its multi-omics and clinical/phenotype data, it is still difficult for cancer researchers to rapidly explore all available UCSC Xena datasets, find what they need in their research, and download or analyze the corresponding data. Besides, the analysis features and visualization quality provided by UCSC Xena platform have room for improvement. Advanced functionalities for analyzing different molecular profiles from specified data hubs including TCGA, CCLE and PCAWG, and generating publication-ready result plots are still lacking. In 2019, we developed UCSCXenaTools, an open-source R package for retrieving metadata and data from more than one thousand public UCSC Xena datasets (Wang and Liu, 2019). However, this package lacks analysis and visualization capabilities, and only provides a low-level application program interface (API) for accessing data. Thus, it is not suitable for cancer researchers with limited programming experience. Here, we are motivated to present UCSCXenaShiny, an R/CRAN package containing a web application based on the R Shiny framework (https://shiny.rstudio.com/) for quickly searching, retrieving, exploring, analyzing and visualizing data from UCSC Xena data hubs. This tool could effectively promote the practical use of UCSC Xena public data, and serve as an important complement to the functionality of current Xena functional genome explorer.

2 Tool description

UCSCXenaShiny uses both the R package interface (i.e. R functions) and the Shiny application interface to allow the user to efficiently retrieve and analyze data from UCSC Xena data hubs. The architecture of UCSCXenaShiny can be classified into three layers (Fig. 1). The first layer retrieves data from UCSC Xena data hubs and is built on the top of UCSCXenaTools (Wang and Liu, 2019). The second layer is implemented as an R package, it provides almost all core data and analysis features as built-in datasets and public functions (i.e. API) of the R package (Supplementary Table S1). The third layer is implemented as an R Shiny application and provides a graphical user interface for interactive exploration and analysis of UCSC Xena data. A demo of this Shiny is deployed at https://shiny.hiplot.com.cn/ucsc-xena-shiny/ for public use. UCSCXenaShiny has more functionalities compared with other UCSC Xena related tools (including UCSC Xena browser, UCSCXenaTools and xenaPython) (Supplementary Table S2).
Fig. 1.

The architecture diagram of UCSCXenaShiny

The architecture diagram of UCSCXenaShiny

2.1 R function interface

The public functions of UCSCXenaShiny can be divided into three categories based on their functionalities: (i) data retrieval; (ii) analysis and visualization of pan-cancer studies; (iii) advanced analysis and visualization (Supplementary Table S1).

2.2 Shiny interface

Its Shiny web application is the highlight of the UCSCXenaShiny software. It is a web-based software to provide interactive data retrieval, analysis and visualization for users. Similar to the R function interface, there are three core web pages: ‘Repository’, ‘General Analysis’ and ‘Quick PanCan Analysis’. The ‘Repository’ page contains a clickable table of UCSC Xena datasets, dataset filter widgets based on data hubs, cohorts, data types, keywords, etc., and corresponding action buttons (Supplementary Fig. S1). The current ‘General Analysis’ page contains four common analysis modules for exploring relationships between continuous variables, value difference between sample groups and survival curve difference between sample groups (Supplementary Fig. S2). The ‘Quick PanCan Analysis’ page contains several analysis modules for well-known pan-cancer data including TCGA, GTEx (Supplementary Fig. S3) and CCLE (Supplementary Fig. S4). For example: (i) comparisons of molecular profiles among samples, such as mRNA expression between TCGA tumor and normal control tissue (Supplementary Fig. S3A) or across different types of cancer (Maag, 2018) (Supplementary Fig. S3B) or CCLE cell lines (Supplementary Fig. S4A); (ii) association analysis between two molecular profiles with TCGA (Supplementary Fig. S3C) or CCLE (Supplementary Fig. S4B) data; (iii) association studies between a certain molecular profile and tumor/immune features, such as TMB (tumor mutational burden)/MSI (microsatellite instability)/stemness (Supplementary Fig. S3D) and immune gene signatures (Li ; Thorsson ) (Supplementary Fig. S3E); (iv) Kaplan–Meier survival analysis among samples with different levels of a molecular profile (Supplementary Fig. S3F); (v) association analysis between survival hazard ratio and a molecular profile with the Cox model across TCGA cancer types (Supplementary Fig. S3G); (vi) association analysis between mRNA expression of a gene (list) and cell line drug response (Supplementary Fig. S4C); (vii) exploring drug response differences between samples with different gene expression levels (Supplementary Fig. S4D).

3 Implementation

UCSCXenaShiny has been developed with R version ≥3.5 and Shiny following a modular and robust design of both R package and Shiny application. Continuous integration tests with CRAN R package is done automatically after each code commit to help test functionality and detect program bugs in a timely manner. Instructions on how to install, use UCSCXenaShiny and run the Shiny application are presented in the public GitHub repository (https://github.com/openbiox/UCSCXenaShiny). A detailed manual of built-in data and public R functions are organized and described in the package reference page (https://openbiox.github.io/UCSCXenaShiny/reference/index.html). Instructions including texts and videos on how to use functionalities of the Shiny application are documented in the help page of the Shiny application. Tooltips are adopted to help users to understand and customize the parameter setting. The Shiny application also shows data table behind each result plot and provides data download buttons to facilitate the archiving of data and result reproducibility.

4 Conclusion

In recent years, several bioinformatics platforms or tools, such as cBioPortal (Cerami ), Genomic Data Commons (GDC) data portal (Grossman ), ICGC Data Portal (Zhang ), CVCDAP (Guan ) and UCSC Xena (Goldman ) have been constructed for the analysis and visualization of cancer genomics data (Supplementary Table S3). UCSCXenaShiny works as a UCSC Xena client, cBioportal, ICGC data portal, GDC data portal are independent data portals. Compared with these other data portals, UCSC Xena platform is featured with a comprehensive collection of public cancer genome datasets, and combined analysis between public and researchers’ own data (Goldman ). However, UCSC Xena only provided limited number of analysis tools. For efficient cancer genome data download, integration, exploration and visualization, we built UCSCXenaShiny to allow a wide range of users to perform interactive analysis of UCSC Xena data by either programming or graphical interface operation. Since its release, UCSCXenaShiny has been downloaded for more than 10 000 times around the world (according to the API for CRAN package download counts, from the RStudio CRAN mirror, https://cranlogs.r-pkg.org/). We believe that UCSCXenaShiny could effectively promote the practical use of public cancer data and serve as an important complement to the functionality of current Xena functional genome explorer. Click here for additional data file.
  13 in total

1.  The International Cancer Genome Consortium Data Portal.

Authors:  Junjun Zhang; Rosita Bajari; Dusan Andric; Francois Gerthoffert; Alexandru Lepsa; Hardeep Nahal-Bose; Lincoln D Stein; Vincent Ferretti
Journal:  Nat Biotechnol       Date:  2019-04       Impact factor: 54.908

2.  The Cancer Genome Atlas Pan-Cancer analysis project.

Authors:  John N Weinstein; Eric A Collisson; Gordon B Mills; Kenna R Mills Shaw; Brad A Ozenberger; Kyle Ellrott; Ilya Shmulevich; Chris Sander; Joshua M Stuart
Journal:  Nat Genet       Date:  2013-10       Impact factor: 38.330

3.  Toward a Shared Vision for Cancer Genomic Data.

Authors:  Robert L Grossman; Allison P Heath; Vincent Ferretti; Harold E Varmus; Douglas R Lowy; Warren A Kibbe; Louis M Staudt
Journal:  N Engl J Med       Date:  2016-09-22       Impact factor: 91.245

4.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data.

Authors:  Ethan Cerami; Jianjiong Gao; Ugur Dogrusoz; Benjamin E Gross; Selcuk Onur Sumer; Bülent Arman Aksoy; Anders Jacobsen; Caitlin J Byrne; Michael L Heuer; Erik Larsson; Yevgeniy Antipin; Boris Reva; Arthur P Goldberg; Chris Sander; Nikolaus Schultz
Journal:  Cancer Discov       Date:  2012-05       Impact factor: 39.397

5.  Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans.

Authors: 
Journal:  Science       Date:  2015-05-07       Impact factor: 47.728

6.  International Cancer Genome Consortium Data Portal--a one-stop shop for cancer genomics data.

Authors:  Junjun Zhang; Joachim Baran; A Cros; Jonathan M Guberman; Syed Haider; Jack Hsu; Yong Liang; Elena Rivkin; Jianxin Wang; Brett Whitty; Marie Wong-Erasmus; Long Yao; Arek Kasprzyk
Journal:  Database (Oxford)       Date:  2011-09-19       Impact factor: 3.451

7.  The Immune Landscape of Cancer.

Authors:  Vésteinn Thorsson; David L Gibbs; Scott D Brown; Denise Wolf; Dante S Bortone; Tai-Hsien Ou Yang; Eduard Porta-Pardo; Galen F Gao; Christopher L Plaisier; James A Eddy; Elad Ziv; Aedin C Culhane; Evan O Paull; I K Ashok Sivakumar; Andrew J Gentles; Raunaq Malhotra; Farshad Farshidfar; Antonio Colaprico; Joel S Parker; Lisle E Mose; Nam Sy Vo; Jianfang Liu; Yuexin Liu; Janet Rader; Varsha Dhankani; Sheila M Reynolds; Reanne Bowlby; Andrea Califano; Andrew D Cherniack; Dimitris Anastassiou; Davide Bedognetti; Younes Mokrab; Aaron M Newman; Arvind Rao; Ken Chen; Alexander Krasnitz; Hai Hu; Tathiane M Malta; Houtan Noushmehr; Chandra Sekhar Pedamallu; Susan Bullman; Akinyemi I Ojesina; Andrew Lamb; Wanding Zhou; Hui Shen; Toni K Choueiri; John N Weinstein; Justin Guinney; Joel Saltz; Robert A Holt; Charles S Rabkin; Alexander J Lazar; Jonathan S Serody; Elizabeth G Demicco; Mary L Disis; Benjamin G Vincent; Ilya Shmulevich
Journal:  Immunity       Date:  2018-04-05       Impact factor: 43.474

8.  TIMER2.0 for analysis of tumor-infiltrating immune cells.

Authors:  Taiwen Li; Jingxin Fu; Zexian Zeng; David Cohen; Jing Li; Qianming Chen; Bo Li; X Shirley Liu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

9.  CVCDAP: an integrated platform for molecular and clinical analysis of cancer virtual cohorts.

Authors:  Xiaoqing Guan; Meng Cai; Yang Du; Ence Yang; Jiafu Ji; Jianmin Wu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

10.  gganatogram: An R package for modular visualisation of anatograms and tissues based on ggplot2.

Authors:  Jesper L V Maag
Journal:  F1000Res       Date:  2018-09-28
View more
  12 in total

1.  A functional reference map of the RNF8 interactome in cancer.

Authors:  Chuanyang Liu; Jingyu Kuang; Yuxuan Wang; Ting Duan; Lu Min; Chenyu Lu; Tianyi Zhang; Ruifen Chen; Ying Wu; Lingyun Zhu
Journal:  Biol Direct       Date:  2022-07-13       Impact factor: 7.173

2.  Evidence of Omics, Immune Infiltration, and Pharmacogenomics for BATF in a Pan-Cancer Cohort.

Authors:  Chenguang Jia; Yihui Ma; Mengyang Wang; Wen Liu; Feng Tang; Jincao Chen
Journal:  Front Mol Biosci       Date:  2022-04-29

3.  Comprehensive analysis of expression profiles and prognosis of TRIM genes in human kidney clear cell carcinoma.

Authors:  Junwen Shen; Rongjiang Wang; Yu Chen; Zhihai Fang; Jianer Tang; Jianxiang Yao; Jianguo Gao; Wenxia Zhou; Xiongnong Chen
Journal:  Aging (Albany NY)       Date:  2022-05-26       Impact factor: 5.955

4.  MDIG, a 2‑oxoglutarate‑dependent oxygenase, acts as an oncogene and predicts the prognosis of multiple types of cancer.

Authors:  Feng Geng; Wei Yang; Dandan Song; Haijia Hou; Bing Han; Yecheng Chen; Hongwen Zhao
Journal:  Int J Oncol       Date:  2022-05-18       Impact factor: 5.884

5.  Prognostic Value of Long Noncoding RNA DLEU2 and Its Relationship with Immune Infiltration in Kidney Renal Clear Cell Carcinoma and Liver Hepatocellular Carcinoma.

Authors:  Shengqiang Fu; Binbin Gong; Siyuan Wang; Qiang Chen; Yifu Liu; Changshui Zhuang; Zhilong Li; Zhicheng Zhang; Ming Ma; Ting Sun
Journal:  Int J Gen Med       Date:  2021-11-11

6.  A Pan-Cancer Analysis Reveals the Prognostic and Immunotherapeutic Value of ALKBH7.

Authors:  Kaijie Chen; Dongjie Shen; Lin Tan; Donglin Lai; Yuru Han; Yonggang Gu; Changlian Lu; Xuefeng Gu
Journal:  Front Genet       Date:  2022-02-11       Impact factor: 4.599

7.  Extrachromosomal DNA formation enables tumor immune escape potentially through regulating antigen presentation gene expression.

Authors:  Tao Wu; Chenxu Wu; Xiangyu Zhao; Guangshuai Wang; Wei Ning; Ziyu Tao; Fuxiang Chen; Xue-Song Liu
Journal:  Sci Rep       Date:  2022-03-04       Impact factor: 4.379

8.  A novel mechanism for A-to-I RNA-edited AZIN1 in promoting tumor angiogenesis in colorectal cancer.

Authors:  Yan Wei; Haowan Zhang; Qiaohui Feng; Shumin Wang; Youcheng Shao; Jie Wu; Ge Jin; Weiwei Lin; Xinxin Peng; Xiaoyan Xu
Journal:  Cell Death Dis       Date:  2022-04-02       Impact factor: 8.469

9.  Altered expression of ACOX2 in non-small cell lung cancer.

Authors:  Jane S Y Sui; Petra Martin; Anna Keogh; Pierre Murchan; Lisa Ryan; Siobhan Nicholson; Sinead Cuffe; Pilib Ó Broin; Stephen P Finn; Gerard J Fitzmaurice; Ronan Ryan; Vincent Young; Steven G Gray
Journal:  BMC Pulm Med       Date:  2022-08-23       Impact factor: 3.320

10.  Immunological and prognostic analysis of PSENEN in low-grade gliomas: An immune infiltration-related prognostic biomarker.

Authors:  Kaijie Chen; Beibei Liang; Wenhao Ma; Guoqing Wan; Bing Chen; Changlian Lu; Yuzhou Luo; Xuefeng Gu
Journal:  Front Mol Neurosci       Date:  2022-07-28       Impact factor: 6.261

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.