Literature DB >> 34244710

UCSC Cell Browser: Visualize Your Single-Cell Data.

Matthew L Speir1, Aparna Bhaduri2, Nikolay S Markov3, Pablo Moreno4, Tomasz J Nowakowski5,6,7,8, Irene Papatheodorou4, Alex A Pollen5,9, Brian J Raney1, Lucas Seninge1,10, W James Kent1, Maximilian Haeussler1.   

Abstract

SUMMARY: As the use of single-cell technologies has grown, so has the need for tools to explore these large, complicated datasets. The UCSC Cell Browser is a tool that allows scientists to visualize gene expression and metadata annotation distribution throughout a single-cell dataset or multiple datasets.
AVAILABILITY AND IMPLEMENTATION: We provide the UCSC Cell Browser as a free website where scientists can explore a growing collection of single-cell datasets and a freely available python package for scientists to create stable, self-contained visualizations for their own single-cell datasets. Learn more at https://cells.ucsc.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Year:  2021        PMID: 34244710      PMCID: PMC8652023          DOI: 10.1093/bioinformatics/btab503

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Background

Single-cell RNA-seq assays allow for the exploration of gene expression at unprecedented detail, for surveying cellular diversity in organs (Soraggi ) or characterizing cellular states in development (de Soysa ) and disease (Kathiriya ). As a result, the number of publications using a single-cell RNA-seq assay has grown exponentially since 2010 (Supplementary Fig. S1). This growth has created a need for interactive tools that allow scientists to explore these complex datasets at a high-level before diving deeper into computational analysis with collaborators and for sharing them with the scientific community after publication. Analysis of single-cell datasets typically begins with metadata and an expression matrix, which are then taken through a few standard steps: (i) normalization, (ii) dimensionality reduction, (iii) clustering, (iv) marker gene identification and cluster labeling and (v) visualization and sharing. Expression matrices are often normalized, trimmed or batch-corrected. They are mapped from many dimensions to just two as x, y coordinates by dimensionality reduction algorithms like tSNE (van der Maaten and Hinton, 2008) and UMAP (Dorrity ). These algorithms attempt to retain much of the original cell similarity structure. The expression matrix is also fed to a clustering algorithm in an effort to find groups of similar cells. Clusters are usually manually annotated using known cell type markers. In most research groups, cluster annotation happens by visualizing dimensionality reduction plots of well-known marker genes in the UCSC Cell Browser, Seurat, Scanpy or one of many other tools (see below).

2 Features

The UCSC Cell Browser allows scientists to visualize the output of these single-cell analysis methods. Its primary display is a two-dimensional scatter plot, most commonly the output of tSNE or UMAP dimensionality reductions (Fig. 1, upper panel center). As a single dataset may produce several x, y coordinates, scientists can switch between different layouts. Scientists can pan and zoom along the plane, similar to navigating within geographic map viewers. Cells can be colored based on provided annotations (e.g. cell type, age) or by gene expression using several built-in color palettes. The display can be split into two panes offering a side-by-side view allowing comparison between metadata attributes or genes (View > Split; Fig. 1, upper panel). Cells can be selected, either by visually selecting groups or by combining one or more metadata-based filters (Edit > Find Cells). Cell identifiers can be exported for use in other analyses (Edit > Export Cells).
Fig. 1.

The UCSC Cell Browser interface showing a dataset focused on human cortex development. In the center of the screen, the primary layout view has been split into two vertical panes. The left pane shows the cells in this dataset colored by their metadata value for the field “WGCNACluster” (e.g. RG-div1). The right side shows the same cells colored by their expression of the gene HES1, those with a higher expression being colored dark red. On the far right side of the screen, there is a legend outlining how the colors are associated with expression bins; below this is a violin plot of this gene expression in a set of selected cells (those outlined in black in the center of the screen) versus all other cells in the dataset. On the far left side of the screen, you can see a list of metadata annotation fields available for this dataset. Each one can be clicked to color the plot by the values in that field. Below the two vertical panes, a heatmap shows 16 different genes that the authors of this dataset consider important “dataset genes”

The UCSC Cell Browser interface showing a dataset focused on human cortex development. In the center of the screen, the primary layout view has been split into two vertical panes. The left pane shows the cells in this dataset colored by their metadata value for the field “WGCNACluster” (e.g. RG-div1). The right side shows the same cells colored by their expression of the gene HES1, those with a higher expression being colored dark red. On the far right side of the screen, there is a legend outlining how the colors are associated with expression bins; below this is a violin plot of this gene expression in a set of selected cells (those outlined in black in the center of the screen) versus all other cells in the dataset. On the far left side of the screen, you can see a list of metadata annotation fields available for this dataset. Each one can be clicked to color the plot by the values in that field. Below the two vertical panes, a heatmap shows 16 different genes that the authors of this dataset consider important “dataset genes” In addition to a two-dimensional scatter plot, the tool provides other ways to explore single-cell data. Datasets are typically accompanied by a curated list of ‘dataset genes’ for coloring the scatter plot. When enabled, the heatmap view shows the expression of these dataset genes across labeled clusters (View > Heatmap; Fig. 1, lower panel). After selecting cells, histograms showing the distribution of metadata values can be shown as well as violin plots comparing the expression of a gene in the selection against all other cells in the dataset or user-defined ‘background’ cells. The UCSC Cell Browser can be used to display any high-dimensional data, such as single-cell ATAC-seq data (Supplementary Fig. S2) or bulk RNA-seq datasets, like those from the UCSC Treehouse Childhood Cancer group (https://treehouse.cells.ucsc.edu).

3 Comparison to other available tools

Data from single-cell experiments can be visualized using a number of tools in addition to the visualization options available through the analysis packages Scanpy (Wolf ) and Seurat (Butler ; Stuart ). A recent paper (Çakır ) compares and contrasts 13 different solutions including the UCSC Cell Browser. Features that set ours apart from these are the ability to host many datasets on a single instance arranged as a hierarchy, a simple installation procedure that requires no special server infrastructure (e.g. Flask, Shiny), and built-in converters for many data formats.

4 Setting up an instance

A cell browser can be built from plain text files (expression matrix, metadata annotations, layout coordinates), but we provide utilities to import data from Seurat, Scanpy, Cellranger (Zheng ), Loom (http://loompy.org/) and other files. Both Seurat and Scanpy provide functions to export data and build a Cell Browser instance. Installation instructions and other documentation are available at https://cellbrowser.readthedocs.io/. The UCSC Cell Browser can be used as a Galaxy, https://galaxyproject.org/ (Afgan ), tool for data analyzed or imported into an instance where it has been installed. The module can also be used at the public Human Cell Atlas (HCA) Galaxy instance at https://humancellatlas.usegalaxy.eu/, where it can be used to visualize reanalyzed data from the HCA (Regev ) and the Single Cell Expression Atlas (Papatheodorou ), as part of the SCiAp setup (Moreno ). The Bioconda (Grüning ) module has been installed more than 3.4k times and the Galaxy module, https://toolshed.g2.bx.psu.edu/view/ebi-gxa/ucsc_cell_browser, has been cloned to ∼185 Galaxy instances around the world.

5 Recent developments and future work

The website currently has 378 single-cell datasets arranged into 136 top-level projects, over 90 of which were added in the last 12 months (Supplementary Table S1). Our ATAC-seq support has been expanded in the past few months, with the ability to search genes and select nearby peaks for coloring (Supplementary Fig. S2). We have added the ability to associate large microscopy images with a dataset (Supplementary Fig. S3). During the next few months, we plan to further improve ATAC-seq support and custom annotations. Over the next year, we intend to add support for running analysis algorithms on selected cells on-the-fly.

Funding

This work was supported by the National Human Genome Research Institute [5U41HG002371 to M.H., W.J.K., M.L.S., B.J.R., 1U41HG010972 to M.H., 5R01HG010329 to W.J.K.]; National Institutes of Health [U01MH114825 to W.J.K., K99 NS111731 to A.B., RF1MH121268 to T.J.N.]; National Institutes of Mental Health [DP2MH122400 to A.A.P.]; Silicon Valley Community Foundation [2017-171531(5022) to M.H., W.J.K., M.L.S.]; California Institute for Regenerative Medicine [GC1R-06673-C to M.H., W.J.K., M.L.S., GC1R-06673-B to L.S.]; University of California Office of the President Emergency COVID-19 Research Seed Funding [R00RG2456 to M.H.]; Chan Zuckerberg Initiative Foundation [CZF2019-002438 to N.S.M., 2018-183498 to P.M., I.P., 2018-182800 to L.S.]; Simons Foundation [SFARI 491371 to T.J.N.]; Brain and Behavior Research Foundation [NARSAD Young Investigator Grant to T.J.N]; Gifts from Schmidt Futures and the William K. Bowes Jr Foundation to T.J.N. Conflict of Interest: none declared.

Data availability

No new data were generated or analysed in support of this research. Click here for additional data file.
  14 in total

1.  Bioconda: sustainable and comprehensive software distribution for the life sciences.

Authors:  Björn Grüning; Ryan Dale; Andreas Sjödin; Brad A Chapman; Jillian Rowe; Christopher H Tomkins-Tinch; Renan Valieris; Johannes Köster
Journal:  Nat Methods       Date:  2018-07       Impact factor: 28.547

2.  Integrating single-cell transcriptomic data across different conditions, technologies, and species.

Authors:  Andrew Butler; Paul Hoffman; Peter Smibert; Efthymia Papalexi; Rahul Satija
Journal:  Nat Biotechnol       Date:  2018-04-02       Impact factor: 54.908

Review 3.  Evaluating genetic causes of azoospermia: What can we learn from a complex cellular structure and single-cell transcriptomics of the human testis?

Authors:  Samuele Soraggi; Meritxell Riera; Ewa Rajpert-De Meyts; Mikkel H Schierup; Kristian Almstrup
Journal:  Hum Genet       Date:  2020-01-16       Impact factor: 4.132

4.  User-friendly, scalable tools and workflows for single-cell RNA-seq analysis.

Authors:  Pablo Moreno; Ni Huang; Jonathan R Manning; Suhaib Mohammed; Andrey Solovyev; Krzysztof Polanski; Wendi Bacon; Ruben Chazarra; Carlos Talavera-López; Maria A Doyle; Guilhem Marnier; Björn Grüning; Helena Rasche; Nancy George; Silvie Korena Fexova; Mohamed Alibi; Zhichao Miao; Yasset Perez-Riverol; Maximilian Haeussler; Alvis Brazma; Sarah Teichmann; Kerstin B Meyer; Irene Papatheodorou
Journal:  Nat Methods       Date:  2021-04       Impact factor: 28.547

5.  Modeling Human TBX5 Haploinsufficiency Predicts Regulatory Networks for Congenital Heart Disease.

Authors:  Irfan S Kathiriya; Kavitha S Rao; Giovanni Iacono; W Patrick Devine; Andrew P Blair; Swetansu K Hota; Michael H Lai; Bayardo I Garay; Reuben Thomas; Henry Z Gong; Lauren K Wasson; Piyush Goyal; Tatyana Sukonnik; Kevin M Hu; Gunes A Akgun; Laure D Bernard; Brynn N Akerberg; Fei Gu; Kai Li; Matthew L Speir; Maximilian Haeussler; William T Pu; Joshua M Stuart; Christine E Seidman; J G Seidman; Holger Heyn; Benoit G Bruneau
Journal:  Dev Cell       Date:  2020-12-14       Impact factor: 12.270

6.  SCANPY: large-scale single-cell gene expression data analysis.

Authors:  F Alexander Wolf; Philipp Angerer; Fabian J Theis
Journal:  Genome Biol       Date:  2018-02-06       Impact factor: 13.583

7.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.

Authors:  Enis Afgan; Dannon Baker; Bérénice Batut; Marius van den Beek; Dave Bouvier; Martin Cech; John Chilton; Dave Clements; Nate Coraor; Björn A Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Saskia Hiltemann; Vahid Jalili; Helena Rasche; Nicola Soranzo; Jeremy Goecks; James Taylor; Anton Nekrutenko; Daniel Blankenberg
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

8.  The Human Cell Atlas.

Authors:  Aviv Regev; Sarah A Teichmann; Eric S Lander; Ido Amit; Christophe Benoist; Ewan Birney; Bernd Bodenmiller; Peter Campbell; Piero Carninci; Menna Clatworthy; Hans Clevers; Bart Deplancke; Ian Dunham; James Eberwine; Roland Eils; Wolfgang Enard; Andrew Farmer; Lars Fugger; Berthold Göttgens; Nir Hacohen; Muzlifah Haniffa; Martin Hemberg; Seung Kim; Paul Klenerman; Arnold Kriegstein; Ed Lein; Sten Linnarsson; Emma Lundberg; Joakim Lundeberg; Partha Majumder; John C Marioni; Miriam Merad; Musa Mhlanga; Martijn Nawijn; Mihai Netea; Garry Nolan; Dana Pe'er; Anthony Phillipakis; Chris P Ponting; Stephen Quake; Wolf Reik; Orit Rozenblatt-Rosen; Joshua Sanes; Rahul Satija; Ton N Schumacher; Alex Shalek; Ehud Shapiro; Padmanee Sharma; Jay W Shin; Oliver Stegle; Michael Stratton; Michael J T Stubbington; Fabian J Theis; Matthias Uhlen; Alexander van Oudenaarden; Allon Wagner; Fiona Watt; Jonathan Weissman; Barbara Wold; Ramnik Xavier; Nir Yosef
Journal:  Elife       Date:  2017-12-05       Impact factor: 8.140

9.  Dimensionality reduction by UMAP to visualize physical and genetic interactions.

Authors:  Michael W Dorrity; Lauren M Saunders; Christine Queitsch; Stanley Fields; Cole Trapnell
Journal:  Nat Commun       Date:  2020-03-24       Impact factor: 14.919

View more
  17 in total

1.  External signals regulate continuous transcriptional states in hematopoietic stem cells.

Authors:  Eva M Fast; Audrey Sporrij; Margot Manning; Edroaldo Lummertz Rocha; Song Yang; Yi Zhou; Jimin Guo; Ninib Baryawno; Nikolaos Barkas; David Scadden; Fernando Camargo; Leonard I Zon
Journal:  Elife       Date:  2021-12-23       Impact factor: 8.140

2.  Activity-dependent modulation of synapse-regulating genes in astrocytes.

Authors:  Isabella Farhy-Tselnicker; Matthew M Boisvert; Hanqing Liu; Cari Dowling; Galina A Erikson; Elena Blanco-Suarez; Chen Farhy; Maxim N Shokhirev; Joseph R Ecker; Nicola J Allen
Journal:  Elife       Date:  2021-09-08       Impact factor: 8.140

3.  Characterization, isolation, and in vitro culture of leptomeningeal fibroblasts.

Authors:  Jan Remsik; Fadi Saadeh; Xinran Tong; Min Jun Li; Jenna Snyder; Tejus Bale; Jean Wu; Camille Derderian; David Guber; Yudan Chi; Rajmohan Murali; Adrienne Boire
Journal:  J Neuroimmunol       Date:  2021-09-29       Impact factor: 3.478

Review 4.  FaceBase: A Community-Driven Hub for Data-Intensive Research.

Authors:  R E Schuler; A Bugacov; J G Hacia; T V Ho; J Iwata; L Pearlman; B D Samuels; C Williams; Z Zhao; C Kesselman; Y Chai
Journal:  J Dent Res       Date:  2022-07-31       Impact factor: 8.924

5.  Single-cell analyses reveal early thymic progenitors and pre-B cells in zebrafish.

Authors:  Sara A Rubin; Chloé S Baron; Cecilia Pessoa Rodrigues; Madeleine Duran; Alexandra F Corbin; Song P Yang; Cole Trapnell; Leonard I Zon
Journal:  J Exp Med       Date:  2022-08-08       Impact factor: 17.579

6.  The UCSC Genome Browser database: 2022 update.

Authors:  Brian T Lee; Galt P Barber; Anna Benet-Pagès; Jonathan Casper; Hiram Clawson; Mark Diekhans; Clay Fischer; Jairo Navarro Gonzalez; Angie S Hinrichs; Christopher M Lee; Pranav Muthuraman; Luis R Nassar; Beagan Nguy; Tiana Pereira; Gerardo Perez; Brian J Raney; Kate R Rosenbloom; Daniel Schmelter; Matthew L Speir; Brittney D Wick; Ann S Zweig; David Haussler; Robert M Kuhn; Maximilian Haeussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 19.160

7.  Expression Atlas update: gene and protein expression in multiple species.

Authors:  Pablo Moreno; Silvie Fexova; Nancy George; Jonathan R Manning; Zhichiao Miao; Suhaib Mohammed; Alfonso Muñoz-Pomer; Anja Fullgrabe; Yalan Bi; Natassja Bush; Haider Iqbal; Upendra Kumbham; Andrey Solovyev; Lingyun Zhao; Ananth Prakash; David García-Seisdedos; Deepti J Kundu; Shengbo Wang; Mathias Walzer; Laura Clarke; David Osumi-Sutherland; Marcela Karey Tello-Ruiz; Sunita Kumari; Doreen Ware; Jana Eliasova; Mark J Arends; Martijn C Nawijn; Kerstin Meyer; Tony Burdett; John Marioni; Sarah Teichmann; Juan Antonio Vizcaíno; Alvis Brazma; Irene Papatheodorou
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

8.  Preparation of mouse pancreatic tumor for single-cell RNA sequencing and analysis of the data.

Authors:  Aizhan Surumbayeva; Michael Kotliar; Linara Gabitova-Cornell; Andrey Kartashov; Suraj Peri; Nathan Salomonis; Artem Barski; Igor Astsaturov
Journal:  STAR Protoc       Date:  2021-12-04

9.  Single-cell analysis of cell fate bifurcation in the chordate Ciona.

Authors:  Konner M Winkley; Wendy M Reeves; Michael T Veeman
Journal:  BMC Biol       Date:  2021-08-31       Impact factor: 7.431

10.  Mammary cell gene expression atlas links epithelial cell remodeling events to breast carcinogenesis.

Authors:  Kohei Saeki; Gregory Chang; Noriko Kanaya; Xiwei Wu; Jinhui Wang; Lauren Bernal; Desiree Ha; Susan L Neuhausen; Shiuan Chen
Journal:  Commun Biol       Date:  2021-06-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.