Literature DB >> 29228172

SPRING: a kinetic interface for visualizing high dimensional single-cell expression data.

Caleb Weinreb1, Samuel Wolock1, Allon M Klein1.   

Abstract

Motivation: Single-cell gene expression profiling technologies can map the cell states in a tissue or organism. As these technologies become more common, there is a need for computational tools to explore the data they produce. In particular, visualizing continuous gene expression topologies can be improved, since current tools tend to fragment gene expression continua or capture only limited features of complex population topologies.
Results: Force-directed layouts of k-nearest-neighbor graphs can visualize continuous gene expression topologies in a manner that preserves high-dimensional relationships and captures complex population topologies. We describe SPRING, a pipeline for data filtering, normalization and visualization using force-directed layouts and show that it reveals more detailed biological relationships than existing approaches when applied to branching gene expression trajectories from hematopoietic progenitor cells and cells of the upper airway epithelium. Visualizations from SPRING are also more reproducible than those of stochastic visualization methods such as tSNE, a state-of-the-art tool. We provide SPRING as an interactive web-tool with an easy to use GUI. Availability and implementation: https://kleintools.hms.harvard.edu/tools/spring.html, https://github.com/AllonKleinLab/SPRING/. Contact: calebsw@gmail.com or allon_klein@hms.harvard.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29228172      PMCID: PMC6030950          DOI: 10.1093/bioinformatics/btx792

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Recent advances in single-cell RNA sequencing (scSeq) have made it possible to catalog the expression of every gene in every cell from a given sample with reasonable accuracy. There is now a need for computational tools to explore and visualize this high-dimensional data, and in particular to capture the continuous trajectories of cell in gene expression space. K-nearest-neighbor (knn) graphs have proven useful for analyzing continuous cell topologies (Bendall ; Setty ; Xu and Su, 2015), and one study proposed the use of knn graphs for visualization and data clustering (Islam ). In a knn graph, each cell is a node that extends edges to the k other nodes with most similar gene expression. We have found that interactively exploring graph topology, overlaid with gene expression or other annotations, provides a powerful approach to uncover biological processes emerging from data. However, at present there are no publicly available tools for interactive visualization of scSeq data in a graph format. Here, we present a user-friendly web tool called SPRING. To use the tool, users must supply a table of gene expression measurements for single-cells and can optionally upload additional annotations. SPRING builds a knn graph from this data and displays the graph using a force-directed layout algorithm that renders as a real-time simulation in an interactive viewing window. We include a set of features for open-ended data exploration, including interactive discovery of marker genes; gene expression comparisons between different sub-populations and selection tools for isolating sub-populations of interest. SPRING is compatible with all major web browsers and does not require technical knowledge to operate.

2 Materials and methods

To generate the knn graph, SPRING performs the following transformations to the inputted gene expression matrix. All parameters labeled ‘X’ in this section can be adjusted using an interactive web form. (1) Filter all cells with fewer than X reads; (2) cell normalization so that every cell has the same total reads; (3) filter genes with ¡ X mean expression or < X coefficient of variation; (4) Z-score normalize expression values for each gene; (5) perform principal components analysis, keep the top X principal components and (6) compute a distance matrix and output a knn-graph with k = X. One can also conceive of other choices for each step of filtering, normalization, dimensionality reduction and distance metric used. SPRING is demonstrated in two examples in Figure 1. The underlying datasets are being published in separate research papers (in submission), and will be available at https://kleintools.hms.harvard.edu/tools/spring.html.
Fig. 1.

(A) SPRING depicts the dynamic trajectories of hematopoietic progenitor cells as they differentiate from stem cells (HSCs; black circle) into each of seven lineages (colored arms; lineage identities are described in a separate publication, in submission). In contrast, tSNE (B) and diffusion map (C) visualizations of the same data show disconnected clusters of cells or do not capture the full complexity of the data in two dimensions. (D) SPRING and tSNE plots of upper airway epithelium cells from three human donors highlight the reproducibility of SPING visualizations. Cells in (A–D) are colored by marker gene scores. Detailed methodology for producing all plots is available in the Supplementary Material

(A) SPRING depicts the dynamic trajectories of hematopoietic progenitor cells as they differentiate from stem cells (HSCs; black circle) into each of seven lineages (colored arms; lineage identities are described in a separate publication, in submission). In contrast, tSNE (B) and diffusion map (C) visualizations of the same data show disconnected clusters of cells or do not capture the full complexity of the data in two dimensions. (D) SPRING and tSNE plots of upper airway epithelium cells from three human donors highlight the reproducibility of SPING visualizations. Cells in (A–D) are colored by marker gene scores. Detailed methodology for producing all plots is available in the Supplementary Material The SPRING GUI is currently configured for datasets up to 10 000 cells and becomes very slow for larger datasets because of poor scalability of the graph rendering method and the computational burden of computing the force layout. In principle, these can be improved, for example by using the ForceAtlas2 algorithm (Jacomy ). In the meantime, large datasets can be accommodated by coarse-graining cells. A procedure to do so is described in the Supplementary Material and shown for an example dataset in Supplementary Figure S5. We provide code for coarse-graining on the github page.

3 Advantages over existing methods

3.1 Continuous expression topologies

In contrast to the commonly used method tSNE (Amir ), SPRING captures the long-distance relationships between cells and can, therefore, visualize continuous expression topologies. For example, SPRING accurately maps the branching topology of hematopoietic progenitor cells as they differentiate along seven lineages (Fig. 1A). Though a diffusion map (Haghverdi ) visualization (Fig. 1C) can usually capture continuous gene expression trajectories, it often requires more than two diffusion components to distinguish all lineages, preventing a full representation of the data complexity in a single two dimensional plot.

3.2 Graph invariance

One drawback of tSNE is that it is stochastic and, therefore, not perfectly reproducible. In contrast, graph construction in SPRING is non-stochastic and, therefore, yields consistent topologies between runs and replicates. In addition, manual interaction with the kinetic SPRING interface allows users to bring plots from separate replicates into register with one other (Fig. 1D).

4 Conclusion

Single-cell gene expression profiling is becoming a common tool to dissect cellular heterogeneity and characterize dynamic processes such as differentiation. Interactive visualization tools can help researchers exploit this data more fully. Our easy-to-use web tool, SPRING, provides a simple interface for open-ended investigation of gene expression topology.

Funding

C.W. and S.W. are supported by NIH training grant [5T32GM080177-07] and A.M.K. by NIH Grant 1R33CA212697, a Burroughs-Wellcome Career Award at the Scientific Interface, and by an Edward J Mallinckrodt Foundation Fellowship. Conflict of Interest: none declared. Click here for additional data file.
  7 in total

1.  Single-cell trajectory detection uncovers progression and regulatory coordination in human B cell development.

Authors:  Sean C Bendall; Kara L Davis; El-Ad David Amir; Michelle D Tadmor; Erin F Simonds; Tiffany J Chen; Daniel K Shenfeld; Garry P Nolan; Dana Pe'er
Journal:  Cell       Date:  2014-04-24       Impact factor: 41.582

2.  Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq.

Authors:  Saiful Islam; Una Kjällquist; Annalena Moliner; Pawel Zajac; Jian-Bing Fan; Peter Lönnerberg; Sten Linnarsson
Journal:  Genome Res       Date:  2011-05-04       Impact factor: 9.043

3.  Diffusion maps for high-dimensional single-cell analysis of differentiation data.

Authors:  Laleh Haghverdi; Florian Buettner; Fabian J Theis
Journal:  Bioinformatics       Date:  2015-05-21       Impact factor: 6.937

4.  Identification of cell types from single-cell transcriptomes using a novel clustering method.

Authors:  Chen Xu; Zhengchang Su
Journal:  Bioinformatics       Date:  2015-02-11       Impact factor: 6.937

5.  viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia.

Authors:  El-ad David Amir; Kara L Davis; Michelle D Tadmor; Erin F Simonds; Jacob H Levine; Sean C Bendall; Daniel K Shenfeld; Smita Krishnaswamy; Garry P Nolan; Dana Pe'er
Journal:  Nat Biotechnol       Date:  2013-05-19       Impact factor: 54.908

6.  ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software.

Authors:  Mathieu Jacomy; Tommaso Venturini; Sebastien Heymann; Mathieu Bastian
Journal:  PLoS One       Date:  2014-06-10       Impact factor: 3.240

7.  Wishbone identifies bifurcating developmental trajectories from single-cell data.

Authors:  Manu Setty; Michelle D Tadmor; Shlomit Reich-Zeliger; Omer Angel; Tomer Meir Salame; Pooja Kathail; Kristy Choi; Sean Bendall; Nir Friedman; Dana Pe'er
Journal:  Nat Biotechnol       Date:  2016-05-02       Impact factor: 54.908

  7 in total
  107 in total

1.  The dynamics of gene expression in vertebrate embryogenesis at single-cell resolution.

Authors:  James A Briggs; Caleb Weinreb; Daniel E Wagner; Sean Megason; Leonid Peshkin; Marc W Kirschner; Allon M Klein
Journal:  Science       Date:  2018-04-26       Impact factor: 47.728

2.  Lung megakaryocytes display distinct transcriptional and phenotypic properties.

Authors:  Anthony K Yeung; Carlos Villacorta-Martin; Stephanie Hon; Jason R Rock; George J Murphy
Journal:  Blood Adv       Date:  2020-12-22

3.  Single-Cell Proteomics Reveal that Quantitative Changes in Co-expressed Lineage-Specific Transcription Factors Determine Cell Fate.

Authors:  Carmen G Palii; Qian Cheng; Mark A Gillespie; Paul Shannon; Michalina Mazurczyk; Giorgio Napolitani; Nathan D Price; Jeffrey A Ranish; Edward Morrissey; Douglas R Higgs; Marjorie Brand
Journal:  Cell Stem Cell       Date:  2019-03-14       Impact factor: 24.633

4.  scRCMF: Identification of Cell Subpopulations and Transition States From Single-Cell Transcriptomes.

Authors:  Xiaoying Zheng; Suoqin Jin; Qing Nie; Xiufen Zou
Journal:  IEEE Trans Biomed Eng       Date:  2019-08-23       Impact factor: 4.538

5.  Single-Cell Transcriptomics of Human and Mouse Lung Cancers Reveals Conserved Myeloid Populations across Individuals and Species.

Authors:  Rapolas Zilionis; Camilla Engblom; Christina Pfirschke; Virginia Savova; David Zemmour; Hatice D Saatcioglu; Indira Krishnan; Giorgia Maroni; Claire V Meyerovitz; Clara M Kerwin; Sun Choi; William G Richards; Assunta De Rienzo; Daniel G Tenen; Raphael Bueno; Elena Levantini; Mikael J Pittet; Allon M Klein
Journal:  Immunity       Date:  2019-04-09       Impact factor: 31.745

6.  Mapping Distinct Bone Marrow Niche Populations and Their Differentiation Paths.

Authors:  Samuel L Wolock; Indira Krishnan; Danielle E Tenen; Victoria Matkins; Virginia Camacho; Sweta Patel; Puneet Agarwal; Ravi Bhatia; Daniel G Tenen; Allon M Klein; Robert S Welner
Journal:  Cell Rep       Date:  2019-07-09       Impact factor: 9.423

7.  A large pool of actively cycling progenitors orchestrates self-renewal and injury repair of an ectodermal appendage.

Authors:  Amnon Sharir; Pauline Marangoni; Rapolas Zilionis; Mian Wan; Tomas Wald; Jimmy K Hu; Kyogo Kawaguchi; David Castillo-Azofeifa; Leo Epstein; Kyle Harrington; Pierfrancesco Pagella; Thimios Mitsiadis; Christian W Siebel; Allon M Klein; Ophir D Klein
Journal:  Nat Cell Biol       Date:  2019-09-02       Impact factor: 28.824

8.  Single cell transcriptome dynamics from pluripotency to FLK1+ mesoderm.

Authors:  Haiyong Zhao; Kyunghee Choi
Journal:  Development       Date:  2019-12-05       Impact factor: 6.868

Review 9.  Revolutionizing Cancer Immunology: The Power of Next-Generation Sequencing Technologies.

Authors:  Meromit Singer; Ana C Anderson
Journal:  Cancer Immunol Res       Date:  2019-02       Impact factor: 11.151

10.  Cellular Heterogeneity and Lineage Restriction during Mouse Digit Tip Regeneration at Single-Cell Resolution.

Authors:  Gemma L Johnson; Erick J Masias; Jessica A Lehoczky
Journal:  Dev Cell       Date:  2020-02-24       Impact factor: 12.270

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.