| Literature DB >> 35536286 |
Andrew Jiang1, Klaus Lehnert1, Linya You2, Russell G Snell1.
Abstract
Here we present ICARUS, a web server to enable users without experience in R to undertake single cell RNA-seq analysis. The focal point of ICARUS is its intuitive tutorial-style user interface, designed to guide logical navigation through the multitude of pre-processing, analysis and visualization steps. ICARUS is easily accessible through a dedicated web server (https://launch.icarus-scrnaseq.cloud.edu.au/) and avoids installation of software on the user's computer. Notable features include the facility to apply quality control thresholds and adjust dimensionality reduction and cell clustering parameters. Data is visualized through 2D/3D UMAP and t-SNE plots and may be curated to remove potential confounders such as cell cycle heterogeneity. ICARUS offers flexible differential expression analysis with user-defined cell groups and gene set enrichment analysis to identify likely affected biological pathways. Eleven organisms including human, dog, mouse, rat, zebrafish, fruit fly, nematode, yeast, cattle, chicken and pig are currently supported. Visualization of multimodal data including those generated by CITE-seq and the 10X Genomics Multiome kit is included. ICARUS incorporates a function to save the current state of analysis avoiding computationally intensive steps during repeat analysis. The complete analysis of a typical single cell RNA-seq dataset by inexperienced users may be achieved in 1-2 h.Entities:
Year: 2022 PMID: 35536286 PMCID: PMC9252722 DOI: 10.1093/nar/gkac322
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Summary of R packages used for each ICARUS step
| Step in ICARUS | Main command | R packages | Reference |
|---|---|---|---|
| Quality control | - | Seurat | ( |
| Integration of second dataset | Seurat::FindIntegrationAnchors | Seurat, Harmony | ( |
| Dimensionality reduction | Seurat::FindVariableFeatures | Seurat | ( |
| Clustering | Seurat::FindNeighbours | Seurat | ( |
| Data correction | Seurat::CellCycleScoring | Seurat | ( |
| Labelling clusters | SingleR::SingleR | SingleR, Celldex | ( |
| Multimodal analysis | - | Seurat | ( |
| Differential expression analysis | Seurat::FindMarkers | Seurat | ( |
| Pathway analysis | ClusterProfiler::gseGO | ClusterProfiler, ReactomePA | ( |
Figure 1.Flow chart of pre-processing, processing and analysis steps performed in ICARUS.
Example datasets available in ICARUS
| Dataset | Available from… |
|---|---|
| 2,700 Human peripheral blood mononuclear cells (Seurat guided clustering tutorial dataset) | 10x genomics |
| 500 human peripheral blood mononuclear cells | 10x genomics |
| 5,000 cells from a combined cortex, hippocampus and subventricular zone of an E18 mouse | 10x genomics |
| 8,617 cord blood mononuclear cells (Seurat multimodal data tutorial), this dataset also contains CITE-seq data for 11 surface proteins. | NCBI GEO |
| 10k Human PBMCs Multiplexed, 2 CMOs (CellPlex) | 10x genomics |
Figure 2.Comparison of cell clustering and annotation. ICARUS employs the Seurat clustering algorithm. Clustering outcome comparison between Seurat clustering tutorial vignette (A) and ICARUS (B) on a dataset of 2700 Human PBMCs. Cell cluster labelling in ICARUS by comparison of cluster marker genes to the Database of Immune Cell Expression.
Figure 3.User selected differential expression analyses and gene set enrichment. (A) ICARUS provides the user the option to interactively select customised cell groups using a lasso select function for differential expression tests. (B) Extended visualizations of enriched terms (gene set enrichment analysis) including a dot plot ordered by gene ratio, a gene concept network showcasing genes involved with enriched terms and an enrichment map consisting of a network of enriched terms with edges connecting overlapping gene sets. Enriched terms may also be visualised individually as gene pathways (B cell activation pathway shown in figure).