| Literature DB >> 36263428 |
Giulia Carangelo1, Alberto Magi2, Roberto Semeraro3.
Abstract
Single cell RNA sequencing (scRNA-seq) is today a common and powerful technology in biomedical research settings, allowing to profile the whole transcriptome of a very large number of individual cells and reveal the heterogeneity of complex clinical samples. Traditionally, cells have been classified by their morphology or by expression of certain proteins in functionally distinct settings. The advent of next generation sequencing (NGS) technologies paved the way for the detection and quantitative analysis of cellular content. In this context, transcriptome quantification techniques made their advent, starting from the bulk RNA sequencing, unable to dissect the heterogeneity of a sample, and moving to the first single cell techniques capable of analyzing a small number of cells (1-100), arriving at the current single cell techniques able to generate hundreds of thousands of cells. As experimental protocols have improved rapidly, computational workflows for processing the data have also been refined, opening up to novel methods capable of scaling computational times more favorably with the dataset size and making scRNA-seq much better suited for biomedical research. In this perspective, we will highlight the key technological and computational developments which have enabled the analysis of this growing data, making the scRNA-seq a handy tool in clinical applications.Entities:
Keywords: RNA sequencing; biomedical applications; single cell; spatial transcriptomics; technological evolution; transcriptomics
Year: 2022 PMID: 36263428 PMCID: PMC9575985 DOI: 10.3389/fgene.2022.994069
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Noteworthy technologies that have allowed to profile large numbers of cells in parallel. Starting from manual isolation methods, a jump to ∼100 cells was enabled by sample multiplexing and than the development of integrated fluidic circuits increased these numbers to an order of magnitude. Next, the introduction of nanodroplet technologies increased throughput even further to hundreds of thousands of cells, as for in situ barcoding which favoured the development of spatial methods.
Raw data processing tools.
| Name | Alignment | QC | Count | CC | PL | References | |
|---|---|---|---|---|---|---|---|
| Pipelines | CellRanger | x | x | x | x | R/Python |
|
| UMI-tools | x | x | x | x | Python |
| |
| scPipe | x | x | x | x | C++/R |
| |
| zUMIs | x | x | x | x | R/Perl |
| |
| dropEst | x | x | x | x | C++ |
| |
| Optimus | x | x | x | x | Python/C++ | ||
| Tools | STAR | x | x | x | x | C/C++ |
|
| HISAT2 | x | - | - | - | C/C++ |
| |
| kallisto | - | - | x | - | C/C++ |
| |
| FastQC | - | x | - | - | Java | ||
| HTSeq | - | x | x | - | Python |
| |
| featureCount | - | - | x | - | C |
| |
| EmptyDrops | - | - | - | x | R |
|
QC, quality check; CC, cell calling; PL, programming language.
FIGURE 2Overview of the workflow. The count matrix undergo preprocessing and expression analysis. Boxes are ordered according data analysis flow.
Analysis tools.
| Preprocessing | Expression analysis | PL | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Name | QC | N | BC | DR | V | C | DE | TI | References | |||
| Pipelines | CellRanger | x | x | x | x | x | x | x | - | R/Python |
| |
| Scanpy | x | x | x | x | x | x | x | x | Python |
| ||
| Seurat | x | x | x | x | x | x | x | - | R |
| ||
| SCell | x | x | x | x | x | x | x | x | Matlab |
| ||
| scater | x | x | x | x | x | x | x | x | R |
| ||
| Pagoda2 | x | x | x | x | x | x | x | - | R |
| ||
| Tools | Doublet Finder | x | - | - | - | - | - | - | - | R |
| |
| Scrublet | x | - | - | - | - | - | - | - | Python |
| ||
| scds | x | - | - | - | - | - | - | - | R |
| ||
| scran | x | x | - | - | - | - | - | - | R |
| ||
| SCnorm | - | x | - | - | - | - | - | - | R | |||
| bioinfokit | - | x | - | - | - | - | - | - | R |
| ||
| ComBat | - | - | x | - | - | - | - | - | R |
| ||
| mnnCorrect | - | - | x | - | - | - | - | - | R |
| ||
| Harmony | - | - | x | - | - | - | - | - | R |
| ||
| BBKNN | - | - | x | - | - | - | - | - | Python |
| ||
| SAUCIE | - | - | x | x | x | x | - | - | Python |
| ||
| scVI | - | - | x | x | - | - | x | - | Python |
| ||
| PCA | - | - | - | x | - | - | - | - | Python |
| ||
| t-SNE | - | - | - | x | x | - | - | - | Python/R |
| ||
| UMAP | - | - | - | x | x | - | - | - | Python/R |
| ||
| Louvain | - | - | - | - | - | x | - | - | Python/R |
| ||
| Leiden | - | - | - | - | - | x | - | - | Python/R |
| ||
| MAST | - | - | - | - | - | - | x | - | R |
| ||
| scCODE | - | - | - | - | - | - | x | - | R |
| ||
| Slingshot | - | - | - | - | - | - | - | x | R |
| ||
| DPT | - | - | - | - | - | - | - | x | Python |
| ||
| Whishbone | - | x | - | x | x | - | x | x | Python |
| ||
| Monocle2 | - | x | x | x | x | x | x | x | R |
| ||
| Monocle3 | - | x | x | x | x | x | x | x | R |
| ||
| velocyto | - | x | x | x | x | x | x | x | Python/R |
| ||
| scVelo | - | x | x | x | x | x | x | x | Python |
| ||
QC, quality check; N, normalization; BC, batch correction; DR, dimensionality reduction; V, visualization; C, clustering; DE, differential expression; TI, trajectory inference; PL, programming language.