| Literature DB >> 32099593 |
Wei Xiong Wen1,2, Adam J Mead1,3, Supat Thongjuea2,3.
Abstract
Alternative splicing of RNAs generates isoform diversity, resulting in different proteins that are necessary for maintaining cellular function and identity. The discovery of alternative splicing has been revolutionized by next-generation transcriptomic sequencing mainly using bulk RNA-sequencing, which has unravelled RNA splicing and mis-splicing of normal cells under steady-state and stress conditions. Single-cell RNA-sequencing studies have focused on gene-level expression analysis and revealed gene expression signatures distinguishable between different cellular types. Single-cell alternative splicing is an emerging area of research with the promise to reveal transcriptomic dynamics invisible to bulk- and gene-level analysis. In this review, we will discuss the technological advances for single-cell alternative splicing analysis, computational strategies for isoform detection and quantitation in single cells, and current applications of single-cell alternative splicing analysis and its potential future contributions to personalized medicine.Entities:
Keywords: Alternative splicing; Isoform; Next-generation sequencing; Percent spliced-in; Single-cell transcriptome analysis
Year: 2020 PMID: 32099593 PMCID: PMC7033300 DOI: 10.1016/j.csbj.2020.01.009
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Summary of computational approaches for detection and quantification of alternative splicing events in single cells.
| Computational method | Software/Statistical method | Read aligner | No. of cells | Cell type(s) | Sample origin | Library preparation | Sequencing platform | Isoform variant analysed | Reference |
|---|---|---|---|---|---|---|---|---|---|
| Developed for bulk short-read RNA-sequencing | MISO | Bowtie | 12 | LNCaP, PC3, T24 | Cell line | Smart-Seq | Genome Analyzer IIx, 150 bp PE | CE | |
| MISO | Bowtie | 18 | BMDCs | Mouse | Smart-Seq | HiSeq 2000, 100 bp PE | CE | ||
| MISO | STAR | 34 | ESCs | Human | Smart-Seq | HiSeq 2000, 100 bp SE | CE | ||
| MISO | Bowtie | 18 | BMDCs | Mouse | Smart-Seq | HiSeq 2000, 100 bp PE | CE | ||
| VAST-TOOLS | TopHat | 66 | Spermatogenic cells | Mouse | Smart-Seq | HiSeq 4000, 150 bp PE | CE, RI, A5SS, A3SS | ||
| bam2ssj | Bowtie | 10 | GM12878 | Cell line | Smart-Seq | HiSeq 2000, 100 bp SE | SJ | ||
| IPSA | TopHat | 40 | HeLa S3 | Cell line | MIRALCS | HiSeq 2000, 150 bp PE, 50 bp SE | SJ | ||
| Developed for single-cell short-read RNA-sequencing | Custom pipeline | ABI whole transcriptome software tool | 1 | Blastomere | Mouse | Modified single-cell cDNA amplification for microarray | SOLiD sequencer, 50 bp PE | SJ | |
| Custom pipeline | ABI whole transcriptome software tool | 33 | ESCs, ICM, Epiblast, ICM outgrowth cells | Mouse | Modified single-cell cDNA amplification for microarray | SOLiD sequencer, 50 bp PE | SJ | ||
| Custom pipeline | Bowtie | 85 | ESCs, embryonic fibroblast | Mouse | STRT | Genome Analyzer IIx, 150 bp PE | SJ | ||
| SingleSplice | GSNAP/GMAP | 182 | ESCs | Mouse | Smart-Seq | HiSeq 2000, 100 bp, PE | Isoform switching | ||
| ISOP | Bowtie | 384 | MDA-MB-231 | Cell line | Smart-Seq | HiSeq 2000, 100 bp PE | Isoform switching | ||
| Logistic regression | Bowtie | 182 | Primary myoblasts | Human | Smart-Seq | HiSeq 2500, 100 bp PE | Isoform switching | ||
| BRIE | HISAT | 40 | ESCs | Mouse | Smart-Seq | HiSeq 2500, bp, SE | CE | ||
| BRIE | STAR | 93 | iPSCs | Human | scM&T-seq | HiSeq 2500 | CE | ||
| BRIE | STAR | 2208 | Oligodendrocytes | Mouse | Smart-Seq | HiSeq X Ten, 50 bp, SE | CE | ||
| BRIE | STAR | 242 | Epithelial breast cancer cells | Human | Smart-Seq | HiSeq 2500, 100 bp PE | CE | ||
| BRIE | Bowtie | 82 | Macrophages | Mouse | Smart-Seq | NextSeq 500, PE | CE | ||
| Expedition | STAR | 63 | iPSCs | Human | Smart-Seq | HiSeq 2000, 100 bp PE | CE, MXE | ||
| Developed for single-cell long-read RNA-sequencing | Custom pipeline | GMAP | 2 | VLMCs | Mouse | STRT | PacBio SMRT | CE, A5SS, A3SS, TSS, TTS | |
| Custom pipeline | STARlong | 6627 | Cerebellar cells | Mouse | Smart-Seq | PacBio SMRT | SJ, CE, TSS, TES | ||
| Mandalorion | STAR | 7 | B1a cells | Mouse | Smart-Seq | ONT MinION, 2D | CE, RI, A5SS, A3SS, TSS, TES | ||
| Mandalorion | STAR | 12 | OHCs | Mouse | Smart-Seq | ONT MinION, 1D | CE | ||
| Mandalorion | Minimap2 | 96 | B cells | Human | R2C2 | ONT MinION, 1D | CE, RI, TSS, TES | ||
| IgBLAST, BLASTN | Minimap2 | 6027 | Lymph node cells | Human | Droplet-based (10x Genomics) | ONT MinION, 1D | CE, RI, A5SS, A3SS, TSS, TES | ||
A3SS: Alternative 3′ splice site; A5SS: Alternative 5′ splice site; ABI: Applied Biosystems; BMDCs: Bone-marrow-derived dendritic cells; CE: Cassette exon; ESCs: Embryonic stem cells; ICM: Inner cell mass; IPSA: Integrative Pipeline for Splicing Analyses; iPSCs: Induced pluripotent stem cells; ISOP: ISOform-Patterns; MIRALCS: Microwell full-length mRNA amplification and library construction system; MISO: Mixture of Isoforms; MN: Motor neurons; MXE: Mutually exclusive exons; NPC: Neural progenitor cells; OHC: Outer hair cells; ONT: Oxford Nanopore Technology; PacBio SMRT: Pacific Biosciences Single Molecule Real Time; PE: Paired-end; R2C2: Rolling Circle Amplification to Concatemeric Consensus; RI: Retained-intron; scM&T-seq: Single-cell methylation and transcriptome sequencing; SE: Single-end; SJ: Splice junction; STRT: Single-cell tagged reverse transcription; TES: Transcription end site; TSS: Transcription start site; VLMCs: Vascular and leptomeningeal cells.
Day 3 Oct4+Sox2+Nanog+, day 5 Oct4−−Sox2+Nanog+, day 5 Oct4−−Sox2−−Nanog−− outgrowth cells.
Fig. 1Unique insights gained through single-cell alternative splicing analysis using short- and long-read RNA-sequencing. For simplicity, two isoforms with one alternative spliced exon (blue and red cells) and one complex isoform with two alternative spliced exons (green cells) illustrated here. Top-left panel: Bulk short-read RNA-sequencing is unable to delineate cell of origin for alternative splicing event. Top-right panel: Single-cell short-read RNA-sequencing is able to delineate cell of origin for each alternative splicing event. With the exception of cells with coordinated alternative splicing event (green) where it will be inferred that there are two isolated alternative splicing events. Bottom-left panel: Bulk long-read RNA-sequencing is able to distinguish isolated and coordinated alternative splicing events but is unable to assign the events to the cell of origin. Bottom-right panel: Single-cell long-read RNA-sequencing is able to distinguish isolated and coordinated alternative splicing events as well as assign the events to the cell of origin. Solid black box represents constitutive exons. Blue and red boxes represent alternatively spliced exons. Solid lines connecting two exons represent no alternative splicing events (no exon-skipping). Dotted lines connecting two exons represent alternative splicing events (exon-skipping). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2The comparison of different visualization methods of alternative splicing events in 69 motor neurons (MNs) and 63 induced-pluripotent stem cells (iPSCs) from Song et al. using short-read RNA-sequencing data [43]. (A-C) Mutually exclusive exon 9 and 10 of PKM gene. Alternative splicing event validated using smFISH previously. iPSCs almost exclusively express exon 10 while MNs predominantly express exon 9. (A) IGV display of coverage distribution of mutually exclusive exon 9 and 10 together with flanking constitutive exons from 3 MN (red) and 2 iPSC (blue) bulk samples show inconsistency in relative coverage for exon 9 and 10 across MN samples. Specifically, MN sample 1 and 2 show higher exon 9 coverage whereas MN sample 3 show higher exon 10 coverage. (B) IGV display of coverage distribution of mutually exclusive exon 9 and 10 together with flanking constitutive exons from 5 MN (red) and 5 iPSC (blue) representative cells show inconsistency in relative coverage for exon 9 and 10 across MN cells. Specifically, MN sample 1, 3, and 5 show higher exon 9 expression, MN sample 4 show higher exon 10 expression, whereas MN sample 2 had no detectable coverage across both exon 9 and 10. (C) VALERIE display of PSI values for all MN (red) and iPSC (blue) cells in heatmap annotated with mutually exclusive exon 9 (orange) and 10 (yellow) and flanking constitutive exons (black and grey). MN exon 9 with higher PSI compared to iPSC where MN exon 10 with lower PSI compared to iPSC. Differences in PSI for mutually exclusive exon 9 and 10 between MN and iPSC cell groups are statistically significant. On the other hand, there is no statistical difference in PSI values of both constitutive exons between MN and iPSC cell groups. (D-F) Exon 6 skipping of RPS24 gene. Alternative splicing event validated using sc-qPCR. MNs express higher levels of exon 6 compared to iPSCs. (D) IGV display of coverage distribution of alternative spliced exon 6 together with flanking constitutive exons from 3 MN (red) and 2 iPSC (blue) bulk samples show consistently higher coverage of exon 6 in MN compared to iPSC. (E) IGV display of coverage distribution of alternative spliced exon 6 together with flanking constitutive exons from 5 MN (red) and 5 iPSC (blue) representative cells show consistently higher coverage of exon 6 in MN compared to iPSC. (F) VALERIE display of PSI values for all MN (red) and iPSC (blue) cells in heatmap annotated with alternatively spliced exon 6 (orange) and flanking constitutive exons (black and grey). MN exon 6 with higher PSI compared to iPSC. Differences in PSI for alternatively spliced exon 6 between MN and iPSC cell groups are statistically significant. On the other hand, there is no statistical difference in PSI values for both constitutive exons between MN and iPSC cell groups. VALERIE standardizes the display of base position in 5′-to-3′ direction, focuses on informative exonic regions by excluding long intronic sequences with no splicing events, and displays PSI values rather than coverage information. Two-sided t-test used as statistical test for comparing PSI values at each genomic coordinate. IGV: Integrative Genome Browser. VALERIE: Visualizing alternative splicing events in single-cell ribonucleic acid (RNA)-sequencing experiments. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)