| Literature DB >> 30050112 |
Johannes W Bagnoli1, Christoph Ziegenhain1,2, Aleksandar Janjic1, Lucas E Wange1, Beate Vieth1, Swati Parekh1,3, Johanna Geuder1, Ines Hellmann1, Wolfgang Enard4.
Abstract
Single-cell RNA sequencing (scRNA-seq) has emerged as a central genome-wide method to characterize cellular identities and processes. Consequently, improving its sensitivity, flexibility, and cost-efficiency can advance many research questions. Among the flexible plate-based methods, single-cell RNA barcoding and sequencing (SCRB-seq) is highly sensitive and efficient. Here, we systematically evaluate experimental conditions of this protocol and find that adding polyethylene glycol considerably increases sensitivity by enhancing cDNA synthesis. Furthermore, using Terra polymerase increases efficiency due to a more even cDNA amplification that requires less sequencing of libraries. We combined these and other improvements to develop a scRNA-seq library protocol we call molecular crowding SCRB-seq (mcSCRB-seq), which we show to be one of the most sensitive, efficient, and flexible scRNA-seq methods to date.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30050112 PMCID: PMC6062574 DOI: 10.1038/s41467-018-05347-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1mcSCRB-seq workflow and the effect of molecular crowding. a Overview of the mcSCRB-seq protocol workflow. Single cells are isolated via FACS in multiwell plates containing lysis buffer, barcoded oligo-dT primers, and Proteinase K. Reverse transcription and template switching are carried out in the presence of 7.5% PEG 8000 to induce molecular crowding conditions. After pooling the barcoded cDNA with magnetic SPRI beads, PCR amplification using Terra polymerase is performed. b cDNA yield dependent on the absence (gray) or presence (blue) of 7.5% PEG 8000 during reverse transcription and template switching. Shown are three independent reactions for each input concentration of total standardized RNA (UHRR) and the resulting linear model fit. c Number of genes detected (>=1 exonic read) per replicate in RNA-seq libraries, generated from 10 pg of UHRR using four protocol variants (see Supplementary Table 1) at a sequencing depth of one million raw reads. Each dot represents a replicate (n = 8) and each box represents the median and first and third quartiles per method with the whiskers indicating the most extreme data point, which is no more than 1.5 times the length of the box away from the box
Fig. 2Comparison of mcSCRB-seq to SCRB-seq and other protocols. a Number of UMIs detected in libraries generated from 249 single mESCs using SCRB-seq or mcSCRB-seq when downsampled to different numbers of raw sequence reads. Each box represents the median and first and third quartiles per cell, sequencing depth and method. Whiskers indicate the most extreme data point that is no more than 1.5 times the length of the box away from the box. b The true positive rate of mcSCRB-seq and SCRB-seq estimated by power simulations using the powsimR package[22]. The empirical mean–variance distribution of the 10,904 genes that were detected in at least 10 cells in either mcSCRB-seq or SCRB-seq (500,000 reads) was used to simulate read counts when 10% of the genes are differentially expressed. Boxplots represent the median and first and third quartiles of 25 simulations with whiskers indicating the most extreme data point hat is no more than 1.5 times the length of the box away from the box. The dashed line indicates a true positive rate of 0.8. The matching plot for the false discovery rate is shown in Supplementary Fig. 11d. c Sensitivity of mcSCRB-seq and other protocols, calculated as the number of ERCC molecules needed to reach a 50% detection probability as calculated in Svensson et al[5]. Per-cell distributions are shown using violin plots with vertical lines and numbers indicating the median per protocol
Fig. 3mcSCRB-seq distinguishes cell types of peripheral blood mononuclear cells. a PBMCs were obtained from a healthy male donor and FACS sorted into four 96-well plates. Using the mcSCRB-seq protocol, sequencing libraries were generated. b tSNE projection of PBMC cells (n = 349) that were grouped into five clusters using the Seurat package[24]. Colors denote cluster identity. c tSNE projection of PBMC cells (n = 349) where each cell is colored according to its expression level of various marker genes for the indicated cell types. Expression levels were log-normalized using the Seurat package. d Marker gene expression from c was summarized as the mean log-normalized expression level per cell. B-cell markers: CD79A, CD74, MS4A1, HLA-DRA; Monocyte markers: LYZ, PSAP, FCN1, CD14, FCGR3A; NK-cell markers: GNLY, NKG7, GZMA, GZMB; T-cell markers: CD3E, CD3D, TRAC, CCR7