| Literature DB >> 35637419 |
Vincent Hahaut1,2, Dinko Pavlinic1,2, Walter Carbone3, Sven Schuierer3, Pierre Balmer1,2, Mathieu Quinodoz1,2, Magdalena Renner1,2, Guglielmo Roma3, Cameron S Cowan1,2, Simone Picelli4,5.
Abstract
We present FLASH-seq (FS), a full-length single-cell RNA sequencing (scRNA-seq) method with increased sensitivity and reduced hands-on time compared to Smart-seq3. The entire FS protocol can be performed in ~4.5 hours, is simple to automate and can be easily miniaturized to decrease resource consumption. The FS protocol can also use unique molecular identifiers (UMIs) for molecule counting while displaying reduced strand-invasion artifacts. FS will be especially useful for characterizing gene expression at high resolution across multiple samples.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35637419 PMCID: PMC9546769 DOI: 10.1038/s41587-022-01312-3
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 68.164
Fig. 1Overview of the FS and FS low-amplification (FS-LA) protocols.
a, Estimated protocol duration to process a 96-well plate of HEK293T cells for the full-length scRNA-seq protocols used in this study. Steps are color-coded. QCs include concentration and size distribution measurements. SSsc, SMART-Seq Single Cell Kit (Takara). b, Number of genes detected in HEK293T cells processed with SS2 (n = 80), SS3 (n = 51) and FS (n = 105) at two read thresholds, with reads downsampled to 500,000 (= 500K) raw reads. c, Mean ± standard deviation (s.d.) gene-body coverage in HEK293T cells. d, Cell-to-cell Kendall’s tau correlations among gene expression in SS2 (n = 80), SS3 (n = 51) and FS (n = 105) using only genes expressed in all three methods (ngenes = 20,042). e, FS-LA workflow. The number of required PCR cycles is a function of the cell RNA content. f, Number of genes detected in HEK293T cells processed with FS (nFS-19c = 56) or FS-LA (nFS-LA-12c = 31, nFS-LA-10c = 32, nFS-LA-8c = 31, nFS-LA-6c = 24, nFS-LA-4c = 32) using 250 K downsampled raw reads. Gene detection threshold was set to >0 or >5 reads. g, Top panel shows the percentage of read tags mapped to exonic (= CDS exons), intronic or intergenic features in HEK293T cells processed with FS or FS-LA, measured using ReSQC. Bottom panel shows mapping statistics with the percentage of uniquely mapped, multimapped or unmapped reads for FS and FS-LA. h, Mean ± s.d. number of detected genes (>0 reads) per cell type in hPBMC samples. Only points supported by two or more cells are displayed. Some cells had insufficient coverage to be represented at each point. The difference between the number of genes in FS and FS-LA was evaluated at 125 K reads for each cell type using a Wilcoxon rank-sum test (two-sided, Bonferroni correction, adjusted P value). No statistically significant differences (NS) were observed (P > 0.05). MAIT, mucosal-associated invariant T cell; mono, monocytes; NK, natural killer; TCM, central memory T cell; TEM, effector memory T cell; gdT, gamma-delta T-cell; pDC, plasmacytoid dendritic cell; cDC2, conventional dendritic cell 2. A two-sided Dunn’s test was used, and Bonferroni corrected, adjusted P values are shown for b, d and f. Box plots in f and g show the median (center), 25th/75th percentile (lower/upper hinges), 1.5× interquartile range (whiskers) and outliers (points).
Fig. 2Strand-invasion events are mitigated in FS-UMI.
a, Strand-switching reaction occurs at the end of cDNA (top); strand-invasion events are increased when the UMI and riboguanosine sequences are adjacent (middle) but are mitigated by a spacer sequence (bottom). b, Relationship between the number of genes detected in HEK293T cells using UMI and internal reads at several downsampled sequencing depths (nSS3 = 85, nSS3_Hagemann-J. = 101, nFS-CTAAC_STRT-dT = 76, nFS-CAGCA_STRT-dT = 16). Blue line denotes the mean number of genes detected in FS (nFS-5μl = 85, 75 K reads). Purple/pink lines denote the mean number of genes detected in SS3 (Hagemann-Jensen et al.[5], UMI = pink, internal = purple, 75K reads). Numbers of genes detected at 75 K reads were compared between cells from[5] and the other conditions for both read types (two-sided Wilcoxon rank-sum test, Bonferroni adjusted P value, colored by read type). c, Nucleotide distribution in the 6-bp adjacent to the read start using 500 K randomly selected UMIreads (HEK293T). d, Mean ± s.d. deduplicated UMI reads (%) harboring a match between the UMI and the upstream sequence within 20 bp, with 0 to 3consecutive 5′ mismatches (nSS3 = 2,089,581, nSS3_Hagemann-J. = 13,511,157, nFS-UMI-TSO_STRT-dT = 1,404,599, nFS-CTAAC_STRT-dT = 9,964,516, nFS-CAGCA_STRT-dT = 1,189,676,UMI reads). Colored by oligo-dT/TSO combination. e, Retinal organoid and experimental design. Retinal and nonretinal parts are highlighted. f, Genes detected (>0 reads) in representative cell types using 10x Genomics and FS-UMI (UMI, internal or both reads). Dunn’s test (two-sided, Bonferroni adjusted P value). Bipolar ON, ON-center bipolar cells; bipolar OFF, OFF-center bipolar cells. g, Gene diversity. Defined by resampling ten cells per cell type 100 times and calculating the number of genes expressed with >0 reads (both UMI/internal reads) in more than two cells. h, Mean ± s.d. detected SNPs at various downsampled read depths (nFS_retinal_organoids = 1,281). For each cell, exome-sequencing SNPs in transcribed regions (sequencing depth (= DP) > 2) as are used as reference. Totals SNPs, detected in FS-UMI (green, DP > 2) or in the reference (pink). True positive SNPs, detected in FS-UMI and the reference (orange, DP > 2; purple, DP > 2 & variant quality (= QUAL) > 20). False negative SNPs, reference SNPs not detected in FS-UMI (dark green). False positive SNPs, detected in FS-UMI but not present in the reference (yellow, DP > 2; brown, DP > 2 & QUAL > 20). True positive SNPs (%), percentage of true-positive SNPs detected among the reference SNPs. Box plots in f and g show the median (center), 25th/75th percentile (lower/upper hinges), 1.5× interquartile range (whiskers) and outliers (points).