| Literature DB >> 30256981 |
Navpreet Ranu1, Alexandra-Chloé Villani2,3,4, Nir Hacohen2,3,4, Paul C Blainey1,2.
Abstract
Transcriptional profiling of thousands of single cells in parallel by RNA-seq is now routine. However, due to reliance on pooled library preparation, targeting analysis to particular cells of interest is difficult. Here, we present a multiplexed PCR method for targeted sequencing of select cells from pooled single-cell sequence libraries. We demonstrated this molecular enrichment method on multiple cell types within pooled single-cell RNA-seq libraries produced from primary human blood cells. We show how molecular enrichment can be combined with FACS to efficiently target ultra-rare cell types, such as the recently identified AXL+SIGLEC6+ dendritic cell (AS DC) subset, in order to reduce the required sequencing effort to profile single cells by 100-fold. Our results demonstrate that DNA barcodes identifying cells within pooled sequencing libraries can be used as targets to enrich for specific molecules of interest, for example reads from a set of target cells.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30256981 PMCID: PMC6326790 DOI: 10.1093/nar/gky856
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Targeted enrichment of single cells within a pooled RNA sequence library. (A) Workflow showing enrichment based on single cell barcodes on the 5′ end of sequence library molecules. Target cells (barcodes) of interest are identified based on shallow sequencing of the original pooled library. PCR with barcode-specific primers is used to create a new sequence library enriched for reads from the target cells. (B) Example enrichment plot for a single target cell from a multiplex-enrichment reaction. The original library was deeply sequenced as a control to identify gene expression profiles in the target cell. Enrichment fold is the fold-difference in overall sequencing effort to detect 50% of the maximum detectable number of genes. (C) Distribution of enrichment-fold values for 65 targeted cells amplified in multiplex PCR enrichments. (D) the pairwise correlation of gene expression profiles before and after PCR enrichment for CD19+ cells (top) and HLA-DR+ cells (bottom) libraries. The upper dashed line and shaded region in each plot represent the mean ± two standard deviations of Bootstrap replicates of the original gene expression profiles against themselves (which represents the best correlation achievable given the read sampling, UMI sampling, and distribution of expression levels across genes in these specific cells). Red points show the correlation for targeted cells (post-enrichment profiles versus pre-enrichment profiles for the same cell). Gray box plots show distribution of correlation coefficients for control (non-target) cells existing in the library (post-enrichment profiles of the subject control cell versus pre-enrichment profiles of all cells). The dotted line shows the mean correlation for the cell barcodes that had at least 6 mismatches at the 3′ end. Control comparisons are shown as a function of the number of mismatches (Hamming distance) between the six most 3′ base pairs of the 16-base pair subject control cell barcode and the six most 3′ base pairs of the 16-base pair barcode of compared targeted cells.
Figure 2.Single-cell expression profile before and after enrichment. Reduced dimensionality representations of 19 cells from CD19+ cells (A) and 46 cells from HLA-DR+ cells (B) showing the position of targeted cells based on the expression profiles from the original deep sequenced library (closed circles) and the enriched library (open circles), where each color represents one cell/barcode. The gray data points show all cells within the two original deeply sequenced libraries and make visible the major clusters of cells with related expression profiles. Principal components analysis (PCA) and t-SNE, a nonlinear dimensionality reduction approach, are used to represent the high-dimensional datasets with corresponding color schemes. (C) AS dendritic cell signature analysis. Bar plots at top show the fraction of cells with at least one UMI count for the corresponding gene (number of cells above the dashed gray line in bottom panel, with black for all non-target cells, and red for target cells in the original library, middle bar, and red for target cells in the enriched library, right bar). Bottom panel: the expression of the classifier genes for all non-target cells in the original library (black points, left group for each gene), target cells in the original library (colored points, middle group), and enriched target cells (colored points, right). The same color is used for each targeted cell across the different classifier genes to facilitate comparison. The total number of cells in the HLA-DR+ cells library was 2397 cells and nine putative AS DC cells were targeted for enrichment.