| Literature DB >> 28679620 |
Lili Wang1,2, Jean Fan3, Joshua M Francis1,4, George Georghiou5, Sarah Hergert1, Shuqiang Li1,4, Rutendo Gambe1, Chensheng W Zhou1,6, Chunxiao Yang7, Sheng Xiao2,8, Paola Dal Cin2,8, Michaela Bowden1,6, Dylan Kotliar7, Sachet A Shukla1, Jennifer R Brown1,2,9, Donna Neuberg10, Dario R Alessi5, Cheng-Zhong Zhang1,3,4,10, Peter V Kharchenko3, Kenneth J Livak11, Catherine J Wu1,2,4,9.
Abstract
Intra-tumoral genetic heterogeneity has been characterized across cancers by genome sequencing of bulk tumors, including chronic lymphocytic leukemia (CLL). In order to more accurately identify subclones, define phylogenetic relationships, and probe genotype-phenotype relationships, we developed methods for targeted mutation detection in DNA and RNA isolated from thousands of single cells from five CLL samples. By clearly resolving phylogenic relationships, we uncovered mutated LCP1 and WNK1 as novel CLL drivers, supported by functional evidence demonstrating their impact on CLL pathways. Integrative analysis of somatic mutations with transcriptional states prompts the idea that convergent evolution generates phenotypically similar cells in distinct genetic branches, thus creating a cohesive expression profile in each CLL sample despite the presence of genetic heterogeneity. Our study highlights the potential for single-cell RNA-based targeted analysis to sensitively determine transcriptional and mutational profiles of individual cancer cells, leading to increased understanding of driving events in malignancy.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28679620 PMCID: PMC5538547 DOI: 10.1101/gr.217331.116
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Detection of somatic alterations and gene expression patterns in single CLL cells. (A) Workflow of DNA and RNA analysis at the single-cell level. Viable leukemia cells from CLL patients were flow sorted either into 96-well plates or processed initially as bulk populations. DNA (left) and RNA (right) plate-based approaches were used for phylogeny reconstruction and integration of genotype and phenotype, respectively. Bulk cells (middle) were applied to C1 integrated fluidic circuits (IFCs) for single-cell capture and cDNA library generation (see Methods). Sequencing libraries were generated and sequenced on an Illumina HiSeq system. (B) Number of single-cell DNA-based detection assays designed (top) and the cancer cell fraction (CCF) of all the alterations (bottom panel) for five CLL samples. Each point is an alteration with specific alterations indicated by colors as noted. (C) Correlation between mutation and chromosomal abnormalities detected by single-cell DNA analysis and CCF inferred from bulk tumor whole-exome sequencing (WES).
Patient characteristics of CLL samples
Figure 2.Reconstruction of tumor phylogeny in CLL from single-cell DNA analysis. (A–C) Detection and clustering of somatic mutations and chromosomal deletions (rows) for single viable CD19+CD5+ cells (columns) from CLL003 (A), CLL146 (B), CLL005 (C; for analysis of CLL096 and CLL032, see Supplemental Fig. S4A,B). (Left) Blue indicates the presence of mutations (sSNVs) or chromosomal deletion; beige, the absence of sSNVs; red, the absence of chromosomal deletion. (Middle) Clonal architecture for CLL003, CLL146, and CLL005 derived from single-cell DNA analysis. (Right) FISH hybridization (A,B) and karyotyping images (C) as validation of the single-cell chromosomal deletion analysis with percentage of positive FISH cells enumerated from 100 cells. Sensitivity of probes used in the study was confirmed by hybridization with PBMC from a normal donor (Supplemental Fig. S4C).
Figure 3.CLL transcriptional heterogeneity revealed by single-cell transcriptome sequencing. Pathway and gene set overdispersion analysis (PAGODA) was used to identify transcriptionally defined subpopulations for CLL005 (A) and CLL146 (B) (for analysis of CLL096 and CLL032, see Supplemental Fig. S6A,B). Based on gene sets defined by MSigDB annotations, significantly overdispersed pathways group cells into coherent and distinct aspects of transcriptional heterogeneity. Aspect scores (Cell PC score) are oriented so that high values generally correspond to increased expression of associated gene sets. Also shown are expression patterns of select genes driving each aspect of transcriptional heterogeneity along with their loading contributions to the aspect scores (left). Mutation information inferred from single-cell transcriptome sequencing is also shown at the bottom. CCF for each mutation derived from bulk tumor WES and the number of reads (log10 transformed) from single-cell transcriptome sequencing are also indicated.
Figure 4.Establishment of a targeted RNA-based approach to perform integrated, targeted, and multiplexed detection of somatic mutations and gene expression in single cells. (A) Number of genes detected from 96 genes using cell numbers ranging from one to 40 cells per well by the targeted approach illustrated in Figure 1A, right panel. (B) Correlation between gene expression derived from a bulk RNA-seq and single-cell targeted approach in CLL005 (analyses of other individual CLL samples and in aggregates in Supplemental Fig. S7). (C) Gene expression of a set of 96 genes in single cells distinguishes normal (CD19+) and CLL-B cells by principal component analysis (PCA). Single cells were derived from one normal healthy donor and two CLL patients. (D, top) SuperSelective primer design (Vargas et al. 2016) was used for mutation detection. The primer contains a long 5′-anchor sequence that binds strongly to template strands, a short 3′-foot sequence that includes an interrogating nucleotide complementary to the corresponding nucleotide in a mutant template (but mismatches the corresponding nucleotide in a wild-type template), and a linking bridge sequence. (Bottom) Schema of a successful targeted mutation detection assay, in which distinct paired assays (wild-type and mutant allele) were designed for detection of a single mutation. (E) Heat map of seven patient-specific mutation detection assays (with seven matched wild-type assays) performed on cDNA derived from 20 single cells from normal CD19+ B cells or two CLL-B cells, with detection measured as the number of cycles to achieve the detection threshold (Cq). (F) Examples for mutation calls in single cells with or without mutations LCP1 and WNK1. Cells were called wild-type (WT; blue), mutant (red), or unclear (gray).
Figure 5.Integrated analysis of DNA- and RNA-level information by simultaneous detection of mutation, deletion, and gene expression in single cells. (A) Schema of deletion call based on expression of heterozygous SNPs. Dashed line indicates deleted region. A deletion was called when a cell expresses only one allele in the deleted region while expressing similar levels of both alleles for SNPs outside the deleted region. (B) Examples for deletion calls in CLL005 single cells for subclonal deletions in Chromosomes 11 and 13. Each point represents a cell. Orange indicates cells with no detectable expression from the deleted allele are inferred to harbor the deletion; blue, cells with expression from both alleles or the deleted allele are inferred to lack the deletion. (C) Mutation and chromosomal deletion frequency detected from single-cell RNA from five CLL samples show high correlation with CCF (r2 = 0.921) and with mutation frequency per single-cell DNA analysis (r2 = 0.923). (D) Mutation and chromosomal deletion detection in single-cell RNA from sample CLL005 enables reconstruction of phylogeny. (E,F) Genes within the chromosomal deletion regions exhibit significantly lower expression based on a one-sided Wilcoxon rank-sum test in cells inferred to harbor the deletions (cluster 1; orange) compared with cells not harboring the deletions (cluster 2; blue). Housekeeping genes are not significantly differentially expressed among single cells from the two clusters. (G) PCA of single cells from CLL005 or CLL146 based on gene expression for genes within chromosomal deletion regions leads to separation of cells by genetic subpopulation. Linear discriminant analysis achieves elevated ROC AUCs of 0.774 for CLL005 and 0.771 for CLL146. (H) PCA of single cells from CLL005 or CLL146 based on gene expression for genes driving aspects of transcriptional heterogeneity (23 genes for CLL005, 33 for CLL146) fails to separate cells by genetic subpopulation. Linear discriminant analysis does not perform substantially better than random, achieving ROC AUC of 0.516 for CLL005 and 0.595 for CLL146.
Figure 6.Mutations in LCP1 and WNK1 increase cell fitness and favor cell growth and survival. (A) Schema of the nonsense mutation in LCP1 found in CLL005. (B) LCP1-mutant cells have altered DNA response upon gamma-irradiation. HEK293 cells stably expressing wild-type, mutant, or control constructs were treated with 10 Gy. Levels of gamma-H2AX and phosphorylated forms of ATM, ATR, DNA-PK, and GAPDH were assessed using immunoblot. (C) Both wild-type and mutant LCP1 physically interact with ATM protein. HEK293 cells stably expressing wild-type, mutant, or control constructs were transiently transfected with an ATM-expressing construct (with N-terminal His tag). Forty-eight hours after transfection, cells were either treated or not treated with irradiation. Immunoprecipitations were performed on protein lysates using anti-His beads or anti-GFP as well as isotype control antibodies followed by immunoblot against GFP or ATM protein. Shown are immunoprecipitates from untreated cells. Irradiation does not change the physical association between these proteins (Supplemental Fig. S8). (D) Cells with mutant LCP1 have greater overall survival after gamma-irradiation compared with cells with wild-type LCP1 with or without ATM inhibitor treatment. The survival rate was calculated by normalization to each group of nonirradiated cells using colony assays performed on six-well plates. Mean ± SD; n = 3. (E) Crystal structure of WNK1 kinase domain (PDB entry 4Q2A). Val403 is located underneath the activation loop of WNK1. Mutation of Val403 to Phe potentially disrupts activation loop folding and affects kinase activity. (F) Cells with mutant WNK1 demonstrate a faster progression from G1 to S phase. HEK293 cells were induced to express either wild-type or mutant WNK1 for 48 h. After synchronization by serum deprivation for 24 h, cell cycle was assessed by EdU assay 24 h after return to complete media. The mean percentage of cells (±SD; n = 3) in different phases of the cell cycle is shown.