| Literature DB >> 33184288 |
Sonia Brockway1,2,3,4, Geng Wang1,2,3, Jasen M Jackson1,2,3, David R Amici1,2,3,5, Seesha R Takagishi1,2,3, Matthew R Clutter3,6,7, Elizabeth T Bartom1,2, Marc L Mendillo8,9,10.
Abstract
Chemical-genetic interaction profiling in model organisms has proven powerful in providing insights into compound mechanism of action and gene function. However, identifying chemical-genetic interactions in mammalian systems has been limited to low-throughput or computational methods. Here, we develop Quantitative and Multiplexed Analysis of Phenotype by Sequencing (QMAP-Seq), which leverages next-generation sequencing for pooled high-throughput chemical-genetic profiling. We apply QMAP-Seq to investigate how cellular stress response factors affect therapeutic response in cancer. Using minimal automation, we treat pools of 60 cell types-comprising 12 genetic perturbations in five cell lines-with 1440 compound-dose combinations, generating 86,400 chemical-genetic measurements. QMAP-Seq produces precise and accurate quantitative measures of acute drug response comparable to gold standard assays, but with increased throughput at lower cost. Moreover, QMAP-Seq reveals clinically actionable drug vulnerabilities and functional relationships involving these stress response factors, many of which are activated in cancer. Thus, QMAP-Seq provides a broadly accessible and scalable strategy for chemical-genetic profiling in mammalian cells.Entities:
Mesh:
Year: 2020 PMID: 33184288 PMCID: PMC7661543 DOI: 10.1038/s41467-020-19553-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1QMAP-Seq generates precise and accurate quantitative measures of drug response.
a Experimental workflow for QMAP-Seq with one cell line. b Schematic of QMAP-Seq library preparation using unique sets of i5/i7 indexed primers followed by next-generation sequencing of amplicons. c Standard curves generated from five uniquely barcoded 293T cell spike-in standards introduced at known cell numbers. Data are represented as mean number of sequencing reads across the six or eight DMSO samples on a plate ± standard deviation. d Scatterplot of interpolated cell number for two biologically independent replicates. Statistical significance of Pearson correlation was determined using a two-tailed test (n = 288 compound-dose combinations). e Live-cell imaging of MDA-MB-231 sgPool cells 72 h after treatment with YM155 or Ganetespib. Images are representative of two biologically independent replicates. Scale bar = 100 μm. Source data are provided as a Source data file. f Top: Dose–response curves for MDA-MB-231 sgPool cells as measured using live-cell imaging 72 h after treatment with YM155 or Ganetespib. Bottom: Dose–response curves for 12 genetic perturbations of MDA-MB-231 cells as measured using QMAP-Seq 72 h after treatment with YM155 or Ganetespib. Each data point represents one of two biologically independent replicates. The shaded region indicates the area under the curve (AUC) for sgPool. g Scatterplot of the dose–response curve AUC for sgPool as determined using live-cell imaging versus QMAP-Seq. Statistical significance of Pearson correlation was determined using a two-tailed test (n = 89 compounds). Source data are available in the Source data file.
Fig. 2Expanding QMAP-Seq to multiple cell lines.
a Experimental workflow for QMAP-Seq with five cell lines. b Pie chart showing breakdown of 180 compounds by stage of development. c Pie chart showing breakdown of 180 compounds by pathway. d Schematic of competition experiment to optimize the starting representation of the five co-cultured breast cancer cell lines. Original cell line pools were prepared by mixing equal numbers of one cell line expressing ZsGreen with each of the other four cell lines expressing dTomato. Flow cytometry analysis measured the percentage of GFP positive cells in each pool over time. e Heat maps displaying the percentage of GFP positive cells at various time points as measured using flow cytometry. Left: Original cell line pools that started with 20% of each cell line on Day 0. Right: Optimized cell line pools predicted to contain 20% of each cell line on Day 7. Source data and gating strategy are provided as a Source data file. f Representation of five breast cancer cell lines as measured by counting the number of sequencing reads for each cell line barcode across the 96 DMSO samples. g Representation of sgRNAs as measured by counting the number of sequencing reads for each sgRNA barcode relative to the total number of sequencing reads for that cell line across the 96 DMSO samples. h Standard deviation of the interpolated cell number for each cell line-sgRNA pair across the 96 DMSO samples. Dotted line indicates threshold for excluding cell line-sgRNA pairs with high variability. Source data are available in the Source data file.
Fig. 3Identification and validation of cell line–gene–drug interactions.
a Heat maps displaying the relative cell number for each cell line-sgRNA pair 72 h after treatment with 4-Hydroxytamoxifen (4-OHT), Lapatinib, or YM155 as measured using QMAP-Seq. Data are represented as mean of two biologically independent replicates. Asterisks denote positive controls. b Scatterplot of the dose–response curve AUC as determined using QMAP-Seq with one cell line versus QMAP-Seq with multiple cell lines. Common compounds for MDA-MB-231 cells are shown. Statistical significance of Pearson correlation was determined using a two-tailed test (n = 220 compound-sgRNA combinations). c Volcano plot depicting cell line–gene–drug interactions. Magnitude was determined by calculating the difference in mean AUC between sgRNA and sgNT for every cell line-compound combination. Statistical significance was determined using an unpaired, two-tailed t test (n = 2 biologically independent replicates). Blue dots indicate interactions where the knockout confers greater sensitivity than sgNT. Red dots indicate interactions where the knockout confers greater resistance than sgNT. d Pathways targeted by compounds involved in the top 60 sensitivity interactions (blue circles) or top 124 resistance interactions (red circles) that were significantly enriched compared to expected pathway representation. Statistical significance of pathway enrichment was determined using a one-tailed binomial test to compare observed distribution with expected distribution (n = 180 compounds). e List of the top 12 conditions that confer compound sensitivity or resistance. f Top: Dose–response curves for four of the top chemical–genetic interactions as measured using QMAP-Seq. Each data point represents one of two biologically independent replicates. Bottom: Dose–response curves as measured using Resazurin Cell Viability Kit. Each data point represents one of three biologically independent replicates. For clarity, individual proteostasis factor knockout curves are partioned across four panels; sgNT is same in all cases.
Fig. 4QMAP-Seq enables proteostasis network mapping in breast cancer.
a Chemical–genetic interaction map displaying the highest confidence interactions identified by QMAP-Seq (absolute AUC difference >60 and P < 0.05). Genes are represented as oval nodes, and compounds are represented as rectangular nodes. Chemical–genetic interactions where the knockout confers compound sensitivity (i.e., synthetic lethal interactions) are represented as blue edges. Chemical–genetic interactions where the knockout confers compound resistance (i.e., synthetic rescue interactions) are represented as red edges. b Spearman correlation matrix of gene–gene correlations based on AUC differences across all cell lines and compound contexts (n = 488 cell line-compound combinations). c Distribution of gene–gene Spearman correlations across all cell line and compound contexts. Known and previously unknown genetic relationships are labeled.