| Literature DB >> 35474898 |
Christian Kramme1,2, Alexandru M Plesa1,2, Helen H Wang1,2, Bennett Wolf1,2, Merrick Pierson Smela1,2, Xiaoge Guo1,2, Richie E Kohman1,2, Pranam Chatterjee1,2, George M Church1,2.
Abstract
With the recent advancements in genome editing, next-generation sequencing (NGS), and scalable cloning techniques, scientists can now conduct genetic screens at unprecedented levels of scale and precision. With such a multitude of technologies, there is a need for a simple yet comprehensive pipeline to enable systematic mammalian genetic screening. In this study, we develop unique algorithms for target identification and a toxin-less Gateway cloning tool, termed MegaGate, for library cloning which, when combined with existing genetic perturbation methods and NGS-coupled readouts, enable versatile engineering of relevant mammalian cell lines. Our integrated pipeline for sequencing-based target ascertainment and modular perturbation screening (STAMPScreen) can thus be utilized for a host of cell state engineering applications.Entities:
Keywords: CRISPR; gene regulatory networks; genetic screening; stem cells; transcription factors
Year: 2021 PMID: 35474898 PMCID: PMC9017118 DOI: 10.1016/j.crmeth.2021.100082
Source DB: PubMed Journal: Cell Rep Methods ISSN: 2667-2375
Figure 1STAMPScreen schematic workflow
Schematic representation of the STAMPScreen pipeline, highlighting in silico target ascertainment, screening tool selection, library cloning, and NGS-coupled screening readout. STAMPScreen generates data that feed into iterative cycles of the workflow.
Figure 2In silico target ascertainment
(A) DEG network target identification pipeline. DGEA was used to determine significant changes between the starting and target cell state. Using publicly available protein interaction networks, DEGs are scored based on their connectivity and differential expression levels.
(B) Validation of the DEG network analysis method applied to the fibroblast aging phenotype. The list of all DEGs and top 100 ranked lists based on common metrics and our network score were tested for enrichment of known aging genes from the GenAge database.
(C) Graph theory-based TF discovery pipeline. Gene regulatory networks are inferred from time-series RNA sequencing data. PageRank with a standard residual probability of 0.85 was utilized to rank TFs by centrality score.
(D) Prediction of central TFs in known differentiation protocols using graph theory-based TF discovery pipeline. Experimentally validated TFs are indicated, demonstrating predictive capability of the pipeline.
Figure 3Gene perturbation tool evaluation
Systematic comparison of CRISPRa, CRISPRi, and cDNA function were performed in the hiPSC line PGP1. Analysis was performed using qRT-PCR with duplicates and the ΔΔCq method was utilized to determine relative expression to a no plasmid control duplicate. For CRISPRa and cDNA (A and C), cells were harvested 48 h post-transfection; for CRISPRi (B), cells were harvested at 72 h post-transfection. Induction using dCas9-VPR, SAM, and SunTag was performed on 47 gene targets in hiPSCs, cDNA induction was performed on 17 genes targets, and CRISPRi repression was performed on 12 gene targets. Significance by Mann-Whitney test. The pink line shows median values, dashed black lines are 95 CI. For systematic comparison of cDNA induction using the PB-TA-ERP2, XLone, cT3G, cERP2, and cERP2-cT3G vectors expressing sfGFP (D). Cells were harvested for flow cytometry after 24 h of doxycycline induction and screens were performed in duplicate with and without doxycycline. Geometric mean fluorescent intensity of all live singlet cells was plotted for each condition. For demonstration of dual expression vectors (E) a constitutive mCherry cDNA was expressed along with a sgRNA targeting a Tet promoter. This vector was co-nucleofected with dCas9-KRAB-MeCP2 into an hiPSC line harboring a Tet-sfGFP under induction. Cells were harvested 96 h later for flow cytometry. Relative expression was calculated as MFI compared with no plasmid control. The right panel shows a similar setup but with a sfGFP cDNA vector under Tet promoter and an sgRNA targeting an integrated tdTomato construct. The plasmid was co-nucleofected with dCas9-VPR and cells were harvested 48 h later for flow cytometry, relative expression was calculated as MFI compared with no plasmid control. Calculated p values are represented as follows: ∗∗∗p ≤ 0.001, ∗∗∗∗p ≤ 0.0001.
Figure 4MegaGate, a toxin-less cDNA cloning method
(A and B) (A) Schematic representation of the MegaGate cloning reaction. (B) Percent ORF capture (pink) measured as the ratio of input genes captured in expression vectors in a single MegaGate cloning reaction for single genes and pooled groups. Number of barcodes captured per gene (teal) for the single genes and pools was determined via NGS alignment of destination vector amplicons.
(C–E) Cloning efficiency as a function of ORF length. Cloning efficiency was measured as the relative abundance of the gene in the expression vector pool divided by the relative abundance of the gene in the pDONOR pool, as measured by NGS counts. Genes are arrayed by length on the x axis.
Figure 5NGS-coupled readouts
(A) PiggyBac integration copy number was measured using qRT-PCR on genomic DNA. Integrant-specific primers and a single copy gene RPP30 were used. A 2 × ΔCq measurement was used to determine copy number for n = 2 biological replicates. Error bars are standard deviation.
(B) Barcode enrichment analysis (Bar-seq) was performed on hiPSC pools with 54 barcoded gene insertions and barcoded GFP insertions. RNA was harvested after FACS 72 h post-induction, and barcodes were amplified for NGS. Relative abundance was measured as the relative read count of each gene barcode compared with the total read count of all barcodes.
(C) Two barcoded genes, GFP and ZGLP1, were integrated into hiPSCs and induced for 72 h in duplicate. RNA was harvested and RNA-seq was performed. DEseq2 was utilized to calculate log2fc and p values and an exact match kmer was used to identify the gene barcode.
(D) Forty barcoded genes were integrated into hiPSCs and induced for 3 days and RNA was harvested and converted to cDNA. A primer pool for all 40 genes was used to amplify the 40 targets for NGS as well as their barcodes in the DOX OFF and DOX ON pools. Relative gene expression was calculated as relative read counts normalized to GAPDH for each pool. Calculated p values are represented as follows: ∗∗∗∗p ≤ 0.0001.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| NEB 5-Alpha | New England Biolabs | C2987 |
| LR Clonase II | Thermo Fisher | 11791020 |
| BP Clonase II | Thermo Fisher | 11789020 |
| I-SceI (5U/ ul) | New England Biolabs | R0694 |
| I-CeuI (5U/ ul) | New England Biolabs | R0699 |
| 10X CutSmart Buffer | New England Biolabs | B7204 |
| T5 Exonuclease | New England Biolabs | M0663 |
| Gibson Assembly Master Mix | New England Biolabs | E2611 |
| BsaI-HFV2 | New England Biolabs | R3733 |
| BsmBI-V2 | New England Biolabs | R0739 |
| SapI | New England Biolabs | R0569 |
| ProNex Size-Selective Purification System | Promega | NG2001 |
| KAPA SYBR Fast universal 2X qPCR Master Mix | KAPA Biosystems | KK4601 |
| Powerup SYBR Green Master Mix | Applied Biosystems | A25741 |
| RNAeasy Plus Mini Kit | Qiagen | 74034 |
| DNAeasy blood and tissue lysis kit | Qiagen | 69504 |
| SuperScript IV First Strand Synthesis Kit | Invitrogen | 18091050 |
| KAPA RNA HyperPrep kit with Riboerase | KAPA Biosystems | KR1351 |
| Qubit dsDNA HS Assay Kit | Invitrogen | Q32851 |
| Q5 High Fidelity 2X Mastermix | New England Biolabs | M0492 |
| Qiagen Plasmid Plus Midi Kit | Qiagen | 12941 |
| QIAprep Spin Miniprep Kit | Qiagen | 27104 |
| LunaScript RT SuperMix Kit | New England Biolabs | E3010 |
| KAPA Unique Dual-Indexed Adapter Kit | KAPA Biosystems | KK8727 |
| Monarch DNA Gel Extraction Kit | New England Biolabs | T1020 |
| Raw MegaGate pooled cloning NGS data | This Paper | PRJNA753802 |
| Raw data from BAR-Seq and TAR-Seq NGS | This Paper | PRJNA753802 |
| Raw data from barcoded ZGLP1 and GFP RNA-Seq | This Paper | |
| Analyzed data for DEG network Aging genes benchmarking | ||
| Analyzed data for GRN network Neural stem cell bench marking | PRJNA596331 | |
| Analyzed data for GRN network Myoblasts bench marking | ||
| Analyzed data for GRN network Melanocyte bench marking | PRJNA492994 | |
| PGP1 hiPSC | Personal Genome Project | PGP1 hiPSCs |
| F3 hiPSC | ATCC | BXS0116 |
| M1 HDFn | ATCC | PCS-201-010 |
| M2 HDFa | ATCC | PCS-201-012 |
| Copy Number Assessment Primers ( | This Paper- | N/A |
| qPCR Primers for CRISPRa/I assessment, BAR-Seq and TAR-Seq ( | This Paper- | N/A |
| ORF-BC_Rev: TCTTATCATGTCTGGATCGCGG (For identifying gene-barcode pairs in | This Paper- | N/A |
| Custom Illumina I5 index primers (i501-i508) (For NGS indexing in | This Paper- | N/A |
| Custom Illumina I7 index primers (i701-i708) (For NGS indexing in | This Paper- | N/A |
| MegaDestination : PB-cT3G-ERP2-MegaGate | This Paper | Addgene Deposit 80028 |
| MegaDestination : PB-cT3G-cERP2-MegaGate | This Paper | Addgene Deposit 80028 |
| MegaDestination : PB-cT3G-cERP2-MegaGate-hU6 | This Paper | Addgene Deposit 80028 |
| MegaDestination : PB-cT3G-cERP2-MegaGate-IRES2-mTagBFP2 | This Paper | Addgene Deposit 80028 |
| MegaDestination : PB-cT3G-cERP2-MegaGate-IRES2-mCherry | This Paper | Addgene Deposit 80028 |
| MegaDestination : PB-cT3G-cERP2-MegaGate-IRES2-mNeonGreen | This Paper | Addgene Deposit 80028 |
| MegaDestination : EF1a- MegaGate | This Paper | Addgene Deposit 80028 |
| MegaDestination : EF1a- MegaGate-hU6 | This Paper | Addgene Deposit 80028 |
| dCas9-KRAB | Addgene 110820 | |
| dCas9-KRAB-MeCP2 | Addgene 110821 | |
| SP-dCas9-VPR | Addgene 63798 | |
| PB-SAM | Addgene 102559 | |
| dCas9-SunTag (2 vector system) | Addgene 60903 and 60904 | |
| PB-CA | Addgene 20960 | |
| PB-TA-ERP2 | Addgene 80477 | |
| XLone-GFP | Addgene 96930 | |
| MegaCassette (Sequence is found and annotated in MegaDestination vectors. Contains AttR1, I-SceI, I-CeuI, AttR2): | This Paper | N/A |
| STAMPScreen Target Ascertainment algorithms | This paper | |
| Geneious Prime 2019.2.3 | Biomatters Ltd. | N/A |
| DESeq2 v1.32.0 | N/A | |
| GraphPad Prism v8.3.1 for MacOS | Graph Pad software | N/A |
| STAR v2.5 | N/A | |
| BBMap v38.90 | N/A | |
| FlowJo v10.8 | Becton Dickinson & Company | N/A |
| samtools v1.3.1 | N/A | |
| Bowtie2 v2.1.0 | N/A | |
| Cutadapt v1.12 | N/A | |