Literature DB >> 24681693

Transcriptional profiling of cells sorted by RNA abundance.

Sandy Klemm1, Stefan Semrau2, Kay Wiebrands3,4, Dylan Mooijman3,4, Dina A Faddah5,6, Rudolf Jaenisch5,6, Alexander van Oudenaarden2,3,4,5,7.   

Abstract

We have developed a quantitative technique for sorting cells on the basis of endogenous RNA abundance, with a molecular resolution of 10-20 transcripts. We demonstrate efficient and unbiased RNA extraction from transcriptionally sorted cells and report a high-fidelity transcriptome measurement of mouse induced pluripotent stem cells (iPSCs) isolated from a heterogeneous reprogramming culture. This method is broadly applicable to profiling transcriptionally distinct cellular states without requiring antibodies or transgenic fluorescent proteins.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24681693      PMCID: PMC4174458          DOI: 10.1038/nmeth.2910

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


A common challenge in biology is to identify and isolate transcriptionally distinct subpopulations within a single tissue or cell type. While a variety of techniques have been developed to discriminate among these alternative expression modes, the most widely used methods require either transgenic integration of fluorescent protein reporters or the availability of specific antibodies1–3. These approaches, however, are precluded for biological systems that are refractory to genetic manipulation (e.g., primary human tissue) and for processes in which RNA – rather than protein – is the key discriminative marker (e.g., non-coding RNA). Recently, flow cytometry has been used to sort cells using a spectrum of fluorescent labeling techniques in whicholigonucleotide probes are hybridized to either DNA or RNA target sequences4–7. The principle limitation of these methods has been that RNA extracted from hybridized material is often highly degraded8,9. Although fragmented RNA can be reverse transcribed and analyzed by quantitative PCR (RT-qPCR)10,11, full-length RNA is required for an unbiased representation of the transcriptome. While RNA degradation is partially mitigated by labeling RNA in live cells7,12, extended hybridization in ex-vivo culture may obscure the molecular state of primary tissue. Given these limitations, we have developed a method for RNA labeling in cross-linked cells which permits full-length RNA isolation and unbiased transcriptional profiling. The proposed RNA cell sorting technique uses flow cytometry to measure the fluorescence of individual cells labeled with a single molecule RNA FISH (smFISH) probe library13,14. As a proof-of-principle experiment, green fluorescent protein (GFP) transcripts were fluorescently labeled in cells which expressed the transgene under doxycycline control15 (Supplementary Fig. 1a-c). To assess the sorting potential of the labeled RNA signal, single cell RNA fluorescence distributions were measured by flow cytometry, revealing a clear separation of high and low induction profiles (Fig. 1a and Supplementary Fig. 1b,c). Furthermore, the measured mRNA fluorescence was found to scale linearly with mRNA and protein abundance across a broad range of induction levels (Fig. 1b). We further confirmed the linearity of the labeled RNA fluorescence signal for a panel of endogenous genes by comparing the mean flow cytometry signal intensity with the average number of RNA molecules in iPSCs quantified by single cell transcript counting14 (Fig. 1c).
Figure 1

A quantitative single cell measurement of transcription

(a) Single cell distribution of GFP mRNA fluorescence for indicated doxycycline induction levels. (b) Linear and unbiased scaling of GFP mRNA fluorescence with RT-qPCR and GFP protein fluorescence. (c) Linear scaling of mean mRNAfluorescence with single cell transcript quantification by classical (microscopy) smFISH for a panel endogenous genes in iPS-2i cells (see Online Methods). (d) Sox2 mRNA signal (red) and background (black) with shaded 95% quantile (gray) and molecular resolution R reported for the p = 5% significance level (see Online Methods). Recapitulation of first (25%), second (50%), and third (75%) quartile sorting by classical smFISH transcript counting (inset).

We then asked whether the observed RNA fluorescence signal provides an accurate measurement of single cell transcript levels. For this purpose, we measured both GFP protein and labeled mRNA fluorescence in single cells and found a strong correlation (ρ = 0.77, Supplementary Fig. 1d), which is consistent with the expectation that mRNA and protein are broadly correlated in single cells. Additionally this correlation between mRNA and protein was confirmed across a broad range of GFP induction levels (Supplementary Fig. 1e). Next, we tested the single cell precision of the proposed RNA measurement by differentially labeling the 3′ and 5′ ends of a single transcriptional target (Oct4-IRES-GFP fusion mRNA) and found that the labels were strongly correlated at the single cell level (ρ = 0.90, Supplementary Fig. 2a). We then measured a mixture of cells labeled with differentially coupled probes for the gene Oct4 and confirmed that the two labels were strongly anti-correlated (ρ = −0.81, Supplementary Fig. 2b). Having established measurements of both positive and negative correlation, we hypothesized that doxycycline induced GFP expression would in principle be uncorrelated with every endogenous mRNA species in the genome. We confirmed this prediction across all measured GFP induction levels for the gene Oct4 (ρ = 0.1, Supplementary Fig. 2c). Finally, we sorted cellsabove and below each quartile of the Sox2 RNA fluorescence distribution. Single cell transcript counting in each fraction quantitatively recapitulated the flow cytometry quartile measurements (Fig. 1d). Taken together, these data suggest that flow cytometry can be used to quantitatively measure the abundance of mRNA in single cells hybridized with a complementary smFISH library. To assess the resolution of the labeled RNA signal, we measured the expected number of transcripts required for a cell to be statistically resolved from the background (see Online Methods). An estimated resolution of 57 transcripts was measured for a library of 20 nucleotide probes hybridized to the Sox2 gene in derived iPSCs (Supplementary Fig. 4a). Under more stringent hybridization conditions, a 30 nucleotide library improved the resolution to 24 transcripts (Supplementary Fig. 4b). This estimate was confirmed by sorting cells and directly measuring the difference in the number of transcripts required for a pair of cells to be resolved with 95% confidence (Supplementary Fig. 4c,d). The sorting resolution was further improved by measuring RNA abundance and sorting in specific cell cycle phases (Supplementary Fig. 5a-b). For G1 cells, the molecular resolution was estimated as 30 molecules for the 20 nucleotide probe library and 13 molecules for the 30 nucleotide library (Fig. 1d and Supplementary Fig. 5c,d). In order to measure the transcriptome of sorted subpopulations, RNA is extracted from hybridized cells by cross-link reversal (see Online Methods). Under standard hybridization conditions14, the molecular integrity of the extracted RNA is attenuated in a time and temperature dependent manner, due to enzymatic RNA degradation as well as hydrolysis mediated RNA fragmentation (Supplementary Fig. 6a). We have addressed this by developing an RNA preserving hybridization buffer (RPHB) which facilitates efficient isolation of full-length RNA following hybridization (Supplementary Fig. 6b). RPHB employs a nearly saturating salt concentration to eliminate enzymatic RNA degradation by precipitating proteins, and includes a high concentration of the chelating agent EDTA, which inhibits RNA degradation by sequestration of metal ions involved in RNA hydrolysis. We tested RPHB by extracting RNA from live and RPHB hybridized mouse embryonic stem cells (mESCs) and comparing relative expression levels by RT-qPCR and microarrays (Fig. 2a,b), establishing that RNA extracted from RPHB hybridized material is quantitatively unbiased. Furthermore, we found that the error distribution between technical microarray replicates is identical for live and hybridized samples (Supplementary Fig. 6c). We then examined the genome-wide expression fold change between NIH-3T3 fibroblasts and J1 mES cells. A comparison of fold change measurements for both live and hybridized samples revealed a strong correlation (ρ = 0.94) over the full dynamic range of the microarray (Fig. 2c). Finally, we isolated mES cells and fibroblasts by RNA FACS from an artificial mixture of these cells types; subpopulation transcriptome measurements on the sorted fractions recapitulated therespective cell type signatures (Supplementary Fig. 6d and Supplementary Fig. 7a,b).
Figure 2

RNA extraction and transcriptional sorting

(a) Hybridized and live mES cell expression (normalized to Gapdh) for a panel of genetic markers measure by RT-qPCR (black) and microarray (red). (b) Genome-wide microarray expression measurements for live and hybridized cells. (c) NIH-3T3 and J1 mESC expression fold change measurements for hybridized and live samples.

One of the motivating applications for RNA based sorting has been to transcriptionally profile induced pluripotent stem cells during the process of cellular reprogramming16. Following disruption of the somatic state during reprogramming, individual cells stochastically reactivate the pluripotency machinery at widely different rates17 and contribute to a diverse collection of co-existing subpopulations18. While iPS cells define a subpopulation which is independent of ectopic reprogramming factors, incompletely reprogrammed cells are dependent on sustained reprogramming factor expression in order to be competitively maintained in culture. To interrogate these subpopulations, we isolated cells which had reactivated the endogenous Sox2 locus – a definitive marker for reprogramming18,19 – from a background of partially reprogrammed cells. The Sox2+ and Sox2− transcriptomes were then compared with iPS cells derived by reprogramming factor withdrawal. Secondary mouse embryonic fibroblasts (2° MEFs, see Online Methods) were reprogrammed by exposure to doxycycline induced expression of Oct4, Sox2, Klf4, and c-Myc (OSKM) for 32 consecutive days (Fig. 3a), at which point iPS cells were passaged in co-culture with MEFs (iPSMEF) and in feeder-free 2i media (iPS-2i) as OSKM independent colonies (Fig. 3a). The distribution of endogenous Sox2 expression – measured by flow cytometry using a smFISH probe library designed against the non-coding 3′UTR of Sox2 – was bimodal (Fig. 3a) for day 32 OSKM+ cells, reflecting an underlying diversity of reprogramming depth among individual cells. The upper and lower Sox2 expression tails were sorted and transcriptionally profiled in comparison with OSKM independent iPSCs, revealing an unambiguous pluripotency signature for the positive fraction (Fig. 3b). While many somatic marks were repressed in both Sox2+ and Sox2− cells, a broad spectrum of iPSC specific genes were differentially upregulated in Sox2+ cells, including transcription factors, RNA binding proteins, chromatin regulators, and cell surface markers. Reciprocally, Sox2− cells expressed a class of differentiation associated genes which were repressed in iPS cells. The striking similarity between Sox2+ and iPS cells suggests that cells within the Sox2+ subpopulation are reprogrammed and give rise to stable iPS cells under OSKM withdrawal. By leveraging a non-coding transcriptional element (Sox2 3 ′ U TR), these experiments illustrate the flexibility of RNA FACS and suggest the potential for a broader subpopulation analysis of cellular reprogramming.
Figure 3

Isolation and transcriptional profiling of iPS cells

(a) Secondary MEF reprogramming; isolation of iPS cells by OSKM withdrawal in MEF co-culture (iPS-MEF) and 2i media (iPS-2i); RNA FACS for reprogrammed Sox2+ (red) and non-reprogrammed Sox2− (black) cells (25 μm scale bars). The bimodal Sox2 3′UTR signal (black) relative to the non-specific background signal (gray) reflects transcriptional and phenotypic heterogeneity in day 32 OSKM+ cells. (b) Hierarchical clustering of genes differentially expressed between Sox2+ and Sox2− cells (top 5% shown).

Single cell heterogeneity in gene expression is a common phenomenon for a variety of developmental and homeostatic processes. The principle focus of this work has been to develop a fluorescent measure of RNA in single cells, which facilitates high resolution sorting as well asefficient and unbiased RNA isolation. This technique extends flow cytometry to a new class of applications based on direct quantification of RNA.

ONLINE METHODS

Molecular resolution

The molecular resolution is defined as the expected number of mRNA molecules required for a cell to be statistically resolved from the background. The resolution depends on the measured signal, the single cell distribution of mRNA, and the background fluorescence. The transcript distribution was measured by counting smFISH labeled mRNA in single cells as previously described14. The background fluorescence – which accounts for both cellular autofluorescence and non-specific probe binding – was estimated by hybridizing a mock sample with a 1:20 mixture of fluorescently labeled and unlabeled probe libraries (yielding a 95% attenuation of the specific signal). The resolution R was calculated for the significance level p using the mock signal as a null model (see Supplementary Note).

Fluorescence in-situ hybridization buffers and probes

Oligonucleotide libraries with 20 nucleotide probes were designed and fluorescently labeled as previously described14. Probes for the 30 nucleotide Sox2 library were similarly designed such that the predicted melting temperature of individual probes deviates from the median by no more than 5°C. See Supplementary Data for all probe libraries used in this study. The following buffers were used in this study: (RPH buffer) 300 mM Sodium chloride, 30 mM Sodium citrate, 2.1 M Ammonium sulfate, 10 mM EDTA, 1 mg/ml E. Coli tRNA, 500 μg/ml BSA, 25% (40%) formamide for 20 (30) nucleotide probe library; (Wash buffer) 25% (40%) formamide for 20 (30) nucleotide probe library, 2x SSC; (Flow buffer) 2x SSC; (Sorting buffer) 50% RNALater Solution (Ambion), 2x SSC; (Reverse cross-linking buffer) 100 mM NaCl, 10 mM pH 8.0 Tris, 1 mM EDTA, 0.5% SDS, 500 μg/ml Proteinase K. The dependence of RNA degradation on hybridization conditions (Supplementary Fig. 4a) was measured using the previously reported (classical) smFISH hybridization buffer14.

RNA fluorescence in-situ hybridization

Cells were fixed in 4% paraformaldehyde for 5 minutes, centrifuged at 1000 RCF for 5 minutes, and washed in 70% ethanol (EtOH). Following overnight ethanol permeabilization (70% EtOH) at 4°C, cells were resuspended in RPH buffer with labeled probes (0.5-1 ng/μl) and incubated at 30°C for 12 hours. An equal volume of wash buffer was then added and mixed thoroughly with the sample. Cells were then pelleted by centrifugation and resuspended in wash buffer for 30 minutes at 30°C. After repeating the previous wash step, cells were resuspended in flow buffer and maintained at 4°C in preparation for sorting.

Flow cytometry, FACS and RNA extraction

Cells were sorted by FACS into 4°C sorting buffer using a BD Biosciences Aria II flow cytometer. Unless otherwise noted, cell size was controlled for by co-staining cells with Hoechst 33342 (Invitrogen) and selecting for diploid DNA content. FACS sorted cells were centrifuged as before and resuspended in reverse cross-linking buffer at 50°C for 1 hour. Total RNA was isolated by phenol-chloroform (Trizol, Invitrogen) using the manufacturerfs protocol.

Microarrays

Microarray assays were performed by the Massachusetts Institute of Technology (MIT) BioMicro Center. NIH-3T3 fibroblasts and mESCs were assayed using Eukaryotic Exon 1.0 ST arrays from Agilent, while Mouse 430A 2.0 Affymetrix chips were used for MEF and iPSCs.

Cell lines and media

Embryonic and induced pluripotent stem cells were grown as indicated: (1) co-cultured with irradiated MEF cells (Global Stem) or (2) cultured in 2i conditions with both glycogen synthase kinase 3β (Stemgent) and mitogen-activated protein kinase kinase inhibitors (Stemgent). ES and iPS cells were grown with leukemia inhibitory factor (103 units/ml, Millipore) and 15% (10% for E14 cells) heat-inactivated fetal bovine serum (Hyclone) together with Knockout DMEM (Gibco), L-glutamine (Gibco), MEM non-essential amino acids (Gibco), penicillin-streptomycin (Gibco) and β-mercaptoethanol (Sigma). NIH-3T3 fibroblast and primary MEF cells were cultured without inhibitors in 10% serum. E14 mES cells were used for the post-hybridization RNA extraction controls and J1 mES cells (2i media) were used for the NIH-3T3 & mESC sorting experiments. KH2:GFP cells15 – with constitutive R26-M2rtTA expression and a tetracycline-inducible eGFP construct targeted to the ColA1 locus – were used for the titrated GFP induction experiments.

Secondary somatic cell generation and reprogramming

Secondary mouse embryonic fibroblasts (2° MEFs) were isolated from chimeric embryos as previously described20, providing doxycycline inducible Oct4, Sox2, Klf4, and c-Myc expression; Neomycin selection at the Nanog locus; and a fluorescent transcriptional reporter for Oct4. The parental pluripotent stem cells used to generate the secondary line were derived by replacing an Oct4 allele with an Oct4-IRES-GFP sequence in Nanog-Neo iPS cells20. Secondary MEFs were plated at optimal density20 and passaged after 48 hours. Doxycycline (2μg/ml, Stemgent) was added 24 hours after replating, marking the start of reprogramming. Neomycin selection (Stemgent G418, 1μg/ml) was applied beginning at day 6 and maintained throughout the reprogramming timecourse. Reprogrammed iPS cells were stabilized in co-culture with MEFs (iPSMEF) and in feeder-free conditions (iPS-2i).
  19 in total

1.  Telomere length dynamics in human lymphocyte subpopulations measured by flow cytometry.

Authors:  N Rufer; W Dragowska; G Thornbury; E Roosnek; P M Lansdorp
Journal:  Nat Biotechnol       Date:  1998-08       Impact factor: 54.908

2.  A late transition in somatic cell reprogramming requires regulators distinct from the pluripotency network.

Authors:  Azadeh Golipour; Laurent David; Yu Liu; Gowtham Jayakumaran; Calley L Hirsch; Dan Trcka; Jeffrey L Wrana
Journal:  Cell Stem Cell       Date:  2012-12-07       Impact factor: 24.633

3.  Green fluorescent protein as a marker for gene expression.

Authors:  M Chalfie; Y Tu; G Euskirchen; W W Ward; D C Prasher
Journal:  Science       Date:  1994-02-11       Impact factor: 47.728

4.  Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors.

Authors:  Kazutoshi Takahashi; Shinya Yamanaka
Journal:  Cell       Date:  2006-08-10       Impact factor: 41.582

5.  Efficient method to generate single-copy transgenic mice by site-specific integration in embryonic stem cells.

Authors:  Caroline Beard; Konrad Hochedlinger; Kathrin Plath; Anton Wutz; Rudolf Jaenisch
Journal:  Genesis       Date:  2006-01       Impact factor: 2.487

6.  A drug-inducible transgenic system for direct reprogramming of multiple somatic cell types.

Authors:  Marius Wernig; Christopher J Lengner; Jacob Hanna; Michael A Lodato; Eveline Steine; Ruth Foreman; Judith Staerk; Styliani Markoulaki; Rudolf Jaenisch
Journal:  Nat Biotechnol       Date:  2008-07-01       Impact factor: 54.908

7.  Stochastic patterning in the mouse pre-implantation embryo.

Authors:  Jens-Erik Dietrich; Takashi Hiiragi
Journal:  Development       Date:  2007-10-31       Impact factor: 6.868

8.  Imaging individual mRNA molecules using multiple singly labeled probes.

Authors:  Arjun Raj; Patrick van den Bogaard; Scott A Rifkin; Alexander van Oudenaarden; Sanjay Tyagi
Journal:  Nat Methods       Date:  2008-09-21       Impact factor: 28.547

9.  Direct cell reprogramming is a stochastic process amenable to acceleration.

Authors:  Jacob Hanna; Krishanu Saha; Bernardo Pando; Jeroen van Zon; Christopher J Lengner; Menno P Creyghton; Alexander van Oudenaarden; Rudolf Jaenisch
Journal:  Nature       Date:  2009-11-08       Impact factor: 49.962

10.  Simultaneous detection of mRNA and protein stem cell markers in live cells.

Authors:  Won Jong Rhee; Gang Bao
Journal:  BMC Biotechnol       Date:  2009-04-02       Impact factor: 2.563

View more
  20 in total

Review 1.  Variation in transcriptome size: are we getting the message?

Authors:  Jeremy E Coate; Jeff J Doyle
Journal:  Chromosoma       Date:  2014-11-26       Impact factor: 4.316

2.  Transcriptome Analysis of Mycobacteria-Specific CD4+ T Cells Identified by Activation-Induced Expression of CD154.

Authors:  Shajo Kunnath-Velayudhan; Michael F Goldberg; Neeraj K Saini; Christopher T Johndrow; Tony W Ng; Alison J Johnson; Jiayong Xu; John Chan; William R Jacobs; Steven A Porcelli
Journal:  J Immunol       Date:  2017-08-18       Impact factor: 5.422

3.  Single-cell transcriptional landscapes reveal HIV-1-driven aberrant host gene transcription as a potential therapeutic target.

Authors:  Runxia Liu; Yang-Hui Jimmy Yeh; Ales Varabyou; Jack A Collora; Scott Sherrill-Mix; C Conover Talbot; Sameet Mehta; Kristen Albrecht; Haiping Hao; Hao Zhang; Ross A Pollack; Subul A Beg; Rachela M Calvi; Jianfei Hu; Christine M Durand; Richard F Ambinder; Rebecca Hoh; Steven G Deeks; Jennifer Chiarella; Serena Spudich; Daniel C Douek; Frederic D Bushman; Mihaela Pertea; Ya-Chi Ho
Journal:  Sci Transl Med       Date:  2020-05-13       Impact factor: 17.956

4.  A microfluidic approach to parallelized transcriptional profiling of single cells.

Authors:  Hao Sun; Timothy Olsen; Jing Zhu; Jianguo Tao; Brian Ponnaiya; Sally A Amundson; David J Brenner; Qiao Lin
Journal:  Microfluid Nanofluidics       Date:  2015-10-14       Impact factor: 2.529

5.  Simultaneous, Single-Cell Measurement of Messenger RNA, Cell Surface Proteins, and Intracellular Proteins.

Authors:  Kah Teong Soh; Joseph D Tario; Sean Colligan; Orla Maguire; Dalin Pan; Hans Minderman; Paul K Wallace
Journal:  Curr Protoc Cytom       Date:  2016-01-06

6.  Isolation of intact RNA from murine CD4+ T cells after intracellular cytokine staining and fluorescence-activated cell sorting.

Authors:  Shajo Kunnath-Velayudhan; Steven A Porcelli
Journal:  J Immunol Methods       Date:  2018-02-17       Impact factor: 2.303

Review 7.  Cholesterol signaling in single cells: lessons from STAR and sm-FISH.

Authors:  Colin R Jefcoate; Jinwoo Lee
Journal:  J Mol Endocrinol       Date:  2018-05       Impact factor: 5.098

8.  The developmental potential of iPSCs is greatly influenced by reprogramming factor selection.

Authors:  Yosef Buganim; Styliani Markoulaki; Niek van Wietmarschen; Heather Hoke; Tao Wu; Kibibi Ganz; Batool Akhtar-Zaidi; Yupeng He; Brian J Abraham; David Porubsky; Elisabeth Kulenkampff; Dina A Faddah; Linyu Shi; Qing Gao; Sovan Sarkar; Malkiel Cohen; Johanna Goldmann; Joseph R Nery; Matthew D Schultz; Joseph R Ecker; Andrew Xiao; Richard A Young; Peter M Lansdorp; Rudolf Jaenisch
Journal:  Cell Stem Cell       Date:  2014-09-04       Impact factor: 24.633

Review 9.  Cerebral cortical neuron diversity and development at single-cell resolution.

Authors:  Matthew B Johnson; Christopher A Walsh
Journal:  Curr Opin Neurobiol       Date:  2016-11-23       Impact factor: 6.627

10.  Probe-Seq: Method for RNA Sequencing of Specific Cell Types from Animal Tissue.

Authors:  Ryoji Amamoto; Mauricio D Garcia; Emma R West; Jiho Choi; Sylvain W Lapan; Elizabeth A Lane; Norbert Perrimon; Constance L Cepko
Journal:  Bio Protoc       Date:  2020-09-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.