Rajeevkumar Raveendran Nair1, Stefan Blankvoort1, Maria Jose Lagartos1, Cliff Kentros2. 1. Kavli Institute for Systems Neuroscience and Centre for Neural Computation, NTNU, Norway. 2. Kavli Institute for Systems Neuroscience and Centre for Neural Computation, NTNU, Norway; Institute of Neuroscience, University of Oregon, Eugene OR, USA. Electronic address: clifford.kentros@ntnu.no.
Abstract
Although a variety of remarkable molecular tools for studying neural circuits have recently been developed, the ability to deploy them in particular neuronal subtypes is limited by the fact that native promoters are almost never specific enough. We recently showed that one can generate transgenic mice with anatomical specificity surpassing that of native promoters by combining enhancers uniquely active in particular brain regions with a heterologous minimal promoter, an approach we call EDGE (Enhancer-Driven Gene Expression). Here we extend this strategy to the generation of viral (rAAV) vectors, showing that some EDGE rAAVs can recapitulate the specificity of the corresponding transgenic lines in wild-type animals, even of another species. This approach thus holds the promise of enabling circuit-specific manipulations in wild-type animals, not only enhancing our understanding of brain function, but perhaps one day even providing novel therapeutic avenues to approach disorders of the brain.
Although a variety of remarkable molecular tools for studying neural circuits have recently been developed, the ability to deploy them in particular neuronal subtypes is limited by the fact that native promoters are almost never specific enough. We recently showed that one can generate transgenic mice with anatomical specificity surpassing that of native promoters by combining enhancers uniquely active in particular brain regions with a heterologous minimal promoter, an approach we call EDGE (Enhancer-Driven Gene Expression). Here we extend this strategy to the generation of viral (rAAV) vectors, showing that some EDGE rAAVs can recapitulate the specificity of the corresponding transgenic lines in wild-type animals, even of another species. This approach thus holds the promise of enabling circuit-specific manipulations in wild-type animals, not only enhancing our understanding of brain function, but perhaps one day even providing novel therapeutic avenues to approach disorders of the brain.
The mammalian brain is the most complex biological structure known, with innumerable distinct cell types differing in cytoarchitecture, electrophysiological properties, gene expression, and connectivity (Luo et al., 2008, Zeng and Sanes, 2017). Understanding brain function requires understanding neural circuits at the level of specificity at which they operate. Recent years have seen the development of truly revolutionary molecular tools that allow neuroscientists to elucidate precise neural connectivity (Callaway and Luo, 2015) and monitor (Chen et al., 2013) and manipulate (Boyden et al., 2005, Roth, 2016, Sternson and Roth, 2014) neural activity. However, optimal use of these tools to examine the functional circuitry of the brain requires the ability to deliver them specifically to particular elements of neural circuits (i.e., neuronal cell types), rather than as a nonspecific bolus affecting all of the neurons in a brain area. The use of molecular genetics is the only method by which one can perform truly cell-type specific manipulations, as evidenced by a variety of studies using transgenic animals expressing transgenes from neuronal promoters (genomic regions just upstream of the transcriptional start site) (Kanter et al., 2017, Miao et al., 2017). However, such approaches are limited by the fact that, because individual genes are expressed in a variety of cell types in the brain, promoters are not specific to a single neuronal cell type. Although estimates vary (ENCODE Project Consortium, 2012), there are at least an order of magnitude more cis-regulatory elements (i.e., enhancers and repressors, distal genomic regions that help regulate where and when promoters transcribe DNA) than promoters, suggesting that enhancers may be more specific. This led us to take an approach to the generation of molecular genetic tools that we call Enhancer-Driven Gene Expression (EDGE), based on identifying the cis-regulatory elements uniquely active in particular brain regions and combining them with a heterologous minimal promoter. When we used this strategy to make transgenic mice, they were indeed significantly more specific than the presumed parent gene, often driving expression primarily in particular sets of neurons in the brain region they were derived from (Blankvoort et al., 2018).However, although transgenic animals are powerful tools for the analysis of neural circuits, they have limitations. They are costly in both time and resources, can be subject to insertional effects (Matthaei, 2007, Feng et al., 2000), and are most practical in a limited number of species. Moreover, although they are often excellent models of disease, transgenic technologies are far from therapeutic applications. Recombinant adeno-associated viral vectors (rAAVs) can overcome many of the above issues. They can be made relatively quickly, generally do not insert into the genome or replicate, and can be used in a variety of species (Watakabe et al., 2015) including humans and therefore have clinical potential as well (Bouard et al., 2009, Dias et al., 2018, Kotterman and Schaffer, 2014, Mendell et al., 2017). However, efforts to generate cell-specific viral vectors by capsid modifications (Koerber et al., 2008, Koerber et al., 2009, Klimczak et al., 2009) or using promoters (Delzor et al., 2012, Kugler et al., 2003, Shevtsova et al., 2005) have been largely unsuccessful to date to address a particular cell type, with a few notable exceptions (Dimidschstein et al., 2016, Hartl et al., 2017), and even those are likely to have multiple subclasses. This is in large part because the relatively small payload size of rAAVs puts most native promoters out of reach. However, most enhancers are much smaller than promoters, raising the intriguing possibility of targeting specific neuronal cell types in any species by adapting EDGE to viral vectors, provided the background expression of the viral backbone and promoter can be minimized. Toward this end, we present results demonstrating enhancer-based viral vectors that specifically express in particular neurons of the entorhinal cortex (EC) in two different species of wild-type animals.
Results
Optimization of rAAV Design for Enhancer-Driven Gene Expression
Because one can obtain some degree of apparent specificity with rAAVs by means other than transcriptional regulation, we took steps to ensure that any observed specificity comes from the enhancer element used. Most notably, AAV serotypes exhibit distinct tropisms for different cell types: for instance, AAV8 is most efficient for oligodendrocytes and astrocytes (Aschauer et al., 2013, Hutson et al., 2012) and AAV 1, 2, 5, 7, 8, 9 prefer neurons (Aschauer et al., 2013, Castle et al., 2016, Davidson et al., 2000, During et al., 2003) (although they are by no means exclusive to them), whereas rAAV9 appears well suited for cortical neurons (Aschauer et al., 2013) and a variety of AAVs with engineered capsids show specific tropisms (Deverman et al., 2016, Tervo et al., 2016). We therefore used a single serotype (AAV2/1) with a wide tropism for neurons (Hauck et al., 2003) for the vast majority of our efforts toward engineering rAAVs transcriptionally specific to particular subtypes of neurons. We selected AAV 2/1, a chimera between capsid-1 (less efficient neuronal transduction [Castle et al., 2016]) and capsid-2 (vast tropism [Wang et al., 2003]) because of its broad transduction efficiency (Hauck et al., 2003) and to prepare viruses with high purity (During et al., 2003, Mcclure et al., 2011) via heparin columns (see Transparent Methods).Because injections of small volumes of rAAVs can appear specific because of the specific parcellation around the injection site, we used a medial entorhinal cortex (MEC) enhancer (MEC13-53) known to be specific to a particular subset of neurons in the entorhinal cortex (Blankvoort et al., 2018) in transgenic animals so we knew what to look for. Figure 1A shows the expression pattern obtained from crossing one of the MEC13-53 tTA driver lines to a payload line expressing the helper transgenes for the ΔG-rabies monosynaptic tracing system (Blankvoort et al., 2018). Expression in this cross was limited to Reelin-positive (RE+), Calbindin-negative (CB-) excitatory projection neurons in layer (LII) of the EC (Kitamura et al., 2014, Varga et al., 2010, Witter et al., 2017). Finally, we injected the same large (400 nL in mice, as opposed to the ∼50-nL injections typically used with nonspecific rAAVs) volume of each virus into multiple animals using the same EC coordinates and compared only green fluorescent protein (GFP)-expressing rAAVs of similar titer (see Table S1 and Transparent Methods). For the purposes of comparison, Figure 1B shows the widespread strong expression throughout the various layers of the entorhinal cortex (as well as subiculum and parasubiculum) resulting from injecting a control AAV with a relatively (it has been shown to prefer neurons) nonspecific cytomegalovirus promoter (CMV-rAAV) of the same serotype and similar titer.
Figure 1
Optimization of rAAV Constructs for Enhancer-Dependent Gene Expression
(A) Transgene expression in a MEC13-53 tTA X tetO-TVAG transgenic cross visualized by anti-2A immunostaining is restricted to RE + LII projection neurons in EC (Blankvoort et al., 2018). Since this is a different antibody, this is purely a qualitative comparison.
(B) Injection of a nonspecific (CMV-rAAV) virus into the EC shows broad label throughout the entire region, including all layers of EC, as well as subiculum (S) and parasubiculum (PaS).
(C) The same construct without a minimal promoter shows weak nonspecific expression throughout the region that would not be visible at normal image settings (see Figure S1).
(D) Changing the orientation of the expression cassette leads to a marked reduction in nonspecific expression of MEC13-53 rAAV (see inset in [C] and [D], note that most of the LIII label in [D] is not cellular, unlike in [C], and when it is, it is very light, i.e., from baseline transcription). All murine injections were 400 nL. NB: images were differentially modified to best visualize the GFP expression pattern in each panel; comparisons of these images with the same post-acquisition settings are shown in Figure S1 (see Transparent Methods). Note that all label above background auto-fluorescence was treated as positive, even though there were two markedly distinct intensities of label. See also related Figures S1 and S2. Schematics of the viral designs are depicted on top of the corresponding image. ITR, inverted terminal repeat; W, woodchuck hepatitis virus post-transcriptional regulatory element; pA, human growth hormone polyadenylation signal; E, enhancer; G, Green fluorescent protein; C, cytomegalovirus promoter; CMV*, mutated minimal cytomegalovirus promoter. Scale bar, 100 μm.
Optimization of rAAV Constructs for Enhancer-Dependent Gene Expression(A) Transgene expression in a MEC13-53 tTA X tetO-TVAG transgenic cross visualized by anti-2A immunostaining is restricted to RE + LII projection neurons in EC (Blankvoort et al., 2018). Since this is a different antibody, this is purely a qualitative comparison.(B) Injection of a nonspecific (CMV-rAAV) virus into the EC shows broad label throughout the entire region, including all layers of EC, as well as subiculum (S) and parasubiculum (PaS).(C) The same construct without a minimal promoter shows weak nonspecific expression throughout the region that would not be visible at normal image settings (see Figure S1).(D) Changing the orientation of the expression cassette leads to a marked reduction in nonspecific expression of MEC13-53 rAAV (see inset in [C] and [D], note that most of the LIII label in [D] is not cellular, unlike in [C], and when it is, it is very light, i.e., from baseline transcription). All murine injections were 400 nL. NB: images were differentially modified to best visualize the GFP expression pattern in each panel; comparisons of these images with the same post-acquisition settings are shown in Figure S1 (see Transparent Methods). Note that all label above background auto-fluorescence was treated as positive, even though there were two markedly distinct intensities of label. See also related Figures S1 and S2. Schematics of the viral designs are depicted on top of the corresponding image. ITR, inverted terminal repeat; W, woodchuck hepatitis virus post-transcriptional regulatory element; pA, human growth hormone polyadenylation signal; E, enhancer; G, Green fluorescent protein; C, cytomegalovirus promoter; CMV*, mutated minimal cytomegalovirus promoter. Scale bar, 100 μm.The initial step in obtaining viruses capable of driving expression as specific as the EDGE transgenic animals in wild-type brains is to find a minimal viral promoter that is capable of robust expression only when paired with a heterologous enhancer. This is complicated by the fact that the viral inverted terminal repeats (ITRs) themselves have transcriptional activity (Carter et al., 1993, Flotte et al., 1993, Haberman et al., 2000), as can be seen by the very weak (but still above autofluorescence) nonspecific expression obtained from a viral construct with neither a promoter nor an enhancer (Figure 1C). Note that the expression levels in Figure 1C are far below those seen with the other viruses: each panel in Figure 1 has been differentially post-acquisition processed to aid visualization, the “background” expression seen in Figure 1C would otherwise be imperceptible (see Figure S1 for comparison of each image with the same processing). To minimize this issue, we reversed the orientation of the expression cassette relative to the ITRs such that the sense strand was under the influence of the 3′ ITR, which we attenuated by putting WPRE (Zufferey et al., 1999) between the 3′ITR and the enhancer (see schematics in 1C, D). The substantial reduction in background expression enabled us to recapitulate MEC LII-specific expression in a wild-type mouse (Figure 1D) with a mutated minimal CMV promoter (CMV∗) (Loew et al., 2010). Roughly similar results varying in amount and specificity were obtained with other minimal promoters (Figure S2), but we selected CMV∗ for all subsequent experiments (and hereafter simply refer to the enhancer) as it was the smallest one that worked well. The specificity of the expression of this virus as compared with a nonspecific CMV-rAAV virus is quantified in Figure 2. Although still clearly far more specific than the CMV-rAAV, the quantification of MEC13-53 rAAV does not seem as specific as it looks in the figure panels because in our counts we did not distinguish between weak “background” label (such as that seen in Figures 1C and S1 without a promoter) and the strong specific labeling (see below).
Figure 2
MEC13-53 EDGE rAAVs Recapitulates the Cell-Type Specificity Seen in the MEC13-53 EDGE Transgenic Crosses in WT Mice and Rats
Equal amount of MEC13-53 rAAV was injected into MEC of wild-type mice (A–D) and rats (E–H). Insets show anti-GFP (top), marker (middle), and overlay (bottom) of box in main panel. Sections of MEC13-53 rAAV injections counterstained with anti-RE antibody (red, [A] and [E]) and anti-CB antibody (red, [B] and [F]); with a CB + cluster (asterisks) in the insets in (B) and (F). Note the extensive co-localization of the RE stain with the GFP, the sharp delineation of the entorhinal/parasubicular boundary by both labels (arrows, [A] and [E]), and the exclusion of viral label from the CB clusters (asterisks, [B] and [F]).
(C) Proportion of GFP-expressing cells in different parahippocampal regions for both MEC13-53 and nonspecific CMV-rAAV. Each point is a section, note the large number of sections where 100% of the cells are in LII and 0% in other regions exclusively in the MEC13-53 rAAVs compared with the controls (for pictures of control injections see Figures 1B, S4, and S5). A total of 13,096 and 8,540 GFP + cells were counted from three mice injected with CMV-rAAV and seven mice with MEC13-53 rAAV, respectively; data represented as mean ± SEM. In (G), 7,191 and 2,831 GFP + cells from sections were counted from MEC13-53 rAAV and CMV-rAAV, respectively, from three rats. Quantitation of results are shown in (D) (for mice) and (H) (for rats), showing overlap of GFP with cell-marker Reelin stain (green) in LII MEC of mice (96%) and rats (complete overlap). About 4% overlap of GFP with Calbindin (red) was observed in mice and <2% overlap in rats, with number of cells counted in MECLII region. MEC-LII GFP + cells were counted from separate RE and CB immunostained sections from seven mice and three rats injected with MEC13-53 rAAV; data represented as mean ± SEM. See also related Figures S3–S6. Scale bar, 100 μm; all images were processed identically.
MEC13-53 EDGE rAAVs Recapitulates the Cell-Type Specificity Seen in the MEC13-53 EDGE Transgenic Crosses in WT Mice and RatsEqual amount of MEC13-53 rAAV was injected into MEC of wild-type mice (A–D) and rats (E–H). Insets show anti-GFP (top), marker (middle), and overlay (bottom) of box in main panel. Sections of MEC13-53 rAAV injections counterstained with anti-RE antibody (red, [A] and [E]) and anti-CB antibody (red, [B] and [F]); with a CB + cluster (asterisks) in the insets in (B) and (F). Note the extensive co-localization of the RE stain with the GFP, the sharp delineation of the entorhinal/parasubicular boundary by both labels (arrows, [A] and [E]), and the exclusion of viral label from the CB clusters (asterisks, [B] and [F]).(C) Proportion of GFP-expressing cells in different parahippocampal regions for both MEC13-53 and nonspecific CMV-rAAV. Each point is a section, note the large number of sections where 100% of the cells are in LII and 0% in other regions exclusively in the MEC13-53 rAAVs compared with the controls (for pictures of control injections see Figures 1B, S4, and S5). A total of 13,096 and 8,540 GFP + cells were counted from three mice injected with CMV-rAAV and seven mice with MEC13-53 rAAV, respectively; data represented as mean ± SEM. In (G), 7,191 and 2,831 GFP + cells from sections were counted from MEC13-53 rAAV and CMV-rAAV, respectively, from three rats. Quantitation of results are shown in (D) (for mice) and (H) (for rats), showing overlap of GFP with cell-marker Reelin stain (green) in LII MEC of mice (96%) and rats (complete overlap). About 4% overlap of GFP with Calbindin (red) was observed in mice and <2% overlap in rats, with number of cells counted in MECLII region. MEC-LII GFP + cells were counted from separate RE and CB immunostained sections from seven mice and three rats injected with MEC13-53 rAAV; data represented as mean ± SEM. See also related Figures S3–S6. Scale bar, 100 μm; all images were processed identically.
MEC13-53 EDGE rAAVs Express Specifically in Layer II Stellate Cells in Wild-Type Mice and Rats
The neuron-specific stain NeuN (Boccara et al., 2015) confirms the robust LII-specific expression of the MEC13-53 rAAV (Figures S3A and S3C) in neurons (100% of labeled cells were NeuN+, data not shown). Weak, “background” GFP expression was observed in other layers as well in both this virus (Figure S3A, inset) and in the rAAV backbone (i.e., the same virus lacking the enhancer, Figure S3B, inset), which in contrast did not strongly label any cells. Within LII of MEC there are two major classes of excitatory principal neurons, RE + stellate cells and CB + pyramidal cells (Rowland et al., 2018, Witter et al., 2017), with RE label providing a sharp boundary between MEC and parasubiculum (Varga et al., 2010, Witter et al., 2017) (see arrows in Figures 2A and 2E inset). We therefore performed immunohistochemical analysis comparing these markers with viral GFP and found that, for the MEC13-53 rAAV, 96% (2,300/2,406) of GFP + cells in layer II were RE+ (Figures 2A and 2D), whereas 4% (74/1,668) were CB+ (Figures 2B and 2D). In contrast, for injections of roughly equal amounts of the ubiquitous CMV-rAAV, only 34% (319/929) of GFP + LII cells were RE +, whereas 10.5% (142/1,353) were CB+. Thus, the MEC13-53 rAAV drives transgene expression specifically in a particular subset of excitatory neurons in EC of wild-type mice, i.e., RE + EC LII neurons (stellate cells in MEC), avoiding the adjacent CB + pyramidal cells, like the transgenic lines based on the same enhancer.Although this nicely illustrates the specificity of this EDGE rAAV, perhaps the greatest utility of EDGE rAAVs is that, because enhancers are highly conserved (Cotney et al., 2013) and can be obtained from any tissue sample, they have the potential to work across species. As seen in Figures 2E–2G, S3C, and S4C, the MEC13-53 rAAV derived from mouse EC is, if anything, more specific in the rat. Figures 2E and S3C shows GFP expression almost exclusively in MEC LII (as quantified in Figures 2G and S6B), whereas the few labeled neurons in the virus with no enhancer have no layer specificity (Figure S3D), just as in mouse (Figure S3B). Similarly, 100% (2,332/2,332) of MEC LII GFP + neurons in rats injected with MEC13-53 rAAVs were RE+ (Figures 2E and 2H), whereas only 1.4% (25/1,799) were CB+ (Figures 2F and 2H), even though the two excitatory subtypes are intermingled (Witter et al., 2017). This, and the presence of LII-specific label throughout the dorsoventral and medio-lateral axes of the MEC (Figure S4C), provides compelling evidence for cellular specificity. Note that, with the nonspecific CMV-rAAV, 35% (189/518) of GFP + LII cells were RE +, whereas 46% (285/613) were CB+ (Figure S5). It is interesting to note that, although these two markers are largely mutually exclusive, there are reports of a very small subpopulation of RE + neurons that are also CB+ (Fuchs et al., 2016, Varga et al., 2010), so the single-digits label with the MEC13-53 virus may be those cells. Clearly, though, the two rAAVs with the same serotype have very different expression patterns, both in terms of layer and cellular specificity.
Systemic Administration of Blood-Brain Barrier Crossing MEC13-53 EDGE Recapitulates MEC Layer II Stellate Cell Expression
Although we are mainly interested in developing tools to be used in analysis of the EC, it is interesting to ask whether this enhancer would express in other brain regions if it were systemically administered. We therefore packaged the MEC13-53 EDGE enhancer (shown with the 2/1 serotype in Figures 1 and 2) into the blood-brain barrier crossing PHP (Deverman et al., 2016) serotype and performed noninvasive intravenous injections via the tail vein. Systemic injections of MEC13-53 EDGE PHP resulted in much sparser GFP + cells overall, but they are also mostly confined to layer II of MEC throughout the caudal forebrain (Figures 3A and S7). However, we also noticed sparse expression of the transgene in regions other than MEC, typically also in brain regions we would sometimes see transgene expression in MEC13-53 transgenic lines (Figures 3B and 3C, Table S2). Curiously, we did not see expression in LII of the piriform cortex, the major site of non-EC expression in the MEC13-53 transgenic lines, possibly due to the particular tropism of the PHP capsid. Furthermore, we confirmed that these GFP + cells in MEC are RE+ (Figure 3D). These results suggest that EDGE rAAV can retain its particular cell-type specificity, even when assembled in a serotype with a different innate tropism.
Figure 3
Recapitulation of LII MEC Specificity of MEC13-53 Using a BBB-Crossing rAAV Serotype
(A and B) (A) Representative image of the GFP + neurons in horizontal brain section from a mouse injected with 1012 particles of MEC13-53 rAAV PHP, intravenously into tail vein. The boxes in (A) are zoomed in (B).
(C) MEC13-53 transgene expression in same regions as in (B) are in the MEC13-53 tTA X tetO-TVAG transgenic cross.
(D) Sections of MEC13-53 rAAV PHP injected brain counterstained with anti-RE antibody. Insets show anti-GFP (top), Reelin (middle), and overlay (bottom) of box in main panel. Label is throughout the layers of EC and sparsely in other regions (arrow heads, [D]). Note the extensive co-localization of the RE stain with the GFP (arrows).
See also Figure S7. Scale bar, 100 μm. See also related Table S2.
Recapitulation of LII MEC Specificity of MEC13-53 Using a BBB-Crossing rAAV Serotype(A and B) (A) Representative image of the GFP + neurons in horizontal brain section from a mouse injected with 1012 particles of MEC13-53 rAAV PHP, intravenously into tail vein. The boxes in (A) are zoomed in (B).(C) MEC13-53 transgene expression in same regions as in (B) are in the MEC13-53 tTA X tetO-TVAG transgenic cross.(D) Sections of MEC13-53 rAAV PHP injected brain counterstained with anti-RE antibody. Insets show anti-GFP (top), Reelin (middle), and overlay (bottom) of box in main panel. Label is throughout the layers of EC and sparsely in other regions (arrow heads, [D]). Note the extensive co-localization of the RE stain with the GFP (arrows).See also Figure S7. Scale bar, 100 μm. See also related Table S2.
EDGE rAAVs Recapitulate the Expression Pattern of Their Respective Transgenic Lines
To examine whether this is a general strategy, we created EDGE rAAVs with several other enhancers with known specificity (Blankvoort et al., 2018). Although not all enhancers that worked as transgenic lines worked in rAAVs, roughly half (Figure 4, left column) did indeed appear to recapitulate the specificity (or relative lack thereof, 4A, B) of the corresponding EDGE lines (Figure 4, right column). The MEC13-104 rAAV (Figure 4A) recapitulates the relatively sparse labeling of a subset of LIII neurons (arrows) seen in the MEC13-104 line (Figure 4B), whereas the converse is true for the mainly LIII-specific LEC13-8 (compare 4C with 4D) line. Thus, the relative densities of the layer-specific label appear to be enhancer specific, suggesting that the minority of cells that strongly express outside of their primary layer may not be “noise.” Ongoing experiments explore the functional distinctions between the cells labeled by the various enhancers, which may label distinct subsets of what has been considered a single neuronal cell type, e.g., stellate cells.
Figure 4
EDGE rAAVs Recapitulate the Distinct Layer-Specific Expression Patterns Seen in EDGE Transgenic Mice
Comparison of expression patterns obtained by injection of EDGE rAAVs (left column) with those seen in transgenic crosses made with the same enhancers (right column). Wild-type mice were injected with 400 nL of EDGE rAAVs (A) MEC13-104 and (C) LEC13-8. Transgene expression in the corresponding EDGE transgenic crosses ([B], MEC13-104 tTA X tetO-TVAG) and ([D], LEC13-8 tTA X tetO-HM3) visualized by ISH on horizontal sections using the respective transgene probes. The sparse expression of the transgene in minor layers is indicated by arrows both in EDGE transgenics and viruses. Scale bar, 100 μm; all sections are horizontal, and all rAAV figures underwent the same image processing. See also related Table S1 and Data S1.
EDGE rAAVs Recapitulate the Distinct Layer-Specific Expression Patterns Seen in EDGE TransgenicMiceComparison of expression patterns obtained by injection of EDGE rAAVs (left column) with those seen in transgenic crosses made with the same enhancers (right column). Wild-type mice were injected with 400 nL of EDGE rAAVs (A) MEC13-104 and (C) LEC13-8. Transgene expression in the corresponding EDGE transgenic crosses ([B], MEC13-104 tTA X tetO-TVAG) and ([D], LEC13-8 tTA X tetO-HM3) visualized by ISH on horizontal sections using the respective transgene probes. The sparse expression of the transgene in minor layers is indicated by arrows both in EDGE transgenics and viruses. Scale bar, 100 μm; all sections are horizontal, and all rAAV figures underwent the same image processing. See also related Table S1 and Data S1.
Discussion
Our prior work showed that identification of cis-regulatory elements uniquely active in finely dissected cortical subregions allows one to generate genetic tools specific to cells in that subregion, an approach we call EDGE (Blankvoort et al., 2018). Here we show that one can use the same approach to make rAAVs with similar specificity in both mouse and rat, provided the vector and minimal promoter's innate transcriptional activity is minimized. This clearly cross-validates the initial identification of enhancers in our prior work (Blankvoort et al., 2018): although transgenic lines might show highly specific expression patterns purely due to insertional effects (although not the same pattern in multiple founders, as we saw), rAAVs typically do not insert into the genome (Mccarty et al., 2004), so cannot show such effects. In other words, although the precise functional significance of the enhancers presented here remains unknown, they clearly are “true” enhancers, reflecting some genetic subgroup of excitatory neurons in the entorhinal cortex of wild-type mice and rats. Taken together, these data lead to two very interesting conclusions: (1) given that the numbers of enhancers may run into the millions (as opposed to ∼44,000 promoters) (ENCODE Project Consortium, 2012), they may provide access to the ever-growing number of neuronal cell types than promoters, which may be far greater than generally assumed (Zeisel et al., 2015, Cembrowski et al., 2016, Tasic et al., 2018, Saunders et al., 2018); and (2) although we do not do so here, one could conceivably take this approach toward generating neuronal subtype-specific transgene expression in species other than the traditional genetic models of mouse, zebrafish, fly, and worm, because one can do the required epigenomic analyses on any tissue sample.There has been a lot of effort over the years toward making cell-type-specific viral vectors, but even in those cases when a minimal native promoter is useful (i.e., when a single marker defines the cells, e.g., TH-AAV [Gompf et al., 2015] and CaMKII-AAV [Nathanson et al., 2009]) the AAVs are not fully restricted to cells expressing the gene. We have previously shown that using single, uniquely active enhancers can lead to far greater specificity than that of native parent promoters (Blankvoort et al., 2018), at least in transgenesis. That enhancers drive expression similarly in both transgenic lines and viruses is not a particularly surprising result. It has been known for decades that enhancers drive cell-specific expression (Grosveld et al., 1987, Noonan and Mccallion, 2010, Shen et al., 2016) in a variety of species. Enhancers for the six homeobox genes related to the fly distal-less gene (Cohen and Jurgens, 1989) (Dll in fly, Dlx in vertebrates) have been shown to play a crucial role in morphogenesis in many species (Anderson et al., 1997, Ghanem et al., 2003, Miyoshi et al., 2010, Panganiban and Rubenstein, 2002, Zerucha et al., 2000). One such enhancer in the Dlx 5/6 gene cluster has been shown to be critical to the development of interneurons in particular (Stenman et al., 2003), and a recent paper (Dimidschstein et al., 2016) used this enhancer element in a viral vector to obtain interneuron-specific expression in a variety of species, nicely showing that enhancers can be used to drive expression in viral vectors. However, as is true for most genetically defined enhancers active early in development, Dlx5/6 drives expression across broad classes of neurons (e.g., interneurons in general) throughout the brain, rather than to particular interneuronal subclasses and/or subregions.More recently, several groups have begun to incorporate cis-regulatory elements into their strategies for creating viral vectors specific to neuronal subtypes. Such efforts are likely furthest along in the retina, where Juttner and colleagues (2019) created a broad rAAV resource targeting subtypes of retinal neurons using strategies based on genes of interest (GOIs) identified in a priori transcriptomal analysis (Siegert et al., 2012) and epigenetic analysis (Hartl et al., 2017) of known retinal cell types. Although most of these constructs are simply the minimal promoters of the GOIs, some also are based on the local epigenetic landscape, using strategies such as conservation, methylation patterns, and transcription factor binding sites to identify likely cis-elements for GOIs. Although the results in retina can be quite impressive, little is known how specific such vectors would be in the rest of the brain. As for the brain, Hrvatin et al., 2019 recently published an interesting screening strategy called PESCA (Paralleled Enhancer Single Cell Assay), in which multiple rAAVs containing barcoded putative enhancers (they use the term Gene Regulatory Elements, or GREs) are screened via single-cell transcriptomics (scRNAseq) rather than the more traditional one-at-a-time anatomical techniques shown here. Although scRNAseq does not always reflect actual viral expression, this technique nevertheless promises to greatly increase the throughput involved in first-pass screening of rAAVs. In a very interesting study, Graybuck and colleagues compare scRNAseq data to the epigenetic single-cell Assay for Transposase-Accessible Chromatin with Sequencing (scATACseq) data from layer-specific transgenic mice. Hits that co-register in both the transcriptomic and epigenetic clusters are then cloned into PHP.B Cre-rAAVs and systemically (retro-orbitally) injected into a Cre-reporter mouse for anatomical characterization.The overwhelming similarity of these various approaches is the idea that individual cis-regulatory elements may be more specific than promoters. Each strategy has two stages: identifying likely cis-regulatory elements and then making and screening the resulting rAAVs. The major difference clearly comes at the identification stage: each of these other GRE (as opposed to promoter) -based approaches has been based on a priori knowledge of the transcriptomics of whatever cell type one is looking for, often even taking advantage of transgenic animals, whereas EDGE simply looks for regionally specific chromatin marks in reproducibly dissected bulk tissue. The advantage of the former is resolution: by a “deep dive” into subtypes of what we had originally thought were cell types, one both gets at the scale of neuronal diversity and immediately puts the cell types in context, whereas with pure differential screens of bulk tissue such as EDGE you really do not know what cell types you will get, you just know that they are more or less specific to your tissue of interest. However, the flexibility (one simply needs ChIP of an ROI), ease of doing EDGE in other species, and ability to discover truly new cell types counterbalance this disadvantage. A more purely technical difference is between bulk ChIPseq and ATACseq. Although the latter can be done with much less tissue (even single cells), the former's use of particular histone marks may provide greater specificity for active enhancers rather than other forms of open chromatin. At the screening level, systemic viral injections (Graybuck, 2019) with AAV serotypes that cross the blood-brain barrier clearly give you the best idea of where a particular enhancer can express throughout the brain. We regret that we are as of yet unable to obtain permission to perform retro-orbital AAV injections from our local regulators, so our systemic injections were with a less effective technique (tail vein), lowering the effective titer. If PESCA (Hrvatin et al., 2019) can reliably be done on bulk tissue, however, it may end up as a better screen for our purposes. All in all, there are advantages to each approach that make them largely complementary, suggesting that combinations of these techniques and comparisons between the resulting datasets (ChIP versus ATAC, bulk versus single cell) may well end up being the best overall approach.Thus, the most important aspect of these and other papers is not that enhancers can work in viral vectors, it is illustrating the promise of applying modern genomic techniques to the study of the precise neural circuitry of the vertebrate brain. The striking diversity of enhancers found in these tiny subregions of cortex (numbers comparable with those found for entire organs) may indicate a similar diversity of neuronal cell types in the brain. However, the relationship between enhancers and cell types remains unclear. Indeed, the expression patterns we obtain are arguably more specific than our current understanding of neuronal cell type (Luo et al., 2008, Zeng and Sanes, 2017). For instance, stellate cells are a generally accepted excitatory neuronal cell type of the medial entorhinal cortex (Rowland et al., 2016, Varga et al., 2010, Witter et al., 2017). However, we show that distinct enhancers drive expression in EC LII stellate cells to different degrees in both transgenics and rAAVs. The question becomes whether these enhancer-driven expression patterns reflect functionally distinct stellate cells, or states of stellate cells, or just random subsets of the same indivisible cell type. In the specific case of stellate cells, a recent paper used optogenetic tagging to show that stellate cells of the MEC exhibit a variety of quite distinct receptive field properties (i.e., they can be grid cells or spatial cells or border cells), suggesting that there are many functional subtypes of stellate cells (Rowland et al., 2018). More generally, the relationship between differential enhancer usage and neuronal cell types is a highly non-trivial question, not least because there is not even complete agreement even as to how to define neuronal cell types (although there are notable exceptions) (Cembrowski et al., 2016, Tasic et al., 2018, Tremblay et al., 2016), let alone how many there are. There are several other interesting explanations for differential enhancer usage beyond cell type; for instance, it could dictate distinct states of a single cell type. In support of this, neural activity drastically changes the chromatin landscape of the brain, including which enhancers are active (Gallegos et al., 2018, Malik et al., 2014). It will likely take years of anatomical, molecular, and physiological characterization of these tools to disentangle such questions, so for our current purposes the most important consideration is that these enhancer-based molecular genetic tools remain true to type, as appears to largely be the case, comparing the virus to the transgenic.It should be noted, however, that specificity is almost never absolute, especially with viral vectors. Although we obtain neuronal subtype-specific results with large injections into the entorhinal cortex (Figures 2 and S4), it is likely that any cell type in other brain regions that express the transcription factor(s) appropriate for a particular enhancer would be labeled as well, as can be seen with the systemic injections shown in Figure 3. Thus, we do not claim that the rAAVs shown here are necessarily 100% regionally specific; indeed, it is hard to imagine that a particular enhancer is only used once in development. Rather, we demonstrate clear cell-type specificity when the MEC13-53 rAAV is injected into a particular brain region, which is nevertheless good enough for the study of neural circuitry. Moreover, many more cells are infected than show strong GFP label, and there is a baseline level of transcription from other elements in the viral construct (i.e., the minimal promoter and the ITRs). This implies that superfection of enough rAAVs could lead to discernible nonspecific transgene expression in any cell regardless of the promoter, something that is shown most clearly by making viruses containing no exogenous promoter whatsoever (Figure 1C). Viral expression is thus not all-or-nothing, but the difference between background and enhancer-driven expression levels can be quite marked (Figure S1). This background expression inherent to rAAVs can be quite problematic when a little bit of expression can have a large effect. This is true when expressing enzymes such as recombinases or when complementing replication-competent viruses (e.g., pseudotyped ΔG-rabies [Weible et al., 2010]) but is likely not an issue with transgenes whose effects vary roughly linearly with their expression levels, such as the chemogenetic (Sternson and Roth, 2014) and/or optogenetic tools (Boyden et al., 2005) used to study neural circuits.Thus, identification of the active enhancers of a mere four cortical subregions of the mouse brain has led to a variety of transgenic, and now viral tools for circuit analysis that appear to work across species, at least in rodents. Since, in principle, one can do this on any reasonably well-annotated genome, one could conceivably develop tools for anatomically specific “circuit-breaking” tools in any species, even our own. Thus, not only will circuit-specific tools greatly facilitate our understanding of normal and pathological brain function, but they could also in time possibly provide circuit-specific therapeutic avenues. For example, it has been known for decades that preclinical stages of Alzheimer's disease (AD) are characterized by neuronal loss and accumulation of neurofibrillary tangles in the superficial layers of trans-entorhinal cortex (Braak and Braak, 1991), a region roughly equivalent to rodent MEC layer II. In addition, intracellular amyloid-β is found specifically in MEC layer II RE + neurons in humanAD pathology and rodent disease models (Kobro-Flatmoen et al., 2016). Given the emerging consensus that AD may progress trans-synaptically (De Calignon et al., 2012, Spires-Jones and Hyman, 2014), it is conceivable that one could use something like a MEC13-53 rAAV to deliver therapeutic agents directly to the presumed pre-α cells, thereby stopping AD before it starts. More generally, it is possible that the reason that many neurological and neuropsychiatric disorders are resistant to drug therapy is that they are imbalances in particular neural circuits, not diseases of the entire brain. A drug having tropism for multiple circuits (as most do) would then by definition produce unwanted side effects: it may do the right thing in the right circuit, but it does the wrong thing to normal circuits. Results like those presented here allow the hope that investigators may one day be able to design interventions with the specificity required to treat the complex diseases of the brain.
Limitations of the Study
Although we think that we have made a substantive contribution toward the generation of circuit-specific tools that could be used outside of traditional genetic models, we freely acknowledge the limitations of our data. Although it is indeed true that active enhancers can be identified in any tissue sample of reasonable size from any species and used to make EDGE-rAAVs in ways similar to that presented here, we have only showed the same specificity for stellate cells in two rodent species—larger animals such as primates pose significant challenges with viral vectors. In addition, although we can see remarkable cellular specificity when EDGE rAAVs are injected into the region they were designed for, systemic administration suggests that the enhancer may also express in other cell types if injected in other regions. Regardless, we feel that these are quite useful tools for the analysis of neural circuits.
Methods
All methods can be found in the accompanying Transparent Methods supplemental file.
Authors: Noël Ghanem; Olga Jarinova; Angel Amores; Qiaoming Long; Gary Hatch; Byung Keon Park; John L R Rubenstein; Marc Ekker Journal: Genome Res Date: 2003-04 Impact factor: 9.043
Authors: Arpiar Saunders; Evan Z Macosko; Alec Wysoker; Melissa Goldman; Fenna M Krienen; Heather de Rivera; Elizabeth Bien; Matthew Baum; Laura Bortolin; Shuyu Wang; Aleksandrina Goeva; James Nemesh; Nolan Kamitaki; Sara Brumbaugh; David Kulp; Steven A McCarroll Journal: Cell Date: 2018-08-09 Impact factor: 41.582
Authors: Sinisa Hrvatin; Christopher P Tzeng; M Aurel Nagy; Hume Stroud; Charalampia Koutsioumpa; Oren F Wilcox; Elena G Assad; Jonathan Green; Christopher D Harvey; Eric C Griffith; Michael E Greenberg Journal: Elife Date: 2019-09-23 Impact factor: 8.140
Authors: Alyssa J Lawler; Easwaran Ramamurthy; Ashley R Brown; Naomi Shin; Yeonju Kim; Noelle Toong; Irene M Kaplow; Morgan Wirthlin; Xiaoyu Zhang; BaDoi N Phan; Grant A Fox; Kirsten Wade; Jing He; Bilge Esin Ozturk; Leah C Byrne; William R Stauffer; Kenneth N Fish; Andreas R Pfenning Journal: Elife Date: 2022-05-16 Impact factor: 8.713
Authors: R Irene Jacobsen; Rajeevkumar R Nair; Horst A Obenhaus; Flavio Donato; Torstein Slettmoen; May-Britt Moser; Edvard I Moser Journal: Cell Rep Methods Date: 2022-05-23
Authors: Lucas T Graybuck; Tanya L Daigle; Adriana E Sedeño-Cortés; Miranda Walker; Brian Kalmbach; Garreck H Lenz; Elyse Morin; Thuc Nghi Nguyen; Emma Garren; Jacqueline L Bendrick; Tae Kyung Kim; Thomas Zhou; Marty Mortrud; Shenqin Yao; La' Akea Siverts; Rachael Larsen; Bryan B Gore; Eric R Szelenyi; Cameron Trader; Pooja Balaram; Cindy T J van Velthoven; Megan Chiang; John K Mich; Nick Dee; Jeff Goldy; Ali H Cetin; Kimberly Smith; Sharon W Way; Luke Esposito; Zizhen Yao; Viviana Gradinaru; Susan M Sunkin; Ed Lein; Boaz P Levi; Jonathan T Ting; Hongkui Zeng; Bosiljka Tasic Journal: Neuron Date: 2021-03-30 Impact factor: 17.173
Authors: John K Mich; Lucas T Graybuck; Erik E Hess; Joseph T Mahoney; Yoshiko Kojima; Yi Ding; Saroja Somasundaram; Jeremy A Miller; Brian E Kalmbach; Cristina Radaelli; Bryan B Gore; Natalie Weed; Victoria Omstead; Yemeserach Bishaw; Nadiya V Shapovalova; Refugio A Martinez; Olivia Fong; Shenqin Yao; Marty Mortrud; Peter Chong; Luke Loftus; Darren Bertagnolli; Jeff Goldy; Tamara Casper; Nick Dee; Ximena Opitz-Araya; Ali Cetin; Kimberly A Smith; Ryder P Gwinn; Charles Cobbs; Andrew L Ko; Jeffrey G Ojemann; C Dirk Keene; Daniel L Silbergeld; Susan M Sunkin; Viviana Gradinaru; Gregory D Horwitz; Hongkui Zeng; Bosiljka Tasic; Ed S Lein; Jonathan T Ting; Boaz P Levi Journal: Cell Rep Date: 2021-03-30 Impact factor: 9.423
Authors: Adriana Galvan; Terri L Petkau; Austin M Hill; Andrea J Korecki; Ge Lu; Diane Choi; Kazi Rahman; Elizabeth M Simpson; Blair R Leavitt; Yoland Smith Journal: Hum Gene Ther Date: 2021-06-01 Impact factor: 4.793
Authors: Anna N Rubin; Ruchi Malik; Kathleen K A Cho; Kenneth J Lim; Susan Lindtner; Sarah E Robinson Schwartz; Daniel Vogt; Vikaas S Sohal; John L R Rubenstein Journal: eNeuro Date: 2020-12-17
Authors: Anastasia A Borodinova; Pavel M Balaban; Ilya B Bezprozvanny; Alla B Salmina; Olga L Vlasova Journal: Cells Date: 2021-06-25 Impact factor: 6.600