| Literature DB >> 28120936 |
Jessie C Jacobsen1, Serkan Erdin2,3, Colby Chiang2,4, Carrie Hanscom2, Renee R Handley1, Douglas D Barker2, Alex Stortchevoi2, Ian Blumenthal2, Suzanne J Reid1, Russell G Snell1, Marcy E MacDonald2,3,5, A Jennifer Morton6, Carl Ernst7, James F Gusella2,3,8, Michael E Talkowski2,3,5,9.
Abstract
Integration of exogenous DNA into a host genome represents an important route to generate animal and cellular models for exploration into human disease and therapeutic development. In most models, little is known concerning structural integrity of the transgene, precise site of integration, or its impact on the host genome. We previously used whole-genome and targeted sequencing approaches to reconstruct transgene structure and integration sites in models of Huntington's disease, revealing complex structural rearrangements that can result from transgenesis. Here, we demonstrate in the R6/2 mouse, a widely used Huntington's disease model, that integration of a rearranged transgene with coincident deletion of 5,444 bp of host genome within the gene Gm12695 has striking molecular consequences. Gm12695, the function of which is unknown, is normally expressed at negligible levels in mouse brain, but transgene integration has resulted in cortical expression of a partial fragment (exons 8-11) 3' to the transgene integration site in R6/2. This transcript shows significant expression among the extensive network of differentially expressed genes associated with this model, including synaptic transmission, cell signalling and transcription. These data illustrate the value of sequence-level resolution of transgene insertions and transcription analysis to inform phenotypic characterization of transgenic models utilized in therapeutic research.Entities:
Mesh:
Year: 2017 PMID: 28120936 PMCID: PMC5264158 DOI: 10.1038/srep41120
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Transgene rearrangement and insertion into the mouse genome.
(a) The original structure of the R6 transgene (top schematic, drawn to scale) and its extensive resultant rearrangement following integration into chromosome 4 of the R6/2 mouse (bottom schematic) are shown. The R6/2 integrated transgene contained fragments derived from at least three copies of the original R6 transgene, including complex inversions as well as head-to-tail and tail-to-head concatenations. Additionally, a 168 bp fragment of bacterial sequence was integrated upstream of the transgene HTT 5′ genomic sequence2. (b) Integration of a 1.9 kb HTT transgene fragment into intron 7 of mouse locus Gm12695 and concomitant 5.4 kb deletion of intron 7 at the integration site (demarcated by dotted lines). PhyloP38 analysis of mammalian conservation spanning the gene is provided, revealing the deleted intronic segment to be among the most conserved regions in the gene, including its coding sequence.
Figure 2Dysregulation of Gm12695 expression in R6/2 brain.
Expression of Gm12695 mRNA sequences in (a) cortex, (b) hippocampus and (c) striatum of four independent R6/2 transgenic mice expressing either 52, 111, 404 or 714 CAG repeats (which display differing phenotypic severity), and four age and sex-matched littermates that do not carry the transgene, was measured using quantitative PCR. All brain regions in transgenic R6/2 mice showed dramatically increased expression of Gm12695 sequences above low or negligible expression levels in wild-type. Data are represented as mean ± STDEV.
Figure 3Identification of a partial Gm12695 transcript spliced downstream of a 3′ segment of R6/2 transgene.
Integrated Genome Viewer display of combined RNA sequencing data from all three R6/2 libraries. The Sashimi plot depicts splice junctions as arcs between the R6/2 transgene and exon 8, and between other exons of the Gm12695 transcript on the sense strand (reverse strand). Numbers on the arcs illustrate the reads covering splice junctions. Maximum read depth coverage is 89. The bottom track illustrates a novel transcript predicted by Trinity (predicted transcript) as a result of transgene integration, the known Gm12695 transcript and the R6/2 transgene at the integration site. The chromosomal coordinates are derived from Supplementary Table S1.