Literature DB >> 25158091

Lentivector integration sites in ependymal cells from a model of metachromatic leukodystrophy: non-B DNA as a new factor influencing integration.

Robert G McAllister1, Jiahui Liu2, Matthew W Woods1, Sean K Tom1, C Anthony Rupar3, Stephen D Barr1.   

Abstract

The blood-brain barrier controls the passage of molecules from the blood into the central nervous system (CNS) and is a major challenge for treatment of neurological diseases. Metachromatic leukodystrophy is a neurodegenerative lysosomal storage disease caused by loss of arylsulfatase A (ARSA) activity. Gene therapy via intraventricular injection of a lentiviral vector is a potential approach to rapidly and permanently deliver therapeutic levels of ARSA to the CNS. We present the distribution of integration sites of a lentiviral vector encoding human ARSA (LV-ARSA) in murine brain choroid plexus and ependymal cells, administered via a single intracranial injection into the CNS. LV-ARSA did not exhibit a strong preference for integration in or near actively transcribed genes, but exhibited a strong preference for integration in or near satellite DNA. We identified several genomic hotspots for LV-ARSA integration and identified a consensus target site sequence characterized by two G-quadruplex-forming motifs flanking the integration site. In addition, our analysis identified several other non-B DNA motifs as new factors that potentially influence lentivirus integration, including human immunodeficiency virus type-1 in human cells. Together, our data demonstrate a clinically favorable integration site profile in the murine brain and identify non-B DNA as a potential new host factor that influences lentiviral integration in murine and human cells.

Entities:  

Year:  2014        PMID: 25158091      PMCID: PMC4221599          DOI: 10.1038/mtna.2014.39

Source DB:  PubMed          Journal:  Mol Ther Nucleic Acids        ISSN: 2162-2531            Impact factor:   10.183


Introduction

The blood–brain barrier controls the passage of molecules from the blood into the central nervous system (CNS) and is a major challenge for the treatment of neurological diseases. Metachromatic leukodystrophy (MLD) is an autosomal recessively inherited neurodegenerative lysosomal storage disease caused by the loss of arylsulfatase A (ARSA) activity. ARSA is required to catalyze the first step in the degradation pathway of galactosyl-3-sulfate ceramide (sulfatide), a major sphingolipid of myelin. The loss of ARSA activity results in the accumulation of sulfatide in glial cells and neurons, followed by severe demyelination and neurodegeneration.[1,2] Visceral manifestations of some lysosomal storage disease may be treated by intravenous enzyme replacement therapy, but the blood–brain barrier prevents access to the CNS in MLD and other lysosomal storage diseases with neurological involvement. The efficacy of brain gene therapy to correct ARSA deficiency has been demonstrated in mice and nonhuman primates using direct injection of serotype 5 recombinant adeno-associated vector in the brain.[3,4] However, the requirement of multiple injections in different regions of the brain and low transduction efficiencies warrant the development of improved vector delivery approaches. Recently, restoration of ARSA deficiency has been achieved in mice by lentiviral transduction of hematopoietic stem cells (HSCs) ex vivo, followed by re-infusion of the engineered HSCs.[5,6,7] A similar HSC gene therapy approach was shown to prevent progression to neurodegenerative disease in three presymptomatic patients who were predicted to experience early-onset MLD.[8] The most common presentation of MLD is the rapidly progressive late infantile disease. Most patients are significantly symptomatic by the time of diagnosis and the relatively slow replacement of brain microglia from bone-marrow-derived cells may pose a significant challenge in halting the rapid progression of the disease. In mammals, the choroid plexus is a structure in the ventricles of the brain consisting of modified ependymal cells. The ependymal cell lining (ependyma) is a single-layered, cuboidal-columnar, ciliated epithelium that normally lines the cerebral ventricles and central canal of the spinal cord. The ependymocyte is a fully differentiated cell that remains in a position adjacent to the cerebrospinal fluid (CSF). The coordinated beating of the ependymal cilia creates a current of CSF along the walls of the lateral ventricle that optimizes the dispersion of neural messengers in the CSF.[9,10,11,12] The CSF also provides access to several different regions of the brain via the Virchow–Robbins spaces. Lentiviral transduction of ependymal cells is therefore an attractive approach to rapidly and permanently deliver therapeutic levels of proteins or enzymes to a broad area of the CNS and is less invasive than multiple injections throughout the brain.[13] Among the advantages of using lentiviral vectors for delivery of therapeutic proteins are their high transduction efficiencies, permanence, and ability to transduce fully differentiated cells. The potential for adverse events such as insertional activation of oncogenes or the disruption of gene coding sequences necessitates analysis of lentiviral vector integration sites in the genome of cells targeted by clinical lentiviral gene therapy vectors.[14,15] Here we present an analysis of the distribution of integration sites of a lentiviral vector encoding the human ARSA gene (LV-ARSA) in murine brain ependymal cells, administered via a single intracranial injection into the right lateral ventricle of ARSA−/− and wild-type (WT) mice.

Results

Intraventricular administration of LV-ARSA

LV-ARSA was injected into the right lateral ventricle of 4 ARSA−/− and 4 WT C57Bl6 4-month-old mice of mixed genders (,). Seven days after administration, the brain tissue surrounding the lateral ventricle containing the choroid plexus was dissected from each mouse (). In a separate experiment to visualize transduced cells, ARSA was replaced with green fluorescent protein (GFP) to generate LV-GFP (). LV-GFP was injected into the right lateral ventricle, and immunofluorescence microscopy analysis of tissue sections showed that LV-GFP transduced cells in the choroid plexus and the ependymal cells that lined the ventricle. No substantial GFP fluorescence was observed in cells of the surrounding brain tissue, suggesting that the vector did not pass through or between the ependymal cell layer (). The distribution of GFP positive cells obtained 7 days after injection did not differ 7 months after injection (data not shown). To characterize vector integration sites, genomic DNA was isolated from the dissected tissue and subjected to restriction enzyme digestions and linker-mediated amplification using primers as previously described (see Materials and Methods section and ref. 16). Following amplification, the purified DNA samples were processed using the Nextera XT DNA Sample Preparation kit, which uses an engineered transposome to simultaneously fragment and tag the input DNA with unique adapter and index (“barcodes”) sequences on both ends of the DNA. A limited-cycle PCR reaction was performed to amplify the insert DNA, which was then sequenced using Illumina MiSeq using 2 × 150 base pair chemistry. To help control for potential fragment recovery bias introduced from restriction enzyme digestion, a matched set of random control sites was generated taking into account the distances between experimental sites and the nearest restriction enzyme sequence used in the digestion, as previously described.[16] All samples were independently prepared and sequenced twice by different people with 13 months between preparations.

Integration is enriched in genes and satellite DNA

From two independent sequencing datasets, we obtained a total of 77 million reads across all 8 mice and 434 unique sites (Supplementary Table S1). To analyze integration events in particular genomic features, unique integration sites from all mice were pooled. shows a graphical representation of the frequency of unique integration events with respect to several genomic features (see also Supplementary Table S2). For each genomic feature, favoring or disfavoring of integration into the feature compared with random is represented by green or magenta coloring, respectively. In all, 43.6% of the LV-ARSA integration sites were found in the protein coding region of UCSC and RefSeq genes. These frequencies were slightly enriched compared with the matched random controls by 1.17- and 1.20-fold, respectively (P < 0.008, χ2 test). The single most prevalent feature of mammalian genomes is their repetitive sequences. Approximately 39% of the murine genome consists of interspersed repeats (e.g., LINEs, SINEs, LTR elements, and satellite DNA), compared with ~46% in the human genome. Analysis of LV-ARSA integrations in interspersed repeats revealed that 0.69% of integrants were located in satellite DNA compared with 0.07% for the matched random control sites (10.5-fold increase) (P < 0.0001, χ2 test) (). In contrast, integration sites were not significantly enriched in LINEs, SINEs, endogenous retroviral elements (ERVs), CpG islands, or low-complexity repeats. For comparison, LV-ARSA infection of human 293T and HeLa cells showed that LV-ARSA favored integration into genes, LINEs, SINEs, ERVs, low-complexity repeats, and simple repeats (Supplementary Tables S3 and S4). Integration into satellite DNA was favored in both 293T and HeLa cells; however, significance was only achieved with HeLa cells (P < 0.0001, χ2 test).

LV-ARSA does not favor integration in oncogenes or highly transcribed genes

Insertional oncogenesis by γ-retroviral gene therapy vectors has occurred in previous human gene therapy trials, contributing to the development of leukemia.[14,17,18,19] A major contributing factor to this adverse event is the preferential integration of γ-retroviral vectors at promoters, transcription start sites, and near genes controlling cell growth and proliferation.[20,21,22] Lentiviral vectors have not been previously found to favor such regions and are less likely to contribute to insertional oncogenesis. Consistent with previous findings with lentiviral vectors, LV-ARSA disfavored integration in transcription start sites ( and Supplementary Table S2). Furthermore, none of the LV-ARSA sites were located in, or within 50 kb of, known murine cancer-related genes (the “Cancer Gene List” from http://www.bushmanlab.org/links/genelists).[23] To determine if there was an influence of gene expression on LV-ARSA integration into genes, we examined two published gene expression profiling datasets of murine choroid plexus tissue (GSE27630 and GSE3594).[24,25] Genes were categorized into four different expression bins ranging from low- to high-level gene expression. Genes targeted for integration by LV-ARSA were distributed into the same bins based on their expression levels (,). The distribution of genes targeted by LV-ARSA did not differ significantly from random distribution in either dataset, except for bin 1 (low-level expression) of the GSE3594 dataset where LV-ARSA sites were depleted compared with random (P=0.008, Fisher's exact test). Together, these data indicate that high-level gene expression did not influence LV-ARSA integration site targeting in murine ependymal cells. In contrast, integration of LV-ARSA in human 293T cells showed preference for highly expressed genes (Supplementary Figure S1), consistent with previous observations with HIV-1 integration in human cells.[26]

Regional hotspots for LV-ARSA integration

Regional genomic “hotspots” for retroviral vector integration have been reported in in vitro and in vivo datasets, including in HIV-1-infected individuals.[21,27,28,29] To determine if clustering of integration sites was evident in the dataset, the distribution of lengths of DNA segments between integration sites was compared with the distribution expected with random integration (). Significantly more short intersegment distances were observed with the LV-ARSA sites compared with the random control sites, indicative of clustering (P < 0.0001, Fisher's exact test). Further inspection of the integration sites revealed that 66 genomic positions hosted two or more independent integration events, representing 42.9% of all integration sites (Supplementary Table S5). Thirty-seven of these 66 genomic regions shared the same integration site in two or more mice. Nineteen regions hosted 3 or more integration sites within 20 bases of each other. The genomic region (chr10: 9832769-9832909) located in an intron of the stxbp5 gene was highly favored for integration. Eleven of the 434 (2.5%) independent integration sites were located in this region within 140 bases of each other. Closer inspection of the target DNA sequence for these 11 integration sites revealed that all 11 sites were located in or within 100 bases of a direct repeat sequence located at position chr10:9832707-9832930. For comparison, analysis of 6,294 LV-ARSA integration sites in both 293T and HeLa cells showed that significantly more short intersegment distances were observed with the LV-ARSA sites compared with the random control sites, indicative of clustering (P < 0.0001, χ2 test) (Supplementary Figure S2). From these integration sites, we randomly selected 434 sites to compare the number of regions hosting multiple integration sites with those obtained in the mouse dataset. In contrast with the mouse dataset, no identical integration sites were detected in the human dataset and only 4 integration sites were located within 20 bases of each other (Supplementary Table S6).

LV-ARSA integration target site selection is strongly influenced by flanking G-quadruplex-forming (G-quad) motifs

To investigate if the primary sequence at the integration target sites influenced target site selection, we extracted sequences flanking the LV-ARSA integration sites for further analysis. Forty bases upstream and downstream of each integration site (between −1 and +1) or the random control sites were aligned and the frequencies of A, C, G, and T at each position surrounding the integration sites were calculated. These values were compared with the expected value based on random control sites ( and Supplementary Table S7). As expected, the base composition at each position surrounding the matched random control sites varied little from the expected values[30] (Supplementary Table S7). The difference in frequency of each nucleotide at each position up to 40 bases upstream and downstream of the LV-ARSA integration sites was calculated (). Inspection of the bases surrounding the LV-ARSA integration sites using QGRS Mapper (http://bioinformatics.ramapo.edu/QGRS/analyze.php) revealed two G-quad motifs flanking the integration site at positions −26 to −3 and +22 to +35.[31] Of note, positions +1 to +5 comprise the duplicated target site sequence after integration of lentiviruses.

LV-ARSA integration sites are enriched in or near non-B DNA-forming motifs

G-quad structures are stable DNA secondary structures that can form from motifs containing tracts of tandem guanines. These guanines hydrogen bond in a planar arrangement, forming stacks connected by single-stranded DNA loops.[32] G-quad structures are only one of many (>10) non-B DNA-forming DNA secondary structures.[33] The Database for Integrated Annotations and Analysis of non-B DNA Forming Motifs (Non-B DB) provides the most complete list of alternative DNA structure predictions using the latest genome assemblies.[34,35] We used the Non-B DB to determine if LV-ARSA integration sites are enriched in or near G-quad motifs compared with the matched random control sites. LV-ARSA integration sites near G-quad motifs were enriched 1.28-fold compared with the matched random control sites (P < 0.0001, χ2 test) ( and Supplementary Table S8). We then asked if LV-ARSA integration sites are enriched in or near other non-B DNA-forming motifs. Analysis of integration site distributions revealed significant enrichment in short tandem repeats (1.16-fold; P < 0.0001, χ2 test) and mirror repeats (1.56-fold; P < 0.0001, χ2 test). Integration was also enriched in triplex motifs (1.36-fold), Z-DNA motifs (1.81-fold), direct repeats (1.14-fold), slipped motifs (1.77-fold), cruciform motifs (3.90-fold), and A-phased motifs (1.13-fold), although this enrichment did not reach statistical significance. These data show that LV-ARSA integration sites are highly enriched in and near non-B DNA-forming motifs. We then asked if non-B DNA-forming motifs were also favored in a previous study of integration site analyses in murine neural tissue (brain striatum and eye), SC1 cells, and HC1 cells.[36] Integration was enriched in or near several non-B DNA motifs, most notably Z-DNA motifs, direct repeats, inverted repeats, mirror repeats, slipped motifs, and cruciform motifs ( and Supplementary Table S8). Together, these data suggest that lentiviral integration is enriched in or near non-B DNA motifs in murine cells.

HIV-1 integration sites are enriched in non-B DNA-forming motifs in human cells

Next, we analyzed a variety of HIV-1 integration site datasets in diverse human cell types 293T,[37] Jurkat,[37,38] macrophage,[16] HeLa,[20,39] IMR90 (refs. 40,41), PBMC,[41] HOS,[37] H9 (ref. 20), and SupT1 (refs. 29,42) to determine if integration sites were enriched in non-B DNA-forming motifs (Supplementary Table S9). Integration in or near multiple non-B DNA-forming motifs was highly enriched in each of the datasets ( and Supplementary Table S10). Integration in or near several non-B DNA motifs was strongly favored among the different datasets and preference for specific types of non-B DNA-forming motifs varied for each cell type, possibly indicating cell-type-specific effects.

Discussion

Here we have presented an analysis of the distribution of integration sites of a lentiviral gene therapy vector (LV-ARSA) in murine brain choroid plexus and ependymal cells. LV-ARSA was stereotactically delivered into the right lateral ventricle of ARSA−/− and wild-type C57Bl6 mice and demonstrated a clinically favorable integration site profile. LV-ARSA showed no bias for integration in known murine oncogenes or near transcription start sites of genes, and did not show a bias for integration in transcriptionally active genes. LV-ARSA did show a strong bias for integration in large arrays of tandemly repeating, noncoding DNA such as satellite DNA. Integration site hotspots were identified and target site selection was strongly influenced by flanking G-quad sequence motifs. In addition, we demonstrated that integration in or near several non-B DNA-forming motifs is highly favored by LV-ARSA and HIV-1 in diverse cell types. Previous analyses of integration site distribution of HIV-1 and other lentiviral vectors revealed that transcriptionally active genes are preferred targets for integration in human cells.[26] Many of these datasets were obtained from in vitro- and ex vivo-transduced cells. In the present in vivo study, analysis of lentiviral vector integration site distribution in murine brain ependymal cells revealed little bias for integration in transcriptionally active genes. This result is consistent with a different lentiviral vector integration site dataset in murine eye and brain striatum tissues.[36] In that study, low expression levels of Psip1/LEDGF/p75 were proposed as a potential mechanism for reduced integration into genes in murine eye and brain striatum tissues. In choroid plexus tissue, Psip1/LEDGF/p75 expression level is moderate to high and therefore not likely a major factor for reduced integration into active genes in choroid plexus tissue (data not shown). The lentivector itself is not likely a factor since LV-ARSA favored integration into active genes under similar conditions in human cells, as previously established in the field. LV-ARSA exhibited a strong bias for integration in tandem repeats (e.g., satellite DNA) in murine ependymal cells. Preferred integration into satellite DNA has been previously described for microinjected tandem repeated DNA, latent proviral HIV-1, and murine eye tissue.[36,38,43] In contrast with integration in murine brain ependymal cells, lentiviral vector integration in murine brain striatum was not enriched in satellite DNA.[36] The reasons for this biased integration in satellite DNA in different regions of the brain are unknown. It is possible that the size of the datasets, vector differences, differential expression of host proteins in different regions of the brain, or other unknown factors influence integration site placement.[44,45,46] Earlier in vitro integration site assays have shown that the primary sequence immediately flanking (~5–10 bases) the integration-target DNA has minor influences on site selection, with only weak target site consensus motifs identified.[42,47,48,49,50,51] In the present in vivo study, we looked at up to 40 bases flanking integration sites and identified flanking G-quad sequence motifs as a potential factor that influences site selection by flanking integration sites. Many DNA repeat sequences can exist in at least two distinct conformations. Typically, DNA sequences adopt a right-handed B-form with Watson–Crick base pairing. However, at least 10 non-B DNA conformations exist. These conformations have contorted bond angles or unpaired nucleotides compared with the orthodox right-handed B-form. Non-B DNA conformations are in higher energy states and are believed to be facilitated at specific sequence motifs by the free energy generated from negative supercoiling, which can arise from processes such as transcription, protein binding, and other factors.[52,53,54] Several non-B DNA motifs preferentially act as the recipient of genetic information during gene conversion events. For example, repeating units of the TG dinucleotide is characteristic of the non-B DNA-forming Z-DNA motif.[55] Substrates containing Z-DNA motifs preferentially act as the recipient of genetic information during gene conversion events.[56] Moreover, Z-DNA motifs have also been shown to stimulate homologous recombination up to 20-fold in human cells. Another example is G-quad motifs, which can promote recombination and possibly influence genomic stability and cellular processes such as transcription (reviewed in ref. 32). It is possible that lentiviral integration complexes are recruited to genomic regions that favor genetic recombination such as non-B DNA motifs via direct interactions with the motif itself or via non-B DNA motif-binding proteins. LV-ARSA integration sites were also enriched in or near a variety of other non-B DNA-forming motifs with some integration site bias towards the type of non-B DNA-forming motif. For example, LV-ARSA exhibited a strong bias for integration into all of the non-B DNA-forming sequences except for G-quad repeats and inverted repeats, where integration was instead enriched near G-quad repeats, rather than within them. Integration site bias for certain non-B DNA-forming motifs was also observed in other published datasets with HIV-1 in various cell types, indicating that cell-type-specific factors associated with non-B DNA-forming motifs, or the genomic structures they produce, contribute to this bias. Specific host proteins that bind to specific non-B DNA-forming motifs have been identified; therefore it is conceivable that such factors could recruit preintegration complexes and/or influence integration in or near these motifs. Notably, Psip1/LEDGF/p75, which is a co-activator of general transcription and influences the location of HIV-1 integration, was recently shown to selectively bind negatively supercoiled DNA over unconstrained DNA.[37,57] It will be interesting to learn if non-B DNA plays a role in attracting Psip1/LEDGF/p75, which then recruits HIV-1 preintegration complexes. Although several of the datasets showed that integration was disfavored directly within certain non-B DNA-forming motifs, integration was significantly enriched near these motifs. Host proteins/complexes that bind non-B DNA-forming motifs may recruit lentiviral preintegration complexes, but block integration within these motifs via steric hindrance. As a result, integration near these motifs may be favored instead. Consistent with enrichment of integration sites in or near non-B DNA-forming motifs, we observed several integration site hotspots including a particularly strong integration hotspot in a direct repeat element on chromosome 10. It is unclear why this region was targeted multiple times in one mouse and only a few times in another mouse. Several additional integration sites were detected in this region but were omitted from our analysis since they fell within 10 bases of each other due to potential errors in sequencing typically seen using this sequencing platform. Therefore, it is possible that the number of sites in this region is underrepresented and requires additional characterization. Since DNA repeats can vary in abundance and length in vertebrate genomes, it is also possible that the underlying repeats of non-B DNA-forming elements are polymorphic among the mice and that a polymorphism in this direct repeat in one mouse enhanced LV-ARSA integration events in this region.[34,35,53,58] Clustering of lentiviral integration sites has been observed previously in human datasets (reviewed in ref. 26). We also showed that LV-ARSA integration sites clustered in both mouse and human cells, suggesting that clustering is not species specific. In addition, we identified LV-ARSA clustering of integration sites in the human genes CREB-binding-protein (CREBBP) and DNA (cytosine-5-)methyltransferase-1 (DNMT1), which were previously identified as integration hotspots in human cells.[20,29,41] These data also further support the fact that LV-ARSA integration clustering is not an artifact of the method used. Interestingly, the integration sites in CREBBP and DNMT are all located adjacent to non-B DNA motifs. Analyses of published integration site datasets thus far have shown that no single factor dictates lentiviral integration site targeting/placement and that multiple factors are likely involved. For example, target DNA structure, curvature, flexibility, rigidity in solution, and distortion within the nucleosome core all influence the frequency and placement of integration.[49,59] Host proteins such as Psip1/LEDGF/p75 that can tether HIV-1 preintegration complexes to target DNA also influence integration site placement.[37,60] Conversely, DNA-binding proteins can create regions that are refractory to integration via steric hindrance.[60,61,62,63] In conclusion, our data demonstrate a clinically favorable integration site profile of a lentiviral vector encoding the ARSA transgene in the murine brain, and identify non-B DNA-forming motifs as a potential new host genomic feature that influences the distribution of lentiviral integration sites.

Materials and methods

Lentivector construction. Codon-optimized cDNA of the human arylsulfatase A (ARSA) gene, or the green fluorescent protein (GFP) gene as a control, were independently cloned into a self-inactivating third-generation HIV-based vector downstream of the human EF1-alpha promoter to generate the LV-ARSA or LV-GFP vectors, respectively. LV-ARSA and LV-GFP were generated using the ViraPowerTM Lentiviral Packaging Mix (Life Technologies). 293FT cells at 80% confluency were transfected with pLV-ARSA (or pLV-GFP), pLP1, pLP2, and pLP-VSV-G transfected with TurboFectin8.0 transfection reagent (Origene). Cell supernatant was harvested after 36 hours of transfection, clarified by centrifugation at 500 × g for 10 minutes, and filtered (0.45 μm). Virus-containing medium was concentrated at 31,000 × g for 5 hours. Stereotactical brain injection and isolation of genomic DNA from ependymal cells. Animal experiments were performed in compliance with the guidelines set by the Canadian Council for Animal Care and the policies and procedures approved by the University of Western Ontario Council on Animal Care (protocol 2007-044-12). Four-month-old C57Bl6 WT (two males and one female) and C57Bl6 ARSA−/− (two males and two females) were used in this study. Mice were anesthetized with ketamine and xylazine and placed in a Kopf stereotaxic apparatus. Sterile surgical procedures were followed to expose the injection site located at AP −0.2, ML 1.0, DV 3.0 according to mouse brain stereotaxic coordinates (The Mouse Brain in Stereotactic Coordinates, Second Edition, George Paxinos and Keith B. J. Franklin, Academic Press, 2001). A burr hole was placed followed by an injection of 10 μl of LV-ARSA vector (2.5 × 109 TU/ml) over 20 minutes. The needle was left in place for 5 minutes and then slowly withdrawn. The number of LV-ARSA transducing units was approximated by titering LV-EF1a-GFP vector on HT1080 cells (generated in parallel with LV-ARSA). Seven days postinjection, the region surrounding the lateral ventricle containing the choroid plexus and ependymal cells from each murine was removed by dissection. Subsequently, genomic DNA was isolated from the ependyma using the DNeasy Blood & Tissue Kit according to the manufacturer's instructions (Qiagen). Isolation of integration sites. Genomic DNA from the dissected tissue from each mouse was restriction enzyme digested using MseI and NarI and amplified by ligation-mediated PCR, as previously described step by step.[64] After gel purification of the PCR products, the purified DNA samples were processed using the Nextera XT DNA Sample Preparation kit, which uses an engineered transposome to simultaneously fragment and tag input DNA with unique adapter and index (“barcodes”) sequences on both ends of the DNA. DNA from each murine was barcoded. A limited-cycle PCR reaction was performed to amplify the insert DNA, which was then sequenced using Illumina MiSeq using 2 × 150 bp chemistry at the London Regional Genomics Centre (Robarts Research Institute, Western University, Canada). Bioinformatic analyses. Integration site sequences were judged to be of acceptable quality if (1) the match to the genome began within 3 bp of the 5′-CA-3′ terminus of the viral DNA, (2) the match proximal to the LTR end showed an identity of at least 98%, (3) the match yielded a unique best hit using default parameters in the client-server BLAT ranking, as previously performed,[16] and (4) the integration site did not fall within 10 bases or less of another integration site (due to potential sequencing errors known to arise with this sequencing platform). Bowtie2 was used for aligning sequence reads to the genome and BedTools was used for computing distances between the sites and genomic features.[65,66] An integration site was scored as present in a transcription unit if it was mapped in DNA between the base pairs encoding the 5′ and 3′ ends of the transcribed region as specified in the various gene catalog annotations (http://www.genome.ucsc.edu/cgi-bin/hgGateway). Matched random control integration sites (28,800 in total) were generated by matching each experimentally determined site with 50 random sites in silico that were constructed to be the same number of bases from the restriction site as was the experimental site, as previously described.[16] Mapping of integration sites to non-B DNA motifs was done using the Non-B DB for species Murine 37.1 or Human 37.1 (http://nonb.abcc.ncifcrf.gov/apps/site/default).[34,35] Identification of all common clones between animals was not validated and could have resulted from cross-contamination during DNA processing. As a result, it is possible that the number of unique sites reported in this study is artificially inflated. Figure S1. Analysis of expression levels of genes in human cells targeted for integration by lentiviral vector encoding human arylsulfatase A (LV-ARSA). Figure S2. Analysis of integration site spacing on the murine genome. Integration site clustering was assessed by comparing the spacing between lentiviral vector encoding human arylsulfatase A (LV-ARSA) integration sites to the spacing between the same number of random sites. Table S1. Summary of integration site distribution in murine ependymal cells. Table S2. Summary of LV-ARSA integration site distribution in various murine genomic features. Table S3. Summary of pooled LV-ARSA and LV-GFP integration site distribution in various human genomic features. Table S4. List of pooled LV-ARSA and LV-GFP integration sites in human 293T and HeLa cells. Table S5. Murine genomic regions hosting two or more independent integration sites. Table S6. Human genomic regions in 293T and HeLa cells hosting multiple independent integration events out of 434 randomly selected sites. Table S7. Frequncy of each nucleotide up to 40 bases upstream and downstream of the LV-ARSA integration sites or matched random sites. Table S8. Distribution of integration sites in non-b DNA-forming motifs in murine cells. Table S9. Summary of the human integration site datasets used in this study. Table S10. Distribution of integration sites in non-B DNA-forming motifs in various human cells.
  65 in total

1.  A serious adverse event after successful gene therapy for X-linked severe combined immunodeficiency.

Authors:  Salima Hacein-Bey-Abina; Christof von Kalle; Manfred Schmidt; Françoise Le Deist; Nicolas Wulffraat; Elisabeth McIntyre; Isabelle Radford; Jean-Luc Villeval; Christopher C Fraser; Marina Cavazzana-Calvo; Alain Fischer
Journal:  N Engl J Med       Date:  2003-01-16       Impact factor: 91.245

2.  Weak palindromic consensus sequences are a common feature found at the integration target sites of many retroviruses.

Authors:  Xiaolin Wu; Yuan Li; Bruce Crise; Shawn M Burgess; David J Munroe
Journal:  J Virol       Date:  2005-04       Impact factor: 5.103

3.  Adult mouse brain gene expression patterns bear an embryologic imprint.

Authors:  Matthew A Zapala; Iiris Hovatta; Julie A Ellison; Lisa Wodicka; Jo A Del Rio; Richard Tennant; Wendy Tynan; Ron S Broide; Rob Helton; Barbara S Stoveken; Christopher Winrow; Daniel J Lockhart; John F Reilly; Warren G Young; Floyd E Bloom; David J Lockhart; Carrolee Barlow
Journal:  Proc Natl Acad Sci U S A       Date:  2005-07-07       Impact factor: 11.205

4.  In vitro integration of human immunodeficiency virus type 1 cDNA into targets containing protein-induced bends.

Authors:  Y C Bor; F D Bushman; L E Orgel
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

Review 5.  DNA structures, repeat expansions and human hereditary disorders.

Authors:  Sergei M Mirkin
Journal:  Curr Opin Struct Biol       Date:  2006-05-19       Impact factor: 6.809

6.  LMO2-associated clonal T cell proliferation in two patients after gene therapy for SCID-X1.

Authors:  S Hacein-Bey-Abina; C Von Kalle; M Schmidt; M P McCormack; N Wulffraat; P Leboulch; A Lim; C S Osborne; R Pawliuk; E Morillon; R Sorensen; A Forster; P Fraser; J I Cohen; G de Saint Basile; I Alexander; U Wintergerst; T Frebourg; A Aurias; D Stoppa-Lyonnet; S Romana; I Radford-Weiss; F Gross; F Valensi; E Delabesse; E Macintyre; F Sigaux; J Soulier; L E Leiva; M Wissler; C Prinz; T H Rabbitts; F Le Deist; A Fischer; M Cavazzana-Calvo
Journal:  Science       Date:  2003-10-17       Impact factor: 47.728

7.  The non-B-DNA structure of d(CA/TG)n does not differ from that of Z-DNA.

Authors:  P S Ho
Journal:  Proc Natl Acad Sci U S A       Date:  1994-09-27       Impact factor: 11.205

8.  Human immunodeficiency virus integrase directs integration to sites of severe DNA distortion within the nucleosome core.

Authors:  D Pruss; F D Bushman; A P Wolffe
Journal:  Proc Natl Acad Sci U S A       Date:  1994-06-21       Impact factor: 11.205

9.  Gene therapy of metachromatic leukodystrophy reverses neurological damage and deficits in mice.

Authors:  Alessandra Biffi; Alessia Capotondo; Stefania Fasano; Ubaldo del Carro; Sergio Marchesini; Hisaya Azuma; Maria Chiara Malaguti; Stefano Amadio; Riccardo Brambilla; Markus Grompe; Claudio Bordignon; Angelo Quattrini; Luigi Naldini
Journal:  J Clin Invest       Date:  2006-11       Impact factor: 14.808

10.  DNA conformations and their sequence preferences.

Authors:  Daniel Svozil; Jan Kalina; Marek Omelka; Bohdan Schneider
Journal:  Nucleic Acids Res       Date:  2008-05-13       Impact factor: 16.971

View more
  4 in total

1.  Metachromatic Leukodystrophy (MLD): a Pakistani Family with Novel ARSA Gene Mutation.

Authors:  Muhammad Aiman Shahzad; Saba Khaliq; Ali Amar; Saqib Mahmood
Journal:  J Mol Neurosci       Date:  2017-08-10       Impact factor: 3.444

Review 2.  Promoter considerations in the design of lentiviral vectors for use in treating lysosomal storage diseases.

Authors:  Estera Rintz; Takashi Higuchi; Hiroshi Kobayashi; Deni S Galileo; Grzegorz Wegrzyn; Shunji Tomatsu
Journal:  Mol Ther Methods Clin Dev       Date:  2021-11-24       Impact factor: 6.698

Review 3.  Mammalian Sulfatases: Biochemistry, Disease Manifestation, and Therapy.

Authors:  Ryuichi Mashima; Mahito Nakanishi
Journal:  Int J Mol Sci       Date:  2022-07-24       Impact factor: 6.208

4.  Rapid and Efficient Stable Gene Transfer to Mesenchymal Stromal Cells Using a Modified Foamy Virus Vector.

Authors:  Nathan Paul Sweeney; Cathy Regan; Jiahui Liu; Antonio Galleu; Francesco Dazzi; Dirk Lindemann; Charles Anthony Rupar; Myra Olga McClure
Journal:  Mol Ther       Date:  2016-05-02       Impact factor: 11.454

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.