Akitsu Hotta1,2. 1. Center for iPS Cell Research & Application (CiRA), Kyoto University, Japan. 2. Institute for Integrated Cell-Material Sciences (iCeMS), Kyoto University, Japan.
Abstract
Duchenne muscular dystrophy (DMD) is a severe genetic disorder caused by loss of function of the dystrophin gene on the X chromosome. Gene augmentation of dystrophin is challenging due to the large size of the dystrophin cDNA. Emerging genome editing technologies, such as TALEN and CRISPR-Cas9 systems, open a new erain the restoration of functional dystrophin and are a hallmark of bona fide gene therapy. In this review, we summarize current genome editing approaches, properties of target cell types for ex vivo gene therapy, and perspectives of in vivo gene therapy including genome editing in human zygotes. Although technical challenges, such as efficacy, accuracy, and delivery of the genome editing components, remain to be further improved, yet genome editing technologies offer a new avenue for the gene therapy of DMD.
Duchenne muscular dystrophy (DMD) is a severe genetic disorder caused by loss of function of the dystrophin gene on the X chromosome. Gene augmentation of dystrophin is challenging due to the large size of the dystrophin cDNA. Emerging genome editing technologies, such as TALEN and CRISPR-Cas9 systems, open a new erain the restoration of functional dystrophin and are a hallmark of bona fide gene therapy. In this review, we summarize current genome editing approaches, properties of target cell types for ex vivo gene therapy, and perspectives of in vivo gene therapy including genome editing in human zygotes. Although technical challenges, such as efficacy, accuracy, and delivery of the genome editing components, remain to be further improved, yet genome editing technologies offer a new avenue for the gene therapy of DMD.
Duchenne muscular dystrophy (DMD) is one of the most common and severe forms of genetic muscle disorders and is described by the progressive degeneration of proximal muscular tissue with the characteristic of calfmuscle enlargement. Muscle weakness becomes obvious around 3 to 5 years of age, resulting in compromised running, climbing, or standing (known as Gowers’ sign). Patients at infant stage show a massive elevation of creatine kinase (CK) or creatine phosphokinase (CPK) levels (>10,000 IU/L in comparison to normal levels of <200 IU/L) in the blood, which is an indication of muscle or heart damage, since CK is a muscle specific kinase mediating the production of energy storage material phosphocreatine in muscle. Between ages 7 and 12, DMDpatients gradually start to rely on wheel chairs due to difficulties in walking and a waddling gait. By their teenage years, they may require assistance in activities that involve the use of arms and legs. With time, muscle weakness affects not only proximal muscle tissues, but also cardiac and respiratory function. The development of artificial ventilators in the 1980s have helped improve life expectancy, however, cardiac or respiratory defects remain the main cause of mortality in DMDpatients, with a life expectancy of around the age of 30.
Dystrophin gene
The genetic cause of DMD was first identified by Dr. Louis M. Kunkel’s group in 1986 [1], and the identified cDNA was named “Dystrophin” [2]. The dystrophin gene spans about 2.2 Mbp on the X chromosome and consists of 79 exons for the Dp427 m isoform, whichis mainly expressed in skeletal muscle cells. Dystrophin protein has an actin-binding domain on the N’-terminus and a dystrogly can complex-binding domain on the C’-terminus, and the two are connected by a rod domain which consists of spectrin-like repeats (Fig. 1). The function of dystrophin protein is to physically anchor the cellular skeletal actin fibers and sarcolemma membrane in muscle fibers [3]. Muscle cells without functional dystrophin become vulnerable during physical muscle movements, resulting in spontaneous cellular death which induces inflammation and fibrosis [4].
Fig.1
Structure of the dystrophin gene and protein. (a) Human dystrophin gene is located on chromosome Xp 21.2, spans 2.22 Mb in size, and consists of 79 exons for the muscle isoform Dm 427 m. DMD patients suffer from functional loss of dystrophin protein, mainly caused by premature truncation due to a large deletion around exons 40 to 54. (b) Dystrophin protein translated from the Dm427 m isoform consists of 3685 amino acids and is 427 kDa. The protein can be divided into three functional domains: an actin-binding domain, rod domain, and cysteine (Cys)-rich C’-terminal domain. The actin-binding domain at the N’-terminal is important for binding to cytoskeletal actin fibers, and the Cys-rich C’-terminal domain is critical for the binding to the dystroglycan complex at the sarcolemma membrane. The center rod domain consists of several spectrin-like repeats, which form a three-helix bundle-like structure.
DMD mutation patterns
Due to the large size of the dystrophin gene, mutation patterns found in DMDpatients largely vary. Approximately 60–70% of DMDpatients harbor a large deletion of one or more exons, with many occurring with in exons 44 to 54 [5-9].Notably, a milder form of DMD, Becker muscular dystrophy (BMD), is also caused by mutations in the same dystrophin gene. Complete loss of functional dystrophin protein due to out-of-frame mutations is classified as DMD, while partial loss of functionality (such as smaller protein size) or decreased level of dystrophin protein expression due to in-frame deletions, insertions, or small amino acid substitutions are classified as BMD.BMDpatients suffer from weakened muscles and mobility in their old age, although the disease phenotype is highly heterogeneous. Depending on the type of genetic mutation, functionality of the resulting dystrophin protein ranges from barely functional (similar to DMD) to almost normal [3, 10]. Therefore, the symptoms of BMDpatients vary, and some patients have nearly normal life spans.Considering the function of dystrophin as a structural protein that connects cytoskeletal actin fibers and the sarco lemma membrane, minor amino acid substitutions or truncation at the rod domain is not critical for its function. This is a unique and important property when it comes to designing strategies on how to restore the function of dystrophinprotein for treating DMD.
Antisense oligonucleotide-mediated exon skipping
The deletion mutations found in roughly two thirds of exons are not in multiples of three base pairs, thus resulting shifts of one or two base pairs in the protein reading frame. Notably, if an exon with the sizeof (3n+1) bp is deleted, further translational skipping of another adjacentor neighboring exon with the size of(3n+2) bp would restore the out-of-frame mutation. This approach is known as exon skipping. Although exon skipping causes some amino acid deletion, in general truncations in the rod domain are tolerated, as seen in BMDpatients.To induce exon skipping, several methods have been investigated, including the use of anti sense oligonucleotides to bind to a targeted exon. Several different types of anti sense oligonucleotides have been tested and demonstrated as effective and functional in various animal models [11-16]. Encouragingly, anti sense oligonucleotide analogues are now being investigated in clinical trials, with some showing promising results [17-20]. However, due to the transient nature of antisense oligonucleotide analogues, DMDpatients are required to be repeatedly treated throughout their lifetime (Fig. 2).
Fig.2
Three therapeutic strategies target distinct biological processes. Gene augmentation approaches deliver functional cDNA, such as micro-dystrophin or micro-utrophin cDNA, to compensate the function of dystrophin protein. Retroviral vectors or DNA transposonvectors stably integrate the therapeutic cDNA into chromosomes randomly, whereas HDAv, AAV, HSV, or HAC vectors remain within the nucleus without integration. Due to the large size of the dystrophin or utrophin cDNA, transduction efficiency is one of the biggest obstacles. Genome editing approaches modify the mutated gene specifically, but off-target mutagenesis is a concern. The delivery of programmable nucleases is unexplored in the context of muscle tissue. Exon skipping uses antisense oligonucleotides to modulate the splicing patterns of a particular exon. Systematic delivery is feasible for antisense oligonucleotides, but the effect is transient. Risk of off-target effects or posttranslational suppression of the target gene should also be considered.
Gene augmentation therapy
The concept of gene therapy, which is to treat or cure congenital disorders, emerged through the development of viral vectors in the early 1970s. Since then, the field of gene therapy has significantly expanded with the concept of supplying a healthy gene to replace the function of the mutated gene [21-24]. So far, several genedelivery vectors based on viruses have been developed, such as retroviral vector [25-27], lentiviral vector [28-30], herpes simplex virus (HSV)-based vector [31,32], adenoviral vector (HDAdv) [33,34], adeno-retroviral hybrid vector [35], and adeno-associated virus (AAV) vector [36-40]. Each vector has its own advantages and disadvantages, as listed in Table 1.
Table 1
Advantages and disadvantages of commonly used viral vectors for DMD gene therapy
Vector
Packaging size limit
Genomic integration (stable expression)
Infection into non-dividing cells
Major risk/concern in gene therapy application
Retrovirus
∼8 kb
Yes
No
Leukemia
Lentivirus
∼8 kb
Yes
Yes
Leukemia, recombination with HIV
Herpes simplex virus
∼150 kb
No
Yes
Cytotoxicity and immunogenicity
Adenovirus (helper-dependent)
∼30 kb
No
Yes
Immunogenicity against viral capsid
Adeno-associated virus (AAV)
∼4 kb
No
Yes
Inhibited by neutralizing antibody
In the case of gene therapy for DMD, there are three major challenges: the large size of dystrophin cDNA (14 kbp for the Dp427 m isoform); the transduction of muscle tissues that are surrounded by layers of connective tissues; and the abundance of affected tissue, which consists of 30∼40% of body weight. In general, oversized viral vectors have significantly lower production efficiency, therefore leading to poor transduction efficiency. HSV vectors [31,32] and HDADv vectors [33, 34] have been demonstrated to deliver full-length dystrophin cDNA. However, other vectors require the dystrophin cDNA to be cut down in size by trimming the rod domain (as seen in mild BMDpatients), resulting in mini-dystrophin (6.2 kb) or micro-dystrophin genes (3.6 to 4.2 kb) [41]. As an alter native,the utrophin gene (official gene symbol: UTRN), which shares both structural and functional similarities with the dystrophin gene, was shown to functionally replace defective dystrophin [42, 43].Apart from viral vectors, non-viral vectors have also been investigated for DMD gene therapy. Sleeping Beauty is an engineered DNA transposon originally derived from Tc1/mariner transposons in fish [44] and is found to have high chromosomal integration ability of a foreign gene in human cells. Dr. Rita Perlingeiro’s group demonstrated that Sleeping Beauty transposon vector can transduce the micro-utrophin gene, which is a truncated form of the utrophin gene [45]. To overcome the packaging size limit of a gene transfer vector, HAC (human artificial chromosome) vectors have been investigated to deliver a 2.4 Mb humandystrophin genomic region that includes regulatory elements [46]. Despite the promise of delivering the full dystrophin genomic region, the current delivery efficiency of HAC vectors is low, thus requiring anextensive selection process for HAC vector transduction.
Genome editing technologies for DMD
Due to difficulties in delivering a functional dystrophin gene into myogenic cells, alternative approaches were anticipated. In this regard, the development of engineered nucleases opened a new era of gene therapy for DMD. Artificial nucleases, which are engineered to cut genomic DNA once at the designated target sequence, can induce DNA repair pathways, such as non-homologous end joining (NHEJ) and homologous directed repair (HDR). The NHEJ pathway results in the generation of small deletions or sometimes insertions, whereas the HDR pathway uses a repair template called the donor DNA.
Meganucleases for DMD
The first demonstration of genome editing technology for the dystrophin gene was reported in 2010 by Dr. Jacques Tremblay’s group in collaboration with the biotechnology company Cellectis [47]. They used meganucleases, which are homing end on ucleases that contain a large DNA recognition domain to target 12- to 40-bp DNA sequences. For example, I-SceI mega nuclease derived from the mitochondrial DNA of Saccharomyces cerevisiae binds and cutsan 18-bp recognition sequence (5′-TAGGGATAACAGGGTAAT-3′). This recognition sequence does not exist in the human genome, therefore, it can be used to introduce a specific DNA cleavage only at the target sequence without attacking human endogenous sequences. Similarly I-CreI, a mega nuclease derived from Chlamydomonas reinhardtii, recognizes a 22-bp DNA sequence (5′-CAAAACGTCGTACGACGTTTTG-3′). Cellectis has engineered I-CreI mega nuclease variants to recognize the humanRAG1 gene (I-CreI-RAG1, 5′-TGTTCTCAGGTACCTCAGCCAG-3′) [48]. To demonstrate a proof-of-principle, Dr. Tremblay’s group first inserted a mega nuclease target site (either I-SceI or I-CreI-RAG1 target site) into the middle of the dogdystrophin gene with a frame-shift mutation. They then checked whether over expression of the corresponding mega nuclease (I-SceI or I-CreI-RAG1) could induce a small deletion or insertion at the target site to alter the protein reading frame. Indeed, restored expression of the dogdystrophin protein was detected in 293FT cells, human myoblasts, and mouse muscle fibers. This report was the first demonstration of engineered nucleases restoring out-of-frame mutations of the dystrophin gene solely by the initiation of DNA cleavage. However, due to the lack of a natural targeting site for mega nucleases in the humandystrophin gene and because custom engineering of mega nucleases require laborious screening processes, custom engineering nucleases that can be targeted to an endogenous dystrophin sequence are desired.
ZFN mediated indels for DMD
Zinc finger domain is one of the most abundant DNA binding domains in mammals, and most of itspossible combinations of DNA binding motifs have been identified. Custom conjugation of several zinc finger domain swith a nuclease domain allows scientists to target specific sequences of interest. To target one target site, typically three or four zing finger domains are conjugated with FokI nuclease domain to form zinc finger nuclease (ZFN). FokI domain cleaves DNA only when it is dimerized, so a pair of zinc finger nucleases (ZFNs) is used to cleavage the target sequence and also to increase specificity.Dr. Jacques Tremblay’s group investigated the utility of ZFNs that target endo genous the humandystrophin gene together with several mega nucleases and found that ZFNs can induce small deletions (typically 1–20 bp in size) or small insertions (typically 1–5 bp in size) at the target site of the humandystrophin gene [49]. These results demonstrate that ZFNs are able to restore the reading frame of endo genous dystrophin simply by introducing a small deletion or insertion (indel).
ZFN-mediated knock-in for DMD
Site-specific DNA cleavage not only resulted in NHEJ mediated indels, but also enhances the efficiency of HDR. In humanDMD myoblasts carrying a deletion of exons 45–52, Dr. George Dickson’s group demonstrated that mega nucleases targeting intron 44 of the dystrophin gene facilitated the insertion of cDNA corresponding to the exon 45–52 region in a bulk cell population [50]. This is a promising gene therapy approach for patients who carry deletions of multiple exons. Unfortunately, restored dystrophin protein expression was not detected, presumably due to the low incidence of cDNA knock-in events or the insertion of an artificial exon altering the expression pattern of dystrophin protein. Further enhancement of knock-in events or selection of a collected cell pool is required.While ZFN is a powerful tool to edit a target site of interest, it does have drawbacks. Firstly, construction of an active ZFN pair is challenging, as simple assembly of the corresponding zinc finger domain does not guarantee full DNA binding activity due to cross interactions between adjacent zinc finger domains (known as context dependency). Secondly, even though zinc finger domains that recognize all possible combinations of trinucleotide motifs (43 = 64) have been identified, there are some biases or preferences to certain sequences, such as “GNN”. Therefore, designable target sequences are limited (Fig. 3).
Fig.3
Targetability of programmable nucleases. Programmable nucleases, such as ZFNs, TALENs, and CRISPR-Cas9 systems, have preferences or requirements for their target DNA sequences. The targeting of a particular site of interest can be limited for some nucleases. For example, ZFNs have a sequence preference for “5′-NNMNNMNNC [5 to 7bp spacer] GNNKNNKNN-3′” (where M = C or A, K = G or T, N = A or C or G or T), although this preference depends on the assembly method of the zinc finger domains. TALENs offer much more flexible targetability, as the size of the RVD domain (i.e. typically 15 to 20 repeats) and spacer length (i.e. 14 to 19 bp) can be variable. One major requirement, however, is a “T” 1 bp before the RVD domain. Therefore, the consensus sequence is “5′-TN (×15 ∼20) [14∼19 bp spacer] N (×15 ∼20) A-3′”. CRISPR-Cas9 requires a protospacer adjacent motif (PAM) to recognize its target sequence; e.g. “5′-NGG-3′” for Cas9 derived from Streptococcus pyogenes, and “5′-NNGRRT-3′” for Cas9 derived from Staphylococcus aureus. Each targetable sequence is plotted at around exon 45 of the dystrophin gene. Note that “targetable sequence” only accounts for the patterns matched with the above consensus motifs and does not take other factors, such as the epigenetic status (i.e. DNA CpG methylation), specificity in the human genome, or risk of off-target cleavages, into consideration. Sequence specificity or sequence uniqueness is one of the most important factors for choosing a proper nuclease, as nucleases targeting repeat sequences have high risk for off-target mutagenesis and cellular toxicity. In this regard, we have developed a database to visualize the uniqueness of DNA sequences by stacking a set of unique k-mer sequences (where k = 10 to 16), as indicated in the red histogram on top.
TALEN for DMD
To this end, a more flexible DNA targeting domain is preferred. TALE (Transcriptional Activator-Like Effector) domain, which was originally derived from the plant pathogenic bacteria Xanthomonas, has a unique structure for its DNA binding motif. TALE domain consists of a tandem repeat of 34 a mino acid residues, also known as repeat variable di-residue (RVD), with variations at the 12th and 13th amino acids. Each RVD recognizes a single nucleotide sequence, and there is less context dependency between adjacent RVDs. Therefore, the conjugation of TALE domain with FokI nuclease, which is known as TALEN, provides a much more flexible DNA binding platform for targeting a specific DNA sequence of interest (Fig. 3).Thanks to its greater flexibility in target sites, the TALEN system has high probability at targeting exonic regions, which are typically less than 200 bp in size. By targeting the protein-coding region in exons, the dystrophin coding frame can be modulated when the correct size of a deletion or insertion is introduced. Deletion is solely mediated by programmable nucleases via the NHEJ repair pathway and since a donor template is not required, the gene delivery process can be simplified. This NHEJ-mediated approach is unique in its targeting the rod domain region of the dystrophin gene, as small truncations or amino acid changes can be tolerated or have only minor effects on dystrophin protein function, as seen in BMDpatients. Dr. Charles Gersbach’s group utilized the TALEN system to target exon 51 in fibroblasts or an immortalized myoblast cell line, which was originally derived from a DMDpatient with deletion of exons 48–50 [51]. Successful introduction of a small deletion (i.e. 5 bp) in exon 51 resulted in a frame shift of the protein reading frame and restored dystrophin protein expression in differentiated myoblasts. We tested a similar approach in induced pluripotent stem (iPS) cells derived from a DMDpatient who lacks exon 44 [52]. By targeting exon 45 with TALEN, we successfully introduced a small deletion or insertion (i.e. 1 bp insertion), which restored the proper protein reading frame to express dystrophin protein after myogenic differentiation. However, the frame shift approach is highly stochastic, as the size of the indelby programmable nucleases cannot be controlled. Therefore, out of all the indelclones analyzed, only one-third of cells showed a proper protein reading frame.To enhance the likelihood of reading frame correction, we also tested the possibility of skipping exon 45 by targeting a splicing acceptor site. As consensus sequences of splicing acceptor sites mainly consist of a 2-bp “AG” sequence, we initially thought the disruption of a splicing acceptor site should be more efficient than the frame shifting approach. However, it was difficult to design TALENs or other programmable nuclease swith high specificity, due to the low complexity of the polypyrimidine tract (which is a “C and T” rich region) around the splicing acceptor site. Nevertheless, by targeting around 10 bp downstream from the splicing acceptor site, we managed to show that some NHEJ-mediated deletions can reach the splicing acceptor site and succeeded in skipping exon 45 to restore the proper reading frame of dystrophin protein [52]. A similar approach is also reported in skipping exon 51 by using ZFNs [53].For NHEJ-mediated deletion approaches, the control or prediction of the deletion pattern is important. Interestingly, deep sequencing analysis of deletion patterns identified apredominant deletion pattern (up to 30–40% of the deletion events) harboring microhomology sequences (typically 3–5 bp in size) on both sides of the cleavage site [52, 54]. By using web tools to predict microhomology-mediated end joining patterns [52, 55], we sought to enhance the success rate of the frame shift approach.Thus, by taking advantage of iPS cells and their propensity to self-renew indefinitely, we examined a knock-in approach of missing exon 44 by conjugating antibiotic selection followed by subsequent removal of the selection cassette by the Cre-loxP system. We conducted TALEN-mediated targeting experiments and observed high knock-in events amongst antibiotic resistant clones. Successful expression of full-length dystrophin protein was detected by immunostaining and western blotting upon differentiation into myoblasts. Two rounds of sub cloning is demanding and time consuming, but homologous recombination (HR)-mediated targeted knock-in of a missing exon might be a feasible approach in DMDpatient-derived iPS cells.
CRISPR-Cas9 for DMD
The TALEN system is versatile enough to target almost any DNA sequence, however, construction of TALEN is laborious, especially for newcomers to genome editing experiments. To construct one TALEN, a library of RVD domains is required to conjugate 14 to 20 RVD domains into an expression vector [56-59]. The assembly process normally takes two to three steps of cloning cycles (i.e. two to three weeks) depending on the assembly systems.Mean while, bacterial adaptive immunity systems [60], namely CRISPR (clustered regularly interspaced short palindromic repeats) and Cas (CRISPR associated) proteins [61], have been identified as a DNA cleavage machinery against invasive DNA elements, such as bacteriophages [62-64]. Soon after this finding, Drs. Jennifer Dundna and Emmanuelle Charpentier’s groups recognized that the combination of Cas9 protein and two small RNAs, crRNA and tracrRNA, can induce a DNA double strand break at a site complementary to the crRNA sequence [65]. Furthermore, crRNA and tracrRNA can be conjugated into a single guide RNA (sgRNA) by a tetranucleotide linker loop to simplify the system. This finding has had significant impact on the genome editing field, as only the construction of ansgRNA (∼100 bp in size) is required to determine the target sequence, and the Cas9 protein component is common for any target sequences. Construction of sgRNA is simple and straight forward, as target sequences can be synthesized and inserted into an expression vector driven by a PolIII promoter.One major limitation of the CRISPR system as a genome editing tool is the requirement of a “proto spacer adjacent motif (PAM)” sequence for the target site, as this sequence is essential for the initial step of the target sequence recognition [66]. For example, the most widely used CRISPR-Cas9 system derived from Streptococcus pyogenes requires a “NGG” PAM sequence at the 3’-end of the target sequence, while Staphylococcus aureus derived Cas9 requires a “NNGRRT” PAM sequence [67]. This feature somehow reduces the target ability of CRISPR-sgRNA at given sequences, especially when compared with the TALEN system, however, the CRISPR-Cas9 system is still flexible enough to target roughly every 10∼20 bp of DNA sequence (Fig. 3).Strikingly, it is reported that DNA cleavage activity with the Streptococcus pyogenes-derived CRISPR-Cas9 system is greater than that with the TALEN system in human embryonic stem (ES) cells and iPS cells [68]. In this regard, we tested the CRISPR-Cas9 system to correct dystrophin mutations in patient-derived iPS cells, finding the resulting cleavage activity and HR-mediated knock-in efficiency was comparable with those using the TALEN system [52]. Considering the ease of construction, the CRISPR system is significantly easier to use by non-cloning specialists (Table 2).
Table 2
Advantages and disadvantages of genome editing technologies
Programmable nuclease
DNA recognition motif
Construction procedures
Sequence targetability
Meganuclease
Homing endonuclease LAGLIDADG motifs
Very complicated
Low
ZFNs
Zinc finger domains
Complicated
Relatively low
TALENs
TAL effector RVD domain
Relatively complicated
Very high
CRISPR-Cas9
crRNA (DNA-RNA heteroduplex)
Easy
Relatively high
Another concern about the CRISPR system is its targeting specificity. There are a number of reports on off-target mutagenesis even with several base pair mismatches, especially in immortalized cancer cell lines [69-73]. On the other hand, several other groups reported high specificity and minimum off-target mutation load by whole genome sequencing in human pluripotent stem cells [74, 75]. We tested the risk of off-target mutagenesis by karyotyping, CNV analysis, and exome sequencing, and found minimum incidents of undesired mutagenesis in DMD-iPS cells. Therefore, at least in human pluripotent stem cells, the CRISPR system can be regarded as a precise recognition machinery. Further study is required to shed light on whether immortalized cell lines have abnormal DNA repair pathways similar to cancer, or whether pluripotent stem cells equip a particular DNA repair mechanism to maintain genomic integrity, such as Zscan4 [76] or Filia genes [77].Strategies to correct DMD mutations must considerer the type of mutation in the patient, as genome editing approach is highly custom tailored. Therefore, pathogenic mutation patterns must be carefully examined before deciding on the appropriate therapeutic strategy. To cover as many mutation types as possible, Dr. Gersbach’s group took advantage of the CRISPR-Cas9 system for its ability to target multiple exons simultaneously [78]. It is difficult to target all the mutations known to cause DMD, but this approach greatly expands the range of treatable DMDpatients. Targeting other types of mutations, especially large bulk deletions or duplication of multiple exons, is a future challenge.
Ex vivo gene therapy
Considering the stochastic nature of the deletion patterns as well as the risk of off-target mutagenesis by custom nuclease treatment, it is advantageous to predetermine the edited DNA sequence in isolated cells from patients in advance and then, only after proving the gene correction was successful, transplant the cells back to the patient. This “ex vivo” gene therapy approach is particularly useful for evaluating off-target mutagenesis. It has been shown that myoblast cells can be isolated from patient’s muscle biopsy and immortalized by transduction of a transforming factor, such as TERT (telomerase). Such immortalized myoblasts can be expanded unlimitedly and can form fused myofibers once transplanted back to muscle tissue [79, 80]. Such cells serve as ideal targets for genome editing, however, the risk of introducing transforming factors must be carefully examined.As another cell source to be expanded in vitro are the patient’sown fibro blast cells, which can be isolated and immortalized by transforming genes. Such fibro blast cells can be differentiated into myogenic cells by forced expression of the MyoD (also known as MYOD1) gene, which is a major transcriptional factor of myogenic development [81]. Again, oncogenic risk of the cells due to the introduction of transforming factors must be examined carefully.In comparison to most somatic cells, except for neoplastic cells, ES cells have unlimited self-renewal capacity and grow rapidly while maintaining normal karyotyping. In addition, ES cells remain pluripotent for the differentiation into different cell types even after massive expansion. Establishment of ES cells requires fertilized embryos, however, the invention of iPS cell technology now allows us to establish cells almost identical to ES cells from the patient’s own somatic cells by the transient expression of transcriptional and transforming factors [82, 83]. So far, hundreds of patient-derived iPS cell lines have been established, including DMDpatient-derived iPS cells, which were first reported in 2008 [84].Due to its unlimited self-renewal capacity and differentiation ability into myoblasts, iPS cell holds great promise as a cell source for ex vivo gene therapy. Dr. Oshimura’s group utilized HAC technology to deliver the full-length humandystrophin gene into DMD-iPS cells to restore dystrophin expression [85]. Viral vector-mediated gene delivery into iPS cells is another option [45], however, strong transcriptional silencing activity inpluripotent stem cells may hamper the stable expression of therapeutic transgenes [86]. Therefore, genome editing approaches have advantages over gene augmentation approaches when targeting ES or iPS cells.To materialize iPS cell-mediated ex vivo gene therapy, efficient differentiation into the proper cell type after genomic correction is essential (Fig. 4). Differentiation of patient-derived iPS cells into myoblasts is one promising approach, as differentiated myoblasts have the ability to fuse and form myofibers both in vitro and in vivo [87, 88]. Considering that human adult muscle can regenerate damaged muscle tissue from muscle stem cells called satellite cells, another potential approach would be the differentiation of patient-derived iPS cells into muscle stem cells or myogenic progenitor cells [89-91]. It is yet to be determined which cell type is best for engraftment and long-term therapeutic effect.
Fig.4
Ex vivo gene therapy approaches using iPS cells. A scheme for iPS cell-mediated ex vivo gene therapy approaches for DMD. Skin fibroblasts or monocytes from peripheral blood are reprogrammed to iPS cells by transient expression of the Yamanaka factors. The dystrophin mutation can then be repaired using genome engineering technologies. Such corrected iPS cells can be further differentiated into myoblasts to form myofibers. Either myoblasts or myofibers can be transplanted to patients, but only for transient recovery, as myoblasts or myofibers will eventually die after cellular turnover. An ideal approach would be to differentiate iPS cells into satellite cells, which are muscle stem cells, to gain long-term self-renewal and regeneration capacity in the myofibers. Currently, ex vivo expansion of primary satellite and genome editing is challenging, but progress here could circumvent the use of iPS cells.
Genome editing in zygotes
Apart from ex vivo gene therapy approaches, in vivo gene therapy approaches, where genome editing nucleases are directly injected into in vivo tissues, might be possible. Compared with ex vivo gene therapy, which requires the preparation of both the genome editing nuclease and appropriate target cells, in vivo gene therapy is a simpler procedure, as it only requires the preparation of the genome editing nuclease for therapeutic application.Muscle is one of the most abundant tissues in the human body, and DMD affects the entire muscle mass, therefore, correction of the systematic phenotype is a great challenge. Due to the lack of effective DNA delivery methods into adult tissues, one might consider introducing gene therapy for DMD at the early developmental stage. Dr. Eric Olson’s group reported “prevention” of the muscular dystrophy phenotype in mdx mice by direct injection of CRISPR-Cas9 mRNA, sgRNA and single-strandedoligodeoxynucleotide (ssODN) as a donor template into mouse zygotes [92]. Since totipotent zygotes can give rise to the entire population of cells in our body, gene correction in zygotes is an effective way to correct systemic cells in theory. However, it also brings the ethical debate about germline editing, since these edits will be transmitted to the offspring regardless of the outcome. Current genome editing technology cannot offer 100% correction efficiency and often suffers from several limitations, including mosaicism, random indel patterns, low HR efficiency and off-target mutagenesis. Agroup at Sun Yat-sen University tested the cleavage activity of CRISPR-Cas9 system in human tripronuclear zygotes at the HBB (=β-globin) gene locus [93] and found a surprisingly high off-target mutagenesis rate (up to 7 embryos out of 11 embryos investigated), which deviates from the observations made by many reports of mouse zygotes and human pluripotent stem cell lines, but is consistent with those of cancer cell lines instead. It is not clear from the experimental settings described why this high off-target mutagenesis rate was observed. Possibilities include impaired DNA repair pathways due to the nature of the tripronuclear zygotes, possible problems with the target DNA sequence, contamination of inaccurate sgRNA products during in vitro transcription, or something else. Regardless the conclusions of the report, genome editing procedures are still immature for application into human embryos.Will germ line editing be necessary to treat DMDpatients? To correct the dystrophin mutation, it is first necessary to diagnose where the genetic mutation is located. Recent deep sequencing technologies require only small amounts of DNA template for the identification of genomic mutations from biopsies of pre-implantation embryos. However, the inheritance of DMD follows an X-linked recessive fashion, therefore all female embryos and half of male embryos are not affected by the mutations. This begs the question, will it be necessary to select the affected male embryos and correct the mutated dystrophin gene by genome editing despite the various technical risks described above? Ethically, it would be more appropriate to select the non-affected population of embryos than genetically manipulate the affected one. Of course, the selection of embryos brings up another ethical issue, which is related to the rights of early human embryos. Considering the severity, incurability and lethality of DMD, pre-implantation diagnosis could be justified, and most parents will probably choose to give up on the embryo if DMD is detected. Further discussion will be necessary among scientists, patients, and broadersociety.
Genome editing in vivo
One of the most exciting approaches for DMD gene therapy is in vivo treatment of patient’s muscle tissues by genome editing nucleases. It has been investigated that some serotypes of AAV vector (i.e. AAV serotype 9) can effectively deliver a gene of interest into a wide variety of tissues, including skeletal muscles and cardiac tissues [94, 95].For conventional gene augmentation therapy, consistent and high-level expression of a therapeutic transgene is essential. However, for genome editing technologies, a simple “hit-and-away” strategy is feasible, as transient expression of the genome editing nuclease is sufficient to induce a stable alteration of the genomic DNA sequence.Cas9 protein from Streptococcus pyogenes is relatively large, and its cDNA size is about 4.2 kb. Considering the packaging limit of AAV vector, which is typically 4 to 5 kb including the promoter and termination signal, the use of smaller Cas9 is desirable for efficient in vivo delivery by this vector. Dr. Feng Zhang’s group screened several orthogonal Cas9 sequences and found that Cas9 from Staphylococcus aureus has comparable genome editing activity with S. pyogenes Cas9, but significantly smaller cDNA size (around 3.2 kb). The group demonstrated efficient delivery of S. aureus Cas9 into the mouse liver by tail vein injection of AAV2 ITR vector pseudo-typed with AAV8 capsid [67]. Delivery of S. aureus Cas9 into muscle tissues to correct the dystrophin mutation in mice and/or larger animal models of DMD is the next important step towards DMD genome editing gene therapy. Important to note, targeting muscle satellite cells is an attractive approach by Cas9, but AAV6 vector has poor transduction efficiency of muscle satellite cells compared with proliferating myoblasts [96]. Optimizing the proper gene delivery would significantly progress gene targeting in vivo muscle tissues.
Future challenges
The development of easy and efficient genome editing tools, such as TALENs and CRISPR-Cas9 systems, have opened a new era for DMD gene therapy. The collection of disease mutations in cultured cell or mouse models is the beginning of a long journey for clinical applications. Ex vivo gene therapy approaches require determination of the most applicable cell type based on feasibility of the genome editing, engraftment ability after transplantation, immunogenicity, and functional integration into the myofibers. The application of gene correction into satellite cells might be an ultimate hallmark for ex vivo gene therapy, but cultivation, maintenance and transplantation of such satellite cells are still challenging. In vivo gene therapy approaches may be less demanding in terms of technical requirements, however, current genome editing and gene delivery techniques do not offer sufficiently accurate and precise therapeutic application. Immune responses against the corrected dystrophin protein must also be take into consideration [97]. Overcoming these technical limitations would open the door for safe and effective gene therapy for DMD.
Authors: Annemieke Aartsma-Rus; Ivo Fokkema; Jan Verschuuren; Ieke Ginjaar; Judith van Deutekom; Gert-Jan van Ommen; Johan T den Dunnen Journal: Hum Mutat Date: 2009-03 Impact factor: 4.878
Authors: David G Ousterout; Pablo Perez-Pinera; Pratiksha I Thakore; Ami M Kabadi; Matthew T Brown; Xiaoxia Qin; Olivier Fedrigo; Vincent Mouly; Jacques P Tremblay; Charles A Gersbach Journal: Mol Ther Date: 2013-06-04 Impact factor: 11.454
Authors: María Luisa Guevara-Fujita; Francia Huaman-Dianderas; Daisy Obispo; Rodrigo Sánchez; Victor Barrenechea; Diana Rojas-Málaga; Alejandro Estrada-Cuzcano; Milana Trubnykova; Mario Cornejo-Olivas; Victoria Marca; Bertha Gallardo; Milagros Dueñas-Roque; Ana Protzel; Carlos Castañeda; Hugo Abarca; Luis Celis; Jorge La Serna-Infantes; Ricardo Fujita Journal: Mol Genet Genomic Med Date: 2021-07-29 Impact factor: 2.183