| Literature DB >> 32226034 |
Lei Tan1, Yi Hu1, Yalan Li1, Lingchen Yang1, Xiong Cai2, Wei Liu1, Jiayi He1, Yingxin Wu1, Tanbin Liu1, Naidong Wang3, Yi Yang3, Robert S Adelstein4, Aibing Wang1,4.
Abstract
The generation of genetically modified mouse models derived from gene targeting (GT) in mouse embryonic stem (ES) cells (mESCs) has greatly advanced both basic and clinical research. Our previous finding that gene targeting at the Myh9 exon2 site in mESCs has a pronounced high homologous recombination (HR) efficiency (>90%) has facilitated the generation of a series of nonmuscle myosin II (NM II) related mouse models. Furthermore, the Myh9 gene locus has been well demonstrated to be a new safe harbor for site-specific insertion of other exogenous genes. In the current study, we intend to investigate the molecular biology underlying for this high HR efficiency from other aspects. Our results confirmed some previously characterized properties and revealed some unreported observations: 1) The comparison and analysis of the targeting events occurring at the Myh9 and several widely used loci for targeting transgenesis, including ColA1, HPRT, ROSA26, and the sequences utilized for generating these targeting constructs, indicated that a total length about 6 kb with approximate 50% GC-content of the 5' and 3' homologous arms, may facilitate a better performance in terms of GT efficiency. 2) Despite increasing the length of the homologous arms, shifting the targeting site from the Myh9 exon2, to intron2, or exon3 led to a gradually reduced GT frequency (91.7, 71.8 and 50.0%, respectively). This finding provides the first evidence that the HR frequency may also be associated with the targeting site even in the same locus. Meanwhile, the decreased trend of the GT efficiency at these targeting sites was consistent with the reduced percentage of simple sequence repeat (SSR) and short interspersed nuclear elements (SINEs) in the sequences for generating the targeting constructs, suggesting the potential effects of these DNA elements on GT efficiency; 3) Our series of targeting experiments and analyses with truncated 5' and 3' arms at the Myh9 exon2 site demonstrated that GT efficiency positively correlates with the total length of the homologous arms (R = 0.7256, p<0.01), confirmed that a 2:1 ratio of the length, a 50% GC-content and the higher amount of SINEs for the 5' and 3' arms may benefit for appreciable GT frequency. Though more investigations are required, the Myh9 gene locus appears to be an ideal location for identifying HR-related cis and trans factors, which in turn provide mechanistic insights and also facilitate the practical application of gene editing.Entities:
Year: 2020 PMID: 32226034 PMCID: PMC7105122 DOI: 10.1371/journal.pone.0230126
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of the HR efficiency and features of frequently targeted loci/sites.
| Locus/Gene | Total length of homologous arms (kb) | Length of 5’ arm (kb) | Length of 3’ arm (kb) | Ratio of the length of 5’ to 3’ arm | ES cell lines | Positive/Negative selection marker | Targeting efficiency (%) | References |
|---|---|---|---|---|---|---|---|---|
| Myh9 | 5.7 | 4.0 | 1.7 | 2.3:1 | V6.5 | Neomycin/TK | 91.7 | [ |
| ColA1 | 6.0 | 3.3 | 2.7 | 1.2:1 | V6.5 | Neomycin/- | 80.0 | [ |
| HPRT | 9.6 | 3.8 | 5.8 | 1:1.5 | BPES | HAT/- | [ | |
| ROSA26 | 5.4 | 1.1 | 4.3 | 1:3.9 | AK7 | Neomycin/DTA | 34.8 | [ |
| 5.4 | 1.1 | 4.3 | 1:3.9 | HM-1 | Neomycin/TK | Undisclosed | [ |
TK: thymidine kinase gene; HAT: hypoxanthine-aminopterin-thymidine; DTA: diphtheria toxin A.
Comparison of the sequence features of several widely used loci for site-specific insertion of exogenous gene.
| Gene/locus | Length of the 5’ and 3’ homologous arms | G/C content (%) (5’/3’) | SSR (%) (5’/3’) | CpG islands (n) (5’/3’) | Percentage of SINE (%) (5’/3’) | Percentage of LINE (%) (5’/3’) | Other DNA elements (%) (5’/3’) |
|---|---|---|---|---|---|---|---|
| Myh9 | 4.0/1.7 | 0.0/0.0 | 0/0 | 3.1/0.0 | 0.0/0.0 | 0.0/0.0 | |
| ColA1 | 3.3/2.7 | 5.9/2.9 | 0/0 | 0.0/0.0 | 0.0/0.0 | 0.0/0.0 | |
| HPRT | 3.8/5.8 | 43.0/39.3 | 1.9/7.5 | 0/0 | 0.0/0.0 | 4.5/0.0 | 0.0/0.0 |
| ROSA26 | 1.1/4.3 | 68.3/38.7 | 0.0/0.0 | 1 | 0.0/4.1 | 0.0/0.0 | 0.0/2.6 |
GC content represents the percentage of nucleotides in the strand that possesses either cytosine or guanine bases; Simple sequence repeat (SSR) consists of short, tandemly repeated di, tri-, tetra- or penta-nucleotide motifs; CpG island is a short stretch of DNA in which the frequency of the CG sequences is higher than other regions; SINEs denotes short interspersed nuclear elements; LINEs denotes long interspersed nuclear elements;
*CpG island located at the position of 48–810.
Fig 1Gene targeting at various sites of the mouse Myh9 locus.
(A) Diagram of the constructs targeting to different sites of the Myh9 gene locus. Shown are part of the Myh9 gene locus including exons1-4, the distance between them, the down arrows representing the targeted insertion of Neomycin (upper), the constructs targeting to distinct sites of the Myh9 gene locus and the length of the 5’ and 3’ arms of these constructs (middle), as well as the whole sequence region used for the generation of these constructs (lower). The HR efficiency of these targeting constructs is summarized in Table 3. The average percentage of HR efficiency is derived from two repeats. (B) Representative gel image of PCR identification of the GT recombinants at the Myh9 gene exon2 site. (C) Representative gel image of PCR identification of the GT recombinants at the Myh9 gene exon3 site. For representative gel image of PCR identification of the GT recombinants at the Myh9 intron2 site, please see reference [31]. M: DNA Marker; P: Positive control; N: Negative control; PC: positive clone; NC: negative clone.
Gene targeting frequency at distinct targeting sites of the Myh9 locus in mouse ES cells.
| Targeting site | Positive vs Screened clones | Targeting frequency (%) |
|---|---|---|
The bioinformatic analysis of the sequences used to create the constructs targeting to different sites of the Myh9 locus.
| The constructs targeting to the sites of the Myh9 locus | The properties of the sequences used to create the targeting construct | ||||||
|---|---|---|---|---|---|---|---|
| Total length (bp) | GC content (%) | Percentage of simple sequence repeat (SSR) (%) | CpG Island (%) | Percentage of SINEs (%) | Percentage of LINEs (%) | Percentage of other DNA elements (%) | |
Fig 2Gene targeting at the Myh9 gene exon2 site with various length of the homologous arms.
(A) Diagram of a series of truncated targeting constructs. The 5’ homologous arm is left-side truncated from 4, 3, 2 to 1 kb, while the 3’ one is right-side truncated from 1.7, 1.1 to 0.6 kb. Additionally, a left-side truncated 3’ arm with the deletion of the exon2 and beyond (about 0.5 kb) is also generated. Each targeting construct is given a number (No) as indicated at the left, the total length, the length of each 5’ or 3’ arm and the ratio between them are also indicated. Detailed information of these constructs and their HR frequency in mouse ES cells are summarized in Table 5. The average percentage of HR efficiency is derived from two repeats. (B) Shown is representative gel image of PCR identification of the GT recombinants for the targeting construct No4. M: DNA marker; PC: positive clone; NC: negative clone. (C) The graph indicates a downward trend of the gene targeting efficiency with the decreased length of the homologous arms and reflects the correlation of the gene targeting frequency with the total length of the homologous arms, the R2 value shown in the graph is calculated from thirteen points (p<0.01).
Summary of the HR frequency for each targeting construct.
| Targeting Construct No* | Left arm length (kb) | Right arm length (kb) | Positive vs Total clones | Targeting frequency (%) |
|---|---|---|---|---|
Genetic features of the sequences used for creating different 5’ and 3’ truncated homologous arms.
| Truncated arms (kb) | G+C content (%) | SSR | CpG island | Percentage of SINE (%) | Percentage of LINE (%) | Other DNA elements |
|---|---|---|---|---|---|---|
| 50.4 | 0.0 | No | 0.0 | 0.0 | ||
| 50.2 | 0.0 | No | 0.0 | 0.0 | ||
| 51.0 | 0.0 | No | 0.0 | 0.0 | ||
| 5’-1.0 | 51.5 | 0.0 | No | 0.0 | 0.0 | 0.0 |
| 3’-1.7 | 50.9 | 0.0 | No | 0.0 | 0.0 | 0.0 |
| 3’-1.1 | 50.9 | 0.0 | No | 0.0 | 0.0 | 0.0 |
| 3’-0.6 | 54.5 | 0.0 | No | 0.0 | 0.0 | 0.0 |
| 3’-1.2 | 49.2 | 0.0 | No | 0.0 | 0.0 | 0.0 |