| Literature DB >> 32413257 |
Francine J Boonekamp1, Sofia Dashko1, Donna Duiker1, Thies Gehrmann2, Marcel van den Broek1, Maxime den Ridder1, Martin Pabst1, Vincent Robert2, Thomas Abeel3, Eline D Postma1, Jean-Marc Daran1, Pascale Daran-Lapujade1.
Abstract
The construction of powerful cell factories requires intensive and extensive remodelling of microbial genomes. Considering the rapidly increasing number of these synthetic biology endeavors, there is an increasing need for DNA watermarking strategies that enable the discrimination between synthetic and native gene copies. While it is well documented that codon usage can affect translation, and most likely mRNA stability in eukaryotes, remarkably few quantitative studies explore the impact of watermarking on transcription, protein expression, and physiology in the popular model and industrial yeast Saccharomyces cerevisiae. The present study, using S. cerevisiae as eukaryotic paradigm, designed, implemented, and experimentally validated a systematic strategy to watermark DNA with minimal alteration of yeast physiology. The 13 genes encoding proteins involved in the major pathway for sugar utilization (i.e., glycolysis and alcoholic fermentation) were simultaneously watermarked in a yeast strain using the previously published pathway swapping strategy. Carefully swapping codons of these naturally codon optimized, highly expressed genes, did not affect yeast physiology and did not alter transcript abundance, protein abundance, and protein activity besides a mild effect on Gpm1. The markerQuant bioinformatics method could reliably discriminate native from watermarked genes and transcripts. Furthermore, presence of watermarks enabled selective CRISPR/Cas genome editing, specifically targeting the native gene copy while leaving the synthetic, watermarked variant intact. This study offers a validated strategy to simply watermark genes in S. cerevisiae.Entities:
Keywords: DNA and RNA watermarks; Saccharomyces cerevisiae; differential RNA expression analysis; genome engineering; glycolysis; pathway swapping
Mesh:
Substances:
Year: 2020 PMID: 32413257 PMCID: PMC7309318 DOI: 10.1021/acssynbio.0c00045
Source DB: PubMed Journal: ACS Synth Biol ISSN: 2161-5063 Impact factor: 5.110
Figure 1Comparison of two watermarking strategies. (A) First strategy with clustered watermarks. (B) Second strategy with watermarks distributed over the whole coding region. The tables in panel A and B represent the % of sequencing reads that can be captured by the two watermarking strategies, calculated from in silico simulated 100 bp paired-end sequencing reads.
Figure 2Construction of SinLoG (Single Locus Glycolysis) strains IMX1770 and IMX1771 using the glycolysis swapping strategy.[13] (A) A newly designed glycolysis is integrated in the CAN1 locus by simultaneous CRISPR/Cas9-aided editing of CAN1 and in vivo assembly of glycolytic expression cassettes and helper fragments (ARS418, ARS1211 and the selection marker HIS3). The > and < signs next to the gene names indicate the directionality of transcription and letters indicate the synthetic homologous recombination (SHR) sequence which was used for assembly. (B) Subsequently, the Single Locus Glycolysis present in the SGA1 locus was excised by double editing using CRISPR/Cas9 and replaced by the URA3 selection marker. The set of genes integrated in CAN1 is then the sole set of glycolytic genes present in the newly constructed strain and is essential for growth on glucose.
Figure 3Strain construction workflow. The Switchable Yeast Glycolysis (SwYG) strain, IMX1338, served as parental strain to introduce in chromosome V a SinLoG (Single Locus Glycolysis) with native ORFs and standardized promoters/terminators (IMX1747) as well as with watermarked ORFs and standardized promoters/terminators (IMX1717). From both strains the native SinLoG in chromosome IX with variable promoters and terminators was removed (resulting in strain IMX1771 and IMX1770, respectively). After removal of URA3 from strain IMX1771 (native ORFs) the SinLoG with watermarked ORFs was introduced in chromosome IX, resulting in a strain with double glycolysis (IMX2028).
List of Strains Used in This Study
| strain name | genotype | source and brief description |
|---|---|---|
| Strains Characterized in This Study | ||
| IMX1770 | MAT | This study; Prototrophic strain with watermarked single locus glycolysis (WMG strain); Derived from IMX1717 |
| IMX1771 | MAT | This study; Prototrophic strain with native single locus glycolysis (NG strain); Derived from IMX1747 |
| IMX2028 | MATa | This study; Prototrophic strain with native and watermarked single locus glycolysis (DG strain); Derived from IMX1748 |
| Strains Used As Starting Point or Intermediate in the Construction of the above Strains | ||
| CEN.PK113–7D | Control strain;[ | |
| IMX589 | ||
| IMX1338 | MAT | This study; Derived from IMX589; Contains a SinLoG in Chr. IX; Uracil and histidine auxotroph |
| IMX1717 | MAT | This study; Derived from IMX1338; Strain with native (variable prom and term length) and watermarked SinLoG in Chr. IX and V respectively; Histidine auxotroph |
| IMX1747 | MAT | This study; Derived from IMX1338; Strain with native SinLoG with variable prom and term length in Chr. IX and native SinLoG with standardized prom and term length in Chr. V; Histidine auxotroph |
| IMX1748 | MAT | This study; Derived from IMX1771; Strain with watermarked SinLoG in Chr. IX and native SinLoG with standardized prom and term length in Chr. V; Uracil auxotroph |
Figure 4Physiological characterization of strains with native (IMX1771) and watermarked (IMX1770) glycolytic genes during aerobic batch cultures in bioreactors. (A) Left panel, biomass concentration (gram dry biomass per liter), central panel, glucose concentration (mM); right panel, ethanol concentration (mM). Three independent culture replicates are represented for each strain. Shades of blue with square symbols, IMX1770, shades of red with round symbols, IMX1771. (B) Specific enzyme activities of the 12 reactions encoded by the 13 glycolytic enzymes (Pfk1 and Pfk2 form an enzyme complex) of the strains with native (IMX1771, red bars) and watermarked glycolysis (IMX1770, blue bars). Samples were taken in midexponential phase. Bars represent the average and standard deviation of measurements from three independent batch cultures for each strain. Stars indicate enzyme activities that are significantly different between the two strains (Student t test, p-value threshold 0.05, two-tailed test, homoscedastic).
Physiological Characterization in Bioreactor of Yeast Strains with Native and Watermarked Glycolysisa
| IMX1771 | IMX1770 | |
|---|---|---|
| Ysx (gdry weight/gglucose) | 0.12 ± 0.01 | 0.13 ± 0.01 |
| Ys,glycerol (mol/mol) | 0.07 ± 0.01 | 0.05 ± 0.01 |
| Ys,ethanol(mol/mol) | 1.47 ± 0.05 | 1.46 ± 0.02 |
| Ys,acetate(mol/mol) | 0.05 ± 0.00 | 0.06 ± 0.00 |
| μmax (h–1) | 0.32 ± 0.00 | 0.33 ± 0.01 |
| qglucose (mmol/g–1·h–1) | –14.6 ± 0.7 | –14.7 ± 0.9 |
| qglycerol (mmol/g–1·h–1) | 0.96 ± 0.03 | 0.76 ± 0.17 |
| qethanol(mmol/g–1·h–1) | 21.4 ± 0.3 | 21.4 ± 1.7 |
| qacetate(mmol/g–1·h–1) | 0.79 ± 0.05 | 0.88 ± 0.06 |
| 105 ± 2 | 103 ± 3 | |
Physiological characterization of strains with native (IMX1771) and watermarked (IMX1770) glycolytic genes during aerobic batch cultures in bioreactors. Data represent the average and standard deviation of measurements from three independent batch cultures for each strain. Statistical analysis (Student t test, p-value threshold 0.05, two-tailed test, homoscedastic) revealed no significant differences between the two strains.
Figure 5Genome-wide transcriptome analysis of IMX1770 and IMX1771. The x-axis represents the log fold change in expression, and the y-axis represents the −log p-value. Each point represents a transcript. A negative log fold change reflects higher expression in the native strain than in the watermarked strain, and vice versa. The horizontal, dashed line represents the FDR corrected p-value threshold of 0.05, and the vertical dashed lines represent a log fold change threshold of 1. Red points indicate significantly differentially expressed transcripts (FDR-corrected p-value above 0.05 and Log fold change higher than 1).
Figure 6Glycolysis and fermentation transcript levels of S. cerevisiae grown in aerobic batch cultures in bioreactors. (A) Watermarked transcript levels of IMX1770 (blue) and native transcript levels of IMX1771 (red). (B) Watermarked and native transcript levels of IMX2028. Bars represent the average and standard deviation of three independent cultures replicates. Samples were taken in midexponential phase (Table S9). No significant change in expression was found between watermarked and native genes (Student t test, p-value threshold 0.05, two-tailed test, homoscedastic) between IMX1770 and IMX1771 and within IMX2028.
Figure 7Diagnostic PCR for selective editing of native glycolytic genes. Separation of PCR products resulting from outside–outside amplification to identify edited (nonwatermarked) and nonedited (watermarked) loci for PYK1 (A) and TPI1 (B) from transformants of IMX1717 (double SinLoG). (A) Lanes 1–15 show the PCR results of amplification of the PYK1 locus of randomly picked colonies. Successful editing of the locus results in a DNA fragment with a length of 670 bp. No editing of the locus results in a DNA fragment with a length of 2177 bp. Primers 11915 and 4667 were used. Lanes 1, 5, and 15 display bands of both sizes revealing selective editing. (B) Lane 1–15 show the PCR results of amplification of the TPI1 locus of randomly picked colonies. Successful editing of the locus results in a DNA fragment with a length of 378 bp. No editing of the locus results in a DNA fragment with a length of 1125 bp. Primers 3514 and 6406 were used. Lanes 9–11, 13, and 15 display bands of both sizes revealing selective editing. A negative control is indicated with “C-“ (IMX1338, SinLog). In the lanes indicated with “L”, GeneRuler DNA ladder mix was loaded. 1% (w/v) agarose in TAE.