| Literature DB >> 35845313 |
Weiji Zhai1,2, Yanting Duan1,2, Xiaomei Zhang3,4, Guoqiang Xu1,2, Hui Li3,4, Jinsong Shi3,4, Zhenghong Xu1,2, Xiaojuan Zhang1,2.
Abstract
The intrinsic terminator in prokaryotic forms secondary RNA structure and terminates the transcription. However, leaking transcription is common due to varied terminator strength. Besides of the representative hairpin and U-tract structure, detailed sequence and thermodynamic features of terminators were not completely clear, and the effect of terminator on the upstream gene expression was unclearly. Thus, it is still challenging to use terminator to control expression with higher precision. Here, in E. Coli, we firstly determined the effect of the 3'-end sequences including spacer sequences and terminator sequences on the expression of upstream and downstream genes. Secondly, terminator mutation library was constructed, and the thermodynamic and sequence features differing in the termination efficiency were analyzed using the FlowSeq technique. The result showed that under the regulation of terminators, a negative correlation was presented between the expression of upstream and downstream genes (r=-0.60), and the terminators with lower free energy corelated with higher upstream gene expression. Meanwhile, the terminator with longer stem length, more compact loop and perfect U-tract structure was benefit to the transcription termination. Finally, a terminator strength classification model was established, and the verification experiment based on 20 synthetic terminators indicated that the model can distinguish strong and weak terminators to certain extent. The results help to elucidate the role of terminators in gene expression, and the key factors identified are crucial for rational design of terminators, and the model provided a method for terminator strength prediction.Entities:
Keywords: FlowSeq; Free energy; Intrinsic terminator; Machine learning; Structure-activity relationship; Transcription termination
Year: 2022 PMID: 35845313 PMCID: PMC9257418 DOI: 10.1016/j.synbio.2022.06.003
Source DB: PubMed Journal: Synth Syst Biotechnol ISSN: 2405-805X
Fig. 1Schematic diagram of the terminator-probe plasmids. The expression level of upstream gene EGFP and downstream gene mRFP1, with terminators or spacers inserted, were quantified.
Fig. 2Comparison of gene expression that located at upstream and downstream of varied intrinsic terminator sequences and spacer sequences. (A and C) The fluorescence intensity of upstream and downstream genes. ***P < 0.001 indicated significant difference (Welch′s t-test). (B)The correlation analysis between expression of upstream gene and ΔG/L. (D) The left figure showed distribution of fluorescence intensity of up/down stream reporter gene regulated by three terminators and a spacer sequence. The right figure indicated the relative transcriptional levels of downstream gene regulated by terminators and spacer sequence.
Fig. 3Schematic diagram of construction of terminator mutation library and the result of flow cytometric sorting. (A) The design of terminator mutation library and the FlowSeq process to obtain the phenotype and genotype characteristics of terminator variants, including the design of library, transformation and cell sorting. The red letter represented the mutant base and the number represented the nucleotide position. (N: A/T/C/G; B:C/G/T; W: A/T; Y:C/T) (B) The distribution of fluorescence of the up/down-stream reporter gene in each bin. (C) The weighted fluorescence intensity of the upstream and downstream genes regulated by each terminator variant.
Fig. 4Terminators with longer stem and more T bases in the U-tract region were more likely to become strong terminators. (A and C) The sequence conservation analysis at each position of variants in strong terminator and weak terminator group. (B and D) The frequency of each nucleotide in each position in strong and weak terminator groups. The dots with larger size and brighter color indicated the higher frequency. The proportion of 6- and 7-nt stem (E) and the content of T base in the U-tract region (F) in weak and strong terminator group.
Fig. 5Thermodynamic parameters of the terminators between weak and strong terminator groups. ΔG represented the free energy of entire terminator. ΔG, ΔGand ΔG represented the free energy of hairpin structure, loop structure and U-tract of terminator, respectively. ΔG′ reflected the weighted free energy of the 15 bp broad U-tract region included U-tract and its downstream 7 bp. Mann-Whitney test was used for statistical analysis, and P < 0.05 was considered to be significant (*P < 0.05, **p < 0.01, ***p < 0.001).
Fig. 6Schematic diagram of establishing the model.
Fig. 7The results of the terminator strength classification model and the experimental verification. (A) The correlation analysis of relative fluorescence intensity of upstream and downstream reporter genes of 20 synthetic terminators. (B)The average accuracy, precision, recall and f1-score of 5-fold cross validation trained by the XGBoost algorithm. (C and D) The comparison of downstream reporter gene (p = 0.0387, two independent sample T-test) and termination efficiency (p = 0.0109, two independent sample T-test) between terminator groups that classified by the model as strong or weak. P < 0.05 was considered to have significant difference.