Qiang Li1,2, Qian Feng2, Ashley Snouffer2, Biyao Zhang2, Gustavo Rubén Rodríguez2,3, Esther van der Knaap2,4,5. 1. College of Horticulture, Hebei Agricultural University, State Key Laboratory of North China Crop Improvement and Regulation, Key Laboratory of Vegetable Germplasm Innovation and Utilization of Hebei, Collaborative Innovation Center of Vegetable Industry in Hebei, Baoding, China. 2. Center for Applied Genetic Technologies, University of Georgia, Athens, GA, United States. 3. Instituto de Investigaciones en Ciencias Agrarias de Rosario (IICAR-CONICET-UNR), Cátedra de Genética, Facultad de Ciencias Agrarias UNR, Santa Fe, Argentina. 4. Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, GA, United States. 5. Department of Horticulture, University of Georgia, Athens, GA, United States.
Abstract
CRISPR/Cas-mediated genome editing is a powerful approach to accelerate yield enhancement to feed growing populations. Most applications focus on "negative regulators" by targeting coding regions and promoters to create nulls or weak loss-of-function alleles. However, many agriculturally important traits are conferred by gain-of-function alleles. Therefore, creating gain-of-function alleles for "positive regulators" by CRISPR will be of great value for crop improvement. CYP78A family members are the positive regulators of organ weight and size in crops. In this study, we engineered allelic variation by editing tomato KLUH promoter around a single-nucleotide polymorphism (SNP) that is highly associated with fruit weight. The SNP was located in a conserved putative cis-regulatory element (CRE) as detected by the homology-based prediction and the Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq). Twenty-one mutant alleles with various insertion and deletion sizes were generated in the LA1589 background. Five mutant alleles (m2+4bp , m3+1bp , m5-1bp , m13-8bp , and m14-9bp ) showed a consistent increase in fruit weight and a significant decrease in the proportion of small fruits in all experimental evaluations. Notably, m2+4bp and m3+1bp homozygote significantly increase fruit weight by 10.7-15.7 and 8.7-16.3%, respectively. Further analysis of fruit weight based on fruit position on the inflorescence indicated that the five beneficial alleles increase the weight of all fruits along inflorescence. We also found that allele types and transcriptional changes of SlKLUH were poor predictors of the changes in fruit weight. This study not only provides a way of identifying conserved CRE but also highlights enormous potential for CRISPR/Cas-mediated cis-engineering of CYP78A members in yield improvement.
CRISPR/Cas-mediated genome editing is a powerful approach to accelerate yield enhancement to feed growing populations. Most applications focus on "negative regulators" by targeting coding regions and promoters to create nulls or weak loss-of-function alleles. However, many agriculturally important traits are conferred by gain-of-function alleles. Therefore, creating gain-of-function alleles for "positive regulators" by CRISPR will be of great value for crop improvement. CYP78A family members are the positive regulators of organ weight and size in crops. In this study, we engineered allelic variation by editing tomato KLUH promoter around a single-nucleotide polymorphism (SNP) that is highly associated with fruit weight. The SNP was located in a conserved putative cis-regulatory element (CRE) as detected by the homology-based prediction and the Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq). Twenty-one mutant alleles with various insertion and deletion sizes were generated in the LA1589 background. Five mutant alleles (m2+4bp , m3+1bp , m5-1bp , m13-8bp , and m14-9bp ) showed a consistent increase in fruit weight and a significant decrease in the proportion of small fruits in all experimental evaluations. Notably, m2+4bp and m3+1bp homozygote significantly increase fruit weight by 10.7-15.7 and 8.7-16.3%, respectively. Further analysis of fruit weight based on fruit position on the inflorescence indicated that the five beneficial alleles increase the weight of all fruits along inflorescence. We also found that allele types and transcriptional changes of SlKLUH were poor predictors of the changes in fruit weight. This study not only provides a way of identifying conserved CRE but also highlights enormous potential for CRISPR/Cas-mediated cis-engineering of CYP78A members in yield improvement.
As populations grow and the climate changes, demands for increased crop productivity continue across the world (Tilman et al., 2011; Ray et al., 2013). However, the rate of crop yield increase driven by conventional breeding technologies is not on pace to meet the increasing demands for food (Godfray et al., 2010; Ray et al., 2013; Long et al., 2015; Gao, 2018; Willmann, 2018; Hickey et al., 2019; Wolter et al., 2019). Yield and produce size are complex quantitative traits controlled by multiple genes. Despite enormous efforts made in the identification of yield-related genes in various crops, the implementation of these alleles in breeding further yield improvements is limited due to the low levels of genetic diversity stemming from long-term domestication and intensive selection (Zuo and Li, 2014; Kumar et al., 2017). Furthermore, yield and fruit size are regulated by numerous quantitative trait loci (QTLs) with subtle effects, and the identification and introgression of natural mutations are time-consuming and laborious (Doebley et al., 2006; Shi and Lai, 2015; Birchler, 2017). Therefore, the development of innovative technologies toward yield increases is essential to expand phenotypic diversity and accelerate yield enhancement to feed a growing population.The CRISPR/Cas technologies have been successfully applied for crop improvement (Rodriguez-Leal et al., 2017; Gao, 2018; Chen et al., 2019; Hua et al., 2019; Mao et al., 2019; Zhang et al., 2019; Zhu et al., 2020). However, CRISPR/Cas-mediated genome editing has mainly focused on coding regions to produce loss-of-function mutants (Pandiarajan and Grover, 2018; Wolter and Puchta, 2018; Korotkova et al., 2019; Li Q. et al., 2020). While this application widely favors reverse genetic approaches for many domestication traits (Li et al., 2018; Zsogon et al., 2018), loss-of-function mutations often result in developmental defects that would hamper their applications in crop improvement (Xu et al., 2015; Swinnen et al., 2016; Morineau et al., 2017; Li C. et al., 2020). On the other hand, engineering cis-regulatory motifs (cis-engineering) within non-coding regions could result in fine-tuning gene expression and generate phenotypic diversity with less pleiotropic or deleterious effects than loss-of-function alleles (Swinnen et al., 2016; Rodriguez-Leal et al., 2017; Pandiarajan and Grover, 2018; Wolter and Puchta, 2018; Huang et al., 2020; Li C. et al., 2020; Liu et al., 2021).The downregulation of gene expression as the result of the editing of cis-regulatory regions in genes which are negative regulators of a desirable trait has been successful in crop improvement (Duan et al., 2016; Jia et al., 2016; Holme et al., 2017; Peng et al., 2017; Rodriguez-Leal et al., 2017; Hummel et al., 2018; Li et al., 2018; Jia et al., 2019; Korotkova et al., 2019; Oliva et al., 2019; Huang et al., 2020; Li C. et al., 2020; Liu et al., 2021). However, many agriculturally important traits are conferred by dominant gain-of-function mutations (Korotkova et al., 2017; Li et al., 2017; Korotkova et al., 2019). Therefore, the generation of gain-of-function alleles in the promoters of positive regulators of traits could lead to the modulation of gene expression levels and tissue or temporal-specific expression patterns that would have great value for crop improvement.CYP78A family members are recognized as positive regulators of organ weight and size in many crops, such as tomato (Zhang, 2012; Chakrabarti et al., 2013; Li et al., 2021), rice (Nagasawa et al., 2013; Yang et al., 2013; Maeda et al., 2019), wheat (Ma et al., 2015a,b), maize (Sun et al., 2017), soybean (Wang et al., 2015; Zhao et al., 2016), pepper (Chakrabarti et al., 2013), Jatropha curcas (Tian et al., 2016), and sweet cherry (Qi et al., 2017). Tomato KLUH (SlKLUH) underlies the fruit weight locus fw3.2. SlKLUH copy number is positively associated with fruit weight whereas knockout or knockdown SlKLUH often results in smaller fruits as well as other growth defects, including tiny inflorescences and infertile flowers (Chakrabarti et al., 2013; Alonge et al., 2020).Previously, we identified a potential regulatory SNP, named M9 SNP, in the promoter of SlKLUH which is highly associated with fruit weight. Thus, the M9 SNP was proposed to be the causative variant of the fw3.2 locus (Chakrabarti et al., 2013). However, a recent study demonstrated that an ∼50-kbp tandem duplication, rather than the M9 SNP, that includes SlKLUH underlies fw3.2, giving rise to 2- to 3-fold higher expression of SlKLUH and larger fruits. In this study, we found that the four tandem repeats are conserved motifs in the promoter of SlKLUH orthologs detected by homology-based prediction and Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq). The putative conserved motif was edited by CRISPR/Cas9 with a single guide RNA (gRNA) which included the M9 SNP to generate a total of 21 alleles. From these, we produced a series of homozygous transgene-free mutants showing a range of fruit weight variations. Among them, five mutant alleles (m2, m3, m5, m13, and m14) confer increased fruit weight with subtle effects in tomato in three experimental evaluations. Our data indicate that within tomato, cis-engineering of “positive regulators” using CRISPR/Cas9 has the potential for the improvement of quantitative traits. Furthermore, the application of cis-engineering of “positive regulators” to generate beneficial variants and the alleles generated in the conserved motifs in the promoter of SlKLUH identified are likely applicable to diverse crops.
Materials and Methods
Plant Materials and Growth Conditions
LA1589 carries a single copy of fw3.2 and a wild-type (wt) allele of the M9 SNP. Seeds were sown directly in the soil in 72-count 6 pack trays and grown in a growth chamber under 16-h light/8-h dark photoperiod for 5–6 weeks. The 5- to 6-week-old seedlings were transplanted into 2.45 L pots and were cultivated in a greenhouse under 16-h light/8-h dark photoperiod in Athens, GA, United States. All the plants of each experimental evaluation were randomly arranged in the greenhouse.
Guide RNA Design, CRISPR/Cas9 Construct, and Plant Transformation
The gRNA targeting the M9 SNP was designed using the CRISPR-P tool[1] (Lei et al., 2014). The CRISPR/Cas9 construct was assembled using the Golden Gate cloning method as previously described (Wu et al., 2018). Electroporation was used to introduce the final binary vector into Agrobacterium tumefaciens strain LBA4404, which was kindly provided by Dr. Joyce Van Eck, Cornell University. The LBA4404 harboring the binary vector was used for the transformations of LA1589. The genetic transformations of LA1589 were performed as described (Gupta and Van Eck, 2016) at Wayne Parrott’s Laboratory, University of Georgia.
Genotyping Edited Plants and Recovery of Homozygous Progenies
For genotyping of T0 transgenic plants, genomic DNA was extracted from leaves and was used for genotyping by PCR for the presence of the Cas9 using two specific primer pairs (14EP426/14EP427; 14EP438/14EP439) on the Cas9 coding region. The target region was amplified by PCR from the genomic DNA using specific primers flanking the gRNA target sites (17EP42/17EP47). The PCR products were purified and directly sequenced by Sanger sequencing. The mutations in T0 generation were analyzed by decoding sequencing chromatograms (Ma et al., 2015c).T0 transgenic plants were backcrossed to LA1589, and the seeds of the F1 generation were extracted and sown in 288-well plastic flats. Each progeny was genotyped by PCR using primer flanking of the target region (17EP218/EP2460), and PCR products were resolved on 3.5% (w/v) agarose gel. The progenies harboring small indels that cannot be easily identified by PCR assay were further genotyped by Derived Cleaved Amplified Polymorphic Sequences (dCAPS)-BsrBI, dCAPS-AciI, and dCAPS-HaeIII. For Cas9-free F1 plants, they were self-pollinated for the generation of homozygous mutants in the F2 generation. For the F1 plants carrying the Cas9 transgene, they were backcrossed with LA1589, and the Cas9-free homozygotes were obtained in the F3 generation. All the mutant alleles from the selected plants in F1, F2, and F3 generations were confirmed by Sanger sequencing, and the presence of Cas9 was determined by two specific primers in its coding region (Supplementary Table 4).
Phenotyping
The phenotyping was performed on homozygous mutants with three experimental evaluations during 2018 and 2019, each with at least three plants per genotype (Supplementary Table 3). Three stems were kept and trained on a bamboo stick, respectively. We kept six inflorescences per plant with eight fruits on each inflorescence. All the flowers were pollinated by hand. We numbered the fruit position from 1 to 8 according to the proximal to distal positions on each inflorescence, and the inflorescences were numbered on each plant from 1 to 6 according to the harvesting time. The fruit weight was measured individually using a precision balance (VWR 64B). For fruit weight distribution analysis, fruits were grouped into four categories based on the quartiles of the fruit weight from all genotypes in each experimental evaluation.
RNA Extraction and Quantitative Real-Time PCR
Since SlKLUH showed very low expression in developing fruits and very high expression in young flower buds in LA1589 (Chakrabarti et al., 2013), young flower buds at 9–13 days post initiation (dpi) (Supplementary Figure 7) were collected from at least three inflorescences per plant. Total RNA from meristems was then extracted using the TRIzol® Reagent (Thermo Fisher, United States). Total RNA was used for cDNA synthesis with a High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher, United States). Quantitative real-time PCR (qRT-PCR) was performed with gene-specific primers using the SsoAdvanced™ Universal SYBR® Green Supermix (Bio-Rad, United States) reaction system on the CFX96 Real-Time system (Bio-Rad, United States), following manufacturer’s instructions. Clathrin adaptor complexes medium subunit (CAC) gene was used as an internal control (González-Aguilera et al., 2016; Supplementary Table 4).
Statistical Analysis
Statistical analysis was performed in R. The information of the statistical test is given in the respective figures.
Results
Generation of Novel Cis-Regulatory Alleles by CRISPR/Cas9 in LA1589
The M9 SNP is located within the second repeat of four 30-bp tandem repeats (Supplementary Figure 1). Homology-based prediction using Multiple Em for Motif Elicitation (MEME) indicated that the four tandem repeats are conserved motifs in orthologous KLUH promoters (Figure 1A), suggesting that the motifs might be the important regulatory elements of the CYP78A genes. Since active gene regulatory elements are associated with open chromatin (Zhu et al., 2015; Lu et al., 2019; Yan et al., 2020), ATAC-seq was utilized in tomato meristem and leaf tissues (Hendelman et al., 2021). Significant peaks covering the repeats were detected in both tissues, indicating that this region might be an important regulatory region of SlKLUH (Figure 1B). Many genetic changes underlying traits of economic importance reside in cis-regulatory elements (CREs) and cis-engineering mediated by CRISPR/Cas for crop improvement can be utilized to expedite the modification of these CREs (Li Q. et al., 2020; Hendelman et al., 2021). Therefore, we hypothesized that engineering fruit weight variation could be implemented by editing the presumptive CRE of SlKLUH.
FIGURE 1
Identification of putative cis-regulatory element in SlKLUH promoter using homology-based prediction and ATAC-seq. (A) Conserved motif analysis in the promoters of CYP78A members. The unrooted phylogenetic tree was depicted by the MEGA program with the neighbor-joining (NJ) method using full protein sequences of 15 CYP78As from tomato (Sl), pepper (Capana03g000903), potato (PGSC0003DMT400063324), Arabidopsis (At), rice (Os), wheat (Ta), soybean (Gm), maize (Zm), and sweet cherry (Pa). To identify the conserved motifs, the 2-kb upstream region of the start codon of the KLUHs was analyzed in MEME with the following parameters: “nmotifs 3, minw 6, maxw 30”; (B) ATAC-seq indicated that the target region was located in open chromatin. The data were visualized by the Integrative Genomics Viewer. The peaks at the target region were indicated by red shading; (C) A schematic map of the gRNA-targeted site. The protospacer adjacent motif (PAM) site and gRNA were highlighted in pink and cyan, respectively. The four tandem repeats in promoter were indicated by R1, R2, R3, and R4. TSS, transcription start site.
Identification of putative cis-regulatory element in SlKLUH promoter using homology-based prediction and ATAC-seq. (A) Conserved motif analysis in the promoters of CYP78A members. The unrooted phylogenetic tree was depicted by the MEGA program with the neighbor-joining (NJ) method using full protein sequences of 15 CYP78As from tomato (Sl), pepper (Capana03g000903), potato (PGSC0003DMT400063324), Arabidopsis (At), rice (Os), wheat (Ta), soybean (Gm), maize (Zm), and sweet cherry (Pa). To identify the conserved motifs, the 2-kb upstream region of the start codon of the KLUHs was analyzed in MEME with the following parameters: “nmotifs 3, minw 6, maxw 30”; (B) ATAC-seq indicated that the target region was located in open chromatin. The data were visualized by the Integrative Genomics Viewer. The peaks at the target region were indicated by red shading; (C) A schematic map of the gRNA-targeted site. The protospacer adjacent motif (PAM) site and gRNA were highlighted in pink and cyan, respectively. The four tandem repeats in promoter were indicated by R1, R2, R3, and R4. TSS, transcription start site.We tested the hypothesis by targeting the CRE using CRISPR/Cas9 with a single gRNA in the wild relative of the cultivated tomato LA1589 (Solanum pimpinellifolium) (Figure 1C). Seven first-generation transgenic (T0) individuals were obtained (Supplementary Figure 2). The disrupted target sites detected by PCR and Sanger sequencing suggested that two plants were biallelic for m3/m9 (T0–1) or m3/m6 (T0–15), two plants were homozygous for m6 (T0–9) or m16 (T0–11), and other two plants were heterozygous for m3 (T0–12) or chimeric for at least three alleles (T0–10) (Supplementary Figure 2 and Supplementary Table 1).To enrich for SlKLUH promoter mutant alleles covering a range of fruit weight variations, the CRISPR/Cas9-driven mutagenesis approach was utilized (Rodriguez-Leal et al., 2017). A sensitized population of 719 F1 plants was generated by the backcrossing of T0 lines with LA1589 and genotyped by PCR and Restriction Enzyme (PCR/RE) analysis (Supplementary Figure 3). F1 progenies from T0–9 crossed to LA1589 resulted in the identification of three new alleles, namely, m2, m14, and m21, based on indels observed by gel electrophoresis mobility shift after PCR using primers flanking the target region (Figure 2; Supplementary Figure 3; Supplementary Table 2). To identify small indels that are different from m6 or wt, we developed a screen scheme that exploits dCAPS analysis according to the sequences of m6 and wt and the recognition sequences of the three REs (BsrBI, HaeIII, and AciI). The remaining progenies were genotyped by dCAPS-BsrBI, which can digest m6 and wt. The new mutations that do not carry the restriction site for the digestion with BsrBI were further analyzed by dCAPS-HaeIII and/or dCAPS-AciI, by which we identified one plant heterozygous for m8 (Figure 2; Supplementary Figure 4; Supplementary Table 2). With this approach, we identified another 13 new alleles from the progenies of T0–10, T0–12, and T0–15 (Figure 2; Supplementary Figure 4; Supplementary Table 2).
FIGURE 2
CRISPR/Cas9-induced mutations in LA1589. PAM and gRNA are highlighted in pink and cyan, respectively. The four tandem repeats in promoter were indicated by R1, R2, R3, and R4. The black arrow and blue dashed line indicate the position of the M9 SNP in LA1589 and all the 21 mutants, respectively. Inserted and deleted nucleotides were shown in red and gray, respectively. m, mutant allele. The mutant alleles were numbered 1 through 21 according to the length of the alleles. For the subscripts of the mutant alleles, (-) indicates deletions followed by the number of bp deleted, (+) indicates insertions followed by the number of bp inserted.
CRISPR/Cas9-induced mutations in LA1589. PAM and gRNA are highlighted in pink and cyan, respectively. The four tandem repeats in promoter were indicated by R1, R2, R3, and R4. The black arrow and blue dashed line indicate the position of the M9 SNP in LA1589 and all the 21 mutants, respectively. Inserted and deleted nucleotides were shown in red and gray, respectively. m, mutant allele. The mutant alleles were numbered 1 through 21 according to the length of the alleles. For the subscripts of the mutant alleles, (-) indicates deletions followed by the number of bp deleted, (+) indicates insertions followed by the number of bp inserted.Collectively, a total of 21 mutant alleles were created in the LA1589 background (Figure 2 and Supplementary Table 2), including three insertion alleles (1, 4, and 98 bp) and deletions of various sizes (from 1 to 60 bp) (Figure 2). Cas9 often cleaves double-strand DNA at a position of 3-bp upstream of the protospacer adjacent motif (PAM) sequence, and most mutations occurred at the 4th base from the PAM site (Jinek et al., 2012). However, we found that eight alleles (m5, m9, m11, m12, m15, m17, m19, and m21) did not occur right upstream of the predicted double-strand break (DSB) position. These results indicated the high efficiency of the CRISPR/Cas9-driven mutagenesis screen approach creating a collection of novel mutant alleles, including unexpected mutations, in tomatoes. F1 plants carrying novel alleles that were Cas9 positive were backcrossed to LA1589 to segregate away the Cas9 transgene to avoid further edits and minimize off-target effects. The Cas9-free mutants homozygous for these mutant alleles were obtained in F2 and F3 generations and used for further analysis.
Evaluation of Novel Cis-Regulatory Alleles for Fruit Weight Variation
The plants homozygous for the 21 alleles were grown in December 2018 with at least three plants per genotype (Supplementary Table 3). They displayed a continuum of fruit weight variation between 0.92 and 1.14 g (Figure 3A and Supplementary Table 3). Compared to LA1589, plants homozygous for nine alleles (m2, m3, m5, m18, m13, m14, m21, m1, and m17) showed more than 5% variation in fruit weight (Figure 3A). Notably, fruit weight of m2, m3, m5, m18, m13, and m14 homozygotes was greater by 10.68, 8.74, 8.74, 8.74, 7.77, and 7.77% compared to LA1589, respectively. In contrast, fruit weight decreased by 8.74, 9.71, and 10.68% for m21, m1, and m17, respectively (Figure 3A and Supplementary Table 3).
FIGURE 3
The effects of mutant alleles on fruit weight in three experimental evaluations. (A) Fruit weight differences from LA1589. Each data point represents the fruit weight differences of the mutant (in percentage) to the LA1589. Up and down triangle indicates significantly greater or lower than LA1589 (two-tailed Student’s t-test, P < 0.05), respectively, and the circle represents no significant difference of the mutant relative to LA1589; (B) Percentage distribution of the fruit weight according to fruit weight quartile. Fruits were grouped by their weight into four categories based on the quartiles of fruit weight in all genotypes in each replicate. Each quartile is indicated by a different color, and the fruit range within each quartile is shown in the upper part. A quantitative plot was generated using the data from December 2018. Asterisks denote significant difference (*P < 0.05; **P < 0.01; ***P < 0.001) of the proportion of small fruit (gray bar) between mutant and LA1589 as determined by chi-squared test. NS, non-significant difference; -, not evaluated.
The effects of mutant alleles on fruit weight in three experimental evaluations. (A) Fruit weight differences from LA1589. Each data point represents the fruit weight differences of the mutant (in percentage) to the LA1589. Up and down triangle indicates significantly greater or lower than LA1589 (two-tailed Student’s t-test, P < 0.05), respectively, and the circle represents no significant difference of the mutant relative to LA1589; (B) Percentage distribution of the fruit weight according to fruit weight quartile. Fruits were grouped by their weight into four categories based on the quartiles of fruit weight in all genotypes in each replicate. Each quartile is indicated by a different color, and the fruit range within each quartile is shown in the upper part. A quantitative plot was generated using the data from December 2018. Asterisks denote significant difference (*P < 0.05; **P < 0.01; ***P < 0.001) of the proportion of small fruit (gray bar) between mutant and LA1589 as determined by chi-squared test. NS, non-significant difference; -, not evaluated.We then selected eight mutant alleles for further analysis, including five larger-fruited alleles (m2, m3, m5, m13, and m14) and three smaller-fruited alleles with large insertion/deletions (m1, m20, and m21). Notably, statistically significant increases in average fruit weight were observed in plants homozygous for m2 and m3 (Figure 3A and Supplementary Table 3). The increased fruit weight of the homozygotes m5, m13, and m14 was still observed and showed 5% or more variation in fruit weight compared to LA1589 in May 2019 and/or September 2019 (Figure 3A and Supplementary Table 3). Collectively, m2, m3, m5, m13, and m14 homozygous mutants had 14.07, 9.60, 5.09, 6.26, and 7.06% greater fruit weight compared to the LA1589 (Figure 3A and Supplementary Table 3). However, the smaller-fruited alleles with large insertions/deletions, m1, m20, and m21, had inconsistent effects on fruit weight between December 2018 and September 2019 (Figure 3A and Supplementary Table 3), potentially due to environmental effects. In the following experiments, we focused only on the larger fruit mutant alleles (m2, m3, m5, m13, and m14) as they showed consistent effects on fruit weight across all experimental evaluations.To better describe the increased fruit weight in the mutants homozygous for m2, m3, m5, m13, and m14, we performed a fruit weight distribution analysis. Fruits were grouped into four categories based on the quartiles of fruit weight in all genotypes in each experimental evaluation. The mutant alleles displayed a range of quantitative effects on the distribution of fruit weight in each experimental evaluation (Supplementary Figure 5). Importantly, the five large-fruited alleles (m2, m3, m5, m13, and m14) showed a significant decrease in the proportion of the small fruits (gray bar) across all or the majority of the experimental evaluations (Figure 3B and Supplementary Figure 5). The results indicated that the decreased proportion of the small fruits is responsible for the increased fruit weight in the five mutants.
The Effects of the Five Mutant Alleles on Fruit Weight Based on Fruit Position on Inflorescence
Previous studies in the domesticated tomato demonstrated that fruits in the same inflorescence generally differ in size from the larger ones at the proximal position (Beadle, 1937; Bangerth and Ho, 1984) and similar results were also observed in LA1589 (Supplementary Figure 6). Fruit weight showed a decreasing trend from the 1st (proximal) to 8th (distal) fruit in the inflorescence, and the decrease rate of fruit weight was 29.1% in December 2018, 35.0% in May 2019, and 13.8% in September 2019 (Supplementary Figure 6).Given that the five mutant alleles can increase fruit weight and decrease the proportion of small fruits, we hypothesized that this could be achieved by reducing the decreasing trend along with the position of an individual inflorescence or increasing the weight of all fruits on each inflorescence. To explore this, two-way ANOVA was performed using the weight and position along the inflorescence. As expected, the fruit weight was significantly affected by genotype and fruit position (P < 0.05) (Table 1). However, the interaction of genotype and fruit position had no significant effect (P > 0.05) in all experimental evaluations (Table 1 and Figure 4), indicating that the five mutant alleles had no significant effect on the rate of fruit weight decrease along inflorescence. Therefore, the five mutant alleles can increase the weight of all fruits along inflorescence without affecting the decreasing rate.
TABLE 1
Results of two-way ANOVA and Tukey’s honestly significant difference (HSD) test examining the influence of genotype and fruit position along inflorescence on fruit weight in each replication.
Two-way ANOVA
December 2018
May 2019
September 2019
Df
Sum sq
F value
P
Df
Sum sq
F value
P
Df
Sum sq
F value
P
Genotype
5
0.43
11.37
0.00*
3
0.64
32.21
0.00*
5
2.61
53.35
0.00*
Fruit position
7
3.00
56.89
0.00*
7
4.56
97.96
0.00*
7
1.10
15.98
0.00*
Genotype*Fruit position
35
0.15
0.57
0.98
21
0.05
0.37
0.99
35
0.11
0.31
1.00
*Significant difference (P < 0.05). Different color indicates different experimental evaluations.
FIGURE 4
The effects of genotype and fruit position on fruit weight. (A) Linear regression between fruit weight and fruit position. Because the interaction of genotype and fruit position had no significant effect, the slopes of the regression lines are equal; in other words, the regression lines are parallel to each other in each replicate. Each data point is the mean fruit weight of each fruit position in one plant; (B) Schematic diagram shows the increased fruit weight along inflorescence in the mutants (lower panel) compared to LA1589 (upper panel).
Results of two-way ANOVA and Tukey’s honestly significant difference (HSD) test examining the influence of genotype and fruit position along inflorescence on fruit weight in each replication.*Significant difference (P < 0.05). Different color indicates different experimental evaluations.The effects of genotype and fruit position on fruit weight. (A) Linear regression between fruit weight and fruit position. Because the interaction of genotype and fruit position had no significant effect, the slopes of the regression lines are equal; in other words, the regression lines are parallel to each other in each replicate. Each data point is the mean fruit weight of each fruit position in one plant; (B) Schematic diagram shows the increased fruit weight along inflorescence in the mutants (lower panel) compared to LA1589 (upper panel).
Non-linear Relationships Between Allele Types, Expression Levels, and Phenotypic Changes
The effects of promoter mutations on the phenotypic changes are often unpredictable and unexpected due to the complexity of transcriptional controls (Rodriguez-Leal et al., 2017). In this study, eight alleles (m1, m2, m3, m4, m5, m6, m7, and m15) had the unaltered wt M9 SNP (Figure 2), among which m2, m3, and m5 showed an increase in fruit weight, while other five showed no or inconsistent effects on fruit weight in this study (Figure 3 and Supplementary Table 3). These results suggested that the M9 SNP had no or minor effects on tomato fruit weight, which was supported by the finding of the SlKLUH gene duplication underlying fw3.2(Alonge et al., 2020). In addition, although many alleles shared overlapping deletions, they had different phenotypic effects. For example, m13–8 and m14 showed consistent effects on increasing fruit weight, while m20 and m21 had opposite effects on fruit weight between December 2018 and September 2019 (Figure 3A and Supplementary Table 3).Given that the lower expression of SlKLUH results in smaller fruits (Chakrabarti et al., 2013; Alonge et al., 2020), we tested whether the expression levels of SlKLUH were upregulated in the mutants homozygous for the five larger fruit mutant alleles. We evaluated the SlKLUH expression in young flower buds at 9–13 dpi (Supplementary Figure 7). Although increased fruit weight was observed for the five large-fruited mutant alleles compared to LA1589, the SlKLUH expression levels from them are comparable to LA1589 (Figure 5A). Remarkably, there was a low correlation between the SlKLUH expression levels and fruit weights (Figure 5B). Similar findings were also observed for the tomato lc allele that is caused by two SNPs in a 15-bp repressor element downstream of tomato WUSCHEL (SlWUS) (van der Knaap et al., 2014; Rodriguez-Leal et al., 2017). Subtle differences in SlWUS expression level were not captured by RT-PCR, resulting in larger fruit size (Muños et al., 2011; van der Knaap et al., 2014; Rodriguez-Leal et al., 2017). These results support that allele types and transcriptional changes are poor predictors of phenotypic changes as previously noted (Rodriguez-Leal et al., 2017; Hendelman et al., 2021).
FIGURE 5
Non-linear relationships between transcriptional change for SlKLUH and fruit weight variation. (A)
SlKLUH expression and fruit weight analyses of the m2, m3, m5, m13, and m14 homozygotes; (B) The correlation between SlKLUH expression and fruit weight. Each point represents one plant per genotype. Two experimental evaluations were taken, each with four to five plants per genotype. The m13 and m14 homozygotes were only included in one experimental evaluation.
Non-linear relationships between transcriptional change for SlKLUH and fruit weight variation. (A)
SlKLUH expression and fruit weight analyses of the m2, m3, m5, m13, and m14 homozygotes; (B) The correlation between SlKLUH expression and fruit weight. Each point represents one plant per genotype. Two experimental evaluations were taken, each with four to five plants per genotype. The m13 and m14 homozygotes were only included in one experimental evaluation.
Discussion
Expanding genetic diversity is of great importance for fine-tuning quantitative traits. However, the reduced genetic variation in modern crops limits the resources that breeders have available to cause subtle changes in agronomic traits (Birchler, 2017; Xing et al., 2020). CRISPR/Cas-mediated cis-engineering holds great promise to fine-tune quantitative traits that are highly valued in crop improvement such as yield and produce size (Rodriguez-Leal et al., 2017; Pandiarajan and Grover, 2018; Wolter and Puchta, 2018; Li Q. et al., 2020).Mutations in the promoter often result in unexpected transcriptional and phenotypic changes due to the complexity of transcriptional control (Rodriguez-Leal et al., 2017; Shrestha et al., 2018; Cui et al., 2020; Huang et al., 2020). This is especially true for editing promoters with several gRNAs or unknown CREs. Contrary to this, mutating CREs with known functions can generate predictable quantitative variation or traits. One remarkable example is the disruption of the CArG element, a repressor motif downstream of SlWUS, leading to larger fruits with more locules (Rodriguez-Leal et al., 2017; Li et al., 2018). Two recent studies in rice reported that bacterial blight-resistant plants were created through CRISPR/Cas editing of the transcription-activator-like effector (TALe)-binding element (EBE) in the promoter of SUGARS WILL EVENTUALLY BE EXPORTED TRANSPORTERS (SWEET) genes (Oliva et al., 2019; Li C. et al., 2020). In another example in rice, the modification of the GT-1 element that is responsible for salt induction of OsRAV2 confers adaptive salt responses (Duan et al., 2016). These studies highlight the great value for de novo discovery and characterization of CREs for boosting CRISPR/Cas-mediated cis-engineering in crop improvement.The identification of conserved motifs by comparing promoter sequences of orthologous genes from different plants is one of the effective ways of de novo CRE discovery (Li Q. et al., 2020). Initially, no well-known or previously described CREs were identified in the target site using PLACE and PlantCARE. However, three conserved motifs that correspond to the four tandem repeats in the SlKLUH promoter were identified by the comparative analysis of the orthologous KLUH promoters. These data, together with the results of the ATAC-seq, directed us to engineer the specific CRE including the M9 SNP using CRISPR/Cas9. While knockout or constitutive knockdown of SlKLUH leads to growth defects and infertile phenotypes (Chakrabarti et al., 2013; Alonge et al., 2020), all the novel promoter alleles generated had normal growth and fertility in our study, supporting the notion that the modifications of cis-regulatory regions can benefit crop improvement or breeding with no or less deleterious effects (Swinnen et al., 2016; Rodriguez-Leal et al., 2017; Li Q. et al., 2020). Overall, the deletion alleles had no or weak positive effects on fruit weight, whereas the alleles with insertions except for m1 significantly increased fruit weight. Importantly, consistent and reproducible results were observed for the five mutant alleles showing significant changes in fruit weight, indicating that the deletions and insertions may have generated novel CREs in the SlKLUH promoter, resulting in larger fruit without a detrimental impact, especially for the 1- and 4-bp insertions. However, the mechanistic aspects of increased fruit weight are not known. Therefore, m2 and m3 show great potential for breeding by introducing them into elite tomato cultivars using precision genome editing. In addition, the conserved motifs were also identified in the promoter of KLUHs in potato, pepper, soybean, rice, wheat, maize, and sweet cherry (Figure 1), suggesting that our approach could be applicable to engineer fruit or seed size variation with KLUHs in these crops.Promoter editing has revealed complex relationships between allele types, transcriptional changes, and phenotypic changes which remain to be fully elucidated (Rodriguez-Leal et al., 2017). However, they can have positive effects on agronomically important traits. In this study, we observed positive changes in fruit weight in the five large-fruited mutants, especially mutants homozygous for m2 and m3; however, no simple linear relationship between expression level and fruit weight changes was observed (Figures 3, 5). This may be due to the complexity of transcriptional control and the pleiotropic regulation of genes by differing elements within the promoter to control the additional aspects of gene expression, such as spatial-temporal specificity, and has been observed in other studies editing multiple CREs (Rodriguez-Leal et al., 2017; Hendelman et al., 2021; Liu et al., 2021). It is possible that larger fruits are due to changes in SlKLUH expression at developmental stages not sampled or at levels not detectible by qRT-PCR, such as changes in cell types expressing SlKLUH. Therefore, using CRISPR/Cas-mediated promoter engineering to screen for desirable traits at the phenotypic level may be more practical for crop improvement than detecting transcriptional changes. Moreover, the role of the putative CRE harboring the M9 SNP in fruit weight regulation needs to be further investigated.In summary, this study identified a putative conserved CRE by combining homology-based prediction, ATAC-seq, and CRISPR/Cas9, which is applicable to diverse genes and crops. Five alleles, namely, m2, m3, m5, m13, and m14, were created by editing the CRE that showed increased fruit weight and potential for breeding. This study not only provides a way of identifying conserved CRE but also highlights the enormous potential for CRISPR/Cas-mediated cis-engineering of CYP78A members in yield improvement.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Author Contributions
QL and EK conceived the project. QL, QF, AS, BZ, and GR performed the research. QL analyzed the data and wrote the draft with revisions from QF, AS, BZ, GR, and EK. All authors contributed to the article and approved the submitted version.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Authors: Michael Alonge; Xingang Wang; Matthias Benoit; Sebastian Soyk; Lara Pereira; Lei Zhang; Hamsini Suresh; Srividya Ramakrishnan; Florian Maumus; Danielle Ciren; Yuval Levy; Tom Hai Harel; Gili Shalev-Schlosser; Ziva Amsellem; Hamid Razifard; Ana L Caicedo; Denise M Tieman; Harry Klee; Melanie Kirsche; Sergey Aganezov; T Rhyker Ranallo-Benavidez; Zachary H Lemmon; Jennifer Kim; Gina Robitaille; Melissa Kramer; Sara Goodwin; W Richard McCombie; Samuel Hutton; Joyce Van Eck; Jesse Gillis; Yuval Eshed; Fritz J Sedlazeck; Esther van der Knaap; Michael C Schatz; Zachary B Lippman Journal: Cell Date: 2020-06-17 Impact factor: 66.850
Authors: Aaron W Hummel; Raj Deepika Chauhan; Tomas Cermak; Andrew M Mutka; Anupama Vijayaraghavan; Adam Boyher; Colby G Starker; Rebecca Bart; Daniel F Voytas; Nigel J Taylor Journal: Plant Biotechnol J Date: 2018-01-22 Impact factor: 9.803