Literature DB >> 30386365

Genome-Scale Analysis of the WRI-Like Family in Gossypium and Functional Characterization of GhWRI1a Controlling Triacylglycerol Content.

Xinshan Zang1, Wenfeng Pei1, Man Wu1, Yanhui Geng1, Nuohan Wang1, Guoyuan Liu1, Jianjiang Ma1, Dan Li1, Yupeng Cui1, Xingli Li1, Jinfa Zhang2, Jiwen Yu1.   

Abstract

Cotton (Gossypium spp.) is the most important natural fiber crop and the source of cottonseed oil, a basic by-product after ginning. AtWRI1 and its orthologs in several other crop species have been previously used to increase triacylglycerols in seeds and vegetative tissues. In the present study, we identified 22, 17, 9, and 11 WRI-like genes in G. hirsutum, G. barbadense, G. arboreum, and G. raimondii, respectively. This gene family was divided into four subgroups, and a more WRI2-like subfamily was identified compared with dicotyledonous Arabidopsis. An analysis of chromosomal distributions revealed that the 22 GhWRI genes were distributed on eight At and eight Dt subgenome chromosomes. Moreover, GhWRI1a was highly expressed in ovules 20-35 days after anthesis and was selected for further functional analysis. Ectopic expression of GhWRI1a rescued the seed phenotype of a wri1-7 mutant and increased the oil content of Arabidopsis seeds. Our comprehensive genome-wide analysis of the cotton WRI-like gene family lays a solid foundation for further studies.

Entities:  

Keywords:  GhWRI1a; WRI-like; cotton; cottonseed oil; expression pattern

Year:  2018        PMID: 30386365      PMCID: PMC6198791          DOI: 10.3389/fpls.2018.01516

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Cotton, especially upland cotton (Gossypium hirsutum L.), is the main source of renewable textile fibers and is also well known for its oil-rich seeds. After ginning, fuzzy cottonseed is processed into four major products: hulls (26%), linters (9%), oil (16%), and meal (45%), with 4% lost in processing (Liu et al., 2009). The most valuable cottonseed oil is typically composed of approximately 26% palmitic acid (C16:0), 15% oleic acid (C18:1), and 58% linoleic acid (C18:2) (Liu et al., 2002). Because cotton is the world’s sixth largest source of vegetable oil (Liu et al., 2002, 2009), increasing cottonseed oil content through classical breeding techniques and biotechnological approaches is important. Triacylglycerols (TAGs), which accumulate in plant seeds and fruits, are major renewable sources of reduced carbon used as food, industrial feedstocks, and fuel (Bates and Browse, 2012). As reviewed previously (Bates and Browse, 2012), plants use two main pathways to produce diacylglycerol (DAG), the immediate precursor molecule to TAG synthesis. AtWRI1, an AP2 transcription factor, is involved in the regulation of seed storage metabolism in Arabidopsis (Focks and Benning, 1998; Cernac and Benning, 2004). The homozygous atwri1 mutant has a wrinkled seed phenotype and exhibits an 80% reduction in seed oil content compared with wild type (WT) Arabidopsis (Focks and Benning, 1998; Cernac and Benning, 2004). Expression of AtWRI1 cDNA under the control of the cauliflower mosaic virus 35S-promoter has been found to lead to increased seed oil content and the accumulation of TAGs in developing seedlings (Cernac and Benning, 2004). The involvement of orthologous genes of AtWRI1 in the regulation of oil content has also been reported in many other plant species (Liu et al., 2010; Shen et al., 2010; Pouvreau et al., 2011; An and Suh, 2015; Grimberg et al., 2015; Yang et al., 2015; Hofvander et al., 2016; An et al., 2017). For example, in rapeseed, overexpression of WRI1-like (BnWRI1) cDNAs driven by cauliflower mosaic virus 35S-promoter results in 10–40% increased seed oil content and enlarged seed size and mass in 51 transgenic Arabidopsis lines (Liu et al., 2010). In maize, overexpression of ZmWRI1 results in an oil increase without affecting germination, seedling growth, or grain yield in transgenic maize (Shen et al., 2010; Pouvreau et al., 2011). Eighteen putative target genes of ZmWRI1a have been identified by transcriptomic experiments, 12 of which contain the cis-element bound by AtWRI1 in their upstream regions. Interestingly, the higher seed oil content is not accompanied by a reduction of starch in ZmWRI1a transgenic lines, and could be utilized in transgenic breeding (Pouvreau et al., 2011). Recently, expression of CsWRI1A, B, or C has rescued the seed phenotype of the Arabidopsis wri1-3 loss-of-function mutant (An et al., 2017). In cotton, WRI-like genes are found to be participated in the fiber development and oil seed content. Silencing of the expression of WRINKLED1 by TRV-VIGS (tobacco rattle virus triggers virus-induced gene silencing), corresponding to GhWRI1b in our study, has been found to increase fiber length but reduce oil seed content, suggesting the possibility of increasing fiber length by repartitioning carbon flow (Qu et al., 2012). Fiber transcriptome of G. hirsutum producing short-fiber and long-fiber is compared with the transcriptome of extra-long fiber producing G. barbadense, and find that expression pattern of a Wrinkeled1 gene shows close association with fiber length. The authors speculate that Wrinkled1 transcription factor (GenBank accession number: DW505003.1), also corresponding to GhWRI1b in the present study, is involved in the development of extra-long staples in cotton (Qaisar et al., 2017). Moreover, compared with WT, overexpression of GhWRI1 (GenBank accession number: JX270189), corresponding to GhWRI1b in our study, has been observed to increase seed lipid content and decrease protein content in transgenic upland cotton (Liu et al., 2018). Four WRI1-like genes, named AtWRI1-4, are present in Arabidopsis (To et al., 2012). Seed-specific overexpression of AtWRI3 and AtWRI4, but not AtWRI2, can suppress the wrinkled phenotype of wri1-4 and restore normal oil accumulation (To et al., 2012). These results imply that WRI-like family genes play important roles in the developmental regulation of fatty acid and TAG production in plants. In this study, we performed a comprehensive genome-wide analysis to further understand the complexity of WRI-like family genes in cotton. In addition, a transgenic approach was used to clarify the function of GhWRI1a in TAG production.

Materials and Methods

Sequence Retrieval, Multiple Sequence Alignment, and Phylogenetic Analysis

Genome sequences of G. arboreum (A2, BGI_V1.0) (Li et al., 2014), G. raimondii (D5, BGI_V1.0) (Wang et al., 2012), G. hirsutum acc. TM-1 (AD1, NBI_V1.1) (Zhang et al., 2015), and G. barbadense acc.3-79 (AD2, SGI_V1.0) (Yuan et al., 2015) were downloaded from the CottonGen website[1]. AtWRI1, AtWRI2, AtWRI3, and AtWRI4 were acquired from TAIR 10[2]. WRI-like genes in cacao were acquired from the cacao genome database[3]. WRI-like genes in rice were acquired from the rice annotation project database[4]. To identify WRI-like genes from Gossypium, AtWRI1, AtWRI2, AtWRI3, and AtWRI4 protein sequences were used as queries against the above-mentioned cotton genomes. ClustalX version 2.0 (Larkin et al., 2007) was used to perform multiple sequence alignments of all identified WRI-like genes in this study (Supplementary File ). A phylogenetic analysis was carried out using the neighbor-joining algorithm with the pairwise deletion option, Poisson correction model, and uniform rates, with the statistical reliability of the resulting tree evaluated using 1,000 bootstrap replicates (Tamura et al., 2013). The online ExPASy tool[5] was used to calculate the sequence length, theoretical molecular weight (MW), and isoelectric point (pI) of WRI-like proteins.

Chromosomal Location, Gene Duplication, and Gene Loss

MapChart[6] was used to visualize the mapping of WRI-like genes (Voorrips, 2002). Gene duplication events were defined as previously described criteria (Dong et al., 2016; Cui et al., 2017). Gene loss evens were analyzed based on the best match and the syntenic blocks in the CottonGen website[7]. DnaSP software of phylogenetic analysis by the maximum likelihood method was used to calculate Ka and Ks of the duplicated gene pairs.

Genetic Structure Analysis and Protein Domain Detection

GhWRI gene structures were generated using the Gene Structure Display Server (GSDS)[8]. The SMART database[9] was used for detection of GhWRI protein domains.

Expression Pattern Analysis of GhWRI Genes Based on RNA-Seq Data

FPKM values of GhWRI genes were calculated using previously reported RNA-seq data of 22 cotton tissues (SRA accession code: PRJNA248163) (Zhang et al., 2015).

Transgenic Plant Generation and Expression Analysis

The complete coding sequence of GhWRI1a (Supplementary File ) was amplified with gene specific primers. The resulting PCR product was cloned into a digested pBI121 vector with BamH I and Sac I using ClonExpress® II One Step Cloning Kit (Vazyme, Nanjing, China). We used Agrobacterium tumefaciens strain GV3101 containing this binary construct to transform Arabidopsis plants. Transformants were selected on MS medium supplemented with kanamycin (50 mg/L). The progeny of transformants showed an approximately 3:1 segregation of live and dead phenotypes, and homozygous lines in the T3 generation were used for further analysis (Zang et al., 2017). To detect the relative expression level of GhWRI1a in the transgenic Arabidopsis lines, siliques were collected 15 days after anthesis (DPA), frozen immediately in liquid nitrogen, and stored at -80 °C for RNA isolation. Quantitative real-time PCR (qRT-PCR) was performed to determine the expression pattern of GhWRI1a, with the 2-ΔΔCt method (Livak and Schmittgen, 2001) used to quantify the expression level of GhWRI1a relative to the 18S rRNA endogenous control. Each experiment was independently repeated in triplicate. Primers are listed in Supplementary Table .

Generation of CRISPR/Cas9 Transgenic Plant

For AtWRI1 gene editing, two single-guide RNAs (sgRNAs) were designed to target the first and fifth exons, namely Target1 and Target2 (Supplementary Figure ). The two integrated targets were ligated to BsaI-digested pRGEB32-GhU6.9 as previously reported (Wang et al., 2017). This construct was introduced into Agrobacterium tumefaciens strain GV3101, which was used to transform Arabidopsis Col-0 as described above. The resulting CRISPR/Cas9 transgenic lines were genotyped for mutations using a pair of primers spanning the two target sequences (Supplementary Table ). The homozygous T3 generation was used for further analysis.

Oil Content Analysis

We determined total oil content using an NMI20-Analyst nuclear magnetic resonance spectrometer (Niumag, Shanghai, China).

Results

Genome-Wide Identification and Phylogenetic Analysis of WRI-Like Genes in Gossypium

Two diploid cottons, G. arboreum (AA genome) and G. raimondii (DD genome), evolved from a common ancestor (Zhang et al., 2017). The most widely cultivated tetraploid cotton species are G. hirsutum (AADD, AD1 genome) and G. barbadense (AADD, AD2 genome), both of which originated from inter-genomic hybridization of two A- and D-genome progenitor species (Paterson et al., 2012). To identify all WRI-like proteins in AD1, AD2, AA and DD genomes, Arabidopsis WRI1-4 protein sequences (AtWRI1/AT3g54320, AtWRI2/AT2g41710, AtWRI3/AT1g16060, and AtWRI4/AT1g79700) were queried against reference genomes of the above-mentioned four species. All WRI-like candidates were further screened based on the conserved AP2 domain using the SMART database. A total of 59 WRI-like genes were identified: 11 in G. raimondii, 9 in G. arboreum, 22 in G. hirsutum, and 17 in G. barbadense (Table ). WRI-like gene names and identifiers, gene pairs, and predicted properties of WRI-like proteins are listed in Table . Characteristics of WRI-like genes and predicted properties of WRI-like proteins. A phylogenetic tree was constructed to reveal the relationships of WRI-like proteins in Arabidopsis, cacao, rice, and cotton (Figure ). This phylogenetic analysis classified WRI-like genes into WRI1, WRI2, WRI3/WRI4, and WRI2-like subfamilies. In comparison with dicotyledonous Arabidopsis, a more WRI2-like subfamily was identified interestingly. The WRI1 subfamily contained 11 members: 4 from G. hirsutum, 4 from G. barbadense, 1 from G. arboreum, and 2 from G. raimondii. The WRI2 subfamily consisted of seven members: two from G. hirsutum, three from G. barbadense, and one each from G. arboreum and G. raimondii. The WRI3/WRI4 subfamily included 18 members: 6, 6, 3, and 3 from G. hirsutum, G. barbadense, G. arboreum, and G. raimondii, respectively. Finally, the WRI2-like subfamily comprised 23 members: 10, 4, 4, and 5 in G. hirsutum, G. barbadense, G. arboreum, and G. raimondii, respectively. Phylogenetic tree of WRI-like gene family in Gossypium and Arabidopsis. The phylogenetic tree of all WRI-like proteins in four Gossypium species and Arabidopsis was constructed using the neighbor-joining method with 1,000 bootstrap replicates.

Chromosomal Distribution of WRI-Like Genes

The identified WRI-like genes were physically mapped to the chromosomes of cotton using the reference genome sequences (Figure and Table ). In the G. arboretum genome, nine GaWRIs were evenly distributed on seven chromosomes (A01, A04, A05, A07, A09, A10, and A12) (Figure ). One GaWRI gene each was located on chromosomes A01, A05, A07, A09, A10, and A12, and three GaWRI genes were found on chromosome A04. Nine of the 11 GrWRI genes in the G. raimondii genome were uniformly distributed on six chromosomes (D02, D05, D08, D09, D12, and D13), with one each positioned on chromosomes D05 and D09 (Figure ). The other three GrWRI genes were only located on the scaffolds. Among the 22 GhWRI genes identified in the G. hirsutum genome, 11 originated from the eight At subgenome chromosomes (A02, A04, A05, A07, A09, A10, A12, and A13), while 11 were derived from the eight Dt subgenome chromosomes (D03, D04, D05, D07, D09, D10, D12, and D13) (Figure ). Two genes each were located on chromosomes D04, D05, A09, and D09, while chromosome A05 harbored three GhWRIs. Each of the remaining chromosomes contained one GhWRI gene each. Among the 17 GbWRI genes identified in the G. barbadense genome, nine were located on the five At subgenome chromosomes (two on A02, one on A04, two on A05, two on A10, and two on A13), four were mapped to the three Dt subgenome chromosomes (two on D04, one on D05, and one on D10), and four were located on scaffolds (Figure ). Most of WRI-like genes were distributed evenly on the chromosomes (Figure and Table ), which provided a clue to their evolution. Chromosomal localization of WRI-like genes in four Gossypium species. A total of 69 WRI-like genes were mapped onto different chromosomes or scaffolds of G. arboreum (A), G. raimondii (B), G. hirsutum (C), and G. barbadense (D). The scale represents the megabases (Mb).

Analysis of Gene Duplication and Loss of WRI-Like Genes

Large-scale duplication events have occurred during Gossypium evolution progress (Paterson et al., 2012). Gene duplication events, including tandem and segmental duplications, are considered as the major forces for expansion of gene families. In contrast to Arabidopsis, cacao, and rice, the WRI-like genes were expanded in cotton (Figure ). We investigated the possible tandem and segmental duplication events of WRI-like genes in the four cotton species, respectively (Table and Supplementary Table ). Among them, no duplicated gene pairs were found in genome of G. raimondii and G. arboretum. In G. hirsutum, nine duplicated gene pairs were found to be segmental duplication events. In G. barbadense, six duplicated gene pairs were found, containing five segmental duplication events and one tandem event. These results indicated that segmental duplication were the main driving forces of the in the expansion of the WRI-like gene family. Ka and Ks calculations of the WRI-like duplicated gene pairs. During the process of evolution, gene pairs are subject to three alternative fates, i.e., non-functionalization, subfunctionalization, and neofunctionalization (Lynch and Conery, 2000). In this study, the Ka/Ks ratios for 15 duplicated WRI-like gene pairs were calculated (Table ). The Ka/Ks ratios of ten pairs were less than 1, which suggests that these duplicated WRI-like genes have mainly experienced purifying selection pressure. The Ka/Ks ratios of other five pairs were more than 1, indicating positive selection pressure in the progress of evolution. Then, WRI-like gene conservation and loss were analyzed based on the best match and the syntenic blocks in the CottonGen website (Figure and Supplementary Table ). Four homologous WRI-like clusters were ultra-conserved in four cotton species (Figure and Supplementary Table ). Ten homologous WRI-like genes were lost from the At, Dt or both subgenomes of G. barbadense and two were lost from G. arboretum (Figure and Supplementary Table ). Additionally, two genes were only present in G. barbadense (Figure and Supplementary Table ). This indicated that the GbWRIs and GaWRIs experienced a higher frequency of genic sequence losses than GhWRIs and GrWRIs. Gene conservation and loss analysis of the WRI-like genes in Gossypium. (A) Ultra-conserved homologous WRI-like clusters in four cotton species. (B) Homologous WRI-like genes lost from one and/or two cotton species. (C) WRI-like genes only present in G. barbadense. Statistics and scenarios of gene conservation and loss analysis in the four cotton species. The observed genes were indicated as solid lines and balls, and the lost genes were indicated as dotted lines and balls. The number of gene clusters found in the four cotton species is provided below each graphic.

Gene Structure and Protein Domain Analyses of WRI-Likes in G. hirsutum

Generic Feature Format files of the four Gossypium species and a phylogenetic tree of deduced amino acids of GhWRIs were used to analyze the similarity and diversity of their exon–intron structures (Figure ). The AtWRI2 gene contained 10 introns and 11 exons, whereas WRI2 subfamily genes GhWRI2a (Gh_A02G1061) and GhWRI2b (Gh_D03G0620) harbored seven introns and eight exons. AtWRI1, AtWRI3, and AtWRI4 genes contained seven introns and eight exons. In contrast, most GhWRI1, GhWRI3/GhWRI4, and GhWRI2-like family genes fell into two categories: those containing five introns and six exons, and those having six introns and seven exons. GhWRI3b (Gh_D04G0842), GhWRI3c (Gh_A05G0024), GhWRI3d (Gh_A05G0999), GhWRI3f (Gh_D05G1117), GhWRI2-likeb (Gh_A05G3160), GhWRI2-likec (Gh_A07G1973), GhWRI2-liked (Gh_D07G2191), GhWRI2-likef (Gh_A09G0219), GhWRI2-likei (Gh_A12G1529), and GhWRI2-likej (Gh_D12G1652) contained six introns and seven exons. GhWRI1a (Gh_A10G1731), GhWRI1b (Gh_D10G2551), GhWRI3a (Gh_A04G1351), GhWRI3e (Gh_D05G0071), GhWRI2-likee (Gh_A09G0218), and GhWRI2-likeg (Gh_D09G0206) contained five introns and six exons. Four genes had unique intron–exon compositions: GhWRI1c (Gh_A13G0020) with 20 introns and 21 exons, GhWRI1d (Gh_D13G0036) with 24 introns and 25 exons, GhWRI2-likea (Gh_D04G0466) with four introns and five exons, and GhWRI2-likeh (Gh_D09G0207) containing three introns and four exons. Phylogenetic relationships and gene structures of GhWRIs. Phylogeny of 22 GhWRI proteins in G. hirsutum and Arabidopsis. The phylogenetic tree (Left) was constructed using the neighbor-joining (NJ) method with 1,000 bootstrap replicates. The exon–intron structure (Right) of GhWRI genes was drawn using the GSDS server. Black boxes show exons, and lines show introns. The scale bar is shown at the bottom. To better understand the similarity and diversity of GhWRI protein structures, their putative protein domains were predicted using the SMART database. The WRI-like proteins belonged to the AP2-EREPB family of transcription factors (Cernac and Benning, 2004; To et al., 2012). As shown in Figure , most GhWRIs contained two AP2 domains; the exceptions were GhWRI1d, GhWRI2a, GhWRI2b, and GhWRI2-likeh, all having only one each. Interestingly, we also found many other putative protein domains in GhWRI1c and GhWRI1d that need to be further verified. GhWRI protein domain prediction. Potential AP2 domains of GhWRI proteins were identified using the SMART database.

Tissue-Specific Expression Profiles of GhWRI Genes

The expression pattern of a gene can be a direct indication of its involvement in developmental or differential events (Zang et al., 2017). To reveal the tissue-specific expression profiles of the 22 GhWRI genes identified in this study, published TM-1 expression data (Zhang et al., 2015) were used to analyze the transcript profiles of GhWRI genes in 22 cotton tissues (Supplementary Figure ). GhWRI genes from WRI1, WRI2, and WRI3/WRI4 subfamilies were widely detected in different tissues, whereas GhWRI genes from the WRI2-like subfamily exhibited very low expression levels in most tissues. Interestingly, we found GhWRI1a and GhWRI1b (gene pairs from the corresponding At and Dt subgenome) were highly expressed in 20–35 DPA ovules (Figure and Supplementary Figure ). GhWRI1a was thus selected for further functional analysis. Tissue-specific expression profile of GhWRI1a in different tissues of G. hirsutum accession TM-1. The ΔCt value of GhWRI1a in root was set as the control. The data presented are the means ± SD of three replicates.

Ectopic Expression of GhWRI1a Rescued the Seed Phenotype of the wri1-7 Mutant and Increased the Oil Content of Arabidopsis Seeds

To characterize the biological functions of GhWRI1a in regard to oil content, we generated transgenic Arabidopsis plants overexpressing GhWRI1a. qRT-PCR was performed to analyze relative expression levels of GhWRI1a in transgenic Arabidopsis using cDNA from three different transgenic lines and WT as templates (Figure ). GhWRI1a was highly expressed in the transgenic lines. To evaluate the applicability of GhWRI1a in transgenic breeding for oil content, we characterized the phenotypes of GhWRI1a transgenic Arabidopsis at different developmental stages. No visible difference between transgenic and WT plants was observed (data not shown). To determine whether GhWRI1a had increased the oil content, we compared the oil contents of transgenic and WT plants. Significantly increased oil content, 6.96–14.24% higher, was observed in the transgenic plants (Figure ). Improved oil content of GhWRI1a transgenic plants. (A) Relative expression level of GhWRI1a in three transgenic Arabidopsis lines (L1, L2, and L3). The ΔCt value of GhWRI1a in transgenic line L1 was set as the control. The data presented are the means ± SD of three replicates. (B) Seed oil content of GhWRI1a transgenic lines (L1, L2, and L3) and WT. The data presented are the means ± SD of three replicates; ∗P < 0.05; ∗∗P < 0.01 (Student’s t-test). In order to determine whether the GhWRI1a transcription factor is involved in the activation of the whole fatty acid biosynthetic pathway, we created an atwri1 mutant named wri1-7 by the CRISPR method. DNA sequence comparison revealed the presence of a 722 bp deletion and a single adenine (A) insertion from the first to the fifth exon in the wri1-7 mutant (Figure and Supplementary Figure ). Microscopic observation of mature dry seeds of the wri1-7 mutant also revealed a wrinkled phenotype (Figure ), similar to previously reported wri1 mutant seeds (Cernac and Benning, 2004; To et al., 2012). The ability of the overexpression constructs to complement the seed phenotype of the wri1-7 mutant was confirmed by crossing L1, L2, and L3 transgenic plants with the wri1-7 mutant. Over accumulation of GhWRI1a RNA in the transgenic lines was verified by qRT-PCR (Figure ). Microscopic observation of mature dry seeds revealed a reversion to the wrinkled phenotype in wri1-7 seeds overexpressing GhWRI1a (Figure ). An analysis of total oil content of the dry seeds confirmed the ability of GhWRI1a to efficiently activate fatty acid biosynthesis and to thus complement the oil accumulation of wri1-7 seeds (Figure ). Arabidopsis wri1-7 mutant displaying a wrinkled phenotype rescued by overexpression of GhWRI1a. (A) Isolation and molecular characterization of the wri1-7 mutant. (B) Relative expression level of GhWRI1a in three GhWRI1a-complemented lines (C-L1, C-L2, and C-L3). The ΔCt value of GhWRI1a in complemented line C-L1 was set as the control. The data presented are the means ± SD of three replicates. (C) Rescue of the wrinkled phenotype of mature wri1-4 seeds by overexpression of GhWRI1a. Dry seeds from complemented lines were observed by stereoscopic microscopy. Representative seeds are shown. Bar = 0.2 mm. (D) Seed oil content of complemented lines (C-L1, C-L2, and C-L3), wri1-7, and WT. The data presented are the means ± SD of three replicates; ∗∗P < 0.01 (Student’s t-test).

Discussion

Numerous studies have revealed a crucial role for WRI-like genes in TAG biosynthesis, including GhWRI1 corresponding to GhWRI1b (Liu et al., 2018). Nevertheless, the naming of WRI-like family genes in cotton is confusing, and their systematic exposition is incomplete. In this study, we have accomplished the first-ever identification of WRI-like genes in four representative types of cotton, i.e., allotetraploid cotton species G. hirsutum and G. barbadense and their diploid ancestors G. arboreum, and G. raimondii. Our findings provide significant insights into the sequence variation, adaptive evolution, protein domains, expression profiles, co-localization with QTLs and GhWRI1a functions in cotton. Our analysis revealed details of 22 deduced GhWRIs, most of which contain two AP2 domains, with only four GhWRIs (GhWRI1d, GhWRI2a, GhWRI2b, and GhWRI2-likeh) having just one AP2 domain (Figure ). The WRI-like gene family is a branch of the AP2/EREBP (APETALA2/ethylene responsive element binding protein) transcription factor family. The AP2/EREBP family is one of the largest plant transcription factor families and plays an important role in plant growth and development (Okamuro et al., 1997; Riechmann et al., 2000; Zhou et al., 2013). This superfamily, comprising AP2, EREBP, and RAV subfamilies, is defined by the AP2/ERF DNA binding domain. AP2 family proteins contain two repeated AP2/ERF domains, EREBP family proteins have a single AP2/ERF domain, and RAV family proteins possess a B3 DNA-binding domain in addition to a single AP2/ERF domain (Sakuma et al., 2002; Feng et al., 2005; Nakano et al., 2006). AtWRI1, AtWRI2, AtWRI3, and AtWRI4 proteins all belong to the AP2 subfamily (Feng et al., 2005). These proteins generally contain two repeated AP2/ERF domains, the exception is AtWRI2, which was found to possess only one AP2/ERF domain (Supplementary Figure ), consistent with previous reports (Nakano et al., 2006). Consequently, GhWRI1d, GhWRI2a, GhWRI2b, and GhWRI2-likeh, which contain only one AP2 domain, are typical representatives of the AP2 subfamily. The WRI-like genes identified in this study belong to the AP2 subfamily of the AP2/EREBP family. Cottonseed oil accumulates in ovules after 15 DPA. At this stage, most GhWRIs were found to be expressed in our study. GhWRI1a and GhWRI1b had the highest expression levels (Supplementary Figure ), indicating that these two genes play important roles in TAG biosynthesis in developing cotton seeds. In this study, we demonstrated that ectopic expression of GhWRI1a could rescue the seed phenotype of the wri1-7 mutant and increase the oil content of Arabidopsis seeds. In addition, four WRI-like genes were localized in cottonseed oil QTL intervals, which suggests their association with natural variation in cottonseed oil content. We further discovered that GhWRIs were expressed in developing fibers. GhWRI1a and GhWRI1b, in particular, were highly expressed in 25-DPA developing fibers (Figure and Supplementary Figure ), suggesting their additional involvement in fiber development. Other studies of upland cotton have also indicated the involvement of GhWRIs in fiber length (Qu et al., 2012; Qaisar et al., 2017). The regulatory relationship between GhWRIs and fiber development needs to be further verified. In short, we have performed a comprehensive genome-wide analysis of the WRI-like gene family in G. hirsutum, G. barbadense, G. raimondii, and G. arboreum. A total of 69 WRI-like genes grouped into four distinct subfamilies were identified in four sequenced Gossypium species. Our detailed analysis has established a solid foundation for further studies of WRI-like genes in cotton.

Author Contributions

JY and XZ directed the experiments. WP, MW, YG, NW, GL, JM, DL, YC, XL, and JZ participated in the study. XZ conceived the study, performed the experiments and wrote the manuscript. JY and JZ revised the manuscript. All authors read and approved the final manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Table 1

Characteristics of WRI-like genes and predicted properties of WRI-like proteins.

Family nameGene nameGene identifier (NAU)Chromosomal localizationSize (AA)MW (KD)pI
WRI1GhWRI1aGh_A10G1731A1043448.95345.54
GhWRI1bGh_D10G2551D1043549.02645.54
GhWRI1cGh_A13G0020A1377690.30368.00
GhWRI1dGh_D13G0036D131607182.10118.19
GbWRI1aGbscaffold22373.8.1A1043749.34285.54
GbWRI1b.1Gbscaffold14438.14.0scaffold14438_d1043849.41495.69
GbWRI1b.2Gbscaffold14438.14.1scaffold14438_d1028732.52544.47
GbWRI1b.3Gbscaffold14438.14.2scaffold14438_d1034939.47568.52
GbWRI1c.1Gbscaffold18152.14.0A1323025.90514.79
GbWRI1c.2Gbscaffold18152.14.1A1322825.68905.01
GbWRI1dGbscaffold20501.18.0A1339043.47805.91
GaWRI1aCotton_A_24703CA_chr9/A1043749.3835.69
GrWRI1aCotton_D_gene_10029828Chr5/D0243549.11055.40
GrWRI1bCotton_D_gene_10024797Chr13/D1339443.82645.69
WRI2GhWRI2aGh_A02G1061A0243247.87078.59
GhWRI2bGh_D03G0620D0342747.33518.69
GbWRI2aGbscaffold3103.5.0A0242246.84078.67
GbWRI2bGbscaffold1219.3.0A0242246.82578.84
GbWRI2cGbscaffold9581.6.0A0512414.478810.22
GaWRI2aCotton_A_37619CA_chr2/A0142246.82678.67
GrWRI2aCotton_D_gene_10038477Chr4/D0841946.46538.69
WRI3/WRI4GhWRI3aGh_A04G1351A0439144.63197.52
GhWRI3bGh_D04G0842D0436141.19747.25
GhWRI3cGh_A05G0024A0537842.88007.56
GhWRI3dGh_A05G0999A0535139.69357.14
GhWRI3eGh_D05G0071D0537842.80507.86
GhWRI3fGh_D05G1117D0535139.60757.37
GbWRI3aGbscaffold25274.2.0A0427131.12409.77
GbWRI3b.1Gbscaffold1205.6.0D0421324.03844.47
GbWRI3b.2Gbscaffold1205.6.1D0436341.35057.27
GbWRI3c.1Gbscaffold2524.5.0A0535440.06897.14
GbWRI3c.2Gbscaffold2524.5.1A0535139.69357.14
GbWRI3d.1Gbscaffold12660.29.0D0535439.98297.37
GbWRI3d.2Gbscaffold12660.29.1D0535239.69866.95
GbWRI3eGbscaffold22373.8.0A1033137.76654.46
GbWRI3fGbscaffold18379.5.0scaffold1837926129.30035.64
GaWRI3aCotton_A_16267CA_chr12/A0437041.86898.29
GaWRI3bCotton_A_41232CA_chr12/A0436441.46077.00
GaWRI3cCotton_A_17105CA_chr10/A0535139.70367.14
GrWRI3aCotton_D_gene_10016054Chr9/D0537342.18827.91
GrWRI3bCotton_D_gene_10007101Chr9/D0535139.57257.66
GrWRI3cCotton_D_gene_10021087scaffold21136841.93017.00
WRI-likeGhWRI2-likeaGh_D04G0466D0443249.55709.44
GhWRI2-likebGh_A05G3160A0537842.57907.91
GhWRI2-likecGh_A07G1973A0736541.42919.63
GhWRI2-likedGh_D07G2191D0737442.98319.56
GhWRI2-likeeGh_A09G0218A0938744.40538.80
GhWRI2-likefGh_A09G0219A0927831.85999.73
GhWRI2-likegGh_D09G0206D0938744.25308.50
GhWRI2-likehGh_D09G0207D0918020.37539.98
GhWRI2-likeiGh_A12G1529A1239044.11069.96
GhWRI2-likejGh_D12G1652D1238743.43609.84
GbWRI2-likeaGbscaffold17450.12.0D0412414.502810.22
GbWRI2-likebGbscaffold19204.1.0scaffold1920438143.22889.96
GbWRI2-likecGbscaffold1804.8.0scaffold180436541.33009.54
GbWRI2-likedGbscaffold259.12.0scaffold25938743.46209.77
GaWRI2-likeaCotton_A_29003CA_chr12/A0437842.58307.91
GaWRI2-likebCotton_A_19437CA_chr9/A0732036.39949.99
GaWRI2-likecCotton_A_28204CA_chr11/A0938644.10498.52
GaWRI2-likedCotton_A_06134CA_chr6/A1239044.03669.92
GrWRI2-likeaCotton_D_gene_10014405Chr6/D0928232.24749.85
GrWRI2-likebCotton_D_gene_10014404Chr6/D0938744.17808.89
GrWRI2-likecCotton_D_gene_10008870Chr8/D1238743.51719.84
GrWRI2-likedCotton_D_gene_10002133scaffold38436841.83269.51
GrWRI2-likeeCotton_D_gene_10001570scaffold48438644.02768.52
Table 2

Ka and Ks calculations of the WRI-like duplicated gene pairs.

SpeciesGene1Gene2KaKsKa/KsDuplicated type
G. hirsutumGhWRI1aGhWRI1b0.02840.07540.376658Segmental duplication
GhWRI2aGhWRI2b0.99950.91211.095823Segmental duplication
GhWRI3aGhWRI3b1.52931.51281.010907Segmental duplication
GhWRI3cGhWRI3e0.01260.06130.205546Segmental duplication
GhWRI3dGhWRI3f0.00990.04360.227064Segmental duplication
GhWRI2-likeaGhWRI2-likeb2.59912.31471.122867Segmental duplication
GhWRI2-likecGhWRI2-liked0.89330.78831.140432Segmental duplication
GhWRI2-likeeGhWRI2-likeg0.01350.03500.385714Segmental duplication
GhWRI2-likeiGhWRI2-likej0.14200.15020.945406Segmental duplication
G. barbadenseGbWRI1aGbWRI1b.10.02510.06360.394654Segmental duplication
GbWRI2aGbWRI2b0.00520.01020.509804Tandem duplication
GbWRI3aGbWRI3b.10.01800.02480.725806Segmental duplication
GbWRI3c.1GbWRI3d.10.00980.03870.253230Segmental duplication
GbWRI2-likeaGbWRI2c0.01660.03000.553333Segmental duplication
GbWRI2-likebGbWRI2-liked0.13180.13121.004573Segmental duplication
  33 in total

1.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors:  K J Livak; T D Schmittgen
Journal:  Methods       Date:  2001-12       Impact factor: 3.608

2.  An annotation update via cDNA sequence analysis and comprehensive profiling of developmental, hormonal or environmental responsiveness of the Arabidopsis AP2/EREBP transcription factor gene family.

Authors:  Jian-Xun Feng; Di Liu; Yi Pan; Wei Gong; Li-Geng Ma; Jing-Chu Luo; Xing Wang Deng; Yu-Xian Zhu
Journal:  Plant Mol Biol       Date:  2005-12       Impact factor: 4.076

3.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

4.  Ectopic expression of TaOEP16-2-5B, a wheat plastid outer envelope protein gene, enhances heat and drought stress tolerance in transgenic Arabidopsis plants.

Authors:  Xinshan Zang; Xiaoli Geng; Kelu Liu; Fei Wang; Zhenshan Liu; Liyuan Zhang; Yue Zhao; Xuejun Tian; Zhaorong Hu; Yingyin Yao; Zhongfu Ni; Mingming Xin; Qixin Sun; Huiru Peng
Journal:  Plant Sci       Date:  2017-01-20       Impact factor: 4.729

5.  DNA-binding specificity of the ERF/AP2 domain of Arabidopsis DREBs, transcription factors involved in dehydration- and cold-inducible gene expression.

Authors:  Yoh Sakuma; Qiang Liu; Joseph G Dubouzet; Hiroshi Abe; Kazuo Shinozaki; Kazuko Yamaguchi-Shinozaki
Journal:  Biochem Biophys Res Commun       Date:  2002-01-25       Impact factor: 3.575

6.  High-stearic and High-oleic cottonseed oils produced by hairpin RNA-mediated post-transcriptional gene silencing.

Authors:  Qing Liu; Surinder P Singh; Allan G Green
Journal:  Plant Physiol       Date:  2002-08       Impact factor: 8.340

7.  WRINKLED transcription factors orchestrate tissue-specific regulation of fatty acid biosynthesis in Arabidopsis.

Authors:  Alexandra To; Jérôme Joubès; Guillaume Barthole; Alain Lécureuil; Aurélie Scagnelli; Sophie Jasinski; Loïc Lepiniec; Sébastien Baud
Journal:  Plant Cell       Date:  2012-12-14       Impact factor: 11.277

8.  Ectopic Expression of WRINKLED1 Affects Fatty Acid Homeostasis in Brachypodium distachyon Vegetative Tissues.

Authors:  Yang Yang; Jacob Munz; Cynthia Cass; Agnieszka Zienkiewicz; Que Kong; Wei Ma; John Sedbrook; Christoph Benning
Journal:  Plant Physiol       Date:  2015-09-29       Impact factor: 8.340

9.  Genome sequence of the cultivated cotton Gossypium arboreum.

Authors:  Fuguang Li; Guangyi Fan; Kunbo Wang; Fengming Sun; Youlu Yuan; Guoli Song; Qin Li; Zhiying Ma; Cairui Lu; Changsong Zou; Wenbin Chen; Xinming Liang; Haihong Shang; Weiqing Liu; Chengcheng Shi; Guanghui Xiao; Caiyun Gou; Wuwei Ye; Xun Xu; Xueyan Zhang; Hengling Wei; Zhifang Li; Guiyin Zhang; Junyi Wang; Kun Liu; Russell J Kohel; Richard G Percy; John Z Yu; Yu-Xian Zhu; Jun Wang; Shuxun Yu
Journal:  Nat Genet       Date:  2014-05-18       Impact factor: 38.330

10.  A Genome-Scale Analysis of the PIN Gene Family Reveals Its Functions in Cotton Fiber Development.

Authors:  Yuzhou Zhang; Peng He; Zuoren Yang; Gai Huang; Limin Wang; Chaoyou Pang; Hui Xiao; Peng Zhao; Jianing Yu; Guanghui Xiao
Journal:  Front Plant Sci       Date:  2017-03-30       Impact factor: 5.753

View more
  2 in total

1.  Identification and analysis of oil candidate genes reveals the molecular basis of cottonseed oil accumulation in Gossypium hirsutum L.

Authors:  Zhibin Zhang; Juwu Gong; Zhen Zhang; Wankui Gong; Junwen Li; Yuzhen Shi; Aiying Liu; Qun Ge; Jingtao Pan; Senmiao Fan; Xiaoying Deng; Shaoqi Li; Quanjia Chen; Youlu Yuan; Haihong Shang
Journal:  Theor Appl Genet       Date:  2021-10-29       Impact factor: 5.699

Review 2.  Genetics, Breeding and Genetic Engineering to Improve Cottonseed Oil and Protein: A Review.

Authors:  Man Wu; Wenfeng Pei; Tom Wedegaertner; Jinfa Zhang; Jiwen Yu
Journal:  Front Plant Sci       Date:  2022-03-10       Impact factor: 5.753

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.