Literature DB >> 31510067

Genome-Wide Identification and Expression Profiles of Late Embryogenesis-Abundant (LEA) Genes during Grain Maturation in Wheat (Triticum aestivum L.).

Datong Liu1, Jing Sun2, Dongmei Zhu3, Guofeng Lyu4, Chunmei Zhang5, Jian Liu6, Hui Wang7, Xiao Zhang8, Derong Gao9.   

Abstract

Late embryogenesis-abundant (LEA) genes play important roles in plant growth and development, especially the cellular dehydration tolerance during seed maturation. In order to comprehensively understand the roles of LEA family members in wheat, we carried out a series of analyses based on the latest genome sequence of the bread wheat Chinese Spring. 121 Triticum aestivum L. LEA (TaLEA) genes, classified as 8 groups, were identified and characterized. TaLEA genes are distributed in all chromosomes, most of them with a low number of introns (≤3). Expression profiles showed that most TaLEA genes expressed specifically in grains. By qRT-PCR analysis, we confirmed that 12 genes among them showed high expression levels during late stage grain maturation in two spring wheat cultivars, Yangmai16 and Yangmai15. For most genes, the peak of expression appeared earlier in Yangmai16. Statistical analysis indicated that expression level of 8 genes in Yangmai 16 were significantly higher than Yangmai 15 at 25 days after anthesis. Taken together, our results provide more knowledge for future functional analysis and potential utilization of TaLEA genes in wheat breeding.

Entities:  

Keywords:  genome-wide identification; grain maturation; late embryogenesis-abundant gene; wheat (Triticum aestivum L.)

Mesh:

Substances:

Year:  2019        PMID: 31510067      PMCID: PMC6770980          DOI: 10.3390/genes10090696

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Introduction

The dehydration process of embryos during grain maturation is the terminal event of seed development, correlating with the acquisition of desiccation tolerance and with the specific induction of highly abundant hydrophilic proteins described as Late Embryogenesis Abundant (LEA) [1,2]. This process indicates the “switch” of seed transformation from development to germination, acting as an adaptability strategy of angiosperm to ensure the survival of seeds during storage or the environmental stress of long-term evolution [3,4]. Seeds are classified as orthodox and recalcitrant based on their storage behavior and desiccation tolerance in the drying process [5]. Orthodox seeds undergo mature dehydration on the maternal plants. Their water content is usually low and can be further dried to 1 to 5% moisture without damage [4,6]. LEA was first isolated from cotyledons of cotton at the late embryonic development stage [7]. In the following 30 years, LEA proteins have been detected in a variety of higher plant development seeds [8,9,10,11,12,13]. The LEA family is usually composed of multiple members in plants. Previous study identified at least 13 and 34 LEA homologous genes in maize [14] and rice, respectively [15]. With the development of genome-wide sequencing technology, the LEA gene family has been further elucidated in many other species, such as Solanum lycopersicum (27 LEA genes) [16], Dendrobium officinale (Orchidaceae) (17) [17], Brassica mapus (108) [18], Amaranth (105) [19], and Solanum Tuberosum (74) [13]. Furthermore, LEA genes are not unique to plants. They have been also identified in fungi, bacteria and animals [20]. LEA proteins have small molecular weights from 10 to 30 kDa. They are enriched in glycine and other hydrophilic amino acids, resulting in high hydrophilicity and thermal stability [21]. LEA proteins consist of a non-periodic linear and facultative α-helix which formed the main hydrophobic interaction between monomers without thermodynamically dominant state [8,22]. According to the similarity, LEA family is generally divided into 8 groups, namely LEA1, LEA2, LEA3, LEA4, LEA5, LEA6, dehydrin and SMP (seed maturation protein) [23]. Recently, ASR proteins (abscisic stress ripening) were classified as LEA7 group [24,25], extending LEA family to 9 groups. The division of family members is not consistent in different species [26]. While the sequence conservation implies their conserved role in desiccation tolerance [27]. Moreover, LEA proteins were reported to be widely involved in plant stress response in Arabidopsis, tobacco, grape, and other species [28]. In monocotyledons such as maize, wheat and rice, LEA expression can be induced by varieties of abiotic stresses to improve plant tolerance [29,30,31], which is beyond the scope of this article. Wheat, as an important staple crop in the world, is becoming the focus of global food security issues [32,33]. In China, due to changes of climate and cultivation methods, new requirements for early maturing traits and dehydration characteristics of winter wheat have been put forward recently [34,35]. The moisture content of wheat seeds, like orthodox seeds, decline up to 40% quickly after physiological ripening. The speed of grain dehydration affects the desiccation tolerance and is highly related to the harvest time after maturity. Rapid dehydration after maturity is able to avoid the bad meteorological conditions in late stages of seed maturation, reduce the occurrence of pre-harvest sprouting, and shorten the cost of drying after harvest. Wheat is an allohexaploid species with a large and complex genome consisting of three closely related sub genomes (A, B, and D). The continuous improvement and release of genomic data promoted the identification and analysis of wheat genes genome-widely [36,37,38,39,40,41,42]. The identification and use of the elite gene resource may help us improve the stress tolerance of wheat and solve the problems caused by changing global climate and food security status [42]. In this study, we systematically characterized the LEA genes in wheat based on the latest genome sequences. The genome composition, phylogeny, conserved motifs, chromosome locations, and the expression pattern of TaLEA proteins were systematically analyzed. These results will not only help us to reveal the molecular mechanism of mature dehydration in wheat and other cereal crops, but also provide a theoretical basis for their application to promote crop variety improvement.

2. Materials and Methods

2.1. Identification and Phylogenetic Analysis of Late Embryogenesis-Abundant (LEA) Genes in Wheat Genomes

The nucleotide and protein sequence in the whole-genome of T. aestivum was downloaded from the wheat genome URGI database (https://wheat-urgi.versailles.inra.fr/Seq-Repository). These sequences were first used to construct a local protein database with which to search against known LEA protein sequences collected from A. thaliana (51) [9] and O. sativa (34) [15], through a local protein basic local alignment search (BLASTP) program (https://blast.ncbi.nlm.nih.gov) with an E-value cut-off <10−10 and an identity of 50% as the threshold. The Hidden Markov Model (HMM) which was constructed using HMMER 3.0 program was used to search all the encoding protein sequences of wheat to find all predicted LEA family members. A self-blast of these sequences was performed by comparing of HMMER and BLAST hits to remove the redundancy, and no alternative splice variants were considered. The aligned sequences were then used as candidate LEA family sequences. The obtained candidate sequences were subjected to domain annotation of the target sequence using the software PfamScan and Pfam A databases, and the sequences containing the complete LEA-1 (Pfam ID PF03760.14), LEA-2 (PF03168.12), LEA-3 (PF03242.12), LEA-4 (PF02987.15), LEA-5 (PF00477.16), LEA-6 (PF10714.8), Dehydrin (PF00257.18), and SMP (PF04927.11) domains were identified as the final wheat LEA family sequences. The identified wheat LEA proteins were termed according to the subfamily it belongs and the order of gene ID. The biochemical parameters such as theoretical isoelectric point (PI), molecular weight (MW), amino acid number, instability, and hydrophilicity of the obtained proteins were calculated by the ProtParam (http://web.expasy.org/protparam/). To investigate the evolutionary relationships among these TaLEA genes, multiple sequence alignment was performed using MAFFT (Multiple Alignment using Fast Fourier Transform: http://mafft.cbrc.jp/alignment/software/) with the default parameters. A tree was constructed based on the full-length protein sequences using the Neighbor-Joining (NJ) method with Partial deletion and p-distance model, and a Bootstrap test of 1000 replicates for internal branch reliability. Another two phylogenetic trees for LEA protein family of T. aestivum, O. sativa, and A. thaliana were generated using Maximum-Likehood [ML, by FastTree (an open-source software developed by Morgan N. Price in Adam Arkin’s group at Lawrence Berkeley National Lab, Berkeley, USA) with default parameters] and Maximum-Parsimony (MP, by MEGA7.0 software [43] with the Max No. of Trees to Retain as 100) method to better understand the evolution of LEA genes.

2.2. Structural Characterisation of Wheat LEA

The exon-intron structures of the TaLEA family genes were determined based on alignments of their coding sequences with the corresponding genomic sequences, and a diagram was obtained using GSDS (Gene structure display server: http://gsds.cbi.pku.edu.cn/). To better understand the similarity and diversity of protein motifs, MEME (Multiple Expectation Maximization for Motif Elicitation: http://meme.nbcr.net/meme) was used to identify the conserved motif structures encoded by each group of TaLEA family genes. The parameters as follow: optimum motif widths of 6–50 residues and a maximum of 15 motifs. The schematic diagram of the amino acid motifs for each TaLEA gene was drawn accordingly.

2.3. Distribution of LEA Genes on Wheat Chromosomes

TaLEA were mapped on wheat chromosomes according to the positional information of the TaLEA genes from the wheat genome annotation datebase, and the chromosome physical location map was displayed using MG2C (Map Gene 2 Chromosome V2: http://mg2c.iask.in/mg2c_v2.0/).

2.4. Expression Profile Analysis of Wheat LEA Genes

The publicly available wheat RNA-Seq datasets used for generating gene expression levels were downloaded from the Wheat Expression Browser (http://www.wheat-expression.com), then used to analyze the expression profiles of the identified TaLEA genes. These data are from the studies of developmental time-course of common hexaploid wheat variety Azhurnaya and Chinese Spring. The results were summarized according to high expression level tissues as grains, roots, leaves/shoots, and spikes. The original TPM (transcripts per kilobase million) of each gene plus 0.0001 to prevent an expression value of ‘0’. The heat map was drawn in Log10 (tpm + 0.0001)-transformed expression values according to the standardization. Genes highly expressed in the grain were selected for qRT-PCR detection.

2.5. Plant Materials, Growth Conditions, and Sampling

Based on many years production experience and research, Yangmai16 had a faster grain filling speed, earlier maturity compared with Yangmai15 [35]. Therefore, these two spring wheat varieties Yangmai16 and Yangmai15, which were bred by the author’s unit, were planted and cultivated under the same condition. At the flowering stage, two individual florets at the base of the center ears were marked with a marker pen and the plants were tagged with anthesis dates. Wheat grains were sampled at 10, 15, 20, 25, 30, and 35 DAA (days after anthesis) for extraction of total RNA and further determination.

2.6. Quantitative Real-Time Polymerase Chain Reaction Analysis of TaLEA Proteins

In order to identify the expression profile of TaLEA gene in different wheat cultivars, and to explain the relationship between the LEA protein and maturity, as well as the grain production, high expression level LEA genes were collected for quantitative real-time polymerase chain reaction (qRT-PCR) analysis. Total RNA was extracted from the seeds by Trizol reagent (Biotech, Shanghai), and the first strand of cDNA was synthesized using Transcriptor First Strand cDNA Synthesis Kit (Roche, Penzberg, Germany). The wheat gene Actin was used as an internal reference, and the selected 12 TaLEA genes, including TaDehydrin-1, TaDehydrin-3, TaDehydrin-5, TaDehydrin-7, TaLEA1-1, TaLEA1-3, TaLEA4-13, TaLEA5-1, TaLEA5-3, TaLEA5-4, TaSMP-1 and TaSMP-2, were subjected to qRT-PCR using a fluorescence quantitative reagent Ultra SYBR Mixture (Kang Wei Century, Beijing) on a Roch Light-Cycler 480 fluorescence quantitative PCR machine (Roche, Penzberg, Germany); three independent biological replicates. The primer sequences used are listed in Table S1.

2.7. Determination of the Dry Weight and Moisture Content of Grains

Sampled grains were used for measurements of grain dry weight and moisture content. The fresh grains then dried at 70 °C to constant weight were weighed for the fresh weight (W) and dry weight (W). Moisture content of grains was calculated by (W − W/W × 100%. The processes of grain filling were fitted by Logistic’ equation as described by [44]:f(x) = a/(1 + b × exp(−c × x)) The dehydration progress was fitted by Gompertz equation as described by [45]:f(x) = a × exp(−exp(−b × (x − c)))

3. Results

3.1. Genome-Wide Identification and Phylogenetic Analysis of Wheat LEA Genes

Based on genome-wide blast, a total of 121 non-redundant LEA homologous containing the complete LEA, Dehydrin, and SMP domains were identified from the hexaploid wheat genome, which were considered as the putative wheat LEA genes. By phylogenetic analysis, these 121 genes were divided into 8 groups (groups LEA1–LEA6, Dehydrin, and SMP) (Table 1). The largest group Dehydrin contains 47 members, while the smallest groups LEA3 and LEA6 have only two members. Groups LEA1, LEA2, LEA4, LEA5, and SMP contain 19, 15, 15, 4, and 17 genes, respectively.
Table 1

Description of Late Embryogenesis Abundant (LEA) genes identified from the wheat genome.

Gene_NameGene_IDAmino Acid NumberMWpIInstability IndexAliphatic IndexGRAVY
TaDehydrin-1TraesCS3A02G25460021321,831.766.2535.6346.76−0.83
TaDehydrin-10TraesCS5A02G36980014314,570.81834.5736.36−1.151
TaDehydrin-11TraesCS5A02G36990014014,240.478.013839.86−1.083
TaDehydrin-12TraesCS5A02G42480014915,217.659.3325.9432.21−1.087
TaDehydrin-13TraesCS5B02G37210014314,429.64834.2435.66−1.118
TaDehydrin-14TraesCS5B02G37220013814,218.468.0138.0338.99−1.104
TaDehydrin-15TraesCS5B02G42670015015,176.659.5231.6534.6−1.062
TaDehydrin-16TraesCS5B02G42680015015,221.659.3624.1432−1.063
TaDehydrin-17TraesCS5D02G37920014314,515.737.1133.9736.99−1.123
TaDehydrin-18TraesCS5D02G37930013313,930.28.8140.138.95−1.189
TaDehydrin-19TraesCS5D02G43320015215,345.799.5227.6632.24−1.05
TaDehydrin-2TraesCS3A02G39620027527,016.779.626.3237.13−0.775
TaDehydrin-20TraesCS5D02G43330015415,587.099.5225.931.17−1.024
TaDehydrin-21TraesCS5D02G519300LC26628,166.539.7646.8552.93−0.803
TaDehydrin-22TraesCS6A02G05980015315,511.878.843739.08−1.041
TaDehydrin-23TraesCS6A02G25330026828,823.055.2556.4363.73−0.997
TaDehydrin-24TraesCS6A02G35050022122,048.669.057.5923.48−1.071
TaDehydrin-25TraesCS6A02G35060016216,284.689.2217.1438.02−0.994
TaDehydrin-26TraesCS6A02G35070015215,522.717.1727.5331.51−1.176
TaDehydrin-27TraesCS6A02G35080014314,815.219.2237.636.36−1.012
TaDehydrin-28TraesCS6A02G35090019020,142.8710.7450.4635.63−1.274
TaDehydrin-29TraesCS6B02G27340025927,973.075.257.3962.93−1.054
TaDehydrin-3TraesCS3B02G28660021722,297.386.8735.4747.7−0.856
TaDehydrin-30TraesCS6B02G38320040540,293.096.83-0.2629.98−1.064
TaDehydrin-31TraesCS6B02G38350023123,229.059.222.8323.25−1.074
TaDehydrin-32TraesCS6B02G38360015815,838.179.1323.234.11−1.041
TaDehydrin-33TraesCS6B02G38380016616,704.048.0517.7133.01−1.054
TaDehydrin-34TraesCS6B02G695700LC16216,127.469.1913.9935−0.983
TaDehydrin-35TraesCS6B02G695800LC16216,127.469.1913.9935−0.983
TaDehydrin-36TraesCS6B02G695900LC16216,097.449.1914.9835.62−0.968
TaDehydrin-37TraesCS6D02G23470026228,155.25.1957.3461.83−1.051
TaDehydrin-38TraesCS6D02G33290023123,020.759.056.924.16−1.028
TaDehydrin-39TraesCS6D02G33300016216,195.518.0514.9233.21−1.019
TaDehydrin-4TraesCS3B02G42820027427,193.969.7424.6434.34−0.859
TaDehydrin-40TraesCS6D02G33310014414,512.729.1915.7933.96−1.076
TaDehydrin-41TraesCS6D02G33320015515,7298.8326.8134.06−1.125
TaDehydrin-42TraesCS6D02G33330016016,255.598.0721.6139.81−1.05
TaDehydrin-43TraesCS6D02G33360016716,712.027.1714.331.08−1.062
TaDehydrin-44TraesCSU02G08620015115,288.669.138.1542.19−1.011
TaDehydrin-45TraesCSU02G12220014914,855.026.8633.6140.13−0.947
TaDehydrin-46TraesCSU02G564500LC13012,865.89.0525.2532.38−0.968
TaDehydrin-47TraesCSU02G656800LC13012,865.89.0525.2532.38−0.968
TaDehydrin-5TraesCS3D02G25550021522,243.36.6337.5344.51−0.912
TaDehydrin-6TraesCS3D02G39020027527,156.859.723.9134.58−0.855
TaDehydrin-7TraesCS4A02G25090045543,740.88.84-6.0231.38−0.779
TaDehydrin-8TraesCS4B02G06420045743,891.988.84-3.8432.74−0.741
TaDehydrin-9TraesCS4D02G06310043041,223.149.040.8531.65−0.773
TaLEA1-1TraesCS2A02G44970011011,589.919.432.9439.45−1.056
TaLEA1-10TraesCS5D02G17730014614,589.987.1125.5357.81−0.588
TaLEA1-11TraesCS6B02G24490010210,650.638.0634.5842.55−1.089
TaLEA1-12TraesCS7A02G03010013113,910.819.8739.5354.66−0.852
TaLEA1-13TraesCS7A02G03030011512,294.959.8438.3658−0.886
TaLEA1-14TraesCS7A02G04250017519,508.110.3947.8952.23−1.061
TaLEA1-15TraesCS7A02G43920014714,4839.1330.3457.48−0.453
TaLEA1-16TraesCS7B02G33780014714,522.948.9330.451.5−0.555
TaLEA1-17TraesCS7D02G02630011912,561.29.8433.9456.13−0.845
TaLEA1-18TraesCS7D02G02640011512,306.889.9429.5252.96−0.95
TaLEA1-19TraesCS7D02G42880016916,768.619.5624.2855.86−0.442
TaLEA1-2TraesCS2A02G44980011011,406.599.1915.5936.82−1.047
TaLEA1-3TraesCS2B02G47150011011,481.728.6928.5139.45−0.993
TaLEA1-4TraesCS2B02G47160012012,363.799.5825.1937.83−0.96
TaLEA1-5TraesCS2D02G44920022624,041.9510.151.5345.66−0.828
TaLEA1-6TraesCS2D02G44930011612,059.479.5821.7340.78−0.975
TaLEA1-7TraesCS4A02G45970016918,344.6110.0345.8655.15−0.931
TaLEA1-8TraesCS5A02G17280014614,712.076.7125.0253.15−0.634
TaLEA1-9TraesCS5B02G17020014314,466.796.6524.2754.34−0.645
TaLEA2-1TraesCS1A02G42380016317,826.525.0314.9489.75−0.069
TaLEA2-10TraesCS3D02G09140030533,630.144.6528.8692.98−0.329
TaLEA2-11TraesCS3D02G15860015116,272.664.7814.2493.58−0.068
TaLEA2-12TraesCS3D02G22780018119,544.984.2926.0780.17−0.173
TaLEA2-13TraesCS4A02G34330033336,961.065.0923.5790.66−0.393
TaLEA2-14TraesCS5B02G53140036140,184.794.9628.895.76−0.288
TaLEA2-15TraesCS5D02G52970032636,210.084.928.4491.13−0.406
TaLEA2-2TraesCS1B02G45590016317,748.485.1717.6392.76−0.022
TaLEA2-3TraesCS1D02G43240016317,808.555.0515.3490.98−0.037
TaLEA2-4TraesCS3A02G09150030533,629.194.6524.894.89−0.297
TaLEA2-5TraesCS3A02G15080015116,254.624.7811.3396.16−0.055
TaLEA2-6TraesCS3A02G22560018619,778.214.3728.9876.45−0.165
TaLEA2-7TraesCS3B02G10670030533,625.184.6826.9292.66−0.343
TaLEA2-8TraesCS3B02G240900LC16217,614.165.6823.5890.25−0.199
TaLEA2-9TraesCS3B02G25510018119,458.894.3326.1579.12−0.169
TaLEA3-1TraesCS1B02G24360011512,671.659.1854.9973.91−0.42
TaLEA3-2TraesCS7D02G3893009910,643.2310.1152.8876.26−0.167
TaLEA4-1TraesCS1A02G36400022423,196.178.8122.8837.46−1.118
TaLEA4-10TraesCS1D02G36980021421,965.878.898.4543.74−0.95
TaLEA4-11TraesCS3A02G18870039642,151.235.0640.5745.88−0.962
TaLEA4-12TraesCS3D02G19210040743,198.535.4236.9243−0.963
TaLEA4-13TraesCS4A02G12910035137,340.936.520.447.04−1.007
TaLEA4-14TraesCS4B02G17560036338,561.246.718.1546.86−1.031
TaLEA4-15TraesCS4D02G17750035237,457.16.4420.7649.12−0.97
TaLEA4-2TraesCS1A02G36410016317,097.79.1648.5141.66−1.045
TaLEA4-3TraesCS1B02G38120021221,896.879.0221.7136.75−1.096
TaLEA4-4TraesCS1B02G38140017818,146.728.6323.1642.64−0.954
TaLEA4-5TraesCS1B02G38150016917,5285.9525.2941.95−1.009
TaLEA4-6TraesCS1D02G36920022022,708.72921.6236.82−1.108
TaLEA4-7TraesCS1D02G36930016316,930.396.6226.3739.26−1.026
TaLEA4-8TraesCS1D02G36940018018,788.435.9928.1940.5−1.046
TaLEA4-9TraesCS1D02G36950018018,740.375.9923.3542.67−1.029
TaLEA5-1TraesCS1B02G23740015316,878.385.5737.0633.2−1.539
TaLEA5-2TraesCS1B02G436600LC11312,029.245.5341.9551.06−0.962
TaLEA5-3TraesCS1D02G22580015316,771.275.4739.1833.2−1.503
TaLEA5-4TraesCS3B02G16640012413,694.25.1754.1661.45−1.308
TaLEA6-1TraesCS5A02G25890012112,844.875.4444.6651.74−1.065
TaLEA6-2TraesCS5B02G25770011812,534.65.6742.948.9−1.044
TaSMP-1TraesCS4A02G03060027227,106.724.7235.7970.59−0.336
TaSMP-10TraesCS5B02G39060026627,101.215.1538.2477.56−0.316
TaSMP-11TraesCS5B02G39080018518,756.744.6920.3475.24−0.248
TaSMP-12TraesCS5D02G02650022422,869.265.252.270.36−0.442
TaSMP-13TraesCS5D02G39560026627,028.165.0438.179.4−0.273
TaSMP-14TraesCS5D02G39570018518,795.835.1225.3474.76−0.258
TaSMP-15TraesCS7A02G33310027728,312.944.1823.4670.04−0.32
TaSMP-16TraesCS7B02G24430027528,060.674.1922.7469.49−0.308
TaSMP-17TraesCS7D02G34040027528,179.744.1920.5568.76−0.349
TaSMP-2TraesCS4A02G03070028428,955.834.7836.1170−0.412
TaSMP-3TraesCS4B02G27530027227,037.574.6534.1469.89−0.355
TaSMP-4TraesCS4D02G27390030330,498.735.2340.3173.07−0.305
TaSMP-5TraesCS5A02G02140022022,346.745.3150.5871.64−0.376
TaSMP-6TraesCS5A02G38560026627,030.135.0636.7877.56−0.306
TaSMP-7TraesCS5A02G38570018518,762.785.2121.4774.16−0.255
TaSMP-8TraesCS5B02G01830022522,887.235.3152.9270.93−0.424
TaSMP-9TraesCS5B02G01890022423,131.595.1743.4272.1−0.45

Note: MW, Molecular weight (Da); pI, Isoelectric point; GRAVY, Grand average of hydropathicity.

The physical and chemical parameters of 121 TaLEA proteins were shown according to the analysis using the ProtParam online tool. The wheat LEA protein has a maximum length of 457 amino acid residues (TaDehydrin-8) and the shortest of 99 amino acid residues (TaLEA3-2), with isoelectric points (pI values) ranged from 4.18 (TaSMP-15) to 10.74 (TaDehydrin-28). The molecular weights range from 10.64 kDa (TaLEA3-2) to 43.89 kDa (TaDehydrin-8) with an average of 20.98 kDa. Only 16 proteins have a molecular weight >30 kDa. Analysis of the grand average of hydropathicity (GRAVY) index indicated that all TaLEA proteins were hydrophilic, with all of the index <0. The lipid index reflects the thermal stability of the protein in Table 1. The phylogenetic analysis (Figure 1) revealed two major clades of the TaLEA family. The 47 genes of the TaDehydrin group, the LEA5 group, and part of the LEA1 group were clustered in one clade, while the other six groups were clustered in another one. According to the evolutionary relationship, TaSMP is more closely related to LEA6 and LEA2; LEA1 and LEA4 are more closely related. To classify the TaLEA genes into different subfamilies with higher confidence and to better understand their evolutionary relationship, another two phylogenetic trees were constructed with Arabidopsis and Oryza (Figures S1 and S2). It was indicated that the subfamilies classification of LEA genes in wheat is credible.
Figure 1

Phylogenetic analysis of wheat Late Embryogenesis-abundant (LEA) proteins. The LEA1, LEA2, LEA3, LEA4, LEA5, LEA6, Dehydrin, and seed maturation protein (SMP) groups are presented in blue, pink, light green, bright purple, orange, light blue, purple, and red, respectively.

3.2. Structural Characterization of Wheat LEA

All Gene structure analysis revealed that (Figure 2) wheat LEA genes generally contained few introns, and 24 had no introns. Only two of the 97 intron-containing LEA proteins contained more than one intron. Among the 47 genes of the TaDehydrin group, 46 genes contained only one intron, while the other one (TaDehydrin-21) contained three introns. In the TaLEA1 group, 16 genes had no introns, and the other 3 genes contained only one intron. Among the 15 genes in the TaLEA2 group, 11 were single-intron genes, and the other 4 were intron-free genes. Neither LEA3 nor LEA6 contained introns. There were 14 single-intron genes and 1 double intron gene among the TaLEA4 group. Both of the TaLEA5 and TaSMP groups contained only one intron.
Figure 2

Exon-intron structures of TaLEA genes. Exon-intron are indicated by wide color bar and black line, respectively.

To elucidate the similarity and diversity of protein motifs, the conserved motifs of wheat LEA proteins in each group were analyzed by MEME software. 15 conserved motifs were identified, namely motif1~motif15 (Figure 3). The results indicated that with the exception of the TaLEA3, TaLEA5, and TaLEA6, there were conservative motifs specific to each group (Figure 3). The composition of structural motifs was diverse among different LEA groups, but similar within the same group. Moreover, the motifs encoding LEA domains were relatively conserved, indicating that the functions of TaLEA proteins are intergroup specific. All of the 19 genes in the LEA1 group have motif15, and 12 of them also contain motif13. Among the LEA2 group, 9 members contained 2 motifs and 6 members contained 5 motifs, while each member possessed motif3 and motif4. The TaSMP group contained motif7 and motif10. All LEA4 members had motif13, with 10 of them also having motif11, and 3 of them having motif5 and motif15. There were 27 genes in the Dehydrin group possessing motif1, motif2, motif5, motif6, motif8, and motif9, and all 47 members contained motif1, motif2, and motif5.
Figure 3

Conserved motifs (left) and their distributions (right) of TaLEA genes predicted by the MEME (Multiple Expectation Maximization for Motif Elicitation) online tool. Each motif is represented by a different colored box with corresponding number. The domain of each subfamily is represented by a different colored thick line below.

3.3. Chromosomal Locations of Wheat LEA Genes

Using MapGene2Chrom software to analyze genomic position data, 117 genes were distributed on 21 wheat chromosomes, while the location of the other 4 genes remained unclear (Figure 4). The analysis from the 7 homologous chromosome groups showed that there were 29 LEA genes on the chromosome groups five, representing the most abundant regions, followed by group six with 23, while the minimum was on chromosome groups two, with only 6 genes. Chromosome groups one and three had 18 and 17 genes, respectively. Chromosome groups four and seven had 12 genes. In total, 37, 40, and 40 TaLEA genes were located on the A, B, and D sub-genome, respectively, which were almost evenly distributed.
Figure 4

Distribution of TaLEA gene family members on wheat chromosomes. TaLEA1, TaLEA2, TaLEA3, TaLEA4, TaLEA5, TaLEA6, TaDehydrin and TaSMP subfamily are presented in orange, light green, red, purple, black, dark green, pink, and blue.

According to the chromosomal distribution of the TaLEA gene, there were 43 of the Dehydrin group distributed on homologous chromosome groups three, four, five, and six, except for the four genes were still unclear. Among them, there were 7, 8, 7 on chromosome 6A, 6B, and 6D, and 3, 4, 5 on 5A, 5B, and 5D, respectively, accounting for 72.34% of all Dehydrin members. Such a concentrated distribution deserves further study in the future.

3.4. Expression Profile Analysis of Wheat LEA Genes in Different Tissues

To gain insight about the expression profile of these identified TaLEA genes, the publicly available RNA-Seq data were collected from the Wheat Expression Browser database. The expression analysis was performed on grains, roots, leaves/shoots, and spikes. A heat map was created according to the Log10 (tpm + 0.0001) of 121 TaLEA genes (Figure 5). It was indicated that expression patterns of TaLEA genes in different tissues were quite different. 93 genes highly expressed in grains, and among them there were 35 out of 47 members in the Dehydrin subfamily; 17 out of 19 in LEA1; all members of LEA4, LEA5 and LEA6. Whereas, very low expression levels of these 93 genes were observed in the other three tissues, except for 12 in roots and 8 in spikes were also high. 11 highly expressed genes were only observed in spikes, and among them there were 6 members of LEA2 and two members of the LEA3 subfamily. Almost no gene highly expressed in all four tissues and only several genes had higher expression levels in multiple tissues.
Figure 5

Expression profiling of TaLEA genes in different tissues. RNA-seq date downloaded from Wheat Expression Browser was used to analysis expression pattern. The heat map was drawn in Log10 (tpm + 0.0001)-transformed expression values. Color scale on the right represent relative expression levels: red represents high level and blue represent low level.

3.5. qRT-PCR Analysis of Wheat LEA Genes in the Grain Maturation of Two Different Cultivers

According to the analysis of gene expression in different tissues, 12 TaLEA genes highly expressed in grains were selected to verify the potential role of TaLEA genes in the maturation of seed. Expression levels at different stages post anthesis of Yangmai16 and Yangmai15 were analyzed by qRT-PCR. Results showed that the expression of these TaLEA genes have temporal specificity. On the whole, the expression level of TaLEA genes increased from 15 DAA to 35 DAA (Figure 6) in two genotypes. Interestingly, the expression patterns of TaLEA genes in two genotypes presented different characteristics. In Yangmai16, the maximum expression level of 9 genes except TaDehydrin-3, TaLEA5-1 and TaSMP-1 were observed at 30 DAA, whereas there were 8 genes in Yangmai 15 with maximum expression level at 35 DAA. For most genes, the peak of expression appeared earlier in Yangmai16. Statistical analysis indicated that, at 25 DAA, the expression level of 8 genes in Yangmai 16 was significantly higher than Yangmai 15.
Figure 6

Expression profiles of 12 wheat LEA genes during grain filling and maturation. Purple and light purple columns indicate Yangmai16 and Yangmai 15, respectively. Values represent means ± standard deviation of three replicates. Asterisks reveal the gene significantly higher or lower in Yangmai 16 than in Yangmai 15 by t-test (* p < 0.05, ** p < 0.01).

According to the determination of phenotypic character of maturation, the increase of grain dry weight in Yangmai16 was faster than that in Yangmai15 (Figure 7A). At 30 DAA, the grain filling of yangmai16 grain was almost completed. Statistical analysis indicated that the average grain filling rate was significantly higher in Yangmai 16 (P = 0.0185). The moisture content curve and the average seed dehydration rate indicated that the dehydration of Yangmai 16 was faster (P = 1.69 × 10−4) and started earlier than Yangmai 15, which could be judged by the appearance and color of the seeds (Figure 7B,C).
Figure 7

Phenotypic character of grain maturation in Yangmai 15 and Yangmai 16. (A), comparison of grain dry weight. (B), comparison of grain moisture content. (C), appearance of grains during maturation progress; Bar = 4 cm.

4. Discussion

4.1. Identification and Analysis of Wheat LEA Gene Family

In this study, 121 wheat LEA homologous were identified by genomic analysis. Wheat LEA family is bigger than rice (34), corn (32), soybean (36), potato (74), Arabidopsis (51), and canola (108), but less than upland cotton (242). We speculated that in the process of hybridization and chromosome doubling by three species, the number of genes had also doubled and finally formed an allohexaploid species with a huge genome [46,47,48]. It is also possible that most of the wheat LEA genes were retained during subsequent evolution due to genome replication [49]. Functional differentiation of LEA genes during the evolution process could facilitate the regulation of development and adaptation to the environment. The largest group of the wheat LEA family Dehydrin accounts for about 40% of the total number of family members. However, the largest group was Dehydrin in rice [15] and LEA3 in soybeans [50]. In plum blossom, the largest groups were Dehydrin and LEA2 [10]. LEA2 group in potato had the most members, accounting for 45 of all 74 LEA genes, while the Dehydrin group had only 5 [13]. The difference above indicated that LEA family genes may have evolved independently after the differentiation of these species. Gene structure analysis revealed that wheat LEA family genes generally lack introns. For instance, there were 24 genes that don’t have introns and 95 genes that have only one intron, while only 2 genes contained more than one intron. The phenomenon was also found in other species [13,50,51]. Perhaps it could be explained by the early intron model of protein-coding gene origin [52]. In general, genes that respond to stress contain fewer introns. Introns can have a deleterious effect on gene expression by delaying transcript production [18]. Moreover, introns can extend the length of the nascent transcript, resulting in an additional expense for transcription [53]. The small number of introns may be a result of adaptation to the environment, which takes less time from transcription to translation, allowing rapid gene expression and functional protein production in response to stresses [53]. DREB [54], AP2/ERF [55], and HD-Zip [42], which play important roles in response to diverse stresses, showing similar phenomenon. The expression profiles of TaLEA family members in different tissues suggested that many TaLEA genes have obvious spatial specificity. However, they were still mainly enriched in the spikes and grains. Our study could be helpful for further investigation of LEA genes with spatiotemporal expression specificity and evaluation of their roles in desiccation tolerance.

4.2. Wheat Seed Desiccation and the Roles of TaLEA Genes in Acquirement of Desiccation Tolerance

With the prospect of climate changes that are shaping future agricultural practices, the production of highly vigorous seeds have become a key lever to improve yields and a focus of food security and genetic resources [56]. Because of the long growth duration of common wheat varieties, plants must face various biotic and abiotic stresses in fields, especially during the stage of seed maturity [57]. The stage of seed maturity at harvest, which was also associated with the desiccation tolerance, is a major factor influencing seed vigor and grain quality [58]. Nowadays, faster filling and dehydration of wheat seeds achieved more popularity in China because of the important role for optimizing the new cropping systems. The wheat seeds undergo three development phases: embryo morphogenesis, material accumulation, and mature dehydration. During the last stage, the metabolism is correspondingly weakened, along with the rapid decrease of water content, which then turn on the transition from morphogenesis to germination [59,60,61]. Before and during dehydration, physiological and biochemical changes contribute to the acquisition of desiccation tolerance. The correlation between LEA protein and desiccation tolerance had been confirmed in many orthodox seeds [62,63]. In this study, the spatial expression pattern of TaLEA indicated that some genes display high expression levels mainly in spikes and grains. Two spring wheat variety Yangmai16 and Yangmai15 were then used for quantitative verification. The results indicated that Yangmai16 had certain advantages in time and quantity of expression at 25 DAA, in which the maximum expression of 8 TaLEA genes appeared earlier than that of Yangmai15. It could be a hint of the inherent genetic differences between two varieties. We speculated that the phenotype of earlier and faster dehydration, which may also possess higher desiccation tolerance theoretically, were correlated with the different expression levels of LEA genes in grains. The contribution of LEA to maintain seed desiccation tolerance is possibility due to their thermal stability and high hydrophilicity, acting as a dehydration protectant and substitute of water [64]. During the mature dehydration of seeds, the protein of Dehydrins will be present in the cell like a “space filler”, maintaining the dissolved character of cell fluid to avoid damage to cell architecture [21]. Magnetic Resonance Imaging (MRI) has been used for tracking and locating the distribution regular of water during seed maturation. It was shown that under the action of Dehydrins, water moves against potential, and the sufficient water in the surroundings of the seed could maintain the available flowing water content in the cells [65,66]. LEA were also supposed to regulate the expression of other genes by direct binding, which still lacks evidence. Four questions that need to be considered were raised by Song and Fu in the last century [67]: What are the signals that induce the mature dehydration during seed development? Are these signals from the seed itself (embryo, storage tissue, and seed coat) or the maternal tissues? What are the acceptors in seeds that response to the dehydration signal? And last but not least, what are the subtle effects of LEA proteins in cells and sub-cells leading to desiccation tolerance? Answers to these questions would be valuable for a better understanding of mature dehydration mechanisms and germplasm conservation, as well as the genetic improvement of crops.
  53 in total

1.  Proteomic and morphological analysis of early stages of wheat grain development.

Authors:  Isabelle Nadaud; Christine Girousse; Clément Debiton; Christophe Chambon; Mohamed Fouad Bouzidi; Pierre Martre; Gérard Branlard
Journal:  Proteomics       Date:  2010-08       Impact factor: 3.984

2.  Duplication and DNA segmental loss in the rice genome: implications for diploidization.

Authors:  Xiyin Wang; Xiaoli Shi; Bailin Hao; Song Ge; Jingchu Luo
Journal:  New Phytol       Date:  2005-03       Impact factor: 10.151

3.  Identification and phylogenetic analysis of late embryogenesis abundant proteins family in tomato (Solanum lycopersicum).

Authors:  Jun Cao; Xiang Li
Journal:  Planta       Date:  2014-12-10       Impact factor: 4.116

4.  Expression of a Late Embryogenesis Abundant Protein Gene, HVA1, from Barley Confers Tolerance to Water Deficit and Salt Stress in Transgenic Rice.

Authors:  D. Xu; X. Duan; B. Wang; B. Hong; THD. Ho; R. Wu
Journal:  Plant Physiol       Date:  1996-01       Impact factor: 8.340

5.  Common amino acid sequence domains among the LEA proteins of higher plants.

Authors:  L Dure; M Crouch; J Harada; T H Ho; J Mundy; R Quatrano; T Thomas; Z R Sung
Journal:  Plant Mol Biol       Date:  1989-05       Impact factor: 4.076

6.  Rapidly regulated genes are intron poor.

Authors:  Daniel C Jeffares; Christopher J Penkett; Jürg Bähler
Journal:  Trends Genet       Date:  2008-06-27       Impact factor: 11.639

7.  The Pepper Late Embryogenesis Abundant Protein, CaDIL1, Positively Regulates Drought Tolerance and ABA Signaling.

Authors:  Junsub Lim; Chae Woo Lim; Sung Chul Lee
Journal:  Front Plant Sci       Date:  2018-09-04       Impact factor: 5.753

8.  Conformational plasticity of the intrinsically disordered protein ASR1 modulates its function as a drought stress-responsive gene.

Authors:  Diana E Wetzler; Federico Fuchs Wightman; Hernan A Bucci; Jimena Rinaldi; Julio J Caramelo; Norberto D Iusem; Martiniano M Ricardi
Journal:  PLoS One       Date:  2018-08-23       Impact factor: 3.240

9.  LEA (late embryogenesis abundant) proteins and their encoding genes in Arabidopsis thaliana.

Authors:  Michaela Hundertmark; Dirk K Hincha
Journal:  BMC Genomics       Date:  2008-03-04       Impact factor: 3.969

10.  Genome-wide identification, structural analysis and new insights into late embryogenesis abundant (LEA) gene family formation pattern in Brassica napus.

Authors:  Yu Liang; Ziyi Xiong; Jianxiao Zheng; Dongyang Xu; Zeyang Zhu; Jun Xiang; Jianping Gan; Nadia Raboanatahiry; Yongtai Yin; Maoteng Li
Journal:  Sci Rep       Date:  2016-04-13       Impact factor: 4.379

View more
  3 in total

1.  Genome-Wide Identification and Low Temperature Responsive Pattern of Actin Depolymerizing Factor (ADF) Gene Family in Wheat (Triticum aestivum L.).

Authors:  Ke Xu; Yong Zhao; Sihang Zhao; Haodong Liu; Weiwei Wang; Shuhua Zhang; Xueju Yang
Journal:  Front Plant Sci       Date:  2021-02-24       Impact factor: 5.753

2.  GmTDN1 improves wheat yields by inducing dual tolerance to both drought and low-N stress.

Authors:  Yongbin Zhou; Jun Liu; Jinkao Guo; Yanxia Wang; Hutai Ji; Xiusheng Chu; Kai Xiao; Xueli Qi; Lin Hu; Hui Li; Mengyun Hu; Wensi Tang; Jiji Yan; Huishu Yan; Xinxuan Bai; Linhao Ge; Mingjie Lyu; Jun Chen; Zhaoshi Xu; Ming Chen; Youzhi Ma
Journal:  Plant Biotechnol J       Date:  2022-05-25       Impact factor: 13.263

3.  Genome-wide search and structural and functional analyses for late embryogenesis-abundant (LEA) gene family in poplar.

Authors:  Zihan Cheng; Xuemei Zhang; Wenjing Yao; Kai Zhao; Lin Liu; Gaofeng Fan; Boru Zhou; Tingbo Jiang
Journal:  BMC Plant Biol       Date:  2021-02-24       Impact factor: 4.215

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.