Literature DB >> 27549916

Genome-wide identification, phylogeny and expressional profiles of mitogen activated protein kinase kinase kinase (MAPKKK) gene family in bread wheat (Triticum aestivum L.).

Meng Wang1, Hong Yue1, Kewei Feng1, Pingchuan Deng1, Weining Song2,3, Xiaojun Nie4.   

Abstract

BACKGROUND: Mitogen-activated protein kinase kinase kinases (MAPKKKs) are the important components of MAPK cascades, which play the crucial role in plant growth and development as well as in response to diverse stresses. Although this family has been systematically studied in many plant species, little is known about MAPKKK genes in wheat (Triticum aestivum L.), especially those involved in the regulatory network of stress processes.
RESULTS: In this study, we identified 155 wheat MAPKKK genes through a genome-wide search method based on the latest available wheat genome information, of which 29 belonged to MEKK, 11 to ZIK and 115 to Raf subfamily, respectively. Then, chromosome localization, gene structure and conserved protein motifs and phylogenetic relationship as well as regulatory network of these TaMAPKKKs were systematically investigated and results supported the prediction. Furthermore, a total of 11 homologous groups between A, B and D sub-genome and 24 duplication pairs among them were detected, which contributed to the expansion of wheat MAPKKK gene family. Finally, the expression profiles of these MAPKKKs during development and under different abiotic stresses were investigated using the RNA-seq data. Additionally, 10 tissue-specific and 4 salt-responsive TaMAPKKK genes were selected to validate their expression level through qRT-PCR analysis.
CONCLUSIONS: This study for the first time reported the genome organization, evolutionary features and expression profiles of the wheat MAPKKK gene family, which laid the foundation for further functional analysis of wheat MAPKKK genes, and contributed to better understanding the roles and regulatory mechanism of MAPKKKs in wheat.

Entities:  

Keywords:  Expression profiles; Gene family; MAPKKKs; Wheat

Mesh:

Substances:

Year:  2016        PMID: 27549916      PMCID: PMC4994377          DOI: 10.1186/s12864-016-2993-7

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Mitogen-activated protein kinase (MAPK) cascades play the crucial role in plant growth and development as well as in response to stresses, which are highly conserved in the signal transduction pathway in eukaryote [1]. The MAPK pathway included three main protein kinase members, namely MAPK kinase kinases (MAPKKK or MEKK), MAPK kinases (MKK or MEK) and MAPKs (MPK). They achieved the function through sequentially being phosphorylated. Upstream signals firstly activated the MAPKKKs, which in turn the MAPKKKs activated the MAPKKs and then specific MAPKs were activated by the MAPKKs. Eventually, the activated MAPKs phosphorylated transcription factors, enzymes or other signaling components to modulate the expression of downstream genes to complete signal amplification [2, 3]. It has been demonstrated that MAPK cascades played a vital role in cell division, growth and differentiation [4, 5], hormone response [6], plant immunity [7, 8], biotic and abiotic stress response and so on [9-11]. To date, extensive studies have been conduct to systematically investigate the MAPKKK gene family in many plant species and it is reported that there were 74 putative MAPKKK genes in maize (Zea mays), 75 in rice (O. sativa), 78 in cotton (G. raimondii) and 80 in Arabidopsis (A. thalianna), respectively [12-15]. Wheat is one of the most important crops worldwide, occupying 17 % of cultivated lands and serving as the staple food source for 30 % of the human population all over the world [16, 17]. Genetically, wheat is an allohexaploid species (2n = 6x = 42), which has a complex original and evolutionary history, derived from three diploid donor species through two naturally interspecific hybridization events. The initial hybridization event was occurred between A genome donor (T. urartu, AA; 2n = 14) and B geome donor (Aegilops speltoides, SS; 2n = 14) to produce the allotetraploid (AABB, T. turgidum L) about 0.2 MYa ago, and then the AABB donor crossed with the D genome donor (A. Tauschii Coss) to form the allohexaploid wheat (AABBDD) about 9000 years ago [18]. As a result, wheat possesses a large and complex genome with three homologous genomes (A, B and D) and the size more than 17 Gb, which makes it a huge challenge to conduct genomic study in wheat. But, as the newly formed polyploidy, wheat is considered as an ideal model for chromosome interaction and polyploidization studies in plants [19, 20]. Recently, the draft genome sequencing of hexaploid wheat Chinese Spring (CS) was completed using the chromosome-based strategy, which laid the foundation to identify wheat gene family at the genome-level and also to discern the homologous copies in these three sub-genomes [17]. The retention and dispersion of homologous gene will provide the indispensable information about chromosome interaction during polyploidization [21, 22]. At present, no systematical investigation of MAPKKK gene family has been performed in wheat. In light of the functional significance of this family, an in silico genome-wide search was conducted to identify wheat MAPKKK gene family in this study. Then, the chromosome localization, gene structure, conserved protein domain, phylogenetic relationship as well as expression profiles and regulatory network were systematically analyzed in the putative wheat MAPKKK genes to reveal the evolutionary and functional features of these genes. Our study will provide a basis for further functional analysis of the wheat MAPKKK genes, and will contribute to better understanding the molecular mechanism of MAPKKKs involving in regulating growth and development as well as stress processes in wheat.

Methods

Identification of MAPKKK gene family in wheat

The wheat MAPKKK gene family was identified following the method as described by Rao et al with some modifications [13]. First, all the wheat protein sequences available were downloaded from the Ensemble database (http://plants.ensembl.org/index.html) to construct a local protein database. Then, this database were searched with 304 known MAPKKK gene sequences collected from A.thaliana (80), O. sativa (75), Z. mays (74) and B.distachyon (75) using the local BLASTP program with an e-value of 1e-5 and identity of 50 % as the threshold. Furthermore, all the MAPKKK sequences were aligned and the obtained alignments were used to construct a HMM profile using the hmmbuild tool embedded in HMMER3.0 (http://hmmer.org/download.html), and then the HMM profile were used to search the local protein database using the hmmsearch tool. HMMER and BLAST hits were compared and parsed by manual editing. Furthermore, a self-blast of these sequences was performed to remove the redundancy and the remaining sequences were considered as the putative TaMAPKKK proteins, which then were submitted to the NCBI Batch CD-search database (http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) and PFAM databases (http://pfam.xfam.org/) to confirm the presence and integrity of the kinase domain. Finally, all the obtained sequences were verified the existence by BLASTN similarity search against the wheat ESTs deposited in NCBI database. The theoretical pI (isoelectric point) and Mw (molecular weight) of the putative TaMAPKKK were calculated using compute pI/Mw tool online (http://web.expasy.org/compute_pi/). Subcellular localization of each TaMAPKKK cascade kinases were predicted using the TargetP software of the CBS database [23].

Multiple sequence alignments and phylogenetic analysis

Multiple sequence alignments were generated using ClustalW tool [24]. To investigate the evolutionary relationship among MAPKKK proteins, a neighbor-joining (NJ) tree was constructed by MEGA 6.0 software based on the full-length of MAPKKK protein sequences [25]. Bootstrap test method was adopted and the replicate was set to 1000.

Gene structure construction, protein domain and motif analysis

The gene structure information were got from Ensemble plants database (http://plants.ensembl.org/index.html) and displayed by Gene Structure Display Server program (GSDS: http:/gsds.cbi.pku.edu.cn/). The protein domains and motifs in the MAPKKKs were predicted using InterProScan against protein databases (http://www.ebi.ac.uk/interpro/). The schematic representing the structure of all members of TaMAPKKKs was based on the InterProScan analysis.

Chromosomal locations and gene duplication

Genes were mapped on chromosomes by identifying their chromosomal position provided in the wheat genome database. Gene duplication events of MAPKKK genes in wheat were investigated based on the following three criteria: (a) the alignment covered >80 % of the longer gene; (b) the aligned region had an identity >80 %; and (c) only one duplication event was counted for the tightly linked genes [12, 26]. In order to visualize the duplicated regions in the T. aestivum genome, lines were drawn between matching genes using Circos-0.67 program (http://circos.ca/).

Identification of cis-regulatory elements

To investigate the cis-regulatory elements, the upstream regions (2 kbp) of all wheat MAPKKK genes were extracted, which were considered as the proximal promoter regions for the individual wheat MPKKK genes. Then, all the sequences were submitted to PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/Plantcare/html/) to identify the putative cis-acting regulatory elements.

Network interaction analysis

The interaction network which the TaMAPKKK genes involved were investigated based on the orthologous genes between Wheat and Arabidopsis using the AraNet V2 tool (http//www.inetbio.org/aranet/). Then, enrichment analysis was implemented by BiNGO, a cytoscape plugin, for gene ontology analysis and identifying processes and pathways of specific gene sets. Over-represented GO full categories were identified with a significance threshold of 0.01.

The MAPKKK gene expression analysis by RNA-seq data

To study the expression of TaMAPKKK genes in different organs and response to stress, transcriptome sequencing data obtained from WHEAT URGI (https://urgi.versailles.inra.fr/files/RNASeqWheat/) and NCBI Sequence Read Archive (SRA) database were used to investigate the differential expression of TaMAPKKKs. The accession numbers and sample information of the used data were listed in Additional file 1. TopHat and Cufflinks were used to analyze the genes’ expression based on the RNA-seq data [27]. The FPKM value (fragments per kilobase of transcript per million fragments mapped) was calculated for each MAPKKK gene, the log10-transformed (FPKM + 1) values of the 155 TaMAPKKK genes were used for heat map generation. And fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant threshold [28, 29].

Plant materials, growth conditions, and treatments

The plants of wheat cultivar ‘CS’ were reared in growth chambers at 23 ± 1 °C with a photoperiod of 16 h light/8 h dark. The roots, stems, leaves, spikes (1 d before flowering), and grains (10d after pollination) were collected from flowering plants for tissue expression analysis. One-week-old seedlings which consisted with RNA-seq data were treated by 150 mM NaCl which represented salt treatment, and the seedlings grown under normal condition were used as control. The leaves of seedlings under salt and also control conditions were collected at 0, 6, 12, 24 and 48 h after treatment. All the plant samples from two biological replicates were frozen in liquid nitrogen immediately and stored at −80 °C for RNA isolation.

RNA isolation and qRT-PCR analysis

The total RNA was extracted using Plant RNA Kit reagent (Omega Bio-Tek, USA) according to the manufacturer’s instructions. The RNA integrity was checked by electrophoresis on 1.0 % agarosegels stained with ethidium bromide (EB). The first strand cDNAs were synthesized using a Vazyme Reverse Transcription System (Beijing, China) following the manufacturer’s protocol. Real-time PCR analyses were performed using the primer pairs listed in Additional file 2. Two biological and three technical replicates for each sample were obtained using the real-time PCR system (BIO-RAD CFX96, USA). The β-actin gene was used as internal reference for all the qRT–PCR analysis. Each treatment was repeated three times independently. The expression profile was calculated from the 2–△△CT value [ΔΔCT = (CTtarget/salt – CTactin/salt) – (CTtarget/control – CTactin/control)] [30].

Results and discussion

Genome-wide Identification of MAPKKK Family in Wheat

Availability of the genome sequence made it possible for the first time to identify all the MAPKKK family members in wheat. Using the method as described above, a total of 155 genes with the complete kinase domain were identified as the MAPKKK members in the wheat genome. Since there is no standard nomenclature, the predicted wheat MAPKKK genes were then designated as TaMAPKKK1 to TaMAPKKK155 based on the blast scores. It was notable that wheat possessed the largest MAPKKK gene family among the reported species (Table 1), which may be the result of its allohexaploid genome and complex evolutionary process.
Table 1

Comparison of the gene abundance in three subfamilies of MAPKKK genes in different plant species

SpeciesRafMEKKZIKTotal
Wheat1152911155
Arabidopsis48211180
Rice43221075
Maize4622674
Brachypodium4524675
Tomato40331689
soybean923424150
Grapevine279945
Cucumber31181059
Canola3918966
Comparison of the gene abundance in three subfamilies of MAPKKK genes in different plant species As reported in Arabidopsis and other plant species [12-15], the MAPKKK gene family could be subdivided into Raf, MEKK and ZIK subfamily according to the specific conserved signature motifs contained by these subfamilies, of which Raf had the signature of GTXX (W/Y) MAPE, ZIK of GTPEFMAPE (L/V) Y, and MEKK of G (T/S) PX (W/Y/F) MAPEV [15, 31]. To validate our prediction and subcategorize the identified wheat MAPKKKs, we further investigated the conserved signature motif in these TaMAPKKKs. Results showed that all the putative wheat MAPKKKs possessed at least one of the three conserved signature motifs (Fig. 1). Among them, 29 genes shared the conserved motif G (T/S) PX (W/Y/F) MAPEV, which were categorized into MEKK subfamily, and 11 had the motif GTPEFMAPE (L/V)Y, belonging to ZIK subfamily as well as the remaining 115 genes shared the motif GTXX (W/Y) MAPE, belonging to Raf subfamily. Then, we further named these gene based on the subfamily categories (Table 2). Moreover, the Raf subfamily is found to be the largest subfamily while the ZIK subfamily had the least members in wheat, which was consistent with the composition of MAPKKK genes in other species.
Fig. 1

Protein sequence alignment of TaMAPKKK genes by ClustalW. The highlighted blue boxes showed the conserved signature motif

Table 2

Characteristics of the putative wheat MAPKKK genes

No.MAPKKKsEnsemble Wheat Gene IDSubfamilySubfamily Gene IDAmino acid lengthEST countPIMW (kDa)Subcellular locationLocation
1TaMAPKKK1Traes_2BL_23D01E7F4MEKKTaMEKK117418.4619.5Extracellular PlasmaMembranescaffold_2BL_6949321:447-1269
2TaMAPKKK2Traes_4DS_63F7CF3CETaMEKK2424175.4647.7Cytoplasmicscaffold_4DS_2304216:3-2906
3TaMAPKKK3Traes_4BL_A7AE389EETaMEKK3654206.3372.0Nuclearscaffold_4BL_6901486:6-5409
4TaMAPKKK4Traes_6BL_93505FEAFTaMEKK418606.9520.8Cytoplasmicscaffold_6BL_4252290:2480-4222
5TaMAPKKK5Traes_2AS_6DA49285ETaMEKK5424955.9548.2Cytoplasmicscaffold_2AS_5236692:1-3092
6TaMAPKKK6Traes_4BS_E01B5DAC9TaMEKK6398185.9444.9Cytoplasmic4B:9539577-9542587
7TaMAPKKK7TRAES3BF169900020CFD_gTaMEKK747344.6449.8Chloroplast3B:24030208-24031629
8TaMAPKKK8TRAES3BF036800120CFD_gTaMEKK843115.1346.1Cytoplasmic Chloroplast3B:452802187-452803479
9TaMAPKKK9TRAES3BF036800100CFD_gTaMEKK936654.5538.2Cytoplasmic Chloroplast3B:452828028-452829181
10TaMAPKKK10Traes_4DL_94E10E6EBTaMEKK10659216.4472.5Nuclear4D:19445439-19451009
11TaMAPKKK11Traes_5DL_ADFFAE33DTaMEKK11450365.8451.1Cytoplasmic5D:146319049-146323269
12TaMAPKKK12Traes_4AS_DF85CBD39TaMEKK12710216.5577.7Nuclear4A:60064569-60070396
13TaMAPKKK13Traes_6AL_E854742BBTaMEKK1318607.6720.8Cytoplasmic Extracellular6A:166723325-166725190
14TaMAPKKK14Traes_5AS_9A8A9187CTaMEKK14404225.3245.9Cytoplasmic5A:52959512-52965983
15TaMAPKKK15Traes_5AL_DEDF36AD2TaMEKK15355295.8640.5Cytoplasmic5A:127609658-127614056
16TaMAPKKK16Traes_5BL_35A6B4387TaMEKK16557295.9562.7Cytoplasmic5B:250599335-250602791
17TaMAPKKK17Traes_5AL_4D0919BA1TaMEKK1754995.760.9Nuclearscaffold_5AL_2767817:3993-8685
18TaMAPKKK18Traes_2BL_84B12F4F8TaMEKK181262475.86139.6Nuclearscaffold_2BL_8013221:1461-11089
19TaMAPKKK19Traes_2DL_000136878TaMEKK191267445.69139.8Nuclear2D:137763450-137774947
20TaMAPKKK20Traes_2AL_66079157ATaMEKK201059225.54116.6Nuclear2A:238560833-238569155
21TaMAPKKK21Traes_6AS_E690A27CATaMEKK2154336.8361.2Cytoplasmic6A:131214661-131219615
22TaMAPKKK22Traes_5AL_F9C2BEAF3TaMEKK2260155.466.2Cytoplasmic Nuclear5A:109832378-109839192
23TaMAPKKK23Traes_6DS_185723D1ETaMEKK2348036.5954.6Cytoplasmic Nuclear6D:52694919-52699797
24TaMAPKKK24Traes_5BL_3EFFD8013TaMEKK2454755.7560.4Nuclear5B:45438771-45443053
25TaMAPKKK25Traes_5BL_38DB82ACFTaMEKK2551806.0156.5Cytoplasmic Chloroplast5B:75941978-75943867
26TaMAPKKK26Traes_2DS_122AEE879TaMEKK26130247.79142.3PlasmaMembranescaffold_2DS_5390089:1-10763
27TaMAPKKK27Traes_2BS_8506C57C5TaMEKK27133558.01146.1PlasmaMembranescaffold_2BS_1798276:2-10405
28TaMAPKKK28Traes_2AS_F0521C4F2TaMEKK28133258.09145.9PlasmaMembrane2A:17064310-17075483
29TaMAPKKK29Traes_5DL_243735D6CTaMEKK2961755.8968.0Cytoplasmic Nuclear5D:48513467-48518535
30TaMAPKKK30Traes_5DL_9824E97A8ZIKTaZIK1640175.7170.6Nuclearscaffold_5DL_4596034:10027-17090
31TaMAPKKK31Traes_6DL_F70F83614TaZIK2616134.8668.9Cytoplasmic Nuclearscaffold_6DL_3325277:1-4055
32TaMAPKKK32Traes_2AS_2B84A0A98TaZIK3650335.5672.9Nuclearscaffold_2AS_3354645:196-4869
33TaMAPKKK33Traes_6BL_4A17F7221TaZIK4617134.8969.0Nuclearscaffold_6BL_4289517:41-4156
34TaMAPKKK34Traes_2DS_AA3E486F3TaZIK5321166.6236.2Cytoplasmic Nuclear2D:43089164-43091159
35TaMAPKKK35Traes_2AS_E27D25DA3TaZIK6213136.124.1Cytoplasmic Nuclear2A:69759079-69760992
36TaMAPKKK36Traes_2BS_18264AA5CTaZIK7703325.6178.6Nuclear2B:135976808-135980180
37TaMAPKKK37Traes_2BS_1E887CFE5TaZIK8292136.133.0Cytoplasmic2B:157476501-157478662
38TaMAPKKK38Traes_1DS_34EFDA767TaZIK924335.9127.9Cytoplasmic1D:3919344-3922775
39TaMAPKKK39Traes_6AL_48165ABE5TaZIK10616134.8268.9Cytoplasmic Nuclear6A:166642548-166647057
40TaMAPKKK40Traes_5BL_4002B5518TaZIK11640175.5570.5Nuclear5B:140747940-140754957
41TaMAPKKK41Traes_6DS_D8750EB5ARafTaRaf132638.6536.6Nuclearscaffold_6DS_1052516:1426-2508
42TaMAPKKK42Traes_2BL_4CAF2C184TaRaf214975.0716.5Extracellular2B:344488349-344489312
43TaMAPKKK43Traes_6BL_01E6CE316TaRaf388210699.6Cytoplasmic Nuclear6B:192776834-192783783
44TaMAPKKK44Traes_2DS_DFE006BB6TaRaf4236196.0826.9PlasmaMembrane2D:2355728-2357164
45TaMAPKKK45Traes_3DL_CFCA7AA6BTaRaf5280106.131.7Cytoplasmicscaffold_3DL_6928571:2813-4619
46TaMAPKKK46Traes_2DS_0BFF3B23DTaRaf634246.2638.9Cytoplasmic2D:9025906-9028377
47TaMAPKKK47Traes_7DS_361EC0618TaRaf745405.350.8Cytoplasmic Nuclear7D:151974-158365
48TaMAPKKK48Traes_7DS_A3EB5BFEBTaRaf8272195.8230.8PlasmaMembrane Cytoplasmic7D:15224206-15225510
49TaMAPKKK49Traes_7DS_7A0BEA59BTaRaf9267146.7930.1Cytoplasmic Chloroplast7D:15301325-15302622
50TaMAPKKK50Traes_7DS_D56FBFFD4TaRaf10180124.8619.9PlasmaMembrane7D:19252002-19255310
51TaMAPKKK51Traes_7DS_5A97B2141TaRaf1117755.2520.1Cytoplasmic7D:44647285-44648284
52TaMAPKKK52Traes_7DS_342F25C32TaRaf1238048.5642.8PlasmaMembrane7D:87713571-87717063
53TaMAPKKK53Traes_1BL_C9B36DE76TaRaf13247155.8327.8Cytoplasmic1B:269260712-269261808
54TaMAPKKK54Traes_7DL_F0110933BTaRaf14714176.2879.7Extracellular Cytoplasmic7D:221995565-222000466
55TaMAPKKK55Traes_3DS_0694296CBTaRaf15199336.222.1Cytoplasmic3D:812187-813154
56TaMAPKKK56Traes_3DS_4E61EE6EATaRaf16180164.9420.0PlasmaMembrane3D:2782290-2783296
57TaMAPKKK57Traes_3DS_6801BD0D2TaRaf17279335.2431.3PlasmaMembrane3D:3073536-3075436
58TaMAPKKK58Traes_3DL_B28036C5BTaRaf18284197.0531.7Cytoplasmic3D:56193757-56197452
59TaMAPKKK59Traes_2AS_9219695D6TaRaf1934065.8937.4Cytoplasmic Chloroplast2A:121409421-121412207
60TaMAPKKK60Traes_2AS_79A94F84ATaRaf2022916.4426.1PlasmaMembrane2A:155554112-155555589
61TaMAPKKK61Traes_7DL_705BA7CDDTaRaf2121839.2424.9Mitochondrial Nuclear7D:60185604-60186553
62TaMAPKKK62Traes_4AL_1C557F688TaRaf2225565.928.5PlasmaMembrane Cytoplasmic4A:171143548-171144835
63TaMAPKKK63Traes_4AL_06A8F8B8FTaRaf23287137.1932.5PlasmaMembrane Cytoplasmic4A:183127766-183129049
64TaMAPKKK64Traes_4AL_FEFC21AABTaRaf2470925.2479.4Cytoplasmic4A:211420094-211424697
65TaMAPKKK65Traes_4AL_C217A20A1TaRaf2574135.7982.8PlasmaMembrane Cytoplasmic4A:211772709-211779190
66TaMAPKKK66Traes_1DL_FB90601E7TaRaf2634856.7630.5Cytoplasmic Mitochondrial Nuclear1D:93818790-93820691
67TaMAPKKK67Traes_1DL_F49D0E56ATaRaf27248155.5428.0Cytoplasmic1D:116551471-116552444
68TaMAPKKK68Traes_1DL_A0FB3E1D3TaRaf28193145.1421.7Extracellular Cytoplasmic1D:129495165-129496613
69TaMAPKKK69Traes_2DL_C5A0BDC60TaRaf29271189.3331.0Mitochondrial Nuclear2D:144590634-144593681
70TaMAPKKK70Traes_1DL_56B195A26TaRaf30289257.4931.9Cytoplasmic Nuclear1D:129622264-129624911
71TaMAPKKK71Traes_6AS_006C344A3TaRaf3178665.8990.0Cytoplasmic Nuclear6A:146084-152036
72TaMAPKKK72Traes_3AS_A2CECBF17TaRaf32243306.3426.9Cytoplasmic Nuclear3A:1529045-1530295
73TaMAPKKK73Traes_3AS_769E90DDDTaRaf33268138.1229.9PlasmaMembrane3A:4632011-4633193
74TaMAPKKK74Traes_3AS_5AF26B2FCTaRaf34327106.7236.8PlasmaMembrane3A:5100634-5102019
75TaMAPKKK75Traes_3AS_A542EC6F6TaRaf3530587.2134.3Mitochondrial3A:15435755-15437806
76TaMAPKKK76Traes_3AL_7F6E774BBTaRaf36253115.2728.3Cytoplasmic3A:91931309-91932151
77TaMAPKKK77Traes_3AL_943665768TaRaf37279187.0531.2Cytoplasmic3A:107041859-107044259
78TaMAPKKK78Traes_3AL_60BB7086FTaRaf38183338.4420.6PlasmaMembrane Nuclear3A:178617601-178618324
79TaMAPKKK79Traes_3AL_F384515F5TaRaf39188244.8121.0Extracellular Cytoplasmic3A:180162239-180164198
80TaMAPKKK80Traes_2AS_0C8932B8ETaRaf4033975.5438.8Cytoplasmic Nuclear2A:180067672-180069167
81TaMAPKKK81Traes_5AL_3FE725FD4TaRaf4177526.2888.0Cytoplasmic Nuclear5A:82903861-82912218
82TaMAPKKK82Traes_5AL_A236B0387TaRaf42259115.4929.2Cytoplasmic5A:96483013-96484223
83TaMAPKKK83Traes_5AL_CDD4A02E7TaRaf4329956.3633.8PlasmaMembrane5A:97062318-97064376
84TaMAPKKK84Traes_5AL_13784C39BTaRaf4423365.4626.3PlasmaMembrane5A:97195379-97196530
85TaMAPKKK85Traes_5AL_68C659562TaRaf4527285.230.6PlasmaMembrane5A:99451668-99452790
86TaMAPKKK86Traes_5AL_7B1C0342FTaRaf46339408.1638.0Extracellular PlasmaMembrane5A:105814700-105817645
87TaMAPKKK87Traes_1AS_BEE845715TaRaf47388186.3243.0Cytoplasmic Nuclear1A:100519-103703
88TaMAPKKK88Traes_1AL_C21696173TaRaf48332276.2536.8Nuclear1A:243280434-243282190
89TaMAPKKK89Traes_7AS_51069274FTaRaf49264176.1329.8Cytoplasmic7A:12995054-12996342
90TaMAPKKK90Traes_7AS_81545C211TaRaf5021438.9324.0Cytoplasmic Nuclear7A:27180845-27181770
91TaMAPKKK91Traes_4DS_7D8A5F90BTaRaf5175546.4585.9Cytoplasmic4D:38444291-38457344
92TaMAPKKK92Traes_5DL_3191490FETaRaf52160507.0218.0Cytoplasmic5D:119596889-119599696
93TaMAPKKK93Traes_5BS_0B466F42FTaRaf5327807.5931.8Nuclear5B:4053009-4053978
94TaMAPKKK94Traes_5BS_43731B6ACTaRaf5428586.4130.4Cytoplasmic Chloroplast5B:4123947-4125118
95TaMAPKKK95Traes_5BL_E44E042FDTaRaf5534469.337.6Nuclear5B:106916097-106920463
96TaMAPKKK96Traes_5BL_2DA8896EETaRaf5678428.488.3Cytoplasmic Nuclear5B:178405794-178411614
97TaMAPKKK97Traes_5BL_11A7A1F5CTaRaf5720599.323.2Cytoplasmic5B:206004103-206004989
98TaMAPKKK98Traes_5DL_294C4EDB3TaRaf58387497.5842.2Nuclear5D:148108984-148113098
99TaMAPKKK99Traes_3AS_2A0765E10TaRaf59279298.1331.2PlasmaMembrane3A:671046-672777
100TaMAPKKK100Traes_3AL_82306B917TaRaf6031696.8235.9Cytoplasmic3A:154206856-154208804
101TaMAPKKK101Traes_5DS_53F8C78FATaRaf6119986.0121.1Cytoplasmic5D:10503237-10504290
102TaMAPKKK102Traes_7BL_46880A4FETaRaf622801198.731.5Mitochondrialscaffold_7BL_6485684:8-1478
103TaMAPKKK103Traes_7AL_9AD23808DTaRaf6331426.935.4Cytoplasmic7A:84246015-84251550
104TaMAPKKK104Traes_1DL_0162A6BACTaRaf6424175.9826.8Cytoplasmic Nuclearscaffold_1DL_2275852:3-2035
105TaMAPKKK105Traes_3AS_A0EA6D12CTaRaf6521076.0824.0Cytoplasmic Mitochondrial Nuclearscaffold_3AS_1117810:1-1084
106TaMAPKKK106Traes_4AL_48E7FB1C6TaRaf66197116.1522.5PlasmaMembranescaffold_4AL_7145827:1-952
107TaMAPKKK107Traes_4AL_83D9333FETaRaf6715496.8217.4PlasmaMembranescaffold_4AL_7109061:3-710
108TaMAPKKK108Traes_5DL_62B6846F6TaRaf6819176.321.7PlasmaMembrane Cytoplasmicscaffold_5DL_4605280:630-1568
109TaMAPKKK109Traes_2DS_42A9CC22DTaRaf6925235.2427.9Cytoplasmicscaffold_2DS_838920:50-1605
110TaMAPKKK110Traes_4BL_3626CDB73TaRaf7026515.6128.8Cytoplasmicscaffold_4BL_7036128:2-919
111TaMAPKKK111Traes_3AL_5DC02A5FCTaRaf7130276.1433.3Cytoplasmic Chloroplastscaffold_3AL_1833470:519-2133
112TaMAPKKK112Traes_5DL_0A74AE348TaRaf7229755.7633.5PlasmaMembrane5D:124050225-124051615
113TaMAPKKK113Traes_3AS_C492FCE9ATaRaf7324236.5227.1Nuclearscaffold_3AS_2578257:98-1277
114TaMAPKKK114Traes_4AL_32D968595TaRaf74270176.130.5Cytoplasmicscaffold_4AL_7089761:892-2199
115TaMAPKKK115Traes_3AL_0187ECBACTaRaf7515975.3917.9Cytoplasmic Chloroplastscaffold_3AL_4340950:1-1036
116TaMAPKKK116Traes_1BL_1E2841006TaRaf76267196.2430.2Extracellular Cytoplasmic Nuclearscaffold_1BL_3793082:882-2495
117TaMAPKKK117Traes_3DS_0B1914F50TaRaf7730596.934.3Cytoplasmic Mitochondrialscaffold_3DS_2550735:71-2194
118TaMAPKKK118Traes_5DL_5DAC7A4CFTaRaf7849735.8856.4Cytoplasmicscaffold_5DL_4513923:4360-10186
119TaMAPKKK119Traes_2AL_0E43EBBB6TaRaf79180137.0620.3Mitochondrialscaffold_2AL_6381182:1-1586
120TaMAPKKK120Traes_4AL_9601B9873TaRaf8031446.9634.7Nuclearscaffold_4AL_7096965:1880-5803
121TaMAPKKK121Traes_2DS_964FA3D25TaRaf81245134.6427.1Cytoplasmicscaffold_2DS_5355140:3031-4467
122TaMAPKKK122Traes_2AS_DCD2F10331TaRaf8231196.2334.8Cytoplasmicscaffold_2AS_2039357:2956-4095
123TaMAPKKK123Traes_5DL_A367964F5TaRaf83225108.7925.2Cytoplasmic5D:124089352-124090277
124TaMAPKKK124Traes_2AS_AC9886ABCTaRaf84225128.8825.3Cytoplasmic Nuclearscaffold_2AS_5255912:5418-6352
125TaMAPKKK125Traes_7DS_81C827CE6TaRaf8536346.2740.5PlasmaMembrane Cytoplasmicscaffold_7DS_3862762:1862-7469
126TaMAPKKK126Traes_6BS_511AB47D71TaRaf86339195.5938.1PlasmaMembrane Cytoplasmicscaffold_6BS_3043664:2-1698
127TaMAPKKK127Traes_6DL_7662129ACTaRaf87928555.77104.3Cytoplasmic Nuclearscaffold_6DL_3324907:1786-5987
128TaMAPKKK128Traes_1BL_CDC566E72TaRaf88289257.9732.0Cytoplasmic Nuclearscaffold_1BL_3828880:5213-7383
129TaMAPKKK129Traes_6BL_658AE8589TaRaf8928015.731.6Cytoplasmicscaffold_6BL_4262535:303-3102
130TaMAPKKK130Traes_7AS_0BE0D89ACTaRaf90251145.7928.6PlasmaMembrane Cytoplasmicscaffold_7AS_4255305:1753-2961
131TaMAPKKK131Traes_6BS_EAABDE59ATaRaf91250479.1428.4Extracellular Mitochondrialscaffold_6BS_3021108:276-3989
132TaMAPKKK132Traes_5BL_17A56822ETaRaf9222167.6924.8PlasmaMembrane Cytoplasmicscaffold_5BL_10894314:6618-8227
133TaMAPKKK133Traes_1BS_EA26D2661TaRaf93388186.3242.5Cytoplasmic Nuclearscaffold_1BS_3482116:8155-10572
134TaMAPKKK134Traes_5DL_383D5A71FTaRaf94189115.9421.0PlasmaMembrane Nuclear5D:157768052-157768754
135TaMAPKKK135Traes_2DL_77990F25ATaRaf9531917.1136.4Cytoplasmic Nuclearscaffold_2DL_9829349:7066-8506
136TaMAPKKK136Traes_2BS_C0AED9734TaRaf9621924.7224.5Cytoplasmic Nuclearscaffold_2BS_5191771:1720-2933
137TaMAPKKK137Traes_3DL_73ACAB95CTaRaf9730996.1434.8Cytoplasmic Nuclearscaffold_3DL_6924167:1792-4345
138TaMAPKKK138Traes_7DS_03068057CTaRaf9825907.0729.6Cytoplasmic Nuclearscaffold_7DS_3924816:112-1661
139TaMAPKKK139Traes_3AL_AB54706CATaRaf99381265.6943.1Cytoplasmic Nuclearscaffold_3AL_4360739:391-3058
140TaMAPKKK140Traes_5BS_F1687AA56TaRaf100231309.3327.1Mitochondrialscaffold_5BS_2278981:2727-5793
141TaMAPKKK141Traes_7DS_A46AFAE10TaRaf10191856.62102.6PlasmaMembrane Cytoplasmicscaffold_7DS_3809424:2024-7790
142TaMAPKKK142Traes_2AS_CC27D1C41TaRaf10224887.6427.8Cytoplasmicscaffold_2AS_5226094:20239-21469
143TaMAPKKK143Traes_2AS_AC9886ABC1TaRaf103225128.8825.3Cytoplasmic Nuclearscaffold_2AS_5255913:5418-6352
144TaMAPKKK144Traes_3DL_3D1CAD68FTaRaf104188154.8420.9Cytoplasmicscaffold_3DL_6944830:139-1513
145TaMAPKKK145Traes_2BS_5C64FC44ATaRaf105265116.3329.8Cytoplasmic2B:125675753-125677190
146TaMAPKKK146Traes_4BS_C5AB35B0CTaRaf106203105.8422.6Mitochondrial Chloroplastscaffold_4BS_948180:48-952
147TaMAPKKK147Traes_2AS_E5AB3458CTaRaf10734736.5739.6Nuclearscaffold_2AS_5232094:4234-6292
148TaMAPKKK148Traes_1BS_41E5F1990TaRaf10826966.0930.9Cytoplasmicscaffold_1BS_3451546:6832-8016
149TaMAPKKK149Traes_3B_582DCEA06TaRaf10935287.7439.2Cytoplasmic Mitochondrialscaffold_3B_10637137:56-2229
150TaMAPKKK150TRAES3BF061500080CFD_t1TaRaf110340305.2937.6Cytoplasmic Nuclear3B:1864715-1866712
151TaMAPKKK151TRAES3BF104900080CFD_t1TaRaf111100596.67111.9Nuclear3B:97278846-97291325
152TaMAPKKK152TRAES3BF026200090CFD_t1TaRaf11239696.2443.7Cytoplasmic3B:421410785-421414323
153TaMAPKKK153TRAES3BF086600060CFD_t1TaRaf11330286.2533.4Cytoplasmic Mitochondrial3B:552717475-552718658
154TaMAPKKK154TRAES3BF078400040CFD_t1TaRaf11477535.6787.6PlasmaMembrane Cytoplasmic Nuclear3B:696462241-696470991
155TaMAPKKK155Traes_6BS_5BFDC774ATaRaf11531825.236.1PlasmaMembrane6B:84413110-84414856
Protein sequence alignment of TaMAPKKK genes by ClustalW. The highlighted blue boxes showed the conserved signature motif Characteristics of the putative wheat MAPKKK genes To support the actual existence of these wheat MAPKKKs, we further performed a BLASTN search against the wheat expressed sequence tag (EST) and unigene database using the MAPKKKs as query. Results showed that most of the TaMAPKKKs’ existences were supported by EST hits except 6 MAPKKKs (TaMEKK4, TaMEKK13, TaMEKK25, TaRaf7, TaRaf53 and TaRaf98). We speculated these 6 not-support TaMAPKKKs might not express under any the used conditions or express with very low level that cannot be detected experimentally. Among the supported TaMAPKKK genes, TaRaf62 has the largest hits of ESTs, with the number of 119, followed by TaMEKK5 and TaRaf87 with the number of 95 and 55 ESTs, respectively. Chromosome localization analysis found that the 155 TaMAPKKK genes were unevenly distributed on all the 21 wheat chromosomes, of which chromosome 3A contained the most MAPKKK genes with the number of 15, followed by 2A with the number of 14, then 5B, 5D as well as 7D all with the number of 11, while the chromosome 7B had the least MAPKKK gene, with the number of only 1. Furthermore, the length of putative TaMAPKKK proteins ranged from 149 to 1335 amino acids, with the putative molecular weight (Mw) ranging from 16.5 to 146.1 kDa and theoretical isoelectric point (pI) ranging from 4.55 to 9.33, respectively. The subcellular localization analysis found that a total of 51 TaMAPKKKs localized in nuclear, 42 localized in cytoplasmic and 32 localized in plasma membrane, while the remaining were predicted to be located in chloroplast, mitochondrial and extra-cellular (Table 2).

Phylogenetic and conserved domains analysis of TaMAPKKKs

To further evaluate the phylogenetic relationships of the wheat MAPKKK cascade genes, the full-length protein sequences of the 155 TaMAPKKKs were aligned using ClustalW software and then the phylogenetic tree were constructed using the neighbor joining (NJ) method integrated into MEGA6.0 (Fig. 2a). On the basis of phylogenetic analysis, MAPKKKs in wheat were clustered into three major groups, of which MEKK, Raf and ZIK subfamily members clustered together into one category, respectively. It is found that the bootstrap value of the phylogenetic tree is low, which may due to the low similarity of the full-length protein sequences, suggesting that there are high sequence differentiation in these MAPKKK genes although the conserved motifs were included, which was consistent with the MAPKKKs in maize [12], rice [13] and Brachypodium [15, 32]. The conserved domains and phylogenetic relationship suggested that MAPKKK genes showing the closer phylogenetic relationship may have the similar biological function. To date, there is no report regarding MAPKKK genes in T. aestivum, so searching for MAPKKK family genes and understanding their phylogenetic relationship in T. aestivum is necessary and helpful for their further functional study.
Fig. 2

Phylogenetic relationships (a), gene structures (b) and protein structures (c) of MAPKKK genes in wheat

Phylogenetic relationships (a), gene structures (b) and protein structures (c) of MAPKKK genes in wheat Furthermore, the protein domains of these wheat MAPKKK genes were identified by searching against InterProScan databases (Fig. 2c). Results found that each cluster of the MAPKKKs classified by phylogenetic analysis shared the similar protein structure and domain composition, demonstrating that the protein architecture is remarkably conserved within a specific subfamily of MAPKKKs. Protein kinases have been demonstrated to play the crucial role in mediating process of protein phosphorylation, which widely occurred in most cellular activities [32]. In this study, we found all the TaMAPKKK proteins contained a kinase domain (IPR000719), and most of them had the serine/threonine protein kinase active site (IPR008271) in the central part of the catalytic domain. These features were also found in the MAPKKK proteins of rice and cucumber [13, 33], suggesting the conserved function of MAPKKK genes in plants. Moreover, the ATP-binding site, which is located on the catalytic domain, is the most conserved sequences in the kinase family [33]. We found that most of TaMAPKKKs also contained an ATP-binding site (IPR017441), suggesting that these wheat MAPK cascade kinases use ATP as the ligand in signal transduction pathway. In addition, the TaMAPKKKs also had some other conserved domains, such as concanavalin A-like lectin/glucanase domain (IPR013320), armadillo-like helical (IPR011989), and EF-hand domain (IPR011992). Interestingly, these TaMAPKKKs containing the same protein domains were generally clustered into the same clade in phylogenetic analysis, and showed similar expression patterns in response to multiple stresses, which was consistent with the result of BdMAPKKK genes as reported previously [32]. For example, most TaMAPKKK genes containing concanavalin A-like lectin/glucanase domain were up-regulated by drought stress, while those genes containing armadillo-like helical domain showed to be down-regulated under salt stress. These results indicated that the various protein domains could regulate the TaMAPKKK gene to exhibit specific biological functions. The conserved domains identification and analysis may facilitate the identification of functional units in these kinase genes and accelerate to understand their crucial roles in plant growth and development as well as stresses response [34, 35].

Analyses of gene structures and promoter regions of TaMAPKKKs

Gene structure analysis can provide important information about the gene function, organization and evolution [36]. Thus, the exon/intron structures of TaMAPKKK genes were further analyzed using the available wheat genome annotation information and then were displayed by the Gene Structure Display Server (http://gsds.cbi.pku.edu.cn/) (Fig. 2b). We found the exon/intron structures in the TaMAPKKK genes were relatively conserved within the subfamily but some divergent between different subfamily. The Raf and MEKK subfamily have more sophisticated structure than ZIK subfamily due to the various number of intron. In detail, all the ZIK genes had introns, with the number ranging from 1 to 7. In the MEKK subfamily, 3 gene had no intron, and others had 1 to 22 introns, which was the most highly variable in the number of introns in TaMAPKKKs. In the Raf subfamily, 7 out 115 genes had no intron, and other Raf genes had the intron number ranging from 1 to 14. Interestingly, most gene pairs clustered together by phylogenetic analysis shared the similar exon/intron structure and intron phases in these TaMAPKKK genes, suggesting the evolutionary event may impact not only on the gene function but also on gene structure. It has been revealed that intron gain or loss is the results of selection pressures during evolution in plants, and the genes tend to evolve into diverse exon-intron structures and perform differential functions [37, 38]. Accordingly, the wheat MAPKKK genes were found to have the similar exon-intron structure within same subfamily, while the numbers of introns were varied, even within subfamily, which indicated that gene differentiation have occurred in the wheat MAPKKK to accomplish different biological functions under the selection pressure during the wheat genome formation and evolution. Promoter is the region of the transcription factors (TF) binding site to initiate transcription, which plays a key role in regulating gene spatial and temporal expressions [39]. To further detect the possible biological function and transcription regulation of these TaMAPKKKs, the 2 kb-upstream region of the transcriptional start site of all these genes were extracted and then used to screen for cis-regulatory elements. Results showed that a large number of stress-related and hormone-related cis-elements were found in promoter regions of the wheat MAPKKK genes (Additional file 3), which were similar with the result in Brachypodium, tomato and cucumber [32, 33, 36]. In addition, the abiotic stress-related (a total of 9 drought-stress, 1 salt-stress, 1 heat-stress, 1 cold-stress, 2 wound-stress and 2 disease resistance-related) and hormones signaling transduction-related (6 gibberellins, 4 abscisic acid and 3 ethylene-related) cis-regulatory elements were also found, suggesting that the wheat MAPKKKs may involve in regulating varieties of stress responses and hormone signaling transduction processes.

Genomic distribution and gene duplication of TaMAPKKK gene family

Based on the available wheat genome annotation information, the chromosomal location of the TaMAPKKK genes were further investigated (Fig. 3). A total of 58, 45, and 52 TaMAPKKK genes are distributed in the A, B and D sub-genome, respectively (A > D > B). Initial gene loss may occurred in B genomes following tetraploidy to decrease functional redundancy and define the core wheat genes, with subsequent loss from all three genomes following the formation of the hexaploid around 9000 years ago. The distribution of MAPKKK genes was not random in wheat chromosomes. There were 13, 31, 32, 16, 32, 15 and 16 genes in the group 1 to 7 chromosomes, which show two obvious gradients between group 2, 3, 5 and other four groups. And chromosome 3A had the highest number of MAPKKK genes with the value of 15 genes, whereas chromosome 7B had only one MAPKKK gene. These results indicates that duplication events of MAPKKK gene have likely occurred in wheat 2, 3 and 5 group chromosomes during wheat formation and the evolution of gene families within the different sub-genome is independent, which may associate with gene functions.
Fig. 3

Chromosomal localization and the homologous TaMAPKKK genes in wheat A, B and D sub-genomes. The genes followed by * represent that the gene only anchor to scaffold. Seven homologous groups of wheat chromosomes are displayed in different colors. Duplicated genes of each homo-group are displayed in corresponding color and linked using lines with corresponding color

Chromosomal localization and the homologous TaMAPKKK genes in wheat A, B and D sub-genomes. The genes followed by * represent that the gene only anchor to scaffold. Seven homologous groups of wheat chromosomes are displayed in different colors. Duplicated genes of each homo-group are displayed in corresponding color and linked using lines with corresponding color Gene duplication is frequently observed in plant genomes, arising from polyploidization or through tandem and segmental duplication associated with replication [40]. In our study, a total of 11 homologous gene groups with a copy on each of A, B and D homologous chromosome were found in wheat MAPKKK gene family, and 24 gene pairs with a copy on only 2 of the 3 homologous chromosomes were also identified (Fig. 3 and Additional file 4), while the remaining 74 genes were not found homologs in wheat genome. Previous studies have demonstrated that the fractionation from ploidy caused the loss of some homologous sequences because of some combination of deletion [41]. Our results indicated gene loss may also occur in wheat MAPKKK gene family, resulting in the loss of some homologous copies. The specific retention and dispersion of MAPKKKs in homologous chromosomes provide the invaluable information to better understand the wheat chromosome interaction and polyploidization. Furthermore, these homologous genes are clustered in group 2, 3 and 5 chromosomes, which was consistent with the above chromosome localization analysis, suggesting that group 2, 3 and 5 chromosomes suffered less sequence loss and interaction impact compared to other homologous chromosome groups. Additionally, 25 pairs of duplication genes from different sub-genomes were also identified (Fig. 4 and Additional file 4), including 3 duplication events within the same chromosome and 22 segmental duplication events between different chromosomes, suggesting that the duplication events could play vital roles in the expansion of the MAPK cascade kinase genes in wheat genome. Interestingly, most duplication events occurred between A and D genomes, except the pair of Raf92 and Raf57 occurred on 5B as well as that of Raf13 and Raf88 from 1B. We postulated that the gene family size of the A and B sub-genome have arrived to balance after first hybridization with the long evolutionary process, but the D sub-genome, which was added to form hexaploid wheat recently, appeared to have more interaction with other two sub-genomes. More interestingly, all the 25 pairs of duplication genes belonging to Raf subfamily, which indicates that gene duplication is a main processes responsible for expanding family size and protein functional diversity [42].
Fig. 4

Duplicated MAPKKK genes pairs identified in wheat. Seven homologous groups of wheat chromosomes are displayed in different colors. Duplicated gene pairs are displayed in corresponding color and linked using lines with the corresponding color

Duplicated MAPKKK genes pairs identified in wheat. Seven homologous groups of wheat chromosomes are displayed in different colors. Duplicated gene pairs are displayed in corresponding color and linked using lines with the corresponding color

Regulatory network between TaMAPKKK genes with other wheat genes

MAPKKKs, as the first step of MAPK cascade, function as the pivotal component linking upstream signaling steps to the core MAPK cascade and then promote the corresponding cellular responses, which are activated by a diversity of external stimuli and interact with other genes to form the signaling regulatory network in plants [2, 31]. To understand the interactions between TaMAPKKKs and other wheat genes, the regulatory network of them (Fig. 5) was predicted using the orthology-based method [43]. Results showed 18 MAPKKKs (6 TaMEKKs, 8 TaRafs and 4 TaZIKs) were found to have homology with Arabidopsis genes, and corresponding 509 gene pairs of network interactions were detected with the average of 28.3 gene/TaMAPKKK, suggesting the MAPKKKs were widely involved in the regulatory network and metabolic processes in wheat (Additional files 5 and 6). Among them, 149 genes were interacted by TaZIKs, and 212 genes were interacted by TaRafs, as well as 148 genes interacted by TaMEKKs, respectively. TaMEKK27 showed orthologous to Arabidopsis Fused (FU) gene, with an active kinase domain and the C-terminal ARM/HEAT repeat domain. Previously study has revealed that Arabidopsis Fused kinase termed TIO is essential for cytokinesis in both sporophytic and gametophytic cell types [44]. In this study, TaMEKK27 was found to interact with 38 wheat genes, including SOS6, NACK1 and FZR3, suggesting it was also mainly involved in cell proliferation and cytokinesis. TaRaf1 is found to interact with 10 wheat genes, which is homology with Arabidopsis HT1 gene reported to encode an important protein kinase for regulation of stomatal movements and corresponding to CO2, ABA and light [45]. The predicted upstream target genes of TaRaf1 included SLAC1, FMA and CHX20 as well as MYB and NAC transcription factor, which indicated TaRaf1 might play a vital role in ion homeostasis and stress response in wheat. Furthermore, Gene Ontology (GO) functional enrichment of those genes was performed to understand their potential functions. GO descriptions of those interacted genes were involved in diverse biological process, molecular function and stress response. TaMEKK interacted genes were significantly enriched for cellular process and metabolic process, and TaRaf interacted genes were significantly enriched for cellular process and pathways for stress response, while TaZIK interacted genes were functionally enriched in cellular process and protein modification process pathway (Fig. 6a–c), which indicated that TaMAPKKK genes played the vital role in cellular response to external stimuli, especially TaRaf subfamily genes might be the main adaptors to transduce the stress-related signal.
Fig. 5

The interaction network of TaMAPKKK genes in Wheat according to the orthologs in Arabidopsis

Fig. 6

Functional categories of genes in MEKK (a), Raf (b), and ZIK (c) subfamily. FDR-adjusted P values, **P < 0.01, respectively. Observed, numbers of genes observed in this study; Expected, numbers of genes in this same category in the GO enrichment analysis program

The interaction network of TaMAPKKK genes in Wheat according to the orthologs in Arabidopsis Functional categories of genes in MEKK (a), Raf (b), and ZIK (c) subfamily. FDR-adjusted P values, **P < 0.01, respectively. Observed, numbers of genes observed in this study; Expected, numbers of genes in this same category in the GO enrichment analysis program

Tissue-specific expression patterns of TaMAPKKK genes

Different members of gene families exhibit great disparities in abundance among different tissues to accommodate different physiological processes [46, 47]. To gain insight into the temporal and spatial expression patterns and putative functions of MAPKKK genes in wheat growth and development, the tissue specificity of the 155 TaMAPKKK genes was investigated using available RNA-seq data for five different tissues [48]. Based on the log10-transformed (FPKM + 1) values, we found that the expression levels of the TaMAPKKKs varied significantly in different tissues (Fig. 7). Most MAPKKK genes were found to be expressed in at least one detected organ. All the members in ZIK subfamily were expressed in all of the 5 organs, while a total of 16 Raf genes had too weak expression abundances to be detected in any tissues, which indicated that these genes have undergone functional differentiation and redundancy. Most of MAPKKK genes were much more highly expressed in the root and leaf compared to grain, stem and spike. Furthermore, the tissue-specific expressed MAPKKK genes were identified. A total of 1, 6, 1, 6 and 3 genes were found to be specifically expressed in grain, root, stem, leaf and spike, respectively. Among them, TaRaf112 was predominantly expressed in grain and spike, TaMEKK25 showed preferential expression in stem and leave, and TaRaf12, TaRaf33 as well as TaRaf73 showed preferential expression in root and leave. As shown in Fig. 7 and Additional file 7, most homologous and duplication genes showed similar expression pattern during development. However, it also should be noted that many clustering of expression profiles does not reflect gene similarities, including the copies of one MAPKKK gene from sub-genomes and duplication genes from different sub-genomes. Some of them even show converse expression patterns. For instance, TaRaf71 which located in 3A showed preferential expression patterns in the root, stem, leaf and spike, whereas its homology gene TaRaf113 from 3B was only expressed in the grain. TaMAPKKK23 in 5A was expressed in all tested organs with relatively higher abundance, while its homology TaMAPKKK25 from 5B only slightly expressed in stem and leaf. The divergences in expression profiles between homologous genes revealed that some of them may lose function or acquire new function after polyploidy and duplication in the wheat evolutionary process.
Fig. 7

Hierarchical clustering of the expression profiles of all TaMAPKKK genes in five different organs or tissues (grain, root, stem, leaf and spike). Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample

Hierarchical clustering of the expression profiles of all TaMAPKKK genes in five different organs or tissues (grain, root, stem, leaf and spike). Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample

Expression patterns of TaMAPKKK genes under abiotic stresses

Extensive studies have revealed that the MAPKKK genes played a crucial role in response to abiotic stresses in plant [10, 49, 50]. In the present study, expression patterns of all TaMAPKKK genes in response to four abiotic (salt, heat, drought, cold) stresses were investigated using RNA-seq data to study the roles of TaMAPKKK genes in the response to abiotic stresses. Overall, all the 155 wheat MAPKKK genes showed differential expression patterns under these conditions and most of them were up-regulated in response to more than one stress (Figs. 8, 9 and 10). Among them, TaMEKK14, TaRaf10, TaRaf34 and TaRaf53 showed specific-expression under salt stress, while TaRaf87 and TaRaf105 specifically expressed under drought stress. Meanwhile, TaRaf36 and TaRaf49 were specifically expressed under cold stress while TaRaf112 were specifically expressed under heat stress. In addition, some down-regulated TaMAPKKKs were also observed. TaMEKK29, TaRaf22, TaRaf41, and TaRaf73 was down-regulated under salt stress (Fig. 8), TaMEKK29 showing down-regulated under heat stress, while TaRaf44, TaRaf72 and TaRaf80 showing down-regulated under heat and drought stress (Fig. 9), as well as TaMEKK13, TaRaf1 and TaZIK10 were down-regulated under cold stress (Fig. 10), respectively. These stress-induced MAPKKK genes provided the valuable information to further reveal the roles of TaMAPKKKs playing in regulating wheat diverse stress processes. Finally, the most of the homologous and duplication gene pairs such as TaRaf110/TaRaf32/TaRaf15, and TaMEKK18/ TaMEKK19/ TaMEKK20 showed the similar expression pattern under these stress treatments, suggesting that these had similar physiological functions. On the other hand, several gene pairs such as TaRaf83/TaRaf42 and TaRaf17/TaRaf74, exhibited different expression patterns under the same stress treatments, suggesting functional differentiation has been occurred in these genes and they involved in regulating different stress signaling pathways.
Fig. 8

Hierarchical clustering of the expression profiles of all 155 TaMAPKKK genes under salt stress treatments. Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample. Fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant

Fig. 9

Hierarchical clustering of the expression profiles of all TaMAPKKK genes under drought and heat stress treatments. Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample. Fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant

Fig. 10

Hierarchical clustering of the expression profiles of all TaMAPKKK genes under cold stress treatments. Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample. Fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant

Hierarchical clustering of the expression profiles of all 155 TaMAPKKK genes under salt stress treatments. Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample. Fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant Hierarchical clustering of the expression profiles of all TaMAPKKK genes under drought and heat stress treatments. Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample. Fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant Hierarchical clustering of the expression profiles of all TaMAPKKK genes under cold stress treatments. Log10-transformed (FPKM + 1) expression values were used to create the heat map. The red or green colors represent the higher or lower relative abundance of each transcript in each sample. Fold change cutoff of two and p-value < 0.05, q-value < 0.05 were taken as statistically significant

Validation of the expression of TaMAPKKKs by qRT-PCR analysis

Gene expression patterns usually provide the important clue for its function. Though expression profiles analysis based on RNA-seq data, the differentially expressed TaMAPKKKs among different tissues and stresses were obtained. To further verify the expression levels of these TaMAPKKKs, 10 differentially expressed genes in tissues and 4 salt-responsive genes were randomly selected to detect their expression levels through qRT-PCR analysis (Fig. 11). Among five tissues, TaMEKK5 was found to be expressed in all tested materials with relatively higher abundance. TaMEKK14, TaMEKK21 and TaMEKK23 were found to show a relatively high expression level in the spike comparing with other four tissues, whereas TaRaf80 exhibited the high abundance in the leaf and TaRaf87 showed high expression levels in root and leaf (Fig. 11a). Under salt stress, TaRaf34 was found to be significantly up-regulated while TaRaf22, TaRaf4 and TaMEKK29 were down-regulated under salt stress condition (Fig. 11b). The qRT-PCR results were highly consistent with that of RNA-seq data, suggesting it is reasonable to use RNA-seq data to assess the expression level of transcripts in wheat and the validated tissues-specific and salt-responsive TaMAKKK provided the candidates for further study of their function in wheat development and stress response.
Fig. 11

Validation of the expression level of TaMAPKKKs by qRT-PCR analysis. a The relative expression levels of the 10 selected TaMAPKKKs in different tissues; b The relative expression levels of the 4 TaMAPKKKs under salt treatment

Validation of the expression level of TaMAPKKKs by qRT-PCR analysis. a The relative expression levels of the 10 selected TaMAPKKKs in different tissues; b The relative expression levels of the 4 TaMAPKKKs under salt treatment

Conclusion

This study for the first time identified and characterized the wheat MAPKKK gene family. Through a genome-wide search using the latest available wheat genome information, a total of 155 putative TaMAPKKKs were obtained, which classified into MEKK, ZIK and Raf 3 subfamilies based on the conserved motif signatures. The gene structure, conserved protein domain as well as phylogenetic relationship of these TaMAPKKKs were systematically analyzed and strongly supported the classification. The homologous genes between wheat A, B and D sub-genome and gene duplication were also investigated, which was found to be the main factors contributing to the expansion of wheat MAPKKK gene families. Furthermore, the expression profiles of wheat MAPKKKs during development and under abiotic stresses were investigated and the tissue-specific or stress-responsive TaMAPKKK genes were identified. Finally, 6 tissue-specific and 4 salt-responsive TaMAPKKK genes were selected to validate their expression level through qRT-PCR analysis, which provided the important candidates for further functional analysis of MAPKKK genes in wheat development and stress response. Our current study systematically investigated the genome organization, evolutionary features, regulatory network and expression profiles of the wheat MAPKKK gene family, which not only lay the foundation for investigating the function of these MAPKKKs, but also facilitate to reveal the regulatory and evolutionary mechanism of MAPK cascade involving in growth and development as well as in response to stresses in wheat.
  48 in total

Review 1.  Complexity, cross talk and integration of plant MAP kinase signalling.

Authors:  Claudia Jonak; László Okrész; László Bögre; Heribert Hirt
Journal:  Curr Opin Plant Biol       Date:  2002-10       Impact factor: 7.834

2.  Genome-wide analysis of the mitogen-activated protein kinase gene family in Solanum lycopersicum.

Authors:  Fuling Kong; Jie Wang; Lin Cheng; Songyu Liu; Jian Wu; Zhen Peng; Gang Lu
Journal:  Gene       Date:  2012-01-28       Impact factor: 3.688

3.  Multiple sequence alignment using ClustalW and ClustalX.

Authors:  Julie D Thompson; Toby J Gibson; Des G Higgins
Journal:  Curr Protoc Bioinformatics       Date:  2002-08

Review 4.  MAPK cascade signalling networks in plant defence.

Authors:  Andrea Pitzschke; Adam Schikora; Heribert Hirt
Journal:  Curr Opin Plant Biol       Date:  2009-07-14       Impact factor: 7.834

5.  Frequent gene movement and pseudogene evolution is common to the large and complex genomes of wheat, barley, and their relatives.

Authors:  Thomas Wicker; Klaus F X Mayer; Heidrun Gundlach; Mihaela Martis; Burkhard Steuernagel; Uwe Scholz; Hana Simková; Marie Kubaláková; Frédéric Choulet; Stefan Taudien; Matthias Platzer; Catherine Feuillet; Tzion Fahima; Hikmet Budak; Jaroslav Dolezel; Beat Keller; Nils Stein
Journal:  Plant Cell       Date:  2011-05-27       Impact factor: 11.277

6.  MAP kinase signalling cascade in Arabidopsis innate immunity.

Authors:  Tsuneaki Asai; Guillaume Tena; Joulia Plotnikova; Matthew R Willmann; Wan-Ling Chiu; Lourdes Gomez-Gomez; Thomas Boller; Frederick M Ausubel; Jen Sheen
Journal:  Nature       Date:  2002-02-28       Impact factor: 49.962

7.  A workshop report on wheat genome sequencing: International Genome Research on Wheat Consortium.

Authors:  Bikram S Gill; Rudi Appels; Anna-Maria Botha-Oberholster; C Robin Buell; Jeffrey L Bennetzen; Boulos Chalhoub; Forrest Chumley; Jan Dvorák; Masaru Iwanaga; Beat Keller; Wanlong Li; W Richard McCombie; Yasunari Ogihara; Francis Quetier; Takuji Sasaki
Journal:  Genetics       Date:  2004-10       Impact factor: 4.562

8.  HINKEL kinesin, ANP MAPKKKs and MKK6/ANQ MAPKK, which phosphorylates and activates MPK4 MAPK, constitute a pathway that is required for cytokinesis in Arabidopsis thaliana.

Authors:  Yuji Takahashi; Takashi Soyano; Ken Kosetsu; Michiko Sasabe; Yasunori Machida
Journal:  Plant Cell Physiol       Date:  2010-08-27       Impact factor: 4.927

9.  Mitogen-activated protein kinase cascades in plants: a new nomenclature.

Authors: 
Journal:  Trends Plant Sci       Date:  2002-07       Impact factor: 18.313

10.  Negative regulation of defense responses in plants by a conserved MAPKK kinase.

Authors:  C A Frye; D Tang; R W Innes
Journal:  Proc Natl Acad Sci U S A       Date:  2001-01-02       Impact factor: 11.205

View more
  44 in total

1.  Genome-wide identification of MAPK cascade genes reveals the GhMAP3K14-GhMKK11-GhMPK31 pathway is involved in the drought response in cotton.

Authors:  Lin Chen; Heng Sun; Fengjiao Wang; Dandan Yue; Xiankun Shen; Weinan Sun; Xianlong Zhang; Xiyan Yang
Journal:  Plant Mol Biol       Date:  2020-03-14       Impact factor: 4.076

2.  A serine-rich effector from the stripe rust pathogen targets a Raf-like kinase to suppress host immunity.

Authors:  Cuiping Wan; Yan Liu; Shuxin Tian; Jia Guo; Xingxuan Bai; Haochuan Zhu; Zhensheng Kang; Jun Guo
Journal:  Plant Physiol       Date:  2022-08-29       Impact factor: 8.005

Review 3.  Thermo-Priming Mediated Cellular Networks for Abiotic Stress Management in Plants.

Authors:  Ambreen Khan; Varisha Khan; Khyati Pandey; Sudhir Kumar Sopory; Neeti Sanan-Mishra
Journal:  Front Plant Sci       Date:  2022-05-13       Impact factor: 6.627

4.  Expression profiling of the mitogen-activated protein kinase gene family reveals their diverse response pattern in two different salt-tolerant Glycyrrhiza species.

Authors:  Aiping Cao; Ling Gao; Fei Wang; Xuechen Tong; Shuangquan Xie; Xifeng Chen; Tianxin Lu; Haitao Shen; Hailiang Liu; Xiang Jin; Hongbin Li
Journal:  Genes Genomics       Date:  2022-02-28       Impact factor: 2.164

5.  Genome-Wide Analysis of Heat Shock Transcription Factors in Ziziphus jujuba Identifies Potential Candidates for Crop Improvement Under Abiotic Stress.

Authors:  Kishor Prabhakar Panzade; Sonam S Kale; Vijay Kapale; Narendra R Chavan
Journal:  Appl Biochem Biotechnol       Date:  2020-11-26       Impact factor: 2.926

6.  Genome-wide analysis of the serine carboxypeptidase-like protein family in Triticum aestivum reveals TaSCPL184-6D is involved in abiotic stress response.

Authors:  Xiaomin Xu; Lili Zhang; Wan Zhao; Liang Fu; Yuxuan Han; Keke Wang; Luyu Yan; Ye Li; Xiao-Hong Zhang; Dong-Hong Min
Journal:  BMC Genomics       Date:  2021-05-15       Impact factor: 3.969

Review 7.  Impact of Post-Translational Modifications of Crop Proteins under Abiotic Stress.

Authors:  Akiko Hashiguchi; Setsuko Komatsu
Journal:  Proteomes       Date:  2016-12-21

8.  Genome-wide identification, classification, evolutionary analysis and gene expression patterns of the protein kinase gene family in wheat and Aegilops tauschii.

Authors:  Jun Yan; Peisen Su; Zhaoran Wei; Eviatar Nevo; Lingrang Kong
Journal:  Plant Mol Biol       Date:  2017-09-16       Impact factor: 4.076

9.  The Pepper Mitogen-Activated Protein Kinase CaMAPK7 Acts as a Positive Regulator in Response to Ralstonia solanacearum Infection.

Authors:  Lanping Shi; Kan Zhang; Linjing Xie; Mingxing Yang; Baixue Xie; Shuilin He; Zhiqin Liu
Journal:  Front Microbiol       Date:  2021-07-06       Impact factor: 5.640

10.  Amino acid transporter (AAT) gene family in foxtail millet (Setaria italica L.): widespread family expansion, functional differentiation, roles in quality formation and response to abiotic stresses.

Authors:  Yang Yang; Yongmao Chai; Jiayi Liu; Jie Zheng; Zhangchen Zhao; Aduragbemi Amo; Chunge Cui; Qiumei Lu; Liang Chen; Yin-Gang Hu
Journal:  BMC Genomics       Date:  2021-07-08       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.