Literature DB >> 22802709

Identification and in silico characterization of soybean trihelix-GT and bHLH transcription factors involved in stress responses.

Marina Borges Osorio1, Lauro Bücker-Neto, Graciela Castilhos, Andreia Carina Turchetto-Zolet, Beatriz Wiebke-Strohm, Maria Helena Bodanese-Zanettini, Márcia Margis-Pinheiro.   

Abstract

Environmental stresses caused by either abiotic or biotic factors greatly affect agriculture. As for soybean [Glycine max (L.) Merril], one of the most important crop species in the world, the situation is not different. In order to deal with these stresses, plants have evolved a variety of sophisticated molecular mechanisms, to which the transcriptional regulation of target-genes by transcription factors is crucial. Even though the involvement of several transcription factor families has been widely reported in stress response, there still is a lot to be uncovered, especially in soybean. Therefore, the objective of this study was to investigate the role of bHLH and trihelix-GT transcription factors in soybean responses to environmental stresses. Gene annotation, data mining for stress response, and phylogenetic analysis of members from both families are presented herein. At least 45 bHLH (from subgroup 25) and 63 trihelix-GT putative genes reside in the soybean genome. Among them, at least 14 bHLH and 11 trihelix-GT seem to be involved in responses to abiotic/biotic stresses. Phylogenetic analysis successfully clustered these with members from other plant species. Nevertheless, bHLH and trihelix-GT genes encompass almost three times more members in soybean than in Arabidopsis or rice, with many of these grouping into new clades with no apparent near orthologs in the other analyzed species. Our results represent an important step towards unraveling the functional roles of plant bHLH and trihelix-GT transcription factors in response to environmental cues.

Entities:  

Keywords:  Glycine max; drought; gene expression; phylogeny; plant-microbe interactions

Year:  2012        PMID: 22802709      PMCID: PMC3392876          DOI: 10.1590/S1415-47572012000200005

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

Soybean [Glycine max (L.) Merril] is one of the most important crop species in the world. It is widely used for both human and animal consumption due to the high protein and oil contents of its grains. More recently, the potential for using soybean oil in renewable fuel production has also emerged (Programa Nacional de Produção e Uso de Biodiesel). Since it belongs to the Fabaceae family, soybean also takes part in the process of organic nitrogen fertilizer production through its symbiotic association with nitrogen-fixing bacteria (Gepts ). Currently, soybean producers are primarily concerned with losses caused by drought stress, Asian Soybean Rust (ASR, caused by the fungus Phakopsora pachyrhizi) and soybean cyst nematode (SCN, caused by Heterodera glycines) (EMBRAPA, 2007). Furthermore, the genetic variability found in soybean germplasm for those characteristics is restricted, which increases the vulnerability of this species to environmental stresses (Priolli ; Miles ). As sessile organisms, higher plants are continuously exposed to a great variety of environmental stimuli. Because their survival depends on the ability to cope with those stimuli, plants have evolved a variety of sophisticated molecular mechanisms in response to environmental stresses. These generally involve alterations in gene expression, leading to changes in plant physiology, metabolism and developmental activities. Whether caused by abiotic (such as drought, salt and cold) or biotic factors (such as pathogens and insects), environmental stresses have serious adverse effects on agriculture. Therefore, a thorough understanding of the molecular mechanisms involved in plant stress tolerance has become pivotal for the development of new strategies and technologies related to the increasing demand on agricultural production (Rao ; Yoshioda and Shinozaki, 2009). Upon stimuli perception, responses of plants to environmental stresses comprise the activation of a multitude of interconnected signaling pathways (Singh ). The phytohormones abscisic acid (ABA), ethylene (ET), jasmonic acid (JA) and salicylic acid (SA), aside from reactive oxygen species (ROS), are known to act as messenger molecules that trigger specific (but at times overlapping) pathways of this complex network, leading to the accumulation of stress-related gene products (Yoshioda and Shinozaki, 2009). Besides, a great number of studies have highlighted the importance of the transcriptional regulation of target-genes through transcription factors in plant responses to environmental stresses (Zhou ; Chen ; Zhang ). Transcription factors act by binding to cis-elements in the promoter regions of target-genes, thereby activating or repressing their expression. Transcriptional reprogramming is known to result in both spatially and temporally altered expression patterns of stress-related genes. Thus, transcription factors are key players in fine-tuning stress responses at the molecular level (Singh ; Eulgem, 2005). A large part of a plant’s genome is devoted to transcription. With the recent completion of the soybean genome sequencing and assembly, a comparative analysis of putative transcription factor-encoding genes found in both soybean and the model dicot Arabidopsis thaliana can be performed. In the leguminous plant (whose genome is six times larger than that of A. thaliana), over 5,600 transcription factors were identified, these corresponding to about 12% of the predicted protein-coding loci (Schmutz ). In contrast, in the model plant the total number of transcription factors (∼2,300) comprises only up to 7% of the predicted protein-coding loci (Singh ). The overall distribution of these genes among known transcription-factor families is similar among the two genomes, although some families are relatively sparser or more abundant in soybean. Thus, even though the A. thaliana genome often serves general comparisons, differences in biological function between species might occur (Schmutz ). Basic helix-loop-helix (bHLH) proteins constitute one of the largest families of transcription factors. They are found in all three eukaryotic kingdoms and are involved in a myriad of regulatory processes. Members of this family share the bHLH signature domain, which consists of ∼60 amino acids comprising two distinct regions, a basic stretch at the N-terminus consisting of ∼15 amino-acids involved in DNA binding, and a C-terminal region of ∼40 amino-acids composed of two amphipathic α-helices, mainly consisting of hydrophobic residues linked by a variable loop (the “helix-loop-helix” region). This region is responsible for promoting protein-protein interactions through the formation of homo- and hetero-dimeric complexes (Toledo-Ortiz ; Carretero-Paulet ; Pires and Dolan, 2010). The Lc protein from Zea mays, reported as a transcriptional activator in the anthocyanin biosynthetic pathway (Ludwig ), was the first plant bHLH member identified. The involvement of bHLH members in plant developmental processes (Szecsi ; Menand ), light perception (Liu ), iron and phosphate homeostasis (Yi ; Long ; Zheng ), and phytohormone signalling pathways (Abe ; Friedrichsen ; Lorenzo ; Anderson ; Fernandez-Calvo ; Hiruma ; Seo ) has also been reported. In fact, Arabidopsis MYC2 is to date the most extensively characterized plant bHLH transcription factor, and it seems to be a global regulator of hormone signalling. MYC2 has been described as an activator of ABA-mediated drought stress-response (Abe , 2003). It also regulates JA/ET-induced genes, either as an activator in response to wounding, or as a suppressor in pathogen responses (Anderson ; Lorenzo ; Hiruma ). In these cases, the activity of MYC2 is itself subject to regulation by JAZ proteins, in a SCFCOI1 proteosome degradation – dependent pathway (Chini ). Additionally, MYC2 seems to form homo- and heterodimers with two other closely-related bHLH proteins (MYC3 and MYC4), and their interaction is essential for full regulation of JA responses in Arabidopsis (Fernandez-Calvo ). Trihelix-GT factors constitute another family of plant-specific transcription factors. They are characterized by binding specificity for GT-elements present in the promoter region of many plant genes (Hiratsuka ; Nagano ) and are among the first transcription factors identified in plants (McCarty and Chory, 2000). They share one or two trihelix (helix – loop – helix – loop –helix) structures, each consisting of three putative α-helices, which are responsible for binding to DNA (Zhou, 1999). Dimerization of GT factors, or interaction between trihelix-GT and other transcription factors appear to play a major role in the regulatory function of this family (Zhou, 1999). In addition, recent studies demonstrated that post-translational modifications may occur in at least some GT-factors, as shown for Arabidopsis light-responsive GT-1 (Maréchal ; Nagata ). Members of the trihelix-GT family were first described as being involved in the regulation of light-responsive genes (Green , 1988). Nevertheless, further studies in rice and Arabidopsis showed that some GT factors are not light-responsive at the transcriptional level (Dehesh ; Kuhn ). The involvement of this family in seed maturation (Gao ), control of flower morphogenesis (Griffith ; Brewer ; Li ), and response to environmental cues (O’Grady ; Park ; Wang ; Xie ; Fang ) has also been reported. In recent years, a growing number of transcription factors belonging to families, such as AP2, NAC and WRKY, have been connected to the responses of soybean against environmental stresses (Zhang ; Pinheiro, 2009; Zhou, 2008). In addition, the involvement of two soybean trihelix-GT factors [GmGT-2A (Glyma04g39400) and GmGT-2B (Glyma10g30300)] in abiotic stress tolerance has recently been proposed, following heterologous expression in Arabidopsis (Xie, 2009). Nevertheless information regarding soybean bHLH and trihelix-GT members and their roles in this species remains scarce. In the present study we, therefore, aimed at identifying soybean bHLH-and trihelix-GT-encoding genes, as well as investigating their involvement in response to environmental stresses. Given the dimension of the bHLH family in plants (with more than 600 members in Arabidopsis divided into 32 groups), we decided to focus on a single monophyletic group (subfamily 25, Carretero-Paulet ), once we had found some interesting soybean candidates within the LGE Soybean Genome database (Nascimento ) that belong to this group. At least 45 bHLH (from subgroup 25) and 63 trihelix-GT putative genes reside in the soybean genome. Among these, at least 14 bHLH and 11 trihelix-GT seem to be involved in responses to abiotic/biotic stresses. A phylogenetic analysis allowed us to successfully cluster these genes with members of bHLH and trihelix-GT proteins from other plant species. All together, our results represent an important step towards understanding the molecular mechanisms by which soybean responds to environmental cues.

Material and Methods

Sequence identification and annotation

In order to identify putative soybean bHLH sequences, the TAIR (The Arabidopsis Information Resource) gene id from all 17 bHLH proteins belonging to subgroup 25 in Arabidopsis was used to search the soybean database in Phytozome and at JGI (Joint Genome Institute). Soybean peptide homologs for each A. thaliana sequence were identified from a BLASTP search with default parameters in Phytozome and redundant sequences were manually discarded. The protein sequences obtained were scanned for the existence of the bHLH domain using the SMART database. The software MEME (multiple EM for motif elicitation) version 4.4.0 was used for motif identification, using the following parameters: minimum and maximum motif width set to 6 and 50 amino acids, respectively, with any number of motif repetitions. Motif detection was restricted to a maximum of 10. Identified motifs were also compared with conserved compositions already described for bHLH sequences. In addition, the bHLH domain was manually delimited according to plant-specific boundaries, as determined by Toledo-Ortiz and Carretero-Paulet . Classification of soybean sequences in subgroup 25 was accomplished by mismatch counting from the consensus established for A. thaliana (Carretero-Paulet ). Sequences with more than 8 mismatches in conserved positions were discarded. Moreover, no mismatches were allowed at residues H9, E13 and R16 of the basic region, since these are crucial for DNA-binding activity, and a consensus among subgroup 25 sequences. The identification of putative trihelix-GT protein sequences from soybean was accomplished as follows: the conserved trihelix sequence of previously reported soybean genes (O’Grady ; Xie ) along with motifs predicted for this family (Fang ), were blasted (TBLASTN) against the soybean genome in Phytozome. All homologous sequences with an E-value of less than 0.0001 were scanned for the existence of the trihelix domain using SMART (domains with less significant scores than default cut-offs were also analyzed). Motif identification and comparison with conserved trihelix-GT compositions were performed using MEME. Sequences that did not fit these criteria were removed from the analysis. To determine the intron-exon organization of all bHLH and trihelix-GT genes, the full length coding sequences were aligned with the corresponding genomic sequences available on Phytozome. Intron-exon maps of the genes were drawn using Fancy Gene v1.4 software.

Gene expression data mining

Expression profiles of the identified bHLH and trihelix-GT sequences in both biotic and abiotic situations were obtained by mining the LGE Soybean Genome database. A “gene” search was carried out using Phytozome’s gene model codes and each gene had its 5′ and 3′ untranslated regions verified in Gbrowse. Gene expression was confirmed by database searches in NCBI ESTs and LGE superSAGE stress experiments with soybean leaves infected with Asian soybean rust (accession PI 561356, resistant) vs. uninfected leaves, and soybean roots subjected to drought (cultivar BR16, susceptible / cultivar Embrapa-48, tolerant) vs. untreated roots from both cultivars.

Phylogenetic analysis

The phylogenetic analysis of plant trihelix-GT factors was performed using protein sequences from A. thaliana, G. max, Medicago truncatula and Oryza sativa. For plant bHLH transcription factors, protein sequences from A. thaliana, G. max, O. sativa and Physcomitrella patens were used. In both cases, multiple sequence alignments were conducted with full-length protein sequences using the CLUSTALW tool (Thompson ) implemented in MEGA ver. 4.0 (Tamura ). The phylogenetic analysis was performed by two different and independent approaches, viz. the neighbor-joining (NJ) and Bayesian methods. The NJ method was performed within MEGA v4.0. Molecular distances of the aligned sequences were calculated according to the p-distance parameter, with gaps and missing data treated as pairwise deletions. Branch points were tested for significance by bootstrapping with 1000 replications. Bayesian analysis was conducted in MrBayes 3.1.2 software (Huelsenbeck ; Ronquist and Huelsenbeck, 2003) with the mixed amino-acid substitution model + gamma + invariant sites. Two independent runs of 5,000,000 generations each, with two Metropolis-coupled Monte Carlo Markov chains (MCMCMC) were run in parallel, each one starting from a random tree. Markov chains were sampled every 100 generations and the first 25% of the trees were discarded as burn-in. The remaining ones were used to compute the majority rule consensus tree (MrBayes command allcompat), and the posterior probability of clades and branch lengths. The unrooted phylogenetic trees of trihelix-GT and bHLH proteins were visualized and edited using the software FigTree ver. 1.3.1.

Results and Discussion

Identification and analysis of soybean bHLH-encoding genes

In the past few years several phylogenetic studies have emerged as attempts to perform the classification of bHLH proteins in plants (Heim ; Toledo-Ortiz ; Carretero-Paulet ; Pires and Dolan, 2010). Nevertheless, the number of proposed subfamilies varies considerably among these studies. In the present one, the classification suggested by Carretero-Paulet proposing the division of plant bHLH transcription factors into 32 subfamilies was used, since it represents the most recent and comprehensive study, so far. From the BLASTP search at Phytozome, using all 17 Arabidopsis bHLH protein sequences from subgroup 25, 67 non-redundant homolog peptides were identified in the soybean genome. Seven of these were removed from the analysis as they did not contain any bHLH domain. Another 15 sequences were discarded after mismatch counting performed with their aligned domains. Using MEME, two other highly conserved motifs (with E-values of less than 1.7 e−851) were identified among the soybean subgroup 25 sequences. They are formed by residues right adjacent to the bHLH domain and had been previously reported (Heim ; Li ; Carretero-Paulet ; Pires and Dolan, 2010). General characteristics related to the 45 remaining putative soybean bHLH genes are shown in Table 1. Remarkably, members of this subgroup were found spread throughout the 20 soybean chromosomes, with protein sequences ranging from 165 to 691 amino acids. Among the 45 annotated ORFs, 42 presented corresponding ESTs, suggesting that they are expressed genes and not pseudogenes. A complete overview of the gene expression results obtained for this group is presented in Figure 1. Differential expression in at least one of the stress situations/experiments available in LGE database was detected for 14 ORFs, four of these were differentially expressed in more than one situation and three respond to both abiotic and biotic stresses.
Table 1

Annotation of soybean bHLH (subgroup 25) encoding-genes.

Accession number in PhytozomeChromosomeORF (bp)Expression confirmed by EST (GenBank Accession)
Glyma01g046101795BE021678.1
Glyma01g0940011587BU765737.1
Glyma01g394501667AW782148.1
Glyma02g1386021539BI786324.1
Glyma02g161102861AW460021.1
Glyma03g2177031575FK005566.1
Glyma03g2971031203BI427219.1
Glyma03g315103879BW666688.1
Glyma03g3274031446BM732402.1
Glyma04g0140041293CA853113.1
Glyma04g050904855FK457664.1
Glyma04g346604732FG990727.1
Glyma04g3769041041CA937888.1
Glyma05g015905675EV276804.1
Glyma05g350605741BE473364.1
Glyma05g3845051029BF325330.1
Glyma06g0143061173BU551063.1
Glyma06g1742061050FG995242.1
Glyma06g200006810CO978579.1
Glyma07g103107498BE347561.1
Glyma08g012108942FG994001.1
Glyma08g046608528-
Glyma08g4604081761BM885094.1
Glyma09g1438091473CA936197.1
Glyma09g315809906-
Glyma10g0369010852BW657011.1
Glyma10g04890101302BI785116.1
Glyma10g12210101074CO978592.1
Glyma10g28290102076BW675573.1
Glyma10g3043010987FG999826.1
Glyma11g05810111146GR843316.1
Glyma11g12450111263BU082612.1
Glyma12g04670121215BE661807.1
Glyma13g19250131437BQ741548.1
Glyma14g10180141269EV269688.1
Glyma15g33020151428BI699764.1
Glyma16g10620161788FK024158.1
Glyma17g08300171098CX708610.1
Glyma17g1029017690FG993937.1
Glyma17g3401017807-
Glyma18g32560181743BI317112.1
Glyma19g32570191101FG996268.1
Glyma19g3436019879GR826097.1
Glyma20g22280201281BE658194.1
Glyma20g3677020999BE474708.1
Figure 1

Expression pattern of bHLH encoding-genes under drought stress and P. pachyrhizi infection. The expression data were obtained from superSAGE experiments available at www.lge.ibi.unicamp.br/soja/. Blocks indicate up-regulation (red), down-regulation (green), non-significant differences (p > 0.05) but expression detected (blue), and expression not detected (white). Contrasting expression might reflect detection of a single gene by different tags. Drought stress was carried out in roots from Embrapa-48 (tolerant cultivar) and BR 16 (susceptible cultivar). Soybean leaves from PI561356 (resistant genotype) were infected with P. pachyrhizi.

Lately, a growing number of studies accessing the functional role of specific plant bHLH transcription factors have been reported (Friedrichsen ; Szécsi ; Liu ; Chandler ; Todd ; Zheng ). Nevertheless, a deeper (and broader) functional characterization of this family, focusing on the connection of members/subgroups to the biological processes they control, remains to be done. A first step in this direction has been recently taken by Carretero-Paulet and Pires and Dolan (2010), where comprehensive information relating both classification and function of previously characterized plant bHLH transcription factors was assembled. More specifically, information regarding the function of subgroup 25 members is still scarce and concerns Arabidopsis members only. An alternative transcript of At1g59640 (ZCW32/BPE) seems to be involved in the control of petal size, whereas its counterpart is expressed ubiquitously (Szécsi ). Furthermore, At4g34530 (CIB1) and At1g26260 (CIB5) were shown to interact with blue-light receptor CRY2 and promote floral initiation (Liu ). Of most interest for this study, is the redundant role of At1g18400 (BEE1, Brassinosteroid Enhanced Expression1), At4g36540 (BEE2) and At1g73830 (BEE3) in brassinosteroids (BRs)/ABA antagonistic cross-talk during cell elongation (Friedrichsen ). According to these authors, BEE1, 2 and 3 are early-response genes induced by BRs through the BRI1 receptor complex, and their expression is repressed by ABA through a yet unknown ABA receptor. Whether this pathway is also related to the ABA-dependent stress-responsive network, still requires further study. Moreover, Poppenberger have demonstrated that At1g25330 (CESTA), a close homolog of BEE1 and BEE3 (Figure 2), is also involved in BR signaling, possibly by heterodimerization with its closest homologs. Remarkably, it has also been shown that lack of CESTA activity results in the misregulation of genes that are not only BR-responsive but also stress-responsive, such as Arabidopsis ERD5 (Early Responsive to Dehydration 5), TTL4 (Tetratricopetide-Repeat Thioredoxin-Like 4), WRKY18 and a putative LRR-disease resistance protein (Poppenberger ), further suggesting that these pathways might indeed share common features.
Figure 2

Phylogenetic relationships among bHLH subgroup 25 members. The phylogenetic tree shown on the left comprises 89 plant bHLH protein sequences. The Bayesian analysis was conducted using Mr.Bayes v3.1.2, after alignment of full-length bHLH proteins from selected plant species by means of ClustalW. The unrooted cladogram was edited using Fig Tree v1.3.1 software. Nodal support is given by posteriori probability values shown next to the corresponding nodes. The scale bar indicates the estimated number of amino acid substitutions per site. The gray area denotes a specific soybean cluster. Previously reported bHLH genes were identified according to their accession/locus numbers, the other genes were designated according to their locus ID in Phytozome. A. thaliana (At); G. max (Glyma); O. sativa (LOC_Os) and P. patens (Pp). The graph on the right shows gene organization of full-length coding sequences from 89 plant bHLHs. Intron-exon maps were drawn using Fancy Gene v1.4 software, according to sequence data available in Phytozome.

As an attempt to predict gene function of the annotated genes, a comparison of their amino-acid sequences with subgroup 25 bHLH protein sequences from three other model plant species was carried out. Indeed, representative members from diverse taxonomic groups (P. patens, bryophytes; O. sativa, monocotyledonous; and A. thaliana, dicotyledonous) were included in the phylogenetic analysis in order to access the evolutionary features of this subgroup. The results obtained from the phylogenetic analysis proved to be consistent, since the clades formed were highly supported by posteriori probabilities (Figure 2, on left) and bootstrap (data not shown) analyses. Unlike previous phylogenetic reconstructions of the bHLH family that used the bHLH domain only, this study presents a tree reconstructed from full-length protein sequences. This adds accuracy and reliability to the tree resolution, since the short length of the bHLH domain (∼60 amino-acids), along with its extremely high conservation within subgroups may compromise the reliability of the analysis (Amoutzias ). Patterns of intron distribution among bHLH-encoding genes from diverse species were shown to be conserved within subgroups and provide another criterion in phylogenetic analysis (Li ; Carretero-Paulet ). In this study, the overall intron-exon organization of bHLH subfamily 25-encoding genes from soybean and other three species was established (Figure 2, on right). Among 89 sequences, the number of introns ranged from 1 (Pp1s270_17v6) to up to 12 (LOC_Os03g12940), and in many cases, phylogenetically related proteins exhibited a closely related gene structure, corroborating the clustering results. Since it is a basal species among land plants, the moss P. patens was added to this classification in order to help infer about this group’s ancestral state (Rensing ). Notably, all 12 members from P. patens grouped together into a clade, instead of grouping with the other plant species, indicating that the radiation within this subgroup has occurred independently in mosses and vascular plants, after the divergence of these taxonomic groups. The same result was obtained by Carretero-Paulet , even when a different method was applied [maximum likelihood (ML) analysis from bHLH-domain alignments]. Nevertheless, the chance that genes belonging to this subgroup might have independently evolved similar functions in both mosses and vascular plants should not be discarded, as suggested by Menand . In fact, while studying plant bHLH ancestry, Pires and Dolan (2010) concluded that the complex regulatory machinery that may be observed in modern plant lineages actually arose early in plant evolution. The most striking feature that can be inferred from our phylogenetic analysis, which is in accordance with other previously published plant bHLH phylogenies (mentioned above), is the importance of gene duplication during the evolution of this family as a whole. Recurring events of single-gene duplications (“birth-and-death evolution”), combined with domain shuffling seem to rule bHLH evolution and diversification (Morgenstern and Atchley, 1999; Amoutzias ; Nei and Rooney, 2005). Furthermore, whole genome duplication (WGD) events also seem to have had an active effect (as seen in the outer clades in Figure 2, on the left), and this seems to be even more intense in the soybean genome. According to our results, the subgroup in question encompasses almost three times more members in soybean than in Arabidopsis or rice (Table 1), with many of these grouping into new clades with no apparent near orthologs in the other analyzed species (Figure 2, in gray on the left side). Indeed, soybean suffered from two WGD events with an impressive retention of homologous blocks (Schmutz ). Furthermore, specifically in the case of transcription factors (and other genes working in complex networks), duplications resulting from WGD events are vastly overretained, simply because they may be too costly to be removed, thus making functional redundancy a common feature among transcription factors, especially in plant species. Once retained, homologous duplicates might diverge in function or even subfunctionalize (Freeling, 2009), thus providing a source of evolutionary novelty in the form of new regulatory networks (Carretero-Paulet ). With all that in mind, an integrated analysis of both the expression profile (Figure 1) and the phylogeny (Figure 2) presented herein provides a hint at the roles of subgroup 25 bHLH soybean genes. By focusing on soybean-near homologs shown in the tree (Figure 2 on left) we could see that for most of the paralogs whose expression has been detected, a divergent profile seems to prevail. An exception would be the cases of Glyma03g31510 and Glyma19g34360, which were both repressed during drought stress, with a broadly negative response in the latter, as its mRNA levels were down-regulated in both the susceptible and the tolerant cultivars analyzed. Moreover, the transcripts from Glyma19g32570 were up-regulated during ASR infection in the resistant genotype, whereas its counterpart Glyma03g29710 exhibited opposite differential expression. The near paralogs Glyma05g01590 and Glyma17g10290 also seem to be moving in different directions. Whereas the first seems to be up-regulated in response to fungal stress, the latter seems to be broadly down-regulated, in both susceptible and tolerant cultivars submitted to drought, as well as in P. pachyrhizi’s infection. Furthermore, while Glyma15g33020 seems to be positively involved in soybean defense against ASR and during drought stress in tolerant Embrapa-48 cultivar, its nearest paralog (Glyma09g14380) was not differentially expressed in any of the situations assessed, and their near homolog Glyma17g08300 seems to be negatively involved in drought stress responses, since it was down-regulated in the same cultivar. Whether the examples mentioned above reflect functional divergence or subfunctionalization among duplicate homologs still requires further analysis. Even though comparison of soybean genes with their orthologs in other species (such as Arabidopsis) is a tentative approach, and as such needs to be performed carefully. In this context it would be interesting to address the function of BEE orthologs in soybean, so as to determine whether they are similar to their Arabidopsis counterparts, and whether they somehow connected to stress responses. In this respect, special attention should be given to Glyma05g35060, which clustered together with the Arabidopsis BR-responsive genes, and whose transcripts turned out to be up-regulated in Embrapa-48 tolerant cultivar in response to drought.

Identification and analysis of soybean trihelix-GT encoding genes

The first isolated and described soybean GT-factor was GmGT-2 (Glyma02g09060), which binds to an element within the Aux28 promoter, and whose mRNA levels were down-regulated by light in a phytochrome-dependent manner (O’Grady ). In a global approach using massive EST analysis, Tian identified 13 putative trihelix genes in the soybean genome. Two of these [GmGT-2A (Glyma04g39400) and GmGT-2B (Glyma10g30300)] were cloned and had their roles in abiotic stress tolerance described using transgenic Arabidopsis plants (Xie ). The current annotation analysis indicates the occurrence of at least 63 GT-like genes in the soybean genome. 56 of these had their expression confirmed in the NCBI databases (Table 2). Unfortunately, since information available in Phytozome is not yet definitive, full-length cDNAs were not obtained for most sequences, so only gene-models were considered for this analysis. The 63 soybean trihelix-GT genes encode proteins with lengths ranging from 201 to 885 amino acids, distributed across most of the soybean chromosomes, except for chromosomes 5 and 14. There is an average of 3.5 GT-factor-encoding genes per chromosome, with the highest number of 9 genes found in chromosome 10, whereas a single member was detected in chromosomes 12 and 17, respectively. Three genes (Glyma09g19750, Glyma10g34610 and Glyma20g30630) with incorrect gene model predictions were manually curated.
Table 2

Annotation of soybean trihelix-GT encoding-genes.

Accession number in Phytozome (gene)ChromosomeORF (bp)Expression confirmed by EST (GenBank Accession)
Glyma01g297601819BW682708.1
Glyma01g353701834GR826253.1
Glyma02g0905021653FG988995.1
Glyma02g09060 (GmGT-2)21896AF372498.1
Glyma03g187503765DB957166.1
Glyma03g3473031368FK016354.1
Glyma03g075903822-
Glyma03g3496031617BE555145.1
Glyma03g4061031626-
Glyma04g3702042217CO982525.1
Glyma04g39400 (GmGT-2A)41335AI900211.1
Glyma06g1550061494BW678214.1
Glyma06g1798062655EH258249.1
Glyma07g0479071107CO981809.1
Glyma07g0969071083BM731493.1
Glyma07g183207876-
Glyma08g056308942AW351117.1
Glyma08g288808981CO979268.1
Glyma09g016709918FK019218.1
Glyma09g1975091155BE659959.1
Glyma09g3213091014GR829369.1
Glyma09g380509969AI460860.1
Glyma10g36980101335BU765094.1
Glyma10g07490101494GD961953.1
Glyma10g34520101374BE820805.1
Glyma10g36950101350BU549085.1
Glyma10g36960102004BW666798.1
Glyma10g07730101785FG992486.1
Glyma10g30300 (GmGT-2B)101746CA953306.1
Glyma10g34610101017-
Glyma10g4462010978GR827102.1
Glyma11g25570111026CO979922.1
Glyma11g37390111125BI317190.1
Glyma12g3385012924CD415252.1
Glyma13g21350131410CX708572.1
Glyma13g2655013957BI702330.1
Glyma13g3028013939DB955747.1
Glyma13g21370131464CO981764.1
Glyma13g3665013921CA800657.1
Glyma13g41550131221GD834531.1
Glyma13g43650131014EV282528.1
Glyma15g03850151233BF068981.1
Glyma15g0889015603BM085616.1
Glyma15g1259015696-
Glyma15g01730151113GD914877.1
Glyma16g01370161113CA801229.1
Glyma16g1404016801CO980073.1
Glyma16g28240161785FK012336.1
Glyma16g28250161395BQ296282.1
Glyma16g28270161332-
Glyma17g13780172433BQ273464.1
Glyma18g01360 (GmGT-1)181131BG406222.1
Glyma18g4319018879-
Glyma18g5179018990BQ786728.1
Glyma19g37410191359GR845650.1
Glyma19g37660191641BF066376.1
Glyma19g43280191803FK019637.1
Glyma20g30630201338BG726775.1
Glyma20g30640201935BW679178.1
Glyma20g30650201893EH261764.1
Glyma20g32940201572FG988154.1
Glyma20g36680201773BE607585.1
Glyma20g3941020960BI699475.1
Mining the LGE gene expression superSAGE experiments revealed that 11 soybean trihelix-GT genes were differentially expressed in the abiotic/biotic conditions tested (Figure 3). In accordance with our analyses, five trihelix-GT genes were up-regulated under drought in the tolerant cultivar (Embrapa-48), whereas only two genes were down-regulated in this genotype. In the susceptible cultivar (BR16), Glyma10g34520 had its transcript levels increased in response to water deficit and the opposite situation occurred with Glyma10g36950. When plants were infected with P. pachyrhizi, only two genes displayed up-regulation of mRNA levels in response to biotic stress whereas two others seemed to be down-regulated. Interestingly, none of the soybean trihelix-GT previously reported as responsive to stress conditions and particularly to abiotic stress [GmGT-2A (Glyma04g39400) and GmGT-2B (Glyma10g30300)] were detected in the superSAGE experiments herein assessed. Divergence in experimental parameters and genotypes used might explain this unexpected result.
Figure 3

Expression pattern of trihelix-GT encoding-genes under drought stress and P. pachyrhizi infection. The expression data were obtained from superSAGE experiments available at www.lge.ibi.unicamp.br/soja/. Blocks indicate up-regulation (red), down-regulation (green), non-significant differences (p > 0.05) but expression detected (blue), and expression not detected (white). Contrasting expression might reflect detection of a single gene by different tags. Drought stress was carried out in roots from Embrapa-48 (tolerant cultivar) and BR 16 (susceptible cultivar). Soybean leaves from PI561356 (resistant genotype) were infected with P. pachyrhizi.

Transcript levels from Glyma01g35370 and Glyma20g30640 increased when plants were infected with ASR, while the opposite situation occurred with Glyma16g28240 and Glyma17g13780 mRNA levels. A rice GT-factor (OsRML1) was already reported to be upregulated in response to Magnaporthe grisea (Wang ), which corroborates a connection between pathogen attack and trihelix-GT gene regulation. It is also possible that Glyma01g35370 may be involved in plant responses to both abiotic and biotic stresses, since the gene expression profile was modulated during water deficit and P. pachyrhizi infection. The superSAGE experiments suggested that, at least in some cases, the same gene has variable transcript levels in different cultivars and/or in response to different stresses or agents. For example, when water deficit was imposed on soybean plants, Glyma10g36950 was down-regulated in the susceptible (BR16) and the tolerant (Embrapa-48) cultivars, whereas its transcript levels did not change in response to ASR. In another case, Glyma09g38050 was up-regulated in response to drought stress in Embrapa-48, but no differences were detected in BR16. Furthermore, Glyma13g26550 was down-regulated in response to drought stress in the tolerant cultivar, whereas its expression in cultivar BR16 did not exhibit any alterations. In these cases, in addition to differential gene regulation, there may be other factors contributing to distinct regulatory function, such as post-translational modifications or variation in dimerization partners (Zhou, 1999). Modifications in individual cis-regulatory elements on trihelix-GT promoter regions of duplicated genes might lead to the processes of transcriptional neofunctionalization or subfunctionalization (Haberer ), which may explain gene induction or repression without any counterpart response during the same stimuli. This seems to be the case for Glyma03g07590 and its nearest paralog Glyma01g29760, or for Glyma16g28240 and the phylogenetically related Glyma02g09050. Further studies focusing on identifying cis-elements, as well as performing promoter analyses to verify inducible expression patterns may clarify the involvement of duplicated genes in stress-related responses. A previous study regarding the phylogenetic analysis encompassing Arabidopsis and rice GT factors (Fang ) showed that this family could be classified into three subfamilies (α, β and γ), with unique composition of predicted motifs. Unfortunately, these results were not reproduced in our analysis, even when full-length protein sequences (Figure 4) or the trihelix domains alone were aligned (data not shown). An exception occurred with subfamily γ, which had already been described as having low sequence similarity with the other reported GT factors. The introduction of soybean and M. truncatula sequences in the phylogeny might have affected the expected distribution within those subgroups. Besides, we also inserted into our tree the soybean gene AAK69274 described by Fang , which could neither be identified in the soybean genome nor detected in the expression database. According to our analysis, this unexpected result seems to indicate the occurrence of an alternative splicing in Glyma19g37410 or Glyma03g34730, both considered to be phylogenetically closest to the unidentified gene locus.
Figure 4

Bayesian phylogenetic tree of 137 plant trihelix-GT proteins. The Bayesian analysis was conducted using Mr.Bayes v3.1.2 software after alignment of full-length trihelix-GT proteins from selected plant species using ClustalW. The unrooted cladogram was edited using Fig Tree ver. 1.3.1 software. Nodal support is given by posteriori probability values shown next to the corresponding nodes. The scale bar indicates the estimated number of amino acid substitutions per site. The gray area denotes GTγ subfamily described by Fang . Previously reported GT factors were identified according to their accession/locus numbers, the other genes were designated according to their locus ID at Phytozome. A. thaliana (At); G. max (Glyma); Medicago truncatula (Medtr) and O. sativa (LOC_Os).

Hence, when taking into account the full-length protein sequence, the GT-factor family might be divided into two subgroups, in one of these subgroups a branch corresponded to the already described subfamily γ (Figure 4, in gray). Despite the fact that subfamilies α and β were not distinguished, other probabilities supported our tree, especially when inner nodes were observed. When gene organization among Arabidopsis and soybean sequences was compared (Figure 5), the number of introns ranged from zero (twenty three genes) up to 16 (At5g63420 and Glyma06g17980), and some phylogenetically close sequences showed the same gene structure. For example, the Arabidopsis At3g10040 and its soybean ortholog do not have intron, whereas At2g33550 and related members have two introns, with remarkable differences in intron size.
Figure 5

Gene organization of phylogenetically related full-length coding sequences from Arabidopsis and soybean trihelix-GT transcription factors. Intron-exon maps were drawn using Fancy Gene ver. 1.4 software.

As observed for bHLH transcription factors, the soybean GT factor family encompasses almost three times more members than Arabidopsis or rice, a consequence of the WGD events that took place during plant evolution. In several cases, soybean paralogs clustered with one M. truncatula gene, indicating that these paralogs probably derived from a WGD event that occurred after the divergence of the two legume species. Similarly, Schmutz refer to a Glycine-specific WGD event, estimated to have occurred about 13 million years ago. However, the possibility that extra M. truncatula orthologs might arise upon the completion of its genome sequencing should not be discarded. Recently, the OsGTγ subfamily was proposed to participate in the regulation of stress tolerance in rice (Fang ). OsGTγ -1 showed more specific expression pattern than their counterparts OsGTγ-2 and OsGTγ-3, which are supposedly redundant. None of them was responsive to light, but their transcript levels increased in response to salt and cold stresses, whereas OsGTγ-1 was upregulated by ABA and SA stimulus. It is possible that some soybean members of this subfamily may act in response to stressor agents, but more studies are required in order to understand whether the pattern seen in rice GTγ factors also occurs in soybean and M. truncatula. Our analysis, so far, does not indicate their involvement in an abiotic and/or biotic stress response. Moreover, soybean genes previously reported as involved in stress responses (Xie ) together with other genes herein identified are dispersed in different tree branches, indicating that this family is in fact evolutionarily diversified.

Conclusion

The present study identified new members of soybean bHLH and trihelix-GT transcription factor families, some of which seem to be involved in responses to environmental stresses. It also emphasizes the role of duplication events in the expansion and evolution of soybean transcription factor families, indicating that exciting new layers of complexity might exist in this species’ regulatory mechanisms, including biotic and abiotic stress responses.
  63 in total

1.  Regulatory mechanism of plant gene transcription by GT-elements and GT-factors.

Authors: 
Journal:  Trends Plant Sci       Date:  1999-06       Impact factor: 18.313

2.  Trihelix DNA-binding protein with specificities for two distinct cis-elements: both important for light down-regulated and dark-inducible gene expression in higher plants.

Authors:  Y Nagano; T Inaba; H Furuhashi; Y Sasaki
Journal:  J Biol Chem       Date:  2001-04-11       Impact factor: 5.157

3.  Genome-wide classification and evolutionary analysis of the bHLH family of transcription factors in Arabidopsis, poplar, rice, moss, and algae.

Authors:  Lorenzo Carretero-Paulet; Anahit Galstyan; Irma Roig-Villanova; Jaime F Martínez-García; Jose R Bilbao-Castro; David L Robertson
Journal:  Plant Physiol       Date:  2010-05-14       Impact factor: 8.340

4.  Genome-wide analysis of basic/helix-loop-helix transcription factor family in rice and Arabidopsis.

Authors:  Xiaoxing Li; Xuepeng Duan; Haixiong Jiang; Yujin Sun; Yuanping Tang; Zheng Yuan; Jingkang Guo; Wanqi Liang; Liang Chen; Jingyuan Yin; Hong Ma; Jian Wang; Dabing Zhang
Journal:  Plant Physiol       Date:  2006-08       Impact factor: 8.340

5.  Molecular dissection of GT-1 from Arabidopsis.

Authors:  K Hiratsuka; X Wu; H Fukuzawa; N H Chua
Journal:  Plant Cell       Date:  1994-12       Impact factor: 11.277

6.  The Arabidopsis bHLH transcription factors MYC3 and MYC4 are targets of JAZ repressors and act additively with MYC2 in the activation of jasmonate responses.

Authors:  Patricia Fernández-Calvo; Andrea Chini; Gemma Fernández-Barbero; José-Manuel Chico; Selena Gimenez-Ibanez; Jan Geerinck; Dominique Eeckhout; Fabian Schweizer; Marta Godoy; José Manuel Franco-Zorrilla; Laurens Pauwels; Erwin Witters; María Isabel Puga; Javier Paz-Ares; Alain Goossens; Philippe Reymond; Geert De Jaeger; Roberto Solano
Journal:  Plant Cell       Date:  2011-02-18       Impact factor: 11.277

7.  OsbHLH148, a basic helix-loop-helix protein, interacts with OsJAZ proteins in a jasmonate signaling pathway leading to drought tolerance in rice.

Authors:  Ju-Seok Seo; Joungsu Joo; Min-Jeong Kim; Yeon-Ki Kim; Baek Hie Nahm; Sang Ik Song; Jong-Joo Cheong; Jong Seob Lee; Ju-Kon Kim; Yang Do Choi
Journal:  Plant J       Date:  2011-02-18       Impact factor: 6.417

8.  Origin and diversification of basic-helix-loop-helix proteins in plants.

Authors:  Nuno Pires; Liam Dolan
Journal:  Mol Biol Evol       Date:  2009-11-25       Impact factor: 16.240

9.  Identification of a novel iron regulated basic helix-loop-helix protein involved in Fe homeostasis in Oryza sativa.

Authors:  Luqing Zheng; Yinghui Ying; Lu Wang; Fang Wang; James Whelan; Huixia Shou
Journal:  BMC Plant Biol       Date:  2010-08-11       Impact factor: 4.215

10.  A web-based bioinformatics interface applied to the GENOSOJA Project: Databases and pipelines.

Authors:  Leandro Costa do Nascimento; Gustavo Gilson Lacerda Costa; Eliseu Binneck; Gonçalo Amarante Guimarães Pereira; Marcelo Falsarella Carazzolle
Journal:  Genet Mol Biol       Date:  2012-06       Impact factor: 1.771

View more
  15 in total

1.  Genome-wide characterization and expression analysis of common bean bHLH transcription factors in response to excess salt concentration.

Authors:  Musa Kavas; Mehmet Cengiz Baloğlu; Elif Seda Atabay; Ummugulsum Tanman Ziplar; Hayriye Yıldız Daşgan; Turgay Ünver
Journal:  Mol Genet Genomics       Date:  2015-07-21       Impact factor: 3.291

2.  In silico identification of transcription factors in Medicago sativa using available transcriptomic resources.

Authors:  Olga A Postnikova; Jonathan Shao; Lev G Nemchinov
Journal:  Mol Genet Genomics       Date:  2014-02-21       Impact factor: 3.291

3.  Mining whole genomes and transcriptomes of Jatropha (Jatropha curcas) and Castor bean (Ricinus communis) for NBS-LRR genes and defense response associated transcription factors.

Authors:  Archit Sood; Varun Jaiswal; Sree Krishna Chanumolu; Nikhil Malhotra; Tarun Pal; Rajinder Singh Chauhan
Journal:  Mol Biol Rep       Date:  2014-08-09       Impact factor: 2.316

4.  NaCl stress induces CsSAMs gene expression in Cucumis sativus by mediating the binding of CsGT-3b to the GT-1 element within the CsSAMs promoter.

Authors:  Li-Wei Wang; Mei-Wen He; Shi-Rong Guo; Min Zhong; Sheng Shu; Jin Sun
Journal:  Planta       Date:  2017-01-10       Impact factor: 4.116

5.  Comparisons of the Effects of Elevated Vapor Pressure Deficit on Gene Expression in Leaves among Two Fast-Wilting and a Slow-Wilting Soybean.

Authors:  Mura Jyostna Devi; Thomas R Sinclair; Earl Taliercio
Journal:  PLoS One       Date:  2015-10-01       Impact factor: 3.240

6.  The bHLH transcription factor CgbHLH001 is a potential interaction partner of CDPK in halophyte Chenopodium glaucum.

Authors:  Juan Wang; Gang Cheng; Cui Wang; Zhuanzhuan He; Xinxin Lan; Shiyue Zhang; Haiyan Lan
Journal:  Sci Rep       Date:  2017-08-16       Impact factor: 4.379

7.  The Soybean Basic Helix-Loop-Helix Transcription Factor ORG3-Like Enhances Cadmium Tolerance via Increased Iron and Reduced Cadmium Uptake and Transport from Roots to Shoots.

Authors:  Zhaolong Xu; Xiaoqing Liu; Xiaolan He; Ling Xu; Yihong Huang; Hongbo Shao; Dayong Zhang; Boping Tang; Hongxiang Ma
Journal:  Front Plant Sci       Date:  2017-06-28       Impact factor: 5.753

8.  Genome-wide identification and expression analysis of the trihelix transcription factor family in tartary buckwheat (Fagopyrum tataricum).

Authors:  Zhaotang Ma; Moyang Liu; Wenjun Sun; Li Huang; Qi Wu; Tongliang Bu; Chenglei Li; Hui Chen
Journal:  BMC Plant Biol       Date:  2019-08-07       Impact factor: 4.215

Review 9.  Silicon era of carbon-based life: application of genomics and bioinformatics in crop stress research.

Authors:  Man-Wah Li; Xinpeng Qi; Meng Ni; Hon-Ming Lam
Journal:  Int J Mol Sci       Date:  2013-05-29       Impact factor: 5.923

10.  OsASR2 regulates the expression of a defence-related gene, Os2H16, by targeting the GT-1 cis-element.

Authors:  Ning Li; Shutong Wei; Jing Chen; Fangfang Yang; Lingguang Kong; Cuixia Chen; Xinhua Ding; Zhaohui Chu
Journal:  Plant Biotechnol J       Date:  2017-10-10       Impact factor: 9.803

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.