Literature DB >> 22802716

In silico identification of known osmotic stress responsive genes from Arabidopsis in soybean and Medicago.

Nina M Soares-Cavalcanti1, Luis C Belarmino, Ederson A Kido, Ana C Wanderley-Nogueira, João P Bezerra-Neto, Rafaela Cavalcanti-Lira, Valesca Pandolfi, Alexandre L Nepomuceno, Ricardo V Abdelnoor, Leandro C Nascimento, Ana M Benko-Iseppon.   

Abstract

Plants experience various environmental stresses, but tolerance to these adverse conditions is a very complex phenomenon. The present research aimed to evaluate a set of genes involved in osmotic response, comparing soybean and medicago with the well-described Arabidopsis thaliana model plant. Based on 103 Arabidopsis proteins from 27 categories of osmotic stress response, comparative analyses against Genosoja and Medicago truncatula databases allowed the identification of 1,088 soybean and 1,210 Medicago sequences. The analysis showed a high number of sequences and high diversity, comprising genes from all categories in both organisms. Genes with unknown function were among the most representative, followed by transcription factors, ion transport proteins, water channel, plant defense, protein degradation, cellular structure, organization & biogenesis and senescence. An analysis of sequences with unknown function allowed the annotation of 174 soybean and 217 Medicago sequences, most of them concerning transcription factors. However, for about 30% of the sequences no function could be attributed using in silico procedures. The establishment of a gene set involved in osmotic stress responses in soybean and barrel medic will help to better understand the survival mechanisms for this type of stress condition in legumes.

Entities:  

Keywords:  Glycine max; Medicago truncatula; osmotic stress; stress-responsive genes

Year:  2012        PMID: 22802716      PMCID: PMC3392883          DOI: 10.1590/S1415-47572012000200012

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

In the course of evolution, plants have acquired a myriad of developmental and metabolic strategies to cope with the adverse effects of environmental stresses during vegetative growth and reproduction (Parry ), making stress tolerance a complex phenomenon. Stress perception and the immediate induction of signals that culminate in adaptive responses are key steps leading to plant stress tolerance. Tolerance stress differences between genotypes or different developmental stages of a single genotype may arise from peculiarities in signal perception and transduction mechanisms (Chinnusamy ). Under osmotic stress conditions diverse sets of physiological responses are activated, including metabolic and defense systems used to sustain growth and for survival. The stress-inducible genes are classified into two major groups: one of them protects the plant directly against stresses, whereas the other regulates gene expression and signal transduction (Valliyodan and Nguyen, 2006). Because plant tolerance against osmotic stress is a complex multigenic trait, a demand exists for genome wide analysis, including ‘omics’ approaches suitable for uncovering important gene sets involved in this important process (Hirayama and Shinozaki, 2010). After the ‘sequencing era’, genetic information was then available for several non-model plants, including some legume species, a group that exhibits unique features, such as the ability to carry the nodulation process. Nitrogen fixation mediated by nodule activities abolishes the need for external nitrogen sources from fertilizers, while providing the so-called ‘green manuring’ that enriches the soil. Moreover, some legumes, such as soybean, barrel medic and cowpea, are important economic crops that provide humans with food, livestock for feeding purposes, and industry with raw materials (Graham and Vance, 2003). Soybean is an example of a non-model plant with plentiful transcriptome information available. Among available databases, the Genosoja platform connects public and restricted data, providing 60,747 unigenes (Nascimento , this issue). The identification of candidate genes in soybean and barrel medic will provide additional evidence of the response mechanisms for osmotic stresses in Fabaceae, yielding useful information for crop improvement. As osmotic stress cannot be solved solely via remedial land management, tolerant crops - able to maintain cellular turgor and osmotic balance - may contribute significantly to reduce this economic burden. The key to plant engineering for osmotic tolerance lies in the knowledge of the underlying mechanisms of plant adaptive responses (Hariadi ). In the present work the main categories of osmotic stress genes known from A. thaliana were identified in the soybean (Genosoja Project) and barrel medic (M. Truncatula database) transcriptomes through an in silico approach, in order to contribute to a better understanding of the early molecular adaptation to osmotic (drought and salinity) stress in both leguminous plants.

Materials and Methods

In a previous study based on 7,000 Arabidopsis genes, Seki identified 103 coding genes distributed over 27 functional categories (Table 1) whose expression increased more than five times in response to osmotic stress. The protein sequences of these stress-inducible genes were obtained at the RIKEN Arabidopsis Full-Length Clone Database, and used as query sequences.
Table 1

Functional categories procured and respective seed-sequence number. Abbreviation: TF = Transcription Factor.

Functional category# Seed sequenceFunctional category# Seed sequence
bZIP TF1WRKY TF2
Photosynthesis1Osmoprotectant3
Signaling1ZincFinger TF3
Reproductive development1Detoxification enzyme2
Respiration1Cellular metabolism3
DNA nucleus1DREB and ERF TF2
Ferritin1Ethylene biosynthesis2
LEA protein1Cytochrome P4502
MYB TF1Fatty Acid metabolism4
Homeodomain TF1Heat Shock protein2
Membrane protein2Kinase protein2
Senescence-related1Carbohydrate metabolism6
Degradation protein1Plant defense4
Secondary metabolism1Transport protein ion channel carrier4
Water channel protein1Cellular struct. organiz. and biogenesis5
NAC TF2Unknown protein37
Protein phosphatase2
Total103
After this step, a local bank with the retrieved sequences was generated in order to make searches for similar sequences against the Genosoja platform (Nascimento ) and the M. truncatula database (Quackenbush ) using the tBLASTn algorithm (Altschul ) with a cut-off of 1e−05. The results were annotated in other local databank for further analyses and for comparisons among studied organisms and literature information. In view of the different number of seed sequences per category, the results obtained from each category and organism were normalized. The soybean and Medicago genes with unknown function were submitted to the AutoFACT program (Koski ), and annotated according to the data available in the largest functional annotation databanks (KEGG, COG, PFAM, SMART, nr). This step was performed in order to categorize these sequences and assign function to them, based on a comparative analysis.

Results and Discussion

The stress-inducible gene products were classified into two main groups: (I) those that are at the front line of defense, protecting the plant against adverse conditions and (II) those that regulate genic expression and signal transduction in the stress response (Seki ). The first group included proteins that probably act in the protection of plant cells from dehydration, such as the enzymes required for the biosynthesis of various osmoprotectants, LEA proteins, antifreeze proteins, chaperones and detoxification enzymes. The second group included signaling molecules such as transcription factors and protein kinases, among others (Seki ). Twenty-seven categories of these two groups classified according to Seki were analyzed, resulting in 1,088 (soybean) and 1,210 (Medicago) sequences (Table S1, supplementary material). In both genomes the ‘unknown protein’ category was the most representative (Figure 1), with 268 candidates for soybean and 331 for Medicago, followed by ‘cellular structure organization and biogenesis’, ‘plant defense’ and ‘transport protein ion channel carrier’ categories (Figure 1).
Figure 1

Main categories of Group I stress-inducible genes (protective molecules), indicating the number of orthologs identified in Glycine max and Medicago truncatula.

The highest number of sequences for genes with ‘unknown function’ - a very common category in expression essays regarding osmotic stress response in plants – attracting great interest from researchers, since those genes represent a clear source of new candidates for breeding purposes. Previous studies highlighted the importance of analyzing the role of stress-induced genes, not only for a further understanding of the molecular mechanisms of stress tolerance in higher plants, but also for improving crop performance using gene manipulation (Seki ). Osmotic stress greatly affects cells both at the micro (i.e., membrane structure), and at the macro level (i.e. the physiology of the whole plant), with results that reflect the variety of responses involved in the acquisition of tolerance. At the microcellular level, the activation of genes in the categories ‘cellular structure, organization and biogenesis’ (soybean: 62; Medicago: 66) and ‘transport protein ion channel carrier’ (soybean: 64; Medicago: 60) was observed, showing the importance of the maintenance of cellular structures and of the control of ion exchange with the environment. Furthermore, we observed the activation of genes in the category ‘plant defense’ (soybean: 66; Medicago: 60), indicating the presence of a cross-talk process between pathways, a common mechanism in plants under stressful conditions. In addition to stress-specific adaptive responses, plants also share responses that protect them from more than one type of stress (Seki ; DeFalco ; Nuruzzaman ), a response also observed in cowpea, another Fabaceae member (Kido ). Amongst the candidates of the second group of responses, composed of genes involved in signal transduction and regulation of expression (203 in soybean and 190 in Medicago; Figure 2), the category transcription factor (TF) was the most prevalent, representing up to 80% in soybean and 82% in Medicago (Figure 2). The high number of transcription factors suggests that transcriptional regulation is an important mechanism in the signal transduction triggered by osmotic stresses in both legumes.
Figure 2

Percentage of stress-inducible genes (Group II), including cell signaling factors identified in Glycine max and Medicago truncatula.

A surprising result was the absence of a bZIP representative in the soybean database, while in Medicago this category was represented by three candidates (Figure 3). This transcription factor has been identified in many plants and is known to participate in various responsive pathways, including abiotic stress response.
Figure 3

Graphic representation of transcription factors identified in Glycine max and Medicago truncatula.

Among the transcription factors, the DREB/ERF and Zinc-finger families had the highest number of sequences (Figure 3). This result was expected, since from more than 1,600 transcription factors encoded by A. thaliana, 9% are members of the DREB/ERF-like family (Dietz ). Due to the versatility of functions that the zinc finger family may have, as well as the variety of their structural proteins, the obtained result was expected. According to Takatsuji (1998), plants seem to have adopted preexisting prototype zinc-finger motifs, generating new zinc-finger domains to adapt them to various regulatory processes. The zinc finger domain can be present in a number of transcription factors and play critical roles in interactions with other molecules. Mutations in some of the genes coding for zinc-finger proteins have been found to cause profound developmental aberrations or defective responses to environmental cues (Takatsuji, 1998). Zinc finger proteins are required for key cellular processes including transcriptional regulation, development, pathogen defense, and stress responses (Ciftci-Yilmaz and Mittler, 2008). A recent study of rice showed that the C2H2-type zinc finger family alone was represented by 189 members and demonstrated that at least 26 of them respond to different environmental stresses (Agarwal ). Moreover, Gong , in a study on transcriptional regulation in drought-tolerant tomato genotypes, also identified and characterized the zinc-finger family as the main activated group during the drought response. It is important to note that the number of seed-sequences used in the search was different for each category; the ‘unknown protein’ category, for example, was represented by 37 sequences, while the ‘bZIP transcription factor’ category comprised a single sequence. Thus, it was expected that the more abundant orthologous categories would be those obtained through comparative searches with the categories composed of more query sequences. As for the remainder, after normalizing the results, proportionally the most representative categories (7% each) were: ‘water channel proteins’, ‘protein degradation’ and ‘senescence-related’ (Figure 4). Without doubt, all categories analyzed may contribute to an improvement in osmotic tolerance, although some functions are more relevant than others. Proteins associated with ion channels and water channels are essential in the acquisition of resistance in the presence of soluble salts and water shortages, the former controlling the entry and exit of ions such as Na+, which are toxic in high concentrations, and the latter controlling water loss to the environment. Besides these proteins, those falling into the category ‘protein degradation’ are required for protein turnover and recycling of essential amino acids, while ‘senescence-related’ genes are key components in the abiotic stress response, with genes controlling subcellular changes that lead to tolerance (Seki ).
Figure 4

Number of gene candidates from Group I for Medicago truncatula and Glycine max, after data normalization.

While the normalized results evidenced similar amounts of data in the most representative categories for both organisms, in some categories there were significant variations in the number of sequences between both leguminous species (Figure 4); this difference was even greater than 50% for the categories ‘Reproductive development’ (soybean: 1,395; Medicago: 465), ‘Ferritin’ (soybean: 651; Medicago: 1,392), ‘Respiration’ (soybean: 186; Medicago: 1,302) and ‘Ethylene biosynthesis’ (soybean: 791; Medicago: 1.721). Nevertheless, this variation may be related to the conditions under which the data were generated and deposited, as well as to the number of sequences available in the respective databases. Additionally, species-specific features could be responsible for these variations, to a lesser extent. Regarding the category ‘Unknown Protein’, screened candidates from soybean (268) and Medicago (331) were subjected to the AutoFACT program in order to assign function to these sequences, allowing the recognition of the function of 174 and 217 sequences, respectively. As a result, 42 and 57 G. max and M. truncatula were categorized according to the COG (Cluster of Orthologous Groups) functional database in five categories (Table 2; Figure 5). Within each category, the annotation revealed that they present the same description as the matched sequences deposited in the databank. For example, the ‘Amino acid transport and metabolism’ functional category was represented just by ‘Amino Acid Permease’ sequences (Table 2). Two candidates of Medicago, which were functionally classified into the ‘Carbohydrate transport and metabolism’ category, were also annotated on the KEGG database as involved in the beta-galactosidase pathway (Galactose Metabolism Glycan Structure – degradation), (Table 2).
Table 2

Sequence description annotated according to the COG (Cluster of Orthologous Groups) functional category in Glycine max and Medicago truncatula.

COG functional categorySequence descriptionSequence amount
G. maxM. truncatula
Amino acid transport and metabolismAmino acid permease98
Carbohydrate transport and metabolismBeta-galactosidase02
General function prediction onlyPatatin417
Posttranslational modification, protein turnover, chaperonesDnaJ-like protein1413
Signal transduction mechanismsUniversal Stress Protein (USP) family protein1517
Total4257
Figure 5

Categorization of soybean and Medicago ‘unknown category’ candidates based on COG (Cluster of Orthologous Groups) functional database.

The remaining previously ‘unknown’ sequences were annotated as shown in Table 3. The analysis through AutoFACT allowed a function assignment to 132 and 160 soybean and Medicago sequences, respectively. In general, the highest number of sequences was categorized as transcription factors, essential genes participating in the transcriptional regulation of plants. Although it was possible to record more than 65% of the sequences, 35% of ‘unknown’ soybean and 34% of ‘unknown’ Medicago sequences remained without their putative function identified. These are relevant data to be worked out in future functional studies, since they may represent new genes not yet described and unique to legumes.
Table 3

Description of sequences with unknown function after AutoFACT analysis.

DescriptionG. maxM. truncatula
Amino acid permease74
ATP binding / kinase / protein serine/threonine kinase03
Auxin-responsive GH3 product [Glycine max]28
BTB/POZ domain-containing protein02
Calcium ion binding24
Calmodulin binding1014
CCT_2 domain containing protein45
Copper ion binding / electron transporter41
Cu-binding-like domain containing protein410
Dev_Cell_Death domain containing protein917
DFL1 (DWARF IN LIGHT 1)10
DnaJ-like protein [Phaseolus vulgaris]32
F-box family protein54
Heat shock protein binding34
Herpes_BLLF1 domain containing protein10
Hydroxyproline-rich glycoprotein family protein01
IFRD1; interferon-related developmental regulator 180
Indole-3-acetic acid-amido synthetase GH3.17, putative35
NAC Transcription Factor410
Nucleic acid binding / transcription factor1814
Patatin B2 precursor, putative10
PHI-1 (PHOSPHATE-INDUCED 1)1920
Plastocyanin-like domain-containing protein01
RCI2A (RARE-COLD-INDUCIBLE 2A)02
SMC_N multi-domain protein13
SPX domain-containing protein20
Stress-inducible protein02
Tify domain containing protein812
Triacylglycerol lipase55
Uncharacterized protein family/Unassigned protein/Protein of unknown function94114
Universal stress protein (USP) family protein13
Zinc finger family protein74
In conclusion, even in the absence of libraries restricted to osmotic stress in the Genosoja databank, this study indicated that most of the genes involved in the osmotic stress pathways were expressed by the non-stressed soybean and Medicago libraries at least in a baseline way. The data also revealed that soybean and Medicago are a rich source of stress-responsive candidates, which can be also applied to improve soybean and other legumes. It also highlights the existence of significant diversity for most genes, useful for comparative physiological essays. The obtained data are available for gene-targeted functional evaluation using qRT-PCR, as well as other biotechnological approaches. The molecular differences detected between the compared libraries will permit the identification of important candidates by additional approaches including PCR walking, as previously done for other crops (e.g. Coemans ). The identified candidates are also being monitored in further expression assays carried out in the Genosoja project (considering contrasting combinations of tolerant and susceptible plants under drought stress as compared with their negative control in a time frame) providing a more complete picture of genes involved in osmotic stress response and useful for breeding and biotechnological purposes.
  20 in total

Review 1.  Molecular responses to drought, salinity and frost: common and different paths for plant protection.

Authors:  Motoaki Seki; Ayako Kamei; Kazuko Yamaguchi-Shinozaki; Kazuo Shinozaki
Journal:  Curr Opin Biotechnol       Date:  2003-04       Impact factor: 9.740

2.  The TIGR gene indices: reconstruction and representation of expressed gene sequences.

Authors:  J Quackenbush; F Liang; I Holt; G Pertea; J Upton
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 3.  Legumes: importance and constraints to greater use.

Authors:  Peter H Graham; Carroll P Vance
Journal:  Plant Physiol       Date:  2003-03       Impact factor: 8.340

Review 4.  Understanding regulatory networks and engineering for enhanced drought tolerance in plants.

Authors:  Babu Valliyodan; Henry T Nguyen
Journal:  Curr Opin Plant Biol       Date:  2006-02-17       Impact factor: 7.834

Review 5.  Zinc-finger transcription factors in plants.

Authors:  H Takatsuji
Journal:  Cell Mol Life Sci       Date:  1998-06       Impact factor: 9.261

6.  SuperSAGE combined with PCR walking allows global gene expression profiling of banana (Musa acuminata), a non-model organism.

Authors:  Bert Coemans; Hideo Matsumura; Ryohei Terauchi; Serge Remy; Rony Swennen; László Sági
Journal:  Theor Appl Genet       Date:  2005-10-11       Impact factor: 5.699

Review 7.  Molecular genetic perspectives on cross-talk and specificity in abiotic stress signalling in plants.

Authors:  Viswanathan Chinnusamy; Karen Schumaker; Jian-Kang Zhu
Journal:  J Exp Bot       Date:  2003-12-12       Impact factor: 6.992

8.  Genome-wide identification of C2H2 zinc-finger gene family in rice and their phylogeny and expression analysis.

Authors:  Pinky Agarwal; Rita Arora; Swatismita Ray; Ashok K Singh; Vijay P Singh; Hiroshi Takatsuji; Sanjay Kapoor; Akhilesh K Tyagi
Journal:  Plant Mol Biol       Date:  2007-07-04       Impact factor: 4.076

Review 9.  The zinc finger network of plants.

Authors:  S Ciftci-Yilmaz; R Mittler
Journal:  Cell Mol Life Sci       Date:  2008-04       Impact factor: 9.261

10.  AutoFACT: an automatic functional annotation and classification tool.

Authors:  Liisa B Koski; Michael W Gray; B Franz Lang; Gertraud Burger
Journal:  BMC Bioinformatics       Date:  2005-06-16       Impact factor: 3.169

View more
  3 in total

Review 1.  Abscisic-acid-dependent basic leucine zipper (bZIP) transcription factors in plant abiotic stress.

Authors:  Aditya Banerjee; Aryadeep Roychoudhury
Journal:  Protoplasma       Date:  2015-12-15       Impact factor: 3.356

2.  A toolbox of genes, proteins, metabolites and promoters for improving drought tolerance in soybean includes the metabolite coumestrol and stomatal development genes.

Authors:  Prateek Tripathi; Roel C Rabara; R Neil Reese; Marissa A Miller; Jai S Rohila; Senthil Subramanian; Qingxi J Shen; Dominique Morandi; Heike Bücking; Vladimir Shulaev; Paul J Rushton
Journal:  BMC Genomics       Date:  2016-02-09       Impact factor: 3.969

Review 3.  Virus-induced gene silencing is a versatile tool for unraveling the functional relevance of multiple abiotic-stress-responsive genes in crop plants.

Authors:  Venkategowda Ramegowda; Kirankumar S Mysore; Muthappa Senthil-Kumar
Journal:  Front Plant Sci       Date:  2014-07-08       Impact factor: 5.753

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.