Literature DB >> 28321577

Hiding in plain sight: the F segment and other conserved features of seed plant SKn dehydrins.

G Richard Strimbeck1.   

Abstract

MAIN
CONCLUSION: An 11-residue amino acid sequence, DRGLFDFLGKK, is highly conserved in a subset of dehydrins found across the full spectrum of seed plants and here given the name F-segment. An 11-residue amino acid sequence, DRGLFDFLGKK, is highly conserved in identity and polarity in 130 non-redundant dehydrin sequences representing conifers and all major angiosperm groups. This newly described motif is here given the name F segment based on the pair of hydrophobic F residues at the core of the sequence. The majority of dehydrins previously classified as SKn dehydrins contain one F segment N terminal to the S and K segments and can accordingly be reclassified as FSKn dehydrins. A cysteine-containing variant, GCGMFDFLKK, occurs in a few rosid and asterid taxa. The S segment in this and other dehydrin types also includes previously overlooked conserved features, including a KLHR prefix and charged or G residues within and following the characteristic string of S residues. Secondary structure prediction models indicate that the F segment and S segment prefix may form amphipathic helices that could be involved in membrane or protein binding.

Entities:  

Keywords:  LEA protein; Localization; Membrane binding; Phosphorylation; Sequence conservation

Mesh:

Substances:

Year:  2017        PMID: 28321577      PMCID: PMC5393156          DOI: 10.1007/s00425-017-2679-7

Source DB:  PubMed          Journal:  Planta        ISSN: 0032-0935            Impact factor:   4.116


Introduction

Dehydrins are a family of land plant proteins that may be expressed constitutively at low levels but are often produced de novo or at higher levels in response to drought, low temperature, or other stresses. They have been detected by western blotting or nucleic acid sequencing in all types of land plants, including bryophytes, pterophytes, gymnosperms, and all major groups of angiosperms. The main structural features of dehydrins were first described over 20 years ago (Close 1996), and include the number and modular arrangement of three types of short, distinctive segments in the protein, designated K, Y, and S segments. Using the YSK shorthand nomenclature proposed by Close, most known dehydrins fall into five types: Kn, SKn, KnS, YnKn, and YnSKn,. With a very few exceptions (noted below) the common feature of all dehydrins is one or more repeats of the K segment, a lysine-rich, 15 amino acid sequence with a highly conserved pattern of charged and nonpolar residues. The consensus K segment sequence in angiosperms is EKKDIMGKIKEKLPG, with a generally similar pattern conserved in conifer variants (Jarvis et al. 1996; Perdiguero et al. 2014). The identity or polarity of all K segment residues is highly conserved. The K segment forms an amphipathic α-helix in nonpolar environments and binds membranes (Koag et al. 2003, 2009; Eriksson et al. 2011), suggesting that a primary function of dehydrins is to protect membranes and perhaps proteins against dehydration stress. The Y segment comprises seven residues with the consensus sequence (V/T)DEYGNP. When present, usually one to three closely spaced copies of the Y segment are located towards the N terminal end of the protein relative to the S or K segments. A group of four Cornus sericea dehydrins contain from 14 to 35 recognizable Y segments (Sarnighausen et al. 2004), and there are a few dehydrins with four. Unlike the K and Y segments, as a rule the S segment occurs only once in any one dehydrin. It is recognized as a series of from three to nine consecutive serine residues, often interrupted by a single, usually polar residue, often D or E, after the first S. It is located N-terminal to the first K segment or at the C-terminus. Dehydrins with S segments have been found in all major seed plant taxa and the moss Physcomitrella patens. These segment types are bracketed and separated by unconserved regions, sometimes called phi segments. These can vary in length from a few to over 100 residues, and are rich in glycine and polar residues, so that dehydrins as a group are highly hydrophilic. These basic features are found in the several hundred complete dehydrin sequences in online databases, and have been discussed and reaffirmed in numerous review articles on dehydrins (Close 1996; Allagulova et al. 2003; Rorat 2006; Eriksson and Harryson 2011). There are only a few known exceptions to the five dehydrin groups described by Close (1996), for example a K4SK2 dehydrin in Picea abies (GenBank accession ABS58630.1), an SK3S variant in Stellaria longipes (CAA79709.1), and a Y2KY2KY dehyrdrin in Juglans regia (AGJ94410.1). Also only few proteins with recognizable Y or S segments and sequence similarity to other dehydrins but with truncated or completely lacking K segments are described. These include a group of conifer proteins (Perdiguero et al. 2014), a Y14S protein in C. sericea (Sarnighausen et al. 2004), and complete ORF sequences with terminal S segments but no clearly identifiable K segments in the Prunus persica (XP 007202808) and Solanum lycopersicum (XP 010317810.1) genomes. Perdiguero et al. (2012) recently reported on two additional conserved segments in Pinaceae SKn dehydrins. While compiling a library of dehydrin amino acid sequences, I noticed a short, palindromic sequence in many SKn-type dehydrins: GLFDFLG, located near the N-terminal end of the protein. Further inspection suggested a longer conserved sequence with the preliminary 11 residue consensus DRGLFDFLGKK. Here I report that the majority of SKn dehydrins contain a single copy of this segment, which I have named the F segment, and can thus be designated FSKn dehydrins. I decribe its taxonomic breadth and sequence conservation and variation. I also note additional sequence conservation around the S segment in both FSKn and YnSKn dehydrins.

Materials and methods

To compile a library of dehydrin sequences, I searched the NCBI data for sequences identified as dehydrins and conducted BLAST searches on the angiosperm and conifer consensus K segment sequences. The results were screened to identify and remove sequences lacking core dehydrin characteristics, allelic variants, and highly similar homologs within taxa. Genera such as Pinus, Vitis, and Solanum are overrepresented in NCBI by registration of dozens of allelic variants and nearly identical homologous sequences in closely related taxa. Based on the number and occurrence of K, Y, and S segments, the resulting 395 sequences were classified into the five types described by Close, with the few exceptions noted. It was in this process that I first noticed the F segment. After a BLAST search in the NCBI database for proteins containing variations of the 11-residue sequence DRGLFDFLGKK, I screened the results as above to remove non-dehydrins and redundant variants within taxa. Sequences were aligned using Clustal Omega and the alignments checked and adjusted manually. To explore sequence variation in and around the conserved segment, I isolated and aligned a 32-residue sequence centered on it and tallied the amino acids of each type found at each position. To explore variation in and around the S segment, all S-segment containing dehydrins from the library were extracted. I excerpted a 35-residue segment centered on the consecutive S residues. These excerpts were manually aligned using the first S residue, recognizable variants of a commonly occurring HR prefix, and the first occurrence of a charged residue, usually D or E, following the S residues. Alignment of the latter was forced by inserting blanks in sequences with fewer than the maximum number of nine S residues. Following alignment, I tallied the amino acids at each position for each of the four dehydrin types (including FSKn) that incorporate S segments. The eight-structure class protein secondary structure prediction model SSpro8 (Magnan and Baldi 2014) was run on a selection of full-length FSKn dehydrin sequences to assess the potential for helical or sheet structures in and around the F and S segments.

Results and discussion

Frequency of the F-segment in dehydrins

In the initial screening and classification of 395 minimally redundant dehydrin sequences recovered from NCBI, 93 were classified as SKn type dehydrins based on the presence of S and K but no Y segments (Table 1). I found F segments in 82 of these, indicating that most known SKn type dehydrins can be reclassified as the FSKn type. There were only nine FKn type dehydrins in the sample, and only one with more than one copy of the F segment, an F3SK2 dehydrin in Rhododendron (AGI36547.1). There were no sequences with both Y and F segments. An additional 75 sequences that were incomplete at the N terminal end could potentially be any of a number of types including FKn or FSKn type dehydrins. About 25% of the 72 complete gymnosperm dehydrin sequences in the library contained F segments, but none contained Y segments (Table 1). The absence of Y segments in gymnosperms was noted by Perdiguero et al. (2012, 2014), but not in earlier reviews of dehydrin occurrence, structure, and function (e.g. Close 1996; Allagulova et al. 2003; Rorat 2006; Eriksson and Harryson 2011). There were also no complete gymnosperm SKn or KnS dehydrins in the curated library, so that the available data indicates that in gymnosperms the S segment is always preceded by an F segment.
Table 1

Distribution of dehydrin types in a curated library of 395 minimally redundant dehydrins retrieved from NCBI

Dhn typeAngiospermGymnospermTotal% of total
?Kn + ?SKn 37387519.0
Kn 279369.1
KnS250256.3
SKn 110112.8
FKn 3692.3
FSKn 65178220.8
YnKn 370379.4
YnSKn 113011328.6
Uncommon types5271.8
Total32372395

“?” indicates incomplete N terminal sequences lacking F and Y segments

Distribution of dehydrin types in a curated library of 395 minimally redundant dehydrins retrieved from NCBI “?” indicates incomplete N terminal sequences lacking F and Y segments

Conservation variation in the F segment

The BLAST search of the NCBI database for proteins containing variations of the 11-residue sequence DRGLFDFLGKK yielded an initial library of about 575 complete and partial sequences, about 300 of which were readily recognized as dehydrins based on the presence of one or more recognizable variants of the K segment. After screening out redundant sequences, I arrived at a list of 130 minimally redundant proteins representing a broad spectrum of seed plant groups, including conifers as representative of gymnosperms and a variety of groups within the monocot, caryophyllid, rosid, and asterid clades of angiosperms. 123 of these also contained an S segment. The 11-residue DRGLFDFLGKK sequence specified in the BLAST search is well-conserved across all 130 sequences, with the core 6-residue sequence GLFDFL nearly invariant in identity or polarity in all variants across the taxonomic spectrum (Fig. 1). The G following this core sequence is deleted in about 40% of the sequences, and there is an apparent insertion of a second hydrophobic residue after the conserved F/L in position 8 in Poaceae. In a subset of 12 rosid sequences and two asterid sequences, the initial DR is replaced by GC and the second G is omitted, giving the consensus GCGMFDFLKK in this variant. The conservation of a rare and metabolically expensive cysteine residue suggests some additional functionality associated with this couplet. The terminal K residues, typically followed by three to five more charged residues, can be interpreted as general features of dehydrins but may also contribute to the functionality of the segment. Similarly, a well-conserved E residue at position −3 may contribute some functionality.
Fig. 1

Alignment, conservation, and variation in 32-residue aligned sequence excerpts from selected seed plant taxa centered on the F segment in FSKn dehydrins. The “i” in the numbered residue positions in the top row indicates a presumed insertion in Poaceae. Residues are ordered and colored by Kyte-Doolittle hydrophobicity (Lefranc 2017). The F segment is outlined in black

Alignment, conservation, and variation in 32-residue aligned sequence excerpts from selected seed plant taxa centered on the F segment in FSKn dehydrins. The “i” in the numbered residue positions in the top row indicates a presumed insertion in Poaceae. Residues are ordered and colored by Kyte-Doolittle hydrophobicity (Lefranc 2017). The F segment is outlined in black The F segment seems to have gone unnoticed over the ca. 25 years of investigation into dehydrin structure and function. To give credit where it is due, Perdiguero et al. (2012) noted a 23-residue N-terminal sequence that includes the F segment and appeared to be conserved in conifers and, with considerable indel variation, in a few angiosperms. My analysis indicates that the core F segment is highly conserved in seed plants, and I propose that it should be recognized as a fourth functional segment type in the dehydrin family.

Conserved features of the S segment

The S segment has been previously characterized as a string of three to as many as 13 serine residues, and typically followed by a string of charged K, D, or E residues (Eriksson and Harryson 2011). In KnS dehydrins it is located at the C-terminus of the protein. In other types it is N-terminal to the first K segment and in between it and F or Y segments where these are present, as indicated by nomenclature. Additional conserved features around the S segment emerge on closer examination. In dehydrins with S segments on the N terminal side, the first S in the sequence is typically followed by a D, E, or G residue, which is then followed by two to as many as 12 additional serine residues. In KnS dehydrins, this is mirrored by a highly conserved terminal DSD motif, which, in the 25 KnS sequences in my curated sample, always forms the C-terminus of the protein. In FSKn dehydrins, there is a well-conserved KLHR prefix (Fig. 2), and the C terminal S is followed by 2–7 acidic residues, giving the generalized sequence KLHRS(D/E)S2–12 (D/E)2–7, followed by a string of predominantly G and charged residues. The prefix is somewhat less conserved in YnSKn and SKn dehydrins, where the initial K is not conserved and a polar residue may replace H. KnS dehydrin S segments have a more variable mix of charged, H, and G residues as a prefix.
Fig. 2

Sequence conservation and variation in S segments from 82 FSKn dehydrins. p polar, h hydrophobic, plus or minus sign charged, minus sign acidic, plus sign basic, asterisk no consensus

Sequence conservation and variation in S segments from 82 FSKn dehydrins. p polar, h hydrophobic, plus or minus sign charged, minus sign acidic, plus sign basic, asterisk no consensus

Structural modeling

Predicted structures from SSpro8 (Magnan and Baldi 2014) for a selection of FSKn dehydrins consistently suggest a short helical region centered on the five residue core sequence LFDFL in the consensus F segment and the GCG variant, including those where the D is replaced by a G residue. In a helical wheel projection of this short sequence, the four hydrophobic residues are arrayed on one side of the helix, with the charged D or somewhat hydrophilic G residue opposing them. This observation suggests that the F segment could have amphiphilic membrane or protein binding properties similar to those of the K segment. The same model also consistently predicts helical regions N terminal to the S segment. The conserved pattern of hydrophobic residues in this region (Fig. 2) could result in an amphiphilic structure here as well. All K segments are also consistently helical in the SSpro8 predictions.

Conservation implies function

The high degree of conservation in the K, Y, F, and S segments in these otherwise highly variable and unstructured proteins implies that all four segment types play important roles in the overall function of the protein. Functional studies show that that the K segment binds membranes and therefore suggest that the main function of dehydrins is membrane protection (Koag et al. 2003, 2009). Membrane binding may be modulated by phosphorylation of S and pH-dependent dissociation of H residues (Eriksson et al. 2011), suggesting a function for the conserved H and S residues in the S segment. The functions of the Y segment remain a mystery. Close et al. (1993) found some similarity between the Y segment and a nucleotide binding site in bacterial chaperones, an observation that has been echoed in some reviews (e.g. Allagulova et al. 2003; Rorat 2006). That similarity is not at all obvious on inspection of the sequences in the original report (Martin et al. 1993), and in any case there seems to have been no follow-up on this assertion. Whatever its function, gymnosperms are able to survive in a wide range of environments, including extremely dry or cold conditions, without any apparent need for the Y segment. As noted above, the F segment may form a short, amphipathic helix with membrane or protein binding properties, and the S segment prefix may also have amphipathic properties. Sucrose, raffinose, and various compatible solutes have been hypothesized to stabilize (Carpenter and Crowe 1988) or replace (Clegg 1985) hydration shells around proteins or membranes; perhaps dehydrins play a similar role. Alternatively, binding by K segments or other amphipathic regions could anchor dehydrins to membranes or proteins so that they could act as “molecular spacers” (Strimbeck et al. 2015), preventing close approach and conformational or phase changes associated with repulsive forces under dehydration (Wolfe and Bryant 1999). Protein modification and binding studies (e.g. Koag et al. 2003, 2009; Eriksson et al. 2011) or other protein modification experiments may help clarify the binding and related protective properties of the F segment and other conserved regions of the different dehydrin types.

Author contribution statement

GRS conducted all of the original analyses reported in this article.
  14 in total

Review 1.  Freezing, drying, and/or vitrification of membrane- solute-water systems.

Authors:  J Wolfe; G Bryant
Journal:  Cryobiology       Date:  1999-09       Impact factor: 2.487

2.  IMGT, the international ImMunoGeneTics information system, http://imgt.cines.fr: the reference in immunoinformatics.

Authors:  Marie-Paule Lefranc; Véronique Giudicelli; Chantal Ginestoux; Denys Chaume
Journal:  Stud Health Technol Inform       Date:  2003

3.  A view of plant dehydrins using antibodies specific to the carboxy terminal peptide.

Authors:  T J Close; R D Fenton; F Moonan
Journal:  Plant Mol Biol       Date:  1993-10       Impact factor: 4.076

4.  Tunable membrane binding of the intrinsically disordered dehydrin Lti30, a cold-induced plant stress protein.

Authors:  Sylvia K Eriksson; Michael Kutzer; Jan Procek; Gerhard Gröbner; Pia Harryson
Journal:  Plant Cell       Date:  2011-06-10       Impact factor: 11.277

5.  SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity.

Authors:  Christophe N Magnan; Pierre Baldi
Journal:  Bioinformatics       Date:  2014-05-24       Impact factor: 6.937

6.  The mechanism of cryoprotection of proteins by solutes.

Authors:  J F Carpenter; J H Crowe
Journal:  Cryobiology       Date:  1988-06       Impact factor: 2.487

7.  The K-segment of maize DHN1 mediates binding to anionic phospholipid vesicles and concomitant structural changes.

Authors:  Myong-Chul Koag; Stephan Wilkens; Raymond D Fenton; Josh Resnik; Evanly Vo; Timothy J Close
Journal:  Plant Physiol       Date:  2009-05-13       Impact factor: 8.340

8.  The binding of maize DHN1 to lipid vesicles. Gain of structure and lipid specificity.

Authors:  Myong-Chul Koag; Raymond D Fenton; Stephan Wilkens; Timothy J Close
Journal:  Plant Physiol       Date:  2003-01       Impact factor: 8.340

9.  Novel dehydrins lacking complete K-segments in Pinaceae. The exception rather than the rule.

Authors:  Pedro Perdiguero; Carmen Collada; Alvaro Soto
Journal:  Front Plant Sci       Date:  2014-12-02       Impact factor: 5.753

Review 10.  Extreme low temperature tolerance in woody plants.

Authors:  G Richard Strimbeck; Paul G Schaberg; Carl G Fossdal; Wolfgang P Schröder; Trygve D Kjellsen
Journal:  Front Plant Sci       Date:  2015-10-19       Impact factor: 5.753

View more
  12 in total

1.  Effect of an Intrinsically Disordered Plant Stress Protein on the Properties of Water.

Authors:  Luisa A Ferreira; Alicyia Walczyk Mooradally; Boris Zaslavsky; Vladimir N Uversky; Steffen P Graether
Journal:  Biophys J       Date:  2018-09-22       Impact factor: 4.033

2.  Dissecting the Genomic Diversification of Late Embryogenesis Abundant (LEA) Protein Gene Families in Plants.

Authors:  Mariana Aline Silva Artur; Tao Zhao; Wilco Ligterink; Eric Schranz; Henk W M Hilhorst
Journal:  Genome Biol Evol       Date:  2019-02-01       Impact factor: 3.416

3.  Identification and Characterization of Five Cold Stress-Related Rhododendron Dehydrin Genes: Spotlight on a FSK-Type Dehydrin With Multiple F-Segments.

Authors:  Hui Wei; Yongfu Yang; Michael E Himmel; Melvin P Tucker; Shi-You Ding; Shihui Yang; Rajeev Arora
Journal:  Front Bioeng Biotechnol       Date:  2019-02-21

4.  Evolution of the modular, disordered stress proteins known as dehydrins.

Authors:  Andrew C Riley; Daniel A Ashlock; Steffen P Graether
Journal:  PLoS One       Date:  2019-02-06       Impact factor: 3.240

5.  Comparative Genomics, Evolution, and Drought-Induced Expression of Dehydrin Genes in Model Brachypodium Grasses.

Authors:  Maria Angeles Decena; Sergio Gálvez-Rojas; Federico Agostini; Ruben Sancho; Bruno Contreras-Moreira; David L Des Marais; Pilar Hernandez; Pilar Catalán
Journal:  Plants (Basel)       Date:  2021-12-03

6.  Evolutionary analysis of angiosperm dehydrin gene family reveals three orthologues groups associated to specific protein domains.

Authors:  Alejandra E Melgar; Alicia M Zelada
Journal:  Sci Rep       Date:  2021-12-13       Impact factor: 4.379

7.  The Halophyte Dehydrin Sequence Landscape.

Authors:  Siwar Ghanmi; Steffen P Graether; Moez Hanin
Journal:  Biomolecules       Date:  2022-02-19

Review 8.  ICE-CBF-COR Signaling Cascade and Its Regulation in Plants Responding to Cold Stress.

Authors:  Delight Hwarari; Yuanlin Guan; Baseer Ahmad; Ali Movahedi; Tian Min; Zhaodong Hao; Ye Lu; Jinhui Chen; Liming Yang
Journal:  Int J Mol Sci       Date:  2022-01-28       Impact factor: 5.923

Review 9.  Plant Group II LEA Proteins: Intrinsically Disordered Structure for Multiple Functions in Response to Environmental Stresses.

Authors:  Mughair Abdul Aziz; Miloofer Sabeem; Sangeeta Kutty Mullath; Faical Brini; Khaled Masmoudi
Journal:  Biomolecules       Date:  2021-11-09

10.  Expression, Purification, and Preliminary Protection Study of Dehydrin PicW1 From the Biomass of Picea wilsonii.

Authors:  Junhua Liu; Mei Dai; Jiangtao Li; Yitong Zhang; Yangjie Ren; Jichen Xu; Wei Gao; Sujuan Guo
Journal:  Front Bioeng Biotechnol       Date:  2022-04-05
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.