Literature DB >> 29493034

Crystal structure of secretory abundant heat soluble protein 4 from one of the toughest "water bears" micro-animals Ramazzottius Varieornatus.

Yohta Fukuda1, Tsuyoshi Inoue1.   

Abstract

Though anhydrobiotic tardigrades (micro-animals also known as water bears) possess many genes of secretory abundant heat soluble (SAHS) proteins unique to Tardigrada, their functions are unknown. A previous crystallographic study revealed that a SAHS protein (RvSAHS1) from one of the toughest tardigrades, Ramazzottius varieornatus, has a β-barrel architecture similar to fatty acid binding proteins (FABPs) and two putative ligand binding sites (LBS1 and LBS2) where fatty acids can bind. However, some SAHS proteins such as RvSAHS4 have different sets of amino acid residues at LBS1 and LBS2, implying that they prefer other ligands and have different functions. Here RvSAHS4 was crystallized and analyzed under a condition similar to that for RvSAHS1. There was no electron density corresponding to a fatty acid at LBS1 of RvSAHS4, where a putative fatty acid was observed in RvSAHS1. Instead, LBS2 of RvSAHS4, which was composed of uncharged residues, captured a putative polyethylene glycol molecule. These results suggest that RvSAHS4 mainly uses LBS2 for the binding of uncharged molecules.
© 2018 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.

Entities:  

Keywords:  X-ray crystallography; anhydrobiosis; secretory abundant heat soluble protein; tardigrades

Mesh:

Substances:

Year:  2018        PMID: 29493034      PMCID: PMC5916119          DOI: 10.1002/pro.3393

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


Introduction

Water is indispensable for all living things. Therefore, severe loss of water results in death for almost all organisms. However, some species of tardigrades, or water bears, can survive extremely desiccated conditions by stopping metabolic processes.1, 2 This state is called anhydrobiosis. In association with loss of water, anhydrobiotic tardigrades form structures called “tun”, which show a high tolerance to desiccation. They can restart metabolic processes once their tuns are given water.3, 4 The tun is also tolerant to very high or low temperature,5, 6 exposure to radiation,7, 8, 9 vacuum,10, 11 high pressure,12, 13 and toxic compounds.14, 15 This resistance of tardigrades is further highlighted by an experiment in which two tardigrades in dehydrated states could survive in the vacuum of outer space for 10 days.16 Several research groups have recently launched genomics17, 18, 19, 20, 21, 22 and molecular biological studies23, 24, 25, 26 to reveal the molecular basis for anhydrobiosis of tardigrades. Some proteins discovered in these studies are thought to be keys to anhydrobiosis because they have not been found in phyla other than Tardigrada.23, 25 Secretory abundant heat soluble (SAHS) protein is one of them and is constantly expressed at high levels in Ramazzottius varieornatus.21 This tardigrade can enter an anhydrobiotic state in a shorter time than other species such as Hypsibius dujardini.24 , 27 Although the expression levels of SAHS proteins are low in active H. dujardani, they are significantly increased when the tardigrade undergoes anhydrobiosis.21 These findings suggest that SAHS proteins play an important role in stress tolerances; however, their functions are unknown. Moreover, it has been recently reported that an anhydrobiotic tardigrade, Milnesium tardigradum, does not have SAHS proteins,22 making the role of SAHS proteins more enigmatic. One of SAHS proteins from R. varieornatus (RvSAHS1) is secreted into the culture medium when it is expressed in human cells; therefore, SAHS proteins are thought to protect extracellular components and/or secretory organelles on anhydrobiosis.23 A crystal structure of RvSAHS1 revealed that it has a β‐barrel structure resembling fatty acid binding proteins (FABPs).28 Residues found in the β‐barrel of RvSAHS1 were bulky and hydrophilic, while smaller and/or hydrophobic residues are assembled in FABPs. Some FABPs conserve tyrosine and arginine at the binding site for fatty acids (FAs) and RvSAHS1 also has Arg161 and Tyr163, which are superimposed on conserved tyrosine and arginine in FABPs. Moreover, electron density that can be interpreted as an FA molecule is observed near these residues and this site is designated as ligand binding site 1 (LBS1). The analysis of RvSAHS1 also showed electron density that can be interpreted as acetate at the putative ligand binding site 2 (LBS2). The sequence alignment indicates that residues located around LBS1 and LBS2 are conserved among many SAHS proteins [Fig. 1(A)]. However, several SAHS proteins such as RvSAHS4 have different sets of residues around LBS1 and LBS2, which implies differences in functions. Here we analyzed a crystal structure of RvSAHS4 to compare it with RvSAHS1.
Figure 1

Structure of RvSAHS4. (A) Amino acid sequence alignment. Rv and Hd means R. varieornatus and H. dujardini, respectively. Similar residues are written by red characters and boxed. Red and blue squares under the sequences show residues conserved at LBS1 and LBS2 of SAHS proteins, respectively. A purple square shows the position of K150 in RvSAHS1, which is a part of both LBS1 and LBS2. (B) RvSAHS4 molecules (MolA and MolB) in the asymmetric unit. (C) Comparison of RvSAHS4 with RvSAHS1. MolA and MolB of RvSAHS4 are shown by cyan and yellow cartoons. RvSAHS1 is shown by a white cartoon.

Structure of RvSAHS4. (A) Amino acid sequence alignment. Rv and Hd means R. varieornatus and H. dujardini, respectively. Similar residues are written by red characters and boxed. Red and blue squares under the sequences show residues conserved at LBS1 and LBS2 of SAHS proteins, respectively. A purple square shows the position of K150 in RvSAHS1, which is a part of both LBS1 and LBS2. (B) RvSAHS4 molecules (MolA and MolB) in the asymmetric unit. (C) Comparison of RvSAHS4 with RvSAHS1. MolA and MolB of RvSAHS4 are shown by cyan and yellow cartoons. RvSAHS1 is shown by a white cartoon.

Results and Discussion

The unit cell of the RvSAHS4 crystal contained two RvSAHS4 molecules (MolA and MolB) [Fig. 1(B)]. The RvSAHS4 structure was refined at 1.5 Å resolution. The overall structure of RvSAHS4 shared a typical FABP fold having an antiparallel β‐barrel composed of 10 β‐strands, and a helix‐turn‐helix lid between βA and βB. Because main chain atoms could not form hydrogen bonds between βD and βE, there was a gap as is found in RvSAHS1 and FABPs.28 The N‐terminal region of the adjacent molecule was inserted into this gap in the crystal structure [Fig. 1(B)]. The structure of RvSAHS4 was well superimposed on that of RvSAHS1. Helix αII in the RvSAHS4 structure had only a single turn and shorter than those in FABPs as is observed in RvSAHS1 structures. However, MolB in the RvSAHS4 structure had a 310 helix (η1) following αII and MolA also had a helix‐like structure in the same region [Fig. 1(C)]. Consequently, RvSAHS4 had a longer helical region than that in RvSAHS1. The αII‐η1 and following loop region showed higher diversity in amino acid sequences as compared to other regions [Fig. 1(A)], and tended to have proline, glycine, and tyrosine residues, which often inhibit the formation of α helix.29 B‐factor values showed that the lid region of RvSAHS4 was highly flexible (Fig. S1, Supporting Information), as is observed in RvSAHS1.28 The variety of residues and the flexibility in the lid structure may be related to ligand binding. As was predicted by alignment of amino acid sequences, the interior of RvSAHS4 included residues distinct from those found in RvSAHS1 [Fig. 2(A, B)]. Superimposition of the RvSAHS4 structure on the RvSAHS1 structure revealed that although Arg and Tyr, key residues for the binding of FAs at LBS1 in RvSAHS1, were conserved in RvSAHS4 (Arg164 and Tyr166), Tyr65 positioning on η1 could inhibit the binding of FAs at LBS1 [Fig. 2(B)]. Moreover, residues giving positive charges to LBS1 in RvSAHS1 (His72 and Lys150), which can stabilize negative charges of FA molecules, were replaced by Gln153 and Leu73 in RvSAHS4. While LBS1 in RvSAHS1 possesses an endogenous FA molecule originating from Escherichia coli in the crystal structure, no electron density corresponding to an FA molecule was observed around LBS1 in RvSAHS4. Because RvSAHS4 was expressed and purified by the same methods as those for RvSAHS1 and crystallized under a condition (Materials and Methods section) similar to that for RvSAHS1 [100 mM HEPES pH 7.8, 150 mM MgCl2, 1 mM ZnSO4, 21% v/v polyethylene glycol (PEG) 600, and 5% v/v 1‐butanol or 3% v/v 2‐propanol at 20°C],28 the absence of the FA molecule indicates that LBS1 of RvSAHS4 has lower affinity to FAs than that of RvSAHS1. To use LBS1 in RvSAHS4, a dramatic structural change in the helix‐turn‐helix lid may be needed, by which Tyr65 is moved away from LBS1. Residues located at LBS2 in RvSAHS4 were also distinct from those in RvSAHS1. LBS2 in RvSAHS4 was constituted by Cys120, Val142, Tyr144, and Gln153, which respectively correspond to Tyr117, Asp139, Tyr141, and Lys150 in RvSAHS1 [Fig. 2(B)]. Moreover, Tyr75 and Thr129 were located around LBS2 where Leu74 and Ala126 are, respectively, located in RvSAHS1. Cys120 was specific to RvSAHS4 [Fig. 1(A)] and could form a hydrogen bond with Thr129 [Fig. 2(C)]. Slender electron density was observed near Thr129 and Tyr75. Hydrophobic compounds such as FAs, bile acids, and retinoid, which are ligands for FABP family proteins,30 could not fit well into the electron density. Therefore, we modeled short PEG molecules contained in the crystallization solution (triethylene glycol in MolA and tetraethylene glycol in MolB) [Fig. 2(C), S2, Supporting Information]. Although PEG molecules seemed to be the best assignment, we do not exclude the possibility that there was another compound such as a disordered FA molecule. The terminal oxygen atom of the PEG molecules could form hydrogen bonds with Thr129, Tyr75, and a nearby water molecule. Other oxygen atoms in the PEG molecules could form hydrogen bonds with water molecules in the β‐barrel. Our observation that LBS1 in RvSAHS4 is blocked by Tyr65 and that PEG molecules were found at LBS2 suggests that RvSAHS4 mainly uses LBS2 to capture ligands. Because neither positively nor negatively charged residues were positioned at LBS2, it may be used to capture uncharged ligands such as alcohols and aldehydes. RvSAHS4 contained several water molecules stabilized by hydrogen bonds around the bottom of the β‐barrel [Fig. 2(C), S2, Supporting Information]. In contrast, there are only one or two water molecules around the β‐barrel bottom of RvSAHS1.28 This is because hydrophobic residues in RvSAHS4 (Leu73, Val86, and Phe155) were substituted by residues forming hydrogen bonds with each other in RvSAHS1 (His72, His83, and Tyr152). A hypothesis suggests that RvSAHS1 having the hydrogen bond network being independent of water is stable even under dehydrated conditions.28 In line with this thinking, RvSAHS4 containing more water molecules might be more vulnerable to desiccation than RvSAHS1, implying that roles of SAHS proteins may not be limited to responses to dehydration stresses.
Figure 2

Residues in the β‐barrel of RvSAHS4. (A) LBS1 and LBS2 in RvSAHS4. MolB is shown here. (B) Comparison of LBSs between RvSAHS4 (cyan) and RvSAHS1 (white). Names of residues at LBS1 and LBS2 of RvSAHS4 are boxed by red and blue squares, respectively. A purple square shows the position corresponding to Lys150 in RvSAHS1, which is a part of both LBS1 and LBS2. A putative heptanoic acid molecule found in RvSAHS1 is shown by gray sticks. (C) A tetraethylene glycol molecule in MolB. Water molecules are represented by small red spheres. Carbon atoms of tetraethylene glycol are colored by magenta. 2mF o‐DF c map (1.0σ) and mF o‐DF c omit map (3.0σ) are illustrated by gray and green meshes, respectively. Possible hydrogen bonds are represented by dashed yellow lines.

Residues in the β‐barrel of RvSAHS4. (A) LBS1 and LBS2 in RvSAHS4. MolB is shown here. (B) Comparison of LBSs between RvSAHS4 (cyan) and RvSAHS1 (white). Names of residues at LBS1 and LBS2 of RvSAHS4 are boxed by red and blue squares, respectively. A purple square shows the position corresponding to Lys150 in RvSAHS1, which is a part of both LBS1 and LBS2. A putative heptanoic acid molecule found in RvSAHS1 is shown by gray sticks. (C) A tetraethylene glycol molecule in MolB. Water molecules are represented by small red spheres. Carbon atoms of tetraethylene glycol are colored by magenta. 2mF o‐DF c map (1.0σ) and mF o‐DF c omit map (3.0σ) are illustrated by gray and green meshes, respectively. Possible hydrogen bonds are represented by dashed yellow lines.

Materials and Methods

Protein expression and purification

A synthetic gene of RvSAHS4 (RvSAHS428–171) was purchased from GenScript and subcloned into the pET‐28a(+) vector (Novagen). A 6× His tag and a TEV protease site (H6‐ENLYFQS) was fused at the N‐terminus of the RvSAHS4 sequence. Therefore, the purified protein contained an extra serine residue at its N‐terminus (The full DNA and amino acid sequences are shown in the text of Supporting Information). The resulting plasmid was used to transform E. coli strain BL21 Star(DE3). The cells were cultivated in Luria‐Bertani medium supplemented with 100 µg/mL kanamycin at 37°C to an optical density at 600 nm of 0.6. Expression of RvSAHS4 was induced with 0.8 mM isopropyl β‐d‐1‐thiogalactopyranoside, and culturing continued for further 19 h at 20°C. Collected cells were sonicated in a lysis buffer (20 mM TrisHCl pH 8.0, 200 mM NaCl, and 5 mM imidazole). After the lysate was centrifuged, the supernatant was purified by a HiTrap TALON column (GE Healthcare). Fractions containing RvSAHS4 were collected and dialyzed with TEV protease against a buffer (20 mM TrisHCl pH 8.0, 200 mM NaCl). The dialysate was further purified by a HisTrap Excel column (GE Healthcare) and a HiLoad 16/60 superdex 75 column (GE Healthcare).

Crystallization

Crystallization was performed by the hanging drop vapor‐diffusion method. RvSAHS4 crystals appeared within 1 week in the presence of 17 mg/mL protein, 50 mM HEPES pH 7.8, 1 mM ZnSO4, and 24% v/v PEG 600 at 20°C. The reservoir solution (400 μL) was poured into 0.5 mL sample cups (Sanplatec), and 1 μL protein solution was mixed with 1 μL reservoir solution on a siliconized cover glass plates. A single plate crystal was peeled off from a petal‐like cluster. Before the crystal was frozen by liquid nitrogen, it was soaked in the crystallization solution supplemented by 20% v/v ethylene glycol.

Data collection, processing, structure solution, and refinement

X‐ray diffraction experiments were performed on the BL44XU beamline of SPring‐8, Hyogo, Japan, at 100 K using a MX300‐HE detector (Rayonix). The dataset was processed using HKL2000.31 The phases were determined by the molecular replacement method with a monomer of RvSAHS1 (PDB code ID: 5XNA) as a search model using Phaser.32 Manual model building was performed using Coot.33 The program Refmac534 implemented in the CCP4 suite35 and phenix.refine36 were used for refinement. The final model was checked by MolProbity.37 Data collection and processing statistics are summarized in Table 1. The diffraction dataset has been deposited in Integrated Resource for Reproducibility in Macromolecular Crystallography (https://proteindiffraction.org/). The atomic coordinate and structure factor have been deposited in the Protein Data Bank (PDB code ID: 5Z4G).
Table 1

Data Collection and Refinement Statistics

RvSAHS4
Data collection
Wavelength (Å)0.90000
Space group P1
a, b, c (Å)32.8, 41.9, 53.5
α, β, γ (°)89.97, 80.75, 78.76
Resolution range (Å)50.0–1.50 (1.53–1.50)a
No. of unique reflections42,396 (2,099)
Completeness (%)95.9 (95.5)
Redundancy2.0 (2.0)
I/σ(I)23.1 (2.9)
R merge (%)3.6 (26.0)
R r.i.m. (%)5.1 (36.8)
CC1/2 /CC* b (0.818)/(0.949)
Refinement
Bragg spacings (Å)50.0–1.50 (1.53–1.50)
Completeness (%)95.8
No. of reflections40,319
R work/R free (%)15.9/20.3
No. of non‐H atoms
Protein2543
Water310
Other27
R.m.s. deviations
Bonds (Å)0.007
Angles (°)1.224
Average B factors (Å2)
Overall23.8
Protein23.4
Water34.7
Other30.6
Ramachandran plot
Favored (%)100
Disallowed (%)0
PDB code ID5Z4G

Data in the parenthesis was calculated based on the highest resolution shell.

CC1/2 and CC* are Pearson correlation coefficient of two half data sets and an estimate of the true CC value, respectively. CC* is calculated from the following equation: CC* = [2CC1/2/(1 + CC1/2)]1/2.

Data Collection and Refinement Statistics Data in the parenthesis was calculated based on the highest resolution shell. CC1/2 and CC* are Pearson correlation coefficient of two half data sets and an estimate of the true CC value, respectively. CC* is calculated from the following equation: CC* = [2CC1/2/(1 + CC1/2)]1/2.

Sequence alignment

Amino acid sequence alignment was performed by Clustal Omega.38 The figure of sequence alignment was generated by ESpript.39 Supporting Information Click here for additional data file.
  28 in total

Review 1.  The fatty acid transport function of fatty acid-binding proteins.

Authors:  J Storch; A E Thumser
Journal:  Biochim Biophys Acta       Date:  2000-06-26

Review 2.  Survival in extreme environments - on the current knowledge of adaptations in tardigrades.

Authors:  N Møbjerg; K A Halberg; A Jørgensen; D Persson; M Bjørn; H Ramløv; R M Kristensen
Journal:  Acta Physiol (Oxf)       Date:  2011-03-22       Impact factor: 6.311

3.  Structural insights into a secretory abundant heat-soluble protein from an anhydrobiotic tardigrade, Ramazzottius varieornatus.

Authors:  Yohta Fukuda; Yoshimasa Miura; Eiichi Mizohata; Tsuyoshi Inoue
Journal:  FEBS Lett       Date:  2017-08-08       Impact factor: 4.124

4.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

5.  Evidence for extensive horizontal gene transfer from the draft genome of a tardigrade.

Authors:  Thomas C Boothby; Jennifer R Tenlen; Frank W Smith; Jeremy R Wang; Kiera A Patanella; Erin Osborne Nishimura; Sophia C Tintori; Qing Li; Corbin D Jones; Mark Yandell; David N Messina; Jarret Glasscock; Bob Goldstein
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-23       Impact factor: 11.205

6.  No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini.

Authors:  Georgios Koutsovoulos; Sujai Kumar; Dominik R Laetsch; Lewis Stevens; Jennifer Daub; Claire Conlon; Habib Maroon; Fran Thomas; Aziz A Aboobaker; Mark Blaxter
Journal:  Proc Natl Acad Sci U S A       Date:  2016-03-24       Impact factor: 11.205

7.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

8.  Comparative genomics of the tardigrades Hypsibius dujardini and Ramazzottius varieornatus.

Authors:  Yuki Yoshida; Georgios Koutsovoulos; Dominik R Laetsch; Lewis Stevens; Sujai Kumar; Daiki D Horikawa; Kyoko Ishino; Shiori Komine; Takekazu Kunieda; Masaru Tomita; Mark Blaxter; Kazuharu Arakawa
Journal:  PLoS Biol       Date:  2017-07-27       Impact factor: 8.029

9.  MolProbity: all-atom structure validation for macromolecular crystallography.

Authors:  Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral; Laura W Murray; Jane S Richardson; David C Richardson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2009-12-21

10.  Phaser crystallographic software.

Authors:  Airlie J McCoy; Ralf W Grosse-Kunstleve; Paul D Adams; Martyn D Winn; Laurent C Storoni; Randy J Read
Journal:  J Appl Crystallogr       Date:  2007-07-13       Impact factor: 3.304

View more
  3 in total

1.  Structure of cytochrome b5 unique to tardigrades.

Authors:  Yohta Fukuda; JeeEun Kim; Tsuyoshi Inoue
Journal:  Protein Sci       Date:  2020-08       Impact factor: 6.725

Review 2.  The biology of tardigrade disordered proteins in extreme stress tolerance.

Authors:  Cherie Hesgrove; Thomas C Boothby
Journal:  Cell Commun Signal       Date:  2020-11-04       Impact factor: 5.712

3.  Structural insights into a C2 domain protein specifically found in tardigrades.

Authors:  Yohta Fukuda; Tsuyoshi Inoue
Journal:  Protein Sci       Date:  2020-12-28       Impact factor: 6.993

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.