Ultraviolet-protective compounds, such as mycosporine-like amino acids (MAAs) and related gadusols produced by some bacteria, fungi, algae, and marine invertebrates, are critical for the survival of reef-building corals and other marine organisms exposed to high-solar irradiance. These compounds have also been found in marine fish, where their accumulation is thought to be of dietary or symbiont origin. In this study, we report the unexpected discovery that fish can synthesize gadusol de novo and that the analogous pathways are also present in amphibians, reptiles, and birds. Furthermore, we demonstrate that engineered yeast containing the fish genes can produce and secrete gadusol. The discovery of the gadusol pathway in vertebrates provides a platform for understanding its role in these animals, and the possibility of engineering yeast to efficiently produce a natural sunscreen and antioxidant presents an avenue for its large-scale production for possible use in pharmaceuticals and cosmetics.
Ultraviolet-protective compounds, such as mycosporine-like amino acids (MAAs) and related gadusols produced by some bacteria, fungi, algae, and marine invertebrates, are critical for the survival of reef-building corals and other marine organisms exposed to high-solar irradiance. These compounds have also been found in marine fish, where their accumulation is thought to be of dietary or symbiont origin. In this study, we report the unexpected discovery that fish can synthesize gadusol de novo and that the analogous pathways are also present in amphibians, reptiles, and birds. Furthermore, we demonstrate that engineered yeast containing the fish genes can produce and secrete gadusol. The discovery of the gadusol pathway in vertebrates provides a platform for understanding its role in these animals, and the possibility of engineering yeast to efficiently produce a natural sunscreen and antioxidant presents an avenue for its large-scale production for possible use in pharmaceuticals and cosmetics.
Entities:
Keywords:
2-epi-5-epi-valiolone synthase; E. coli; S. cerevisiae; biochemistry; ecology; gadusol; mycosporine; sunscreen; vertebrates; zebrafish
The sunscreen compounds, mycosporine-like amino acids (MAAs) and related gadusols, commonly found in bacteria, fungi, algae, and marine invertebrates (Shick and Dunlap, 2002; Miyamoto et al., 2014), have been proposed to fulfill a variety of functions, such as sunscreen, antioxidant, stress response, intracellular nitrogen reservoir, and/or optical filter (Gao and Garcia-Pichel, 2011; Bok et al., 2014). Although their formation had long been proposed to originate from the shikimate pathway, more recent bioinformatic and biochemical studies revealed that in cyanobacteria, MAAs are synthesized by desmethyl-4-deoxygadusol synthase (DDGS), a dehydroquinate synthase (DHQS) homolog (Wu et al., 2007; Balskus and Walsh, 2010; Singh et al., 2010; Asamizu et al., 2012). Interestingly, inactivation of the DDGS gene in Anabaena variabilis ATCC 29413 did not abolish the production of MAAs, suggesting that additional pathways exist for the biosynthesis of MAAs (Spence et al., 2012). DDGS converts sedoheptulose 7-phosphate (SH7P) to desmethyl-4-deoxygadusol via a unique sequence of dephosphorylation, aldol condensation, enolization, dehydration, reduction, and tautomerization reactions (Figure 1—figure supplement 1) (Balskus and Walsh, 2010; Asamizu et al., 2012). The product is subsequently converted by a methyltransferase to 4-deoxygadusol, the building block of MAAs.
Figure 1—figure supplement 1.
Biosynthetic pathway to shinorine in Anabaena variabilis (Balskus and Walsh, 2010).
DOI:
http://dx.doi.org/10.7554/eLife.05919.005
4-Deoxygadusol has also been proposed to be the precursor of gadusol (Starcevic et al., 2010; Rosic and Dove, 2011), a related compound initially isolated from cod roe (Gadus morhua L.) (Plack et al., 1981), but also found in roes of other marine fish (Plack et al., 1981; Arbeloa et al., 2010), sea urchin eggs (Chioccara et al., 1986), cysts and nauplii of brine shrimp (Grant et al., 1985), mantis shrimp crystalline cones (Bok et al., 2014), and sponges (Bandaranayake et al., 1997). As genes responsible for the production of 4-deoxygadusol and MAAs are commonly found in bacteria (e.g., cyanobacteria), algae, and other marine microorganisms (Shick and Dunlap, 2002; Miyamoto et al., 2014), the accumulation of these compounds in marine animals has been proposed to be of dietary or symbiont origin (Arbeloa et al., 2010; Gao and Garcia-Pichel, 2011; Loew, 2014). On the other hand, a gene cluster like that in cyanobacteria apparently encoding a four-step DDGS-based pathway for converting SH7P to MAAs was discovered in the genomes of a coral (Acropora digitifera) and sea anemone (Nematostella vectensis), suggesting that these invertebrates can produce MAAs autonomously (Rosic and Dove, 2011; Shinzato et al., 2011).DDGS is a member of the sugarphosphate cyclase (SPC) superfamily (Wu et al., 2007). In addition to DHQS, this superfamily includes four enzymes known for their roles in the biosynthesis of natural products with therapeutic application: 2-epi-5-epi-valiolone synthase (EEVS), 2-epi-valiolone synthase, aminoDHQS, and 2-deoxy-scyllo-inosose synthase (DOIS) (Wu et al., 2007; Mahmud, 2009; Asamizu et al., 2012; Kang et al., 2012). EEVS catalyzes the entry step to the biosynthesis of pseudosugar-containing natural products, such as the antidiabetic drug acarbose and the crop protectant validamycin A, and has so far only been identified and characterized in bacteria (Mahmud, 2009).Recently, we surprisingly also found genes that encode EEVS-like proteins (annotated as ‘PREDICTED: pentafunctional AROM polypeptide-like’) in the genomes of fish, amphibians, reptiles, and birds (Figure 1A, Figure 1—figure supplement 2, Table 1, Figure 1—source data 1). This gene is located in a cluster with another functionally unknown gene (labeled as MT-Ox hereafter) and flanked by a suite of transcription factor genes encoding FRMD4B, MitF, MDFIC, and FoxP1 (Figure 1B). These transcription factors have been known to regulate the expression of genes with essential roles in cell differentiation, proliferation, and survival (Levy et al., 2006; Rice et al., 2012; Garner et al., 2014; Zhao et al., 2015). However, whether they have a direct functional relationship with the EEVS and MT-Ox genes remains an open question. Whereas humans and other mammals also have FRMD4B, MitF, MDFIC, and FoxP1 homologs, they lack the EEVS-like and the MT-Ox genes (Figure 1B). Therefore, the presence of the EEVS-like gene, previously only found in bacteria, in the genomes of fish, and other egg-laying vertebrates is puzzling and evolutionarily intriguing. Here, we report bioinformatics and functional studies of zebrafish (Danio rerio) EEVS-like and MT-Ox genes and the expression of this two-enzyme pathway in yeast, and discuss the evolutionary origin of this pathway in fish, amphibians, reptiles, and birds.
Figure 1.
Bioinformatic analysis of sugar phosphate cyclases in prokaryotes and eukaryotes and biochemical characterization of gadusol biosynthetic enzymes.
(A) Bayesian phylogenetic tree of SPCs. Numbers represent posterior probability. The stramenopile Aureococcus anophagefferens, denoted by the blue star, has EEVS and MT-Ox proteins strikingly similar (over 50% identical) to those in vertebrates. The micro algae Coccomyxa subellipsoidea, denoted by the red star, also has EEVS and MT-Ox. (B) Genetic organizations of EEVS and MT-Ox genes in fish, amphibians, birds, and reptiles. Humans and other mammals lack these genes (indicated in dashed red box). For a complete list of vertebrates whose genomes are known to contain EEVS and MT-Ox genes, see Table 1. FRMD4B, FERM domain-containing protein 4B; MitF, microphtalmia-associated transcription factor; MDFIC, MyoD-family inhibitor domain-containing protein-like; and FoxP1, Forkhead-related transcription factor 1. (C) Biochemical characterization of recombinant LOC100003999 and zgc:113054 proteins. (D) WebLogo (Crooks et al., 2004) images of residue conservation patterns at the three metal ligand positions and two active site fingerprint sites known (Kean et al., 2014) to distinguish bacterial EEVSs from DDGSs. The residue numbers given correspond to the reference proteins D. rerio EEVS, ValA, and A. variabilis DDGS, respectively. WebLogos were based on 126 vertebrate, 63 bacterial EEVS, and 160 bacterial DDGS sequences, respectively, that had BLASTP E-values <10−120 in searches using the reference proteins noted above as queries. Each group was aligned using ProMals (Pei and Grishin, 2007).
DOI:
http://dx.doi.org/10.7554/eLife.05919.003
DOI:
http://dx.doi.org/10.7554/eLife.05919.004
DOI:
http://dx.doi.org/10.7554/eLife.05919.005
Numbers represent bootstrap confidence values. Blue star denotes the stramenopile A. anophagefferens, and red star denotes the micro algae C. subellipsoidea.
DOI:
http://dx.doi.org/10.7554/eLife.05919.006
(A) SDS PAGE of purified recombinant ValA, LOC100003999, and zgc:113054, and E. coli cell free extracts containing the enzymes. (B) TLC analysis of enzymatic products of purified ValA and purified LOC100003999. (C) TLC analysis of EEV and the zgc:113054 product (gadusol) using FeCl3 solution as a spraying agent. Only gadusol, but not EEV, can be detected as a maroon spot.
DOI:
http://dx.doi.org/10.7554/eLife.05919.007
(A) SH7P with E. coli cell free extracts containing LOC100003999. (B) SH7P with purified ValA. (C) Authentic EEV. Samples were converted to their trimethylsilyl derivatives using Tri-Sil HTP (Thermo Scientific).
DOI:
http://dx.doi.org/10.7554/eLife.05919.008
Alignment of the sequence of the only structurally known EEVS–ValA from Streptomyces hygroscopicus (ShEEVS; PDB entry 4P53) with representative related enzymes. The family consensus secondary structure elements shown schematically shown above the sequences and within each sequence the residues involved in secondary structures are color coded: β-strands (yellow), α-helices (teal), 310 helices (blue), and π helices (orange). ValA is shown first, and other sequences in order are DrEEVS (Danio rerio EEVS), CsEEVS (C. subellipsoidea EEVS), AaEEVS (A. anophagefferens EEVS), AvDDGS (A. variabilis DDGS; Ava_3858), AmEVS (Actinosynnema mirum EVS; Amir_2000), PDB entry 1DQS (Aspergillus nidulans DHQS), and PDB entry 2D2X (Bacillus circulans DOIS). The 14 ‘fingerprint’ active site positions identified by Kean et al. (Kean et al., 2014) are indicated by an asterisk (*) below the sequences. Above the sequences are indicated a subset of those residues that ligate the catalytic metal (m) and another subset with notable variation between the family members that have different types of activity (↓). The DrEEVS sequence and those of all known vertebrate homologs (not shown) match ShEEVS at all fourteen active site positions. The putative algal EEVS enzymes, AaEEVS and CsEEVS, sequences match the EEVS residues at the positions thought to be important for distinguishing the EEVS activity, but respectively differ at one and two positions; these positions are the ones that are of unknown importance for EEVS activity.
DOI:
http://dx.doi.org/10.7554/eLife.05919.009
Shown are the 13 active-site residues conserved in all EEVS enzymes (pale purple carbons), the NAD+ (gray carbons), and the Zn2+ atom (silver sphere with black coordination bonds), along with a mesh that delineates the pocket suitable for binding an SH7P substrate. Residue numbers in LOC100003999 identifying twelve of the active site residues are shown; the third metal ligand, H366, is behind the zinc and is not labeled. For the two positions that differ between EEVS and DDGS, the residue type present in DDGS is shown in green in parentheses. Figure made with Pymol (Schrödinger L. The PyMOL Molecular Graphics System, Version 1.3.) using the coordinates of ValA, an EEVS from S. hygroscopicus subsp. jinggangensis 5008 (Kean et al., 2014).
DOI:
http://dx.doi.org/10.7554/eLife.05919.010
Figure 1—figure supplement 2.
Maximum likelihood phylogenetic tree of sugar phosphate cyclases.
Numbers represent bootstrap confidence values. Blue star denotes the stramenopile A. anophagefferens, and red star denotes the micro algae C. subellipsoidea.
DOI:
http://dx.doi.org/10.7554/eLife.05919.006
Table 1.
Genetic analysis of EEVS and MT-Ox genes in some invertebrates and chordates
DOI:
http://dx.doi.org/10.7554/eLife.05919.011
Class
Species
Common name
FRMD4B
MitF
EEVS
MT-Ox
MDFIC
FoxP1
Cnidaria
Acropora digitifera
Coral*
–
–
–
–
–
–
Nematostella vectensis
Starlet sea anemone*
√
√
–
–
–
√
Hydra vulgaris
Common hydra
–
√
–
–
–
√
Invertebrate chordates
Ciona intestinalis
Vase tunicate
√
√
–
–
–
√
Branchiostoma floridae
Florida lancelet
√
√
–
–
–
√
Agnatha
Petromyzon marinus
Lamprey†
‡
√
‡
‡
‡
√
Chondrichthyes
Callorhinchus milii
Australian ghostshark†
√
√
‡
‡
√
√
Osteichthyes (Actinopterygii)
Astyanax mexicanus
Mexican tetra
√
√
√
√
√
√
Danio rerio
Zebra fish
√
√
√
√
√
√
Dicentrarchus labrax
European seabass
√
√
√
√
√
√
Fundulus heteroclitus
Mummichog
√
√
√
√
√
√
Gadus morhua
Atlantic cod
√
√
√
√
√
√
Gasterosteus aculeatus
Three-spined stickleback
√
√
√
√
√
√
Lepisosteus oculatus
Spotted gar
√
√
√
√
√
√
Maylandia zebra
Zebra mbuna
√
√
√
√
√
√
Neolamprologus brichardi
African cichlid
√
√
√
√
√
√
Oncorhynchus mykiss
Rainbow trout
√
√
√
√
√
√
Orechromis niloticus
Nile tilapia
√
√
√
√
√
√
Oryzias latipes
Japanese rice fish
√
√
√
√
√
√
Poecilia formosa
Amazon molly
√
√
√
√
√
√
Pundamilia nyererei
African cichlid
√
√
√
√
√
√
Salmo salar
Atlantic salmon
√
√
√
√
√
√
Takifugu rubripes
Japanese puffer
√
√
–
–
–
√
Tetraodon nigroviridis
Green spotted puffer
√
√
–
–
–
√
Xiphophorus maculatus
Southern platyfish
√
√
√
√
√
√
Sarcopterygii
Latimeria chalumnae
West African Coelacanth
√
√
–
–
√
√
Amphibia
Xenopus tropicalis
Western clawed frog
√
√
√
√
√
√
Reptilia
Alligator mississippiensis
American alligator
√
√
√
√
√
√
Anolis carolinensis
Green anole
√
√
√
√
√
√
Chelonia mydas
Green sea turtle
√
√
√
√
√
√
Chrysemys picta bellii
Western painted turtle
√
√
√
√
√
√
Ophiophagus hannah
King cobra†
√
√
‡
‡
√
√
Pelodiscus sinensis
Chinese softshell turtle
√
√
√
√
√
√
Python bivittatus
Burmese python†
√
√
‡
‡
√
√
Aves
Anas platyrhynchos
Mallard
√
√
√
√
√
√
Columba livia
Rock dove
√
√
√
√
√
√
Falco cherrug
Saker falcon
√
√
√
√
√
√
Ficedula albicollis
Collard flycatcher
√
√
√
√
√
√
Gallus gallus
Chicken
√
√
√
√
√
√
Geospiza fortis
Medium ground-finch
√
√
√
√
√
√
Meleagris gallopavo
North American wild turkey
√
√
√
√
√
√
Melopsittacus undulates
Common pet parakeet
√
√
√
√
√
√
Pseudopodoces humilis
Tibetan ground-tit
√
√
√
√
√
√
Taeniopygia guttata
Zebra finch
√
√
√
√
‡
√
Zonotrichia albicollis
White-throated sparrow
√
√
√
√
√
√
Mammalia
Balaenoptera acutorostrata scammoni
Minke whale
√
√
–
–
–
√
Bos taurus
Cattle
√
√
–
–
–
√
Callithrix jacchus
White-tufted-ear marmoset
√
√
–
–
–
√
Capra hircus
Goat
√
√
–
–
–
√
Ceratotherium simum simum
Southern white rhinoceros
√
√
–
–
–
√
Chinchilla lanigera
Long-tailed chinchilla
√
√
–
–
√
√
Chlorocebus sabaeus
Green monkey
√
√
–
–
√
√
Cricetulus griseus
Chinese hamster
√
√
–
–
√
√
Eptesicus fuscus
Big brown bat
√
√
–
–
–
√
Equus caballus
Horse
√
√
–
–
–
√
Erinaceus europaeus
Western European hedgehog
√
√
–
–
√
√
Felis catus
Cat
√
√
–
–
–
√
Gorilla gorilla
Western gorilla
√
√
–
–
–
√
Heterocephalus glaber
Naked mole-rat
√
√
–
–
√
√
Homo sapiens
Human
√
√
–
–
–
√
Lipotes vexillifer
Yangtze river dolphin
√
√
–
–
–
√
Loxodonta africana
African savanna elephant
√
√
–
–
–
√
Macaca fascicularis
Crab-eating macaque
√
√
–
–
√
√
Mesocricetus auratus
Golden hamster
√
√
–
–
√
√
Monodelphis domestica
Gray short-tailed opossum
√
√
–
–
√
√
Mus musculus
House mouse
√
√
–
–
√
√
Mustela putorius furo
Domestic Ferret
√
√
–
–
–
√
Myotis davidii
Mouse-eared bat
√
√
–
–
–
√
Myotis lucifugus
Little-brown bat
√
√
–
–
√
√
Nomascus leucogenys
Northern white-cheeked gibbon
√
√
–
–
–
√
Odobenus rosmarus divergens
Pacific walrus
√
√
–
–
√
√
Orcinus orca
Killer whale
√
√
–
–
–
√
Ornithorhynchus anatinus
Duck-billed platypus†
√
√
–
–
–
√
Orycteropus afer
Ardvark
√
√
–
–
–
√
Otolemur garnettii
Small-eared galago
√
√
–
–
–
√
Ovis aries
Sheep
√
√
–
–
–
√
Pan troglodytes
Chimpanzee
√
√
–
–
–
√
Papio anubis
Olive baboon
√
√
–
–
–
√
Peromyscus maniculatus bairdii
Prairie deer mouse
√
√
–
–
√
√
Pteropus alecto
Black flying fox
√
√
–
–
–
√
Rattus norvegicus
Norway rat
√
√
–
–
–
√
Sarcophilus harrisii
Tasmanian devil
√
√
–
–
√
√
Trichechus manatus latirostris
Florida manatee
√
√
–
–
–
√
Vicugna pacos
Alpaca
√
√
–
–
–
√
Harbors genes for MAA biosynthesis.
Incomplete genome sequence; no EEVS and MT-Ox genes identified.
The presence of EEVS or MT-Ox genes is unknown due to missing contigs or sequence information.
Bioinformatic analysis of sugar phosphate cyclases in prokaryotes and eukaryotes and biochemical characterization of gadusol biosynthetic enzymes.
(A) Bayesian phylogenetic tree of SPCs. Numbers represent posterior probability. The stramenopile Aureococcus anophagefferens, denoted by the blue star, has EEVS and MT-Ox proteins strikingly similar (over 50% identical) to those in vertebrates. The micro algaeCoccomyxa subellipsoidea, denoted by the red star, also has EEVS and MT-Ox. (B) Genetic organizations of EEVS and MT-Ox genes in fish, amphibians, birds, and reptiles. Humans and other mammals lack these genes (indicated in dashed red box). For a complete list of vertebrates whose genomes are known to contain EEVS and MT-Ox genes, see Table 1. FRMD4B, FERM domain-containing protein 4B; MitF, microphtalmia-associated transcription factor; MDFIC, MyoD-family inhibitor domain-containing protein-like; and FoxP1, Forkhead-related transcription factor 1. (C) Biochemical characterization of recombinant LOC100003999 and zgc:113054 proteins. (D) WebLogo (Crooks et al., 2004) images of residue conservation patterns at the three metal ligand positions and two active site fingerprint sites known (Kean et al., 2014) to distinguish bacterial EEVSs from DDGSs. The residue numbers given correspond to the reference proteins D. rerioEEVS, ValA, and A. variabilisDDGS, respectively. WebLogos were based on 126 vertebrate, 63 bacterial EEVS, and 160 bacterial DDGS sequences, respectively, that had BLASTP E-values <10−120 in searches using the reference proteins noted above as queries. Each group was aligned using ProMals (Pei and Grishin, 2007).DOI:
http://dx.doi.org/10.7554/eLife.05919.003
Amino acid sequences of the Sugar Phosphate Cyclases (SPCs).
DOI:
http://dx.doi.org/10.7554/eLife.05919.004
Biosynthetic pathway to shinorine in Anabaena variabilis (Balskus and Walsh, 2010).
DOI:
http://dx.doi.org/10.7554/eLife.05919.005
Maximum likelihood phylogenetic tree of sugar phosphate cyclases.
Numbers represent bootstrap confidence values. Blue star denotes the stramenopile A. anophagefferens, and red star denotes the micro algaeC. subellipsoidea.DOI:
http://dx.doi.org/10.7554/eLife.05919.006
SDS PAGE of EEVS and MT-Ox proteins and thin-layer chromatography (TLC) analysis of their products.
(A) SDS PAGE of purified recombinant ValA, LOC100003999, and zgc:113054, and E. coli cell free extracts containing the enzymes. (B) TLC analysis of enzymatic products of purified ValA and purified LOC100003999. (C) TLC analysis of EEV and the zgc:113054 product (gadusol) using FeCl3 solution as a spraying agent. Only gadusol, but not EEV, can be detected as a maroon spot.DOI:
http://dx.doi.org/10.7554/eLife.05919.007
GC-MS analysis of LOC100003999 and ValA reaction products.
(A) SH7P with E. coli cell free extracts containing LOC100003999. (B) SH7P with purified ValA. (C) Authentic EEV. Samples were converted to their trimethylsilyl derivatives using Tri-Sil HTP (Thermo Scientific).DOI:
http://dx.doi.org/10.7554/eLife.05919.008
Sequences of the vertebrate clade of EEVS-related proteins have characteristic features of an EEVS.
Alignment of the sequence of the only structurally known EEVS–ValA from Streptomyces hygroscopicus (ShEEVS; PDB entry 4P53) with representative related enzymes. The family consensus secondary structure elements shown schematically shown above the sequences and within each sequence the residues involved in secondary structures are color coded: β-strands (yellow), α-helices (teal), 310 helices (blue), and π helices (orange). ValA is shown first, and other sequences in order are DrEEVS (Danio rerioEEVS), CsEEVS (C. subellipsoideaEEVS), AaEEVS (A. anophagefferensEEVS), AvDDGS (A. variabilisDDGS; Ava_3858), AmEVS (Actinosynnema mirum EVS; Amir_2000), PDB entry 1DQS (Aspergillus nidulansDHQS), and PDB entry 2D2X (Bacillus circulans DOIS). The 14 ‘fingerprint’ active site positions identified by Kean et al. (Kean et al., 2014) are indicated by an asterisk (*) below the sequences. Above the sequences are indicated a subset of those residues that ligate the catalytic metal (m) and another subset with notable variation between the family members that have different types of activity (↓). The DrEEVS sequence and those of all known vertebrate homologs (not shown) match ShEEVS at all fourteen active site positions. The putative algal EEVS enzymes, AaEEVS and CsEEVS, sequences match the EEVS residues at the positions thought to be important for distinguishing the EEVS activity, but respectively differ at one and two positions; these positions are the ones that are of unknown importance for EEVS activity.DOI:
http://dx.doi.org/10.7554/eLife.05919.009
Stereoview for the modeled active site geometry of LOC100003999.
Shown are the 13 active-site residues conserved in all EEVS enzymes (pale purple carbons), the NAD+ (gray carbons), and the Zn2+ atom (silver sphere with black coordination bonds), along with a mesh that delineates the pocket suitable for binding an SH7P substrate. Residue numbers in LOC100003999 identifying twelve of the active site residues are shown; the third metal ligand, H366, is behind the zinc and is not labeled. For the two positions that differ between EEVS and DDGS, the residue type present in DDGS is shown in green in parentheses. Figure made with Pymol (Schrödinger L. The PyMOL Molecular Graphics System, Version 1.3.) using the coordinates of ValA, an EEVS from S. hygroscopicus subsp. jinggangensis 5008 (Kean et al., 2014).DOI:
http://dx.doi.org/10.7554/eLife.05919.010Genetic analysis of EEVS and MT-Ox genes in some invertebrates and chordatesDOI:
http://dx.doi.org/10.7554/eLife.05919.011Harbors genes for MAA biosynthesis.Incomplete genome sequence; no EEVS and MT-Ox genes identified.The presence of EEVS or MT-Ox genes is unknown due to missing contigs or sequence information.
Results and discussion
To investigate the function of the vertebrate EEVS-like genes, the protein encoded by the zebrafishEEVS-like gene (LOC100003999) was expressed in Escherichia coli. Incubation of the recombinant protein with SH7P gave a product, which was confirmed by thin-layer chromatography (TLC) and GC-MS to be EEV (Figure 1C, Figure 1—figure supplements 3, 4), identifying the protein as an EEVS. The best-characterized bacterial EEVS is ValA from the validamycin pathway in Streptomyces hygroscopicus subsp. jinggangensis 5008 (Bai et al., 2006), and the crystal structure of ValA (Protein Data Bank [PDB] entry 4P53) (Kean et al., 2014), allowed identification of a fingerprint set of 14 active-site residues with characteristic variations that could differentiate the various SH7P cyclases. Further supporting the assignment of the LOC100003999-encoded protein as an EEVS, sequence comparisons show that all animal EEVS-like proteins are highly similar (60–72% identity) and also match the sequence of ValA at all 14 fingerprint sites (Figure 1D, Figure 1—figure supplements 5, 6). This firmly establishes the presence of EEVS activity in animals.The second gene, MT-Ox (zgc:113054), is predicted to encode a protein that contains two domains: the N-terminal domain is similar to S-adenosylmethionine (SAM)–dependent methyltransferases, and the C-terminal domain is similar to NAD+-dependent oxidoreductases. Although its function in zebrafish is unknown, the transcription of this gene in larvae is upregulated by light, leading to a prediction of its involvement in circadian clock regulation (Weger et al., 2011). In contrast, we hypothesized that this bifunctional protein is involved in modifying EEV to yield an oxidized and methylated product (Figure 1C). To test this hypothesis, recombinant MT-Ox protein encoded by zgc:113054 was incubated with EEV in the presence of SAM and NAD+. Following incubation, a product with λmax of 294 nm (pH 7) and 270 (pH 2.5) was detected (Figure 2A–B). Further analysis of the product by (−)-ESI-MS (m/z 203 [M-H]−) and 1H NMR confirmed its identity as gadusol (Figure 2—figure supplements 1, 2). We postulate that the conversion of EEV to gadusol by MT-Ox takes place via oxidation of the C-2 or C-3 OH, followed by enolization and methylation of the resulting C-2 OH (Figure 2—figure supplement 3). This new pathway to the UV-absorbing vinylogous acid functional group shared by gadusol and 4-deoxygadusol is distinct from that used in the biosynthesis of the MAAs (Balskus and Walsh, 2010; Spence et al., 2012), and the existence of multiple pathways for generating this scaffold is intriguing and may indicate that this is a privileged chemical scaffold in living organisms.
Figure 2.
Production of gadusol in zebrafish and yeast and its sunscreen activity.
(A–B) HPLC traces and UV absorptions of gadusol produced from Escherichia coli cell-free extract containing EEVS and purified MT-Ox protein at pH 7.0 and 2.5. (C–D) Transcription patterns of EEVS and MT-Ox genes during zebrafish embryonic development. qRT-PCR analysis of mRNA isolated from zebrafish embryos (n = 3) at 12, 24, 48, 72, 96, and 120 hpf. (E) Time course of gadusol production in yeast harboring the zebrafish genes. The yeast was cultured in YNB + 2% glucose supplemented with leucine and lysine at 30°C for 2 days, and growth was monitored as A600 values (control, dotted blue line; gadusol producer, solid red line). Gadusol concentration in the supernatant of 20 ml cultures (n = 3) was monitored as A296 values in 50 mM phosphate buffer, pH 7.0 (dashed green line) corrected for non-gadusol background absorbance in the control supernatant, normalized to A600 value. Gadusol was quantified based on an extinction coefficient of 21,800 M−1 cm−1 in 50 mM phosphate buffer, pH 7. (F–H) Comparative HPLC analysis of gadusol from recombinant enzymatic reaction, zebrafish extract, and yeast extract. (I) Gadusol suppresses the UVB sensitivity of a rad1∆ yeast mutant; and (J) Gadusol increases the UVB tolerance of a wild-type (RAD1) strain. Cells suspended in control supernatant (CS) or gadusol+ supernatant (G+S) were irradiated with UVB and subsequently spotted in 3 µl aliquots (n = 4) onto YEPD plates, which were incubated at 30°C for 24 hr.
DOI:
http://dx.doi.org/10.7554/eLife.05919.012
(A) SH7P incubated with E. coli cell free extracts containing LOC100003999 (zebrafish EEVS) and boiled zgc:113054 (MT-Ox). (B) SH7P incubated with E. coli cell free extracts containing EEVS and purified MT-Ox. (C) SH7P incubated with purified ValA (a bacterial EEVS) and boiled MT-Ox. (D) Synthetic EEV incubated with purified MT-Ox. (E) Deuterated EEV incubated with purified MT-Ox. (F) MS/MS spectrum of gadusol obtained from the EEVS and MT-Ox reaction. (G) Proposed ion species of gadusol observed in F.
DOI:
http://dx.doi.org/10.7554/eLife.05919.013
DOI:
http://dx.doi.org/10.7554/eLife.05919.014
The dotted arrows show the three steps thought to be a part of the MT-Ox catalyzed reaction.
DOI:
http://dx.doi.org/10.7554/eLife.05919.015
DOI:
http://dx.doi.org/10.7554/eLife.05919.016
Figure 2—figure supplement 3.
Proposed mode of formation of gadusol in vertebrates.
The dotted arrows show the three steps thought to be a part of the MT-Ox catalyzed reaction.
DOI:
http://dx.doi.org/10.7554/eLife.05919.015
Production of gadusol in zebrafish and yeast and its sunscreen activity.
(A–B) HPLC traces and UV absorptions of gadusol produced from Escherichia coli cell-free extract containing EEVS and purified MT-Ox protein at pH 7.0 and 2.5. (C–D) Transcription patterns of EEVS and MT-Ox genes during zebrafish embryonic development. qRT-PCR analysis of mRNA isolated from zebrafish embryos (n = 3) at 12, 24, 48, 72, 96, and 120 hpf. (E) Time course of gadusol production in yeast harboring the zebrafish genes. The yeast was cultured in YNB + 2% glucose supplemented with leucine and lysine at 30°C for 2 days, and growth was monitored as A600 values (control, dotted blue line; gadusol producer, solid red line). Gadusol concentration in the supernatant of 20 ml cultures (n = 3) was monitored as A296 values in 50 mM phosphate buffer, pH 7.0 (dashed green line) corrected for non-gadusol background absorbance in the control supernatant, normalized to A600 value. Gadusol was quantified based on an extinction coefficient of 21,800 M−1 cm−1 in 50 mM phosphate buffer, pH 7. (F–H) Comparative HPLC analysis of gadusol from recombinant enzymatic reaction, zebrafish extract, and yeast extract. (I) Gadusol suppresses the UVB sensitivity of a rad1∆ yeast mutant; and (J) Gadusol increases the UVB tolerance of a wild-type (RAD1) strain. Cells suspended in control supernatant (CS) or gadusol+ supernatant (G+S) were irradiated with UVB and subsequently spotted in 3 µl aliquots (n = 4) onto YEPD plates, which were incubated at 30°C for 24 hr.DOI:
http://dx.doi.org/10.7554/eLife.05919.012
(−)-ESI-MS analysis of LOC100003999 and zgc:113054 reaction products.
(A) SH7P incubated with E. coli cell free extracts containing LOC100003999 (zebrafishEEVS) and boiled zgc:113054 (MT-Ox). (B) SH7P incubated with E. coli cell free extracts containing EEVS and purified MT-Ox. (C) SH7P incubated with purified ValA (a bacterial EEVS) and boiled MT-Ox. (D) Synthetic EEV incubated with purified MT-Ox. (E) Deuterated EEV incubated with purified MT-Ox. (F) MS/MS spectrum of gadusol obtained from the EEVS and MT-Ox reaction. (G) Proposed ion species of gadusol observed in F.DOI:
http://dx.doi.org/10.7554/eLife.05919.013
1H NMR spectrum of gadusol obtained from E. coli cell free extracts containing LOC100003999 and zgc:113054 reactions.
DOI:
http://dx.doi.org/10.7554/eLife.05919.014
Proposed mode of formation of gadusol in vertebrates.
The dotted arrows show the three steps thought to be a part of the MT-Ox catalyzed reaction.DOI:
http://dx.doi.org/10.7554/eLife.05919.015
The Pentose Phosphate Pathway (Asamizu et al., 2012) and the shunt pathway to gadusol.
DOI:
http://dx.doi.org/10.7554/eLife.05919.016In zebrafish, both of the LOC100003999 and zgc:113054 genes are expressed during embryonic development. qRT-PCR analysis of mRNA isolated from zebrafish embryos at 12, 24, 48, 72, 96, and 120 hpf showed maximal expression at 72 hpf (Figure 2C–D). To demonstrate de novo synthesis of gadusol in zebrafish, the embryos were collected at 72 hpf, lyophilized and extracted with methanol, and the extract was analyzed by HPLC and ESI-MS (Figure 2G). Our finding of gadusol in the extract unambiguously confirms the ability of zebrafish to synthesize gadusol and amends the current perception that gadusol found in fish and other vertebrates is necessarily of dietary or symbiont origin. However, as MAAs are synthesized via a different pathway and there is no evidence that fish have those biosynthetic enzymes, the accumulation of MAAs in fish would still appear to be of dietary origin (Mason et al., 1998; Zamzow, 2004).To show that the recombinant LOC100003999 and zgc:113054 genes are sufficient for encoding gadusol synthesis, they were cloned into a yeastexpression vector and transferred into a Saccharomyces cerevisiae strain, in which the transaldolase gene TAL1 had been deleted. Yeast possesses a robust pentose–phosphate pathway (Figure 2—figure supplement 4), and by removing the transaldolase enzyme, which normally metabolizes SH7P, and adding EEVS and MT-Ox, we expected to facilitate an effective shunt pathway from SH7P to gadusol. Analysis of the culture broth by HPLC, ESI-MS, and UV spectrophotometry revealed the presence of gadusol (Figure 2H) and its accumulation to ∼20 mg/l after 5 days (Figure 2E). The results not only demonstrate the ability of the engineered yeast to produce and secrete gadusol but also present a new avenue for large-scale production of the compound for possible commercial uses, for example, sunscreen and/or antioxidant (Plack et al., 1981; Schmid et al., 2006; Cardozo et al., 2007; Arbeloa et al., 2010).
Figure 2—figure supplement 4.
The Pentose Phosphate Pathway (Asamizu et al., 2012) and the shunt pathway to gadusol.
DOI:
http://dx.doi.org/10.7554/eLife.05919.016
To test the UV-protective activity of gadusol, a yeastrad1∆ mutant, which is sensitive to UVB, was suspended at approximately 107 cells/mL in the concentrated supernatant from the gadusol-producing yeast strain or from an otherwise isogenic control strain that did not produce gadusol. The gadusol-containing supernatant suppressed the UVB-sensitivity of the rad1∆ mutant (Figure 2I), confirming the UVB-protective activity of gadusol. Analogous experiments with a wild-type strain (RAD1) at higher doses of UVB showed comparable results (Figure 2J), consistent with UVB protective activity.As noted above, the SPCs EEVS, EVS, DDGS, aminoDHQS, and DOIS are all related to DHQS and are widespread in bacteria and fungi, but other than this report, are not known to exist in vertebrates or prevertebrates. We suggest that the vertebrate EEVS and MT-Ox genes were most plausibly acquired via horizontal gene transfer. Interestingly, searches identify the stramenopile Aureococcus anophagefferens and the microalgae Coccomyxa subellipsoidea, as the only non-vertebrate organisms in current databases that harbor a similar bifunctional MT-Ox gene, and both organisms have a predicted EEVS gene adjacent to that of MT-Ox. As algae are known to be active horizontal gene transfer agents (Ni et al., 2012), algae such as these become a plausible place both for the development of this alternate pathway for gadusol production and as a source of the genes found in vertebrates. Further supporting such a relationship, the A. anophagefferensEEVS protein is substantially more similar to the vertebrate EEVSs than it is to bacterial EEVSs (Figure 1A and Figure 1—figure supplement 2, denoted by the blue star).Further bioinformatics studies also showed that the tunicates and lancelets lack the EEVS and MT-Ox genes, suggesting that the gene transfer occurred sometime during the evolution of primitive chordates to bony fishes (Figure 3). While the EEVS and MT-Ox genes are retained in modern ray-finned fish (with the exception of puffer fish), as well as in amphibians, reptiles, and birds, they were lost in mammals, including the egg-laying mammalplatypus, indicating the lack of a direct link between gadusol and the mode of reproduction. The West Africancoelacanth genome (Amemiya et al., 2013) also appears to lack the EEVS and MT-Ox genes (Figure 3, Table 1). This rare ovoviviparous fish lives in caves 100–500 meters deep and feeds at night, and its lack of gadusol production ability may be directly related to its limited exposure to UV light and/or oxidative stress. Other than the well-documented presence of gadusol in fish eggs, where it can serve to protect the roe from UV damage (Arbeloa et al., 2010; Colleter et al., 2014), nothing is known of its role(s) in fish, reptiles, amphibians, and birds. Exploring its function in these organisms will increase our understanding of their physiology and ecology.
Figure 3.
Model timeline for the evolution of the EEVS and MT-Ox genes in vertebrates.
The genes entered an early vertebrate genome as a linked pair (vertical blue arrow) and were retained in the modern ray-finned fish, amphibians, reptiles, and birds as indicated by thick dark cyan arrows. Coelacanths and mammals lost the genes (thick red arrows). No full genome sequence is available for assessing the presence of EEVS and MT-Ox in lungfish. The phylogenetic trees of the EEVS and MT-Ox proteins or mRNA from a selected set of vertebrates can be found in Figure 3—figure supplements 1–3 and Supplementary files 1, 2.
DOI:
http://dx.doi.org/10.7554/eLife.05919.017
The distantly related E. coli protein CglD was used as an out-group. Numbers represent bootstrap confidence values.
DOI:
http://dx.doi.org/10.7554/eLife.05919.018
E. coli K12 dam was used as an out-group. Numbers represent bootstrap confidence values.
DOI:
http://dx.doi.org/10.7554/eLife.05919.019
E. coli cglD was used as an out-group. Numbers represent bootstrap confidence values.
DOI:
http://dx.doi.org/10.7554/eLife.05919.020
Model timeline for the evolution of the EEVS and MT-Ox genes in vertebrates.
The genes entered an early vertebrate genome as a linked pair (vertical blue arrow) and were retained in the modern ray-finned fish, amphibians, reptiles, and birds as indicated by thick dark cyan arrows. Coelacanths and mammals lost the genes (thick red arrows). No full genome sequence is available for assessing the presence of EEVS and MT-Ox in lungfish. The phylogenetic trees of the EEVS and MT-Ox proteins or mRNA from a selected set of vertebrates can be found in Figure 3—figure supplements 1–3 and Supplementary files 1, 2.DOI:
http://dx.doi.org/10.7554/eLife.05919.017
Phylogenetic tree of the EEVS proteins from a selected set of vertebrates.
The distantly related E. coli protein CglD was used as an out-group. Numbers represent bootstrap confidence values.DOI:
http://dx.doi.org/10.7554/eLife.05919.018
Phylogenetic tree of the MT-Ox proteins from a selected set of vertebrates.
E. coli K12 dam was used as an out-group. Numbers represent bootstrap confidence values.DOI:
http://dx.doi.org/10.7554/eLife.05919.019
Maximum likelihood tree of vertebrate EEVS mRNA sequences.
E. coli cglD was used as an out-group. Numbers represent bootstrap confidence values.DOI:
http://dx.doi.org/10.7554/eLife.05919.020
Materials and methods
Molecular phylogenetic analysis
For phylogenetic analysis, full-length amino acid sequences and vertebrate mRNA sequences were analyzed. A reciprocal BLAST hit analysis was performed with the EEVS protein (see Supplementary file 3). Sequences were aligned using MUSCLE. ProtTest was used to determine the best model of protein evolution (LG+G) (Darriba et al., 2011), and MEGA6 was used to determine the best fit nucleic acid evolutionary model (K80+G) (Tamura et al., 2013). RAxML was used for maximum likelihood analysis, and the robustness of the trees was assessed by bootstrap analysis (1000 replicates) (Stamatakis, 2014). Bayesian analysis was performed by MrBayes (version 3.2.3), using a random starting tree, running eight chains for 4,000,000 generations, sampling every 250 trees (Ronquist et al., 2012). The first 5000 trees were discarded as the burnin, with the remaining trees used to calculate posterior probability. RAxML and MrBayes were run on the CIPRES science gateway (Miller et al., 2010). MEGA6 was used for maximum likelihood analysis of vertebrate mRNA sequences with tree robustness assessed by bootstrap (500 replicates). Sources of proteins for the analyses are listed in Supplementary file 1.
Construction of LOC100003999 and zgc:113054 gene expression vectors
The LOC100003999 gene was codon optimized for E. coli and synthesized commercially (GeneScript USA Inc., Piscataway, NJ). The product was cloned into EcoRV site of pUC57-kan vector. The plasmid was digested with BglII and EcoRI and ligated into BamHI and EcoRI site of pRSET-B (Life Technologies, Carlsbad, CA) for the expression of N-terminal hexa-histidine-tagged protein. The zgc:113054 gene was also codon optimized for E. coli and commercially synthesized (GeneScript USA Inc.). The product was cloned into EcoRV site of pUC57-amp vector. The plasmid was digested with BglII and EcoRI and ligated into BamHI and EcoRI site of pRSET-B (Life Technologies) for the expression of N-terminal hexa-histidine-tagged protein.
Expression of valA, LOC100003999, and zgc:113054 genes in E. coli
pRSETB-valA, pRSETB-LOC100003999, and pRSETB-zgc:113054 plasmids were individually used to transform E. coli BL21 GOLD (DE3) pLysS. Transformants were grown overnight at 37°C on LB agar plate containing ampicillin (100 μg/ml) and chloramphenicol (25 μg/ml). A single colony was inoculated into LB medium (2 ml) containing the above antibiotics and cultured at 37°C for 8 hr. The seed culture (1 ml) was transferred into LB medium (100 ml) in a 500-ml flask and grown at 30°C until OD600 reached 0.6. Then, the temperature was reduced to 18°C. After 1-hr adaptation, isopropyl-D-1-thiogalactopyranoside (IPTG) (0.1 mM) was added to induce the N-terminal hexa-histidine-tagged proteins. After further growth for 16 hr, the cells were harvested by centrifugation (5000 rpm, 10 min, 4°C), washed twice with cold water, and stored at −80°C until used.
Purification of recombinant ValA, LOC100003999, and zgc:113054
Cell pellets from a 400-ml culture of E. coli BL21 GOLD (DE3) pLysS containing pRSETB-valA, pRSETB-LOC100003999, or pRSETB-zgc:113054 plasmids were resuspended in 20 ml of B buffer (40 mM Tris-HCl, 300 mM NaCl, 10 mM imidazole, pH 7.5). Cells were disrupted by sonication for 1 min (4 times, 2 min interval) at 13 watts on ice (Probe sonicator, Misonix, Farmingdale, NY). 20 ml of lysate was divided into 2-ml tubes and centrifuged (14,500 rpm, 20 min, 4°C). Soluble fractions were collected and transferred into a 50-ml tube. Ni-NTA (QIAGEN, Valencia, CA) resin (5 ml) was applied into 10-ml vol empty column, and the Ni-NTA resin was equilibrated with B buffer (50 ml, 10 CV). About 20 ml of supernatant from cell lysate was applied to the column (flow rate; 0.8 ml/min). The column was then washed with 100 ml (20 CV) of W buffer (40 mM Tris-HCl, 300 mM NaCl, 20 mM imidazole, pH 7.5) at 0.8 ml/min. The hexa-histidine-tagged proteins were eluted by imidazole addition using a gradient mixer containing 100 ml of W buffer and 100 ml of E buffer (40 mM Tris-HCl, 300 mM NaCl, 300 mM imidazole, pH 7.5). The fractions (150 drops or about 5 ml) were collected and checked by SDS-PAGE (Coomassie Blue staining). Fractions containing pure proteins were combined (25 ml) and dialyzed against 2 l of D buffer (10 mM Tris-HCl, pH 7.5) 3 times (every 3 hr). Dialyzed protein solution was concentrated by ultrafiltration (MWCO 10 K) to 200 μM and flash frozen in liquid N2 prior to storage at −80°C. The yields of the purified proteins were 57 mg/l for ValA, 18 mg/l for LOC100003999, and 79 mg/l for zgc:113054.
LOC100003999 assay conditions
Each reaction mixture (25 μl) contained Tris-HCl buffer (20 mM, pH 7.5), NAD+ (1 mM), CoCl2, or ZnSO4 (0.1 mM), SH7P (4 mM), and purified enzymes (0.12 mM). The mixture was incubated at 30°C for 2 hr. ValA (instead of LOC100003999) was used as a positive control. No enzyme (buffer only) was used as a negative control.
Coupled LOC100003999 and zgc:113054 assay conditions
Each reaction mixture (50 μl) contained potassium phosphate buffer (10 mM, pH 7.4), NAD+ (2 mM), CoCl2 (0.2 mM), SH7P (4 mM), and LOC100003999 cell-free extract (20 μl) was incubated at 30°C. After 6 hr, S-adenosylmethionine (5 mM) and purified zgc:113054 (0.1 mM) were added. The mixture was incubated at 30°C for another 6 hr. ValA was used (instead of LOC100003999) as a positive control. Extract of E. coli harboring pRSET B empty vector was used as a negative control.
Zgc:113054 assay using [6,6-2H2]-EEV as substrate
A reaction mixture (25 μl) containing potassium phosphate buffer (10 mM, pH 7.4), NAD+ (2 mM), CoCl2 (0.2 mM), S-adenosylmethionine (5 mM), [6,6-2H2]-EEV (4 mM), and purified zgc:113054 (0.1 mM) was incubated at 30°C for 2 hr. Boiled zgc:113054 was used as a negative control.
TLC analysis of EEV and gadusol
Analytical TLC was performed using silica gel plates (60 Å) with a fluorescent indicator (254 nm), which were visualized with a UV lamp and ceric ammonium molybdate (CAM) or 5% FeCl3 in MeOH-H2O (1:1) solutions.
GC-MS analysis of EEV
The enzymatic reaction mixtures were lyophilized, and the products were extracted with MeOH. The MeOH extract was then dried and Tri-Sil HTP (Thermo Scientific, Waltham, MA) (100 μl) was added and left to stand for 20 min. The solvent was removed in a flow of argon gas, and the silylated products were extracted with hexanes (100 μl) and injected into the GC-MS (Hewlett Packard 5890 SERIES II Gas chromatograph).
Enzymatic synthesis, purification, and analysis of gadusol
Fifty eppendorf tubes containing reaction mixtures (100 μl each), which consist of potassium phosphate buffer (10 mM, pH 7.4), SH7P (5 mM), NAD+ (2 mM), CoCl2 (0.2 mM), and LOC100003999 cell-free extract (40 μl), were incubated at 30°C. After 6 hr, S-adenosylmethionine (5.5 mM) and zgc:113054 cell-free extracts (30 μl) were added. The reaction mixtures were incubated at 30°C for another 6 hr. The reaction mixtures were quenched with 2 vol of MeOH, left to stand at −20°C for 20 min, then centrifuged at 14,500 rpm for 20 min. The supernatants were pooled and dried under vacuum. The residual water was frozen and lyophilized. The crude sample was dissolved in water (1 ml) and subjected to Sephadex LH-20 column chromatography using phosphate buffer (2.5 mM, pH 7) as an eluant. Fractions containing the product as judged by MS were combined and lyophilized. Furthermore, the product was purified by HPLC (Shimadzu LC-20AD, C18 column [YMC], 250 × 10 mm, 4 μm, flow rate 1 ml/min). Solvent system: MeOH—phosphate buffer (5 mM, pH 7), gradient 1–100% of MeOH (0–40 min). Peak at 12.74 min was collected and dried to give gadusol (0.4 mg). 1H NMR (700 MHz, D2O, cryo-probe): δ 4.10 (s, 1H, H-4), 3.71 (d, J = 12 Hz, H-7α), 3.56 (d, J = 12 Hz, H-7β), 3.49 (s, 3H, OCH3), 2.68 (d, J = 17 Hz, H-6α), 2.38 (d, J = 17 Hz, H-6β). HR-MS (ESI-TOF) (m/z): (M+H)+ calculated for C8H13O6, 205.0707; found, 205.0709.
Zebrafish lines and embryos
Adult wild-type 5D zebrafish were housed at the Sinnhuber Aquatic Research Laboratory on a recirculating system maintained at 28 ± 1°C with a 14 hr light per 10 hr dark schedule. Embryos were collected from group spawns of adult zebrafish as described previously (Reimers et al., 2006), and all experiments were conducted with fertilized embryos according to Oregon State University Institutional Animal Care and Use Protocols. Embryos were staged accordingly as previously described (Kimmel et al., 1995) and collected by hand for all experiments. Embryos were reared in media consisting of 15 mM NaCl, 0.5 mM KCl, 1 mM MgSO4, 0.15 mM KH2PO4, 0.05 mM Na2HPO4, and 0.7 mM NaHCO3 (Westerfield, 2000).
Polymerase chain reaction
All polymerase chain reaction (PCR) reactions were performed according to manufacturer's specifications. Cycling conditions: 96°C for 3 min, 95°C for 1 min, 65°C for 1 min, and 72°C for 1 min per kB DNA; 35 cycles were used followed by 10 min at 72°C. All PCR products were characterized on an agarose gel. If needed, the PCR product was excised from the gel and purified using the E.Z.N.A. Gel Extraction Kit (Omega Bio-tek, Norcross, GA).
Quantitative PCR of zebrafish samples
qPCR was performed on a Applied Biosystems StepOnePlus machine. The super mix PerfeCTa SYBR Green FastMix, ROX (Quanta biosciences, Gaithersburg, MD) was used. cDNA (100 ng) from time points at 6, 12, 24, 48, 72, 96, and 120 hpf was used. Super mix (18 µl) was added to bring the final volume to 20 µl. PCR conditions suggested by the supplier were used. For total RNA isolation, 30 embryos were homogenized in RNAzol (Molecular Research Center, Cincinnati, OH); RNA was purified according to the manufacturer's protocol. RNA was quantified by A260/280 ratios measured using a SynergyMx microplate reader (Biotek, Winooski, VT) and analyzed with the Gen5 Take3 module. 1 µg of RNA was used for cDNA synthesis. Superscript III First-Strand Synthesis (Life Technologies) and oligo d(T) primers were used to synthesize cDNA from the total RNA.
Isolation of gadusol from zebrafish
Embryos were collected and euthanized at 72 hpf by induced hypoxia through rapid chilling on ice for 30 min. Embryo media were removed until about 5 ml were left and frozen at −80°C. Embryos were lyophilized overnight. The freeze-dried embryos were then ground with a pestle and mortar under liquid nitrogen. The powder was collected and placed in a pre-weighed glass vial. The mortar was washed with MeOH-H2O (80:20), and the solvent was added to the powder. The solvent was evaporated, and powder was weighed. The embryo powder was extracted twice with MeOH-H2O (80:20). The two extracts were combined, dried, and weighed. The extract was suspended in MeOH-H2O (80:20) (1 ml) and extracted twice with hexanes. The aqueous layer was recovered, dried, and weighed. The extract was suspended in MeOH for analysis by mass spectrometry. The extract was dissolved in phosphate buffer pH 7.0 for identification by HPLC (Shimadzu SPD-20A system, YMC ODS-A column (4.6 id × 250 mm), MeOH—5 mM phosphate buffer (1% MeOH for 20 min followed by a gradient from 1 to 95% MeOH in 20 min), flow rate 0.3 ml/min, 296 nm. The isolated gadusol was analyzed by MS (ThermoFinnigan LCQ Advantage system) and NMR (in D2O; Bruker Unity 300 [300.15 MHz] spectrometer).
Construction of yeast mutants
The yeast strains used are listed in Supplementary file 4. The TRP1 gene was replaced in BY4742 tal1∆::KanMX4 with a wild-type URA3 allele from S288c by standard methods (Baudin et al., 1993). The deletion was confirmed by PCR using primer pairs TRP1DisUP/TRP1DisLO and URA3DisUP/TRP1DisLO. The BY4742 tal1∆::KanMX4 trp1∆::URA3 strain was then co-transformed (Gietz et al., 1992) with pXP416 and pXP420 to generate an empty vector control strain and with pXP420-EEVS and pXP416-MT-Ox to generate a gadusol-producing strain. The RAD1 gene was replaced in BY4742 tal1∆::KanMX4 trp1∆::URA3 with a wild-type LEU2 allele from S288c by standard methods (Reynolds et al., 1987). The deletion was confirmed by PCR using primer pairs RAD1UP/RAD1LO. The resultant BY4742 tal1∆::KanMX4 trp1∆::URA3rad1Δ::LEU2 strain was then co-transformed with pXP416 and pXP420.
Media and yeast growth conditions
Cells were pre-grown in YEPD (1% yeast extract, 2% peptone, and 2% glucose) for transformations, and in YNB (Bacto yeastnitrogen base without amino acids) + 2% glucose supplemented with 30 µg/ml leucine and 30 µg/ml lysine to select for transformants and to produce gadusol. Liquid media were sterilized by filtration using a 0.45-µm filter, and agar-based media were sterilized by autoclaving. Liquid cultures were grown at 30°C for 48 hr and 200 rpm; plates were incubated at 30°C.
Yeast overexpression plasmid construction
Plasmids are listed in Supplementary file 5. Primers used for PCR are listed in Supplementary file 6. PCR amplicons with SpeI and XhoI terminal restriction sites were generated for the EEVS gene and MT-Ox gene using pRSETB-EEVS and pRSETB-MTOx as templates, respectively. The EEVS and MT-Ox amplicons were then digested with SpeI and XhoI and ligated into SpeI-digested pXP420 and XhoI-digested pXP420 and pXP416, respectively, and introduced into competent E. coli (Top 10; Life Technologies) by transformation. E. coli transformants were selected on LB plates supplemented with ampicillin (100 µg/ml). Transformants were then screened by digesting plasmid DNA with SpeI and XhoI restriction enzymes and analyzing fragments by agarose gel electrophoresis.
Identification of gadusol production in S. cerevisiae
S. cerevisiae cell pellets from 5 ml cultures were extracted with MeOH, and the supernatant was extracted with nBuOH. Extracts were concentrated and analyzed by HPLC (Shimadzu SPD-20A system, YMC ODS-A column [4.6 id × 250 mm], MeOH—5 mM phosphate buffer (1% MeOH for 20 min followed by a gradient from 1 to 95% MeOH in 20 min), flow rate 0.3 ml/min, 296 nm.
Irradiation protocol
A rad1∆ mutant (MATα his3∆1 leu2∆0 lys2∆0 trp1∆::URA3ura3∆0 rad1∆::LEU2tal1∆::KanMX4/pXP416, pXP420) or wild-type RAD1 strain (S288c, MATα SUC2 gal2 mal2 mel flo1 flo8-1 hap1 ho bio1 bio6) was grown at 30°C and 200 rpm in YNB + 2% glucose + 30 µg/ml leu + 30 µg/ml lys. Cells were harvested after 24 hr by centrifugation, washed twice in the ninefold concentrated supernatant of either the gadusol-producing strain BY4742 tal1∆ trp1∆/pXP416-MTOx, pXP420-EEVS or of the control strain BY4742 tal1∆ trp1∆/pXP416, pXP420, and suspended in the respective concentrated supernatants at 107 cells/ml. Cells (375 µl) were irradiated with UVB (302 nm) at the indicated doses in wells of a 24-well microtiter plate shaken at 900 rpm. 3 µl aliquots of cells were then spotted onto a YEPD plate, which was incubated 24 hr at 30°C prior to being photographed. The supernatants of the gadusol producing and control strains were obtained by centrifugation following 5 day of growth in YNB + 2% glucose + 30 µg/ml leucine + 30 µg/ml lysine at 30°C and 200 rpm. Supernatants were freeze-dried, dissolved in a volume of distilled water 1/10 of the initial culture volume, and stored at 4°C until use. Just prior to suspension of cells, the concentrated supernatant was adjusted to 50 mM phosphate, pH 7.0 resulting in a final ninefold concentrate.eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.Thank you for sending your work entitled “De novo Synthesis of a Sunscreen Compound in Vertebrates” for consideration at eLife. Your article has been favorably evaluated by Ian Baldwin (Senior Editor) and 3 reviewers, one of whom is a memaber of our Board of Reviewing Editors.The following individuals responsible for the peer review of your submission have agreed to reveal their identity: Reviewing editor, Wilfred van der Donk; Emily Balskus, peer reviewer. A third reviewer remains anonymous.The Reviewing editor and the other reviewers discussed their comments, and all three reviewers agree that this study is of general interest to the eLife readership, provided the authors can address the concerns listed below.The authors show in this manuscript that genes encoding a 2-epi-5-epi-valiolone synthase (EEVS) and a bifunctional O-methyltransferase/NAD+ dependent oxidoreductase genes are present in various egg-laying vertebrates. They also show that the two genes are expressed in zebrafish and that gadusol can be isolated from zebrafish embryos. The authors introduced both genes into an engineered strain of yeast that accumulates the EEVS substrate sedoheptulose-7-phosphate and demonstrate the production of gadusol. Finally, they show that extracts from gadusol-producing yeast protect both wild-type and mutant yeast strains from UV-B radiation. Collectively, these data provide strong support that vertebrates can make these compounds and that they need not be produced by symbionts or obtained in diet. Overall, the biochemical data is very convincing, with some questions raised by the reviewers. The bioinformatics/evolution data was considered less robust and needs strengthening. Results from the phylogenetic analyses suggest a series of repeated transfers between bacterial and eukaryotic homologs that appear to be an interesting part of the story. Yet, the methods listed are not sufficient to make this claim. Below are several ways to validate/improve these phylogenetic results. The reviewers agree that the requested revisions should be doable.The Reviewing editor has assembled the following comments to help you prepare a revised submission:1) The inclusion of bacterial and eukaryotic homologs into the evolutionary history of these genes needs to be justified by a reciprocal blast hit analysis in which the authors switch the BLAST query sequence between bacteria and eukaryotes to see if they always retrieve the same hits in the database. This concern is particularly relevant when using protein sequences, because convergent evolution rather than horizontal gene transfer may drive the appearance of reciprocal orthology when there isn't any.2) A model of protein evolution needs to be statistically selected to build the tree. The authors appear to have used the simple, default parameter in Geneious, which is insufficient for accuracy in phylogenetic tree building. To select a model of protein evolution, ProtTest needs to be run, and the selected model needs to be implemented in the tree building.3) In addition to building the tree with protein sequences, can the authors build the tree with DNA sequences? The DNA divergences may be too high to do this for all of the sequences in the tree, but subsections of the tree could be reanalyzed with DNA sequences, starting with reciprocal blast analyses and then model selection (using ModelTest for DNA) and tree building.4) It is highly desirable to use multiple phylogenetic inference methods to validate evolutionary relationships. Thus, in addition to the reported Max Likelihood analysis, a Mr. Bayes analysis should be run alongside.5) Bootstrap and posterior probability support values must be reported in a phylogenetic tree analysis to evaluate the strength of the evolutionary results.6) The authors express and purify the proteins of interest, but the experimental section suggests that for some experiments they were assayed as cell free extracts. It is not clear which results discussed in the text and shown in the figures were obtained with purified protein and which with cell free extracts, but the reader is left with the impression that all data was obtained with purified proteins. The authors should also explain why cell free extracts were used when purified proteins were apparently available.7) The authors should provide more information about the in vitro biochemical characterization of the zebrafishEEVS. Did they test the activity of the enzyme toward sugar substrates other than sedoheltulose-7-phosphate? Also the TLC shown in Figure 1–figure supplement 2B seems to show multiple new products being formed by this enzyme. These products do not appear to be present in the assay with ValA, the bacterial EEVS used as a positive control. What are these alternate products?8) The authors should provide some additional commentary in their discussion regarding the proposed biological roles of gadusol and MAAs in fish and the issue of their origin (endogenous synthesis vs dietary), and highlight issues that remain unresolved with the current results. For instance, it has been demonstrated experimentally that fish can accumulate MAAs from their diets (see Mar Biol 144:1057-1064, Comp Biochem Physiol Part A Mol Integr Physiol 120:587-598). Are the relative concentrations of gadusol and MAAs in fish roe known? Just because there is an endogenous pathway for gadusol synthesis in fish does not mean that this compound cannot also be acquired from dietary sources along with the MAAs or that it is the primary contributor to UV protection.1) The inclusion of bacterial and eukaryotic homologs into the evolutionary history of these genes needs to be justified by a reciprocal blast hit analysis in which the authors switch the BLAST query sequence between bacteria and eukaryotes to see if they always retrieve the same hits in the database. This concern is particularly relevant when using protein sequences, because convergent evolution rather than horizontal gene transfer may drive the appearance of reciprocal orthology when there isn't any.We have performed a reciprocal blast hit analysis by switching the BLAST query sequence between bacteria and eukaryotes and found that from 55 bacterial EEVS sequences blasted against the vertebrate EEVS sequences, 17 unique vertebrate EEVS subjects were hit. Then the vertebrate EEVS sequences were blasted against all proteins (minus vertebrate proteins) used for phylogenetic analysis, which hit 7 unique bacterial EEVS subjects. Two pairs of reciprocal best hits were identified; Actinoplanes sp. A40644 (BAD07382.1) was best hit with Falco peregrinus (XP_005230087.1), and Cystobacter fuscus DSM 2262 (EPX59479.1) with Dicentrarchus labrax (CBN80976.1). The result is presented in Supplementary file 3.2) A model of protein evolution needs to be statistically selected to build the tree. The authors appear to have used the simple, default parameter in Geneious, which is insufficient for accuracy in phylogenetic tree building. To select a model of protein evolution, ProtTest needs to be run, and the selected model needs to be implemented in the tree building.As suggested, we have used ProtTest to determine the best model of protein evolution (LG+G).3) In addition to building the tree with protein sequences, can the authors build the tree with DNA sequences? The DNA divergences may be too high to do this for all of the sequences in the tree, but subsections of the tree could be reanalyzed with DNA sequences, starting with reciprocal blast analyses and then model selection (using ModelTest for DNA) and tree building.We did try to build the tree with DNA sequences, and as suspected by the reviewers the DNA divergences are too high and we didn’t see a clear separation between clades. We then decided to focus only on the EEVS DNA sequences, but still didn’t see a clear separation between bacterial and vertebrate sequences. Finally, we narrowed down our analysis to vertebrate mRNAs only, and the result is shown in Figure 3–figure supplement 3. MEGA6 was used to determine the best fit nucleic acid evolutionary model (K80+G), and for maximum likelihood analysis of the vertebrate mRNA sequences with tree robustness assessed by bootstrap (500 replicates).
Figure 3—figure supplement 3.
Maximum likelihood tree of vertebrate EEVS mRNA sequences.
E. coli cglD was used as an out-group. Numbers represent bootstrap confidence values.
DOI:
http://dx.doi.org/10.7554/eLife.05919.020
4) It is highly desirable to use multiple phylogenetic inference methods to validate evolutionary relationships. Thus, in addition to the reported Max Likelihood analysis, a Mr. Bayes analysis should be run alongside.We have used RAxML for maximum likelihood analysis, and the robustness of the trees was assessed by bootstrap analysis (1000 replicates). As suggested, Bayesian analysis was performed by MrBayes (version 3.2.3), using a random starting tree, running eight chains for 4,000,000 generations, sampling every 250 trees. The first 5,000 trees were discarded as the burnin, with the remaining trees used to calculate posterior probability. RAxML and MrBayes were run on the CIPRES science gateway. The Bayesian phylogenetic tree is presented in Figure 1, whereas the maximum likelihood phylogenetic tree is presented in Figure 1–figure supplement 2. The results are fully consistent with the previous ones, and even place both algal EEVS sequences closer to the vertebrate sequences, further strengthening our horizontal gene transfer proposal.5) Bootstrap and posterior probability support values must be reported in a phylogenetic tree analysis to evaluate the strength of the evolutionary results.We have included the bootstrap and posterior probability values for main branches in the phylogenetic trees. For clarity, values for branches within the clades were excluded.6) The authors express and purify the proteins of interest, but the experimental section suggests that for some experiments they were assayed as cell free extracts. It is not clear which results discussed in the text and shown in the figures were obtained with purified protein and which with cell free extracts, but the reader is left with the impression that all data was obtained with purified proteins. The authors should also explain why cell free extracts were used when purified proteins were apparently available.Characterization of the zebrafishEEVS was carried out with a purified LOC100003999 protein as shown in Figure 1–figure supplement 3. However we discovered that the purified EEVS was relatively unstable and the activity was much lower than that in the cell free extracts. Therefore, for the characterization of the second enzyme (purified MT-Ox), we used cell free extracts containing EEVS to make sure that enough EEV (the substrate for MT-Ox) is produced. We have specified whether purified proteins or cell-free extracts were used in the figure legends.
Figure 1—figure supplement 3.
SDS PAGE of EEVS and MT-Ox proteins and thin-layer chromatography (TLC) analysis of their products.
(A) SDS PAGE of purified recombinant ValA, LOC100003999, and zgc:113054, and E. coli cell free extracts containing the enzymes. (B) TLC analysis of enzymatic products of purified ValA and purified LOC100003999. (C) TLC analysis of EEV and the zgc:113054 product (gadusol) using FeCl3 solution as a spraying agent. Only gadusol, but not EEV, can be detected as a maroon spot.
DOI:
http://dx.doi.org/10.7554/eLife.05919.007
7) The authors should provide more information about the in vitro biochemical characterization of the zebrafishEEVS. Did they test the activity of the enzyme toward sugar substrates other than sedoheltulose-7-phosphate? Also the TLC shown in
seems to show multiple new products being formed by this enzyme. These products do not appear to be present in the assay with ValA, the bacterial EEVS used as a positive control. What are these alternate products?Since the amino acid sequence of zebrafishEEVS is highly similar to the bacterial EEVS, sharing all the 14 conserved active site residues, we only tested the activity of the protein toward sedoheptulose 7-phosphate and confirmed its identity as an EEVS. We did not test the activity of the enzyme toward other sugar substrates. Regarding the additional spots on the TLC, we are not sure what they are. However, they do not appear to be the products of the enzyme. We have replaced the TLC with a new one with no or very little of these spots.8) The authors should provide some additional commentary in their discussion regarding the proposed biological roles of gadusol and MAAs in fish and the issue of their origin (endogenous synthesis vs dietary), and highlight issues that remain unresolved with the current results. For instance, it has been demonstrated experimentally that fish can accumulate MAAs from their diets (see Mar Biol 144:1057-1064, Comp Biochem Physiol Part A Mol Integr Physiol 120:587-598). Are the relative concentrations of gadusol and MAAs in fish roe known? Just because there is an endogenous pathway for gadusol synthesis in fish does not mean that this compound cannot also be acquired from dietary sources along with the MAAs or that it is the primary contributor to UV protection.Based on the present study, we believe that gadusol and MAAs found in marine fish are synthesized through two different pathways. While we propose that gadusol is synthesized de novo by the fish, the accumulation of MAAs in fish would still appear to be of dietary origin. Therefore, we have added the following comment in the Discussion “However, as MAAs are synthesized via a different pathway and there is no evidence that fish have those biosynthetic enzymes, the accumulation of MAAs in fish would still appear to be of dietary origin (Mason et al., 1998; Zamzow, 2004).”
Authors: Julie Colléter; David J Penman; Stéphane Lallement; Christian Fauvel; Tanja Hanebrekke; Renate D Osvik; Hans C Eilertsen; Helena D'Cotta; Béatrice Chatain; Stefano Peruzzi Journal: PLoS One Date: 2014-10-20 Impact factor: 3.240
Authors: Chad Garner; Richard Ahn; Yuan Chun Ding; Linda Steele; Samantha Stoven; Peter H Green; Alessio Fasano; Joseph A Murray; Susan L Neuhausen Journal: PLoS One Date: 2014-07-07 Impact factor: 3.240
Authors: Melissa A Morgan; Corey M Griffith; Meredith M Dinges; Yana A Lyon; Ryan R Julian; Cynthia K Larive Journal: Aquat Toxicol Date: 2019-05-01 Impact factor: 4.964
Authors: Andrew R Osborn; Kelsey M Kean; Khaled M Alseud; Khaled H Almabruk; Shumpei Asamizu; Janet A Lee; P Andrew Karplus; Taifo Mahmud Journal: ACS Chem Biol Date: 2017-02-17 Impact factor: 5.100