Cordula Boehm1, Mark C Field1,2. 1. School of Life Sciences, University of Dundee, Dow Street, Dundee, DD1 5EH, UK. 2. Biology Centre, Institute of Parasitology, Czech Academy of Sciences, České Budějovic, 37005, Czech Republic.
Abstract
Background: The eukaryotic endomembrane system most likely arose via paralogous expansions of genes encoding proteins that specify organelle identity, coat complexes and govern fusion specificity. While the majority of these gene families were established by the time of the last eukaryotic common ancestor (LECA), subsequent evolutionary events has moulded these systems, likely reflecting adaptations retained for increased fitness. As well as sequence evolution, these adaptations include loss of otherwise canonical components, the emergence of lineage-specific proteins and paralog expansion. The exocyst complex is involved in late exocytosis and additional trafficking pathways and a member of the complexes associated with tethering containing helical rods (CATCHR) tethering complex family. CATCHR includes the conserved oligomeric Golgi (COG) complex, homotypic fusion and vacuole protein sorting (HOPS)/class C core vacuole/endosome tethering (CORVET) complexes and several others. The exocyst is integrated into a complex GTPase signalling network in animals, fungi and other lineages. Prompted by discovery of Exo99, a non-canonical subunit in the excavate protist Trypanosoma brucei, and availability of significantly increased genome sequence data, we re-examined evolution of the exocyst. Methods: We examined the evolution of exocyst components by comparative genomics, phylogenetics and structure prediction. Results: The exocyst composition is highly conserved, but with substantial losses of subunits in the Apicomplexa and expansions in Streptophyta plants, Metazoa and land plants, where for the latter, massive paralog expansion of Exo70 represents an extreme and unique example. Significantly, few taxa retain a partial complex, suggesting that, in general, all subunits are probably required for functionality. Further, the ninth exocyst subunit, Exo99, is specific to the Euglenozoa with a distinct architecture compared to the other subunits and which possibly represents a coat system. Conclusions: These data reveal a remarkable degree of evolutionary flexibility within the exocyst complex, suggesting significant diversity in exocytosis mechanisms. Copyright:
Background: The eukaryotic endomembrane system most likely arose via paralogous expansions of genes encoding proteins that specify organelle identity, coat complexes and govern fusion specificity. While the majority of these gene families were established by the time of the last eukaryotic common ancestor (LECA), subsequent evolutionary events has moulded these systems, likely reflecting adaptations retained for increased fitness. As well as sequence evolution, these adaptations include loss of otherwise canonical components, the emergence of lineage-specific proteins and paralog expansion. The exocyst complex is involved in late exocytosis and additional trafficking pathways and a member of the complexes associated with tethering containing helical rods (CATCHR) tethering complex family. CATCHR includes the conserved oligomeric Golgi (COG) complex, homotypic fusion and vacuole protein sorting (HOPS)/class C core vacuole/endosome tethering (CORVET) complexes and several others. The exocyst is integrated into a complex GTPase signalling network in animals, fungi and other lineages. Prompted by discovery of Exo99, a non-canonical subunit in the excavate protist Trypanosoma brucei, and availability of significantly increased genome sequence data, we re-examined evolution of the exocyst. Methods: We examined the evolution of exocyst components by comparative genomics, phylogenetics and structure prediction. Results: The exocyst composition is highly conserved, but with substantial losses of subunits in the Apicomplexa and expansions in Streptophyta plants, Metazoa and land plants, where for the latter, massive paralog expansion of Exo70 represents an extreme and unique example. Significantly, few taxa retain a partial complex, suggesting that, in general, all subunits are probably required for functionality. Further, the ninth exocyst subunit, Exo99, is specific to the Euglenozoa with a distinct architecture compared to the other subunits and which possibly represents a coat system. Conclusions: These data reveal a remarkable degree of evolutionary flexibility within the exocyst complex, suggesting significant diversity in exocytosis mechanisms. Copyright:
A sophisticated level of cellular compartmentalisation is the major feature that differentiates prokaryotic from eukaryotic cells and underpins the origins of the nucleus. Early eukaryotic ancestors possessed a complex internal membrane system, suggesting rapid evolution after the first eukaryotic common ancestor (FECA) arose and prior to origin of the major eukaryotic super-groups (
Dacks & Field, 2018;
Guy
;
Koumandou
;
Schlacht
). It is clear that these systems predate the origins of what would be classically recognised as eukaryotes, as some ancestral genes for constructing an endomembrane system were present in prokaryotes, and specifically Archaea (
Eme
;
Spang
) (
Figure 1A).
Figure 1.
Evolution and structure of the exocyst.
A) Cartoon representing the major supergroups, which are referred to in the text. The inferred position of the last eukaryotic common ancestor (LECA) is indicated and the supergroups are colour coordinated with all other figures.
B) Structure of trypanosome Exo99, modelled using Phyre2 (intensive mode). The model for the WD40/b-propeller (blue) is likely highly accurate. The respective orientations of the a-helical regions may form a solenoid or similar, but due to a lack of confidence in the disordered linker regions this is highly speculative.
C and
D) Structure of the
Saccharomyces cerevisiae exocyst holomeric octameric complex. In
C the cryoEM map (at level 0.100) is shown and in
D, the fit for all eight subunits (pdb 5yfp). Colours for subunits are shown as a key, and the orientation of the cryoEM and fit are the same for
C and
D. All structural images were modelled by the authors from PDB using UCSF Chimera.
Evolution and structure of the exocyst.
A) Cartoon representing the major supergroups, which are referred to in the text. The inferred position of the last eukaryotic common ancestor (LECA) is indicated and the supergroups are colour coordinated with all other figures.
B) Structure of trypanosome Exo99, modelled using Phyre2 (intensive mode). The model for the WD40/b-propeller (blue) is likely highly accurate. The respective orientations of the a-helical regions may form a solenoid or similar, but due to a lack of confidence in the disordered linker regions this is highly speculative.
C and
D) Structure of the
Saccharomyces cerevisiae exocyst holomeric octameric complex. In
C the cryoEM map (at level 0.100) is shown and in
D, the fit for all eight subunits (pdb 5yfp). Colours for subunits are shown as a key, and the orientation of the cryoEM and fit are the same for
C and
D. All structural images were modelled by the authors from PDB using UCSF Chimera.An established theme in the evolution of membrane transport is the central role of paralogous protein families in dictating compartmental identify, specificity and supporting transport functions between compartments. These families include small GTPases, SNAREs, coat complexes and tethers. For example, SNARE and Rab paralogs associate with distinct subcellular organelles (
Elias
;
Khurana
;
Zerial & McBridge, 2001), and it is likely that new organelles and/or pathways develop via emergence of novel SNARE and/or Rab paralogs through gene duplication and neofunctionalisation (
Dacks & Field, 2007;
Ramadas & Thattai, 2013). The evolution of some of these families has been reconstructed in some detail (
Elias
;
Hirst
;
Venkatesh
).Deep evolutionary relationships between proteins forming vesicular coats and other structures, including the COPI and II complexes, clathrin/adaptin heterotetramers and the nuclear pore complex, further supports the concept of stepwise acquisition of complexity prior to the last eukaryotic common ancestor (LECA) (
Rout & Field, 2017). Perhaps most remarkable is the presence of a fully differentiated set of coat complexes and specificity-encoding machinery in the LECA, and consequentially, over a billion years this ancestral endomembrane system has expanded, contracted and neofunctionalised such that the current configurations of eukaryotic endomembrane systems vary hugely. More recently it has been speculated, based on the diversity of the architecture of nuclear pore complex subunits, that the nucleus, and possibly the intraflagellar transport system, arose comparatively late, during the transition from FECA to LECA (
Field & Rout, 2019).A further group of proteins central to compartmentalisation are the membrane-tethering complexes (MTCs). Considerably more diverse than Rab and SNARE families in both architecture and mechanism(s) of action, MTCs control Rab GTP cycles, as well as tether vesicles for fusion. MTCs have splendid names that include transport protein particle (TRAPP) I, II and III, conserved oligomeric Golgi (COG), homotypic fusion and vacuole protein sorting (HOPS), class C core vacuole/endosome tethering (CORVET) (plus class C homologs in endosome-vesicle interaction, CHEVI and factors for endosome recycling and retromer interactions, FERARI), dorsalin-1 (Dsl1), Golgi-associated retrograde protein/endosome-associated recycling protein (GARP/EARP) and the exocyst. Significantly, MTCs vary considerably in the number of subunits they possess, but evidence for common evolutionary descent for some MTC subunits has been offered (
Koumandou
;
Whyte & Munro, 2002;
Yu & Hughson, 2010). MTCs are widely distributed among eukaryotic taxa and many subunits share the complexes associated with tethering-containing helical rods (CATCHR) fold, consistent with a common origin for MTCs (
Klinger
;
Koumandou
;
Yu & Hughson, 2010), and further supported by the structural similarity of several exocyst subunits sharing the CATCHR fold, which is almost exclusively α-helical (
Sivaram
;
Vasan
). Further, the
Saccharomyces cerevisiae CATCHR complexes, GARP, COG1–4 subcomplex of COG and HOPS share similar subunit organization (
Chou
). This is not only consistent with possible common ancestry, but also may indicate mechanistic similarities.Exocyst subunits were initially identified as Sec3, 5, 6, 8, 10 and 15 mutants in a screen for trafficking defects (
Novick
). Two additional subunits, Exo70 and Exo84, were subsequently discovered and the holocomplex presented as a stable 19.5S particle (
Bowser
;
Guo
;
TerBush
) (
Figure 1D). Overall, the yeast exocyst forms a loose open rod, but has considerable conformational flexibility (
Heider
;
Picco
;
TerBush
) and interacts with multiple plasma membrane-located GTPases (
Wu
) and can act as an effector to both Rho and Rab GTPases. CryoEM structures of the complex and subunits at 4.4Å resolution revealed a highly conserved architecture for the subunits with between two and four helical ‘CorEx’ bundles, together with an extended N-terminal α-helix that is critical for assembly (
Mei
). Notably, CorEx shares structural similarities with the N-terminus of COG and GARP subunits. Only Sec3 and Exo84 have an additional domain, namely a pleckstrin homology (PH) lipid interaction domain.Both structural and experimental data from yeast suggests that the exocyst is formed of two heterotertrameric subcomplexes; Sec3, 5, 6, 8 and Sec10, 15, Exo 70 and 84. Sec3 appears critical for both assembly and disassembly (
Ahmed
;
Luo
) and Exo84 phosphorylation is implicated in controlling overall exocyst assembly and function. Significantly, these Sec3 and Exo84 are components of different subcomplexes and likely interact with phospholipids through their PH domains. The exocyst has clear roles in secretion but is also implicated in disease susceptibility, host cell invasion by intracellular bacteria and development (
Arasaki
;
Bonnemaijer
;
Lira
) with evidence for additional roles in endocytosis/recycling also published (
Boehm
;
Jose
;
Monteiro
). Also the tetrameric subcomplexes function in autophagy with potential for additional specialisation (
Bodemann
;
Kulich
).Previous comparative genomics studies identified only six of the eight canonical exocyst subunits, with Sec5 and Exo84 evading identification in all trypanosomatids (
Koumandou
), suggesting a simplified exocyst complex in trypanosomatids, possible replacement of otherwise canonical subunits or failure to uncover highly divergent orthologs. The latter possibility was demonstrated following biochemical identification of all eight canonical subunits as well as a ninth, Exo99, in trypanosomes (
Boehm
). Using updated methodology and genome resources, we find evidence for considerable evolutionary flexibility in exocyst subunit retention, with essentially complete loss from some lineages and a tentative suggestion of a connection to novel coat proteins.
Results
Identifying exocyst subunits across the eukaryotes
The earlier failure to identify Sec5 and Exo84 in excavates by comparative genomics (
Koumandou
), and subsequent identification in trypanosomes by immunoisolation and mass spectrometry, indicated that this earlier study lacked sensitivity, and suggested other false negatives within the dataset (
Boehm
). Furthermore, the distribution of the recently identified Exo99 subunit has not been investigated systematically. Considerable genome sequencing has taken place in the period since the earlier analysis, as well as availability of superior search algorithms, prompting this reanalysis.We screened for genes encoding the eight canonical exocyst subunits and the newly identified subunit Exo99 in 87 eukaryotic genomes by BLAST, inspection of alignments and phylogenetic reconstruction. This increased the size of our genome panel approximately five-fold and took advantage of the increased quality of these resources. Furthermore, we were able to harness high quality phylogenetic reconstruction to validate our data. Only subunit predictions that passed reciprocal BLAST, phylogenetic validation and were predicted to be within a similar length as the query, together with homology that extended over more that 50% of the sequence (to avoid calls based exclusively on conservation of small architectural features) were annotated as ‘found’. Example phylogenetic trees for three subunits (Sec15, Exo99 and Exo70) are shown in
Figure 2 and the overall distribution in
Figure 3. Phylogenetic trees for the remaining subunits, as well as accession numbers of identified orthologs are included as
Extended data (Figures S1-6, Table S1).
Figure 2.
Phylogenetic reconstructions for Sec15, Exo99 and Exo70.
Trees show the best Bayesian topology for reconstructions of
A) Sec 15,
B) Exo99 and
C) Exo70. Numerical values at the nodes indicate statistical support from analysis with MrBayes and PhyML. Values for highly supported nodes have been replaced by symbols as indicated in the legend. Species names are coloured for recognised supergroups: Opisthokonta (blue), Amoebozoa (pink), Archeoplastida (green), Stramenopile-Alveolate-Rhizaria (SAR, orange), Excavata (purple). The same colour convention is used throughout this figure and phylogenetic trees.
Figure 3.
Coulson plot of distribution of exocyst components across eukaryotic lineages.
Calls are based on a combination of BLAST, alignments and phylogenetic reconstruction. Filled circles indicate the presence of the protein, open circles that an ortholog was not found and grey indicates that the sequence could not be fully verified by phylogenetics. The numbers in the circles indicate multiple parlogs and the number of individual genes identified. Accession numbers for all reported exocyst subunits are given in Table S1 (
Extended data). Species names are coloured by supergroup as in
Figure 2. The eight canonical exocyst subunits are conserved throughout the eukaryotic lineage, with the exception of the Alveolata. Some of the plants have greatly increased numbers of Exo84 and especially Exo70 genes in their genomes. The presence of Exo99 is restricted to the Excavata.
Phylogenetic reconstructions for Sec15, Exo99 and Exo70.
Trees show the best Bayesian topology for reconstructions of
A) Sec 15,
B) Exo99 and
C) Exo70. Numerical values at the nodes indicate statistical support from analysis with MrBayes and PhyML. Values for highly supported nodes have been replaced by symbols as indicated in the legend. Species names are coloured for recognised supergroups: Opisthokonta (blue), Amoebozoa (pink), Archeoplastida (green), Stramenopile-Alveolate-Rhizaria (SAR, orange), Excavata (purple). The same colour convention is used throughout this figure and phylogenetic trees.
Coulson plot of distribution of exocyst components across eukaryotic lineages.
Calls are based on a combination of BLAST, alignments and phylogenetic reconstruction. Filled circles indicate the presence of the protein, open circles that an ortholog was not found and grey indicates that the sequence could not be fully verified by phylogenetics. The numbers in the circles indicate multiple parlogs and the number of individual genes identified. Accession numbers for all reported exocyst subunits are given in Table S1 (
Extended data). Species names are coloured by supergroup as in
Figure 2. The eight canonical exocyst subunits are conserved throughout the eukaryotic lineage, with the exception of the Alveolata. Some of the plants have greatly increased numbers of Exo84 and especially Exo70 genes in their genomes. The presence of Exo99 is restricted to the Excavata.
Distribution of the canonical octamer subunits
The eight canonical exocyst subunits are well conserved, reflecting their ancient origins and importantly are detected in representatives of all five eukaryotic supergroups. Phylogenetic analysis indicates that these sequences are
bona fide orthologs and, as the topology of the gene and taxon trees are highly similar, there is no evidence for lateral gene transfer (
Figure 2). Given the noted similarities in structures of these subunits and the clear sharing of extensive α-helical bundles revealed by cryoEM, this suggests that the exocyst octamer indeed arose prior to the LECA and most likely by paralog expansion from an ancestral subunit complement.We found few examples of partial retention of subunits and which may also indicate that some of our examples of single subunit losses are artifactual. This pattern of retention may also argue against fully independent functionality for the two subcomplexes and overall is consistent with recent functional studies (
Ahmed
).We found expansions of Sec6 and 15 in vertebrates and the close relatives
Brachiostoma (commonly lancelets), where multiple subunits have expanded. This pattern may reflect tissue complexity, but the absence of an obvious interaction between Sec6 and 15 suggests this is unlikely to be associated with a specific subfunction. However, the locations of these two subunits, located at opposite ends of the octameric complex, may suggest that this facilitates variation in interactions between exocytic vesicles and plasma membrane docking sites in different tissues (
Heider
;
Heider & Munson, 2012). Sec10 is also expanded in the fungi, and there is complete absence of the complex in Rhizopus, the only such example in the Opistokhonta sampled here. As this fungal taxon lacks an ability for septation, this may explain the loss of the entire exocyst, although this is highly speculative.Both major Amoebozoa lineages retain a full complement of exocyst subunits in the genomes of the majority of species sampled. The absence of several subunits from
Entamoaeba invadens and one from
E. mutabilli is noteworthy, as is the expansion of Sec10, a feature shared with fungi. While this could raise the possibility that a Sec10 duplication occurred at the root of the unikhonts, this is not supported by phylogenetic reconstruction, which suggests independent,
albeit likely basal, origins for fungal and Entamoeba Sec10 paralogs (Figure S5,
Extended data).The most significant subunit expansion within the entire dataset is within the Streptophyta plants. Interestingly, in the closely allied algal Chlorophytes, loss is the dominant evolutionary trend, with
Cyanidioschyzon merolae and
Ostreococcus tauri lacking sufficient subunits to build a canonical exocyst, which may suggest alternate functions or mechanisms, at least for
O. tauri and
Micromonas where only four subunits could be identified. Interestingly, for Chlorophytes lacking many subunits, Sec6 and Sec10 are retained, which significantly are components of distinct subcomplexes and unlikely to physically interact (
Ahmed
). A small number of plants also have multiple Sec10 paralogs and, in common with other taxa (see above), the origin of the duplication was likely taxon-specific (Figure S5,
Extended data).The most extreme expansions within Streptophyta subunits are Exo70 and Exo84, with at least 26 copies detected in
Populus trichocarpa and 18 in rice. We are aware that the total number of genes we have predicted for all eight exocyst subunits in plants varies to some extent from those previously published, mostly because we did not distinguish between genes with ≥99% sequence homology. However, this does not affect the overall consensus between our and previous studies; namely, that all exocyst subunits in plants are expanded to some extent.Phylogeny indicates a complex evolutionary pathway for Exo84, and while our reconstruction suggests that most paralogs arose
via lineage-specific expansions, the absence of good statistical support makes this conclusion equivocal (Figure S6,
Extended data). By contrast, it is clear that Sec15 and Exo70 expansions began at the root of the Streptophyta and, in the case of Exo70, this has continued in a lineage-specific manner to create a family of paralogs of considerable diversity (
Figure 2). Land plant Exo70 paralogs can be grouped into three clades (
Cvrčková
;
Synek
), which indicates an early establishment of these subfamilies. Live imaging in mammalian cells suggests that Exo70 is the first subunit to contact the plasma membrane (
Ahmed
), and hence the presence of so many Exo70 variants is likely a result of tissue-specific and/or plasma membrane-domain targeting specificity. In plant cells the presence of multiple cortical subdomains and differential interactions with Exo70 paralogs has been described, as well as the presence of multiple Exo70 paralogs within a single cell (
Synek
;
Žárský
;
Žárský
). Thus multiple factors likely underpin the Exco70 expansions, including differential targeting, noncanomical functions as well as defence against pathogens (
Žárský
;
Zhao
). The evolution of multiple trafficking pathways based around expansion of canonical systems also extends to other MTC-mediated pathways, specifically HOPS and CORVET (
Takemoto
).All sampled Aplicomplexa, including
Plasmodium falciparum and
Toxoplasma gondii,
lack the entire exocyst complex, while other alveolates within the Ciliata lineage have retained a subset of subunits. Since the retained subunits vary between the two ciliates analysed, including Exo70 and Exo84, this suggests individual losses rather than a stepwise loss of exocyst function during the evolution of the alveolates and raises the question for the existence of an exocyst-independent exocytic pathway in these organisms. It is, however, clear that loss from the Apicomplexa is an ancestral event. Apicomplexa are known for a patchy distributions of other tethering complexes like COG, GARP, Dsl1 and TRAPPII (
Koumandou
), and which may reflect simplifications of trafficking systems in these obligate intracellular parasites. Many Apicomplexa possess unique secretory organelles, including micronemes and rhoptries, that are essential for host cell invasion, but these organelles appear not to require the canonical MTC systems (
Tomavo, 2014).Amongst Stramenopiles, there is also a complex pattern of retention and loss. There is near full retention amongst the Oomycota, which contrasts with the many losses in the sister taxon Ochrophyta. It may be significant that in these organisms, Exo84 and, in a more limited manner, Exo70 are most commonly absent, similar to the ciliates. A limited number of expansions are also detected, principally in Sec3, Sec5 and Sec6, which may suggest more diversity within vesicular cargo transport than at the plasma membrane, as all three of these subunits are components of a single subcomplex that likely interfaces with the incoming vesicle (
Ahmed
).Only single orthologues of all exocyst subunits were found in the kinetoplastids, with possible duplications in
Bodo saltans and
Trypanoplasma borelli. The few apparent losses, for example in
T. cruzi and
Phytomonas HART are most likely the result of incomplete sequence data/assembly, with the suggestion that, for these taxa, the composition of the canonical octameric exocyst component is essentially invariant.
Exo99, a taxon-restricted subunit with distinct structure
Exocyst subunits were originally identified
via yeast secretory mutant screens, which uncovered six subunits (Sec3, 5, 6, 8, 10 and 15), and interaction between Sec15 and Sec4, a small GTPase at the plasma membrane required for secretion and orthologous to Rab11 (
Novick
) and with continually expanding roles in endosomal dynamics (
Zulkefli
). Two additional subunits, Exo70 and Exo84, were subsequently described, and the entire system demonstrated by biochemical and multiple interactome analyses to be a stable 19.5S complex,
albeit with evidence for the presence of additional forms (
Bowser
;
Guo
;
Morgera
). A ninth subunit, Exo99, was identified by affinity isolation in African trypanosomes. Exo99 phenocopies Sec15 under knockdown, indicating that it is a
bona fide member of the complex (
Boehm
). Exo99 is present in all kinetoplastids and related bodonids (
Bodo saltans), suggesting a unique aspect in export pathways in these organisms.The structure of Exo99 is highly distinct from the canonical exocyst subunits and possesses a confidently predicted seven blade β-propeller at the N-terminus, together with an α-helical C-terminus (
Figure 1B). The topology of several short stretches of the C-terminal region is predicted as disordered, preventing assessment of the overall architecture of the α-helical region. Hence, it is unclear if this region adapts a fold similar to the CATCHR family and hence other exocysts subunits or is distinct. Very weak homologs were also found in
Naegleria gruberi, Trichomonas vaginalis, Giardia lamblia and social amoeba. Structure prediction suggests that the
N. gruberi sequence may well share architecture with the kinetoplastida (Data archive 2, Extended data), but that the other possible orthologs do not, indicating likely restriction to Euglenozoa, as well as possibly the heterolobosids.It is tempting to speculate that Exo99 is a divergent member of the protocoatomer family, which populate the endomembrane system. These proteins are associated with vesicular transport and related functions and bear the β-propeller N-terminus as well as an α-helical C-terminal domain, perhaps best recognized in the heavy chain of the endocytosis coat protein clathrin (
Rout & Field, 2017). However, in that instance the helices form a coiled-coil solenoid, a specific type of higher order architecture, whilst for Exo99 it is unclear if this is the case. Clearly, more precise structural data are required to evaluate this possibility, as well as the location of Exo99 within the trypanosome exocyst. It is also unclear if additional coat-like components are associated with the trypanosomatid exocyst, but not captured in the affinity isolation. Most significant is that the presence of this divergent subunit, which evaded detection by
in silico methods due to its novelty, opens the possibility of additional lineage-specific exocyst components in other species.
Discussion
Cellular complexity was revolutionised by the development of an endomembrane system during eukaryogenesis. Subsequently losses and gains have moulded the ancestral transport system into the huge variety observed across the range of eukaryotes. Secondary loss of components is common, while expansion of individual paralogs is also a high frequency event. It is less common to uncover the birth of completely new components or complexes. Overall, this may well reflect the range of cellular processes with which the exocyst has been implicated, and while most of these are membrane transport or likely to be connected intimately, exocyst function may extend beyond to include, for example, translation (
Lipschutz
). The interaction of the exocyst with members of multiple subfamilies of the Ras GTPases, including Rabs, Arfs, Rho and others, together with compositional changes reflects this promiscuity, and which is, to some extent, also mirrored by the present study. The exocyst is an open, monomeric rod, with each component present as single copy, but several complexes appear to be required for vesicle fusion (
Ahmed
). All canonical subunits share the CorEx predominantly α-helical secondary structure suggesting a stepwise pathway for exocyst origins, and supported further by the share the CATCHR architecture of additional MTC subunits (
Sivaram
;
Vasan
).There is considerable diversity in the retention of genes encoding exocyst subunits, with many examples of complete loss or spectacular expansion (
Figure 4). Considering subunit losses there are examples in fungi, plants and multiple protozoan lineages. Some, such as
C. merolae, may reflect known unusual biology, and a complete absence of excess subunits in Apicomlexa is also consistent with the highly unusual and reduced secretory systems in a predominantly parasitic lineage. For example, there is repeated loss of adaptins and degeneration of the Golgi complex in Apicomplexa (
Nevin & Dacks, 2009), which correlates well with the absence of much of the COG, Dsl and TRAPPII complexes (
Klinger
), while loss of the exocyst dates back to the origins of the Chromista (
Woo
).
Figure 4.
Schematic eukaryotic evolutionary tree illustrating the major shifts in exocyst complexity.
Examples of individual lineages are shown at the top, and taxon names are colour-coded as before. Red dots indicate substantial losses, blue dots the origins of subunits and teal dots indicate major expansions. Many other smaller scale events, for example single subunit loss or duplication in a single species, are not illustrated for greater clarity.
Schematic eukaryotic evolutionary tree illustrating the major shifts in exocyst complexity.
Examples of individual lineages are shown at the top, and taxon names are colour-coded as before. Red dots indicate substantial losses, blue dots the origins of subunits and teal dots indicate major expansions. Many other smaller scale events, for example single subunit loss or duplication in a single species, are not illustrated for greater clarity.Expansions generally involve a subset of subunits, with Sec6, 10 and 15 duplicated across more than one supergroup, a pattern indicating independent events. Larger scale expansions are Exo70 and 84 in the higher plants. Significantly, Exo70 in
Arabidopsis thaliana carries multiple motifs for interactions with Atg8 (
Cvrčková & Zárský, 2013) and Exo70 has been subjected to extreme paralogous expansion in Streptophyte plants and is likely the product of selective pressures (
Zárský
,
Zárský
). Significantly, plants also possess expanded Rab11 (RabD) paralogs (
Rutherford & Moore, 2002), but a specific relationship between Rab11 and Exo70 paralogs has not been demonstrated. Retention of the exocyst is, in the main, all-or-nothing, indicating that the complex functions essentially as a single unit, and loss of even one subunit compromises function, consistent with the structure of the complex (
Ahmed
;
Heider
;
Mei
). Exo99 is currently the sole example of a lineage-specific exocyst subunit, and may be part of a larger coat complex, based on similarity to protocoatomer. The presence of Exo99 further highlights evolutionary modifications to membrane trafficking pathways and underscores the flexibility of these pathways across evolution, as well as suggesting that there may be additional exocyst components in other lineages (
Manna
;
Rout & Field, 2017). Overall, despite considerable conservation, there is remarkable sculpting of exocyst complex composition, which suggests that, despite the already significant catalog of activities, a range of functional roles may remain to be uncovered. This significant evolutionary flexibility offers an interesting paradigm for unravelling the evolutionary pressures that have moulded eukaryote evolution.
Methods
Comparative genomics of exocyst components
Candidate exocyst components were identified by scanning a panel of eukaryotic predicted proteomes (Table S1 Extended data) with known exocyst component sequence queries using BLAST (
Altschul
). For each subunit, one query sequence was selected from each of the following predicted proteomes:
Homo sapiens, Saccharomyces cerevisiae, Trypanosoma brucei, Dictyostelium discoideum, a chromalveolate (
Phytophthora capsici/Albugo laibachii/Phytophthora sojae/Phytophthora ramorum), an Archaeplastida
(Arabidopsis thaliana/Chlamydomonas reinhardtii/Selaginella moellendorffii) (accession numbers for initial queries provided in Table S1). For each subunit, the top BLAST hits from each of these scans were pooled in a neighbour-joining tree after alignment with ClustalW with default parameters to remove erroneous sequences (
Thompson
). The gene IDs on the tree were then annotated with predicted protein length (based on) alignments with known exocyst components, pfam domain predictions (pfam server default parameters at
https://pfam.xfam.org) and notes of which (if any) of the six initial query sequences detected the ID as a reciprocal best BLAST hit. This annotation allowed the identification of a cluster of robust candidates by neighbour joining (NJ). Off-target matches were identified by manual inspection of both the annotated NJ tree and the underlying alignment and these were excluded. Furthermore, the overall length of the predicted protein and the region of homology were considered, to exclude proteins that were likely only related through possession of a common domain.In cases where a candidate was not found, additional datasets were queried by web-based BLAST searches at
TriTrypDB,
JGI and
NCBI as appropriate. Alignments were created using MUSCLE (
Edgar, 2004). Only unambiguous homologous regions were retained for phylogenetic analysis, performed by two separate methods. To obtain the Bayesian tree topology and posterior probability values, the program
MrBayes version 3.2.2 was used (
Ronquist & Huelsenbeck, 2003) running 8,000,000 generations. Maximum-likelihood (ML) analysis was performed using
PhyML v3.0 (
Guindon & Gascuel, 2003) with 100 bootstrap replicates. Nodes with better than 0.95 posterior probability and 80% bootstrap support were considered robust, and nodes with better than 0.80 posterior probability and 50% bootstrap support are shown.
Structure prediction
The structures of both
T. brucei and N. gruberi Exo99 proteins were predicted using the
Phyre2 server running under intensive mode (
Kelley
, full output available in Extended data). The data for this, as well as for the exocyst octameric complex of
S. cerevisiae, were visualized using
UCSF Chimera (
Pettersen
). Data for experimentally determined structures were retrieved from PDB (
https://www.rcsb.org).
Graphics production
The Coulson plot in
Figure 3 was produced using Coulson Plot Generator (
Field
) and binaries available at
http://sourceforge.net/projects/coulson. All images were prepared for final production in Adobe Illustrator 23.0.3.
Data availability
Underlying data
All data underlying the results are available as part of the article and no additional source data are required.
Extended data, including additional trees, accessions for all hits and structure prediction outputs from Phyre2 are available at
https://figshare.com/articles/Extended_data/8167724I have no further concerns.I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.The manuscript, “
Evolution of late steps in exocytosis: conservation, specialization”, addresses an interesting question, namely the evolution of the exocyst complex. I think the authors do a good job of describing the comparative genomics, phylogenetics and structure prediction of the exocyst.Where the authors could do better is describing the many cellular functions of the exocyst and offering a possible mechanism for this. For example, in the abstract the authors write, “The exocyst complex is involved in late exocytosis, and possibly additional pathways, and is a member…”. In the introduction, the authors describe some of the other additional pathways, “The exocyst has clear roles in secretion but is also implicated in disease susceptibility, host cell invasion by intracellular bacteria and development (Arasaki et al., 2018
[1]; Bonnemaijer et al., 2018
[2]; Lira et al., 2019
[3]) with evidence for additional roles in endocytosis/recycling also published (Boehm et al., 2017
[4]; Jose et al., 2015
[5])”. This is not a complete list as the exocyst has also been shown to be centrally involved in basolateral protein transport (Grindstaff et al, Cell, 1998
[6]), ciliogenesis (Zuo et al, Mol Biol Cell, 2009
[7]), and protein translocation in the ER (Toikkanen et al, J Biol Chem, 2003
[8]; Lipschutz et al, J Biol Chem, 2003
[9]). How does the exocyst perform these myriad functions? The authors allude to a possible mechanism when they write, “Based on prolific interactions with multiple Ras-superfamily GTPases, the exocyst has been proposed to act as a tether to bring secretory vesicles to the plasma membrane, and recent in vivo imaging supports this model (Ahmed et al., 2018
[10]; Picco et al., 2017
[11])”. The exocyst has actually been shown to be regulated by multiple GTPases of the Arf (e.g. Seixas et al, Mol Biol Cell, 2016
[12]; ), Ral (e.g. Moskalenko et al, Nat Cell Biol, 2002
[13]; Sugihara et al, Nat Cell Biol, 2002
[14]; Brymora et al, J Biol Chem 2001
[15]), Rab (Knodler et al, PNAS, 2010
[16]), and Rho (Phiel et al, J Biol Chem, 2001
[17]; Choi et al, J Am Soc Nephrol, 2013
[18]) families. A possible model is that the exocyst is like a Swiss army knife and has the ability to accomplish multiple tasks depending on which small GTPase is regulating it. I think this kind of discussion would strengthen the manuscript.I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.Mark Field and colleagues opened the field of phylogenetic analyses of tethering complexes evolution more than ten years ago and this report, using five fold increase of well selected genomes, enhances the resolution of the exocyst status in extant eukaryotes.p.4. - In Novick et al. (1980)
[1] bunch of first sec mutants were described (including sec subunits of the exocyst), but exocyst was stepwise discovered later (culminating in TerBush et al. 1996
[2] EMBO J). Similarly on p.7 – interaction of Sec4p with Sec15p containing particle is described in Bowser et al. (1992, JCB)[ref3] not in Novick et al. (1980)
[1].In the description of the complex in the Intro it would be important to mention, that it is effector of both RAB and RHO GTPases, and that also EXO70 contributes a crucial direct interaction with the membrane lipids.The extent of the expansion of EXO70 family in land plants is not comparable to any other subunit in any other organism (e.g. 23 in Arabidopsis and 47 in rice) and possibly would be correct to highlight it also in the Abstract (nicely demonstrated by the Fig. 2C).Evolution of plant EXO70s was first addressed by Marek Eliáš in Synek et al. (2006)
[4] clearly indicating deep early land plant EXO70 divergence into three subfamilies. Existence of these three subfamilies was fully corroborated by Cvrčková et al. (2012)
[5] – liverwort Marchantia has three EXO70 paralogs, each in one subfamily (Cvrčková in Rawat et al. 2017)
[6]. The existence of different plant cell cortical secretory domains and exocyst complexes based on differential EXO70 participation is described in Žárský et al. (2009
[7] and 2013
[8]). Importantly data in plant cells specific transcriptomes and proteomes as well as experimental data clearly indicated that in single plant cell several EXO70 paralogs (and therefore exocyst complexes) might operate – tissue specific expression does not explain full multiplicity of EXO70s in land plants. The major driving force of evolution esp. in EXO70.2 land plants subfamily seems to be competition with parasites.Two exocyst subcomplexes (4+4) mostly work together, however work on exocyst in autophagy (Bodemann et al 2011
[9]; Kulich et al. 2013
[10]) indicates possibility of functional exocyst subcomplexes specialization.“Octamer” instead of octomer – as in Fig. 4 – see root of the tree.Typo p. 9 not EXO74 and 80, but EXO70 and 84.This bioinformatics analysis highlight post-LECA evolutionary destiny of exocyst vesicle tethering complex marked by both local expansions as well as gene/whole complex losses well correlating with lineage specific biological contexts.I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.This study investigates the evolution of the exocyst using comparative genomics, phylogenetics and structure prediction. The focus is the evolutionary flexiblity of the exocyst complex, which was prompted by the initial difficulty in identifying all of the eight canonical exocyst subunits in Trypanosomatids along with the recent discovery of an additional 9
th component, Exo99, in these organisms. The challenge is to identify those orthologues that have diverged significantly and as well as the implications of this additional 9
th subunit in African trypanosomes.The results, nicely presented and summarised in Figs. 3 & 4, reveal the degree of flexibility in exocyst evolution across eukaryotic lineages. There are examples of expansion, notably Exo70 but also Sec 10, absence/loss of individual components, Sec 8 in Entamoaeba, as well as complete loss in Apicomplexa. The other notable finding is that Exo99 appears to be a taxon-restricted subunit most likely with a distinct structure.The message of the study is that the conventional view of the exocyst, as a conserved octameric protein complex, needs to be more nuanced. The authors speculate that subtleties in exocyst complex composition (loss as well as presence of new components) suggest a diversity in exocytosis mechanisms. This seems a reasonable proposition, as pointed out here and elsewhere Apicomplexa possess unique secretory organelles that do not appear to require the canonical tether machinery for biosynthesis, while the presence of Exo99 all kinetoplastids (and also the related bodonids), suggests possibly unique elements in export pathways in these organisms.I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
Authors: Margaret R Heider; Mingyu Gu; Caroline M Duffy; Anne M Mirza; Laura L Marcotte; Alexandra C Walls; Nicholas Farrall; Zhanna Hakhverdyan; Mark C Field; Michael P Rout; Adam Frost; Mary Munson Journal: Nat Struct Mol Biol Date: 2015-12-14 Impact factor: 15.369
Authors: K K Grindstaff; C Yeaman; N Anandasabapathy; S C Hsu; E Rodriguez-Boulan; R H Scheller; W J Nelson Journal: Cell Date: 1998-05-29 Impact factor: 41.582
Authors: Alexander Schlacht; Emily K Herman; Mary J Klute; Mark C Field; Joel B Dacks Journal: Cold Spring Harb Perspect Biol Date: 2014-10-01 Impact factor: 10.005