Literature DB >> 31562761

Hexa-Longin domain scaffolds for inter-Rab signalling.

Luis Sanchez-Pulido1, Chris P Ponting1.   

Abstract

SUMMARY: CPLANE is a protein complex required for assembly and maintenance of primary cilia. It contains several proteins, such as INTU, FUZ, WDPCP, JBTS17 and RSG1 (REM2- and RAB-like small GTPase 1), whose genes are mutated in ciliopathies. Using two contrasting evolutionary analyses, coevolution-based contact prediction and sequence conservation, we first identified the INTU/FUZ heterodimer as a novel member of homologous HerMon (Hermansky-Pudlak syndrome and MON1-CCZ1) complexes. Subsequently, we identified homologous Longin domains that are triplicated in each of these six proteins (MON1A, CCZ1, HPS1, HPS4, INTU and FUZ). HerMon complexes are known to be Rab effectors and Rab GEFs (Guanine nucleotide Exchange Factors) that regulate vesicular trafficking. Consequently, INTU/FUZ, their homologous complex, is likely to act as a GEF during activation of Rab GTPases involved in ciliogenesis. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31562761      PMCID: PMC7703760          DOI: 10.1093/bioinformatics/btz739

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Many diverse cell processes are regulated by small GTPases, switching between active (GTP-bound) and inactive (GDP-bound) states. Small GTPases are switched on by guanine-nucleotide exchange factors (GEFs) that promote the exchange of bound GDP by GTP (Bourne ). Mutations in small GTPases and GEFs are frequent in Mendelian diseases and cancer (Blacque ; Bos ). Multiple small GTPases of the Rab family and their GEFs, for example, are critical for the assembly of cilia (ciliogenesis) and can be mutated in ciliopathies (Blacque ). Mouse mutants for genes encoding the Rab-like small GTPase RSG1 or ciliopathy-associated proteins Fuzzy (FUZ) and Inturned (INTU) show developmental abnormalities characteristic of decreased cilia-dependent Hedgehog signalling (Agbu ; Gray ; Zeng ). These three proteins interact as members of the ciliogenesis and planar polarity effector (CPLANE) complex that controls recruitment of intraflagellar transport machinery to the basal body (Toriyama ). The precise molecular and cellular roles in ciliogenesis of these proteins remain unknown. This is in large part, it is proposed, because they lack discernible domain homologues (Adler and Wallingford, 2017). INTU protein is a scaffolding subunit of the CPLANE complex and, with the sole exception of a PDZ (PSD-95/discs large/ZO-1) domain, no other functional domain has been identified within its 942 residues length (Adler and Wallingford, 2017; Chang ; Toriyama ; Wang ; Yang ). To investigate the evolutionary provenance of the INTU protein family, we embarked on a deep sequence analysis taking advantage of both protein sequence conservation and coevolution-based contact prediction approaches.

2 Results and discussion

2.1 A new Longin domain in INTU

We initiated our analyses with a JackHMMER iterative search (Finn ) of the UniRef50 database (The UniProt Consortium, 2019) using the human INTU protein sequence as query. This identified full-length INTU homologues across the animal kingdom. A full-length multiple sequence alignment, generated with T-Coffee (Notredame ), revealed an evolutionarily conserved region (INTU_HUMAN amino acids 305–439) just after its PDZ domain (Fig. 1).
Fig. 1.

(A) HerMon family domain architecture. Domains coloured in gold are the Longin domains identified first by Kinch and Grishin in MON1A, CCZ1, HPS1 and HPS4 (ovals labelled M1, C1, H11 and H41) (Kinch and Grishin, 2006). The similarity between the N-terminal regions of Ccz1 and Hps4 was originally found by Hoffman-Sommer et al. (Hoffman-Sommer ) and termed the CHiPS domain, corresponding to Longin domains labelled C1 and H41. The first and third Longin FUZ domains (F1 and F3; coloured in blue) were previously proposed, without statistical evidence from sequence similarities, using the GenTHREADER method of structure prediction (Gray ; Lobley ; Toriyama ). In the second Longin domains of HPS1 and HPS4 there are long insertions showing poor evolutionary conservation (H12 and H42; broken ovals). The PDZ domain of INTU annotated in the SMART domain database (hexagon coloured in green) (Letunic and Bork, 2018). Newly identified Longin domains are shown in red (F2, I1, I2, I3, M2, M3, C2, C3, H12, H13, H42 and H43). (B) MON1A contact maps. Cartoon of the Longin domain structure of C. thermophilum MON1 (PDB: 5LDD_A; amino acids 222-316) core structure (β-strands are labelled 1 to 5 and coloured in purple, cyan, green, yellow and red, respectively) generated using PyMOL (https://pymol.org/). Anti-parallel β-strand pairs are clearly observable in the contact map calculated from the first Longin domain (M1) of C.thermophilum MON1 (PDB: 5LDD_A) (see β-strand pairs 1/2, 1/5, 3/4 and 4/5), whose structure is known (Kiontke ), generated using the Cocomaps server (input: 5LDD_A versus 5LDD_A, cut-off distance value = 7 Ångstroms) (Vangone ). Two similar contact patterns, predicted with RaptorX (Wang ), are observed in two conserved regions in human MON1A protein (M2 and M3, amino acids 316-415 and 444-544, respectively) (Supplementary Figs S3, S10 and S12). (C) HHpred comparison E-values among pairs of HerMon Longin domains. Numbers overlaid onto green arrows correspond to HHpred profile-versus-profile comparison Erpt-values (Söding ). Erpt is the estimated number of alignments with a particular score, or higher, in a reduced search space of 18 Longin domain profiles (those shown in panel A) (Söding ) and indicates the significance of profile-profile alignment scores conditional to these proteins harbouring at least one Longin domain. Numbers overlaid on red arrows correspond to HHpred profile-versus-profile comparison Erpt -values of 6 profiles that each represents each of the three Longin domains in FUZ, MON1A and HPS1, or in INTU, CCZ1 and HPS4 (indicated within circles with dotted lines). Multiple sequence alignments, on which these profiles are based, are provided in Supplementary Figures S1, S2 and S9–S12. Arrows indicate the profile search direction. Only E-values < 0.005 are shown

(A) HerMon family domain architecture. Domains coloured in gold are the Longin domains identified first by Kinch and Grishin in MON1A, CCZ1, HPS1 and HPS4 (ovals labelled M1, C1, H11 and H41) (Kinch and Grishin, 2006). The similarity between the N-terminal regions of Ccz1 and Hps4 was originally found by Hoffman-Sommer et al. (Hoffman-Sommer ) and termed the CHiPS domain, corresponding to Longin domains labelled C1 and H41. The first and third Longin FUZ domains (F1 and F3; coloured in blue) were previously proposed, without statistical evidence from sequence similarities, using the GenTHREADER method of structure prediction (Gray ; Lobley ; Toriyama ). In the second Longin domains of HPS1 and HPS4 there are long insertions showing poor evolutionary conservation (H12 and H42; broken ovals). The PDZ domain of INTU annotated in the SMART domain database (hexagon coloured in green) (Letunic and Bork, 2018). Newly identified Longin domains are shown in red (F2, I1, I2, I3, M2, M3, C2, C3, H12, H13, H42 and H43). (B) MON1A contact maps. Cartoon of the Longin domain structure of C. thermophilum MON1 (PDB: 5LDD_A; amino acids 222-316) core structure (β-strands are labelled 1 to 5 and coloured in purple, cyan, green, yellow and red, respectively) generated using PyMOL (https://pymol.org/). Anti-parallel β-strand pairs are clearly observable in the contact map calculated from the first Longin domain (M1) of C.thermophilum MON1 (PDB: 5LDD_A) (see β-strand pairs 1/2, 1/5, 3/4 and 4/5), whose structure is known (Kiontke ), generated using the Cocomaps server (input: 5LDD_A versus 5LDD_A, cut-off distance value = 7 Ångstroms) (Vangone ). Two similar contact patterns, predicted with RaptorX (Wang ), are observed in two conserved regions in human MON1A protein (M2 and M3, amino acids 316-415 and 444-544, respectively) (Supplementary Figs S3, S10 and S12). (C) HHpred comparison E-values among pairs of HerMon Longin domains. Numbers overlaid onto green arrows correspond to HHpred profile-versus-profile comparison Erpt-values (Söding ). Erpt is the estimated number of alignments with a particular score, or higher, in a reduced search space of 18 Longin domain profiles (those shown in panel A) (Söding ) and indicates the significance of profile-profile alignment scores conditional to these proteins harbouring at least one Longin domain. Numbers overlaid on red arrows correspond to HHpred profile-versus-profile comparison Erpt -values of 6 profiles that each represents each of the three Longin domains in FUZ, MON1A and HPS1, or in INTU, CCZ1 and HPS4 (indicated within circles with dotted lines). Multiple sequence alignments, on which these profiles are based, are provided in Supplementary Figures S1, S2 and S9–S12. Arrows indicate the profile search direction. Only E-values < 0.005 are shown HHpred searches against the PDB70 profile database (Söding ) using this conserved region as input detected significant sequence similarity with the Longin domain from Chaetomium thermophilum CCZ1 (Kiontke ) with an E-value of 0.01 (Probability: 95.4). Moreover, in support of this top match, the next most statistically significant similarities were with additional members of the Longin superfamily. Furthermore, the predicted secondary structure (Jones, 1999) of this INTU conserved region was consistent with known Longin domain structures (Supplementary Fig. S1). Longin domains were described initially as evolutionarily conserved N-terminal regions of VAMP7 (Vesicle-associated membrane protein 7) and Ykt6 protein families (Filippini ). Structural analysis subsequently identified similarities between two AP2 (adaptor protein 2) complex subunits (AP2A2 (subunit alpha-2) and AP2M1 (subunit mu)) and Sec22b protein and Ykt6 Longin domain (Collins ). Longin domains have since been found widely across eukaryotes (Pfam family Longin, accession: PF13774; InterPro, accession: IPR011012) (Mitchell ; Punta ) and have often been implicated in aspects of membrane dynamics regulation (Daste ). In structural terms, the Longin core, and its related roadblock fold, are composed of an α/β fold containing two α-helices organized around a central β-sheet of five anti-parallel β-strands (Fig. 1) (Kinch and Grishin, 2006; Kiontke ; Levine ). These similarities in primary sequence and secondary structure correspondence indicate that INTU is a previously undescribed member of the Longin domain-containing protein family. We were struck by INTU's interacting partner FUZ also containing a proposed N-terminal Longin domain (Toriyama ) because Longin domains commonly heterodimerise with other Longin domains (Kinch and Grishin, 2006; Kiontke ; Levine ). Consequently, we decided to further analyze this putative N-terminal Longin domain in FUZ (amino acids 10-141) and found it to have statistically significant sequence similarity to the Longin domain of C. thermophilum MON1 (HHpred E-value = 1.7x10−8; Probability: 98.6) (Kiontke ) (Supplementary Fig. S2). This pair of N-terminal INTU/FUZ Longin domains were thus strikingly each found, by HHpred searches, to be homologues of the pair of Longin domains that heterodimerise in CCZ1 and MON1, respectively. Our analysis thus indicates that INTU/FUZ is a third and unanticipated, heterodimer of the HerMon family that was previously represented by only MC1 (MON1/CCZ1 heterodimer) and BLOC3 (Biogenesis of lysosome-related organelles complex 3) complexes (the latter composed of the HPS1/HPS4 heterodimer) (Supplementary Figs S1 and S2) (Barr, 2014; Carmona-Rivera ; Chiang ; Gerondopoulos ; Kiontke ; Kinch and Grishin, 2006; Nazarian ).

2.2 Tandemly repeated Longin domains in the HerMon family

To date, there is structural and/or statistically significant sequence similarity evidence for only a single N-terminal Longin domain within MON1, CCZ1, HPS1 and HPS4 proteins (M1, C1, H11 and H41; indicated in gold in Fig. 1) (Kiontke ; Kinch and Grishin, 2006). To identify putative domains in the unassigned C-terminal regions of these four proteins and the INTU/FUZ heterodimer, we took advantage of two distinct types of evolutionary information, namely protein sequence conservation and coevolution-based contact predictions. Residue pairs in close contact in protein 3D structures often show a correlated mutational signature. This is due to a missense mutation in one residue often being compensated by a missense mutation in its paired residue, so as to preserve protein stability, folding or function (Rollins ; Schmiedel and Lehner, 2019). Coevolution-based contact predictions methods are able to identify such mutationally coupled residues across deep multiple sequence alignments (Wang ). Coevolution-based contact predictions using RaptorX (Wang ) revealed a repeated contact pattern, observed three-times in each of MON1, CCZ1, INTU and FUZ HerMon family members (Fig. 1; Supplementary Figs S3–S6). This common pattern then allowed us to define the boundaries delimiting three repeated regions. In MON1 and CCZ1 the first of these regions correspond to their structurally determined Longin domains (Kiontke ). In particular, their longer β-strands 1 and 5, buried within the structural core of the Longin fold, contribute a strong feature of the triplicated contact pattern (Fig. 1B;Supplementary Figs S3 and S4). This repeated contact pattern was not evident for HPS1 and HPS4 (Supplementary Figs S7 and S8), likely owing to the limited phyletic range, and thus sequence divergence, within these families. Even the previously identified N-terminal Longin domains in HPS1 and HPS4 (Kinch and Grishin, 2006), and confirmed by us (Supplementary Figs S1 and S2), are not apparent from these contact prediction maps (Supplementary Figs S7 and S8). Similarities between HPS1 or HPS4, and MON1 or CCZ1, respectively, were observed from detailed sequence analysis (Fig. 1; Supplementary Figs S9–S12). These findings, based on sequence conservation and coevolution-based contact predictions, led us to a hypothesis that each of these triplicated regions contains a Longin domain, and motivated us to generate 18 multiple protein sequence alignments and profiles, three for each of the six HerMon family proteins: MON1, CCZ1, INTU, FUZ, HPS1 and HPS4 (Supplementary Figs S1, S2 and S9–S12). Subsequent pairwise comparison of sequence conservation among these profiles using HHpred (Söding ) yielded statistically significant sequence similarities among these repeated regions (E < 5.0x10−3; Fig. 1; Supplementary Figs S1, S2 and S9–S12) that are indicative of homology. Consistent with homology, 3 D models generated using RaptorX (Wang ) for the second and third MON1 repeats are each consistent with a Longin fold. The highest ranked RaptorX models for the second and third repeats were significantly similar to Longin structures (DALI scores of Z = 6.0 and 7.7, respectively, exceeding the Z-score = 2 threshold for statistical significance) (Holm and Laakso, 2016). The common triplicated Longin domain architecture of HerMon proteins (Fig. 1) indicates that these 6 proteins diverged from a common ancestral protein pair (MON1/CCZ1 heterodimer), whose evolutionary precursor was a single homodimer containing three consecutively repeated Longin domains.

2.3 Functional conservation in HerMon complexes

Homology among the three pairs of HerMon proteins is likely to reflect their similar functional mechanisms. MC1 and BLOC3 complexes are signal transducers in Rab cascades both as effectors of an active Rab (GTP-bound state) (Rab5 for MC1 and Rab9 for BLOC3) and as GEFs of an inactive Rab (GDP-bound state) (Rab7 for MC1 and Rab32/Rab37 for BLOC3) that, consequently, guide the directionality of vesicular traffic to lysosome and lysosome-related organelles (Gerondopoulos ; Hegedus ; Kiontke ; Kinchen and Ravichandran, 2010; Kloer ; Mahanty ; Nordmann ; Pfeffer, 2013, 2017). This suggests that the INTU/FUZ heterodimer also orchestrates a Rab signaling cascade in ciliogenesis, involving RAB8 and RSG1, two Rab proteins known to be components of the CPLANE complex (Agbu ; Toriyama ; Zilber ).

3 Conclusion

In summary, we have identified the INTU/FUZ heterodimer as the third pair of HerMon heterodimeric complexes and discovered that all six HerMon proteins harbor three Longin domains. Our identification of each HerMon complex as a hexa-Longin domain scaffold should aid in the design of further experiments that investigate their contributions to diverse transport-related processes and inter-Rab signaling pathways. During proof corrections of this work, Gerondopoulos provided experimental validation of INTU/FUZ as a Rab-GEF complex, and proposed a similar three Longin domain structures without presenting alignments or statistical results.

Funding

This work was supported by the Medical Research Council UK. Conflict of Interest: none declared. Click here for additional data file.
  45 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

Review 2.  The GTPase superfamily: a conserved switch for diverse cell functions.

Authors:  H R Bourne; D A Sanders; F McCormick
Journal:  Nature       Date:  1990-11-08       Impact factor: 49.962

3.  Identification of two evolutionarily conserved genes regulating processing of engulfed apoptotic cells.

Authors:  Jason M Kinchen; Kodi S Ravichandran
Journal:  Nature       Date:  2010-03-21       Impact factor: 49.962

4.  PCP effector gene Inturned is an important regulator of cilia formation and embryonic development in mammals.

Authors:  Huiqing Zeng; Amber N Hoover; Aimin Liu
Journal:  Dev Biol       Date:  2010-01-11       Impact factor: 3.582

5.  The ciliopathy-associated CPLANE proteins direct basal body recruitment of intraflagellar transport machinery.

Authors:  Michinori Toriyama; Chanjae Lee; S Paige Taylor; Ivan Duran; Daniel H Cohn; Ange-Line Bruel; Jacqueline M Tabler; Kevin Drew; Marcus R Kelly; Sukyoung Kim; Tae Joo Park; Daniela A Braun; Ghislaine Pierquin; Armand Biver; Kerstin Wagner; Anne Malfroot; Inusha Panigrahi; Brunella Franco; Hadeel Adel Al-Lami; Yvonne Yeung; Yeon Ja Choi; Yannis Duffourd; Laurence Faivre; Jean-Baptiste Rivière; Jiang Chen; Karen J Liu; Edward M Marcotte; Friedhelm Hildebrandt; Christel Thauvin-Robinet; Deborah Krakow; Peter K Jackson; John B Wallingford
Journal:  Nat Genet       Date:  2016-05-09       Impact factor: 38.330

6.  Dali server update.

Authors:  Liisa Holm; Laura M Laakso
Journal:  Nucleic Acids Res       Date:  2016-04-29       Impact factor: 16.971

7.  20 years of the SMART protein domain annotation resource.

Authors:  Ivica Letunic; Peer Bork
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

8.  The PCP effector Fuzzy controls cilial assembly and signaling by recruiting Rab8 and Dishevelled to the primary cilium.

Authors:  Yulia Zilber; Sima Babayeva; Jung Hwa Seo; Jia Jia Liu; Steven Mootin; Elena Torban
Journal:  Mol Biol Cell       Date:  2013-01-09       Impact factor: 4.138

9.  Rab9A is required for delivery of cargo from recycling endosomes to melanosomes.

Authors:  Sarmistha Mahanty; Keerthana Ravichandran; Praneeth Chitirala; Jyothi Prabha; Riddhi Atul Jani; Subba Rao Gangi Setty
Journal:  Pigment Cell Melanoma Res       Date:  2016-01       Impact factor: 4.693

10.  The CPLANE protein Intu protects kidneys from ischemia-reperfusion injury by targeting STAT1 for degradation.

Authors:  Shixuan Wang; Aimin Liu; Guangyu Wu; Han-Fei Ding; Shuang Huang; Stanley Nahman; Zheng Dong
Journal:  Nat Commun       Date:  2018-03-26       Impact factor: 14.919

View more
  2 in total

1.  Identification of a novel variant of the ciliopathic gene FUZZY associated with craniosynostosis.

Authors:  William B Barrell; Hadeel Adel Al-Lami; Jacqueline A C Goos; Sigrid M A Swagemakers; Marieke van Dooren; Elena Torban; Peter J van der Spek; Irene M J Mathijssen; Karen J Liu
Journal:  Eur J Hum Genet       Date:  2021-11-01       Impact factor: 4.246

2.  Structure of the Mon1-Ccz1 complex reveals molecular basis of membrane binding for Rab7 activation.

Authors:  Björn U Klink; Eric Herrmann; Claudia Antoni; Lars Langemeyer; Stephan Kiontke; Christos Gatsogiannis; Christian Ungermann; Stefan Raunser; Daniel Kümmel
Journal:  Proc Natl Acad Sci U S A       Date:  2022-02-08       Impact factor: 11.205

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.