Literature DB >> 17127216

Phylogenetic analysis of RhoGAP domain-containing proteins.

Marcelo M Brandão, Karina L Silva-Brandão, Fernando F Costa, Sara T O Saad.   

Abstract

Proteins containing an Rho GTPase-activating protein (RhoGAP) domain work as molecular switches involved in the regulation of diverse cellular functions. The ability of these GTPases to regulate a wide number of cellular processes conies from their interactions with multiple effectors and inhibitors, including the RhoGAP family, which stimulates their intrinsic GTPase activity. Here, a phylogenetic approach was applied to study the evolutionary relationship among 59 RhoGAP domain-containing proteins. The sequences were aligned by their RhoGAP domains and the phylogenetic hypotheses were generated using Maximum Parsimony and Bayesian analyses. The character tracing of two traits, GTPase activity and presence of other domains, indicated a significant phylogenetic signal for both of them.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17127216      PMCID: PMC5054073          DOI: 10.1016/S1672-0229(06)60031-4

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

The Rho GTPase-activating proteins (RhoGAPs) are defined by the presence of a 150-amino-acid homolog region that is designated as the RhoGAP domain. This domain is necessary and sufficient for GAP activity and shares at least 20% sequence identity among its family members 1., 2.. Proteins containing an RhoGAP domain act as molecular switches involved in the regulation of diverse cellular functions, including actin cytoskeleton rearrangements, regulation of gene transcriptions, cell cycle regulation, control of apoptosis, and membrane trafficking 2., 3., 4., 5.. Rho GTPases cycle between active and inactive GTP-bound states. The control of these states is regulated by three main classes of proteins: guanine nucleotide exchange factors, guanine nucleotide dissociation inhibitors, and GAPs. To date, at least 21 Rho GTPases have been defined, among which only three (RhoA, Cdc42, and Rac1) are well characterized. Therefore, most studies have been focusing on these three proteins. The ability of Rho GTPases to regulate a wide number of cellular processes comes from their interactions with multiple effectors or inhibitors. One class of these inhibitors is the RhoGAP family, which stimulates GTPase activity by enhancing the intrinsic rate of GTP hydrolysis. In the early analyses of the human genome sequence, 77 different genes containing the RhoGAP domain were found. Further studies have demonstrated that many of these genes are simple gene sequence variations or single nucleotide polymorphisms 6., 7.. The structural data available for RhoGAP domain-containing proteins showing their complexity with Rho GTPases (Cdc42 and RhoA) demonstrated the 3D workflow for RhoGAP-mediated GTP-hydrolysis, and highlighted the importance of a well-conserved arginine residue present in the active site that acts as a conformation stabilizer needed for hydrolysis 8., 9., 10.. Recently, a novel member of the RhoGAP family, ARHGAP21 (Rho GTPase-activating protein 21, alias ARHGAP10), was cloned and characterized in our laboratory (. In addition to the RhoGAP domain, ARHGAP21 presents a PH domain and a P-loop-containing PDZ domain. This gene is widely expressed at high levels in muscle and brain, and is up-regulated during myeloid and erythroid maturation, suggesting a potential role for this RhoGAP in regulating cell differentiation (. The aim of this study is to infer the evolution of the RhoGAP superfamily using a phylogenetic approach, to determine the roles of other domains and their main GTPase activities in this evolutionary history, and to provide a tool that could render some insights regarding subfamily protein functions using RhoGAP-containing proteins as a model.

Results

The full dataset contains 267 amino acids, of which 161 are parsimony-informative. Parsimony searches of the equally weighed dataset resulted in 12 equally parsimonious trees with 3,213 steps (CI=0.449; RI=0.525). The strict consensus tree was mostly not resolved at internal nodes (data not shown), but almost all terminal branches showed strong bootstrap support (Figure 1).
Fig. 1

The Bayesian tree based on amino acid sequences of the RhoGAP domain from RhoGAP domain-containing proteins. Values above the branches indicate the Posterior Probability (PP) and the bold numbers indicate the bootstrap values from 500 replications (where it exceeds 50%).

A Bayesian tree recovered mostly the same terminal relationships by Maximum Parsimony (MP) analyses with strong values of Posterior Probability (PP). However, internal branches have from low to moderate PP values (Figure 1). The two characters investigated (domains and GTPase activity) showed a significant phylogenetic signal (P=0.003), which suggests that the distribution of these traits among the proteins can be explained by their phylogenetic relationships (Figure 2). The optimization of the other domains over our phylogenetic hypothesis suggests that the ancestral state of these proteins involves solely the presence of the RhoGAP domain and the activity toward Rac1.
Fig. 2

Domain presence (left side) and GTPase activity (right side) in the RhoGAP domain-containing proteins traced along the proposed phylogeny.

The character tracing of these traits suggests an overall pattern on which proteins sharing equal domains also share equal GTPase activity. The clade joining KIAA0672 + 3BP1 + RICH1 + Nadrin was recovered both by MP and Bayesian analyses with strong support. All proteins in this clade share the presence of the BAR (Bin-Amphiphysin-Rvs) domain and GTPase activity toward Rac1, solely or in addition to other GTPases. The same rule can be applied to the clade joining STARTdom + GT650 + DLC1 + AHRGAP7. All of them share the presence of the START (steroidogenic acute regulatory proteinrelated lipid transfer) domain and all, but STARTdom, have GTPase activity over RhoA. GTPase activity is unknown for STARTdom. The clade joining GMIP + HA1 + PARG is composed by proteins containing the C1 domain in addition to their GTPase activity toward RhoA. The clade joining AHRGAP11A + AHRGAP20 + AHRGAP1 + AHRGAP8 has in common the absence of other domains that are not RhoGAP. The GTPase activity of this group is known only for AHRGAP1, which is active toward Cdc42. Considering other terminal clades, we can presume that other AHRGAPs within this group can show the same GTPase activity toward Cdc42. On the other hand, the clade joining P115 + srGAP3 + srGAP1 + srGAP2 shares the presence of the FCH (FER/CIP4-homology) domain, however, P115 has GTPase activity over Rac1, while the other three proteins have activity toward Cdc42.

Discussion

Phylogenetic reconstruction and bioinformatics analyses that integrate evolutionary considerations are becoming increasingly important tools for applied fields. Numerous gene sequences were generated in the genomics age with little or no accompanying experimental determination of functional information or evolutionary relationships. Previous works from Peck et al. ( and Moon and Zheng ( also present a phylogenetic approach on the RhoGAP family; however, the authors did not indicate the methodology applied neither did they present any support analyses for their cladograms. In this work, bioinformatics and phylogenomics tools were used to present a phylogenetic relationship of 59 members of the RhoGAP superfamily. All amino acid alignments and subsequent phylogenetic tree constructions were based on the RhoGAP domain sequence. We demonstrated that these RhoGAP domain-containing proteins, with the conservative argenine residue, form a monophyletic group, that is, all of them share a common protein ancestor in their evolutionary history. The tracing for GAP activity toward the most studied RhoGTPases (RhoA, Rac1, and Cdc42) (Figure 2) indicates that this trait presents a strong phylogenetic signal (P=0.003), contrasting with previous findings of Peck et al. (. The analysis of the resulting phylogenetic tree has suggested that the ancestral state for GTPase activity is the affinity to Rac1. It is still difficult to determine the gap activity by only analyzing the protein sequence; the GAP assay 13., 14. is the most reliable way to determine activity. The phylogenetic approach may give a clue, once it is capable of clustering together different proteins that share common substrates as can be seen on the clades of KIAA0672 + 3BP1 + RICH1 + Nadrin and srGAP3 + srGAP1 + srGAP2. Speculations regarding protein specific functions (only using the GTPase activity character) may be avoided for now, because the affinity for the same GTPase does not imply in the same function, since each GTPase may present contrasting functions in different pathways (. Furthermore, structural and molecular biology studies are needed to elucidate the exact amino acid composition involved in determining specificity and how the differences in this composition can affect the 3D protein structure and its interaction with Rho GTPases. In addition to the RhoGAP domain, the members of this superfamily usually contain other functional motifs. Therefore, RhoGAPs might catalyze or participate in enzymatic reactions other than the enhancement of the intrinsic GTP hydrolysis of Rho GTPases, and sometimes apparently aiding the Rho protein to signaling (. Somehow the presence of additional domains was linked to the RhoGAP domain structure because, even focusing the alignment exclusively on the RhoGAP domain sequence, the phylogeny joined in clades of different proteins sharing the same additional domains with strong bootstrap and PP, that is, the ARHGAPs were divided into two groups. One is composed of a clade including the ARHGAPs 9, 12, 21, and 23, presenting an RhoGAP domain and a pH domain accompanied or not by additional domains (Figure 2). The other is composed of the terminals ARHGAPs 1, 8, 11A, and 20, presenting only the RhoGAP domain, except the ARHGAP20 that present an RA domain. Interactions among genomics, evolution, and bioinformatics go further than sequence alignment and relationship elucidation among species. Evolutionary analysis may help researchers design new strategies to understand protein or gene interactions and their functionalities and might provide an insight for new experiments. In conclusion, a phylogenetic study of the RhoGAP domain-containing proteins has demonstrated that there is a strong evolutionary relationship among the RhoGAP superfamily members, especially when they share common motifs or GAP activity.

Materials and Methods

Materials

All protein sequences used here were obtained from the GenBank database (http://ncbi.nlm.nih.gov/Genbank/) at the National Center for Biotechnology Information (NCBI), as well as from the Swiss-Prot/TrEMBL database (http://expasy.org/sprot/) at the Swiss Institute for Bioinformatics and at the European Bioinformatics Institute (Table 1).
Table 1

Selected Rho GTPase-Activating Proteins

No.ProteinGenBankSwissProt
13BP-1Q9Y3L3
2ABRNP_068781.2
3α-ChimaerinCAA35769.1
4ARAP1NP_056057.1
5ARAP2BAA25506.1
6ARAP3CAC83946.1
7ARHGAP1NP_004299
8ARHGAP6NP_038286.1
9ARHGAP7Q63744
10ARHGAP8CAB90248.1
11ARHGAP9BAB56159.1
12ARHGAP11ANP_055598.1
13ARHGAP12NP_060757.4
14ARHGAP18 MacGAPNP_277050
15ARHGAP19NP_116289.4
16ARHGAP20AAS45466.1
17ARHGAP21AF480466.1
18ARHGAP23BAA96025.1
19ARHGAP24NP_112595.1
20ARHGAP25bNP_055697.1
21β-ChimaerinAAA16836.1
22BCRNP_004318.2
23CAC17688.2CAC17688.2
24CHR50RFNP_057687.1
25DLC-1NP_006085.2
26GAPDroAAF44627.1
27GMIPNP_057657.1
28GRAFNP_055886.1
29GRAF-2BAB61771
30GT650 (DLC2)NP_443083.1
31HA-1 (KIAA0233)BAA13212.1
32H-GrafCAA71414.2
33INPP5BAAA79207.1
34KIAA0672BAA31647
35KIAA1204BAA86518.1
36KIAA1314BAA92552.1
37KIAA1688BAB21779.1
38MgcRacGAPNP_037409.2
39Myosin_IXANP_008832.1
40Myosin_IXBNP_004136.2
41NadrinNP_060524.4
42N-Chimaerin homologAAB81198.1
43OCRL-1NP_001578.2
44Oligophrenin-1NP_002538.1
45p190-AAAF80386.1
46p190-BNP_001164.2
47P85-alphaP27986
48P85-betaNP_005018.1
49PARG1NP_004806.2
50PSGAPAAK18175.1
51RALBP1NP_006779.1
52RHG4 (p115)CAA55394.1
53RICH-1CAC37948.1
54RLIP76AAB00103.1
55srGAP1BAA92542.1
56srGAP2BAA32301.1
57srGAP3CAC22407.1
58START domainNP_055540.2
containing 8 (KIAA0189)CAC22407.1
59HA-1 (KIAA0233)BAA13212.1
The sequences were aligned by their RhoGAP domains with additional 100 N-terminal residues, primarily using ClustalW version 1.83 ( under default settings, followed by adjustment by eye using the BioEdit version 6.0.7 (Ibis Therapeutics, Carlsbad, USA). All alignment files, the protein sequences in the FASTA format, and other related colored materials are available for download at http://www.hemocentro.unicamp.br/submission/.

Phylogenetic analyses

The Bayesian analysis was carried out by using MrBayes version 3.1.2 17., 18. with the mixed model of amino acid substitution provided in the package. Six simultaneous chains were conducted for 1.0×106 generations, sampling trees every 100 cycles. The first 1,000 trees were discarded as “burn in”. For all analyses, chr5orf was used as an outgroup to root the tree, based on the absence of the conservative argentine residue. The MP analyses were performed with PAUP* 4.0b10 ( on the entire dataset using a heuristic search with 500 random taxon addition replicates, TBR branch-swapping, gaps scored as missing data, and all characters equally weighted. A strict consensus tree was computed whenever multiple equally parsimonious trees were obtained. The robustness of each branch was determined using the nonparametric bootstrap test ( with 500 replicates and 10 random taxon additions.

Character optimization

MacClade 4.08 ( was used to perform character optimization analyses. We investigated the evolution of two characters that were superimposed onto the Bayesian tree proposed for the RhoGAP-containing proteins: the presence of different domains in addition to RhoGAP, and the GTPase enhancing activity toward the most studied Rho GTPases (Racl, RhoA, and Cdc42). For domain identification, a search of the PFAM database version 19.0 ( was performed using the HMMPFAM tool from the HMMER suite version 2.3.2 ( with the Ε-value cutoff for the persequence set to 1.0E-10. This character had 20 unordered character states plus a 21st character state corresponding to the absence of additional domains other than RhoGAP. The GTPase character had eight character states representing the affinity toward one or two Rho GTPases; these data was mined by searching PubMed (http://www.ncbi.nlm.nih.gov/entrez) at NCBI. To test whether there was a phylogenetic signal in the characters traced, we used the methodology proposed by Wahlberg ( that was modified from the PTP test described by Faith and Cranston (. The test compared the number of steps of the tree constructed with the actual data, with the number of steps in the trees obtained for each random reshuffling of the separated character states. We performed 300 random reshufflings of character states among the fixed terminal proteins by using Mesquite version 1.06 (http://www.mesquiteproject.org). The probability (P) that the observed pattern does not differ from random is given as (n + 1)/300, where n is the number of replications no bigger than that of the actual steps. A significant phylogenetic signal was observed when Ρ is less than 0.05 (.

Authors’ contributions

MMB participated in the design of the study, sequence alignment, bioinformatics analyses, and drafted the manuscript. KLSB participated in all automatic alignment and eye refinements, interpretations of the bioinformatics results, and drafted the manuscript. FFC and STOS conceived the study, participated in its design and coordination, and helped to draft the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors have declared that no competing interests exist.
  21 in total

1.  The phylogenetics and biochemistry of host-plant specialization in Melitaeine butterflies (Lepidoptera: Nymphalidae).

Authors:  N Wahlberg
Journal:  Evolution       Date:  2001-03       Impact factor: 3.694

Review 2.  Rho GTPase-activating proteins in cell regulation.

Authors:  Sun Young Moon; Yi Zheng
Journal:  Trends Cell Biol       Date:  2003-01       Impact factor: 20.808

3.  ARHGAP10, a novel human gene coding for a potentially cytoskeletal Rho-GTPase activating protein.

Authors:  Daniela Sanchez Bassères; Edna Vedelago Tizzei; Adriana A S Duarte; Fernando Ferreira Costa; Sara Teresinha Olalla Saad
Journal:  Biochem Biophys Res Commun       Date:  2002-06-14       Impact factor: 3.575

4.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

5.  Pfam: a comprehensive database of protein domain families based on seed alignments.

Authors:  E L Sonnhammer; S R Eddy; R Durbin
Journal:  Proteins       Date:  1997-07

Review 6.  Rho GTPases and signaling networks.

Authors:  L Van Aelst; C D'Souza-Schorey
Journal:  Genes Dev       Date:  1997-09-15       Impact factor: 11.361

7.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

Authors:  R Sachidanandam; D Weissman; S C Schmidt; J M Kakol; L D Stein; G Marth; S Sherry; J C Mullikin; B J Mortimore; D L Willey; S E Hunt; C G Cole; P C Coggill; C M Rice; Z Ning; J Rogers; D R Bentley; P Y Kwok; E R Mardis; R T Yeh; B Schultz; L Cook; R Davenport; M Dante; L Fulton; L Hillier; R H Waterston; J D McPherson; B Gilman; S Schaffner; W J Van Etten; D Reich; J Higgins; M J Daly; B Blumenstiel; J Baldwin; N Stange-Thomann; M C Zody; L Linton; E S Lander; D Altshuler
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

Review 8.  GAPs for rho-related GTPases.

Authors:  N Lamarche; A Hall
Journal:  Trends Genet       Date:  1994-12       Impact factor: 11.639

Review 9.  Human RhoGAP domain-containing proteins: structure, function and evolutionary relationships.

Authors:  Jeremy Peck; Gilbert Douglas; Catherine H Wu; Peter D Burbelo
Journal:  FEBS Lett       Date:  2002-09-25       Impact factor: 4.124

10.  Structure at 1.65 A of RhoA and its GTPase-activating protein in complex with a transition-state analogue.

Authors:  K Rittinger; P A Walker; J F Eccleston; S J Smerdon; S J Gamblin
Journal:  Nature       Date:  1997-10-16       Impact factor: 49.962

View more
  4 in total

1.  Deciphering the Molecular and Functional Basis of RHOGAP Family Proteins: A SYSTEMATIC APPROACH TOWARD SELECTIVE INACTIVATION OF RHO FAMILY PROTEINS.

Authors:  Ehsan Amin; Mamta Jaiswal; Urszula Derewenda; Katarina Reis; Kazem Nouri; Katja T Koessmeier; Pontus Aspenström; Avril V Somlyo; Radovan Dvorsky; Mohammad R Ahmadian
Journal:  J Biol Chem       Date:  2016-08-01       Impact factor: 5.157

Review 2.  Rho GTPase regulation of reactive oxygen species generation and signalling in platelet function and disease.

Authors:  Anh T P Ngo; Ivan Parra-Izquierdo; Joseph E Aslan; Owen J T McCarty
Journal:  Small GTPases       Date:  2021-04-12

3.  The RhoGAP protein ARHGAP18/SENEX localizes to microtubules and regulates their stability in endothelial cells.

Authors:  Michael D Lovelace; Elizabeth E Powter; Paul R Coleman; Yang Zhao; Amelia Parker; Garry H Chang; Angelina J Lay; Julie Hunter; Aaron P McGrath; Mika Jormakka; Patrick Bertolino; Geoffrey McCaughan; Maria Kavallaris; Mathew A Vadas; Jennifer R Gamble
Journal:  Mol Biol Cell       Date:  2017-03-01       Impact factor: 4.138

4.  Roots of angiosperm formins: the evolutionary history of plant FH2 domain-containing proteins.

Authors:  Michal Grunt; Viktor Zárský; Fatima Cvrcková
Journal:  BMC Evol Biol       Date:  2008-04-22       Impact factor: 3.260

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.