| Literature DB >> 12620104 |
Eugene V Koonin1, Kira S Makarova, Igor B Rogozin, Laetitia Davidovic, Marie-Claude Letellier, Luca Pellegrini.
Abstract
BACKGROUND: The rhomboid family of polytopic membrane proteins shows a level of evolutionary conservation unique among membrane proteins. They are present in nearly all the sequenced genomes of archaea, bacteria and eukaryotes, with the exception of several species with small genomes. On the basis of experimental studies with the developmental regulator rhomboid from Drosophila and the AarA protein from the bacterium Providencia stuartii, the rhomboids are thought to be intramembrane serine proteases whose signaling function is conserved in eukaryotes and prokaryotes.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12620104 PMCID: PMC153459 DOI: 10.1186/gb-2003-4-3-r19
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Multiple alignment of the conserved core of the rhomboid family proteins. The alignment includes the majority of the detected rhomboid family proteins; some closely related sequences were omitted. Only the six conserved (predicted) transmembrane helices (TMH) and short surrounding regions are shown. The boundaries of the predicted TMH are indicated by gray shading and overline and they are numbered 1-6. The number of amino-acid residues in the omitted terminal and internal regions are indicated. The consensus shows amino-acid residues present in at least 90% of the aligned sequences; h stands for hydrophobic residues (A, C, I, L, V, M, F, Y, W in the single-letter amino-acid code) and s for small residues (G, A, S, D, N, V). The proposed catalytic serine (TMH4) and histidine (TMH6) as well as conserved residues in TMH2 with possible ancillary roles in catalysis are highlighted in color. The proteins are identified with the gene identification (GI) number from the nonredundant database and an abbreviated species name. Bacterial species are color-coded green, eukaryotic species blue and archaeal species yellow. Species name abbreviations: Aerpe, Aeropyrum pernix; Agrtu, Agrobacterium tumefaciens; Anoga, Anopheles gambiae; Arath, Arabidopsis thaliana; Arcfu, Archaeoglobus fulgidus; Bacsu, Bacillus subtilis; Brume, Brucella melitensis; Caeel, Caenorhabditis elegans; Caucr, Caulobacter crescentus; Chlte, Chlorobium tepidum; Cloac, Clostridium acetobutilicum; Corgl, Corynebacterium glutamicum; Deira, Deinococcus radiodurans; Dicdi, Dictyostelium discoideum; Drome, Drosophila melanogaster; Escco, Escherichia coli; Haein, Haemophilus influenzae; Halsp, Halobacterium sp.; Homsa, Homo sapiens; Lacla, Lactococcus lactis; Lisin, Listeria innocua; Metja, Methanoccocus jannaschii; Metka, Methanopyrus kandleri; Metma, Methanosarcina mazei; Meslo, Mesorhizobium loti; Mycle, Mycobacterium leprae; Myctu, Mycobacterium tuberculosis; Neucr, Neurospora crassa; Nossp, Nostoc sp.; Prost, Providencia stuartii; Pyrab, Pyrococcus abyssi; Pyrae, Pyrobaculum aerophilum; Ralso, Ralstonia solanaraceum; Sacce, Saccharomyces cerevisiae; Schpo, Schizosaccharomyces pombe; Sinme, Sinorhizobium meliloti; Strco, Streptomyces coelicolor; Strpn, Streptococcus pneumoniae; Sulso, Sulfolobus solfataricus; Sulto, Sulfolobus tokodaii; Synsp, Synechocystis sp.; Theac, Thermoplasma acidophilum; Thema, Thermotoga maritima; Thete, Thermus thermophilus; Vibch, Vibrio cholerae; Xanca, Xanthomonas campestris; Xylfa, Xylella fastidiosa.
Figure 2Phylogenetic tree of the rhomboid family. The sequences and their regions used to construct the tree are exactly those shown in Figure 1. The color coding and abbreviations are as in Figure 1. The two major eukaryotic subfamilies are denoted as RHO and PARL (see text) and four clusters containing unexpected (from a phylogenetic viewpoint) sets of species are denoted 1-4. The clades that were investigated in the KH test are denoted A through D. Although the tree is shown in a pseudorooted form for convenience, this is an unrooted tree. Internal nodes with at least 70% RELL bootstrap supported are denoted by black circles and nodes with a 50-70% support by blue circles. The posterior probabilities reported by the MRBAYES program are indicated for some key internal branches. Domain architectures are connected to the respective proteins by brackets or lines. The domain key is shown at the bottom of the figure.
Log-likelihood analysis of possible placements of selected branches of maximum likelihood trees for the proteins analyzed
| Tree* | Diff lnL† | SE‡ | RELL-BP§ |
| Original tree | 0.0 | - | 0.9702 |
| A → B | -18.9 | 10.2 | 0.0264 |
| B → A | -46.6 | 14.6 | 0.0003 |
| A → C | -30.3 | 12.8 | 0.0031 |
| A → D | -47.9 | 15.6 | 0.0000 |
*A-D, clades that were subjected to local rearrangements in the tree as indicated in Figure 2 and discussed in the text. †Difference of the log-likelihoods relative to the best tree. ‡Standard error of Diff lnL. §Bootstrap probability of the given tree calculated using the RELL method (resampling of estimated log-likelihoods).
Figure 3Hypothesis-specific constraint tree for the rhomboid family. (a) Hypothesis 1, polyphyletic origin of eukaryotic rhomboids from prokaryotic progenitors. The RHO and PARL subfamilies are denoted; the remaining clusters include prokaryotic rhomboids designated as in Figure 2 (with 'a' added to the GI number). Within each cluster, the branches were collapsed into a multifurcation. (b) Hypothesis 2, monophyletic origin of eukaryotic rhomboids. All eukaryotic and prokaryotic sequences were collapsed into the two respective clusters. The trees are unrooted, although shown in a pseudorooted form.
Statistical comparisons of the best neighbor-joining tree with the hypothesis 1 and hypothesis 2 trees
| Kishino-Hasegawa test | |||||
| Tree | Length | Length difference | SD (difference) | ||
| Best | 4951 | - | |||
| Hypothesis 1 | 4966 | 15 | 11.9 | 1.26 | 0.211 |
| Hypothesis 2 | 4974 | 23 | 10.8 | 2.12 | 0.036 |
| Templeton (Wilcoxon signed-ranks) test | |||||
| Tree | Length | Rank sums | |||
| Best | 4951 | - | |||
| Hypothesis 1 | 4966 | 1418.0 | 69 | -1.33 | 0.185 |
| -997.0 | |||||
| Hypothesis 2 | 4974 | 1244.5 | 62 | -1.97 | 0.048 |
| -708.5 | |||||
| Winning-sites (sign) test | |||||
| Tree | Length | Counts | |||
| Best | 4951 | ||||
| Hypothesis 1 | 4966 | 36 | 0.810 | ||
| -33 | |||||
| Hypothesis 2 | 4974 | 40 | 0.031 | ||
| -22 | |||||
*Probability of getting a more extreme test statistic under the null hypothesis of no difference between the two trees (two-tailed test).
Figure 4A hypothetical scenario for the origin and dissemination of the rhomboid family proteases. The figure schematically shows the proposed three stages of evolution of the rhomboid family. In (a), the progenitor of the rhomboid family functions as a transporter for a regulatory peptide in some bacterial lineage. In (b), the catalytic site of the intramembrane protease evolves, allowing the switch to RIP as the mechanism of the regulatory peptide release. In (c), the emergence of RIP is followed by a burst of HGT. R, regulatory peptide. The transmembrane helices of rhomboid are designated as in Figure 1; their topology in the membrane is based on that proposed in [7]. The catalytic histidine and serine are shown and connected by a dotted line to indicate the proposed charge-relay system of the protease; possible ancillary catalytic residues are not shown.