Literature DB >> 18081935

Orthologs of the small RPB8 subunit of the eukaryotic RNA polymerases are conserved in hyperthermophilic Crenarchaeota and "Korarchaeota".

Eugene V Koonin1, Kira S Makarova, James G Elkins.   

Abstract

Although most of the key components of the transcription apparatus, and in particular, RNA polymerase (RNAP) subunits, are conserved between archaea and eukaryotes, no archaeal homologs of the small RPB8 subunit of eukaryotic RNAP have been detected. We report that orthologs of RPB8 are encoded in all sequenced genomes of hyperthermophilic Crenarchaeota and a recently sequenced "korarchaeal" genome, but not in Euryarchaeota or the mesophilic crenarchaeon Cenarchaeum symbiosum. These findings suggest that all 12 core subunits of eukaryotic RNAPs were already present in the last common ancestor of the extant archaea.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18081935      PMCID: PMC2234397          DOI: 10.1186/1745-6150-2-38

Source DB:  PubMed          Journal:  Biol Direct        ISSN: 1745-6150            Impact factor:   4.540


Findings

The core components of the information-processing systems, and in particular, the transcription machinery, are conserved between archaea and eukaryotes, and distinct from the bacterial versions. The heteromultimeric eukaryotic RNAPs consist of 12 subunits (Rpb1–12), of which 11 are conserved in archaea and eukaryotes whereas one, Rpb8, is thought to be unique for eukaryotes [1-6]. Rpb8 is a small protein that typically consists of ~120–150 amino acids and shows relatively poor sequence conservation in eukaryotes. The structure of Rpb8 has been solved, originally, by solution NMR [7] and, subsequently, as part of the RNAP II core, by X ray crystallography [8]. The two structures are in good agreement and indicate that Rpb8 forms a distinct version of the OB(oligonucleotide-oligosaccharide-binding) fold [9] that is characterized by a distinct pattern of 9 β-strands and a pair of invariant glycines in the turn between strands 7 and 8. In the RNAP II structure, Rpb8 interacts with the so-called pore module of Rpb1 at a defined motif that is conserved in both eukaryotic and archaeal Rpb1 orthologs [8]. In addition, Rpb8 genetically interacts with another small subunit, Rpb6, that is also adjacent to the pore module in the core RNAP structure [6]. It has been suggested that Rpb8 and Rpb6, together with the pore module, form a distinct functional unit [6]. The exact role of the small subunits remains unknown although both Rpb6 and Rpb8 are conserved in all eukaryotes, are shared between RNAP I, II and III, and are essential for yeast growth. Regardless of the precise function of Rpb8, the purported absence of this RNAP subunit ortholog in archaea is the major gap in the picture of the otherwise exact correspondence between the core transcriptional machineries of eukaryotes and archaea, and is highly unexpected considering the conservation of all other subunits including the functional partner of Rpb8, Rpb6. In the course of the genome annotation for the first sequenced member of the "Korarchaeota", a putative deep branch of archaea ([10] and manuscript in preparation), one of us (JGE) identified a short (110 amino acids) predicted protein for which some of the best hits in a BLAST search [11] were the eukaryotic Rpb8 subunits. Although the sequence similarity was not statistically significant, this observation prompted a systematic search for possible archaeal homologs of Rpb8. BLAST searches started with the sequences of Rpb8 subunits of various eukaryotic species, again, showed statistically not significant similarity to a distinct set of crenarchaeal proteins that were similar in size to (or slightly shorter than) Rpb8. However, reciprocal iterative PSI-BLAST searches (inclusion E-value cut-off 0.01) started with some of these crenarcheal sequences showed significant similarity to Rpb8. For example, a search with the sequence from Hyperthermus butylicus (Hbut_0467) used as the query retrieved the Rpb8 sequence from fission yeast Schizosaccharomyces pombe in the 2nd iteration, with a E-value of 0.004, and numerous eukaryotic Rpb8 sequences in the 3rd iteration, with highly significant E-values. Similarly, a search with the sequence from Igniococcus hospitalis (Igni_1165) retrieved Rpb8 from Entamoeba histolytica in the 5th iteration, with an E-value of 4 × 10-5). Examination of a multiple alignment of the Rpb8 sequences from diverse eukaryotes and the putative archaeal counterparts showed remarkable conservation of all elements of the OB-fold, in particular, the diagnostic motif between strands 7 and 8 (although one of the two glycines that are invariant in Rpb8 is replaced in a subset of the putative archaeal homologs (Fig. 1a). Furthermore, secondary structure prediction for the archaeal proteins showed a near-perfect superposition with the secondary structure elements extracted from the crystal structure of Rpb8 [8] (Fig. 1a). Taken together, these findings indicate that the detected small archaeal proteins are bona fide homologs of Rpb8.
Figure 1

Orthologs of Rpb8 in archaea. (a) Multiple alignment of eukaryotic Rpb8 subunits and their archaeal orthologs (RpoG). The alignment was constructed using the combination of the results obtained with PROMALS [15] and MUSCLE [16], followed by manual correction on the basis of secondary structure prediction that was obtained using PSIPRED [17] and local alignments generated by PSI-BLAST. Sequences are denoted by their numeric Genbank Identifiers (GI numbers) and species names. The full species names are given in Figure 2. The positions of the first and the last residues of the aligned region in the corresponding protein are indicated for each sequence. The numbers within the alignment represent poorly conserved inserts that are not shown. The numbers of omitted amino acids for T. pendens and G. lamblia are indicated by reverse shading. Positions with identical amino acids in all aligned sequences are in bold face. The coloring is based on the consensus shown underneath the alignment; 'h' indicates hydrophobic residues (ACFILMVWYH), 'p' indicates polar residues (STEDKRNQH), 's' indicates small residues (AGCVDS). Secondary structure is shown for the crystal structure of human Rpb8 (pdb 2F3I); 'H' indicates α-helix and 'E' indicates extended conformation (β-strand). The PSIPRED secondary structure prediction is shown underneath the experimental secondary structure. The glycine doublet that is invariant in eukaryotic Rpb8 sequences is boxed. (b) Phylogenetic and genomic contexts of Rpb8/RpoG. The maximum likelihood phylogenetic tree of Rpb8/RpoG was constructed by local rearrangement of an original minimum evolution (Fitch) tree [18] using the MOLPHY program [19]. MOLPHY was also used to compute RELL bootstrap probabilities, which are indicated (as percentages) for selected major branches. Each terminal node of the tree is labeled by the full species name and the GI number. The genomic neighborhoods of the rpoG gene in Crenarchaeota and the "korarchaeal" genome are shown to the right of the respective branches of the tree. Orthologous genes are shown by arrows of the same color.

Orthologs of Rpb8 in archaea. (a) Multiple alignment of eukaryotic Rpb8 subunits and their archaeal orthologs (RpoG). The alignment was constructed using the combination of the results obtained with PROMALS [15] and MUSCLE [16], followed by manual correction on the basis of secondary structure prediction that was obtained using PSIPRED [17] and local alignments generated by PSI-BLAST. Sequences are denoted by their numeric Genbank Identifiers (GI numbers) and species names. The full species names are given in Figure 2. The positions of the first and the last residues of the aligned region in the corresponding protein are indicated for each sequence. The numbers within the alignment represent poorly conserved inserts that are not shown. The numbers of omitted amino acids for T. pendens and G. lamblia are indicated by reverse shading. Positions with identical amino acids in all aligned sequences are in bold face. The coloring is based on the consensus shown underneath the alignment; 'h' indicates hydrophobic residues (ACFILMVWYH), 'p' indicates polar residues (STEDKRNQH), 's' indicates small residues (AGCVDS). Secondary structure is shown for the crystal structure of human Rpb8 (pdb 2F3I); 'H' indicates α-helix and 'E' indicates extended conformation (β-strand). The PSIPRED secondary structure prediction is shown underneath the experimental secondary structure. The glycine doublet that is invariant in eukaryotic Rpb8 sequences is boxed. (b) Phylogenetic and genomic contexts of Rpb8/RpoG. The maximum likelihood phylogenetic tree of Rpb8/RpoG was constructed by local rearrangement of an original minimum evolution (Fitch) tree [18] using the MOLPHY program [19]. MOLPHY was also used to compute RELL bootstrap probabilities, which are indicated (as percentages) for selected major branches. Each terminal node of the tree is labeled by the full species name and the GI number. The genomic neighborhoods of the rpoG gene in Crenarchaeota and the "korarchaeal" genome are shown to the right of the respective branches of the tree. Orthologous genes are shown by arrows of the same color. Small proteins homologous to Rpb8 were identified in all 10 sequenced genomes of hyperthermophilic Crenarchaeota and the only available korarchaeal genome. Each of these genomes encodes a single Rpb8 homolog which mimics the situation in eukaryotes where no paralogs of Rpb8 are detectable. By contrast, n homologs of these proteins were identified in Euryarchaeota and the mesophilic crenarchaeon Cenarchaeum symbiosum, despite extensive search including running a position-specific scoring matrix for Rpb8 and their crenarchaeal-korachaeal homologs against a dedicated database of euryarchaeal and C. symbiosum protein sequences. Thus, we conclude that all thermophilic Crenarchaeota and at least one korarchaeote encode a single ortholog of eukaryotic Rpb8 whereas Euryarchaeota and C. symbiosum (the only mesophilic crenarchaeon for which the genome sequence is currently available) do not. In retrospect, we became aware that the protein we identified as the crenarchaeal ortholog of Rpb8 has already been described as one of the 13 experimentally defined subunits of the RNAP of the crenarchaeon Sulfolobus acidocaldarius and designated RpoG [12], and the ortholog encoded in the genome of S. solfataricus has been accordingly annotated [13]. Given these data, it appears sensible to adopt the designation RpoG for the archaeal orthologs of Rpb8. In a subsequent global analysis of mRNA stability in the two Sulfolobus species, it has been shown that the RpoG mRNA is markedly more stable than mRNAs of other RNAP subunits, and the apparent uniqueness of this subunit in Sulfolobus has been emphasized [14]. The present analysis clarifies the situation by showing that RpoG is conserved throughout the hyperthermophilic Crenarchaeota (and at least one korarchaeote). Although the small size of Rpb8 and its archaeal orthologs (RpoG) hampers reliable phylogenetic analysis, the maximum likelihood tree we constructed shows a clear separation of the eukaryotic and crenarchaeal branches, and within the latter, the split between Thermoproteales and Sulfolobales (Fig. 1b). Interestingly, the korarchaeal RpoG clustered with Thermoproteales in a strongly supported branch (Fig. 1b). Broader implications of this observation for the evolution of the "Korarchaeota" remain to be investigated. In the archaeal genomes, the rpoG gene is embedded in a notable, partially conserved genomic context (Fig. 1b). With few exceptions, rpoG forms either a codirectional or a divergent but potentially coregulated gene pair with a gene encoding a small RNA-binding protein (COG1958) that is orthologous to eukaryotic Lsm6 and is implicated in RNA-processing. In most of the Sulfolobales, the latter gene is adjacent to a gene for a tRNA modification enzyme, queuine/archaeosine tRNA-ribosyltransferase. Another common neighbor of rpoG is the gene for transcription elongation factor TFIIIB, with a divergent orientation in the majority of Thermoproteales and a codirectional orientation in the korarchaeote. Finally, Thermofilum pendens appears to have a more complex operon organization, with the genes for another RNAP subunit, a transcription factor and a ribosomal protein in the same predicted operon with rpoG (Fig. 1b). This genomic context suggests that, in Crenarchaeota, RpoG is likely to be involved in a tight functional cooperation with TFIIIB, and could also contribute to coupling transcription with RNA processing and modification. The finding described here fills the last gap in the one-to-one correspondence between the RNAP subunits of archaea and eukaryotes, with the implication that the archaeal "parent" of eukaryotes already possessed the intricate 12-subunit organization of RNAP. Surprisingly, however, Euryarchaeota and the only available genome of a mesophilic crenarchaeon appear to lack an ortholog of Rpb8, a conclusion that is compatible with the report on the reconstruction of a fully active RNAP of the euryarchaeon Methanocaldococcus jannaschii from 12 recombinant proteins which, obviously, did not include Rpb8 [1]. Depending on the adopted evolutionary scenario, it is conceivable that Rpb8 emerged in the crenarchaeal lineage or, perhaps, more plausibly, that it was already present in the common ancestor of all extant archaea but lost at the base of the euryarchaeal branch. Regardless of the solution to this conundrum, experimental study of functional differences between RNAPs of Euryarchaeota and Crenarchaeota should be illuminating, given the unusual difference in their predicted subunit composition.

Abbreviations

RNAP: RNA polymerase.

Competing interests

The author(s) declare that they have no competing interests.

Reviewers' reports

Reviewer 1: Purificacion Lopez-Garcia, Universite Paris-Sud This manuscript reports the identification of genes orthologous to RPB8 in archaea. This observation is very interesting and worth publishing, as RPB8 was the only protein conserved in the eukaryotic core of RNA polymerases I, II and III for which orthologues in archaea had not been found. The fact that this protein is apparently missing from Euryarchaeota is intriguing, suggesting that either its role can be fulfilled by other elements or that its primary sequence has evolved beyond recognition. At any rate, the finding of archaeal RPB8 homologues indicates that the complete RNA polymerase core in both archaea and eukaryotes share a common ancestry. Reviewer 2: Chris Ponting, Oxford University This manuscript demonstrates convincingly the presence of an orthologue of eukaryotic RPB8 in hyperthermophilic Crenarchaeota. As such, it provides the last missing piece in the RNAP "puzzle" and should thus be of interest to many working in this field.

Authors' contributions

EVK contributed to sequence analysis and wrote the manuscript; KSm contributed to sequence analysis; JGE made the original observation on the presence of an Rpb8 homolog among the "korarchaeal" proteins; all authors read, edited and approved the final version of the manuscript.
  17 in total

1.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

2.  The complete genome of the crenarchaeon Sulfolobus solfataricus P2.

Authors:  Q She; R K Singh; F Confalonieri; Y Zivanovic; G Allard; M J Awayez; C C Chan-Weiher; I G Clausen; B A Curtis; A De Moors; G Erauso; C Fletcher; P M Gordon; I Heikamp-de Jong; A C Jeffries; C J Kozera; N Medina; X Peng; H P Thi-Ngoc; P Redder; M E Schenk; C Theriault; N Tolstrup; R L Charlebois; W F Doolittle; M Duguet; T Gaasterland; R A Garrett; M A Ragan; C W Sensen; J Van der Oost
Journal:  Proc Natl Acad Sci U S A       Date:  2001-06-26       Impact factor: 11.205

3.  Structural basis of transcription: RNA polymerase II at 2.8 angstrom resolution.

Authors:  P Cramer; D A Bushnell; R D Kornberg
Journal:  Science       Date:  2001-04-19       Impact factor: 47.728

4.  A recombinant RNA polymerase II-like enzyme capable of promoter-specific transcription.

Authors:  Finn Werner; Robert O J Weinzierl
Journal:  Mol Cell       Date:  2002-09       Impact factor: 17.970

5.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

6.  Transcription in archaea: similarity to that in eucarya.

Authors:  D Langer; J Hain; P Thuriaux; W Zillig
Journal:  Proc Natl Acad Sci U S A       Date:  1995-06-20       Impact factor: 11.205

7.  Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods.

Authors:  J Felsenstein
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

8.  Partners of Rpb8p, a small subunit shared by yeast RNA polymerases I, II and III.

Authors:  J F Briand; F Navarro; P Rematier; C Boschiero; S Labarre; M Werner; G V Shpakovski; P Thuriaux
Journal:  Mol Cell Biol       Date:  2001-09       Impact factor: 4.272

9.  OB(oligonucleotide/oligosaccharide binding)-fold: common structural and functional solution for non-homologous sequences.

Authors:  A G Murzin
Journal:  EMBO J       Date:  1993-03       Impact factor: 11.598

10.  PROMALS web server for accurate multiple protein sequence alignments.

Authors:  Jimin Pei; Bong-Hyun Kim; Ming Tang; Nick V Grishin
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

View more
  20 in total

1.  An archaeal origin for the actin cytoskeleton: Implications for eukaryogenesis.

Authors:  Rolf Bernander; Anders E Lind; Thijs J G Ettema
Journal:  Commun Integr Biol       Date:  2011-11-01

2.  A korarchaeal genome reveals insights into the evolution of the Archaea.

Authors:  James G Elkins; Mircea Podar; David E Graham; Kira S Makarova; Yuri Wolf; Lennart Randau; Brian P Hedlund; Céline Brochier-Armanet; Victor Kunin; Iain Anderson; Alla Lapidus; Eugene Goltsman; Kerrie Barry; Eugene V Koonin; Phil Hugenholtz; Nikos Kyrpides; Gerhard Wanner; Paul Richardson; Martin Keller; Karl O Stetter
Journal:  Proc Natl Acad Sci U S A       Date:  2008-06-05       Impact factor: 11.205

3.  Soaking of DNA into crystals of archaeal RNA polymerase achieved by desalting in droplets.

Authors:  Magdalena N Wojtas; Nicola G A Abrescia
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2012-08-31

Review 4.  The dispersed archaeal eukaryome and the complex archaeal ancestor of eukaryotes.

Authors:  Eugene V Koonin; Natalya Yutin
Journal:  Cold Spring Harb Perspect Biol       Date:  2014-04-01       Impact factor: 10.005

5.  Structure-Based Deep Mining Reveals First-Time Annotations for 46 Percent of the Dark Annotation Space of the 9,671-Member Superproteome of the Nucleocytoplasmic Large DNA Viruses.

Authors:  Yeva Mirzakhanyan; Paul David Gershon
Journal:  J Virol       Date:  2020-11-23       Impact factor: 5.103

Review 6.  Archaea and the origin of eukaryotes.

Authors:  Laura Eme; Anja Spang; Jonathan Lombard; Courtney W Stairs; Thijs J G Ettema
Journal:  Nat Rev Microbiol       Date:  2017-11-10       Impact factor: 60.633

7.  The origin and early evolution of eukaryotes in the light of phylogenomics.

Authors:  Eugene V Koonin
Journal:  Genome Biol       Date:  2010-05-05       Impact factor: 13.583

8.  Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.

Authors:  Takuro Nunoura; Yoshihiro Takaki; Jungo Kakuta; Shinro Nishi; Junichi Sugahara; Hiromi Kazama; Gab-Joo Chee; Masahira Hattori; Akio Kanai; Haruyuki Atomi; Ken Takai; Hideto Takami
Journal:  Nucleic Acids Res       Date:  2010-12-15       Impact factor: 16.971

9.  Identification of an ortholog of the eukaryotic RNA polymerase III subunit RPC34 in Crenarchaeota and Thaumarchaeota suggests specialization of RNA polymerases for coding and non-coding RNAs in Archaea.

Authors:  Fabian Blombach; Kira S Makarova; Jeannette Marrero; Bettina Siebers; Eugene V Koonin; John van der Oost
Journal:  Biol Direct       Date:  2009-10-14       Impact factor: 4.540

10.  Rearrangement of the RNA polymerase subunit H and the lower jaw in archaeal elongation complexes.

Authors:  Sebastian Grünberg; Christoph Reich; Mirijam E Zeller; Michael S Bartlett; Michael Thomm
Journal:  Nucleic Acids Res       Date:  2009-12-29       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.