| Literature DB >> 24214024 |
Jin-Mei Feng1, Hai-Feng Tian, Jian-Fan Wen.
Abstract
As a nucleolar complex for small-subunit (SSU) ribosomal RNA processing, SSU processome has been extensively studied mainly in Saccharomyces cerevisiae but not in diverse organisms, leaving open the question of whether it is a ubiquitous mechanism across eukaryotes and how it evolved in the course of the evolution of eukaryotes. Genome-wide survey and identification of SSU processome components showed that the majority of all 77 yeast SSU processome proteins possess homologs in almost all of the main eukaryotic lineages, and 14 of them have homologs in archaea but few in bacteria, suggesting that the complex is ubiquitous in eukaryotes, and its evolutionary history began with abundant protein homologs being present in archaea and then a fairly complete form of the complex emerged in the last eukaryotic common ancestor (LECA). Phylogenetic analysis indicated that ancient gene duplication and functional divergence of the protein components of the complex occurred frequently during the evolutionary origin of the LECA from prokaryotes. We found that such duplications not only increased the complex's components but also produced some new functional proteins involved in other nucleolar functions, such as ribosome biogenesis and even some nonnucleolar (but nuclear) proteins participating in pre-mRNA splicing, implying the evolutionary emergence of the subnuclear compartment-the nucleolus-has occurred in the LECA. Therefore, the LECA harbored not only complicated SSU processomes but also a nucleolus. Our analysis also revealed that gene duplication, innovation, and loss, caused further divergence of the complex during the divergence of eukaryotes.Entities:
Keywords: LECA; SSU processome; evolution; nucleolus; origin
Mesh:
Substances:
Year: 2013 PMID: 24214024 PMCID: PMC3879963 DOI: 10.1093/gbe/evt173
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary of Distribution of the SSU Processome Proteins in the Five Eukaryotic Supergroups
| Subcomplex | Protein Name | Opisthokonts | Amoebozoa | Plantae | Excavates | Chromalveolates | LECA |
|---|---|---|---|---|---|---|---|
| U3 snoRNP | Nop1 | + | + | + | + | + | + |
| Nop56 | + | + | + | + | + | + | |
| Nop58 | + | + | + | + | + | + | |
| Snu13 | + | + | + | + | + | + | |
| Rrp9 | + | + | + | + | + | + | |
| Mpp10 | Imp3 | + | + | + | + | + | + |
| Imp4 | + | + | + | + | + | + | |
| Mpp10 | + | + | + | + | + | + | |
| UtpA | Utp4 | + | + | + | + | + | + |
| Utp5 | + | + | + | + | + | + | |
| Utp8 | + | − | − | − | − | − | |
| Utp9 | + | − | − | − | − | − | |
| Utp10 | + | + | + | + | + | + | |
| Utp15 | + | + | + | + | + | + | |
| Utp17 | + | + | + | + | + | + | |
| UtpB | Utp1 | + | + | + | + | + | + |
| Utp6 | + | + | + | + | + | + | |
| Utp12 | + | + | + | + | + | + | |
| Utp13 | + | + | + | + | + | + | |
| Utp18 | + | + | + | + | + | + | |
| Utp21 | + | + | + | + | + | + | |
| UtpC | Rrp7 | + | + | + | − | + | + |
| Utp22 | + | + | + | + | + | + | |
| Rrp36 | + | + | + | + | + | + | |
| Bms1 | + | + | + | + | + | + | |
| Rcl1 | + | + | + | + | + | + | |
| Unclassified | Utp2 | + | + | + | + | + | + |
| Utp3 | + | + | + | + | + | + | |
| Utp7 | + | + | + | + | + | + | |
| Utp11 | + | + | + | + | + | + | |
| Utp14 | + | + | + | + | + | + | |
| Utp16 | + | − | − | − | − | − | |
| Noc4 | + | + | + | + | + | + | |
| Utp20 | + | + | + | + | + | + | |
| Utp23 | + | + | + | + | + | + | |
| Utp24 | + | + | + | + | + | + | |
| Utp25 | + | + | + | + | + | + | |
| Dbp8 | + | + | + | + | + | + | |
| Dhr1 | + | + | + | + | + | + | |
| Dhr2 | + | + | + | + | + | + | |
| Emg1 | + | + | + | + | + | + | |
| Krr1 | + | + | + | + | + | + | |
| Rok1 | + | + | + | + | + | + | |
| Rrp3 | + | + | + | + | + | + | |
| Rrp5 | + | + | + | + | + | + | |
| Sof1 | + | + | + | + | + | + | |
| Confirmed ribosomal proteins | RPS4 | + | + | + | + | + | + |
| RPS6 | + | + | + | + | + | + | |
| RPS7 | + | + | + | + | + | + | |
| RPS9 | + | + | + | + | + | + | |
| RPS14 | + | + | + | + | + | + | |
Note.—“+” means the protein present in all or the majority of the member groups of a certain supergroup.
Phylogenetic Distribution of Eukaryotic SSU Processome Protein Homologs in Archaea
| Subcomplex | Protein | Crenarchaeota (18) | Euryarchaeota (48) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Desulfurococcales (6) | Sulfolobales (5) | Thermoproteales (7) | Archaeoglobales (3) | Halobacteriales (10) | Methanobacteriale (4) | Methanococcales (8) | Methanomicrobial (5) | Methanopyrales (1) | Methanosarcinale (6) | Thermococcales (7) | Thermoplasmales (3) | Nanoarchaeota (1) | |||
| Nonribosomal proteins | U3 snoRNP | Nop1 | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Nop56 | + | + | + | + | + | + | + | + | + | + | + | + | + | ||
| Nop58 | |||||||||||||||
| Snu13 | + | + | + | + | + | + | + | + | + | + | + | + | + | ||
| Mpp10 | Imp3 | + | + | + | + | + | + | + | + | + | + | + | + | + | |
| Imp4 | + | + | + | + | _ | + | + | _ | + | 5 (+) | + | _ | _ | ||
| Unclassified proteins | Utp23 | + | + | + | + | + | + | + | 4 (+) | + | + | + | + | + | |
| Utp24 | |||||||||||||||
| Emg1 | + | + | + | + | _ | _ | 5 (+) | _ | + | 1 (+) | + | + | _ | ||
| Krr1 | + | + | + | + | + | + | + | + | + | + | + | + | + | ||
| Ribosomal proteins | Rps4 | + | + | + | + | + | + | + | + | + | + | + | + | + | |
| Rps6 | + | + | + | + | + | + | + | + | + | + | + | + | + | ||
| Rps9 | + | + | + | + | + | + | + | + | + | + | + | + | + | ||
| Rps14 | + | + | + | + | + | + | + | + | + | + | + | + | + | ||
Note.—“+” represents presence in all species of this clade. “−” represents absence in all species of this clade. N(+) represents only present in N species and absent in others of a given clade.
Domain Composition of the 37 Eukaryote-Specific SSU Processome Proteins
| Protein Name | Domain Composition | Functional Domain in Yeast | Domain in Archaea | Domain in Bacteria | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Rrp5 | Ancient | S1(PF00575.17) | S1 | S1 | |||||||
| Dbp8 | Ancient | DEAD(PF00270.23) | Helicase_C(PF00271.25) | DEAD | Helicase_C | DEAD | Helicase_C | ||||
| Rcl1 | Ancient | RTC(PF01137.15) | RTC_insert(PF05189.7) | RTC | RTC_insert | RTC | RTC_insert | ||||
| Rok1 | Ancient | DEAD(PF00270.23) | Helicase_C(PF00271.25) | DEAD | Helicase_C | DEAD | Helicase_C | ||||
| Rrp3 | Ancient | DEAD(PF00270.23) | Helicase_C(PF00271.25) | DEAD | Helicase_C | DEAD | Helicase_C | ||||
| Bms1 | Ancient + A + E | GTP_EFTU(PF00009.21) | AARP2CN(PF08142.6) | DUF663(PF04950.6) | GTP_EFTU | DUF663 | GTP_EFTU | ||||
| Dhr1 | Ancient + B | DEAD(PF00270.23) | Helicase_C(PF00271.25) | HA2(PF04408.17) | OB_NTP_bind(PF07717.10) | DEAD | Helicase_C | DEAD | Helicase_C | HA2 | OB_NTP_bind |
| Dhr2 | Ancient + B | DEAD(PF00270.23) | Helicase_C(PF00271.25) | HA2(PF04408.17) | OB_NTP_bind(PF07717.10) | DEAD | Helicase_C | DEAD | Helicase_C | HA2 | OB_NTP_bind |
| Rrp9 | B | WD40(PF00400.26) | WD40 | ||||||||
| Utp4 | B | WD40(PF00400.26) | WD40 | ||||||||
| Utp17 | B | WD40(PF00400.26) | WD40 | ||||||||
| Utp18 | B | WD40(PF00400.26) | WD40 | ||||||||
| Sof1 | B + E | WD40(PF00400.26) | Sof1(PF04158.8) | WD40 | |||||||
| Utp1 | B + E | WD40(PF00400.26) | Utp12(PF04003.6) | WD40 | |||||||
| Utp7 | B + E | WD40(PF00400.26) | BING4CT(PF08149.5) | WD40 | |||||||
| Utp12 | B + E | WD40(PF00400.26) | Utp12(PF04003.6) | WD40 | |||||||
| Utp13 | B + E | WD40(PF00400.26) | Utp13(PF08625.5) | WD40 | |||||||
| Utp15 | B + E | WD40(PF00400.26) | UTP15_C(F09384.4) | WD40 | |||||||
| Utp21 | B + E | Utp21(PF04192.6) | WD40(PF00400.26) | WD40 | |||||||
| Mpp10 | E | Mpp10(PF04006.6) | |||||||||
| Utp2 | E | Nop14(PF04147.6) | |||||||||
| Utp3 | E | Sas10_Utp3(PF04000.9) | Sas10_Utp3_C(PF09368.4) | ||||||||
| Utp5 | E | Utp12(PF04003.6) | |||||||||
| Utp6 | E | U3_assoc_6(PF08640.5) | DUF1740(PF08424.4) | ||||||||
| Utp8 | E | Utp8(PF10395.3) | |||||||||
| Utp9 | E | Utp12(PF04003.6) | |||||||||
| Utp10 | E | U3snoRNP10(PF12397.2) | BP28CT(PF08146.6) | ||||||||
| Utp11 | E | Utp11(PF03998.7) | |||||||||
| Utp14 | E | Utp14(PF04615.7) | |||||||||
| Utp16 | E | U3_snoRNA_assoc(PF08297.5) | |||||||||
| Noc4 | E | CBF(PF03914.11) | |||||||||
| Utp20 | E | DRIM(PF07539.6) | |||||||||
| Utp22 | E | Nrap(PF03813.8) | |||||||||
| Utp25 | E | DUF1253(PF06862.6) | |||||||||
| Rps7 | E | Ribosomal_S7e(PF01251.12) | |||||||||
| Rrp7 | E | RRP7(PF12923.1) | |||||||||
| Rrp36 | E | DUF947(PF06102.6) | |||||||||
FMaximum likelihood phylogenetic tree of the 182 obtained homologous sequences to Snu13. The 131 conserved sites in the alignment were used for the tree construction. Numbers at branches represent bootstrap values. The scale bar represents the average number of substitutions per site.
FMaximum likelihood phylogenetic tree of the 165 obtained homologous sequences to Imp4. The 374 conserved sites in the alignment were used for the tree construction. Numbers at branches represent bootstrap values. The scale bar represents the average number of substitutions per site.
FMaximum likelihood phylogenetic tree of the 288 obtained homologous sequences to Nop56 and Nop58. The 390 conserved sites in the alignment were used for the tree construction. Numbers at branches represent bootstrap values. The scale bar represents the average number of substitutions per site.
FSSU processome components traceable to the LECA. SSU processome composition in the LECA based on two alternative roots of the eukaryote tree. In the left-hand tree, excavates are the outgroup. The right-hand tree is rooted on the basis of the unikont/bikont bifurcation. Gains (+) and losses (−) in different lineages are indicated under each scenario. Where gains and losses are equally probable, these are marked by (?).