| Literature DB >> 25566315 |
A Maxwell Burroughs1, L Aravind1.
Abstract
Ribosomal biogenesis has been extensively investigated, especially to identify the elusive nucleases and cofactors involved in the complex rRNA processing events in eukaryotes. Large-scale screens in yeast identified two biochemically uncharacterized proteins, TSR3 and TSR4, as being key players required for rRNA maturation. Using multiple computational approaches we identify the conserved domains comprising these proteins and establish sequence and structural features providing novel insights regarding their roles. TSR3 is unified with the DTW domain into a novel superfamily of predicted enzymatic domains, with the balance of the available evidence pointing toward an RNase role with the archaeo-eukaryotic TSR3 proteins processing rRNA and the bacterial versions potentially processing tRNA. TSR4, its other eukaryotic homologs PDCD2/rp-8, PDCD2L, Zfrp8, and trus, the predominantly bacterial DUF1963 proteins, and other uncharacterized proteins are unified into a new domain superfamily, which arose from an ancient duplication event of a strand-swapped, dimer-forming all-beta unit. We identify conserved features mediating protein-protein interactions (PPIs) and propose a potential chaperone-like function. While contextual evidence supports a conserved role in ribosome biogenesis for the eukaryotic TSR4-related proteins, there is no evidence for such a role for the bacterial versions. Whereas TSR3-related proteins can be traced to the last universal common ancestor (LUCA) with a well-supported archaeo-eukaryotic branch, TSR4-related proteins of eukaryotes are derived from within the bacterial radiation of this superfamily, with archaea entirely lacking them. This provides evidence for "systems admixture," which followed the early endosymbiotic event, playing a key role in the emergence of the uniquely eukaryotic ribosome biogenesis process.Entities:
Keywords: 18S rRNA; 20S; DTW domain; TSR3; TSR4; endosymbiosis; rRNA; tRNA
Year: 2014 PMID: 25566315 PMCID: PMC4275035 DOI: 10.3389/fgene.2014.00424
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1TDD domain alignment and genome contextual information. (A) Multiple sequence alignment of the TDD domain, with predicted secondary structure provided at the top of the alignment. Sequences are labeled with gene name, organism abbreviation, and ncbi gene identifier (gi) number; families are annotated to the right of the alignment. Numbers bookending sequences represent positions of the domain within the sequence. Numbers within alignment represent the number of amino acids excised in regions of poor conservation. The alignment is colored based on following consensus: h, hydrophobic shaded in yellow; p, polar in blue; s, small in green; l, aliphatic in yellow; o, hydroxylic; a, aromatic in yellow; b, big in gray. Conserved residues with predicted roles in catalysis are shaded in red and colored in white. The absolutely-conserved tryptophan residue is shaded in orange and colored in white. Organism abbreviations are expanded in Supplementary Material. (B) Contextual information for TDD domain. Examples of conserved domain architectures and gene neighborhoods identified for TDD families are boxed in purple and orange, respectively, with the TDD domain always colored in orange. Protein-encoding genes are colored in red and non-coding RNA genes are colored in green. Non-conserved genes within a neighborhood are colored in gray. Domain abbreviations: ZnR, zinc ribbon; ZnK, zinc knuckle; U1-ZF, U1-ribonucleoprotein type C2H2 zinc finger; RGG, arginine/glycine/glycine-rich repeat region.
Figure 2TYPP domain alignment and genome contextual information. (A) Multiple sequence alignment of the TYPP domain; formatting and coloring scheme same as Figure 1A. Additionally: u, tiny shaded in green; +, positively-charged in purple; c, charged in purple. Sporadically-conserved CxxC motifs are highlighted. Insert sites and starting points of the two repeats are labeled above secondary structure in gray and black, respectively. Secondary structure is based on the crystal structure of 1PV5. Organism abbreviations are expanded in Supplementary Material. (B) Genome contextual information for TYPP domain. Conserved domain architectures and gene neighborhoods formatted as in Figure 1B. Additional abbreviations: Ank, Ankyrin; TM, transmembrane helix.
Figure 3Structural overview and features of TYPP domain. (A) Topology diagram of TYPP domain provided to the left. Strands from the N- and C-terminal repeats are respectively colored in dark red and green. The loop corresponding to the standard insert region and the MYND domain insert are colored in gray and yellow, respectively. The well-conserved Q residues in each repeat is marked in red. The poorly-conserved helical segment is shown as a dotted line and colored in gray. Cartoon rendering of the TYPP domain (pdb id: 1PV5) is given to the right. Coloring the same as topology diagram. (B) Molecular surface renderings of TYPP domain. Surfaces are colored by repeat congruent to (A). Predicted binding pocket view shown on left. Middle rendering overlays same view on the cartoon depiction. Right view shows the contribution of the insert region to the cleft with polar residues lining the cleft colored yellow.