Mitochondria and chloroplasts host genomes inherited from their bacterial ancestors. Their biogenesis thus requires fully functional gene expression machineries. RNA metabolism in organelles is particular and has attracted considerable attention since it combines prokaryotic features with unique traits that evolved in their eukaryotic host cell. For a long time, most of the molecular factors governing these processes have remained elusive. Recent research has shown that proteins of the pentatricopeptide repeat (PPR) family are major players of organelle gene expression.PPR proteins make a large group of eukaryote-specific RNA-binding proteins encoded in the nucleus and predominantly located to organelles. This protein family has been identified over a decade ago and has been the subject of a fast growing number of studies ever since.- PPR proteins have escaped identification until complete genomic sequences were available because their primary sequences are very degenerate. Their identifying features are motifs of ca. 35 amino acids occurring as tandem arrays. Contrary to the sequence, the tri-dimensional organization of these modules seems highly conserved, with each repeat arranged in a helix-turn-helix structure and the succession of PPR motifs forming a solenoid.PPR proteins are ubiquitous in eukaryotes and are totally absent from bacteria, with the exception of a small number of plant parasites such as Ralstonia that probably acquired PPR genes by horizontal gene transfer. Still, the distribution of PPR genes among eukaryote clades is very uneven. They are for example present in relatively small numbers in fungi and animals, they occur in higher numbers in trypanosomes, and are present in very large numbers in higher plants. In the latter, hundreds of PPR genes are present, representing as much as 1% of nuclear genomes.The identification and functional analysis of PPR proteins has already helped to solve long-standing questions regarding organelle gene expression. It has for example enabled to understand how sequence specificity is achieved for hundreds of cytidine to uridine editing sites present in higher plant organelle transcriptomes. It has also settled the long-lasting debate on the existence of protein-only enzymes holding RNase P activity., Since their discovery, the function of PPR proteins has been associated with most aspect of gene expression in organelle, i.e., transcription, RNA splicing, RNA editing, RNA ends maturation, and translation. Research on PPR proteins is thus instrumental to understand organelle RNA metabolism.Initial functional data on PPR proteins have mainly been derived from genetic studies. The most recent advances now unravel the mode of action of PPR proteins. Because of their modular structure, it had been proposed from the start that RNA recognition by PPR proteins would involve the recognition of specific RNA moieties by individual PPR motifs. Such a recognition code has indeed been uncovered. The occurrence of specific amino acids mainly at two to three given positions in PPR motifs accounts for the specificity of the respective motifs for individual nucleotides.-This special issue of RNA Biology covers all the major aspects and recent advances on PPR proteins throughout eukaryotes. It features four original articles as well as 12 in-depth reviews. Individual reviews deal with the current knowledge on PPR proteins mode of action, on their functions in yeast, in animals, in trypanosomes, in green algae, in the moss Physcomitrella patens, and in higher plant chloroplasts. Other reviews describe specific functional groups of PPR proteins; i.e., P-class PPR proteins required for the end maturation of mitochondrial transcripts, PPR proteins belonging to the subgroup of “restorer of fertility” proteins, and PPR proteins involved in RNA editing. Finally, other reviews deal with PPR proteins containing specific additional domains such as the “Small MutS-Related” (SMR) domain and the “Nedd4-BP1 YacP Nuclease” (NYN) domain, characterizing PPR proteins holding RNase P activity. Original articles describe the identification of DYW domains similar to those of land plant editing factors in diverse eukaryotes, identify novel RNA editing factors in Arabidopsis, and show that in plants the dual targeting of PPR proteins to mitochondria and plastids is more widespread than previously envisaged.Despite the progress in understanding PPR proteins biology, many important challenges remain for the future. A first aspect will be to better understand how the function of PPR proteins is integrated in a wider context, i.e., to know how PPR functions are connected with other gene expression pathways and in general with other cellular processes. Answers to this question might be brought by the determination of both protein–protein and protein–RNA interaction networks of PPR proteins and the biochemical characterization of complexes involving PPR proteins.Another critical question will be to understand at the atomic level how the PPR code functions, i.e., how base specificity is achieved by individual residues in PPR motifs. Answers to this question might come through the determination of tri-dimensional structures of PPR proteins in complex with their RNA substrates.The full exploitation of the potential held by PPR proteins also remains a major challenge for the future. Custom designed PPR proteins obtained through the reshuffling of PPR motifs or through point mutations are expected to specifically recognize any RNA sequence and/or structure of interest. The possibility to design protein specificity toward RNA obviously opens appealing perspectives for a tremendous number of biotechnological applications.Finally, the thorough understanding of PPR proteins mode of action might help to derive universal rules for target recognition by a much wider group of proteins. Indeed, according to their tri-dimensional fold, PPR proteins fall into the large family of helical-repeat proteins that bind nucleic acids, which includes PUF, TALE, HAT, mTERF, and OPR proteins families. Similar to PPR proteins, recognition codes connecting individual protein motifs and nucleotides identity have been described for both PUF and TALE proteins., It is thus possible that similar codes remain to be identified for the other protein families and that grand rules exist to define how target are recognized by all these solenoid forming proteins.
Authors: Johann Holzmann; Peter Frank; Esther Löffler; Keiryn L Bennett; Christopher Gerner; Walter Rossmanith Journal: Cell Date: 2008-10-31 Impact factor: 41.582
Authors: Anthony Gobert; Bernard Gutmann; Andreas Taschner; Markus Gössringer; Johann Holzmann; Roland K Hartmann; Walter Rossmanith; Philippe Giegé Journal: Nat Struct Mol Biol Date: 2010-05-16 Impact factor: 15.369
Authors: Alice Barkan; Margarita Rojas; Sota Fujii; Aaron Yap; Yee Seng Chong; Charles S Bond; Ian Small Journal: PLoS Genet Date: 2012-08-16 Impact factor: 5.917