| Literature DB >> 32265953 |
José Pedro Melo1, Maria Kalyna2, Paula Duque1.
Abstract
To colonize different terrestrial habitats, early land plants had to overcome the challenge of coping with harsh new environments. Alternative splicing - an RNA processing mechanism through which splice sites are differentially recognized, originating multiple transcripts and potentially different proteins from the same gene - can be key for plant stress tolerance. Serine/arginine-rich (SR) proteins constitute an evolutionarily conserved family of major alternative splicing regulators that in plants subdivides into six subfamilies. Despite being well studied in animals and a few plant species, such as the model angiosperm Arabidopsis thaliana and the crop Oryza sativa, little is known of these splicing factors in early land plants. Establishing the whole complement of SR proteins in different species is essential to understand the functional and evolutionary significance of alternative splicing. An in silico search for SR proteins in the extant moss Physcomitrella patens revealed inconsistencies both in the published data and available databases, likely arising from automatic annotation lacking adequate manual curation. These misannotations interfere with the description not only of the number and subfamily classification of Physcomitrella SR proteins but also of their domain architecture, potentially hindering the elucidation of their molecular functions. We therefore advise caution when looking into P. patens genomic resources. Our systematic survey nonetheless confidently identified 16 P. patens SR proteins that fall into the six described subfamilies and represent counterparts of well-established members in Arabidopsis and rice. Intensified research efforts should disclose whether SR proteins were already determining alternative splicing modulation and stress tolerance in early land plants.Entities:
Keywords: Arabidopsis thaliana; Physcomitrella patens; RNA splicing; SR proteins; alternative splicing; evolution; gene annotation; stress
Year: 2020 PMID: 32265953 PMCID: PMC7105729 DOI: 10.3389/fpls.2020.00286
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Schematic representation of the Arabidopsis thaliana and Physcomitrella patens SR protein families. All SR proteins are characterized by at least one RNA Recognition Motif (RRM) and an arginine/serine-rich (RS) region. Proteins in the RSZ subfamily include a glycine-rich region (G) after the first RRM. RSZ and RS2Z proteins also contain one and two zinc knuckles (Zn), respectively, with RS2Z members harboring an SP-rich extension after the RS region. The SCL subfamily is characterized by an N-terminal charged extension. The light-green shaded box indicates the three plant-specific SR protein subfamilies. Genes with incorrect reference models in the publicly available databases are marked with an asterisk (∗).
FIGURE 2SR protein numbers in six representative plant species. The column on the right indicates the sources of the data. Gray shading indicates SR proteins requiring further annotation. For Chara braunii, marked with an asterisk, refer to Supplementary Table S1. Values in bold indicate Physcomitrella patens SR proteins manually verified and curated in this study. The green shaded subfamilies are plant-specific. Lines on the left are a schematic representation of the phylogenetic relationships among the species.