Literature DB >> 19179357

SECISaln, a web-based tool for the creation of structure-based alignments of eukaryotic SECIS elements.

Charles E Chapple1, Roderic Guigó, Alain Krol.   

Abstract

SUMMARY: Selenoproteins contain the 21st amino acid selenocysteine which is encoded by an inframe UGA codon, usually read as a stop. In eukaryotes, its co-translational recoding requires the presence of an RNA stem-loop structure, the SECIS element in the 3 untranslated region of (UTR) selenoprotein mRNAs. Despite little sequence conservation, SECIS elements share the same overall secondary structure. Until recently, the lack of a significantly high number of selenoprotein mRNA sequences hampered the identification of other potential sequence conservation. In this work, the web-based tool SECISaln provides for the first time an extensive structure-based sequence alignment of SECIS elements resulting from the well-defined secondary structure of the SECIS RNA and the increased size of the eukaryotic selenoproteome. We have used SECISaln to improve our knowledge of SECIS secondary structure and to discover novel, conserved nucleotide positions and we believe it will be a useful tool for the selenoprotein and RNA scientific communities. AVAILABILITY: SECISaln is freely available as a web-based tool at http://genome.crg.es/software/secisaln/.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19179357      PMCID: PMC2647825          DOI: 10.1093/bioinformatics/btp020

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


Selenoproteins are a diverse family of proteins characterized by the presence of the 21st amino acid, selenocysteine (Sec or U). Selenocysteine is co-translationally inserted into the growing polypeptide chain in response to UGA, otherwise read as a stop codon. The correct recoding of UGA to Sec requires the presence of a stem-loop structure, the SECIS element in the 3 untranslated region (UTR) of selenoprotein gene transcripts. Accordingly, the presence of a suitable SECIS element has been used in many studies as a tool for the computational prediction of novel selenoproteins (Castellano et al., 2001; Kryukov et al., 1999; Lescure et al.; 1999) and a specialized tool for SECIS prediction, SECISearch (Kryukov et al., 2003), has already been described and has been widely used. There are two types of eukaryotic SECISes, type I and type II differing at the apex by the presence of the additional helix 3 in type II (Fagegaltier et al., 2000; Grundner-Culemann et al., 1999; Walczak et al., 1996 see Fig. 1. Although the SECIS structure is conserved, there is little sequence conservation beyond the consecutive non-Watson-Crick base pairs UGAN/KGAW constituting the quartet, an unpaired A 5 to UGAN and a run of As in the apical loop/internal loop 2 (Fagegaltier et al., 2000; Walczak et al., 1996). Of these only the UGA/GA of the quartet is invariable (e.g. Buettner et al., 1996; Lobanov et al., 2007). Here, we describe SECISaln, a web-based tool that creates structure-based alignments of an extensive dataset of eukaryotic SECIS sequences. Its implementation led us to uncover novel, conserved sequence elements.
Fig. 1.

Eukaryotic SECIS element consensus sequence. Novel conserved residues are shown in magenta. Where a specific nucleotide is shown, it was observed in that position in 50% or more of the aligned sequences. Where a class of nucleotides is shown, that class was observed in that position in 70% or more of the aligned sequences. Y=U or C, K=G or U, N=any nucleotide, W=A or U, R=A or G, M=A or C. Quartet: four consecutive non-Watson–Crick base pairs. Base pairs forming the quartet were called abcd/a′b′c′d′ for the sake of clarity in the text. Position ‘z’ is the first nucleotide after the run of Ms, positions 2H3/2′H3 are the second base pair of Helix 3 and 1ap the first nucleotide of the apical loop. The range of possible lengths for helix 1 is hard to determine because it depends on the local 2D structure of the mRNA 3′UTR.

Eukaryotic SECIS element consensus sequence. Novel conserved residues are shown in magenta. Where a specific nucleotide is shown, it was observed in that position in 50% or more of the aligned sequences. Where a class of nucleotides is shown, that class was observed in that position in 70% or more of the aligned sequences. Y=U or C, K=G or U, N=any nucleotide, W=A or U, R=A or G, M=A or C. Quartet: four consecutive non-Watson–Crick base pairs. Base pairs forming the quartet were called abcd/a′b′c′d′ for the sake of clarity in the text. Position ‘z’ is the first nucleotide after the run of Ms, positions 2H3/2′H3 are the second base pair of Helix 3 and 1ap the first nucleotide of the apical loop. The range of possible lengths for helix 1 is hard to determine because it depends on the local 2D structure of the mRNA 3′UTR. SECISaln will predict a SECIS element in the query sequence, split it into its constituent parts and align these against a precompiled database of eukaryotic SECIS elements. The user can choose whether the database sequences are sorted by protein family or by species, thereby offering the possibility of comparing the submitted sequence to other, known SECISes. In addition, SECISaln returns a graphical image of the predicted structure of the user-submitted sequence as well as a multiple structural alignment of all SECIS elements of that type already present in the database. SECISaln uses SECISearch for the SECIS prediction step, described in detail in (Kryukov et al. (2003) and is not intended as a replacement for SECISearch. Our patterns and free-energy cutoffs are not stringent and will result in a high false positive rate if used to identify novel SECIS elements. Ideally, SECISaln should be used on sequences which are known to contain a SECIS element, and its main application is the detailed characterization of structural features in the identified SECIS elements, through the multiple structural comparison to other known SECIS elements. In addition to being the first structural alignment tool for SECIS elements, SECISaln also provides the largest available, manually curated collection of eukaryotic SECISes. Our SECIS collection was built by searching for homologs of all known eukaryotic selenoproteins in NCBIs Refseq mRNA and TIGRs EGO databases. We ran TBLASTN searches using the human (when available, other species when not) selenoproteins as queries. We then extracted the relevant mRNA sequence from the database and identified its SECIS element. We also manually added insect SECIS sequences that had been previously identified (Chapple and Guigó, 2008), but which are not yet present in mRNA databases. This process resulted in a collection of 62 type I and 224 type II SECISes, a clear indication that type II constitute the major part of SECIS elements. Interestingly, although all selenoprotein families had a type II SECIS in at least one species, SelO, SelT, MsrA, DI2, SelS, 15kDa, TR3, SelI, Gpx3 and TR2 had type II SECISes in all species investigated. GPx1 and DI1 had type I SECISes in all species except Danio rerio. Analyzing the structural alignments produced by SECISaln provided a more detailed picture of SECIS structural features. For instance the length of helix 2, which was previously set to 14 bp, is less constrained and ranges in fact from 11 bp to 16 bp. SECISaln also highlighted previously unknown conserved residues in eukaryotic SECIS elements (see Supplementary Table 1), which can be summarized as a new consensus core sequence for eukaryotic SECIS elements as shown in Figure 1. Most striking of these is an overrepresentation of G at position 1 (3 to abcd) and a corresponding overrepresentation of Y (C or U) at position 1. We also observed a clear overrepresentation of U in type I elements, and Y in type II at position −4. This is particularly surprising since no cross-species sequence conservation has ever been observed five to the quartet, with the exception of the conserved R, and may be connected to the SBP2-SECIS contacts observed in this area (Cléry et al., 2007; Fletcher et al., 2001) In conclusion, we believe that SECISaln, as has already been demonstrated by the analyses presented here, will be a very useful tool for the analysis and understanding of SECIS elements.
  13 in total

1.  Structural analysis of new local features in SECIS RNA hairpins.

Authors:  D Fagegaltier; A Lescure; R Walczak; P Carbon; A Krol
Journal:  Nucleic Acids Res       Date:  2000-07-15       Impact factor: 16.971

2.  In silico identification of novel selenoproteins in the Drosophila melanogaster genome.

Authors:  S Castellano; N Morozova; M Morey; M J Berry; F Serras; M Corominas; R Guigó
Journal:  EMBO Rep       Date:  2001-08       Impact factor: 8.807

3.  The selenocysteine incorporation machinery: interactions between the SECIS RNA and the SECIS-binding protein SBP2.

Authors:  J E Fletcher; P R Copeland; D M Driscoll; A Krol
Journal:  RNA       Date:  2001-10       Impact factor: 4.942

4.  New mammalian selenocysteine-containing proteins identified with an algorithm that searches for selenocysteine insertion sequence elements.

Authors:  G V Kryukov; V M Kryukov; V N Gladyshev
Journal:  J Biol Chem       Date:  1999-11-26       Impact factor: 5.157

5.  Novel selenoproteins identified in silico and in vivo by using a conserved RNA structural motif.

Authors:  A Lescure; D Gautheret; P Carbon; A Krol
Journal:  J Biol Chem       Date:  1999-12-31       Impact factor: 5.157

6.  A highly efficient form of the selenocysteine insertion sequence element in protozoan parasites and its use in mammalian cells.

Authors:  Sergey V Novoselov; Alexey V Lobanov; Deame Hua; Marina V Kasaikina; Dolph L Hatfield; Vadim N Gladyshev
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-30       Impact factor: 11.205

7.  Characterization of mammalian selenoproteomes.

Authors:  Gregory V Kryukov; Sergi Castellano; Sergey V Novoselov; Alexey V Lobanov; Omid Zehtab; Roderic Guigó; Vadim N Gladyshev
Journal:  Science       Date:  2003-05-30       Impact factor: 47.728

8.  An improved definition of the RNA-binding specificity of SECIS-binding protein 2, an essential component of the selenocysteine incorporation machinery.

Authors:  A Cléry; V Bourguignon-Igel; C Allmang; A Krol; C Branlant
Journal:  Nucleic Acids Res       Date:  2007-03-01       Impact factor: 16.971

9.  Relaxation of selective constraints causes independent selenoprotein extinction in insect genomes.

Authors:  Charles E Chapple; Roderic Guigó
Journal:  PLoS One       Date:  2008-08-13       Impact factor: 3.240

10.  Evolutionary dynamics of eukaryotic selenoproteomes: large selenoproteomes may associate with aquatic life and small with terrestrial life.

Authors:  Alexey V Lobanov; Dmitri E Fomenko; Yan Zhang; Aniruddha Sengupta; Dolph L Hatfield; Vadim N Gladyshev
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more
  16 in total

Review 1.  Threading the needle: getting selenocysteine into proteins.

Authors:  Jesse Donovan; Paul R Copeland
Journal:  Antioxid Redox Signal       Date:  2010-04-01       Impact factor: 8.401

2.  Bioinformatics of Selenoproteins.

Authors:  Didac Santesmasses; Marco Mariotti; Vadim N Gladyshev
Journal:  Antioxid Redox Signal       Date:  2020-04-23       Impact factor: 8.401

3.  Reconstitution of selenocysteine incorporation reveals intrinsic regulation by SECIS elements.

Authors:  Nirupama Gupta; Louise W DeMong; Sowmya Banda; Paul R Copeland
Journal:  J Mol Biol       Date:  2013-04-23       Impact factor: 5.469

4.  The selenocysteine-specific elongation factor contains a novel and multi-functional domain.

Authors:  Jonathan N Gonzalez-Flores; Nirupama Gupta; Louise W DeMong; Paul R Copeland
Journal:  J Biol Chem       Date:  2012-09-19       Impact factor: 5.157

5.  Processive incorporation of multiple selenocysteine residues is driven by a novel feature of the selenocysteine insertion sequence.

Authors:  Sumangala P Shetty; Ryan Sturts; Michael Vetick; Paul R Copeland
Journal:  J Biol Chem       Date:  2018-10-15       Impact factor: 5.157

6.  Selenoprofiles: profile-based scanning of eukaryotic genome sequences for selenoprotein genes.

Authors:  M Mariotti; R Guigó
Journal:  Bioinformatics       Date:  2010-09-21       Impact factor: 6.937

7.  Identification, characterization of selenoprotein W and its mRNA expression patterns in response to somatostatin 14, cysteamine hydrochloride, 17β-estradiol and a binary mixture of 17β-estradiol and cysteamine hydrochloride in topmouth culter (Erythroculter ilishaeformis).

Authors:  Haiyan Dong; Wenbo Chen; Chao Sun; Jianwei Sun; Yanlin Wang; Chao Xie; Qianwen Fu; Junjie Zhu; Jinyun Ye
Journal:  Fish Physiol Biochem       Date:  2016-08-09       Impact factor: 2.794

Review 8.  Eukaryotic selenoproteins and selenoproteomes.

Authors:  Alexey V Lobanov; Dolph L Hatfield; Vadim N Gladyshev
Journal:  Biochim Biophys Acta       Date:  2009-05-27

9.  Selenocysteine insertion sequence binding protein 2L is implicated as a novel post-transcriptional regulator of selenoprotein expression.

Authors:  Jesse Donovan; Paul R Copeland
Journal:  PLoS One       Date:  2012-04-17       Impact factor: 3.240

10.  Novel structural determinants in human SECIS elements modulate the translational recoding of UGA as selenocysteine.

Authors:  Lynda Latrèche; Olivier Jean-Jean; Donna M Driscoll; Laurent Chavatte
Journal:  Nucleic Acids Res       Date:  2009-08-03       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.