| Literature DB >> 24194593 |
Frédéric Romagné1, Didac Santesmasses, Louise White, Gaurab K Sarangi, Marco Mariotti, Ron Hübler, Antje Weihmann, Genís Parra, Vadim N Gladyshev, Roderic Guigó, Sergi Castellano.
Abstract
SelenoDB (http://www.selenodb.org) aims to provide high-quality annotations of selenoprotein genes, proteins and SECIS elements. Selenoproteins are proteins that contain the amino acid selenocysteine (Sec) and the first release of the database included annotations for eight species. Since the release of SelenoDB 1.0 many new animal genomes have been sequenced. The annotations of selenoproteins in new genomes usually contain many errors in major databases. For this reason, we have now fully annotated selenoprotein genes in 58 animal genomes. We provide manually curated annotations for human selenoproteins, whereas we use an automatic annotation pipeline to annotate selenoprotein genes in other animal genomes. In addition, we annotate the homologous genes containing cysteine (Cys) instead of Sec. Finally, we have surveyed genetic variation in the annotated genes in humans. We use exon capture and resequencing approaches to identify single-nucleotide polymorphisms in more than 50 human populations around the world. We thus present a detailed view of the genetic divergence of Sec- and Cys-containing genes in animals and their diversity in humans. The addition of these datasets into the second release of the database provides a valuable resource for addressing medical and evolutionary questions in selenium biology.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24194593 PMCID: PMC3965025 DOI: 10.1093/nar/gkt1045
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison of features in the first and second releases of SelenoDB
| Features | Release 1.0 | Release 2.0 |
|---|---|---|
| Number of species | 8 | 59 |
| Number of protein families | 20 | 28 |
| Number of genes | 81 | 2801 |
| Alternative transcripts | Not present | For one species |
| Variation data | Not present | For one species |
| Curation method | Manual | Manual and automatic |
Figure 1.Human glutathione peroxidase 1 (GPx1) transcript. Note the non-synonymous, synonymous and non-coding SNPs annotated in the transcript sequence. The gene structure and transcript sequence is shown in forward despite being annotated in the reverse strand of the reference human genome.
Figure 2.Lizard selenoprotein I (SelI). Note the predicted Sec (U) in the protein sequence as well as the TAA (#) termination codon. The N-terminal of the protein is missing due to lack of sequence similarity between the protein sequence profile used by Selenoprofiles and a divergent lizard genome sequence.
Protein families annotated in the second release of SelenoDB
| Glutathione peroxidase (GPx) | |
| Iodothyronine deiodinadse (DI) | |
| 15 kDa selenoprotein (Sel15) | |
| 15 kDa selenoprotein-like protein (Fep15) | |
| FrnE (FrnE) | |
| Methionine sulfoxide reductase A (MsrA) | |
| Selenophosphate synthetase (SPS) | |
| Selenoprotein H (SelH) | |
| Selenoprotein I (SelI) | |
| Selenoprotein J (SelJ) | |
| Selenoprotein K (SelK) | |
| Selenoprotein L (SelL) | |
| Selenoprotein M (SelM) | |
| Selenoprotein N (SelN) | |
| Selenoprotein O (SelO) | |
| Selenoprotein P (SelP) | |
| Selenoprotein R (SelR) | |
| Selenoprotein S (SelS) | |
| Selenoprotein T (SelT) | |
| Selenoprotein U (SelU) | |
| Selenoprotein V (SelV) | |
| Selenoprotein W (SelW) | |
| Thioredoxin reductase (TR) | |
| Eukaryotic elongation factor (eEFSec) | |
| Phosphoseryl-tRNA kinase (PSTK) | |
| SECIS binding protein 2 (SBP2) | |
| Selenocysteine synthase (SecS) | |
| tRNA Sec 1 associated protein 1 (SECp43) |
tRNA, transfer RNA.
Figure 3.Variant report for a non-synonymous (Y to H) SNP in the human GPx6 gene. An ancestral T (present in the genome of the ancestor of humans and chimpanzees) has mutated to C in humans reaching higher frequencies in some African populations. Populations are grouped according to their geographical region of origin.