| Literature DB >> 22479358 |
Marco Mariotti1, Perry G Ridge, Yan Zhang, Alexei V Lobanov, Thomas H Pringle, Roderic Guigo, Dolph L Hatfield, Vadim N Gladyshev.
Abstract
BACKGROUND: Selenium is an essential trace element in mammals due to its presence in proteins in the form of selenocysteine (Sec). Human genome codes for 25 Sec-containing protein genes, and mouse and rat genomes for 24. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22479358 PMCID: PMC3316567 DOI: 10.1371/journal.pone.0033066
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Evolution of the vertebrate selenoproteome.
The ancestral vertebrate selenoproteome is indicated in red, and its changes across the investigated vertebrates are depicted along their phylogenetic tree. The ancestral selenoproteins found uniquely in vertebrates are underlined. The creation of a new selenoprotein (here always by duplication of an existing one) is indicated by its name in green. Loss is indicated in grey. Replacement of Sec with Cys is indicated in blue (apart from SelW2c in pufferfish, which is with arginine). Events of conversion of Cys to Sec were not found. On the right, the number of selenoproteins predicted in each species is shown.
Vertebrate Selenoproteins.
| Selenoproteins | Commonly used abbreviations | Fish | Frog | Birds | Mammals | ||
| Platypus | Marsupials | Placentals | |||||
| 15 kDa selenoprotein | Sep15, Sel15 | + | + | + | + | + | + |
| Fish 15 kDa selenoprotein-like | Fep15 | + | |||||
| Glutathione peroxidase 1a | GPx1, GSHPx1, GPx, cGPx | + | + | + | + | + | + |
| Glutathione peroxidase 1b | GPx1b | + | |||||
| Glutathione peroxidase 2 | GPx2, GSHPx-GI, GPRP, GI-GPx, GSGPx-2 | + | + | + | + | + | + |
| Glutathione peroxidase 3 | GPx3, pGPx, GPx-P, GSHPx-3, GSHPx-P, EGPx | + | + | + | + | + | + |
| Glutathione peroxidase 3b | GPx3b | + | |||||
| Glutathione peroxidase 4a | GPx4, PHGPx, MCSP, snGPx, snPHGPx, mtPHGPx | + | + | + | + | + | + |
| Glutathione peroxidase 4b | GPx4b | + | |||||
| Glutathione peroxidase 6 | GPx6, OMP | + | |||||
| Iodothyronine Deiodinase 1 | Dio1, DI1, 5DI, TXDI1, ITDI1 | + | + | + | + | + | + |
| Iodothyronine Deiodinase 2 | Dio2, DI2, D2, 5DII, TXDI2, SelY | + | + | + | + | + | + |
| Iodothyronine Deiodinase 3a | Dio3, DI3, 5DIII, TXDI3 | + | + | + | + | + | + |
| Iodothyronine Deiodinase 3b | Dio3b, DI3b | + | |||||
| Methionine-R-Sulfoxide Reductase 1a | MsrB1, SelR, SelX, SepR, | + | + | + | + | + | + |
| Methionine-R-Sulfoxide Reductase 1b | MsrB1b | + | |||||
| Selenophosphate Synthetase 2a | SPS2a, SEPHS2, Ysg3 | + | + | + | + | + | |
| Selenophosphate Synthetase 2b | SPS2b | + | + | ||||
| Selenoprotein H | SelH, SepH | + | + | + | + | + | + |
| Selenoprotein I | SelI, SepI | + | + | + | + | + | + |
| Selenoprotein J | SelJ | + | |||||
| Selenoprotein J2 | SelJ2 | + | |||||
| Selenoprotein K | SelK, SelG, SepK | + | + | + | + | + | + |
| Selenoprotein L | SelL | + | |||||
| Selenoprotein M | SelM, SepM | + | + | + | + | + | + |
| Selenoprotein N | SelN, SepN1, RSS, MDRS1, RSMD1 | + | + | + | + | + | + |
| Selenoprotein O | SelO, SepO | + | + | + | + | + | + |
| Selenoprotein O2 | SelO2 | + | |||||
| Selenoprotein P | SelP, SeP, SepP1, Se-P, SelPa | + | + | + | + | + | + |
| Selenoprotein Pb | SelPb | + | + | + | + | ||
| Selenoprotein S | SelS, VIMP, ADO15, SBBI8, SepS1, AD-015 | + | + | + | + | + | + |
| Selenoprotein T1a | SelT1a, SepT | + | + | + | + | + | + |
| Selenoprotein T1b | SelT1b | + | |||||
| Selenoprotein T2 | SelT2 | + | |||||
| Selenoprotein U1a | SelU1, SepU1 | + | + | + | |||
| Selenoprotein U1b | SelU1b, SepU1b | + | |||||
| Selenoprotein U1c | SelU1c, SepU1c | + | |||||
| Selenoprotein V | SelV, SepV | + | |||||
| Selenoprotein W1 | SelW1, SeW, SepW1 | + | + | + | + | + | + |
| Selenoprotein W2a | SelW2a | + | + | ||||
| Selenoprotein W2b | SelW2b | + | |||||
| Selenoprotein W2c | SelW2c | + | |||||
| Thioredoxin reductase 1 | TR1, TxnRd1, TxnR, TrxR1, GRIM-12 | + | + | + | + | + | + |
| Thioredoxin reductase 3 | TR3, TR2, TxnRd2, SelZ, TrxR2, TR-Beta | + | + | + | + | + | + |
| Thioredoxin/glutathione reductase | TGR, TR2, TR3, TxnRd3, TrxR3 | + | + | + | + | + | |
Selenoproteins detected by genomic searches in vertebrate genomes are shown. The groups for which a given selenoprotein was found in at least one organism are marked.
Figure 2Replacement of a multiexon SPS2a by an intronless SPS2b.
In the figure, the SPS2 genes found in some representative species are shown. The positions of introns along the protein sequence are displayed with black lines, and the Sec residue is displayed in red. In a few cases, the predicted genes were incomplete because of poor sequence data (e.g., the N-terminal region in platypus). Placental mammals (bottom) possess a single intronless gene, SPS2b. Non-mammalian vertebrates (top) and platypus possess a single multiexon gene, SPS2a. Marsupials (opossum and wallaby) possess both.
Figure 3Multiple sequence alignment of SelV and SelW.
The last 9 residues of SelV exon 1 and exons 2–5 are shown aligned to complete SelW sequences. The last residue of each exon is marked in black and the Sec in red.
Figure 4Phylogenetic tree of GPx family in eukaryotes.
The figure shows a ML tree computed using the JTT substitution model. In the phylogram, Sec-containing proteins are shown in red and Cys-containing homologs are shown in blue. The GPx families are indicated on the right. The distance scale in substitutions per position is indicated at the bottom left. The branch support is shown in red.
Figure 5Multiple sequence alignment of SelI and its homologs.
The multiple sequence alignment of the active site and preceding regions of CHPT1, CEPT1, and SelI is shown. Homologs are labeled with the annotated name. Proteins in the bottom section comprise a large group of diverse proteins containing the same domain. The most critical residues are marked in red. The residue in green marks the end of the first transmembrane domain. The cysteine residue near the active site emerged specifically in SelI proteins is marked in orange.The full length alignment is provided in Supplementary Figure S37 and the corresponding phylogenetic tree in Supplementary Figure S38.
Figure 6SECIS elements of SelM and SelO.
Multiple sequence alignment of SelM (A) and SelO (B) SECIS elements. Critical regions are marked in red.
Mammalian selenoprotein pseudogenes.
| Selenoprotein | # Pseudogenes | Organisms (# pseudogenes) |
| SelT | 9 | Human (2), Chimpanzee (2), Mouse (2), Rabbit (2), Horse |
| GPx1 | 3 | Human (1), Squirrel (1), Rabbit (1) |
| GPx2 | 1 | Human (1) |
| GPx4 | 4 | Mouse (2), Rat (1), Microbat (1) |
| MsrB1 | 3 | Human (1), Chimpanzee (1), Macaque (1) |
| SelH | 5 | Rat (1), Rabbit (1), Shrew (1), Hedgehog (1), Armadillo (1) |
| SelK | >27 | Human (3), Chimpanzee (3), Macaque (3), Galago (2), Mouse (5), Rat (4), Guinea Pig (1), Squirrel (2), Dog (1), Cat (1), Microbat (2) |
| SelS | 1 | Hedgehog (1) |
| SelW | 19 | Human (2), Chimpanzee (2), Orangutan (1), Gibbon (1), Macaque (2), Rat (1), Cow (2), Dog (1), Cat (2), Microbat (1), Hedgehog (1), Elephant (1), Armadillo (2) |
| Sep15 | 5 | Galago (1), Dog (1), Armadillo (2), Opossum (1) |
| SPS2b | 4 | Human (2), Macaque (1), Guinea Pig (1) |
The selenoproteins with pseudogenes, the number of total pseudogenes identified in all mammals, and their occurrence in individual organisms are given.
Figure 7Multiple sequence alignment of selenoprotein genes and pseudogenes.
A. GPx1. Multiple sequence alignment of human and chimpanzee GPx1 pseudogenes. B. SelW. The last residue of each exon is marked in black and Sec in red. Residues marked in green are described in the text. C. SECIS elements of SelW and SelW pseudogene.
Figure 8Multiple sequence alignment of Sep15 and a Sep15 alternative isoform.
The last residue of each exon is marked in black and Sec in red. For human and mouse, ESTs support the presence of the isoform. For the other species shown, the protein sequences were predicted simulating skipping the 4th exon.
Figure 9Phylogeny of SelU family in vertebrates.
ML tree computed using the JTT substitution model. Sec-containing proteins are shown in red, whereas the Cys-containing homologs are shown in blue. At the bottom left, the distance scale in substitutions per position is shown. Branch support is shown along the tree in red.