| Literature DB >> 35883471 |
Yan Zhang1,2, Jiao Jin1, Biyan Huang1, Huimin Ying3, Jie He1, Liang Jiang1.
Abstract
Selenium (Se) is an important trace element that mainly occurs in the form of selenocysteine in selected proteins. In prokaryotes, Se is also required for the synthesis of selenouridine and Se-containing cofactor. A large number of selenoprotein families have been identified in diverse prokaryotic organisms, most of which are thought to be involved in various redox reactions. In the last decade or two, computational prediction of selenoprotein genes and comparative genomics of Se metabolic pathways and selenoproteomes have arisen, providing new insights into the metabolism and function of Se and their evolutionary trends in bacteria and archaea. This review aims to offer an overview of recent advances in bioinformatics analysis of Se utilization in prokaryotes. We describe current computational strategies for the identification of selenoprotein genes and generate the most comprehensive list of prokaryotic selenoproteins reported to date. Furthermore, we highlight the latest research progress in comparative genomics and metagenomics of Se utilization in prokaryotes, which demonstrates the divergent and dynamic evolutionary patterns of different Se metabolic pathways, selenoprotein families, and selenoproteomes in sequenced organisms and environmental samples. Overall, bioinformatics analyses of Se utilization, function, and evolution may contribute to a systematic understanding of how this micronutrient is used in nature.Entities:
Keywords: bioinformatics; comparative genomics; evolution; selenium; selenocysteine; selenoprotein
Mesh:
Substances:
Year: 2022 PMID: 35883471 PMCID: PMC9312934 DOI: 10.3390/biom12070917
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1A general scheme of the three Se utilization traits in bacteria. Proteins involved in each pathway are shown in red.
A complete list of currently reported selenoprotein families/subfamilies in prokaryotes.
| Selenoprotein Family or Subfamily Name | Domain ID (Name) | Sec-Related Motif | Representative Sequence (Genbank/Refseq) | Ref. |
|---|---|---|---|---|
|
| ||||
| Formate dehydrogenase alpha subunit * | COG0243 (BisC) | - | WP_010904702.1 | [ |
| Formylmethanofuran dehydrogenase subunit B * | COG1029 (FwdB) | - | CAA67419.1 | [ |
| Selenophosphate synthetase * | COG0709 (SelD) | UxxK | WP_083774555.1 | [ |
| Coenzyme F420-reducing hydrogenase alpha subunit * | COG3259 (FrhA) | UxxC | WP_083774535.1 | [ |
| Methylviologen-reducing (or F420-nonreducing) hydrogenase alpha subunit * | COG3259 (FrhA) | UxxC | P0C1V6.2 | [ |
| Coenzyme F420-reducing hydrogenase delta subunit * | COG1908 (FrhD) | - | WP_010870703.1 | [ |
| Heterodisulfide reductase alpha subunit * | COG1148 (HdrA) | CxxU | WP_162484757.1 | [ |
| HesB-like protein * | TIGR01911 (HesB_rel_seleno) | - | WP_083774540.1 | [ |
| Glycine reductase complex selenoprotein A | pfam04723 (GRDA) | CxxU | WP_079747582.1 | [ |
| Glycine reductase complex selenoprotein B | pfam07355 (GRDB) | UxxC | WP_246895825.1 | [ |
| D-proline reductase | TIGR04483 (D_pro_red_PrdB) | UxxC | WP_079281142.1 | [ |
| Peroxiredoxin (Prx) | COG1225 (Bcp) | TxxU | WP_011365628.1 | [ |
| Thioredoxin (Trx) | pfam00085 (Thioredoxin) | UxxC | WP_010956703.1 | [ |
| Glutaredoxin (Grx) | pfam00462 (Glutaredoxin) | UxxC | WP_010943784.1 | [ |
| Methione sulfoxide reductase A | COG0225 (MsrA) | - | MBI4965933.1 | [ |
| Arsenite methyltransferase | PRK11873 (arsM) | - | WP_011987699.1 | [ |
|
| ||||
| Radical SAM domain protein | TIGR04167 (rSAM_SeCys) | - | AAR34688.1 | [ |
| Rhodanese-like domain-containing protein | pfam00581 (Rhodanese) | - | WP_010941598.1 | [ |
| Rhodanese-related sulfurtransferase COG0607 form 1 | COG0607 (PspE) | - | MBM9537886.1 | [ |
| Rhodanese-related sulfurtransferase COG0607 form 2 | COG0607 (PspE) | CxU | TKB26178.1 | [ |
| Prx-like thiol:disulfide oxidoreductase * | pfam00578 (AhpC-TSA) | UxxC, UxxU | WP_010940744.1 | [ |
| Thiol:disulfide interchange protein | pfam13098 (Thioredoxin_2) | UxxC | WP_011366075.1 | [ |
| Selenoprotein W (SELENOW)-like protein | pfam10262 (Rdx) | CxxU | AOH51717.1 | [ |
| Glutathione peroxidase (GPX)-like protein | pfam00255 (GSHPx) | UxxT | WP_010957027.1 | [ |
| Homolog of AhpF N-terminal domain (Grx-like domain protein) | TIGR02187 (GlrX_arch) | UxxC | ABB15282.1 | [ |
| DsbG-like protein | pfam13462 (Thioredoxin_4) | UxxC | [ | |
| Fe-S oxidoreductase-like protein | COG0247 (GlpC) | - | WP_174406253.1 | [ |
| DsrE-like protein | pfam02635 (DsrE) | UxxC | WP_014524487.1 | [ |
| FAD-dependent oxidoreductase (CoA-disulfide reductase, NADH oxidase) | COG0446 (FadH2) | - | WP_011365774.1 | [ |
| Distant Alkylhydroperoxidase (AhpD) homolog | COG0599 (YurZ) | CxxU |
| [ |
| AhpD-like protein | COG2128 (YciW) | CxxU | MCB9421940.1 | [ |
| Arsenate reductase | COG1393 (ArsC) | UxxS | MBT3519430.1 | [ |
| Molybdopterin-synthase adenylyltransferase MoeB | COG0476 (ThiF) | - | MBT7809913.1 | [ |
| DsbA-like protein | pfam01323 (DSBA) | UxxC | NIP15863.1 | [ |
| Glutathione S-transferase-like (GST-like) | COG0625 (GstA) | - |
| [ |
| Deiodinase-like protein | pfam00837 (T4_deiodinase) | UxxC |
| [ |
| Thiol-disulfide isomerase-like protein | pfam13905 (Thioredoxin_8) | UxxC |
| [ |
| Carboxymuconolactone decarboxylase(CMD)-like protein | pfam02627 (CMD) | CxxU | MBW1767730.1 | [ |
| Hypothetical protein 1 (Sargasso Sea metagenome) | - | CxxU |
| [ |
| OsmC-like protein | COG1765 (YhfA) | UxxT |
| [ |
| Rhodanase-related sulfurtransferase | COG2897 (SseA) | - |
| [ |
| NADH:ubiquinone oxidoreductase subunit E | COG2209 (NqrE) | TxxU | - | [ |
| Putative mercuric transport protein | pfam02411 (MerT) | - | ABB16073.1 | [ |
| Cation-transporting ATPase, E1-E2 family | COG2217 (ZntA) | UxxC | ABB15669.1 | [ |
| Methylated-DNA-protein-cysteine methyltransferase | COG0350 (AdaB) | - | ABB14497.1 | [ |
| UGSC-containing protein | - | UxxC | ABI76733.1 | [ |
| DUF3179 domain-containing protein | pfam11376 (DUF3179) | UxxC/T |
| [ |
| YHS domain-containing protein | pfam04945 (YHS) | - | - | [ |
| Putative redox protein | - | - |
| [ |
| DUF166 domain-containing protein | pfam02593 (DUF166) | - | - | [ |
| DUF1573 domain-containing protein | pfam07610 (DUF1573) | UGC |
| [ |
| Hypothetical protein OS_HP3 | - | - | SMF39960.1 | [ |
| Putative mercuric reductase | PRK13748 (PRK13748) | UxxU |
| [ |
| Hypothetical protein OS_HP4 | - | UxxC | - | [ |
| Cobalamin synthesis protein CobW-like | COG0523 (YejR) | UxxC |
| [ |
| AhpC/TSA family protein | pfam13911 (AhpC-TSA_2) | UxxS |
| [ |
| Hypothetical protein OS_HP5 | - | - | - | [ |
| Distant Grx-like protein 1 | TIGR02196 (GlrX_YruB) | UxxT | MBW2590879.1 | [ |
| Arsenate reductase-like protein | COG1393 (ArsC) | UxxC |
| [ |
| Fe-S cluster domain-containing protein | PRK07118 (PRK07118) | UxxC |
| [ |
| (2Fe-2S)-binding protein (copper chaperone Copz family) form 1 | cd10141 (CopZ-like_Fer2_BFD-like) | - | WP_245779778.1 | [ |
| (2Fe-2S)-binding protein (copper chaperone Copz family) form 2 | cd10141 (CopZ-like_Fer2_BFD-like) | - | MBF1269327.1 | [ |
| Hypothetical protein predicted in Moorella thermoacetica | - | - |
| [ |
| Alkylmercury lyase MerB-like protein | pfam03243 (MerB) | - | WP_238493467.1 | [ |
| DUF1858 domain-containing protein | pfam08984 (DUF1858) | CxxU | WP_012065717.1 | [ |
| Proline reductase-associated electron transfer protein PrdC form 1 | TIGR04481 (PR_assoc_PrdC) | CxxU | WP_243183503.1 | [ |
| Proline reductase-associated electron transfer protein PrdC form 2 | TIGR04481 (PR_assoc_PrdC) | - | WP_245122565.1 | [ |
| cytochrome c family protein | pfam13435 (Cytochrome_C554) | - |
| [ |
| MtrB/PioB family outer membrane beta-barrel protein | pfam11854 (MtrB_PioB) | - |
| [ |
| UshA-like protein | COG0737 (UshA) | CxU |
| [ |
| C-GCAxxG-C-C family protein | pfam09719 (C_GCAxxG_C_C) | - | WP_012158890.1 | [ |
| CO dehydrogenase/acetyl-CoA synthase gamma subunit | COG1456 (CdhE) | - |
| [ |
| YeeE/YedE family protein | pfam04143 (Sulf_transp) | - | WP_012471001.1 | [ |
| UGC-containing Prx-like protein | pfam00578 (AhpC-TSA) | UGC |
| [ |
| Ferredoxin-thioredoxin reductase | COG4802 (FtrB) | CxU |
| [ |
| Trypsin-like serine protease | pfam00089 (Trypsin) | - | - | [ |
| Putative regulatory protein, FmdB family | TIGR02605 (CxxC_CxxC_SSSS) | U/CxxU | - | [ |
| PDZ domain-containing protein | pfam13899 (Thioredoxin_7) | CxxU |
| [ |
| Hypothetical protein GOS_A | - | - | - | [ |
| Hypothetical protein GOS_B | - | - | NBR19009.1 | [ |
| Hypothetical protein GOS_C | cd02973 (TRX_GRX_like) | UxxC |
| [ |
| Redoxin family protein | - | UxxC |
| [ |
| Crotonase/enoyl-CoA hydratase family protein | PRK06023 (PRK06023) | - |
| [ |
| Cobalamin binding protein BtuF | cd01144 (BtuF) | CxxU |
| [ |
| KCU-star family selenoprotein (or DUF466 protein) | NF033934 (KCU-star) | - |
| [ |
| Thioredoxin-like selenoprotein Sec.1 | pfam13192 (Thioredoxin_3) | CxU | WP_232817751.1 | [ |
| Thioredoxin-like selenoprotein Sec.2 | pfam13192 (Thioredoxin_3) | UxC | WP_218069652.1 | [ |
* Selenoprotein families detected in both archaea and bacteria. ** Italic font: only truncated form of selenoprotein is annotated (no Sec included).
Figure 2Distribution of the top ten selenoprotein families in Sec-utilizing prokaryotes. (a) Bacteria; (b) archaea. Data used to generate this figure can be found in refs. [46,47,65].