| Literature DB >> 17626042 |
Yan Zhang1, Vadim N Gladyshev.
Abstract
Selenocysteine (Sec) and pyrrolysine (Pyl) are rare amino acids that are cotranslationally inserted into proteins and known as the 21st and 22nd amino acids in the genetic code. Sec and Pyl are encoded by UGA and UAG codons, respectively, which normally serve as stop signals. Herein, we report on unusually large selenoproteomes and pyrroproteomes in a symbiont metagenomic dataset of a marine gutless worm, Olavius algarvensis. We identified 99 selenoprotein genes that clustered into 30 families, including 17 new selenoprotein genes that belong to six families. In addition, several Pyl-containing proteins were identified in this dataset. Most selenoproteins and Pyl-containing proteins were present in a single deltaproteobacterium, delta1 symbiont, which contained the largest number of both selenoproteins and Pyl-containing proteins of any organism reported to date. Our data contrast with the previous observations that symbionts and host-associated bacteria either lose Sec utilization or possess a limited number of selenoproteins, and suggest that the environment in the gutless worm promotes Sec and Pyl utilization. Anaerobic conditions and consistent selenium supply might be the factors that support the use of amino acids that extend the genetic code.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17626042 PMCID: PMC1976440 DOI: 10.1093/nar/gkm514
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.A schematic diagram of the search algorithm. Details of the search process are provided in Materials and methods section.
Known selenoprotein families identified in the Olavius algarvensis symbionts
| Protein family | Total selenoproteins | Number of Cys homolog | |||||
|---|---|---|---|---|---|---|---|
| δ1 | δ4 | γ1 | γ3 | Unassigned | |||
| F420-reducing hydrogenase, delta subunit (FrhD) | 12 | 5 | 2 | 0 | 0 | 5 | 6 |
| Heterodisulfide reductase, subunit A (HdrA) | 10 | 4 | 4 | 0 | 0 | 2 | 3 |
| Rhodanese-related sulfurtransferase | 8 | 4 | 2 | 0 | 0 | 2 | 0 |
| AhpD-like | 7 | 5 | 1 | 0 | 0 | 1 | 4 |
| Prx-like thiol:disulfide oxidoreductase | 6 | 2 | 3 | 0 | 0 | 1 | 4 |
| Proline reductase (PR) | 5 | 5 | 0 | 0 | 0 | 0 | 0 |
| Formate dehydrogenase alpha subunit (FdhA) | 4 | 2 | 1 | 0 | 0 | 1 | >10 |
| Sulfurtransferase COG2897 | 3 | 1 | 2 | 0 | 0 | 0 | 4 |
| DsrE-like | 3 | 2 | 1 | 0 | 0 | 0 | 0 |
| DsbA-like | 2 | 2 | 0 | 0 | 0 | 0 | 0 |
| F420-reducing hydrogenase, alpha subunit (FrhA) | 2 | 1 | 1 | 0 | 0 | 0 | 3 |
| Selenophosphate synthetase (SelD) | 2 | 1 | 1 | 0 | 0 | 0 | 1 |
| HesB-like | 2 | 1 | 0 | 0 | 0 | 1 | 0 |
| Fe-S oxidoreductase (GlpC) | 2 | 1 | 1 | 0 | 0 | 0 | 10 |
| Distant AhpD homolog | 2 | 2 | 0 | 0 | 0 | 0 | 2 |
| Sulfurtransferase COG0607 | 2 | 1 | 0 | 0 | 0 | 1 | >10 |
| Methione sulfoxide reductase A (MsrA) | 2 | 1 | 1 | 0 | 0 | 0 | 6 |
| Methylated-DNA-protein-cysteine methyltransferase | 2 | 0 | 0 | 0 | 0 | 2 | 8 |
| DsbG-like | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
| Peroxiredoxin (Prx) | 1 | 1 | 0 | 0 | 0 | 0 | 4 |
| Thioredoxin (Trx) | 1 | 1 | 0 | 0 | 0 | 0 | >10 |
| NADH oxidase | 1 | 1 | 0 | 0 | 0 | 0 | 1 |
| Glutaredoxin | 1 | 0 | 0 | 0 | 0 | 1 | 2 |
| UGSC-containing protein | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| SelW-like | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Glutathione peroxidase (GPx) | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Homolog of AhpF N-terminal domain | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| Thiol:disulfide interchange protein | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| Glycine reductase selenoprotein A (GrdA) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Glycine reductase selenoprotein B (GrdB) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Arsenate reductase | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Molybdopterin biosynthesis MoeB protein | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| Glutathione S-transferase (GST) | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Deiodinase-like | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Thiol-disulfide isomerase-like protein | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| Hypothetical protein 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| OsmC-like protein | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| NADH:ubiquinone oxidoreductase | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| Radical SAM domain protein | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Putative mercuric transport protein | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Cation-transporting ATPase, E1-E2 family | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
*Homologs of known thiol-based oxidoreductases or thioredoxin-like fold proteins.
Figure 2.Multiple sequence alignment of FrhD family. Conserved residues are highlighted. Sec (U) and the corresponding Cys (C) residues are shown in red and blue, respectively.
Figure 3.Multiple sequence alignment of several known selenoprotein families containing new features. New Sec positions are shown in pink. Contigs containing these new features are also highlighted in green background. (A) Rhodanese-related sulfurtransferase; (B) F420-reducing hydrogenase, alpha subunit.
Novel selenoproteins identified in the Olavius algarvensis symbionts
| Protein family | Total selenoproteins | Number of Cys homolog | |||||
|---|---|---|---|---|---|---|---|
| δ1 | δ4 | γ1 | γ3 | Unassigned | |||
| YHS domain protein | 5 | 4 | 0 | 0 | 0 | 1 | 2 |
| Putative redox protein | 3 | 3 | 0 | 0 | 0 | 0 | 2 |
| OS_HP1 | 3 | 3 | 0 | 0 | 0 | 0 | 2 |
| Conserved protein COG1810 | 2 | 1 | 1 | 0 | 0 | 0 | 0 |
| OS_HP2 | 2 | 1 | 1 | 0 | 0 | 0 | 0 |
| OS_HP3 | 2 | 1 | 1 | 0 | 0 | 0 | 0 |
*OS_HP, Olavius symbiont's hypothetical protein.
Figure 4.Multiple sequence alignments of new selenoproteins and their Cys homologs. The alignments show Sec-flanking regions in detected proteins. Both selenoprotein sequences detected in the Olavius symbionts’ metagenome dataset and their Sec/Cys-containing homologs present in indicated organisms are shown. Conserved residues are highlighted. Predicted Sec (U) and the corresponding Cys (C) residues in other homologs are shown in red and blue, respectively.
Figure 5.Predicted bacterial SECIS elements in representative sequences of new selenoprotein families. Only sequences downstream of in-frame UGA codons are shown. In-frame UGA codons and conserved guanosines in the apical loop are shown in red. (A) YHS domain protein, AASZ01000529; (B) Putative redox protein, AASZ01002486; (C) OS_HP1, AASZ01000351; (D) Conserved protein (COG1810), AASZ01000538; (E) OS_HP3, AASZ01001720.
Known Pyl-containing proteins and Pyl operon proteins identified in the Olavius algarvensis symbionts
| Protein family | Total sequences | The | Other homologs | ||||
|---|---|---|---|---|---|---|---|
| δ1 | δ4 | γ1 | γ3 | Unassigned | |||
| MtmB | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| MtbB | 7 | 6 | 0 | 0 | 0 | 1 | 0 |
| MttB | 3 | 2 | 0 | 0 | 0 | 1 | >10 |
| PylSn | 1 | 1 | 0 | 0 | 0 | 0 | |
| PylSc | 1 | 1 | 0 | 0 | 0 | 0 | |
| PylB | 1 | 1 | 0 | 0 | 0 | 0 | |
| PylC | 1 | 1 | 0 | 0 | 0 | 0 | |
| PylD | 1 | 1 | 0 | 0 | 0 | 0 | |
| PylT | 1 | 1 | 0 | 0 | 0 | 0 | |
Figure 6.Occurrence of genes for Pyl-containing proteins and Pyl operon proteins in Olavius symbionts’ metagenomic sequences. The mtbB and mttB genes and other Pyl operon genes are shown by the indicated color scheme in contigs containing these genes.
Figure 7.Multiple alignments of Pyl-flanking regions in methylamine methyltransferase families (MtbB and MttB). Pyl is shown by X and its location in the alignment is highlighted in red.
Figure 8.Relationship among habitats, genome size, proteome size and selenoproteomes. Sec-containing organisms were classified into four groups based on different habitats: aquatic, host-associated, multiple and terrestrial. (A) Correlation between genome size and proteome size. (B) Correlation between proteome size and selenoproteomes. δ1 and δ4 symbionts are indicated in the figure.
Selenoproteins in sequenced symbiotic/host-associated bacteria
| Phylum | Organism | Total number of proteins | Number of selenoproteins | Habitat | Oxygen requirement |
|---|---|---|---|---|---|
| Actinobacteria | 2367 | 2 | Human gut | Anaerobic | |
| 6716 | 1 | Human smegma | Aerobic | ||
| 5120 | 1 | Lung | Aerobic | ||
| Betaproteobacteria/ Burkholderiaceae | 5025 | 1 | Mammals | Aerobic | |
| 6604 | 1 | Human lung | Aerobic | ||
| 7845 | 1 | Root nodules of tropical legumes | Aerobic | ||
| Deltaproteobacteriadelta | 1185 | 1 | Mucosa of the lower intestinal tract in animals | Facultative | |
| Epsilonproteobacteria | 2039 | 6 | Human oral cavity and gastrointestinal tract | Microaerophilic | |
| 1921 | 4 | Human oral cavity and gastrointestinal tract | Microaerophilic | ||
| 1719 | 4 | Human blood | Microaerophilic | ||
| 1875 | 1 | Mucosal layer of the gastrointestinal tract | Microaerophilic | ||
| 2043 | 1 | Gastrointestinal tract | Microaerophilic | ||
| Gammaproteobacteria/ Enterobacteriales | 4243 | 3 | Lower intestine | Facultative | |
| 4683 | 1 | The gut of an entomopathogenic nematode | Facultative | ||
| 4427 | 3 | Gastrointestinal tract in animals | Facultative | ||
| 4425 | 3 | Gastrointestinal tract in animals | Facultative | ||
| 4136 | 3 | Gastrointestinal tract in animals | Facultative | ||
| 4274 | 2 | Gastrointestinal tract in animals | Facultative | ||
| 4182 | 3 | Gastrointestinal tract in animals | Facultative | ||
| 4223 | 3 | Gastrointestinal tract in animals | Facultative | ||
| Gammaproteobacteria/ Pasteurellaceae | 2012 | 2 | Lower respiratory tract of pigs | Facultative | |
| 1883 | 2 | Bovine rumen | Anaerobic | ||
| 1717 | 1 | Animal mucous membranes | Anaerobic | ||
| 1791 | 2 | Animal mucous membranes | Facultative | ||
| 2380 | 2 | Bovine rumen | Anaerobic | ||
| 2015 | 1 | Mucous membranes of the intestinal, genital and respiratory tissues | Facultative | ||
| Spirochaetales | 2767 | 6 | Oral cavity | Anaerobic |