BACKGROUND: Selenocysteine (Sec) is a rare amino acid which occurs in proteins in major domains of life. It is encoded by TGA, which also serves as the signal for termination of translation, precluding identification of selenoprotein genes by available annotation tools. Information on full sets of selenoproteins (selenoproteomes) is essential for understanding the biology of selenium. Herein, we characterized the selenoproteome of the largest microbial sequence dataset, the Sargasso Sea environmental genome project. RESULTS: We identified 310 selenoprotein genes that clustered into 25 families, including 101 new selenoprotein genes that belonged to 15 families. Most of these proteins were predicted redox proteins containing catalytic selenocysteines. Several bacterial selenoproteins previously thought to be restricted to eukaryotes were detected by analyzing eukaryotic and bacterial SECIS elements, suggesting that eukaryotic and bacterial selenoprotein sets partially overlapped. The Sargasso Sea microbial selenoproteome was rich in selenoproteins and its composition was different from that observed in the combined set of completely sequenced genomes, suggesting that these genomes do not accurately represent the microbial selenoproteome. Most detected selenoproteins occurred sporadically compared to the widespread presence of their cysteine homologs, suggesting that many selenoproteins recently evolved from cysteine-containing homologs. CONCLUSIONS: This study yielded the largest selenoprotein dataset to date, doubled the number of prokaryotic selenoprotein families and provided insights into forces that drive selenocysteine evolution.
BACKGROUND:Selenocysteine (Sec) is a rare amino acid which occurs in proteins in major domains of life. It is encoded by TGA, which also serves as the signal for termination of translation, precluding identification of selenoprotein genes by available annotation tools. Information on full sets of selenoproteins (selenoproteomes) is essential for understanding the biology of selenium. Herein, we characterized the selenoproteome of the largest microbial sequence dataset, the Sargasso Seaenvironmental genome project. RESULTS: We identified 310 selenoprotein genes that clustered into 25 families, including 101 new selenoprotein genes that belonged to 15 families. Most of these proteins were predicted redox proteins containing catalytic selenocysteines. Several bacterial selenoproteins previously thought to be restricted to eukaryotes were detected by analyzing eukaryotic and bacterial SECIS elements, suggesting that eukaryotic and bacterial selenoprotein sets partially overlapped. The Sargasso Sea microbial selenoproteome was rich in selenoproteins and its composition was different from that observed in the combined set of completely sequenced genomes, suggesting that these genomes do not accurately represent the microbial selenoproteome. Most detected selenoproteins occurred sporadically compared to the widespread presence of their cysteine homologs, suggesting that many selenoproteins recently evolved from cysteine-containing homologs. CONCLUSIONS: This study yielded the largest selenoprotein dataset to date, doubled the number of prokaryotic selenoprotein families and provided insights into forces that drive selenocysteine evolution.
Authors: M Nirenberg; T Caskey; R Marshall; R Brimacombe; D Kellogg; B Doctor; D Hatfield; J Levin; F Rottman; S Pestka; M Wilcox; F Anderson Journal: Cold Spring Harb Symp Quant Biol Date: 1966
Authors: Bing Hao; Weimin Gong; Tsuneo K Ferguson; Carey M James; Joseph A Krzycki; Michael K Chan Journal: Science Date: 2002-05-24 Impact factor: 47.728
Authors: Sergi Castellano; Alexey V Lobanov; Charles Chapple; Sergey V Novoselov; Mario Albrecht; Deame Hua; Alain Lescure; Thomas Lengauer; Alain Krol; Vadim N Gladyshev; Roderic Guigó Journal: Proc Natl Acad Sci U S A Date: 2005-10-31 Impact factor: 11.205
Authors: Janinah Baclaocos; Didac Santesmasses; Marco Mariotti; Katarzyna Bierła; Michael B Vetick; Sharon Lynch; Rob McAllen; John J Mackrill; Gary Loughran; Roderic Guigó; Joanna Szpunar; Paul R Copeland; Vadim N Gladyshev; John F Atkins Journal: J Mol Biol Date: 2019-08-20 Impact factor: 5.469
Authors: Sergey V Novoselov; Alexey V Lobanov; Deame Hua; Marina V Kasaikina; Dolph L Hatfield; Vadim N Gladyshev Journal: Proc Natl Acad Sci U S A Date: 2007-04-30 Impact factor: 11.205