| Literature DB >> 19995434 |
Daniel Lundin1, Eduard Torrents, Anthony M Poole, Britt-Marie Sjöberg.
Abstract
BACKGROUND: Ribonucleotide reductases (RNRs) catalyse the only known de novo pathway for deoxyribonucleotide synthesis, and are therefore essential to DNA-based life. While ribonucleotide reduction has a single evolutionary origin, significant differences between RNRs nevertheless exist, notably in cofactor requirements, subunit composition and allosteric regulation. These differences result in distinct operational constraints (anaerobicity, iron/oxygen dependence and cobalamin dependence), and form the basis for the classification of RNRs into three classes. DESCRIPTION: In RNRdb (Ribonucleotide Reductase database), we have collated and curated all known RNR protein sequences with the aim of providing a resource for exploration of RNR diversity and distribution. By comparing expert manual annotations with annotations stored in Genbank, we find that significant inaccuracies exist in larger databases. To our surprise, only 23% of protein sequences included in RNRdb are correctly annotated across the key attributes of class, role and function, with 17% being incorrectly annotated across all three categories. This illustrates the utility of specialist databases for applications where a high degree of annotation accuracy may be important. The database houses information on annotation, distribution and diversity of RNRs, and links to solved RNR structures, and can be searched through a BLAST interface. RNRdb is accessible through a public web interface at http://rnrdb.molbio.su.se.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19995434 PMCID: PMC2795772 DOI: 10.1186/1471-2164-10-589
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
General characteristics of RNR classes
| Class I | Class II | Class III | |
|---|---|---|---|
| Operation | Aerobic | Oxygen independent | Anaerobic |
| Structure | α2 β2 | α or α2 | α2 |
| Subunit names | Ia: NrdA, NrdB | NrdJ | NrdD |
| Ib: NrdE, NrdF | specific activase: NrdG | ||
| Radical/cofactor | Tyr122 (in β) | AdoCbl | Gly580 (in α) |
| Reductant | Ia: Thioredoxin/Glutaredoxin | Thioredoxin | Formate |
| Ib: NrdH-redoxin | |||
| In archaea | Limited distribution | Yes | Yes |
| In bacteria | Yes | Yes | Yes |
| In eukaryotes | Yes | Limited distribution | Limited distribution |
| In virusesa | Yes | Bacteriophage; one eukaryotic virus | Bacteriophage |
The catalytic subunits, NrdA, NrdE, NrdJ and NrdD are homologous, as are the two different class I radical generating subunits, NrdB and NrdF. The class III activase, NrdG, is not homologous to any other RNR subunit. Residue numbering refers to the Escherichia coli K12 class I radical generating subunit and the Bacteriophage T4 class III enzyme sequences respectively. Adapted from [2].
aRNR genes are only found in some dsDNA viruses.
Distribution of RNR proteins
| Class | Subunita | Archaea | Archaeal viruses | Bacteria | Bacteriophages | Eukaryotes | Eukaryotic viruses | Totals |
|---|---|---|---|---|---|---|---|---|
| Ia | NrdA | 8 | 2 | 893 | 39 | 188 | 124 | 1254 |
| NrdB | 8 | 2 | 910 | 47 | 247 | 140 | 1354 | |
| Ib | NrdE | 605 | 7 | 612 | ||||
| NrdF | 647 | 8 | 655 | |||||
| NrdH | 393 | 7 | 400 | |||||
| NrdI | 633 | 5 | 638 | |||||
| II | NrdJ | 62 | 4 | 702 | 18 | 7 | 1 | 794 |
| III | NrdD | 49 | 950 | 30 | 8 | 1037 | ||
| NrdG | 48 | 923 | 21 | 992 | ||||
| All classes | NrdR | 6 | 1268 | 1274 | ||||
| Totals | 181 | 8 | 7924 | 182 | 450 | 265 | 9010 | |
Distribution of RNR classes and NrdR regulator among cellular domains and viruses.
a Protein roles for subunits are catalytic component (NrdA, NrdE), radical harbouring component (NrdB, NrdF), reductant (NrdH), flavodoxin component (NrdI), core enzyme (NrdJ, NrdD), activase (NrdG), and transcriptional regulator (NrdR).
Figure 1RNRdb pipeline. RNRdb is loaded from upstream databases (see text for details) using a semi automated pipeline. Before inclusion, each sequence is manually vetted.
RNR annotation
| Correct class | Correct role | Correct function | Single domain | N | Percent |
|---|---|---|---|---|---|
| Yes | Yes | Yes | NA | 1702 | 23 |
| Yes | No | Yes | NA | 569 | 8 |
| No | Yes | NA | NA | 2339 | 32 |
| No | No | Yes | NA | 1185 | 16 |
| No | No | Almosta | NA | 196 | 3 |
| No | No | No | Yes | 105 | 1 |
| No | No | No | No | 1155 | 16 |
| 7251 | 100 | ||||
GenBank definitions of proteins in RNRdb. Four different aspects of RNR functional annotation have been investigated: RNR class, protein role (subunit identity), general function (involved with ribonucleotides) and single domain identification (the ATP cone [35] that regulates activity in many enzymatic subunits). Only NCBI entries for full-length sequences were used, and nine sequences that were difficult to classify were excluded. NA, not applicable.
a Proteins annotated such that they contain the string "ribonucl" for all RNR proteins, or "repressor" for NrdR, but not a full description of the function such that they would classify as correctly functionally annotated, have been classified as almost correctly annotated with respect to function.
RNR distribution
| Combination | Archaea | Archaeal viruses | Bacteria | Bacteriophages | Eukaryotes | Eukaryotic viruses | Totals |
|---|---|---|---|---|---|---|---|
| I | 0 | 0 | 211 | 18 | 56 | 81 | 366 |
| II | 20 | 1 | 87 | 14 | 1 | 0 | 123 |
| III | 15 | 0 | 11 | 1 | 0 | 0 | 27 |
| I+II | 2 | 0 | 82 | 0 | 2 | 1 | 87 |
| I+III | 0 | 0 | 222 | 14 | 1 | 0 | 237 |
| II+III | 15 | 0 | 47 | 0 | 0 | 0 | 62 |
| I+II+III | 1 | 0 | 54 | 0 | 2 | 0 | 57 |
| None | 0 | 0 | 3 | 0 | 2 | 0 | 5 |
| Totals | 53 | 1 | 717 | 47 | 64 | 82 | 964 |
Combinations of RNR classes in the cellular domains and viruses. This table only contains data from fully sequenced genomes. We have exhaustively verified absence of RNRs in organismal, but not in viral genomes.