| Literature DB >> 26594329 |
Reinier Gesto-Borroto1, Miriam Sánchez-Sánchez1, Raúl Arredondo-Peter1.
Abstract
Globins (Glbs) are proteins widely distributed in organisms. Three evolutionary families have been identified in Glbs: the M, S and T Glb families. The M Glbs include flavohemoglobins (fHbs) and single-domain Glbs (SDgbs); the S Glbs include globin-coupled sensors (GCSs), protoglobins and sensor single domain globins, and the T Glbs include truncated Glbs (tHbs). Structurally, the M and S Glbs exhibit 3/3-folding whereas the T Glbs exhibit 2/2-folding. Glbs are widespread in bacteria, including several rhizobial genomes. However, only few rhizobial Glbs have been characterized. Hence, we characterized Glbs from 62 rhizobial genomes using bioinformatics methods such as data mining in databases, sequence alignment, phenogram construction and protein modeling. Also, we analyzed soluble extracts from Bradyrhizobium japonicum USDA38 and USDA58 by (reduced + carbon monoxide (CO) minus reduced) differential spectroscopy. Database searching showed that only fhb, sdgb, gcs and thb genes exist in the rhizobia analyzed in this work. Promoter analysis revealed that apparently several rhizobial glb genes are not regulated by a -10 promoter but might be regulated by -35 and Fnr (fumarate-nitrate reduction regulator)-like promoters. Mapping analysis revealed that rhizobial fhbs and thbs are flanked by a variety of genes whereas several rhizobial sdgbs and gcss are flanked by genes coding for proteins involved in the metabolism of nitrates and nitrites and chemotaxis, respectively. Phenetic analysis showed that rhizobial Glbs segregate into the M, S and T Glb families, while structural analysis showed that predicted rhizobial SDgbs and fHbs and GCSs globin domain and tHbs fold into the 3/3- and 2/2-folding, respectively. Spectra from B. japonicum USDA38 and USDA58 soluble extracts exhibited peaks and troughs characteristic of bacterial and vertebrate Glbs thus indicating that putative Glbs are synthesized in B. japonicum USDA38 and USDA58.Entities:
Keywords: Burkholderia; Cupriavidus; Rhizobium; flavohemoglobin; globin-coupled sensor; single-domain globin; truncated (2/2) hemoglobin
Year: 2015 PMID: 26594329 PMCID: PMC4648194 DOI: 10.12688/f1000research.6392.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Classification of the α- and β-rhizobia and rhizobial genomes analyzed in this work.
| α-rhizobia
| |||
|---|---|---|---|
| Family | Genus | Species/strain | Database |
| Bradyrhizobiaceae |
|
| JGI Genome Portal (
|
| Phyllobacteriaceae |
|
| JGI Genome Portal (
|
| Rhizobiaceae |
|
| CCG-UNAM (
|
|
|
| GGL (
| |
| Xanthobacteraceae |
|
| JGI Genome Portal (
|
| β-rhizobia
| |||
| Family | Genus | Species/strain | Database |
| Burkholderiaceae |
|
| JGI Genome Portal (
|
|
|
| Goettingen Genomics Laboratory (
|
*Formerly classified as Alcaligenes eutrophus and Ralstonia eutropha.
Figure 1. Venn diagram illustrating the distribution of glb genes in the rhizobial bacteria analyzed in this work.
Numbers correspond to rhizobial genomes containing glbs.
Number of glb copies detected in the rhizobial genomes analyzed in this work.
|
| No. of genomes |
|---|---|
|
| |
| 1 | 5 |
| 2 | 1 |
|
| |
| 1 | 5 |
| 2 | 11 |
| 3 | 4 |
| 4 | 2 |
|
| |
| 1 | 12 |
| 2 | 14 |
|
| |
| 1 | 22 |
| 2 | 36 |
| 3 | 3 |
Figure 2. Phenetic relationships among Glbs detected in the genomes of rhizobial bacteria.
Phenogram was obtained from the Glbs sequence alignment shown in Figure S2. The fHb, SDgb, GCS, tHb class 1, tHb class 2 and tHb class 3 clusters are indicated with light blue, dark blue, red, light green, bright green and dark green, respectively. Stars indicate Glbs selected for the detection of promoter sequences upstream to the glb genes and Glb protein modeling.
Position of canonical and Fnr-like promoter sequences and Shine-Dalgarno sequence within 130 nucleotides upstream to selected rhizobial glb genes.
Consensus sequences are indicated in parenthesis. Identical and non-identical nucleotides into the Fnr-like promoter sequences to the consensus Fnr promoter sequence are indicated with upper- and lowercase letters, respectively. N.D., non-detected.
|
| Canonical promoters | Fnr promoter | Shine-Dalgarno
| ||
|---|---|---|---|---|---|
| -10 promoter | -35 promoter | Sequence | Position | ||
| (TATAAT) | (TTGACA) | (TTTAAGAGGCCAAT) | |||
|
| |||||
|
| N.D. | -36 to -41 | TcTAAGcGaCtgAT | -102 to -115 | -10 to -13 |
|
| N.D. | -43 to -48 | N.D. | -8 to -12 | |
|
| N.D. | -46 to -54 | TTTAAaAcGgagcc | -5 to -18 | -10 to -15 |
|
| N.D. | -46 to -52 | N.D. | -10 to -15 | |
|
| N.D. | -29 to -36 | aTcAAGgcGgCgAg | -64 to -77 | -8 to -12 |
|
| N.D. | -32 to -37 | N.D. | -9 to -12 | |
|
| N.D. | -48 to -56 | gTcAAGgaGCCAAa | -12 to -25 | -8 to -12 |
| ggTtgGgGtCCAcT | -61 to -74 | ||||
|
| |||||
|
| N.D. | -38 to -42 | gccAgGAGtCCgAT | -2 to -15 | -8 to -12 |
|
| N.D. | -36 to -43 | TaTAAGgacatcAT | -114 to -127 | -7 to -11 |
|
| N.D. | -34 to -40 | N.D. | -7 to -12 | |
|
| N.D. | -61 to -66 | TTTttGgGGCaAAT | -71 to -84 | N.D. |
|
| N.D. | -40 to -44 | TTTAcGAGGCtgcT | -11 to -24 | -16 to -22 |
|
| N.D. | -41 to -46 | TTTcAGAactCAtT | -22 to -35 | -8 to -12 |
| cTTcgGttaCCAAT | -56 to -69 | ||||
|
| N.D. | -53 to -58 | N.D. | -6 to -9 | |
|
| N.D. | -60 to -65 | N.D. | -7 to -11 | |
|
| |||||
|
| N.D. | -55 to -60 | gTTtcGcctCCgAT | -21 to -34 | -6 to -11 |
|
| N.D. | -44 to -50 | N.D. | -7 to -9 | |
|
| N.D. | -31 to -36 | gTggAGAGGaCcgT | -91 to -104 | -2 to -5 |
|
| N.D. | -44 to -51 | TTTAAccaGgCAtc | -80 to -93 | -5 to -11 |
|
| N.D. | -65 to -70 | N.D.
| N.D. | |
|
| N.D. | -51 to -55 | TgatcGAGGCaAgg | -33 to -46 | -8 to -10 |
|
| N.D. | -47 to -52 | gTggtGAGGaCcgT | -90 to -103 | -3 to -8 |
|
| N.D. | -23 to -29 | TTcAgcgGGCCAca | -47 to -60 | -6 to -8 |
|
| N.D. | -31 to -36 | N.D. | -6 to -8 | |
|
| |||||
|
| N.D. | -54 to -59 | TgctgGAcGCCAAc | -95 to -108 | -5 to -7 |
|
| N.D. | -61 to -67 | N.D. | -7 to -12 | |
|
| N.D. | -56 to -61 | TTTgAGAtaCCtAT | -15 to -28 | -3 to -5 |
|
| N.D. | -34 to -39 | gTTgAGAGcCgcca | -59 to -72 | -2 to -4 |
|
| N.D. | -38 to -42 | TaTAtcAGGgCAca | -23 to -36 | -7 to -9 |
|
| N.D. | -32 to -37 | N.D. | -11 to -14 | |
|
| -8 to -13 | -31 to 35 | TaTAAacGGtaAcT | -90 to -103 | -2 to -4 |
|
| -23 to -28 | -43 to -47 | cTgAtGcGGCCAgc | -70 to -83
| N.D. |
|
| N.D. | N.D. | TcgctaAGGCCgcT | 47 to -60 | -7 to -11 |
|
| N.D. | -43 to -47 | N.D. | -7 to -9 | |
|
| N.D. | -61 to -66 | TTgtAGtGGgCgAc | -95 to -108 | -8 to -12 |
|
| N.D. | -39 to -43 | TgcAAGccGCCAtc | -47 to -60 | -9 to -12 |
|
| N.D. | -32 to -37 | TaTAtGAGGagcgg | -28 to -41 | N.D. |
|
| N.D. | -59 to -64 | N.D. | -8 to -10 | |
|
| N.D. | -54 to -58 | TTggAatGGaCAAT | -58 to -71 | -6 to -10 |
|
| N.D. | -31 to -35 | TTcgAcAtGCaAAT | -90 to -103 | N.D. |
|
| -17 to -22 | -40 to -45 | N.D. | -4 to -7 | |
|
| N.D. | N.D. | N.D. | -7 to -11 | |
|
| N.D. | -42 to -49 | TTTgtcAaGCCctg | -102 to -115 | -3 to -6 and
|
|
| N.D. | -57 to -63 | cTTgtcgGGCagAT | -87 to -100 | -5 to -7 |
Predicted structures of rhizobial Glbs deposited in the Caspur protein model dataBase ( http://bioinformatics.cineca.it/PMDB/).
| Glb | ID number |
|---|---|
|
| |
| BurphySTM815fHb | PM0079658 |
| CupnecHPC(L)fHb | PM0079659 |
| CupnecJMP134fHb | PM0079660 |
| CupnecN-1 ATCC43291fHb1 | PM0079661 |
| CupnecN-1 ATCC43291fHb2 | PM0079662 |
| RhilegUPM1137fHb | PM0079663 |
| Sinmel1021fHb | PM0079672 |
|
| |
| AzodoeUFLA1-100SDgb | PM0079664 |
| BraelkUSDA94SDgb2 | PM0079665 |
| BraelkUSDA3254SDgb1 | PM0079666 |
| BraelkUSDA3254SDgb2 | PM0079667 |
| BraelkUSDA3259SDgb1 | PM0079668 |
| BraelkWSM1741SDgb2 | PM0079669 |
| BrajapUSDA38SDgb2 | PM0079670 |
| BrajapUSDA124SDgb1 | PM0079671 |
|
| |
| Brajapin8p8GCS globin domain | PM0079673 |
| RhietlCIAT652GCS1 globin domain | PM0079674 |
| RhietlCIAT652GCS2 globin domain | PM0079675 |
| Rhietl8C-3GCS globin domain | PM0079676 |
| RhietlCFN42DSM 11541GCS1 globin domain | PM0079677 |
| RhilegGB30GCS1 globin domain | PM0079678 |
| RhilegGB30GCS2 globin domain | PM0079679 |
| SinfreGR64GCS globin domain | PM0079680 |
| Sinmel1021GCS globin domain | PM0079681 |
|
| |
| AzodoeUFLA1-100tHb1 | PM0079682 |
| AzodoeUFLA1-100tHb2 | PM0079683 |
| BraelkUSDA76tHb2 | PM0079684 |
| BraelkUSDA94tHb1 (globin domain +
| PM0079701 |
| BrajapUSDA38tHb2 | PM0079685 |
| BrajapUSDA123tHb1 | PM0079686 |
| BurphySTM815tHb1 | PM0079687 |
| BurphySTM815tHb2 | PM0079688 |
| CupnecN-1 ATCC43291tHb1 | PM0079689 |
| CupnecN-1 ATCC43291tHb2 | PM0079690 |
| MescicCMG6tHb | PM0079691 |
| MeslotNZP2037tHb2 | PM0079692 |
| RhietlCNPAF512tHb | PM0079693 |
| RhietlKim5tHb | PM0079694 |
| RhilegGB30tHb2 | PM0079695 |
| RhilegVc2tHb1 | PM0079696 |
| RhilupHPC(L)tHb1 | PM0079697 |
| RhilupHPC(L)tHb2 | PM0079698 |
| SinfreHH103tHb | PM0079699 |
| Sinmel1021tHb2 | PM0079700 |
Figure 3. Predicted structure of rhizobial fHbs (blue) overlapped to structural homologues (green).
Structural homologues are indicated in Dataset 3. Distal and proximal amino acids to the heme Fe and amino acids that interact with the FAD cofactor are shown in brown. Heme and FAD are shown in red and yellow, respectively. Helices within the globin domain are indicated with letters A to H. All structures are displayed in the same orientation.
Figure 8. Predicted structure of selected rhizobial tHbs class 3 (blue) overlapped to structural homologues (green).
Structural homologues are indicated in Dataset 3. Distal and proximal amino acids to the heme Fe are shown in brown; only potential distal E11 is shown in the tHbs structure. Heme is shown in red. Helices are indicated with letters A to H. All structures are displayed in the same orientation.
Figure 4. Predicted structure of selected rhizobial SDgbs (blue) overlapped to structural homologues (green).
Structural homologues are indicated in Dataset 3. Distal and proximal amino acids to the heme Fe are shown in brown. Heme is shown in red. Helices are indicated with letters A to H. All structures are displayed in the same orientation.
Figure 5. Predicted structure of selected rhizobial GCSs globin domain (blue) overlapped to structural homologues (green).
Structural homologues are indicated in Dataset 3. Distal and proximal amino acids to the heme Fe are shown in brown. Heme is shown in red. Helices are indicated with letters A to H. All structures are displayed in the same orientation.
Figure 6. Predicted structure of class 1 CupnecN1tHb1 (blue) overlapped to the structural homologue Tetrahymena pyriformis tHb (PDB ID 3AQ5) (green).
Distal and proximal amino acids to the heme Fe are shown in brown; only potential distal E11 is shown in the CupnecN1tHb1 structure. Heme is shown in red. Helices are indicated with letters A to H.
Figure 7. Predicted structure of selected rhizobial tHbs class 2 (blue) overlapped to structural homologues (green).
Structural homologues are indicated in Dataset 3. Distal and proximal amino acids to the heme Fe are shown in brown; only potential distal E11 is shown in the tHbs structure. Heme is shown in red. Helices are indicated with letters A to H. Pre-helix F is indicated with the Greek letter φ. All structures are displayed in the same orientation.
Absorption peaks and troughs in the Soret and Q regions from the (dithionite reduced + CO minus dithionite reduced) differential spectra of rhizobial soluble extracts and other bacterial and vertebrate Glbs.
| Rhizobial soluble extract/Glb | Soret region | Q region | Reference | ||||
|---|---|---|---|---|---|---|---|
| Peak
| Trough
| Peak (nm) | Trough (nm) | ||||
|
| |||||||
|
| 425 | 448 | 535 | 573 | 549 | 600 | This work |
|
| 416 | 437 | 535 | 573 | 554 | 601 | This work |
|
| 422 | 443 | 529 | 574 | 558 | 598 | 24 |
|
| 417 | 434 | 540 | 569 | 556 | n.i. | 22 |
|
| 421 | 439 | 539 | 563 | 547 | 590 | 25 |
|
| 424 | 443 | 535 | 574 | 555 | n.i. | 23 |
|
| |||||||
|
| 418 | 436 | 534 | 567 | 551 | 590 | 25 |
|
| 420 | 437 | 530 | 570 | 555 | 592 | 55 |
|
| |||||||
| Sperm whale myoglobin | 419 | 436 | 538 | 578 | 558 | 596 | This work |
| Bovine blood hemoglobin | 417 | 432 | 533 | 570 | 554 | 588 | This work |
n.i., non-identified