| Literature DB >> 24019874 |
James B Howard1, Katerina J Kechris, Douglas C Rees, Alexander N Glazer.
Abstract
Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases for new analyses of the three-dimensional structure and for mutagenesis studies.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24019874 PMCID: PMC3760896 DOI: 10.1371/journal.pone.0072751
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Three-dimensional structure of the α2β2 tetramer of A. vinelandii Component 1 (3U7Q.pdb).
The figure is centered on the approximate two-fold axis between the αβ pairs. Red is the α-subunit and blue is the β-subunit with the three metal centers shown in space filling PCK models. The Component 2 (Fe-protein) docking site is along the axis (arrow) identifying the P-cluster. Figure was prepared using Pymol (http://pymol.org/).
Figure 2Phylogeny of species used for multi-sequence alignment of NifD and NifK.
The species in the data analysis set (identifiers and species are in Table S1) were superimposed on a simplified whole-proteome tree from Jun et al. (Figure 2 in [34], constructed with whole proteomes of 884 prokaryotes). Identifiers are based upon the six nitrogenase groups; species with both Nif and either Anf or Vnf have more than one identifier.
Figure 3Diagram showing co-aligned regions of gene D and gene K used to identify amino acid variants.
Shaded blocks are the regions co-aligned across all 95 sequences. Lines between blocks have one or more insertions or deletions and are not included in the co-alignment. Numbering is based upon the A. vinelandii proteins. Gene D and Gene K co-aligned residues are explicitly given in Table S2.
Invariant and Single Variant Residues.
| α-subunit | β-subunit | |
| Sequence size | 462–578 | 454–548 |
| Aligned residues | 422 | 386 |
| Invariant residues | 41 | 27 |
| % invariant | 9.7% | 7.0% |
| Total Single variant | 39 | 33 |
| % single variant | 9.2% | 8.5% |
Values are for 95 aligned Nif, Anf, and Vnf sequences.
Range of full sequence lengths.
Residues common to nif, anf, vnf exclusive of extensions, insertions or deletions.
Based upon total number of aligned residues.
Invariant Residues, α-Subunit, Common Between Groups.
| # Sequences | Group | I | II | III | IV | Anf | Vnf |
| 45 | I | 144 | 71 | 73 | 93 | 68 | 72 |
| 18 | II | 110 | 59 | 84 | 70 | 68 | |
| 8 | III | 120 | 105 | 78 | 85 | ||
| 3 | IV | 359 | 131 | 138 | |||
| 12 | Anf | 256 | 159 | ||||
| 9 | Vnf | 246 |
Group III includes Sec as invariant with Cys.
Invariant Residues, β-Subunit, Common Between Groups.
| # Sequences | Group | I | II | III | IV | Anf | Vnf |
| 45 | I | 70 | 44 | 46 | 54 | 44 | 47 |
| 18 | II | 85 | 48 | 67 | 56 | 58 | |
| 8 | III | 96 | 72 | 56 | 67 | ||
| 3 | IV | 328 | 97 | 103 | |||
| 12 | Anf | 198 | 128 | ||||
| 9 | Vnf | 171 |
Number of Strong Motif Residues, α-Subunit.
| # Sequences | Group | I | II | III | IV | Anf | Vnf |
| 45 | I | 9 | 5 | 0 | 1 | 0 | 0 |
| 18 | II | 7 | 0 | 0 | 1 | 0 | |
| 8 | III | 2 | 1 | 0 | 0 | ||
| 3 | IV | 32 | 0 | 2 | |||
| 12 | Anf | 23 | 15 | ||||
| 9 | Vnf | 15 |
Number of Strong Motif Residues, β-Subunit.
| # Sequences | Group | I | II | III | IV | Anf | Vnf |
| 45 | I | 3 | 0 | 0 | 0 | 0 | 0 |
| 18 | II | 2 | 0 | 0 | 0 | 0 | |
| 8 | III | 1 | 1 | 0 | 0 | ||
| 3 | IV | 30 | 0 | 0 | |||
| 12 | Anf | 3 | 6 | ||||
| 9 | Vnf | 1 |
Figure 4Cofactor environment showing amino acid residues at 5 Å contact.
Cofactor including homocitric acid, α-His442, and α-Cys 275 ligands are shown as CPK spheres. Waters are red dot spheres. Dark green surface and sticks represent invariant residues. Light teal surface and sticks represent single variant residues. Bright orange surface and sticks represent multiple variant residues. (See Tables S9 and S10) A. Cofactor with α-Cys275 and α-His442 ligands. B. Invariant and single variant residues added. C Multiple variant residues added. Water sheet between homocitric acid and β-subunit is on the right. Amino acids that interact only by H-bond through water atoms are omitted. Figure was prepared using 3U7Q.pdb and Pymol (http://pymol.org/).