| Literature DB >> 22848735 |
John Christian Gaby1, Daniel H Buckley.
Abstract
The nifH gene is the most widely sequenced marker gene used to identify nitrogen-fixing Bacteria and Archaea. Numerous PCR primers have been designed to amplify nifH, but a comprehensive evaluation of nifH PCR primers has not been performed. We performed an in silico analysis of the specificity and coverage of 51 universal and 35 group-specific nifH primers by using an aligned database of 23,847 nifH sequences. We found that there are 15 universal nifH primers that target 90% or more of nitrogen fixers, but that there are also 23 nifH primers that target less than 50% of nifH sequences. The nifH primers we evaluated vary in their phylogenetic bias and their ability to recover sequences from commonly sampled environments. In addition, many of these primers will amplify genes that do not mediate nitrogen fixation, and thus it would be advisable for researchers to screen their sequencing results for the presence of non-target genes before analysis. Universal primers that performed well in silico were tested empirically with soil samples and with genomic DNA from a phylogenetically diverse set of nitrogen-fixing strains. This analysis will be of great utility to those engaged in molecular analysis of nifH genes from isolates and environmental samples.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22848735 PMCID: PMC3405036 DOI: 10.1371/journal.pone.0042149
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Coverage of the nifH gene by sequences and primers in the nifH database.
The number of sequences in the nifH database is depicted in relation to alignment position along the gene. Alignment positions are referenced to the nifH nucleotide position from Azotobacter vinelandii (Genbank ACCN# M20568). Universal nifH primer sequences listed in Table 1 and 2 are indicated by grey horizontal lines.
Properties of universal primers and their coverage for phylogenetic and environmental groupings in the nifH database; continued in Table 2.
|
| Specific groupings | Environ. | |||||||||||||||||
| Primer | Name | Pos. | Deg. | Tm (°C) | 0 | 1 | 2 | Pr | Cy | III | IA | Fr | Pb | Ep | IV | Soil | Mat | Sea | Ref. |
| GCIWTYTAYGGNAARGG | Nh21F | 19–35 | 64 | 51.8–61.9 | 93 | 96 | 103 | 89 | 100 | 100 | 100 | 100 | 91 | 83 | 76 | 78 | 100 | 100 |
|
| GCIWTITAYGGNAARGGNGG | nifH19F | 19–38 | 128 | 59.5–69.5 | 94 | 96 | 96 | 89 | 100 | 100 | 100 | 100 | 91 | 100 | 79 | 78 | 100 | 100 |
|
| GCIWTYTAYGGIAARGGIGG | Ueda19F | 19–38 | 16 | 62.4–67.9 | 93 | 96 | 96 | 89 | 100 | 100 | 100 | 100 | 91 | 83 | 76 | 78 | 100 | 100 |
|
| GCIWTHTAYGGIAARGGIGGIATHGGIAA | IGK3 | 19–47 | 72 | 69.4–75.3 | 92 | 95 | 95 | 87 | 100 | 98 | 96 | 100 | 91 | 100 | 78 | 72 | 100 | 100 |
|
| GCGTTCTACGGTAAGGGCGGTATCGGNAAR | K07-F | 19–48 | 8 | 71.0–72.8 | 1 | 3 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| TTYTAYGGNAARGGNGG | nifH4 | 22–38 | 128 | 49.8–63.5 | 52 | 91 | 94 | 71 | 100 | 4 | 43 | 93 | 91 | 13 | 23 | 43 | 100 | 78 |
|
|
| primer-f | 23–44 | 1 | 66.5 | 10 | 14 | 31 | 13 | 0 | 13 | 0 | 0 | 0 | 23 | 0 | 0 | 0 | 0 |
|
| TACGGCAARGGTGGNATHG | FGPH19 | 25–43 | 24 | 58.2–66.2 | 7 | 24 | 63 | 9 | 0 | 4 | 1 | 71 | 0 | 2 | 4 | 3 | 0 | 0 |
|
| TACGGYAARGGBGGYATCGG | IGK-Poly | 25–44 | 24 | 60.3–70.6 | 18 | 50 | 72 | 22 | 45 | 6 | 16 | 29 | 17 | 0 | 9 | 6 | 100 | 33 |
|
| TACGG(P/K)AAKGG(P/G)GG(P/K)ATPGG | PicenoF44 | 25–44 | 8 | NA | 50 | 85 | 96 | 59 | 96 | 35 | 25 | 79 | 33 | 59 | 31 | 34 | 100 | 89 |
|
| TAYGGIAARGGIGGIATYGGIAARTC | F1 | 25–50 | 4096 | 60.4–74.5 | 79 | 95 | 96 | 84 | 100 | 69 | 65 | 93 | 83 | 87 | 80 | 83 | 100 | 100 |
|
| GGHAARGGHGGHATHGGNAARTC | MehtaF | 28–50 | 1296 | 57.4–72.9 | 59 | 78 | 87 | 46 | 96 | 77 | 44 | 94 | 44 | 51 | 69 | 67 | 100 | 100 |
|
| AARGGNGGNATHGGNAA | IGK | 31–47 | 384 | 62.1–72.5 | 90 | 98 | 104 | 91 | 100 | 93 | 70 | 95 | 89 | 94 | 95 | 95 | 100 | 100 |
|
| AAAGGYGGWATCGGYAARTCCACCAC | nifHF-Rösch | 31–56 | 16 | 66.0–71.6 | 14 | 25 | 44 | 11 | 34 | 5 | 9 | 70 | 22 | 14 | 0 | 34 | 100 | 5 |
|
| AAAGGYGGWATCCGYAARTCCACCAC | rösch F-1b | 31–56 | 16 | 66.0–71.6 | 0 | 14 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| GGTATYGGYAARTCSACSAC | RL28 | 37–56 | 32 | 57.7–64.1 | 41 | 70 | 84 | 43 | 21 | 41 | 62 | 57 | 6 | 12 | 3 | 24 | 100 | 5 |
|
| ATHGTIGGITGYGAYCCIAARGCIGA | KAD3 | 106–131 | 16 | 70.1–76.8 | 70 | 84 | 94 | 89 | 90 | 15 | 80 | 97 | 80 | 82 | 1 | 70 | 100 | 24 |
|
| GGNTGYGAYCCNAARGC | 469 | 112–128 | 128 | 53.4–67.4 | 88 | 92 | 98 | 82 | 94 | 92 | 95 | 98 | 79 | 96 | 35 | 77 | 100 | 73 |
|
| GGITGTGAYCCNAAVGCNGA | nif112 | 112–131 | 96 | 60.9–70.4 | 29 | 90 | 92 | 21 | 37 | 30 | 34 | 36 | 28 | 69 | 11 | 30 | 60 | 22 |
|
| GGITGYGAYCCNAAVGCNGA | nifH-univ-f112 | 112–131 | 192 | 60.9–72.7 | 88 | 91 | 98 | 82 | 91 | 92 | 96 | 98 | 79 | 97 | 35 | 77 | 100 | 73 |
|
| TGYGAYCCNAARGCNGA | nifH2 | 115–131 | 128 | 54.0–68.1 | 95 | 98 | 98 | 93 | 98 | 96 | 98 | 98 | 98 | 96 | 37 | 93 | 99 | 89 |
|
| TGYGAYCCIAARGCIGA | Kadino | 115–131 | 8 | 60.2–67.9 | 95 | 98 | 98 | 93 | 98 | 96 | 98 | 98 | 98 | 96 | 37 | 93 | 99 | 89 |
|
| TGYGAYCCIAAIGCIGA | F2 | 115–131 | 4 | 62.3–67.9 | 96 | 98 | 98 | 94 | 98 | 98 | 99 | 99 | 98 | 98 | 37 | 94 | 99 | 93 |
|
| TGCGAYCCSAARGCBGACTC | polF | 115–134 | 24 | 63.8–70.1 | 39 | 70 | 88 | 51 | 12 | 26 | 41 | 61 | 34 | 6 | 3 | 54 | 15 | 8 |
|
| GAYCCNAARGCNGACTC | nifH11 | 118–134 | 64 | 52.7–63.4 | 72 | 96 | 98 | 79 | 67 | 50 | 82 | 97 | 78 | 48 | 17 | 78 | 66 | 55 |
|
| CTCCGGGCCRCCNGAYTC | FGPH273′ | 262–279 | 16 | 63.7–70.7 | 11 | 44 | 79 | 11 | 0 | 3 | 6 | 93 | 0 | 3 | 1 | 14 | 2 | 5 |
|
| GMRCCIGGIGTIGGYTGYGC | nifH-2f | 277–296 | 16 | 69.2–78.3 | 87 | 99 | 99 | 92 | 80 | 74 | 94 | 98 | 95 | 72 | 20 | 85 | 85 | 79 |
|
| CCRCCRCANACMACGTC | Cy55Nh428R | 388–404 | 32 | 56.6–67.5 | 66 | 94 | 99 | 68 | 76 | 49 | 63 | 76 | 55 | 49 | 23 | 56 | 65 | 80 |
|
| AAICCRCCRCAIACIACRTC | Ueda407R | 388–407 | 8 | 63.9–70.6 | 91 | 99 | 99 | 92 | 95 | 85 | 90 | 99 | 64 | 79 | 71 | 87 | 95 | 93 |
|
| ATIGCRAAICCICCRCAIACIACRTC | DVV | 388–413 | 8 | 71.7–75.8 | 94 | 98 | 99 | 93 | 94 | 94 | 96 | 98 | 90 | 95 | 52 | 91 | 96 | 93 |
|
| GGCATNGCRAANCCVCCRCANAC | MehtaR | 394–416 | 768 | 63.2–75.1 | 92 | 98 | 99 | 92 | 94 | 89 | 95 | 98 | 89 | 89 | 49 | 89 | 97 | 92 |
|
Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.
Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV).
Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).
Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine.
Position is relative to A. vinelandii nifH (Genbank ACCN# M20568).
Degeneracy is given as the number of oligonucleotides that comprise the primer.
References in which the primers are described.
We altered these primer names in order to distinguish them from primers with similar name and sequence composition that originate from other sources.
The 5′ linker sequence ATA GGA TCC was removed from this primer.
NA: Data not available as described in Methods.
Properties of universal primers and their coverage for phylogenetic and environmental groupings in the nifH database; continued from Table 1.
|
| Specific groupings | Environ. | |||||||||||||||||
| Primer | Name | Pos. | Deg. | Tm (°C) | 0 | 1 | 2 | Pr | Cy | III | IA | Fr | Pb | Ep | IV | Soil | Mat | Sea | Ref. |
| ATIGGCATIGCRAAICCICCRCAIAC | VCG | 394–419 | 4 | 73.9–76.7 | 93 | 98 | 99 | 93 | 94 | 94 | 95 | 98 | 90 | 96 | 33 | 91 | 96 | 91 |
|
| TGGGCYTTGTTYTCRCGGATYGGCAT | nifHRc | 412–437 | 16 | 69.1–74.2 | 11 | 34 | 51 | 18 | 0 | 0 | 11 | 0 | 13 | 0 | 0 | 12 | 3 | 9 |
|
| TGSGCYTTGTCYTCRCGGATBGGCAT | nifHRb | 412–437 | 48 | 70.0–76.0 | 0 | 33 | 56 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
|
| SACGATGTAGATPTCCTG | PicenoR436 | 436–453 | 4 | NA | 34 | 60 | 83 | 51 | 10 | 24 | 13 | 97 | 7 | 1 | 3 | 36 | 28 | 22 |
|
| TCIGGIGARATGATGGC | R6 | 457–473 | 2 | 61.1–62.5 | 96 | 97 | 99 | 97 | 99 | 86 | 99 | 99 | 102 | 97 | 15 | 94 | 99 | 97 |
|
| ATSGCCATCATYTCRCCGGA | polR | 457–476 | 8 | 63.7–67.5 | 35 | 63 | 86 | 36 | 15 | 19 | 40 | 90 | 23 | 12 | 0 | 55 | 7 | 2 |
|
| ADNGCCATCATYTCNCC | nifH1 | 460–476 | 96 | 52.5–63.9 | 94 | 99 | 99 | 94 | 96 | 91 | 96 | 99 | 103 | 80 | 13 | 91 | 98 | 87 |
|
| ADWGCCATCATYTCRCC | nifH22 | 460–476 | 24 | 53.2–60.9 | 17 | 89 | 98 | 16 | 23 | 21 | 22 | 1 | 49 | 20 | 3 | 11 | 31 | 36 |
|
| ANDGCCATCATYTCNCC | nifH2-ZANIi | 460–476 | 96 | 52.5–63.6 | 54 | 98 | 99 | 48 | 63 | 70 | 73 | 11 | 83 | 83 | 10 | 63 | 61 | 76 |
|
| TANANNGCCATCATYTCNCC | 470 | 460–479 | 512 | 53.8–65.7 | 80 | 82 | 98 | 84 | 43 | 62 | 88 | 99 | 80 | 98 | 9 | 79 | 6 | 95 |
|
| GCRTAIABNGCCATCATYTC | nifH-univ-463r | 463–482 | 48 | 55.7–63.8 | 85 | 87 | 88 | 91 | 50 | 62 | 91 | 99 | 88 | 99 | 8 | 93 | 83 | 72 |
|
| GCRTAIAIIGCCATCATYTC | Emino | 463–482 | 4 | 60.2–63.4 | 86 | 87 | 88 | 91 | 50 | 64 | 91 | 99 | 88 | 100 | 8 | 93 | 83 | 76 |
|
| ATGATGGCSATGTAYGCSGCSAACAA | nifHR-2i | 466–491 | 16 | 70.0–71.7 | 49 | 58 | 87 | 36 | 17 | 35 | 53 | 99 | 60 | 58 | 0 | 72 | 100 | 4 |
|
| TTGTTSGCSGCRTACATSGCCATCAT | nifHR | 466–491 | 16 | 70.0–71.7 | 49 | 58 | 87 | 36 | 17 | 35 | 53 | 99 | 60 | 58 | 0 | 72 | 100 | 4 |
|
| TTGTTGGCIGCRTASAKIGCCAT | nifH-3r | 469–491 | 8 | 68.5–72.1 | 48 | 89 | 94 | 66 | 7 | 39 | 72 | 4 | 30 | 44 | 3 | 77 | 100 | 0 |
|
| ATRTTRTTNGCNGCRTA | nifH3 | 494–478 | 128 | 46.1–61.5 | 94 | 95 | 98 | 93 | 96 | 86 | 88 | 100 | 78 | 93 | 50 | 93 | 100 | 100 |
|
| YAAATRTTRTTNGCNGCRTA | YAA-polyi | 478–497 | 256 | 49.5–63.5 | 1 | 12 | 51 | 0 | 0 | 7 | 1 | 0 | 0 | 0 | 5 | 0 | 0 | 21 |
|
| CAGATCAGVCCGCCSAGRCGMAC | RL25 | 532–554 | 24 | 67.5–74.1 | 4 | 29 | 61 | 5 | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
|
|
| primer-r | 619–638 | 1 | 64.3 | 4 | 16 | 43 | 3 | 0 | 24 | 0 | 0 | 0 | 29 | 0 | 0 | 0 | 0 |
|
| GCTACTACYTCGCCSGA | AMR-R | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| |||
Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.
Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV).
Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).
Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine.
Position is relative to A. vinelandii nifH (Genbank ACCN# M20568).
Degeneracy is given as the number of oligonucleotides that comprise the primer.
References in which the primers are described.
We altered these primer names in order to distinguish them from primers with similar name and sequence composition that originate from other sources.
NA: Data not available as described in Methods.
Properties of group-specific primers and their coverage for phylogenetic and environmental groupings in the nifH database.
|
| Specific groupings | Environ. | ||||||||||||||||||
| Primer | Name | Tg. | Pos. | Deg. | Tm (°C) | 0 | 1 | 2 |
|
| III | IA |
|
| Ep | IV | Soil | Mat | Sea | Ref. |
| CGCIWTYTACGGIAARGGIGG | ChenBR1 | BR | 18–38 | 512 | 66.6–69.8 | 42 | 81 | 94 | 63 | 6 | 10 | 60 | 85 | 60 | 27 | 9 | 0 | 0 | 0 |
|
| GCSTTCTACGGMAAGGGTGG | nifH-f1-forA | Fr | 19–38 | 4 | 63.9–66.7 | 3 | 7 | 24 | 0 | 0 | 0 | 0 | 85 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| GCRTTYTACGGYAARGGSGG | nifH-a1-forA | AP | 19–38 | 32 | 60.6–69.1 | 14 | 38 | 70 | 26 | 20 | 0 | 4 | 0 | 18 | 0 | 0 | 0 | 100 | 11 |
|
| TACGGNAARGGSGGNATCGGCAA | nifHF | R | 25–47 | 64 | 66.7–73.9 | 20 | 47 | 78 | 32 | 4 | 11 | 12 | 0 | 17 | 7 | 10 | 17 | 100 | 11 |
|
| GGTATYGGYAARTGYACYAC | primer-3 | RA | 37–56 | 32 | 52.6–64.8 | 0 | 19 | 56 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
|
| nifHf1 | Fr | 43–61 | 1 | 67.0 | 1 | 2 | 4 | 0 | 0 | 0 | 0 | 30 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| ATYGTCGGYTGYGAYCCSAARGC | Olsen1 | AM | 106–128 | 64 | 65.0–73.6 | 37 | 66 | 81 | 53 | 2 | 3 | 58 | 57 | 0 | 14 | 0 | 38 | 100 | 0 |
|
|
| cyanoF | Cy | 108–131 | 1 | 68.8 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
|
| nifH-b1-forB | AB | 112–131 | 1 | 68.3 | 1 | 10 | 32 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | 0 | 0 |
|
|
| nifH-g1-forB | GP | 112–131 | 1 | 64.1 | 0 | 3 | 10 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
|
| GGWTGTGATCCWAARGCVGA | nifH-c1-forB | AN | 112–131 | 24 | 58.7–64.3 | 1 | 8 | 25 | 1 | 1 | 1 | 0 | 0 | 0 | 4 | 0 | 1 | 0 | 4 |
|
|
| nifH-a2-forB | AP | 112–131 | 1 | 70.3 | 1 | 10 | 33 | 2 | 0 | 0 | 0 | 0 | 23 | 0 | 0 | 1 | 0 | 0 |
|
| GGMTGCGAYCCSAARGCSGA | nifH-a1-forB | AP | 112–131 | 32 | 66.2–72.7 | 27 | 58 | 73 | 29 | 1 | 31 | 41 | 22 | 33 | 5 | 8 | 15 | 20 | 0 |
|
| GGBTGYGACCCSAASGCYGA | nifH-f1-forB | Fr | 112–131 | 48 | 65.9–72.9 | 22 | 48 | 74 | 15 | 1 | 9 | 17 | 91 | 2 | 3 | 3 | 22 | 20 | 0 |
|
|
| nifHFor | MS | 136–160 | 1 | 74.7 | 11 | 20 | 32 | 21 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 16 | 0 | 9 |
|
| TAARGCTCAAACTACCGTAT | cylnif-F | Cs | 156–175 | 2 | 56.2–57.9 | 1 | 1 | 3 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
|
|
| NIFH2F | TB | 231–250 | 1 | 63.1 | 0 | 2 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 |
|
|
| NIFH5R | TB | 306–326 | 1 | 61.6 | 10 | 22 | 36 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 | 0 | 7 |
|
|
| nifH-269 | Fr | 325–341 | 1 | 64.2 | 3 | 3 | 3 | 0 | 0 | 0 | 0 | 85 | 0 | 0 | 0 | 4 | 0 | 0 |
|
|
| cylnif-R | Cs | 356–375 | 1 | 54.6 | 1 | 1 | 4 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
|
|
| NifHRev | MS | 427–452 | 1 | 67.5 | 13 | 29 | 43 | 23 | 0 | 0 | 6 | 0 | 3 | 0 | 0 | 15 | 1 | 3 |
|
| GACGATGTAGATYTCCTG | primer 4 = AQE | RA | 436–453 | 2 | 53.8–55.1 | 24 | 54 | 81 | 33 | 8 | 21 | 12 | 69 | 7 | 0 | 1 | 19 | 24 | 13 |
|
|
| cyanoR | Cy | 460–482 | 1 | 63.6 | 4 | 8 | 23 | 2 | 25 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 8 |
|
| GCGTACATSGCCATCATCTC | nifH-f1-rev | Fr | 463–482 | 2 | 62.2–62.3 | 23 | 44 | 60 | 14 | 0 | 3 | 35 | 94 | 0 | 7 | 0 | 20 | 0 | 0 |
|
|
| nifH-b1-rev | AB | 463–482 | 1 | 62.3 | 8 | 32 | 53 | 9 | 0 | 3 | 33 | 6 | 0 | 7 | 0 | 18 | 0 | 0 |
|
|
| nifH-g1-rev | GP | 463–482 | 1 | 62.3 | 8 | 32 | 53 | 9 | 0 | 3 | 33 | 6 | 0 | 7 | 0 | 18 | 0 | 0 |
|
| GCATAYASKSCCATCATYTC | nifH-c1-rev | AN | 463–482 | 8 | 55.4–62.3 | 1 | 13 | 58 | 1 | 0 | 3 | 1 | 0 | 0 | 4 | 0 | 2 | 0 | 0 |
|
|
| nifH-a2-rev | AP | 463–482 | 1 | 64.0 | 2 | 17 | 43 | 4 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 3 | 0 | 0 |
|
|
| nifH-a1-rev | AP | 463–482 | 1 | 62.0 | 9 | 16 | 33 | 17 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 |
|
| ATGGTGTTGGCGGCRTAVAKSGCCATCAT | Olsen2 | AM | 466–494 | 24 | 71.5–75.3 | 0 | 32 | 54 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
|
| nifHr | Fr | 671–690 | 1 | 65.9 | 0 | 3 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 24 | 0 | 0 | 0 | 0 |
|
| GGIKCRTAYTSGATIACIGTCAT | ChenBR2 | BR | 676–698 | 1024 | 63.6–69.1 | 31 | 67 | 87 | 40 | 0 | 0 | 7 | 71 | 0 | 0 | 0 | 39 | 0 | 0 |
|
|
| FGPH750 | Fr | 759–778 | 1 | 66.8 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| AGCATGTCYTCSAGYTCNTCCA | nifHI | R | 785–806 | 32 | 63.3–68.8 | 24 | 41 | 51 | 44 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
|
|
| FGPD913′ | Fr | NA | 1 | 66.3 | 10 | 10 | 10 | 0 | 0 | NA | NA | 100 | NA | NA | 0 | NA | NA | NA |
|
Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.
Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha-, Beta-, and Gammaproteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilonproteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV).
Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).
Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine.
Abbreviations indicate the Target Group (Tg.) which the primer was intended to amplify as follows: β-Rhizobia (BR); Frankia (Fr); Alphaproteobacteria (AP); Symbiotic rhizobia (R); reamplification of Cluster I (RA); aerobic and microaerophilic diazotrophs (AM); Cyanobacteria (Cy); Alpha- and Betaproteobacteria (AB); Gammaproteobacteria (GP); alternative nitrogenase cluster (AN); designed to match multiple species of Azospirillum, Burkholderia, Gluconoacetobacter, Azotobacter, Herbaspirillum and Azoarcus (MS); species of the cyanobacterial genus Cylindrospermopsis (Cs); Bradyrhizobium sp. prevalent in truffles (TB).
Position is relative to A. vinelandii nifH (Genbank ACCN# M20568).
Degeneracy is given as the number of oligonucleotides that comprise the primer.
References in which the primers are described.
This binding position for this primer sequence lies beyond the stop codon of Frankia sp. (Genbank ACCN# M21132) and cannot be represented using the A. vinelandii numbering system.
NA: Data not available as described in Methods.
Properties of universal primer pairs and their coverage for phylogenetic and environmental groupings in the nifH database.
| Specific groupings | Environ. | |||||||||||||
| Primer set | Pos. | Len. |
|
|
| III | IA |
|
| Ep | IV | Soil | Mat | Sea |
| Nh21F/Cy55Nh428R | 19–404 | 386 | 67 | 71 | 98 | 45 | 74 | 62 | 73 | 25 | 13 | 93 | 100 | 89 |
| Ueda19F/407R | 19–407 | 389 | 86 | 86 | 100 | 82 | 100 | 100 | 80 | 75 | 48 | 0 | 100 | 100 |
| NH21F/nifH1 | 19–476 | 458 | 91 | 90 | 100 | 85 | 100 | 100 | 100 | 82 | 1 | NA | 100 | 100 |
| nifH19F/nifH-univ463R | 19–482 | 464 | 88 | 86 | 96 | 76 | 100 | 100 | 100 | 100 | 1 | NA | 100 | 100 |
| Ueda19F/nifH-univ463r | 19–482 | 464 | 87 | 86 | 96 | 76 | 100 | 100 | 100 | 82 | 1 | NA | 100 | 100 |
| 19F/nifH3 | 19–494 | 476 | 92 | 87 | 96 | 100 | 100 | 100 | 100 | 82 | 32 | NA | 100 | 100 |
| Nh21F/nifH3 | 19–494 | 476 | 92 | 87 | 96 | 100 | 100 | 100 | 100 | 82 | 32 | NA | 100 | 100 |
| nifH3/nifH4 | 22–494 | 473 | 49 | 68 | 96 | 0 | 0 | 100 | 100 | 27 | 14 | NA | 100 | 78 |
| Primer-f/Primer-r | 23–638 | 616 | 8 | 10 | 0 | 9 | 0 | 0 | 0 | 39 | 0 | NA | 0 | 0 |
| FGPH19/FGPH273 | 25–279 | 255 | 3 | 3 | 0 | 1 | 0 | 71 | 0 | 0 | 0 | 0 | 0 | 0 |
| PicenoF44/PicenoR436 | 25–453 | 429 | 33 | 52 | 2 | 5 | 6 | 85 | 0 | 2 | 0 | 12 | 0 | 0 |
| F1/R6 | 25–473 | 449 | 85 | 88 | 100 | 61 | 94 | 92 | 100 | 93 | 6 | 94 | 100 | 100 |
| FGPH19/PolR | 25–476 | 452 | 6 | 6 | 0 | 0 | 7 | 77 | 0 | 0 | 0 | 0 | 0 | 0 |
| F1/nifH3r | 25–491 | 467 | 51 | 67 | 8 | 42 | 87 | 0 | 75 | 39 | 3 | 25 | 100 | 0 |
| MehtaF/MehtaR | 28–416 | 389 | 56 | 44 | 81 | 73 | 44 | 88 | 75 | 43 | 55 | 52 | 100 | 95 |
| IGK/FGPH273′ | 31–279 | 249 | 9 | 12 | 0 | 1 | 7 | 35 | 0 | 0 | 1 | 8 | 0 | 0 |
| IGK/DVV | 31–413 | 383 | 83 | 84 | 85 | 86 | 68 | 87 | 89 | 88 | 70 | 78 | 100 | 85 |
| IGK/VCG | 31–419 | 389 | 86 | 86 | 84 | 91 | 90 | 87 | 78 | 96 | 37 | 74 | 100 | 100 |
| nifHF/nifHRb | 31–437 | 407 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| nifHF/nifHRc | 31–437 | 407 | 3 | 4 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| IGK/primer-4 = AQE | 31–453 | 423 | 30 | 39 | 6 | 7 | 3 | 81 | 13 | 0 | 0 | 19 | 0 | 0 |
| IGK/PolR | 31–476 | 446 | 32 | 32 | 22 | 31 | 68 | 24 | 25 | 44 | 0 | 32 | 100 | 0 |
| nifHF-Rösch/nifHR | 31–491 | 461 | 26 | 17 | 34 | 10 | 42 | 66 | 50 | 33 | 0 | 56 | 100 | 0 |
| IGK/YAA = nifH3 | 31–494 | 464 | 93 | 90 | 97 | 96 | 100 | 100 | 100 | 100 | 49 | 100 | 100 | 100 |
| RL28/RL25 | 37–554 | 518 | 5 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| KAD3/DVV | 106–413 | 308 | 66 | 84 | 83 | 14 | 81 | 96 | 64 | 77 | 1 | 54 | 100 | 25 |
| KAD3/VCG | 106–419 | 314 | 70 | 84 | 84 | 15 | 69 | 96 | 64 | 83 | 1 | 62 | 100 | 30 |
| 469/R6 | 112–473 | 362 | 82 | 79 | 92 | 69 | 76 | 98 | 53 | 96 | 9 | 61 | 100 | 80 |
| 469/nifH1 | 112–476 | 365 | 81 | 76 | 92 | 78 | 69 | 98 | 53 | 77 | 8 | 57 | 100 | 71 |
| 469/470 | 112–479 | 368 | 83 | 78 | 91 | 72 | 78 | 98 | 53 | 95 | 8 | 63 | 100 | 79 |
| nifHFor/470 | 112–479 | 368 | 82 | 78 | 91 | 72 | 78 | 98 | 53 | 95 | 8 | 62 | 100 | 79 |
| nif112/nifH-univ463R | 112–482 | 371 | 39 | 28 | 64 | 53 | 48 | 31 | 47 | 73 | 4 | 46 | 60 | 64 |
| nifB/nifHRev | 112–482 | 371 | 17 | 8 | 63 | 21 | 14 | 20 | 27 | 13 | 0 | 12 | 0 | 64 |
| PolF/primer-4 = AQE | 115–453 | 339 | 18 | 26 | 1 | 13 | 6 | 40 | 0 | 0 | 0 | 24 | 4 | 2 |
| F2/R6 | 115–473 | 359 | 95 | 95 | 98 | 84 | 98 | 98 | 103 | 97 | 13 | 92 | 99 | 91 |
| nifH2/R6 | 115–473 | 359 | 94 | 94 | 98 | 83 | 97 | 98 | 103 | 95 | 13 | 90 | 99 | 88 |
| nifH1/nifH2 | 115–476 | 362 | 92 | 91 | 96 | 88 | 94 | 98 | 104 | 77 | 11 | 86 | 99 | 81 |
| PolF/PolR | 115–476 | 362 | 25 | 30 | 2 | 11 | 32 | 59 | 21 | 3 | 0 | 51 | 0 | 0 |
| Kadino/Emino | 115–482 | 368 | 83 | 87 | 51 | 67 | 79 | 98 | 87 | 97 | 7 | 84 | 100 | 76 |
| Kadino/nifH-univ-463R | 115–482 | 368 | 82 | 86 | 51 | 65 | 79 | 98 | 87 | 96 | 7 | 84 | 100 | 72 |
| nifH11/nifH22 | 118–476 | 359 | 12 | 15 | 10 | 7 | 17 | 1 | 47 | 6 | 1 | 8 | 17 | 22 |
| nifH-2f/nifH-3r | 277–491 | 215 | 45 | 63 | 6 | 31 | 66 | 3 | 30 | 33 | 0 | 74 | 100 | 0 |
Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.
Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).
Position of amplicon in nifH is relative to A. vinelandii nifH (Genbank ACCN# M20568).
Length expected for PCR amplicon.
Data indicate primer binding with 0 mismatches to all nifH sequences in the database.
NA Data not available as nucleotide information is not available for the target group in the region of primer binding.
Properties of group-specific primer pairs and their coverage for phylogenetic and environmental groups.
| Specific groupings | Environ. | |||||||||||||
| Primer set | Pos. | Len. |
|
|
| III | IA |
|
| Ep | IV | Soil | Mat | Sea |
| ChenBR1/ChenBR2 | 18–698 | 681 | 19 | 35 | 0 | 0 | 7 | 75 | 0 | 0 | 0 | NA | 0 | 0 |
| nifH-a1-forA/nifH-a1-rev | 19–482 | 464 | 5 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 |
| nifH-f1-forA/nifH-f1-rev | 19–482 | 464 | 3 | 0 | 0 | 0 | 0 | 92 | 0 | 0 | 0 | NA | 0 | 0 |
| nifHF/nifHI | 25–806 | 782 | 15 | 33 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
| primer-3/primer-4 = AQE | 37–453 | 417 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| nifHf1/nifH-269 | 43–341 | 299 | 1 | 0 | 0 | 0 | 0 | 19 | 0 | 0 | 0 | 0 | 0 | 0 |
| nifHf1/nifHr | 43–690 | 648 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Olsen1/Olsen2 | 106–494 | 389 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| cyanoF/cyanoR | 108–482 | 375 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| nifH-a1-forB/nifH-a1-rev | 112–482 | 371 | 5 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 |
| nifH-a2-forB/nifH-a2-rev | 112–482 | 371 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| nifH-b1-forB/nifH-b1-rev | 112–482 | 371 | 1 | 2 | 0 | 0 | 6 | 0 | 0 | 0 | 0 | 5 | 0 | 0 |
| nifH-c1-forB/nifH-c1-rev | 112–482 | 371 | 1 | 1 | 0 | 3 | 1 | 0 | 0 | 4 | 0 | 4 | 0 | 0 |
| nifH-f1-forB/nifH-f1-rev | 112–482 | 371 | 20 | 4 | 0 | 2 | 7 | 87 | 0 | 0 | 0 | 6 | 0 | 0 |
| nifH-g1-forB/nifH-g1-rev | 112–482 | 371 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 2 | 0 | 0 |
| nifHFor/NifHRev | 136–452 | 317 | 5 | 9 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 3 | 0 | 0 |
| cylnif-F/cylnif-R | 156–375 | 220 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| NIFH2F/NIFH5R | 231–326 | 96 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| FGPH750/FGPD913′ | 759−f | 116 | 0 | 0 | 0 | NA | NA | 0 | NA | NA | 0 | NA | NA | NA |
Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.
Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).
Position of amplicon in nifH is relative to A. vinelandii nifH (Genbank ACCN# M20568).
Length expected for PCR amplicon.
Data indicate primer binding with 0 mismatches to all nifH sequences in the database.
This binding position for the reverse primer sequence lies beyond the stop codon of Frankia sp. (Genbank ACCN# M21132) and cannot be represented using the A. vinelandii numbering system.
NA Data not available as nucleotide information is not available for the target group in the region of primer binding.
Empirical results of PCR using different nifH primer sets with DNA from isolates and soilsa.
| AT (°C) | Dv | Gu | Av | Fs | Ml | Kp | Xa | Rs | Rl | Pn | Ec | AS | LS | NT | |
| F2/R6 | 51 | − | + | + | − | + | + | − | ns | − | − | − | + | + | − |
| IGK3/DVV | 58 | + | + | + | + | + | + | + | + | + | + | − | + | + | − |
| Ueda19F/388R | 51 | + | + | + | − | + | + | + | + | ns | + | ns | + | + | − |
| nifH2/R6 | 44 | ns | + | + | + | + | + | ns | − | + | − | ns | s | s | − |
| nH21f/nifH1 | 46 | ns | Ns | + | − | ns | s | s | − | ||||||
| nifH1/nifH2 | 46 | ns | + | + | − | ns | s | s | − | ||||||
| Ueda19f/univ463r | 46 | + | ns | + | − | + | + | + | − | + | + | ns | s | s | − |
| nifH3/nH21f | 41 | ns | ns | + | − | ns | s | s | − |
DNA samples and their phylogenetic affiliation in the nifH phylogeny from Figure S2 are: Desulfovibrio vulgaris Hildenborough (Dv), cluster III; Geobacter uraniireducens Rf4 (Gu), subcluster IA; Azotobacter vinelandii DJ (Av), Alpha-, Beta- and Gamma-Proteobacteria; Frankia sp. CcI3 (Fs), Frankia; Mastigocladus laminosus UTEX LB 1931 (Ml), Cyanobacteria; Klebsiella pneumoniae 342 (Kp), Alpha-, Beta- and Gammaproteobacteria; Xanthobacter autotrophicus Py2 (Xa), Alpha-, Beta- and Gammaproteobacteria; Rhodobacter sphaeroides 2.4.1 (Rs), Alpha-, Beta- and Gammaproteobacteria; Rhizobium leguminosarium bv. trifolii (Rl), Alpha-, Beta- and Gammaproteobacteria; Polaromonas naphthalenivorans CJ2 (Pn), Alpha-, Beta- and Gammaproteobacteria; Eschericia coli (Ec), genomic-DNA negative control; agricultural soil (AS); lawn soil (LS); No Template Control (NT). The symbols used are: product of the correct size (+), no product produced (−), non-specific amplification producing multiple bands or a single band of the wrong size (ns), a smeared band of indiscriminate size overlapping in size with the expected product (s). Blank cells indicate that the evaluation was not performed.
Annealing Temperature (AT) used in PCR.