Literature DB >> 22848735

A comprehensive evaluation of PCR primers to amplify the nifH gene of nitrogenase.

John Christian Gaby1, Daniel H Buckley.   

Abstract

The nifH gene is the most widely sequenced marker gene used to identify nitrogen-fixing Bacteria and Archaea. Numerous PCR primers have been designed to amplify nifH, but a comprehensive evaluation of nifH PCR primers has not been performed. We performed an in silico analysis of the specificity and coverage of 51 universal and 35 group-specific nifH primers by using an aligned database of 23,847 nifH sequences. We found that there are 15 universal nifH primers that target 90% or more of nitrogen fixers, but that there are also 23 nifH primers that target less than 50% of nifH sequences. The nifH primers we evaluated vary in their phylogenetic bias and their ability to recover sequences from commonly sampled environments. In addition, many of these primers will amplify genes that do not mediate nitrogen fixation, and thus it would be advisable for researchers to screen their sequencing results for the presence of non-target genes before analysis. Universal primers that performed well in silico were tested empirically with soil samples and with genomic DNA from a phylogenetically diverse set of nitrogen-fixing strains. This analysis will be of great utility to those engaged in molecular analysis of nifH genes from isolates and environmental samples.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22848735      PMCID: PMC3405036          DOI: 10.1371/journal.pone.0042149

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Nitrogen-fixing microorganisms are globally significant in that they provide the only natural biological source of fixed nitrogen in the biosphere. These organisms enzymatically transform dinitrogen gas from the atmosphere into ammonium equivalents needed for biosynthesis of essential cellular macromolecules. Nitrogen-fixing bacteria are diverse, and most of the known taxa have not yet been cultivated in the laboratory [1]. Nitrogen fixation is carried out by the nitrogenase enzyme whose multiple subunits are encoded by the genes nifH, nifD, and nifK (as reviewed in [2]). Of the three, nifH (encoding the nitrogenase reductase subunit) is the most sequenced and has become the marker gene of choice for researchers studying the phylogeny, diversity, and abundance of nitrogen-fixing microorganisms. Thus, many PCR primers have been developed to target the nifH gene with the purpose of amplifying this gene sequence from environmental samples. Through use of nifH as a marker gene, researchers have been able to characterize aspects of the diversity and ecology of nitrogen-fixing Bacteria and Archaea. A wide range of environments have been sampled for nifH gene diversity including marine [3], terrestrial [4], extreme [5], anthropogenic [6], host-associated [7], and agricultural [8]. Analysis of these data indicate that the distribution of diazotrophs in the environment varies as a function of habitat type [1]. While more than 3,358 OTU0.05 nifH sequence types have been determined, the global census of diazotroph diversity remains far from complete [9]. Rates of nitrogen fixation have been associated with both nifH abundance [10] and nifH diversity [11], and thus knowledge of diazotroph community structure and dynamics is required to understand the ecological constraints on nitrogen fixation in microbial communities. Phylogenetic analyses of nifH gene sequences have revealed five primary clusters of genes homologous to nifH [12]–[15]. Cluster I consists of aerobic nitrogen fixers including Proteobacteria, Cyanobacteria, Frankia, and Paenibacillus. Cluster II is generally thought of as the alternative nitrogenase cluster because it contains sequences from FeFe and FeV nitrogenases which differ from the conventional FeMo cofactor-containing nitrogenase. Cluster III consists of anaerobic nitrogen fixers from Bacteria and Archaea including for instance the Desulfovibrionaceae, Clostridia, Spirochataes, and Methanobacteria. Cluster IV and cluster V contain sequences that are paralogs of nifH and which are not involved in nitrogen fixation [13]. We set out to provide a comprehensive evaluation of primer coverage for researchers wishing to use the nifH gene as a molecular marker for the study of nitrogen-fixing Bacteria and Archaea. Primers that target diverse nifH sequences must be degenerate to encompass the sequence variability of the nifH gene, and Zehr and McReynolds were the first to design such degenerate primers [16], [17]. There have since been numerous efforts to design both universal and group-specific nifH primer sets. In a survey of the literature, we have found 51 universal and 35 group-specific primers that have been paired to make 42 universal and 19 group-specific primer sets. We have performed an in silico evaluation of all of these nifH primers using an aligned database of all publicly available nifH sequences which we constructed previously [9]. We then performed empirical tests of the best of these primers using genomic DNA from a phylogenetically diverse set of nitrogen fixers and DNA from soil.

Results

Any effort to assess PCR primer coverage in silico must account for variation in sequence depth along the gene alignment of the database being queried. We observe that nucleotide positions near the beginning and end of the nifH gene alignment are under-represented in sequence databases relative to nucleotide positions in the middle of the gene alignment (Figure 1). This problem occurs because a majority of nifH sequences have been generated using PCR primers that bind to conserved nucleotide positions found within the nifH gene. A majority of the 393 full-length nifH sequences currently present in the nifH database are derived from sequenced genomes. The two dips in nucleotide coverage (at position 199 and 350 in Figure 1) result from insertions in the Azotobacter vinelandii nifH reference sequence relative to other genes in the alignment. In addition, some sequences in the alignment have insertions relative to A. vinelandii (data not shown). Due to the variations observed in sequence depth along the alignment, all estimates of primer coverage were calculated with respect to the total number of sequences available at the alignment positions where each primer binds.
Figure 1

Coverage of the nifH gene by sequences and primers in the nifH database.

The number of sequences in the nifH database is depicted in relation to alignment position along the gene. Alignment positions are referenced to the nifH nucleotide position from Azotobacter vinelandii (Genbank ACCN# M20568). Universal nifH primer sequences listed in Table 1 and 2 are indicated by grey horizontal lines.

Coverage of the nifH gene by sequences and primers in the nifH database.

The number of sequences in the nifH database is depicted in relation to alignment position along the gene. Alignment positions are referenced to the nifH nucleotide position from Azotobacter vinelandii (Genbank ACCN# M20568). Universal nifH primer sequences listed in Table 1 and 2 are indicated by grey horizontal lines.
Table 1

Properties of universal primers and their coverage for phylogenetic and environmental groupings in the nifH database; continued in Table 2.

nifH a (%)Specific groupingsb (%)Environ.c (%)
Primerd NamePos.e Deg.f Tm (°C)012PrCyIIIIAFrPbEpIVSoilMatSeaRef.g
GCIWTYTAYGGNAARGGNh21F19–356451.8–61.993961038910010010010091837678100100 [42]
GCIWTITAYGGNAARGGNGGnifH19F19–3812859.5–69.594969689100100100100911007978100100 [43]
GCIWTYTAYGGIAARGGIGGUeda19F19–381662.4–67.99396968910010010010091837678100100 [44]
GCIWTHTAYGGIAARGGIGGIATHGGIAAIGK319–477269.4–75.3929595871009896100911007872100100 [45]
GCGTTCTACGGTAAGGGCGGTATCGGNAARK07-F19–48871.0–72.8131020000000000 [27]
TTYTAYGGNAARGGNGGnifH422–3812849.8–63.552919471100443939113234310078 [46]
TCTACGGAAAGGGCGGTATCGG primer-f23–44166.510143113013000230000 [47]
TACGGCAARGGTGGNATHGFGPH1925–432458.2–66.272463904171024300 [48]
TACGGYAARGGBGGYATCGGIGK-Polyh 25–442460.3–70.61850722245616291709610033 [20]
TACGG(P/K)AAKGG(P/G)GG(P/K)ATPGGPicenoF4425–448NA50859659963525793359313410089 [49]
TAYGGIAARGGIGGIATYGGIAARTCF125–50409660.4–74.57995968410069659383878083100100 [18]
GGHAARGGHGGHATHGGNAARTCMehtaF28–50129657.4–72.9597887469677449444516967100100 [5]
AARGGNGGNATHGGNAAi IGK31–4738462.1–72.590981049110093709589949595100100 [26]
AAAGGYGGWATCGGYAARTCCACCACnifHF-Röschh 31–561666.0–71.61425441134597022140341005 [24]
AAAGGYGGWATCCGYAARTCCACCACrösch F-1bh 31–561666.0–71.60142500000000000 [25]
GGTATYGGYAARTCSACSACRL2837–563257.7–64.141708443214162576123241005 [50]
ATHGTIGGITGYGAYCCIAARGCIGAKAD3106–1311670.1–76.87084948990158097808217010024 [45]
GGNTGYGAYCCNAARGC469112–12812853.4–67.488929882949295987996357710073 [20]
GGITGTGAYCCNAAVGCNGAnif112112–1319660.9–70.42990922137303436286911306022 [43]
GGITGYGAYCCNAAVGCNGAnifH-univ-f112112–13119260.9–72.788919882919296987997357710073 [21]
TGYGAYCCNAARGCNGAnifH2115–13112854.0–68.19598989398969898989637939989 [16]
TGYGAYCCIAARGCIGAKadino115–131860.2–67.99598989398969898989637939989 [20]
TGYGAYCCIAAIGCIGAF2115–131462.3–67.99698989498989999989837949993 [18]
TGCGAYCCSAARGCBGACTCpolF115–1342463.8–70.13970885112264161346354158 [20]
GAYCCNAARGCNGACTCnifH11118–1346452.7–63.47296987967508297784817786655 [51]
CTCCGGGCCRCCNGAYTCFGPH273′262–2791663.7–70.711447911036930311425 [48]
GMRCCIGGIGTIGGYTGYGCnifH-2f277–2961669.2–78.38799999280749498957220858579 [19]
CCRCCRCANACMACGTCCy55Nh428R388–4043256.6–67.56694996876496376554923566580 [42]
AAICCRCCRCAIACIACRTCUeda407R388–407863.9–70.69199999295859099647971879593 [44]
ATIGCRAAICCICCRCAIACIACRTCDVV388–413871.7–75.89498999394949698909552919693 [45]
GGCATNGCRAANCCVCCRCANACMehtaR394–41676863.2–75.19298999294899598898949899792 [5]

Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.

Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV).

Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).

Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine.

Position is relative to A. vinelandii nifH (Genbank ACCN# M20568).

Degeneracy is given as the number of oligonucleotides that comprise the primer.

References in which the primers are described.

We altered these primer names in order to distinguish them from primers with similar name and sequence composition that originate from other sources.

The 5′ linker sequence ATA GGA TCC was removed from this primer.

NA: Data not available as described in Methods.

Table 2

Properties of universal primers and their coverage for phylogenetic and environmental groupings in the nifH database; continued from Table 1.

nifH a (%)Specific groupingsb (%)Environ.c (%)
Primerd NamePos.e Deg.f Tm (°C)012PrCyIIIIAFrPbEpIVSoilMatSeaRef.g
ATIGGCATIGCRAAICCICCRCAIACVCG394–419473.9–76.79398999394949598909633919691 [45]
TGGGCYTTGTTYTCRCGGATYGGCATnifHRc412–4371669.1–74.2113451180011013001239 [24]
TGSGCYTTGTCYTCRCGGATBGGCATnifHRb412–4374870.0–76.00335610000000100 [24]
SACGATGTAGATPTCCTGPicenoR436436–4534NA3460835110241397713362822 [49]
TCIGGIGARATGATGGCR6457–473261.1–62.596979997998699991029715949997 [18]
ATSGCCATCATYTCRCCGGApolR457–476863.7–67.53563863615194090231205572 [20]
ADNGCCATCATYTCNCCnifH1460–4769652.5–63.994999994969196991038013919887 [16]
ADWGCCATCATYTCRCCnifH22460–4762453.2–60.917899816232122149203113136 [51]
ANDGCCATCATYTCNCCnifH2-ZANIi 460–4769652.5–63.65498994863707311838310636176 [46]
TANANNGCCATCATYTCNCC470460–47951253.8–65.780829884436288998098979695 [20]
GCRTAIABNGCCATCATYTCnifH-univ-463r463–4824855.7–63.8858788915062919988998938372 [43]
GCRTAIAIIGCCATCATYTCEmino463–482460.2–63.48687889150649199881008938376 [20]
ATGATGGCSATGTAYGCSGCSAACAAnifHR-2i 466–4911670.0–71.7495887361735539960580721004 [24]
TTGTTSGCSGCRTACATSGCCATCATnifHR466–4911670.0–71.7495887361735539960580721004 [27]
TTGTTGGCIGCRTASAKIGCCATnifH-3r469–491868.5–72.14889946673972430443771000 [19]
ATRTTRTTNGCNGCRTAnifH3494–47812846.1–61.59495989396868810078935093100100 [46]
YAAATRTTRTTNGCNGCRTAYAA-polyi 478–49725649.5–63.511251007100050021 [20]
CAGATCAGVCCGCCSAGRCGMACRL25532–5542467.5–74.14296150810000100 [50]
GGCACGAAGTGGATCAGCTG primer-r619–638164.3416433024000290000 [47]
GCTACTACYTCGCCSGAAMR-R00000000000000 [27]

Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.

Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV).

Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).

Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine.

Position is relative to A. vinelandii nifH (Genbank ACCN# M20568).

Degeneracy is given as the number of oligonucleotides that comprise the primer.

References in which the primers are described.

We altered these primer names in order to distinguish them from primers with similar name and sequence composition that originate from other sources.

NA: Data not available as described in Methods.

We mapped the 51 universal primers to their complementary binding positions along the A. vinelandii nifH gene (Figure 1, Figure S1). Many primers bind to the same region (Figure 1, Figure S1), and thus may vary only slightly in binding position, oligonucleotide length, or degeneracy. Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%. Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea). Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine. Position is relative to A. vinelandii nifH (Genbank ACCN# M20568). Degeneracy is given as the number of oligonucleotides that comprise the primer. References in which the primers are described. We altered these primer names in order to distinguish them from primers with similar name and sequence composition that originate from other sources. The 5′ linker sequence ATA GGA TCC was removed from this primer. NA: Data not available as described in Methods. Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%. Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea). Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I = Inosine. Position is relative to A. vinelandii nifH (Genbank ACCN# M20568). Degeneracy is given as the number of oligonucleotides that comprise the primer. References in which the primers are described. We altered these primer names in order to distinguish them from primers with similar name and sequence composition that originate from other sources. NA: Data not available as described in Methods. The quality and characteristics of universal nifH PCR primers vary widely (Table 1 and Table 2). Of the universal primers 15 of the 51 were found to hit 90% or more of all nifH sequences while 23 hit less than 50% of these sequences and 9 hit 10% or fewer sequences (Table 1 and Table 2). In general, those universal primers that had >90% coverage for clusters I and III did not demonstrate systematic bias against individual phylogenetic groups within these clusters (Table 1 and Table 2). The primer KAD3 is notable, however, because it misses much of cluster III relative to cluster I (Table 1 and Table 2). Those primers with the highest coverage also tended to recognize a number of non-target sequences from cluster IV (Table 1 and Table 2). The group-specific primers we evaluated generally show poor coverage of the phylogenetic groups they have been designed to target, except for the Frankia-specific primers nifH-f1-forA, nifH-f1-forB, nifH-269, and nifH-f1-rev (Table 3). The primer cyanoR targets Cyanobacteria, but has coverage of only 25%, and its intended pair, primer cyanoF, has a coverage of only 1% of cyanobacterial sequences (Table 3).
Table 3

Properties of group-specific primers and their coverage for phylogenetic and environmental groupings in the nifH database.

nifH a (%)Specific groupingsb (%)Environ.c (%)
Primerd NameTg.e Pos.f Deg.g Tm (°C)012 Pr Cy IIIIA Fr Pb EpIVSoilMatSeaRef.h
CGCIWTYTACGGIAARGGIGGChenBR1BR18–3851266.6–69.842819463610608560279000 [52]
GCSTTCTACGGMAAGGGTGGnifH-f1-forAFr19–38463.9–66.73724000085000000 [21]
GCRTTYTACGGYAARGGSGGnifH-a1-forAAP19–383260.6–69.114387026200401800010011 [21]
TACGGNAARGGSGGNATCGGCAAnifHFR25–476466.7–73.920477832411120177101710011 [53]
GGTATYGGYAARTGYACYACprimer-3RA37–563252.6–64.80195600000000000 [54]
GGCAAGTCCACCACCCAGC nifHf1Fr43–61167.0124000030000000 [55]
ATYGTCGGYTGYGAYCCSAARGCOlsen1AM106–1286465.0–73.6376681532358570140381000 [56]
CGTAGGTTGCGACCCTAAGGCTGA cyanoFCy108–131168.800101000000000 [56]
GGCTGCGATCCCAAGGCTGA nifH-b1-forBAB112–131168.31103220010000200 [21]
GGTTGTGACCCGAAAGCTGA nifH-g1-forBGP112–131164.1031011000000100 [21]
GGWTGTGATCCWAARGCVGAnifH-c1-forBAN112–1312458.7–64.3182511100040104 [21]
GGCTGCGATCCGAAGGCCGA nifH-a2-forBAP112–131170.311033200002300100 [21]
GGMTGCGAYCCSAARGCSGAnifH-a1-forBAP112–1313266.2–72.7275873291314122335815200 [21]
GGBTGYGACCCSAASGCYGAnifH-f1-forBFr112–1314865.9–72.92248741519179123322200 [21]
ACCCGCCTGATCCTGCACGCCAAGG nifHForMS136–160174.71120322100500001609 [57]
TAARGCTCAAACTACCGTATcylnif-FCs156–175256.2–57.911303000000001 [58]
GAAGGTCGGCTACCAGAACA NIFH2FTB231–250163.102610000000200 [59]
AAGTTGATCGAGGTGATGACG NIFH5RTB306–326161.61022362100000001807 [59]
CCGGCCTCCTCCAGGTA nifH-269Fr325–341164.2333000085000400 [60]
ATTTAGACTTCGTTTCCTAC cylnif-RCs356–375154.611403000000001 [58]
ACGATGTAGATTTCCTGGGCCTTGTT NifHRevMS427–452167.51329432300603001513 [57]
GACGATGTAGATYTCCTGprimer 4 = AQERA436–453253.8–55.1245481338211269701192413 [54]
GCATACATCGCCATCATTTCACC cyanoRCy460–482163.64823225010000108 [56]
GCGTACATSGCCATCATCTCnifH-f1-revFr463–482262.2–62.3234460140335940702000 [21]
GCGTACATGGCCATCATCTC nifH-b1-revAB463–482162.3832539033360701800 [21]
GCGTACATGGCCATCATCTC nifH-g1-revGP463–482162.3832539033360701800 [21]
GCATAYASKSCCATCATYTCnifH-c1-revAN463–482855.4–62.31135810310040200 [21]
GCGTAGAGCGCCATCATCTC nifH-a2-revAP463–482164.02174340010010300 [21]
GCATAGAGCGCCATCATCTC nifH-a1-revAP463–482162.091633170100000200 [21]
ATGGTGTTGGCGGCRTAVAKSGCCATCATOlsen2AM466–4942471.5–75.30325400000000000 [56]
CTCGATGACGGTCATCCGGC nifHrFr671–690165.9036000000240000 [55]
GGIKCRTAYTSGATIACIGTCATChenBR2BR676–698102463.6–69.131678740007710003900 [52]
GAAGACGATCCCGACCCCGA FGPH750Fr759–778166.8011000025000000 [48]
AGCATGTCYTCSAGYTCNTCCAnifHIR785–8063263.3–68.824415144000000010000 [53]
GGTCGGGACCTCATCCTCGA FGPD913′FrNAi 166.310101000NANA100NANA0NANANA [48]

Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.

Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha-, Beta-, and Gammaproteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilonproteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV).

Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).

Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I =  Inosine.

Abbreviations indicate the Target Group (Tg.) which the primer was intended to amplify as follows: β-Rhizobia (BR); Frankia (Fr); Alphaproteobacteria (AP); Symbiotic rhizobia (R); reamplification of Cluster I (RA); aerobic and microaerophilic diazotrophs (AM); Cyanobacteria (Cy); Alpha- and Betaproteobacteria (AB); Gammaproteobacteria (GP); alternative nitrogenase cluster (AN); designed to match multiple species of Azospirillum, Burkholderia, Gluconoacetobacter, Azotobacter, Herbaspirillum and Azoarcus (MS); species of the cyanobacterial genus Cylindrospermopsis (Cs); Bradyrhizobium sp. prevalent in truffles (TB).

Position is relative to A. vinelandii nifH (Genbank ACCN# M20568).

Degeneracy is given as the number of oligonucleotides that comprise the primer.

References in which the primers are described.

This binding position for this primer sequence lies beyond the stop codon of Frankia sp. (Genbank ACCN# M21132) and cannot be represented using the A. vinelandii numbering system.

NA: Data not available as described in Methods.

Data indicate primer binding to all nifH sequences in the database with 0, 1, and 2 mismatches allowed. In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%. Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha-, Beta-, and Gammaproteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilonproteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea). Sequences are given in the 5′ to 3′ direction, IUPAC characters are used, and I =  Inosine. Abbreviations indicate the Target Group (Tg.) which the primer was intended to amplify as follows: β-Rhizobia (BR); Frankia (Fr); Alphaproteobacteria (AP); Symbiotic rhizobia (R); reamplification of Cluster I (RA); aerobic and microaerophilic diazotrophs (AM); Cyanobacteria (Cy); Alpha- and Betaproteobacteria (AB); Gammaproteobacteria (GP); alternative nitrogenase cluster (AN); designed to match multiple species of Azospirillum, Burkholderia, Gluconoacetobacter, Azotobacter, Herbaspirillum and Azoarcus (MS); species of the cyanobacterial genus Cylindrospermopsis (Cs); Bradyrhizobium sp. prevalent in truffles (TB). Position is relative to A. vinelandii nifH (Genbank ACCN# M20568). Degeneracy is given as the number of oligonucleotides that comprise the primer. References in which the primers are described. This binding position for this primer sequence lies beyond the stop codon of Frankia sp. (Genbank ACCN# M21132) and cannot be represented using the A. vinelandii numbering system. NA: Data not available as described in Methods. Given that PCR requires two primers used in combination, a useful indication of specificity must account for the coverage obtained when using specific primer pairs (Tables 4 and 5). We evaluated both primer combinations that have been reported in the literature as well as new primer combinations. As expected, the coverage obtained with primer pairs is always lower than the coverage obtained for each individual primer. We evaluated 42 universal primer pair combinations, of which 7 hit >90% of nifH sequences in the database, 24 hit >50%, and 6 hit 10% or less. Those primer sets which had >90% coverage are 19F/nifH3, Nh21F/nifH1, Nh21F/nifH3, IGK/nifH3, F2/R6, nifH2/R6, and nifH1/nifH2 (ie: the Zehr and McReynolds primers). The 6 primer sets which hit 10% or less of cluster I and III are Primer-f/Primer-r, FGPH19/FGPH273′, FGPH19/PolR, IGK/FGPH273′, nifHF/nifHRb, and nifHF/nifHRc. While we evaluated 19 group-specific primer combinations, very few primer sets had high coverage of the designated target groups (Table 5). The primer set ChenBR1/ChenBR2 is designed to target β-Rhizobia but also hits 35% of the sequences within the Alpha-, Beta-, and Gammaproteobacteria and 75% of Frankia sequences. The Frankia-specific primer sets nifH-f1-forA/nifH-f1-rev and nifH-f1-forB/nifH-f1-rev hit 92% and 87% of Frankia respectively.
Table 4

Properties of universal primer pairs and their coverage for phylogenetic and environmental groupings in the nifH database.

Specific groupingsa (%)Environ.b (%)
Primer setPos.c Len.d nifH e Pr Cy IIIIA Fr Pb EpIVSoilMatSea
Nh21F/Cy55Nh428R19–4043866771984574627325139310089
Ueda19F/407R19–4073898686100821001008075480100100
NH21F/nifH119–476458919010085100100100821NA100100
nifH19F/nifH-univ463R19–482464888696761001001001001NA100100
Ueda19F/nifH-univ463r19–48246487869676100100100821NA100100
19F/nifH319–4944769287961001001001008232NA100100
Nh21F/nifH319–4944769287961001001001008232NA100100
nifH3/nifH422–494473496896001001002714NA10078
Primer-f/Primer-r23–63861681009000390NA00
FGPH19/FGPH27325–2792553301071000000
PicenoF44/PicenoR43625–4534293352256850201200
F1/R625–473449858810061949210093694100100
FGPH19/PolR25–4764526600777000000
F1/nifH3r25–491467516784287075393251000
MehtaF/MehtaR28–4163895644817344887543555210095
IGK/FGPH273′31–27924991201735001800
IGK/DVV31–4133838384858668878988707810085
IGK/VCG31–41938986868491908778963774100100
nifHF/nifHRb31–437407000000000000
nifHF/nifHRc31–437407340200000100
IGK/primer-4 = AQE31–45342330396738113001900
IGK/PolR31–47644632322231682425440321000
nifHF-Rösch/nifHR31–49146126173410426650330561000
IGK/YAA = nifH331–4944649390979610010010010049100100100
RL28/RL2537–554518570000000000
KAD3/DVV106–413308668483148196647715410025
KAD3/VCG106–419314708484156996648316210030
469/R6112–473362827992697698539696110080
469/nifH1112–476365817692786998537785710071
469/470112–479368837891727898539586310079
nifHFor/470112–479368827891727898539586210079
nif112/nifH-univ463R112–48237139286453483147734466064
nifB/nifHRev112–482371178632114202713012064
PolF/primer-4 = AQE115–45333918261136400002442
F2/R6115–4733599595988498981039713929991
nifH2/R6115–4733599494988397981039513909988
nifH1/nifH2115–4763629291968894981047711869981
PolF/PolR115–4763622530211325921305100
Kadino/Emino115–482368838751677998879778410076
Kadino/nifH-univ-463R115–482368828651657998879678410072
nifH11/nifH22118–4763591215107171476181722
nifH-2f/nifH-3r277–491215456363166330330741000

Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.

Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).

Position of amplicon in nifH is relative to A. vinelandii nifH (Genbank ACCN# M20568).

Length expected for PCR amplicon.

Data indicate primer binding with 0 mismatches to all nifH sequences in the database.

NA Data not available as nucleotide information is not available for the target group in the region of primer binding.

Table 5

Properties of group-specific primer pairs and their coverage for phylogenetic and environmental groups.

Specific groupingsa (%)Environ.b (%)
Primer setPos.c Len.d nifH e Pr Cy IIIIA Fr Pb EpIVSoilMatSea
ChenBR1/ChenBR218–698681193500775000NA00
nifH-a1-forA/nifH-a1-rev19–4824645120000000NA00
nifH-f1-forA/nifH-f1-rev19–4824643000092000NA00
nifHF/nifHI25–8067821533000000010000
primer-3/primer-4 = AQE37–453417000000000000
nifHf1/nifH-26943–3412991000019000000
nifHf1/nifHr43–690648000000000000
Olsen1/Olsen2106–494389000000000000
cyanoF/cyanoR108–482375000000000000
nifH-a1-forB/nifH-a1-rev112–4823715110000000200
nifH-a2-forB/nifH-a2-rev112–482371000000000000
nifH-b1-forB/nifH-b1-rev112–482371120060000500
nifH-c1-forB/nifH-c1-rev112–482371110310040400
nifH-f1-forB/nifH-f1-rev112–48237120402787000600
nifH-g1-forB/nifH-g1-rev112–482371110000010200
nifHFor/NifHRev136–452317590010000300
cylnif-F/cylnif-R156–375220000000000000
NIFH2F/NIFH5R231–32696010000000100
FGPH750/FGPD913′759−f 116000NANA0NANA0NANANA

Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%.

Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea).

Position of amplicon in nifH is relative to A. vinelandii nifH (Genbank ACCN# M20568).

Length expected for PCR amplicon.

Data indicate primer binding with 0 mismatches to all nifH sequences in the database.

This binding position for the reverse primer sequence lies beyond the stop codon of Frankia sp. (Genbank ACCN# M21132) and cannot be represented using the A. vinelandii numbering system.

NA Data not available as nucleotide information is not available for the target group in the region of primer binding.

Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%. Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea). Position of amplicon in nifH is relative to A. vinelandii nifH (Genbank ACCN# M20568). Length expected for PCR amplicon. Data indicate primer binding with 0 mismatches to all nifH sequences in the database. NA Data not available as nucleotide information is not available for the target group in the region of primer binding. Data indicate primer binding to specific groupings in the nifH phylogeny. Abbreviations are as follows: Alpha, Beta, and Gamma Proteobacteria (Pr); Cyanobacteria (Cy); Cluster III (III); Cluster IA (IA); Paenibacillus (Pb); Frankia (Fr); Epsilon Proteobacteria Containing Cluster (Ep); paralogous sequences in Cluster IV (IV). In some cases highly degenerate primers bind to multiple positions in the sequence generating coverage values that exceed 100%. Primer coverage queried against sequences recovered from specific environments (Environ.) as described in methods. Environments include: soils (Soil), microbial mats (Mat), and pelagic marine samples (Sea). Position of amplicon in nifH is relative to A. vinelandii nifH (Genbank ACCN# M20568). Length expected for PCR amplicon. Data indicate primer binding with 0 mismatches to all nifH sequences in the database. This binding position for the reverse primer sequence lies beyond the stop codon of Frankia sp. (Genbank ACCN# M21132) and cannot be represented using the A. vinelandii numbering system. NA Data not available as nucleotide information is not available for the target group in the region of primer binding. Primer sets with high in silico coverage were used for empirical tests. When tested with DNA from soil, the primer combinations nifH2/R6, nH21f/nifH, nifH1/nifH2, Ueda19f/univ463r, and nifH3/nH21f all produced PCR products of indiscriminate size producing smeared bands in gel electrophoresis and also produced an amplified product from E. coli indicating a lack of specificity for nifH under the amplification conditions tested (Table 6, Figures S9, S10, S11, S12, S13, S14, S15). The primer combinations F2/R6, IGK3/DVV, and Ueda 19F/388R produced a band of the expected size for a diverse range of genomic and soil DNA templates (Table 6, Figures S3, S4, S5, S6, S7, S8), though Ueda 19F/388R was observed to produce an amplified product from E. coli indicating a lack of specificity for nifH under the amplification conditions tested. Overall, the primer pair IGK3/DVV produced the best performance in our empirical analysis, producing PCR products of the expected size from all nitrogen-fixing strains and soil DNA samples tested, while not generating PCR product from the negative controls or producing non-specific PCR products (Table 6, Figures S5 and S6).
Table 6

Empirical results of PCR using different nifH primer sets with DNA from isolates and soilsa.

AT (°C)b DvGuAvFsMlKpXaRsRlPnEcASLSNT
F2/R651++++ns++
IGK3/DVV58++++++++++++
Ueda19F/388R51+++++++ns+ns++
nifH2/R644ns+++++ns+nsss
nH21f/nifH146nsNs+nsss
nifH1/nifH246ns++nsss
Ueda19f/univ463r46+ns++++++nsss
nifH3/nH21f41nsns+nsss

DNA samples and their phylogenetic affiliation in the nifH phylogeny from Figure S2 are: Desulfovibrio vulgaris Hildenborough (Dv), cluster III; Geobacter uraniireducens Rf4 (Gu), subcluster IA; Azotobacter vinelandii DJ (Av), Alpha-, Beta- and Gamma-Proteobacteria; Frankia sp. CcI3 (Fs), Frankia; Mastigocladus laminosus UTEX LB 1931 (Ml), Cyanobacteria; Klebsiella pneumoniae 342 (Kp), Alpha-, Beta- and Gammaproteobacteria; Xanthobacter autotrophicus Py2 (Xa), Alpha-, Beta- and Gammaproteobacteria; Rhodobacter sphaeroides 2.4.1 (Rs), Alpha-, Beta- and Gammaproteobacteria; Rhizobium leguminosarium bv. trifolii (Rl), Alpha-, Beta- and Gammaproteobacteria; Polaromonas naphthalenivorans CJ2 (Pn), Alpha-, Beta- and Gammaproteobacteria; Eschericia coli (Ec), genomic-DNA negative control; agricultural soil (AS); lawn soil (LS); No Template Control (NT). The symbols used are: product of the correct size (+), no product produced (−), non-specific amplification producing multiple bands or a single band of the wrong size (ns), a smeared band of indiscriminate size overlapping in size with the expected product (s). Blank cells indicate that the evaluation was not performed.

Annealing Temperature (AT) used in PCR.

DNA samples and their phylogenetic affiliation in the nifH phylogeny from Figure S2 are: Desulfovibrio vulgaris Hildenborough (Dv), cluster III; Geobacter uraniireducens Rf4 (Gu), subcluster IA; Azotobacter vinelandii DJ (Av), Alpha-, Beta- and Gamma-Proteobacteria; Frankia sp. CcI3 (Fs), Frankia; Mastigocladus laminosus UTEX LB 1931 (Ml), Cyanobacteria; Klebsiella pneumoniae 342 (Kp), Alpha-, Beta- and Gammaproteobacteria; Xanthobacter autotrophicus Py2 (Xa), Alpha-, Beta- and Gammaproteobacteria; Rhodobacter sphaeroides 2.4.1 (Rs), Alpha-, Beta- and Gammaproteobacteria; Rhizobium leguminosarium bv. trifolii (Rl), Alpha-, Beta- and Gammaproteobacteria; Polaromonas naphthalenivorans CJ2 (Pn), Alpha-, Beta- and Gammaproteobacteria; Eschericia coli (Ec), genomic-DNA negative control; agricultural soil (AS); lawn soil (LS); No Template Control (NT). The symbols used are: product of the correct size (+), no product produced (−), non-specific amplification producing multiple bands or a single band of the wrong size (ns), a smeared band of indiscriminate size overlapping in size with the expected product (s). Blank cells indicate that the evaluation was not performed. Annealing Temperature (AT) used in PCR.

Discussion

We report a comprehensive evaluation of nifH PCR primers. Our analysis of nifH primers reveals disparities in their sequence coverage. Variation in coverage is especially notable for primers designed to be universal, where 23 out of 51 target fewer than 50% of known nifH sequences and only 15 target more than 90% of sequences (Table 1). There could be several reasons for the disparity in primer coverage and specificity. Adequate primer design requires use of a sequence database representing the entire sequence diversity to be targeted by the primer. The number of sequences available in public databases has grown dramatically in recent years and earlier efforts at primer design were constrained in the past by the limited number and diversity of nifH sequences available. There is also a reasonable tendency to seek minimally degenerate primers due to undesirable effects that high levels of primer degeneracy can have on PCR performance. Decisions to lower degeneracy, however, could come at the cost of adequate coverage of target sequences. Our efforts to evaluate universal nifH primers expand upon previous work to design universal primers for this gene. Marusina et al. designed nifH primers based upon a diverse set of nifH sequences and tested the resulting primers against DNA from cultivated strains [18]. The F2/R6 primer set they designed was one of the best performing in our comparison (Tables 4 and 6, Figures S3 and S4). Fedorov et al. later reexamined some of the primers of Marusina et al. because they found that primer R6 contained mismatches to certain methylobacterial nifH sequences, and they sought to design primers that included this group [19]. The coverage of their new primer, nifH-3r, however, is considerably lower than that of the original R6 primer matching 48% and 96% of nifH sequences respectively (Table 1). Poly et al. also designed a universal primer set, PolF/PolR, and showed that it amplified 19 of 19 test strains and worked well in soils [20]. However, the test strains they used consisted of Alpha-, Beta-, and Gammaproteobacteria, Firmicutes and Actinobacteria and did not include cluster IA, Cyanobacteria, or cluster III sequences. We found that the PolF/PolR primer set only encompassed 25% of nifH diversity in our database (Table 4). By mapping the 51 universal primers to their complementary binding positions along the A. vinelandii nifH gene (Figure 1), it is evident that the majority of the primers correspond to conserved regions of the nifH gene that encode essential functions like the P-loop, Switch I, and Switch II (Figure 1; [22] ). Sequence coverage is high in regions of universal primer binding (Figure 1), and the shape of the coverage profile suggests that primer sequences have not been trimmed from a large number of sequences. If this is indeed the case, then there could be some bias in our results since the sequence fidelity between primer and target can vary as a function of the specificity of PCR conditions. If primer sequences have replaced existing nifH polymorphism in database sequences, then the net result would be a bias towards overestimating primer coverage. This is a common problem in public sequence databases and illustrates the need for depositors to remove primer sequences prior to sequence deposition. Some of the primer sequences we evaluated have unusually low coverage perhaps indicating that the published sequences contain errors, a phenomenon which is not that uncommon as it has been noted in another review of primer sequences [23]. In particular, there appear to be errors in the sequences published for the primers YAA-poly, nifHRb, and röschF-1b [20], [24], [25]. In the case of primer YAA-poly it appears that the first part of the primer name “YAA” was appended to the 5′ end of the primer sequence in [20] because the original YAA primer sequence does not have these nucleotides [26]. The coverage values for the original YAA primer (the one without the 5′ YAA nucleotides) are actually those of the primer nifH3 (Table 2). For primers nifHRb and röschF-1b there appear to be single base pair errors in the primer sequences. If a single base pair mismatch is allowed for these primers it causes coverage to increase substantially (Table 1, Table 2). The primer röschF-1b [25] differs from the primer nifHF-Rösch [24] in that a G rather than a C is present at the 13th nucleotide from the 5′ terminus. In addition, the primer AMR-R, though reported as a nifH primer [27], does not match nifH and thus appears to be erroneous. We evaluate primer coverage in silico but it is important to point out that universal nifH PCR primers have been used under a wide range of reaction conditions and variation in annealing temperatures and cycle parameters will have dramatic impacts on actual primer performance. Lowering of PCR annealing temperature, for example, lowers reaction specificity and may permit amplification of templates with mismatches in the primer binding region. Notably, for many primer sets either a nested, touchdown, or stepdown PCR approach was needed to achieve amplification of nifH genes from environmental samples (e.g. [28], [29] ). In Tables 1–3 we indicate primer coverage with up to two mismatches to provide an indication of the potential effects that reducing reaction stringency may have on primer performance. In addition, there are several other factors which could impact the specificity and coverage realized using PCR primers at the bench relative to predictions made using sequence databases. These factors include primer dimerization [30], hairpin formation [31], GC content [32], the location of mismatches [33], and the thermodynamics of primer binding to template [34]. For example, mismatches at the 3′ end of a primer may have a greater impact on specificity than those at the 5′ end [33] and some methods of primer design exploit this tendency in order to increase primer coverage [35]. Thus, the real test of primer performance comes at the bench. We performed empirical assessment of coverage for primers which we found targeted 90% or more of sequences in the nifH database. The primer combinations F2/R6, IGK3/DVV, and Ueda 19F/388R performed well with DNA from a diversity of phylogenetic groups and from soil, with IGK3/DVV performing best of all. In contrast, the primer sets Ueda19f/univ463r and nifH1/nifH2 (ie: the Zehr-McReynolds primers) had mediocre performance with soils, producing smeared bands indicative of non-specific amplification, and producing a PCR product from negative controls (Table 6, Figures S13 and S14). All other primer combinations tested had drawbacks such as poor or no soil amplification and amplification of negative controls (Table 6, Figures S9, S10, S11, S12 and S15). There are several limitations to our approach which must be considered. First, only a few full-length nifH sequences are currently available and this lowers the sequence diversity represented along the termini of the nifH gene (Figure 1). Hence, evaluation of primers that bind near the beginning or end of the alignment must be interpreted with care, especially for phylogenetic groups that are underrepresented in sequence databases. Likewise, nifH diversity remains poorly characterized in some and thus estimates primer performance in specific environments must also be interpreted with care when the number of sequences from those environments are small. We refer the reader to the supplementary material (Dataset S1) which provides the number of sequences currently available for each phylogenetic group and for each environment queried. As the number of sequenced genomes increases, full length nifH sequences from more diverse nitrogen fixers will become available aiding future efforts at primer design and analysis. Secondly, we have made no effort to assess coverage for nested and semi-nested reactions, which are common approaches. Nested amplification strategies, when coupled with low stringency reaction conditions, can allow investigators to amplify a wider diversity of templates than would be predicted through in silico analysis. Logically, however, in silico results from nested designs would always produce a reduction in coverage relative to a single primer set design. Some of the universal nifH primers amplify paralogous genes not involved in nitrogen-fixation, for example cluster IV genes (Table 1 and Table 2). The nifH gene shares conserved regions with genes of cluster IV and cluster V which is involved in bacteriochlorophyll synthesis [13], [22]. We find that a substantial number of nifH universal primers will amplify cluster IV sequences (Table 1 and Table 2). It would therefore be wise for researchers interested in assessing the diversity and phylogeny of nitrogen-fixation genes from the environment to screen their sequences for the presence of cluster IV and cluster V genes prior to OTU clustering. Our work outlines a comprehensive approach to primer evaluation. Molecular-based studies are dependent on the effectiveness of the primer sets used to generate the sequence data which serves as our window to the microbial world. These results show that many supposedly universal primer sets miss significant portions of known nifH diversity. Several of the primers that performed well in silico were tested empirically against genomic DNA from a phylogenetically diverse set of strains. The primers that performed well both in silico and empirically should have the greatest utility in further studies of the nifH gene diversity in environmental samples.

Materials and Methods

Primer coverage analyses were performed using an updated version of our previously described nifH database [9]. The current version of the database contains 23,847 sequences, representing all nifH sequences available in Genbank as of July 14, 2010. The database was constructed using the ARB software package [36] as described in [9]. Alignment positions are numbered relative to the Azotobacter vinelandii gene sequence (Genbank ACCN# M20568). The environmental origins of sequences (Tables 1–5) were determined by keyword searches of the sequence records in the nifH database using ARB as described in [9]. The phylogenetic trees and sequence configurations for the environmental groups may be examined as part of the ARB nifH database used for this work which is available at http://www.css.cornell.edu/faculty/buckley/nifH_database_2010_07_14. arb. The phylogenetic groups evaluated (Table 1–5) are labeled on the phylogenetic tree of Figure S2 which corresponds to the tree in the ARB database. We visualized the nucleotide representation of nifH sequence fragments within our nifH database relative to the A. vinelandii nifH sequence (Figure 1) by first exporting in FASTA format all nifH sequences from the ARB database using the A. vinlandii nifH sequence as a filter so that only positions in the alignment where A. vinelandii nifH had a nucleotide were exported. The FASTA file was then opened in BioEdit [37] where we could calculate a positional nucleotide numerical summary, and the total number of sequences containing sequence information was then plotted for each position in the alignment (Figure 1). Primer coverage calculations were performed using the EMBOSS programs fuzznuc, dreg, and primersearch [38] to analyze sequence alignment data exported in FASTA format from our nifH database. The program fuzznuc calculates the number of sequences in a given alignment hit by a given primer. Mismatches, or fuzzy searches, are allowed by the program and were performed with the nifH evaluations (Table 1–3). The program primersearch was used for the evaluation of primer pairs (Tables 4 and 5). The program dreg was used to determine the number of records in an alignment that contained sequence data in the alignment region targeted by each primer or primer pair (Tables 1–5). However, because dreg eliminates the gap characters from the FASTA alignment file from ARB, the flanking gap characters were converted to the IUPAC character S, which is preserved by dreg, and the intervening gap characters were subsequently converted to the IUPAC character N. This allowed the original column positions from the ARB alignment to be maintained and reported as output from dreg. To calculate primer and primer pair coverage, the number of hits obtained from fuzznuc or primersearch were divided by the total number of sequences with nucleotide representation in the target region(s) as indicated by dreg. Unix bash shell scripts were employed to increase the throughput of the in silico primer evaluations by automating the input of multiple primer sequences and other evaluation parameters into the EMBOSS programs. The scripts were also used to parse the output files and organize the data into tables. These scripts, which would be useful for similar evaluations using databases for other functional genes, are available as supplementary material online (Text S1, S2, S3). Primer annealing temperatures were calculated with SciTools Oligoanalyzer version 3.1 which calculates oligonucleotide melting temperatures based on nearest neighbor thermodynamics [39]. Oligoanalyzer can account for Inosine but not for P or K bases and thus melting temperatures were not calculated for PicenoF44 and PicenoR436 (Table 1). The parameters used for the calculations were 0.25 µM oligonucleotides, 50 mM Na+, 1.5 mM Mg++, and 0 mM dNTPs. Genomic DNA was extracted from cultures of the bacterial strains listed in Table 6 according to a standard enzymatic, phenol-chloroform extraction protocol [40]. DNA concentration was determined with a Nanodrop model 1000 (Thermo Fischer Scientific, Wilmington, DE), and DNA was diluted to 1 ng µl−1 prior to PCR. Soil DNA was obtained from a long-term agricultural site at the William H. Miner institute, Chazy, NY described previously [41]. The agricultural soil sample comes from a tilled site used to grow corn for more than 30 years while the lawn soil sample is from a non-cultivated control site that is adjacent to the agricultural site and contains a mixed community of perennial grasses (Table 6). Soil samples were obtained by coring at 0–5 cm depth. Soil samples were sieved to 2 mm, frozen in the field using liquid nitrogen, and stored at −80°C. DNA was extracted from soils using the PowerSoil DNA Isolation Kit (MoBio, Carlsbad, CA). Primers were synthesized and desalted by Integrated DNA technologies. All PCR reaction volumes were 50 µL with the following final reagent concentrations: 1X PCR Gold Buffer (ABI, Foster City, CA), 2.5 mM MgCl2 solution (ABI, Foster City, CA), 0.05% BSA (NEB, Ipswich, MA), 0.2 mM dNTPs, 1 µM each primer, 2.5 U Amplitaq Gold DNA polymerase (ABI, Foster City, CA). As template, 1 ng of genomic DNA was added, or 1 µl of soil DNA extract. To visualize the PCR products, 10 µL of the reactions were loaded onto a 50 ml, 1% agarose gel with 1 µL of SYBR Safe dye (Molecular Probes, Eugene, OR). 5 µl of Hyperladder I (Bioline, Taunton, MA) was loaded onto each gel as a molecular weight marker. Gels ran for 45 minutes at 100 volts and 500 miliamps and were then visualized and photographed. Photos of the electrophoresis gels are available as supplementary material online (Figures S3, S4, S5, S6, S7, S8, S9, S10, S11, S12, S13, S14, S15). Universal primer map. Universal nifH primers (grey lines with names) are mapped onto the sequence of Azotobacter vinelandii (Genbank ACCN# M20568). (EPS) Click here for additional data file. Phylogenetic tree of sequences in the database. The principle groups from Tables 1–5 are labeled. (EPS) Click here for additional data file. F2/R6 primer pair at 51°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. F2/R6 primer pair at 51°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. IGK3/DVV primer pair at 58°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. IGK3/DVV primer pair at 58°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. Ueda19F/388R primer pair at 51°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. Ueda19F/388R primer pair at 51°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. nifH2/R6 primer pair at 44°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. nifH2/R6 primer pair at 44°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. nH21f/nifH1 primer pair at 46°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. nifH1/nifH2 primer pair at 46°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. Ueda19f/univ463r primer pair at 46°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. Ueda19f/univ463r primer pair at 46°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. nifH3/nH21f primer pair at 41°C annealing temperature. Gel image of PCR products generated using the primers indicated with a range of different DNA templates. Results are summarized and full strain names are reported in Table 6. The gel images have been inverted from black to white. (TIF) Click here for additional data file. (XLS) Click here for additional data file. (TXT) Click here for additional data file. (TXT) Click here for additional data file. (TXT) Click here for additional data file.
  50 in total

1.  New molecular screening tools for analysis of free-living diazotrophs in soil.

Authors:  Helmut Bürgmann; Franco Widmer; William Von Sigler; Josef Zeyer
Journal:  Appl Environ Microbiol       Date:  2004-01       Impact factor: 4.792

Review 2.  Nitrogenase gene diversity and microbial community structure: a cross-system comparison.

Authors:  Jonathan P Zehr; Bethany D Jenkins; Steven M Short; Grieg F Steward
Journal:  Environ Microbiol       Date:  2003-07       Impact factor: 5.491

3.  Microbial community shifts influence patterns in tropical forest nitrogen fixation.

Authors:  Sasha C Reed; Alan R Townsend; Cory C Cleveland; Diana R Nemergut
Journal:  Oecologia       Date:  2010-05-09       Impact factor: 3.225

4.  Impacts of warming and fertilization on nitrogen-fixing microbial communities in the Canadian High Arctic.

Authors:  Julie R Deslippe; Keith N Egger; Greg H R Henry
Journal:  FEMS Microbiol Ecol       Date:  2005-01-12       Impact factor: 4.194

5.  Quantification of the detrimental effect of a single primer-template mismatch by real-time PCR using the 16S rRNA gene as an example.

Authors:  D Bru; F Martin-Laurent; L Philippot
Journal:  Appl Environ Microbiol       Date:  2008-01-11       Impact factor: 4.792

6.  Bias in template-to-product ratios in multitemplate PCR.

Authors:  M F Polz; C M Cavanaugh
Journal:  Appl Environ Microbiol       Date:  1998-10       Impact factor: 4.792

7.  Classification of rhizobia based on nodC and nifH gene analysis reveals a close phylogenetic relationship among Phaseolus vulgaris symbionts.

Authors:  Gisèle Laguerre; Sarah M Nour; Valérie Macheret; Juan Sanjuan; Pascal Drouin; Noëlle Amarger
Journal:  Microbiology       Date:  2001-04       Impact factor: 2.777

8.  Phylogenetic diversity of nitrogenase (nifH) genes in deep-sea and hydrothermal vent environments of the Juan de Fuca Ridge.

Authors:  Mausmi P Mehta; David A Butterfield; John A Baross
Journal:  Appl Environ Microbiol       Date:  2003-02       Impact factor: 4.792

9.  Diazotrophic bacterioplankton in a coral reef lagoon: phylogeny, diel nitrogenase expression and response to phosphate enrichment.

Authors:  Ian Hewson; Pia H Moisander; Amanda E Morrison; Jonathan P Zehr
Journal:  ISME J       Date:  2007-05       Impact factor: 10.302

10.  Variation in Frankia populations of the Elaeagnus host infection group in nodules of six host plant species after inoculation with soil.

Authors:  Babur S Mirza; Allana Welsh; Ghulam Rasul; Julie P Rieder; Mark W Paschke; Dittmar Hahn
Journal:  Microb Ecol       Date:  2009-03-31       Impact factor: 4.552

View more
  87 in total

1.  Efficient Nitrogen-Fixing Bacteria Isolated from Soybean Nodules in the Semi-arid Region of Northeast Brazil are Classified as Bradyrhizobium brasilense (Symbiovar Sojae).

Authors:  Elaine Martins da Costa; Paula R Almeida Ribeiro; Teotonio Soares de Carvalho; Rayssa Pereira Vicentin; Eduardo Balsanelli; Emanuel Maltempi de Souza; Liesbeth Lebbe; Anne Willems; Fatima M de Souza Moreira
Journal:  Curr Microbiol       Date:  2020-04-22       Impact factor: 2.188

2.  Diazotrophic bacterial community variability in a subtropical deep reservoir is correlated with seasonal changes in nitrogen.

Authors:  Lina Wang; Zheng Yu; Jun Yang; Jing Zhou
Journal:  Environ Sci Pollut Res Int       Date:  2015-08-18       Impact factor: 4.223

3.  In Planta Sporulation of Frankia spp. as a Determinant of Alder-Symbiont Interactions.

Authors:  G Schwob; M Roy; A C Pozzi; A Herrera-Belaroussi; M P Fernandez
Journal:  Appl Environ Microbiol       Date:  2018-11-15       Impact factor: 4.792

4.  The Overproduction of Indole-3-Acetic Acid (IAA) in Endophytes Upregulates Nitrogen Fixation in Both Bacterial Cultures and Inoculated Rice Plants.

Authors:  Roberto Defez; Anna Andreozzi; Carmen Bianco
Journal:  Microb Ecol       Date:  2017-02-14       Impact factor: 4.552

5.  Impact of Peat Mining and Restoration on Methane Turnover Potential and Methane-Cycling Microorganisms in a Northern Bog.

Authors:  Max Reumer; Monika Harnisz; Hyo Jung Lee; Andreas Reim; Oliver Grunert; Anuliina Putkinen; Hannu Fritze; Paul L E Bodelier; Adrian Ho
Journal:  Appl Environ Microbiol       Date:  2018-01-17       Impact factor: 4.792

6.  Abundance and diversity of diazotrophs in the surface sediments of Kongsfjorden, an Arctic fjord.

Authors:  T Jabir; P V Vipindas; K P Krishnan; A A Mohamed Hatha
Journal:  World J Microbiol Biotechnol       Date:  2021-02-05       Impact factor: 3.312

7.  The Use of Degenerate Primers in qPCR Analysis of Functional Genes Can Cause Dramatic Quantification Bias as Revealed by Investigation of nifH Primer Performance.

Authors:  John Christian Gaby; Daniel H Buckley
Journal:  Microb Ecol       Date:  2017-04-07       Impact factor: 4.552

8.  Elevated level of arsenic negatively influences nifH gene expression of isolated soil bacteria in culture condition as well as soil system.

Authors:  Arindam Chakraborty; Atif Aziz Chowdhury; Kiron Bhakat; Ekramul Islam
Journal:  Environ Geochem Health       Date:  2019-02-14       Impact factor: 4.609

9.  The Diversity and Co-occurrence Patterns of N₂-Fixing Communities in a CO₂-Enriched Grassland Ecosystem.

Authors:  Qichao Tu; Xishu Zhou; Zhili He; Kai Xue; Liyou Wu; Peter Reich; Sarah Hobbie; Jizhong Zhou
Journal:  Microb Ecol       Date:  2015-08-18       Impact factor: 4.552

10.  Active nitrogen-fixing heterotrophic bacteria at and below the chemocline of the central Baltic Sea.

Authors:  Hanna Farnelid; Mikkel Bentzon-Tilia; Anders F Andersson; Stefan Bertilsson; Günter Jost; Matthias Labrenz; Klaus Jürgens; Lasse Riemann
Journal:  ISME J       Date:  2013-02-28       Impact factor: 10.302

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.