| Literature DB >> 35722279 |
Sada Egenriether1, Robert Sanford2, Wendy H Yang1,2,3, Angela D Kent1,4.
Abstract
Background: Assessments of the soil microbiome provide valuable insight to ecosystem function due to the integral role microorganisms play in biogeochemical cycling of carbon and nutrients. For example, treatment effects on nitrogen cycling functional groups are often presented alongside one another to demonstrate how agricultural management practices affect various nitrogen cycling processes. However, the functional groups commonly evaluated in nitrogen cycling microbiome studies range from phylogenetically narrow (e.g., N-fixation, nitrification) to broad [e.g., denitrification, dissimilatory nitrate reduction to ammonium (DNRA)]. The bioinformatics methods used in such studies were developed for 16S rRNA gene sequence data, and how these tools perform across functional genes of different phylogenetic diversity has not been established. For example, an OTU clustering method that can accurately characterize sequences harboring comparatively little diversity may not accurately resolve the diversity within a gene comprised of a large number of clades. This study uses two nitrogen cycling genes, nifH, a gene which segregates into only three distinct clades, and nrfA, a gene which is comprised of at least eighteen clades, to investigate differences which may arise when using heuristic OTU clustering (abundance-based greedy clustering, AGC) vs. true hierarchical OTU clustering (Matthews Correlation Coefficient optimizing algorithm, Opti-MCC). Detection of treatment differences for each gene were evaluated to demonstrate how conclusions drawn from a given dataset may differ depending on clustering method used.Entities:
Keywords: OTU clustering; bioinformatics; dissimilatory nitrate reduction to ammonium; microbial ecology; microbiome; mother; nitrogen cycling; nitrogen fixation
Year: 2022 PMID: 35722279 PMCID: PMC9201982 DOI: 10.3389/fmicb.2022.730340
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
FIGURE 1Workflow used for each analysis.
nifH OTU table and rarefaction characteristics for each clustering method.
| Method | OTUs pre-rarefaction | Singleton OTUs | Percent singletons | Avg OTUs post-rarefaction | OTUs lost post-rarefaction | Percent lost |
| AGC-0.03 | 4,242 | 2,768 | 65% | 444 | 3,798 | 90% |
| MCC-0.03 | 5,810 | 3,928 | 68% | 583 | 5,227 | 90% |
| AGC-0.02 | 9,768 | 7,033 | 72% | 687 | 9,081 | 93% |
nifH OTU table statistics for each agricultural treatment.
| Method | Treatment | Avg rarefied coverage | Unrarefied skewness | Avg unrarefied distance to median |
| AGC-0.03 | T1 | 96.7% | 37.9 | 0.654 |
| T2 | 95.8% | 29.4 | 0.653 | |
| T3 | 97.1% | 42.9 | 0.653 | |
| T4 | 90.7% | 28.2 | 0.654 | |
|
| ||||
| MCC-0.03 | T1 | 95.2% | 42.5 | 0.655 |
| T2 | 94.9% | 32.1 | 0.652 | |
| T3 | 95.7% | 45.5 | 0.654 | |
| T4 | 87.7% | 31.8 | 0.643 | |
FIGURE 2Chao1 and Shannon alpha diversity measures for nifH for AGC-0.03 (A) and MCC-0.03 (B) methods. Letters indicate significance at α < 0.05 via ANOVA with Tukey’s HSD correction for multiple comparisons.
nifH community analysis results for each clustering method across ten independent rarefaction trials.
| Treatment comparison | Method | Avg sig ANOSIM rarefied R | Sig PERMANOVA of 10 rarefactions for all OTUs | Sig PERMANOVA of 10 rarefactions for |
| T1 vs. T2 | AGC-0.03 | 0.282 | 10 | 10 |
| MCC-0.03 | 0.360 | 10 | 10 | |
| AGC-0.02 | 0.309 | 10 | 10 | |
|
| ||||
| T1 vs. T3 | AGC-0.03 | n.s. | 10 | 10 |
| MCC-0.03 | n.s. | 10 | 10 | |
| AGC-0.02 | n.s. | 10 | 10 | |
|
| ||||
| T1 vs. T4 | AGC-0.03 | 0.640 | 0 | 0 |
| MCC-0.03 | 0.671 | 0 | 0 | |
| AGC-0.02 | 0.624 | 0 | 0 | |
|
| ||||
| T2 vs. T3 | AGC-0.03 | 0.297 | 0 | 0 |
| MCC-0.03 | 0.287 | 0 | 0 | |
| AGC-0.02 | 0.315 | 0 | 0 | |
|
| ||||
| T2 vs. T4 | AGC-0.03 | 0.717 | 10 | 10 |
| MCC-0.03 | 0.741 | 10 | 10 | |
| AGC-0.02 | 0.775 | 10 | 10 | |
|
| ||||
| T3 vs. T4 | AGC-0.03 | 0.797 | 2 | 7 |
| MCC-0.03 | 0.876 | 4 | 9 | |
| AGC-0.02 | 0.839 | 2 | 6 | |
Significance assessed at α = 0.05. n.s. in various spots means “not significant.”
FIGURE 3Differentially abundant nifH taxa among the top 100 OTUs for the AGC-0.03 method (A) and the MCC-0.03 method (B). Significance of differential abundance assessed at α = 0.01.
nrfA OTU table and rarefaction characteristics for each clustering method.
| Method | OTUs pre-rarefaction | Singleton OTUs | Percent singletons | Avg OTUs post-rarefaction | OTUs lost post-rarefaction | Percent lost |
| AGC-0.03 | 12,144 | 5,280 | 43% | 7,791 | 4,353 | 36% |
| MCC-0.03 | 16,497 | 8,320 | 50% | 9,912 | 6,585 | 40% |
| AGC-0.02 | 44,495 | 24,323 | 55% | 25,367 | 19,128 | 43% |
nrfA OTU table statistics for each agricultural treatment.
| Method | Treatment | Avg rarefied coverage | Unrarefied skewness | Avg unrarefied distance to median |
| AGC-0.03 | T1 | 95.2% | 59.0 | 0.645 |
| T2 | 93.4% | 41.5 | 0.644 | |
| T3 | 96.2% | 61.5 | 0.648 | |
| T4 | 95.3% | 40.4 | 0.648 | |
|
| ||||
| MCC-0.03 | T1 | 93.7% | 60.8 | 0.646 |
| T2 | 92.3% | 24.6 | 0.649 | |
| T3 | 94.6% | 64.6 | 0.651 | |
| T4 | 93.8% | 42.5 | 0.652 | |
FIGURE 4Chao1 and Shannon alpha diversity measures for nrfA for AGC-0.03 (A) and MCC-0.03 (B) methods. Letters indicate significance at α < 0.05 via ANOVA with Tukey’s HSD correction for multiple comparisons.
nrfA community analysis results for each clustering method across 10 independent rarefaction trials.
| Treatment comparison | Method | Avg sig ANOSIM rarefied R | Sig PERMANOVA of 10 rarefactions for all OTUs | Sig PERMANOVA of 10 rarefactions for |
| T1 vs. T2 | AGC-0.03 | 0.590 | 4 | 10 |
| MCC-0.03 | 0.583 | 10 | 0 | |
| AGC-0.02 | 0.492 | 7 | 0 | |
|
| ||||
| T1 vs. T3 | AGC-0.03 | 0.590 | 2 | 9 |
| MCC-0.03 | 0.584 | 10 | 7 | |
| AGC-0.02 | 0.559 | 1 | 0 | |
|
| ||||
| T1 vs. T4 | AGC-0.03 | 0.714 | 0 | 0 |
| MCC-0.03 | 0.731 | 0 | 0 | |
| AGC-0.02 | 0.618 | 0 | 0 | |
|
| ||||
| T2 vs. T3 | AGC-0.03 | 0.729 | 4 | 1 |
| MCC-0.03 | 0.715 | 10 | 0 | |
| AGC-0.02 | 0.624 | 9 | 0 | |
|
| ||||
| T2 vs. T4 | AGC-0.03 | 0.759 | 10 | 10 |
| MCC-0.03 | 0.724 | 10 | 0 | |
| AGC-0.02 | 0.685 | 10 | 0 | |
|
| ||||
| T3 vs. T4 | AGC-0.03 | 0.738 | 0 | 0 |
| MCC-0.03 | 0.670 | 0 | 0 | |
| AGC-0.02 | 0.607 | 0 | 0 | |
Significance assessed at α = 0.05.
FIGURE 5Differentially abundant nrfA taxa among the top 100 OTUs for the AGC-0.03 method classified using the FunGene database (A), the MCC-0.03 method using the FunGene database (B), the AGC-0.03 method using the clade-based database (C), and MCC-0.03 using the clade-based database (D). Significance of differential abundance assessed at α = 0.01.