| Literature DB >> 32132159 |
Kasia Piwosz1,2, Tanja Shabarova3, Jakob Pernthaler4, Thomas Posch4, Karel Šimek3,5, Petr Porcal3,5, Michaela M Salcher3,4.
Abstract
High-throughput sequencing (HTS) of gene amplicons is a preferred method of assessing microbial community composition, because it rapidly provides information from a large number of samples at high taxonomic resolution and low costs. However, mock community studies show that HTS data poorly reflect the actual relative abundances of individual phylotypes, casting doubt on the reliability of subsequent statistical analysis and data interpretation. We investigated how accurately HTS data reflect the variability of bacterial and eukaryotic community composition and their relationship with environmental factors in natural samples. For this, we compared results of HTS from three independent aquatic time series (n = 883) with those from an established, quantitative microscopic method (catalyzed reporter deposition-fluorescence in situ hybridization [CARD-FISH]). Relative abundances obtained by CARD-FISH and HTS disagreed for most bacterial and eukaryotic phylotypes. Nevertheless, the two methods identified the same environmental drivers to shape bacterial and eukaryotic communities. Our results show that amplicon data do provide reliable information for their ecological interpretations. Yet, when studying specific phylogenetic groups, it is advisable to combine HTS with quantification using microscopy and/or the addition of internal standards.IMPORTANCE High-throughput sequencing (HTS) of amplified fragments of rRNA genes provides unprecedented insight into the diversity of prokaryotic and eukaryotic microorganisms. Unfortunately, HTS data are prone to quantitative biases, which may lead to an erroneous picture of microbial community composition and thwart efforts to advance its understanding. These concerns motivated us to investigate how accurately HTS data characterize the variability of microbial communities, the relative abundances of specific phylotypes, and their relationships with environmental factors in comparison to an established microscopy-based method. We compared results obtained by HTS and catalyzed reporter deposition-fluorescence in situ hybridization (CARD-FISH) from three independent aquatic time series for both prokaryotic and eukaryotic microorganisms (almost 900 data points, the largest obtained with both methods so far). HTS and CARD-FISH data disagree with regard to relative abundances of bacterial and eukaryotic phylotypes but identify similar environmental drivers shaping bacterial and eukaryotic communities.Entities:
Keywords: CARD-FISH; amplicon sequencing; bacterial communities; bacterial community structure; bacterial dynamics; eukaryotic communities; eukaryotic community structure; eukaryotic dynamics; microbial abundance; microbial communities; microbial community structure; microbial dynamics
Mesh:
Substances:
Year: 2020 PMID: 32132159 PMCID: PMC7056804 DOI: 10.1128/mSphere.00052-20
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1(A) Scatterplot of relative abundances of studied eukaryotic groups by 454 sequencing libraries (HTS) and CARD-FISH. (B) Scatterplot of differences between relative abundances of studied eukaryotic groups estimated by CARD-FISH and HTS against the average of the two values. (C) Scatterplot of ranked relative abundances of studied eukaryotic groups by HTS and CARD-FISH. (D) Scatterplot of differences between ranked relative abundances of studied eukaryotic groups estimated by CARD-FISH and HTS against the average of the two values. Black lines in panels A and C show a 1:1 relationship. Solid black lines in panels B and D show the average difference for the whole data set, solid gray lines show 1 standard deviation, and dashed gray lines show 2 standard deviations. Different eukaryotic groups are color coded. Individual plots for panels A and C are shown in Fig. S1 and S2 in the supplemental material, respectively.
Statistics for regressions and Spearman correlations between relative contributions to HTS or CARD-FISH data
| Group | Regression | Spearman correlation | |||||
|---|---|---|---|---|---|---|---|
| Adjusted | Slope | Rho | |||||
| Eukaryotes—abundance | |||||||
| Chlorophyta | −0.01 | 0.13 | 0.4420 | <0.0001 | 0.10 | 0.5881 | 31 |
| Pedinellales | 0.04 | 0.31 | 0.1608 | 0.0029 | 0.36 | 0.0580 | 28 |
| Cryptophyceae | −0.02 | −0.32 | 0.5409 | 0.0167 | 0.02 | 0.8991 | 30 |
| CRY1 cryptophytes | 0.04 | 0.46 | 0.1869 | 0.1295 | 0.47 | 0.0320 | 21 |
| | 0.62 | 0.62 | 0.0042 | 0.0411 | 0.92 | 0.0005 | 10 |
| | 0.70 | 0.69 | 0.0015 | 0.0661 | 0.92 | 0.0005 | 10 |
| | −0.04 | −0.13 | 0.7570 | <0.0001 | −0.42 | 0.2696 | 9 |
| Eukaryotes—biomass | |||||||
| Chlorophyta | −0.02 | 0.12 | 0.4980 | <0.0001 | 0.16 | 0.3954 | 31 |
| Pedinellales | 0.11 | 0.42 | 0.0468 | 0.0084 | 0.45 | 0.0181 | 28 |
| Cryptophyceae | 0.23 | 0.95 | 0.0043 | 0.8612 | 0.54 | 0.0024 | 30 |
| CRY1 cryptophytes | 0.08 | 0.46 | 0.1138 | 0.0644 | 0.49 | 0.0258 | 21 |
| | 0.53 | 0.58 | 0.0101 | 0.0419 | 0.81 | 0.0082 | 10 |
| | 0.53 | 0.62 | 0.0106 | 0.0774 | 0.75 | 0.0184 | 10 |
| | −0.14 | −0.01 | 0.9410 | 0.0005 | −0.23 | 0.5517 | 9 |
| Bacteria—Jiřická Pond | |||||||
| | −0.01 | −0.38 | 0.4043 | 0.0053 | −0.27 | 0.2094 | 24 |
| | 0.48 | 0.47 | <0.0001 | <0.0001 | 0.72 | 0.0001 | 24 |
| “ | 0.90 | 0.74 | <0.0001 | <0.0001 | 0.93 | <0.0001 | 24 |
| Luna-2 cluster, | 0.17 | 0.51 | 0.0279 | 0.0354 | 0.39 | 0.0681 | 23 |
| “ | 0.71 | 0.73 | <0.0001 | 0.0146 | 0.84 | <0.0001 | 23 |
| “ | 0.20 | 0.83 | 0.0374 | 0.6432 | 0.81 | <0.0001 | 18 |
| | 0.23 | 0.52 | 0.0100 | 0.0149 | 0.45 | 0.0277 | 24 |
| | 0.35 | 0.59 | 0.0015 | 0.0187 | 0.56 | 0.0055 | 24 |
| Uncult. lineage GKS98 | 0.46 | 0.71 | 0.0002 | 0.0758 | 0.71 | 0.0002 | 24 |
| | 0.59 | 0.73 | <0.0001 | 0.0408 | 0.76 | <0.0001 | 24 |
| | −0.05 | −0.08 | 0.8340 | 0.0078 | 0.59 | 0.0036 | 22 |
| | 0.03 | −0.19 | 0.2100 | <0.0001 | −0.39 | 0.0808 | 21 |
| All | 0.35 | 0.66 | 0.0013 | 0.0684 | 0.60 | 0.0020 | 24 |
| | 0.19 | 0.64 | 0.0190 | 0.1770 | 0.31 | 0.1433 | 24 |
| | 0.70 | 1.04 | <0.0001 | 0.7979 | 0.89 | <0.0001 | 24 |
| | 0.76 | 0.72 | <0.0001 | 0.0034 | 0.57 | 0.0046 | 24 |
| “ | −0.03 | 0.15 | 0.5310 | 0.0020 | −0.15 | 0.4739 | 24 |
| | 0.65 | 0.98 | <0.0001 | 0.8775 | 0.81 | <0.0001 | 24 |
| | 0.04 | 0.69 | 0.1795 | 0.5489 | 0.01 | 0.9597 | 24 |
| All | 0.67 | 1.28 | <0.0001 | 0.1508 | 0.70 | 0.0002 | 24 |
| Bacteria—Lake Zurich | |||||||
| | 0.67 | 0.82 | <0.0001 | 0.1430 | 0.75 | <0.0001 | 24 |
| “ | −0.02 | −0.06 | 0.44 | <0.0001 | −0.17 | 0.4241 | 24 |
| “ | −0.03 | 0.12 | 0.56 | 0.0002 | 0.02 | 0.9164 | 24 |
| | 0.17 | 0.73 | 0.025 | 0.3874 | 0.52 | 0.0100 | 24 |
Regression statistics include adjusted r2, slope value, and significance level (P [r]) and Spearman correlation statistics include rho and significance level (P [S]) between relative contributions (percentages) to HTS or CARD-FISH data. A P value slope of 1 indicates a significance level against the desired value of 1, while a P value slope of >0.05 indicates that the slope is not significantly different from 1. n, number of data points for each group. Uncult., uncultured.
FIG 2(A) Scatterplot of relative abundances of studied bacterial groups (pooled data sets from both lakes) by 454 sequencing libraries (HTS) and CARD-FISH. (B) Scatterplot of differences between relative abundances of studied bacterial groups estimated by CARD-FISH and HTS against the average of the two values. (C) Scatterplot of ranked relative abundances of studied bacterial groups by HTS and CARD-FISH. (D) Scatterplot of differences between ranked relative abundances of studied bacterial groups estimated by CARD-FISH and HTS against the average of the two values. Black lines in panels A and C show a 1:1 relationship. Solid black lines in panels B and D show average differences for the whole data set, solid gray lines show 1 standard deviation, and dashed gray lines show 2 standard deviations. Different bacterial groups are color coded, and lakes of sample collection are indicated by shape. Individual plots for panels A and C are shown in Fig. S6 and S7, respectively.
FIG 3Scatterplots of relative abundances of the same bacterial groups by 454 sequencing libraries (HTS) and CARD-FISH in Jiřická Pond and Lake Zurich. HTS data for each lake were generated with a different primer set (Table S1). Black lines show 1:1 relationship.
DistML models for the eukaryotic data set calculated from HTS and CARD-FISH data (relative abundance and biovolume)
| Sample(s) | Variable | DistML model for data calculated from: | ||
|---|---|---|---|---|
| HTS: relative abundance | CARD-FISH | |||
| Relative abundance | Relative biovolume | |||
| All samples | SRP | 0.0276 (13.5) | 0.0165 (12.8) | No significant model |
| No outlier sample | SRP | No significant model | 0.0015 (14.2) | No significant model |
| Temp | 0.0072 (14.5) | |||
SRP, soluble reactive phosphorus.
DistML models for bacterial data sets calculated from HTS and CARD-FISH data
| Sampling site | Variable | DistML model for data calculated from: | |
|---|---|---|---|
| HTS: relative abundance | CARD-FISH: relative abundance [ | ||
| Lake Zurich | Temp | 0.0001 (42.9) | 0.0097 (18.9) |
| VLP | 0.0092 (14.1) | 0.05 (11.3) | |
| Jiřická Pond | WRT 0.5m | 0.0001 (29.4) | 0.0004 (10.8) |
| DOC | 0.0001 (18.9) | 0.0001 (37.3) | |
| TP | 0.0002 (14.9) | 0.0009 (15.9) | |
| DN | 0.0061 (6.4) | ||
| Chl-a 0.5m | 0.0231 (5.1) | ||
VLP, abundance of virus-like particles; WRT 0.5m, water residence time at 0.5-m depth; DOC, dissolved organic carbon; TP, total phosphorus; DN, dissolved nitrogen; Chl-a 0.5m, chlorophyll a at 0.5-m depth.