| Literature DB >> 35630359 |
Jiahui Xu1, Jianlin Han1, Hua Su1, Changyu Zhu1,2, Zijing Quan1, Lei Wu1, Zhenzhen Yi1.
Abstract
Protists are a dominant group in marine microplankton communities and play important roles in energy flux and nutrient cycling in marine ecosystems. Environmental sequences produced by high-throughput sequencing (HTS) methods are increasingly used for inferring the diversity and distribution patterns of protists. However, studies testing whether methods disentangling biological variants affect the diversity and distribution patterns of protists using field samples are insufficient. Oligotrich (s.l.) ciliates are one group of the abundant and dominant planktonic protists in coastal waters and open oceans. Using oligotrich (s.l.) ciliates in field samples as an example, the present study indicates that DADA2 performs better than SWARM, UNOISE, UPARSE, and UCLUST for inferring diversity patterns of oligotrich (s.l.) ciliates in the Pearl River Estuary and surrounding regions. UPARSE and UNOISE might underestimate species richness. SWARM might not be suitable for the resolution of alpha diversity owing to its rigorous clustering and sensitivity to sequence variations. UCLUST with 99% clustering threshold overestimates species richness, and the beta diversity pattern inferred by DADA2 is more reasonable than that of the other methods. Additionally, salinity is shown to be one of the key factors responsible for variations in the community distribution of ciliates, but infrequent marine-freshwater transitions occurred during evolutionary terms of this group.Entities:
Keywords: diversity; ecological transition; environmental sequences; protist; salinity
Year: 2022 PMID: 35630359 PMCID: PMC9147439 DOI: 10.3390/microorganisms10050913
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1The location of sampling regions. Sampling areas and their abbreviations are as follows: Daya Bay (DY1, DY2, DY5), Pearl River Estuary (PRE1–PRE10), Guangzhou (GZ1–GZ5), Shenzhen (SZ1–SZ4), Zhuhai (ZH1–ZH5), and Zhanjiang (ZJ1–ZJ4, ZJ6). The numbers shown in parentheses represent the salinity of the sampling sites (unit: ‰), and salinity values of freshwater, low-brackish, high-brackish, and marine areas are colored in blue, green, orange, and black, respectively. Adapted with permission from [34] published by John Wiley and Sons, 2022.
The number of total and oligotrich (s.l.) OTUs/ASVs in six datasets.
| DADA2–100 | SWARM–100 | UNOISE–100 | UPARSE–97 | UCLUST–97 | UCLUST–99 | |
|---|---|---|---|---|---|---|
| Number of total OTUs/ASVs | 3890 | 6656 | 2915 | 3095 | 7613 | 19,993 |
| Number of oligotrich (s.l.) OTUs/ASVs | 103 | 137 | 89 | 63 | 104 | 248 |
Number and proportion of oligotrich (s.l.) OTUs/ASVs in each sampling area from the six datasets. The proportions in parentheses indicate the ratio of the oligotrich (s.l.) OTU/ASV number in each sampling area to the total number of oligotrich (s.l.) OTUs/ASVs within each dataset.
| Sampling Areas | DADA2–100 | SWARM–100 | UNOISE–100 | UPARSE–97 | UCLUST–97 | UCLUST–99 |
|---|---|---|---|---|---|---|
| DY | 6 (5.83%) | 17 (12.41%) | 14 (15.73%) | 15 (23.81%) | 14 (13.46%) | 17 (6.85%) |
| GZ | 31 (30.10%) | 54 (39.42%) | 49 (55.06%) | 31 (49.21%) | 39 (37.50%) | 88 (35.48%) |
| PRE | 47 (45.63%) | 76 (55.47%) | 58 (65.17%) | 45 (71.43%) | 67 (64.42%) | 137 (55.24%) |
| SZ | 8 (7.77%) | 15 (10.95%) | 12 (13.48%) | 10 (15.87%) | 16 (15.38%) | 30 (12.10%) |
| ZH | 23 (22.33%) | 40 (29.20%) | 40 (44.94%) | 28 (44.44%) | 27 (25.96%) | 51 (20.56%) |
| ZJ | 32 (31.07%) | 63 (45.99%) | 46 (51.69%) | 32 (50.79%) | 45 (43.27%) | 110 (44.35%) |
Figure 2Hierarchical clustering analysis of oligotrich (s.l.) community of 32 samples from six datasets. (a) DADA2–100; (b) SWARM–100; (c) UNOISE–100; (d) UPARSE–97; (e) UCLUST–97; (f) UCLUST–99. Samples from the same sampling area are indicated by the same color except for samples from PRE. Samples named PRE1–4 and PRE5–10 are indicated by different colors.
Figure 3The distributions of oligotrich (s.l.) communities from six datasets ((a). DADA2–100; (b). SWARM–100; (c). UNOISE–100; (d). UPARSE–97; (e). UCLUST–97; (f). UCLUST–99) based on Principal Co-ordinates Analysis (PCoA). Samples from the same sampling area are indicated by the same color except for samples from PRE. Samples named PRE1–4 and PRE5–10 are indicated by different colors. The ellipses represent 95% confidence intervals except for DY (too few samples to calculate an ellipse). P represents global significance among oligotrich (s.l.) communities from different sampling areas based on ADONIS. p values < 0.05 indicate significant differences among community structures.
Figure 4Maximum likelihood (ML) tree based on SSU rDNA of 103 environmental sequences (ASVs in DADA2–100) and 197 sequences of identified species from GenBank (accession numbers in Supplementary Table S1). The tree shows the topology and transition pattern; support values (ML/BI) > 50%/0.50 are labeled. The number of collapsed sequences and ecological characteristics are shown in parentheses. F, freshwater; L, low-brackish; H, high-brackish; M, marine.