| Literature DB >> 34133198 |
Sebastián Díaz1, Juan S Escobar2, Frank W Avila1.
Abstract
The bacterial microbiota of the mosquito influences numerous physiological processes of the host. As low-microbial-biomass ecosystems, mosquito tissues are prone to contamination from the laboratory environment and from reagents commonly used to isolate DNA from tissue samples. In this report, we analyzed nine 16S rRNA data sets, including new data obtained by us, to gain insight into the impact of potential contaminating sequences on the composition, diversity, and structure of the mosquito tissue microbial community. Using a clustering-free approach based on the relative abundance of amplicon sequence variants (ASVs) in tissue samples and negative controls, we identified candidate contaminating sequences that sometimes differed from, but were consistent with, results found using established methodologies. Some putative contaminating sequences belong to bacterial taxa previously identified as contaminants that are commonly found in metagenomic studies but that have also been identified as part of the mosquito core microbiota, with putative physiological relevance for the host. Using different relative abundance cutoffs, we show that contaminating sequences have a significant impact on tissue microbiota diversity and structure analysis. IMPORTANCE The study of tissue-associated microbiota from mosquitoes (primarily from the gut) has grown significantly in the last several years. Mosquito tissue samples represent a challenge for researchers given their low microbial biomass and similar taxonomic composition commonly found in the laboratory environment and in molecular reagents. Using new and published data sets that identified mosquito tissue microbiota from gut and reproductive tract tissues (and their respective negative controls), we developed a simple method to identify contamination microbiota. This approach uses an initial taxonomic identification without operational taxonomic unit (OTU) clustering and evaluates the relative abundance of control sample sequences, allowing the identification and removal of purported contaminating sequences in data sets obtained from low-microbial-biomass samples. While it was exemplified with the analysis of tissue microbiota from mosquitos, it can be extended to other data sets dealing with similar technical artifacts.Entities:
Keywords: bar coding analysis; contamination; gut microbiota; metabarcoding analysis; mosquito; reproductive tract microbiota
Year: 2021 PMID: 34133198 PMCID: PMC8265668 DOI: 10.1128/mSphere.00506-21
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Information on the examined data sets
| Data set | BioProject accession no. | Host species | Sample source | DNA extraction method | 16S rRNA region | Tissue | No. of samples | No. Negative controls | Normalization | Reference |
|---|---|---|---|---|---|---|---|---|---|---|
| Aedes_Gut | Lab | Phenol-chloroform | V3-V4 | Gut | 23 (pools) | 6 | Yes | This study | ||
| Aedes_URT | Lab | Phenol-chloroform | V3-V4 | URT | 19 (pools) | 6 | Yes | This study | ||
| Aedes_LRT | Lab | Phenol-chloroform | V3-V4 | LRT | 24 (pools) | 6 | Yes | This study | ||
| Aegypti_Gut | Field | DNeasy kit (Qiagen) | V5-V6 | Gut | 26 (individuals) | 5 | No | |||
| Albopictus_Gut | Field | DNeasy kit (Qiagen) | V5-V6 | Gut | 33 (individuals) | 1 | Yes | |||
| Anopheles1_Gut | Field | Salt precipitation | V2 | Gut | 62 (individuals) | 2 | No | |||
| Anopheles2_Gut | Field | DNeasy kit (Qiagen) | V4 | Gut | 52 (individuals) | 1 | No | |||
| Anopheles2_URT | Field | DNeasy kit (Qiagen) | V4 | URT | 57 (individuals) | 1 | No | |||
| Anopheles2_LRT | Field | DNeasy kit (Qiagen) | V4 | LRT | 59 (individuals) | 1 | No |
ASVs with an overall relative abundance of ≥1% in negative-control samples
| Data set | ASV | Abundance | Decontam | ||||
|---|---|---|---|---|---|---|---|
| Controls | Gut samples | URT samples | LRT samples | Contamination | |||
| Aedes | 40.06 | 25.89 | 16.73 | 22.48 | No | ||
| 1.11 | 0 | 0 | 0 | No | |||
| Aegypti | 6.34 | 0 | NA | NA | No | ||
| 5.22 | 0 | NA | NA | No | |||
| 4.54 | 0 | NA | NA | No | |||
| 3.79 | 0 | NA | NA | No | |||
| 2.18 | 0 | NA | NA | No | |||
| 2.16 | 0 | NA | NA | No | |||
| 2.14 | 0 | NA | NA | No | |||
| 2.06 | 0 | NA | NA | No | |||
| 1.78 | 0 | NA | NA | No | |||
| 1.67 | 0 | NA | NA | No | |||
| 1.24 | 0 | NA | NA | No | |||
| 1.16 | 0 | NA | NA | No | |||
| 1.15 | 0 | NA | NA | No | |||
| 1.08 | 0 | NA | NA | No | |||
| 1.06 | 0 | NA | NA | No | |||
| 1.06 | 0 | NA | NA | No | |||
| Albopictus | |||||||
| Anopheles1 | 17.74 | 9.93 | NA | NA | No | ||
| 12.38 | 8.75 | NA | NA | No | |||
| 3.55 | 5.31 | NA | NA | No | |||
| 3.26 | 1.61 | NA | NA | No | |||
| 1.54 | 3.06 | NA | NA | No | |||
| 1.38 | 0.23 | NA | NA | No | |||
| Anopheles2 | |||||||
| 19.83 | 0 | 0 | 0 | No | |||
Overall relative abundance for negative-control samples and tissue samples for each data set. NA, not applicable.
ASVs found as contamination in the Decontam prevalence-based method are listed in bold, with the associated P value.
ASV distribution in the negative-control samples
| Data set | ASVs ≥1% | ASVs ≥5% | ASVs ≥10% | Total ASVs | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | No. shared | % in tissue samples | Total | No. shared | % in tissue samples | Total | No. shared | % in tissue samples | Total | No. shared | % in tissue samples | |
| Aedes_Gut | 4 | 3 | 49.10 | 3 | 3 | 49.10 | 2 | 2 | 38.94 | 1,992 | 138 | 66.84 |
| Aedes_URT | 4 | 3 | 25.93 | 3 | 3 | 25.93 | 2 | 2 | 23.00 | 1,992 | 75 | 62.70 |
| Aedes_LRT | 4 | 3 | 34.04 | 3 | 3 | 34.04 | 2 | 2 | 30.91 | 1,992 | 221 | 56.98 |
| Aegypti_Gut | 16 | 0 | 0 | 2 | 0 | 0 | NA | NA | NA | 307 | 5 | 0.08 |
| Albopictus_Gut | 7 | 7 | 7.99 | 1 | 1 | 2.76 | 1 | 1 | 2.76 | 1,030 | 307 | 56.76 |
| Anopheles1_Gut | 12 | 12 | 42.70 | 4 | 4 | 22.78 | 2 | 3 | 18.68 | 299 | 267 | 56.58 |
| Anopheles2_Gut | 7 | 6 | 7.63 | 4 | 3 | 6.46 | 3 | 2 | 4.36 | 26 | 16 | 11.78 |
| Anopheles2_URT | 7 | 6 | 9.33 | 4 | 3 | 7.97 | 3 | 2 | 5.75 | 26 | 22 | 10.91 |
| Anopheles2_LRT | 7 | 6 | 8.63 | 4 | 3 | 7.28 | 3 | 2 | 5.01 | 26 | 18 | 10.50 |
Total ASVs and ASVs with overall relative abundances of ≥1%, ≥5%, and ≥10% in control samples. ASVs shared with tissue samples and their overall relative abundance in the tissue samples are also shown. NA, not applicable.
FIG 1Alpha diversity. (A) Number of OTUs, (B) Shannon diversity index, and (C) Pielou’s evenness index of the unmodified data set or after the removal treatment indicated. Different letters correspond to significant differences at a P value of <0.05 for paired t tests or Wilcoxon signed-rank tests.
FIG 2Beta diversity. (A) Jaccard index and (B) Bray-Curtis index of the unmodified data set or after the removal treatment indicated. Different letters correspond to significant differences at a P value of <0.05 for paired t tests or Wilcoxon signed-rank tests.