| Literature DB >> 32655509 |
Emilie Lejal1, Agustín Estrada-Peña2, Maud Marsot3, Jean-François Cosson1, Olivier Rué4,5, Mahendra Mariadassou4,5, Cédric Midoux4,5,6, Muriel Vayssier-Taussat7, Thomas Pollet1,8.
Abstract
BACKGROUND: The development of high-throughput sequencing technologies has substantially improved analysis of bacterial community diversity, composition, and functions. Over the last decade, high-throughput sequencing has been used extensively to identify the diversity and composition of tick microbial communities. However, a growing number of studies are warning about the impact of contamination brought along the different steps of the analytical process, from DNA extraction to amplification. In low biomass samples, e.g., individual tick samples, these contaminants may represent a large part of the obtained sequences, and thus generate considerable errors in downstream analyses and in the interpretation of results. Most studies of tick microbiota either do not mention the inclusion of controls during the DNA extraction or amplification steps, or consider the lack of an electrophoresis signal as an absence of contamination. In this context, we aimed to assess the proportion of contaminant sequences resulting from these steps. We analyzed the microbiota of individual Ixodes ricinus ticks by including several categories of controls throughout the analytical process: homogenization, DNA extraction, and DNA amplification.Entities:
Keywords: contaminant; high-throughput sequencing; low biomass sample; microbiota; tick
Year: 2020 PMID: 32655509 PMCID: PMC7325928 DOI: 10.3389/fmicb.2020.01093
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Analyzing process for samples and added controls at each steps: Tick homogenization, DNA extraction, DNA amplification, and 16s rRNA gene sequencing. For each step, information stated in black correspond to the initial sample and potential controls added at a previous step, and blue stated information correspond to the controls added at the present step.
FIGURE 2Comparison of the 16S rRNA gene sequencing taxonomic assignments between negative controls and samples. Percentages are represented for the OTUs with proportions corresponding to at least 1% of the total number of sequences assigned to a category of samples or controls. Those corresponding to less than 1% are assigned as “Others.” OTUs belonging to the same genus or family (if multi-affiliated at the scale of genus) were assigned under the same name. One OTU was only identified until the class level (multi-affiliated Gammaproteobacteria). Most OTUs detected in homogenization and DNA extraction controls seems to represent a high proportion of reads detected in samples, particularly in nymph and male samples.
FIGURE 3Impact of contamination on the final number of sequence per sample according to the tick instar/sex. A high proportion of sequences have been identified as contaminants and therefore removed from the dataset. Nymph and male samples seem to be more impacted, with almost 75% of samples containing more than 50% of contaminating sequences.
FIGURE 4Impact of contamination on alpha diversity estimations according to the tick instars. Alpha diversity has been estimated according to the observed number of OTUs, the Shannon and Simpson indexes and the Faith’s phylogenetic diversity (PD), for each instar on filtered (turquoise) and not filtered (orange-red) datasets. Results of the statistical test comparing alpha diversity measurement between instars, as well as between the filtered and not filtered groups, are represented on colored bands at both the top and the bottom of the frame, respectively. NS, not significant, ***p < 0.001, **p < 0.01.