| Literature DB >> 32878926 |
Mohammad Razavi1,2, Erik Kristiansson1,3, Carl-Fredrik Flach1,2, D G Joakim Larsson4,2.
Abstract
Insertion sequences (ISs) are abundant mobile genetic elements on bacterial genomes, responsible for mobilization of many genes, including antibiotic resistance genes (ARGs). As ARGs often occur in similar genetic contexts, understanding which ISs tend to be associated with known ARGs could be a first step toward discovering novel ARGs through predictive or experimental strategies. This could be valuable, as early identification of ARGs in pathogens could facilitate surveillance, confinement actions, molecular diagnostics, and drug development. Here, we present a comprehensive analysis of the association of specific ISs with known ARGs. A large collection of bacterial genomes was used to characterize the immediate context of 2,437 known ARGs and 3,768 ISs. While many ARGs were consistently found close to specific ISs, the contexts around all ISs were more variable. Nevertheless, a subset of individual ISs, as well as tentative composite transposons, showed significant associations with ARGs. These included, e.g., insertion sequences classified as IS6, Tn3, IS4, and IS1 that were not only strongly associated with diverse ARGs but also highly abundant in pathogens. Therefore, we conclude that the context of this subset of ISs and tentative composite transposons would be particularly valuable to discover novel ARGs through modeling or empirical approaches. A set of 1,891 metagenomes were analyzed to identify environments where those ISs commonly associated with ARGs were particularly abundant. The associations found in metagenomes were similar to those found in genomes.IMPORTANCE The emergence and spread of antibiotic resistance genes (ARGs) among pathogens threaten the prevention and treatment of bacterial infections as well as our food production chains. Early knowledge about mobile ARGs that are present in pathogens or that have the potential to become clinically relevant could help mitigate potential negative consequences. Recently, exploring integron gene cassettes was shown to be successful for identifying novel mobilized ARGs, some of which were already circulating in pathogens. Still, only a subset of ARGs is mobilized by integrons, and the contexts of other mobile genetic elements associated with ARGs remain unexplored. This includes insertion sequences (ISs) responsible for the mobilization of many ARGs. Our analyses identified ISs, species, and environments where ARG-IS relationships are particularly strong. This could be a first step to guide the discovery of novel ARGs, while also providing insights into mechanisms involved in the mobilization and transfer of ARGs.Entities:
Keywords: antibiotic resistance; bacterial genomes; environment; insertion sequences; metagenomics; resistome
Mesh:
Substances:
Year: 2020 PMID: 32878926 PMCID: PMC7471000 DOI: 10.1128/mSphere.00418-20
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1Relative frequencies of putative functions of the open reading frames (ORFs) within 10 ORF distance (see Materials and Methods and Fig. 7) from ARGs (left panel) and IS variants (right panel), in chromosomes and in plasmids and ICE sets, which are grouped by classes of antibiotics and IS domains, respectively. Increasing the ORF distance will inflate the “other” category, probably reflecting an increase in genes not functionally associated with the transposon. We selected an ORF distance of 10, which is roughly where we observed that the inflation of the “other” categories began more clearly.
FIG 7Conceptual example of how ORFs located downstream and upstream of three core genes (each an IS or ARG) were numbered. Note that for each occurrence of an ARG or IS, a new region with surrounding ORFs was analyzed.
FIG 2The relative frequencies of putative functions of the ORFs around IS domains in three groups of bacteria are presented. The antibiotic selection pressure in the environment that the bacteria inhabit and the extent of horizontal gene transfer that they engage in appear to be linked to the association of IS variants with ARGs and virulence factors. Only chromosomal ORFs within 10 ORF distance away from each IS were considered in the analyses. As a comparison, the average relative frequencies of the contents within 10 ORF distance of 10,000 randomly selected genes on the chromosomes of the included genomes are presented for each group.
FIG 3Distribution of ISs that are significantly associated with different types of ARGs within 10 ORF distance (see Materials and Methods). Different colors show various IS domains, and the size of the circles represents the presence of their associations with ARGs in different bacterial genera.
FIG 4The variability of all ORFs versus all ARGs within tentative composite transposons. The colors represent different IS domains, and the sizes of the markers are scaled to reflect the ARG richness within tentative composite transposons.
FIG 5Insertion sequences and ARGs in metagenomic data sets. (a) Relative abundances of ARGs categorized by ARG families. (b) Relative abundances of IS names categorized by IS domains. (c) Number of different ISs (IS richness) found in each environment grouped by IS domains. (d) Bray-Curtis measurement matrix of environments. The data show similarities between different environments over the presence/absence data of ISs. The main diagonal shows the within-environment similarities.
FIG 6Tracing the associations of ISs with ARGs, identified from analyzing bacterial genome data, in metagenomes. (a) A scatterplot showing the difference between average correlations of the relative abundances of ISs with two groups of ARGs across different metagenomes, i.e., those ARGs significantly associated with ISs from the genome data and those less extensively associated with ISs (see Materials and Methods). A Wilcoxon test confirms a highly significant difference between them (absence of a difference is indicated by a horizontal dashed red line). (b) (Top panel) Average Spearman correlation of ISs with all the ARGs. The sizes of the markers are scaled according to the ARG richness from genome data. (Bottom panel) Stack bar chart showing the distribution of ISs across different environments. The average of relative abundances of ISs across metagenomes in each environment was used.
ISs associated with a high diversity of ARGs (group 1) or with a generally variable context (group 2)
| Group | Family | IS | Abundance | Jaccard | ARG | Selected host(s) (genus) | Environment(s) |
|---|---|---|---|---|---|---|---|
| 1 | IS | IS | 26,941 | 0.086 | 113 | W, G, A | |
| IS | 2,323 | 0.297 | 43 | W, I, R | |||
| IS | 9,286 | 0.214 | 17 | S, A | |||
| IS | 7,930 | 0.030 | 20 | W, A, O | |||
| Tn | Tn | 7,214 | 0.26 | 31 | O, I, W | ||
| Tn | 20,532 | 0.176 | 54 | W, A, G | |||
| Tn | 15,492 | 0.165 | 64 | O, A, W | |||
| IS | IS | 14,872 | 0.185 | 44 | W, Se, A | ||
| IS | IS | 15,540 | 0.052 | 44 | W, G, So | ||
| IS | 8,502 | 0.08 | 39 | W, G, A | |||
| IS | IS | 3,006 | 0.263 | 24 | W, Se, A | ||
| IS | 223 | 0.067 | 13 | W, A, O | |||
| IS | 4,146 | 0.098 | 15 | O, W | |||
| IS | IS | 8,627 | 0.26 | 27 | W, R, Se | ||
| IS | IS | 950 | 0.269 | 8 | O, G, A | ||
| 2 | IS | IS | 1,474 | 0.084 | 0 | W, R, Se | |
| IS | IS | 536 | 0.087 | 0 | A, W, Se | ||
| IS | IS | 831 | 0.078 | 0 |
| M | |
| IS | IS | 693 | 0.060 | 0 | O, G, A | ||
For unit transposases such as members of Tn3 families, averages over all the identified composite transposons with at least one Tn3 transposase were used.
M, marine; W, wastewater/sludge; G, gut; So, soil; S, skin; A, animal associated; R, river; O, oral; Se, sediment; V, vaginal; P, industrially polluted.