| Literature DB >> 35298369 |
Timothy M Ghaly1, Anahit Penesyan1,2, Alexander Pritchard3, Qin Qi1, Vaheesan Rajabal1,2, Sasha G Tetu1,2, Michael R Gillings1,2.
Abstract
Integrons are microbial genetic elements that can integrate mobile gene cassettes. They are mostly known for spreading antibiotic resistance cassettes among human pathogens. However, beyond clinical settings, gene cassettes encode an extraordinarily diverse range of functions important for bacterial adaptation. The recovery and sequencing of cassettes has promising applications, including: surveillance of clinically important genes, particularly antibiotic resistance determinants; investigating the functional diversity of integron-carrying bacteria; and novel enzyme discovery. Although gene cassettes can be directly recovered using PCR, there are no standardised methods for their amplification and, importantly, for validating sequences as genuine integron gene cassettes. Here, we present reproducible methods for the amplification, sequence processing, and validation of gene cassette amplicons from complex communities. We describe two different PCR assays that either amplify cassettes together with integron integrases, or gene cassettes together within cassette arrays. We compare the performance of Nanopore and Illumina sequencing, and present bioinformatic pipelines that filter sequences to ensure that they represent amplicons from genuine integrons. Using a diverse set of environmental DNAs, we show that our approach can consistently recover thousands of unique cassettes per sample and up to hundreds of different integron integrases. Recovered cassettes confer a wide range of functions, including antibiotic resistance, with as many as 300 resistance cassettes found in a single sample. In particular, we show that class one integrons are collecting and concentrating resistance genes out of the broader diversity of cassette functions. The methods described here can be applied to any environmental or clinical microbiome sample.Entities:
Keywords: Gene cassette PCR; antibiotic resistance; class 1 integrons; sedentary chromosomal integrons
Mesh:
Substances:
Year: 2022 PMID: 35298369 PMCID: PMC9176274 DOI: 10.1099/mgen.0.000788
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Experimental and bioinformatic workflow for gene cassette amplicon sequencing. (a) Components of integrons amplified by the two PCR assays. The primer set HS287/HS286 targets cassettes that lie between two attC sites. Potentially any gene cassette(s) can be amplified by this primer set. The primer set intI-R / HS286 targets diverse integron integrases (intI) and cassette recombination sites (attC). The resulting amplicons include ~800 bp of intI and at least the first cassette(s) of an array. (b) The bioinformatic steps and software (in parentheses) used to process and filter amplicon data. Methods are shown for both primer sets sequenced with either Oxford Nanopore (ONT) or Illumina technologies.
Fig. 2.Diversity of recovered gene cassette ORFs. Redundancy was removed using a 100 % amino acid identity of translated protein sequences. (a) Total non-redundant cassette ORFs amplified using the primers HS287/HS286. (b) Cassette ORF diversity was normalised for sequencing depth, based on averages (±1 S.E) of triplicate 50-megabase subsamples of raw sequence reads. Total (c) and normalised (d) cassette ORF diversity are shown for the intI-R / HS286 primer set. Average (±1 S.E) diversity for each analysis are shown on the right-hand side of each panel. The degree of statistical significance is shown by asterisks as determined by two-sample T-tests or Wilcoxon rank sum tests (depending on the normality of the data). NS: P>0.05, *: P<0.05, **: P<0.01, ***: P<0.001, ****: P<0.0001.
Fig. 3.Diversity of integron integrases recovered by the intI-R / HS286 primer set. (a) Total non-redundant (100 % amino acid identity) integron integrases (IntIs) recovered. (b) IntI diversity was normalised for sequencing, based on averages (±1 S.E) of triplicate 50-megabase subsamples of raw sequence reads. Total (c) and normalised (d) diversity of IntI classes (using a 94 % amino acid clustering threshold) are shown. Average (±1 S.E) diversity for each analysis are shown on the right-hand side of each panel. Differences between Nanopore (ONT) and Illumina MiSeq technologies were not significant (NS) as determined by Wilcoxon rank sum tests.
Most prevalent integron integrase (IntI) classes
|
No. of IntIs in cluster/class |
Prevalence among samples (%) |
|
|
|---|---|---|---|
|
94 |
100 |
Class 1 integron - Multispecies |
99.3 |
|
71 |
91.7 |
Xanthomonadales (Rhodanobacteraceae and Xanthomonadaceae) |
~70 |
|
19 |
91.7 |
Multiple phyla (Deltaproteobacteria, Nitrospinae, Chloroflexi) |
~60 |
|
26 |
66.7 |
Xanthomonadaceae ( |
~70 |
|
10 |
58.3 |
Betaproteobacteria |
~80 |
|
13 |
41.7 |
‘IntI1-like’ - Multispecies |
~91 |
|
13 |
41.7 |
Rhodanobacteraceae |
~75 |
|
12 |
41.7 |
Xanthomonadales (Rhodanobacteraceae and Xanthomonadaceae) |
~74 |
|
8 |
41.7 |
Xanthomonadales (Rhodanobacteraceae and Xanthomonadaceae) |
~72 |
|
7 |
41.7 |
Planctomycetes |
~67 |
Fig. 4.COG functional analysis of cassette-encoded proteins recovered with the HS287/HS286 primer set. (a) Average (±1 S.E) percentage of proteins per sample (n=12) that can be classified into functional categories. On average ~20 % of cassette-encoded proteins can be classified by a COG category, half of which fall into categories of known function. (b) The average (±1 S.E) proportion of proteins within a sample assigned to each of the known functional categories. The complete list of protein sequences assigned a COG functional category is presented in Table S1.
Fig. 5.Abundance and diversity of antibiotic resistance gene (ARG) cassettes. (a) Abundance of ARGs recovered from either primer set. (b) The average (±1 S.E) abundance of each ARG type recovered from Nanopore (ONT) sequencing per sample (n=12).
Fig. 6.Proportions of gene cassette recombination sites (attCs) assigned to bacterial taxa. Taxonomic predictions are based on a selection of six (a–f) of the eleven available taxonomic models of chromosomal attCs. Each figure panel shows the proportion of attCs across each sample that exhibit sequence and structure conserved among that taxon. For a comparison of all eleven taxa, see Fig. S5.