| Literature DB >> 35084301 |
David Lund1,2, Nicolas Kieffer2,3, Marcos Parras-Moltó1,2, Stefan Ebmeyer2,3, Fanny Berglund2,3, Anna Johnning1,2,4, D G Joakim Larsson2,3, Erik Kristiansson1,2.
Abstract
Macrolides are broad-spectrum antibiotics used to treat a range of infections. Resistance to macrolides is often conferred by mobile resistance genes encoding Erm methyltransferases or Mph phosphotransferases. New erm and mph genes keep being discovered in clinical settings but their origins remain unknown, as is the type of macrolide resistance genes that will appear in the future. In this study, we used optimized hidden Markov models to characterize the macrolide resistome. Over 16 terabases of genomic and metagenomic data, representing a large taxonomic diversity (11 030 species) and diverse environments (1944 metagenomic samples), were searched for the presence of erm and mph genes. From this data, we predicted 28 340 macrolide resistance genes encoding 2892 unique protein sequences, which were clustered into 663 gene families (<70 % amino acid identity), of which 619 (94 %) were previously uncharacterized. This included six new resistance gene families, which were located on mobile genetic elements in pathogens. The function of ten predicted new resistance genes were experimentally validated in Escherichia coli using a growth assay. Among the ten tested genes, seven conferred increased resistance to erythromycin, with five genes additionally conferring increased resistance to azithromycin, showing that our models can be used to predict new functional resistance genes. Our analysis also showed that macrolide resistance genes have diverse origins and have transferred horizontally over large phylogenetic distances into human pathogens. This study expands the known macrolide resistome more than ten-fold, provides insights into its evolution, and demonstrates how computational screening can identify new resistance genes before they become a significant clinical problem.Entities:
Keywords: HMM; antimicrobial resistance; horizontal gene transfer; microbiome; phylogenetics
Mesh:
Substances:
Year: 2022 PMID: 35084301 PMCID: PMC8914350 DOI: 10.1099/mgen.0.000770
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Summary of predicted macrolide resistance genes and the analysed datasets. Numbers within brackets indicate the number of genomes or metagenomic samples associated with each dataset
|
Erm |
Erm |
Mph |
Mph | |||
|---|---|---|---|---|---|---|
|
Dataset |
Size (nt) |
Genes |
Families
|
Genes |
Families
|
Ref. |
|
| ||||||
|
NCBI RefSeq [15,438] |
6.21×1010 |
330 |
10/21 |
1107 |
13/59 |
[ |
|
NCBI Assembly [412,184] |
1.71×1012 |
12423 |
29/314 |
14033 |
14/210 |
[ |
|
| ||||||
|
HMP [757] |
4.69×1012 |
82 |
7/7 |
8 |
1/1 |
[ |
|
Human gut 1 [170] |
1.93×1011 |
15 |
6/5 |
2 |
1/1 |
[ |
|
Human gut 2 [114] |
1.32×1012 |
14 |
7/3 |
2 |
1/1 |
[ |
|
Pig gut [295] |
1.74×1012 |
145 |
10/9 |
17 |
1/0 |
[ |
|
Wild baboon gut [48] |
1.37×1011 |
0 |
0/0 |
0 |
0/0 |
[ |
|
Wild rhino gut [17] |
6.21×1010 |
0 |
0/0 |
0 |
0/0 |
[ |
|
WWTP [70] |
4.82×1011 |
49 |
6/35 |
8 |
4/4 |
[ |
|
Pune river [62] |
3.91×1011 |
45 |
6/33 |
13 |
4/7 |
[ |
|
Tara oceans [245] |
4.89×1012 |
2 |
0/2 |
1 |
0/1 |
[ |
|
Antarctic soil [3] |
6.25×109 |
0 |
0/0 |
0 |
0/0 |
[ |
|
Forest soil [36] |
1.99×1011 |
6 |
1/5 |
6 |
3/2 |
[ |
|
Oilspill [13] |
2.75×1011 |
0 |
0/0 |
0 |
0/0 |
[ |
|
Lake Hazen [8] |
2.75×1011 |
32 |
0/21 |
0 |
0/0 |
[ |
|
Amazon river [106] |
2.88×1011 |
0 |
0/0 |
0 |
0/0 |
[ |
|
|
|
13143 |
|
15197 |
|
|
a, Amino acid identity <70%.
b, Known/new.
c, Non-redundant.
HMP, Human Microbiome Project; WWTP, Wastewater treatment plant.
Proportions of the 427622 genomes and 12742 unique species in the NCBI database that carried macrolide ARGs
|
Genomes (%) |
Species (%) | |
|---|---|---|
|
| ||
|
Known |
2.64 |
3.63 |
|
New |
0.33 |
3.67 |
|
Total |
2.97 |
6.94 |
|
| ||
|
Known |
3.19 |
1.81 |
|
New |
0.33 |
2.81 |
|
Total |
3.52 |
4.50 |
Fig. 1.Enrichment analysis of bacterial phyla harbouring an over- or under-representation of macrolide resistance genes. The ratios and their significance were calculated using Fisher’s exact test and a star is used to denote significant results (p<0.001). (a) Odds ratios of known and new erm genes. (b) Odds ratios of known and new mph genes.
Fig. 2.The number of reconstructed full-length macrolide ARGs per gigabase for each metagenomic dataset, divided between new and known genes. (a) Reconstructed Erm 23S rRNA methyltransferases per gigabase. (b) Reconstructed Mph macrolide 2'-phosphotransferases per gigabase. Abbreviations: HMP: Human Microbiome Project, WWTP: Wastewater treatment plant.
Descriptions of identified, previously unknown macrolide resistance genes of high interest
|
Family [genes] |
Closest known homologue [amino acid sequence identity] |
Mean fold-change erythromycin (32 µg ml−1) |
Mean fold-change azithromycin (2 µg ml−1) |
Tested gene |
Host phylum |
Pathogenic host(s) |
Associated MGE(s) [no. of isolates] |
Co-localized ARG(s) [no. of isolates] |
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
UGF311 [4] |
Erm(30) [44.7–45.2 %] |
3.2* |
3.8* |
G1525 ( |
Proteobacteria |
|
IS IS |
|
|
|
|
|
|
|
|
|
| |
|
UGF171 [371] |
Erm(F) [59.5–64.3 %] |
|
|
G883 ( |
Bacteroidetes, Verrucomicrobia |
|
MPFB [69], MOBV [52], IS |
|
|
|
|
|
|
|
|
|
| |
|
UGF246 [7] |
Erm(A) [62.3–62.8 %] |
1.0 |
1.2 |
G1415 ( |
Firmicutes |
– |
– |
– |
|
|
|
|
|
|
|
|
| |
|
UGF90 [3] |
Erm(A) [60.1–62.1 %] |
2.8* |
3.0* |
G612 ( |
Firmicutes |
– |
– |
– |
|
|
|
|
|
|
|
|
| |
|
UGF35 [4] |
Erm(T) [51.8–52.7 %] |
3.4* |
4.0* |
G351 ( |
Firmicutes |
|
|
– |
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
| |
|
UGF46 [18] |
Erm(42) [45.7–50.0 %] |
2.3* |
1.8 |
G423 ( |
Proteobacteria |
|
Integrase (Int1) [17], IS |
|
|
|
|
|
|
|
|
|
| |
|
UGF122 [12] |
Erm(42) [47.0–50.3 %] |
1.0 |
1.1 |
G752 ( |
Proteobacteria |
|
|
|
|
|
|
|
|
|
|
|
| |
|
UGF20 [28] |
Erm(53) [66.7–67.5 %] |
– |
– |
– |
Firmicutes, Proteobacteria |
– |
MOBQ [2], MPFFATA [1], MPFFA [1] |
|
|
|
|
|
|
|
|
|
|
|
|
UGF5 [15] |
Mph(E) [59.9–61.8 %] |
3.4* |
4.0* |
G373 ( |
Proteobacteria, Bacteroidetes |
|
IS |
|
|
UGF100 [25] |
Mph(O) [47.9–50.2 %] |
3.2* |
4.3* |
G1169 ( |
Proteobacteria |
– |
– |
– |
|
UGF37 [5] |
Mph(B) [64.3–67.2 %] |
1.8* |
1.9 |
G879 ( |
Firmicutes |
– |
IS |
– |
a, Tested without replicates
*Significant increase in growth (p<0.001)
Fig. 3.Phylogenetic tree depicting the Erm 23S rRNA methyltransferases predicted in this study. Known ARGs and new gene families of high interest are annotated in the tree and experimentally validated new ARGs are marked by a star. Each leaf is coloured based on the phylum of the identified host(s), whether it was found only in metagenomic data, or if it was discovered in multiple phyla (mobile). The tree scale is displayed at the bottom right of the figure. Additional details, including bootstrap support values, can be found in Fig. S5.