| Literature DB >> 34611248 |
Sergio Morgado1, Érica Fonseca2, Ana Carolina Vicente2.
Abstract
Arr is an ADP-ribosyltransferase enzyme primarily reported in association with rifamycin resistance, which has been used to treat tuberculosis in addition to Gram-positive infections and, recently, pan-resistant Gram-negative bacteria. The arr gene was initially identified on the Mycolicibacterium smegmatis chromosome and later on Proteobacteria plasmids. This scenario raised concerns on the distribution and spread of arr, considering the Bacteria domain. Based on 198,082 bacterial genomes/metagenomes, we performed in silico analysis, including phylogenetic reconstruction of Arr in different genomic contexts. Besides, new arr alleles were evaluated by in vitro analysis to assess their association with rifampin resistance phenotype. The arr gene was prevalent in thousands of chromosomes and in hundreds of plasmids from environmental and clinical bacteria, mainly from the phyla Actinobacteria, Proteobacteria, Firmicutes, and Bacteroidetes. Furthermore, this gene was identified in other and new genomic contexts. Interestingly, Arr sequences associated with rifampin resistance were distributed across all phylogeny, indicating that, despite the diversity, their association with rifampin resistance phenotype were maintained. In fact, we found that the key residues were highly conserved. In addition, other analyzes have raised evidence of another Arr function, which is related to guanidine metabolism. Finally, this scenario as a whole also suggested the Actinobacteria phylum as a potential ancestral source of arr within the Bacteria domain.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34611248 PMCID: PMC8492726 DOI: 10.1038/s41598-021-99255-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Distribution of Arr sequences and median GC content of the arr and genome of their hosts.
| Phylum | # Sequences | Genome median GC (%) | min | max | |||
|---|---|---|---|---|---|---|---|
| 7 | 0.58 | 0.56 | 0.53 | 0.61 | 414 | 0.0336 | |
| 3559 | 0.64 | 0.63 | 0.48 | 0.75 | 426 | 0.0388 | |
| 401 | 0.38 | 0.43 | 0.35 | 0.64 | 438 | 0.0620 | |
| 1 | 0.48 | 0.38 | 0.38 | 0.38 | 438 | – | |
| 20 | 0.47 | 0.52 | 0.46 | 0.61 | 417 | 0.0340 | |
| 2 | 0.65 | 0.55 | 0.5 | 0.61 | 412.5 | 0.0778 | |
| 1398 | 0.4 | 0.38 | 0.27 | 0.61 | 417 | 0.0604 | |
| 3668 | 0.57 | 0.47 | 0.32 | 0.74 | 453 | 0.0687 | |
| 1 | 0.72 | 0.68 | 0.68 | 0.68 | 417 | - | |
| 6 | 0.6 | 0.59 | 0.55 | 0.65 | 417 | 0.0320 | |
| Phylum | # Sequences | Plasmid median GC (%) | min | max | |||
| 2 | 0.715 | 0.685 | 0.68 | 0.69 | 417 | 0.0070 | |
| 2 | 0.32 | 0.38 | 0.38 | 0.38 | 447 | 0 | |
| 7 | 0.37 | 0.32 | 0.31 | 0.33 | 417 | 0.00690 | |
| 306 | 0.52 | 0.47 | 0.36 | 0.63 | 453 | 0.0368 | |
*SD standard derivation.
Figure 1Comparison of GC content of arr genes and host genomes. Box plots are grouped based on the phyla with the greatest abundance of arr. The inner square of the boxplots corresponds to the mean value of the GC content.
Figure 2Arr phylogeny generated by the maximum likelihood method. The blue (clade I) and yellow (clade II) branches denote the two main clades. Some sub-clades of clade II are highlighted. Arr sequences functionally verified by this study and other studies are labeled green and red, respectively. Bootstrap values above 50 are shown. The outer color strips indicate the bacterial phylum of the sequence. The presence of colored circles beside the strips indicates whether the sequence has been identified in a mobile element. Reference sequences are ticked in red and have been functionally characterized.
Figure 3Arr phylogenetic tree and sequence alignment. The colored labels indicate the phyla of the organisms from which the sequences were obtained: red, Proteobacteria; aqua, Actinobacteria; purple, Firmicutes; black, environment. The value in parentheses corresponds to the MIC given by Arr expressed in heterologous systems. Highlighted residues are based on the clustalX color scheme. Conserved residues and motifs are shown below the alignment, as follows: NAD+ binding sites, blue; catalytic residue, olive; RIF binding sites, red. Highly conserved residues (≥ 90%), considering all Arr sequences analyzed in the phylogeny, are represented below the alignment by the fuchsia blocks. References: Arr-2, Arr-sc, and Arr-ms[1]; Arr-4 and Arr-5[6]; Arr-8[8]; Arr-w1, Arr-w2, Arr-w3, and Arr-w4[4]; ArrMab[7]; Arr-cb[2].
Figure 4arr genomic context in different organisms.
E-test MIC values for rifampicin in E. coli and wild organisms.
| Strains | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Rifampicin MIC (µg/mL) | 4 | 16 | 8 | 32 | 32 | 16 | 16 | 32 | 32 |
Figure 5Structure of Arr-ms (PDB 2HW2) highlighting motifs (α-helices, red; β-sheets, yellow; loops, green) and the highly conserved residues identified in all Arr sequences used in the phylogeny (blue residues). Rifampin corresponds to the gray structure.