| Literature DB >> 32641975 |
Nancy Fayad1,2, Mireille Kallassy Awad2, Jacques Mahillon1.
Abstract
Insertion sequences (IS) are ubiquitous transposable elements with a very simple organization: two inverted repeats flanking a transposase coding gene. IS982 is one of 26 insertion sequence families known so far. With 70 registered members in the ISFinder database, this family remains somewhat unexplored, despite the association of many of its members with important features such as antibiotic resistance. IS982 has a fairly simple organization with a mean length of ca. 1 Kb, two inverted repeats with conserved 5' AC 3' ends flanking a transposase coding gene and direct repeats of variable lengths. Its transposase has a RNAse-H like chemistry with an atypical DDE motif. In this study, we first highlight the current knowledge on the IS982 family by dissecting its registered members and their characteristics. Secondly, we bring new insights into this old, yet uncharted IS family, by exploring its registered elements, as well as the genomic and proteomic databases of bacterial and archaeal strains. This probing showed that the presence and distribution of this family goes far beyond the clear-cut registry of ISFinder database.Entities:
Keywords: Antibiotic resistance; Archaea; DDE motif; Insertion sequences; Transposase
Year: 2020 PMID: 32641975 PMCID: PMC7335449 DOI: 10.1186/s13100-020-00221-z
Source DB: PubMed Journal: Mob DNA
List of IS982 elements from the ISFinder database; a original host in which the element was found. b Bacterial/archaeal group to which belongs the original host of the IS982 element according to LifeMap [26] and the NCBI taxonomy. c length of the IS982 element including transposase CDS, left and right inverted repeats. d length of the IS982 TnpA protein and coordinates of its CDS. e length of identical nt in the left and right IR over the total length. f Direct repeats in the element’s original host/species. * = Disrupted transposase coding gene
| Name | Origina | Gram (+/−) or Archaea | Bacterial and Archeal host Group b | IS Length (bp)c | TnpA Length (aa)d | IR (bp)e | DR (bp)f |
|---|---|---|---|---|---|---|---|
| IS | – | 1028 | 326 (34–1014) | 21/22 | 7 | ||
| IS | – | 1027 | 294 (100–984) | 18/18 | 2 | ||
| IS1599 | – | 1026 | 293 (102–983) | 18/18 | 0 | ||
| IS | + | 1038 | 301 (111–1016) | 21/21 | 9 | ||
| IS | – | 1070 | 300 (176–1078) | 12/12 | 8 | ||
| IS | + | 1028 | 302 (101–1009) | 25/25 | 8 | ||
| IS | + | 999 | 296 (91–981) | 19/20 | 8 | ||
| IS | + | 999 | 296 (91–981) | 19/20 | 8 | ||
| IS | + | 999 | 296 (91–981) | 19/20 | 0 | ||
| IS | – | 975 | 292 (90–968) | 18/22 | 0 | ||
| IS | – | 972 | 279 (83–922) | 17/20 | 7 | ||
| IS | – | 1003 | 301 (84–988) | 12/12 | 6 | ||
| IS | – | 975 | 291 (85–960) | 17/17 | 7 | ||
| IS | – | 974 | 293 (84–965) | 17/17 | 8 | ||
| IS | – | 972 | 289 (91–960) | 11/17 | 9 | ||
| IS | – | 980 | 287 (95–958) | 13/13 | 8 | ||
| IS | – | 979 | 287 (95–958) | 12/12 | 6 | ||
| IS | + | 996 | 290 (102–975) | 18/21 | 0 | ||
| IS | – | 920 | 274 (86–910) | 21/25 | 6 | ||
| IS | – | 1000 | 291 (105–980) | 21/22 | 4 | ||
| IS | – | 999 | 291 (104–979) | 18/19 | 6 | ||
| IS | + | 1017 | 301 (84–989) | 24/26 | 0 | ||
| IS | + | 989 | 301 (68–973) | 11/11 | 0 | ||
| IS | + | 1157 | 313 (115–1056) | 11/13 | 0 | ||
| IS | + | 1151 | 315 (109–1056) | 22/29 | 7 | ||
| IS | + | 845 | 257 (65–838) | 18/23 | 3 | ||
| IS | + | 845 | 257 (65–838) | 19/24 | 6 | ||
| IS | + | 907 | 271 (55–870) | 19/19 | 5 | ||
| IS | + | 1041 | 302 (111–1020) | 22/22 | 7 | ||
| IS | – | 998 | 277 (150–982) | 18/18 | 0 | ||
| IS | – | 963 | 284 (92–945) | 11/11 | 7 | ||
| IS | + | 1004 | 292 (105–983) | 14/14 | 9 | ||
| IS | + | 1001 | 292 (102–980) | 14/14 | 9 | ||
| IS | – | 1115 | 305 (185–1102) | 25/32 | 0 | ||
| IS | + | 998 | 296 (90–980) | 20/20 | 8 | ||
| IS | + | 994 | 297 (99–992) | 16/17 | 6 | ||
| IS | + | 962 | 271 (137–952) | 32/35 | 7 | ||
| IS | + | 965 | 240 (78–800) | 27/28 | 8 | ||
| IS | + | 965 | 285 (98–955) | 27/28 | 0 | ||
| IS | + | 965 | 285 (98–955) | 27/28 | 8 | ||
| IS | + | 998 | 296 (90–980) | 21/27 | 0 | ||
| IS | + | 999 | 296 (91–981) | 18/20 | 4 | ||
| IS | + | 985 | 292 (97–975) | 29/35 | 8 | ||
| IS | – | 996 | 292 (97–972) | 18/18 | 8 | ||
| IS | – | 1000 | 208 (350–973) | 21/21 | 3 | ||
| IS | – | 1028 | 297 (121–1011) | 16/17 | 6 | ||
| IS | – | 988 | 251 (226–980) | 15/18 | 5 | ||
| IS | – | 1018 | 292 (124–999) | 17/17 | 5 | ||
| IS | – | 1023 | 292 (128–1003) | 16/17 | 4 | ||
| IS | – | 952 | 275 (105–929) | 22/22 | 10 | ||
| IS | Archaea | 933 | 289 (59–930) | 15/16 | 2 | ||
| IS | – | 988 | 294 (91–975) | 15/17 | 2 | ||
| IS | – | 990 | 294 (93–977) | 15/17 | 0 | ||
| IS | – | 1021 | 324 (33–1007) | 21/22 | 7 | ||
| IS | – | 1282 | 239 (282–1001) | 09/18 | 7 | ||
| IS | – | 983 | 294 (90–974) | 20/21 | 3 | ||
| IS | – | 983 | 292 (4688–5566) | 14/16 | 0 | ||
| IS | + | 1112 | 312 (92–1030) | 21/28 | 9 | ||
| IS | + | 962 | 287 (90–953) | 24/24 | 9 | ||
| IS | + | 962 | 287 (89–952) | 22/25 | 6 | ||
| IS | – | 989 | 295 (89–976) | 12/17 | 8 | ||
| IS | + | 951 | 297 (49–942) | 17/18 | 0 | ||
| IS | – | 995 | 292 (104–982) | 15/20 | 9 | ||
| IS | + | 962 | 287 (89–952) | 21/23 | 8 | ||
| IS | + | 962 | 302 (44–952) | 27/28 | 2 | ||
| IS | – | 916 | 279 (75–914) | 15/16 | 0 | ||
| IS | Archaea | 905 | 280 (58–902) | 15/16 | 2 | ||
| IS | – | 1001 | 293 (95–976) | 17/17 | 6 | ||
| IS | – | 981 | 290 (95–967) | 18/18 | 0 | ||
| IS | – | 980 | 294 (93–977) | 20/26 | 7 |
Fig. 1IS982 inverted repeats left and right sequence logo, generated by WebLogo 3 [28]. The x-axis represents position of the corresponding nt. The y-axis represents bits, which indicate the maximum entropy for the given sequence type (log2 4 = 2 bits for DNA/RNA). The height of symbols within the stack reflects the relative frequency of the corresponding nt at that position
Fig. 2a Dendrogram representing the relationship among the 69 IS982 transposases. Protein sequences were first aligned with MAFFT, and the relationship tree was established via neighbor joining, with a bootstrap value of 100, followed by a rooting [35]. Blue, pink and green colors refer to the IS original hosts as Gram-negative bacteria, Gram-positive bacteria and archaea, respectively; b WebLogo [28] 3 comparison of the left (top) and right (bottom) IR of each cluster; c complete IS element nt length range (bp); d transposase protein length range (aa); e Direct repeats length range (bp) and f count of bacterial and archaeal groups within each cluster
Fig. 3Multiple Sequence alignment of 15 randomly chosen IS982 elements from the seven clusters. Alignment was done using Mega-X via a ClustalW algorithm [34]. Only residues conserved at a minimum of 50% are highlighted. Predicted helix-turn-helix and DDE domains are indicated by orange and grey arrows. The catalytic triad DDE and the potential missing K/R residue are indicated by red and green marks, respectively
Distribution of IS982-related elements based on a BLAST. P search
| Element | Original host genus (bacterial/archaeal group) | Total number of genera* | Number of new genera (%)** | Most prevalent group in the new genera (%) |
|---|---|---|---|---|
| 4 | 2 (50%) | |||
| 4 | 2 (50%) | |||
| 80 | 75 (93.75%) | |||
| 140 | 125 (89.3%) | |||
| 132 | 120 (90.9%) | |||
| 81 | 79 (97.5%) | High GC Gram+ (86%) | ||
| 133 | 20 (85%) |
* Number of genera where the specific IS982-related element was found
** Genera not previously known to hold IS982-related elements
a PVC group: Planctomycetes, Verrucomicrobia, and Chlamydiae group
b FCB group: Fibrobacteres, Chlorobi and Bacteroidetes group
Consequences of the insertion of IS982 family elements into the promotor region (A) or the coding DNA sequence (B) of antibiotic resistance or virulence genes. The effects include the complete or partial activation/increase of expression (↑) or inactivation (↓) of the corresponding gene(s)
| Insertion site | Element | Effect | Affected gene | Consequences |
|---|---|---|---|---|
| A. Within the promotor region | IS | ↑ | Resistance to carbapenem | |
| IS | ↑ | |||
| IS | ↑ | |||
| IS | ↑ | |||
| IS | ↑ | Resistance to carbapenem | ||
| IS | ↓ Partial | Between | Reduction of β-galactosidase activity | |
| B. Within the CDS | IS | ↓ | D-Alanine:D-Alanine ligase | Resistance to vancomycin and teicoplanin |
| IS | ↓ | D-Alanine:D-Alanine ligase | Resistance to vancomycin and teicoplanin | |
| IS | ↓ | Resistance to carbapenem | ||
| IS | ↓ | Loss of virulence | ||
| IS | ↓ | Loss of virulence | ||
| IS | ↓ | Negative impact on the cell metabolism and physiology | ||
| IS | ↓ Partial | Cysteine protease | Disruption of its Arg-X cleavage ➔ Decrease in virulence |