| Literature DB >> 35283840 |
Jessica Comín1,2, Isabel Otal2,3,4, Sofía Samper1,2,4.
Abstract
The insertion sequence (IS) 6110 is a repetitive mobile element specific for the Mycobacterium tuberculosis complex (MTBC) used for years to diagnose and genotype this pathogen. It contains the overlapping reading frames orfA and orfB that encode a transposase. Its genetic variability is difficult to study because multiple copies are present in the genome. IS6110 is randomly located, nevertheless some preferential locations have been reported, which could be related to the behaviour of the strains. The aim of this work was to determine the intra- and inter-strain genetic conservation of this element in the MTBC. For this purpose, we analysed 158 sequences of IS6110 copies from 55 strains. Eighty-four copies were from 17 strains for which we knew all the locations in their genome. In addition, we studied 74 IS6110 copies in 38 different MTBC strains in which the location was characteristic of different families including Haarlem, LAM, S, and L6 strains. We observed mutation in 13.3% of the copies studied and we found 10 IS6110 variants in 21 copies belonging to 16 strains. The high copy number strains showed 6.2% of their IS6110 copies mutated, in contrast with the 31.1% in the low-copy-number strains. The apparently more ancient copy localised in the DR region was that with more variant copies, probably because this was the most studied location. Notably, all Haarlem and X family strains studied have an IS6110 in Rv0403c, suggesting a common origin for both families. Nevertheless, we detected a variant specific for the X family that would have occurred in this location after the phylogenetic separation. This variant does not prevent transposition although it may occur at a lower frequency, as X strains remain with low copy number (LCN) of IS6110.Entities:
Keywords: IS6110; IS6110 genomic variability; Mycobacterium tuberculosis complex; tuberculosis; tuberculosis evolution
Year: 2022 PMID: 35283840 PMCID: PMC8912993 DOI: 10.3389/fmicb.2022.767912
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary of IS6110 copies analysed in the strains for which the location of all the IS copies within their genomes is known, detailing the IS copies mutated by comparison with the reference IS in Mycobacterium tuberculosis H37Rv.
| Strain | Lineage (Family) | CN strain | IS analysed ( | Mutated copies location | Mutation detected | Non-mutated copies location | Reference |
|---|---|---|---|---|---|---|---|
| H37Rv | L4.9 | HCN | 16 | - | - | all wt | NC_000962 |
| CDC1551 | L4.1.1.3 | LCN | 4 |
|
|
| AE000516.2 |
| L6 | LCN | 5 |
| all wt but one |
| ||
| L6 | HCN | 7 | DR region | orfA syn 33 aa | all wt but one | FR878060.1 | |
| L Animal | LCN | 1 | - | - | all wt | LT708304.1 | |
|
| L Animal | LCN | 1 | - | - | all wt | CP033311.1 |
|
| L Animal | LCN | 2 | - | - | all wt | CP014566.1 |
|
| L Animal | LCN | 1 | - | - | all wt | AM408590.1 |
| L4.8 | HCN | 12 | - | - | all wt |
| |
| L2 | HCN | 18 | - | - | all wt |
| |
| L Animal | LCN | 2 | - | - | DR region, Rv0756c:phoP |
| |
| HMS 2382 | L6 | LCN | 3 | DR region | orfB Asp2Gly | moaX,Rv0963c |
|
| HMS 2407 | L6 | LCN | 3 | - | - | lipX:mshB,moaX, DR region |
|
| HCU 3445 | L4 | LCN | 1 | - | - | DR region | |
| HMS 2485 | L4.1.1.3 | LCN | 2 |
|
| DR region | |
| HCU 3717 | L4.1.1.3 | LCN | 4 | DR region ( |
|
| |
| HMS 2445 | L4.1.1.3 | LCN | 2 |
|
| DR region |
Even though HCU3717 strain only has four out of five IS copies analysed, it has been included in this section because the fifth copy is located in ppe46 gene (not successfully amplified). CN, Copy number; HCN, high copy number; and LCN, low copy number. *Mutation associated to X family.
Summary of the IS6110 sequences localised in preferential sites published for determined Mycobacterium tuberculosis families (Reyes et al., 2012; Comín et al., 2021).
| Strain | Lineage | Family by SIT | CN STRAIN | Number of IS | Mutated copies location | Mutation detected | Non-mutated copies location |
|---|---|---|---|---|---|---|---|
| HSJ 238 | L4.1.2.1 | Haarlem3 | HCN | 11/3 |
|
|
|
| HMS 18009 | L4.1.2.1 | Haarlem3 | HCN | 10/2 |
|
|
|
| HMS18021 | L4.1.2.1 | Haarlem3 | HCN | 12/2 |
|
|
|
| HMS 18037 | L4.1.2.1 | Haarlem3 | HCN | 11/4 |
|
| |
| HCU 3729 | L4.1.2.1 | Haarlem3 | HCN | 7/5 |
|
| |
| HMS 18031 | L4.1.2.1 | Haarlem3 | HCN | 8/2 |
|
|
|
| HMS 18022 | L4.1.2.1 | Haarlem1 | LCN | 6/4 | - | - | |
| HMS 18007 | L4.1.2.1 | Haarlem1 | HCN | 10/1 | - | - |
|
| HMS 18046 | L4.1.2.1 | Haarlem1 | HCN | 9/4 |
|
| |
| HSJ 234 | L4.1.2.1 | Haarlem3 | HCN | 11/3 |
|
|
|
| HMS 18001 | L4.1.2.1 | Haarlem1 | HCN | 8/4 |
|
| |
| HMS 18002 | L4.1.2.1 | Haarlem1 | HCN | 10/4 |
|
| |
| HMS 18005 | L4.3 | LAM3 | LCN | 5/1 |
|
|
|
| HSJ 241 | L4.3 | LAM3 | HCN | 15/2 |
|
|
|
| HMS 18045 | L4.3 | LAM3 | HCN | 15/2 |
|
|
|
| HMS 18017 | L4.3 | LAM12 | HCN | 14/2 |
|
|
|
| HMS 18025 | L4.3 | LAM9 | HCN | 11/2 |
|
|
|
| HMS 18010 | L4.3 | LAM9 | HCN | 14/2 | - | - |
|
| HMS18047 | L4.3 | LAM4 | HCN | 14/1 | - | - |
|
| HMS 18048 | L4.3 | LAM9 | HCN | 10/3 | - | - | |
| HMS 18018 | L4.3 | LAM3 | HCN | 12/3 | - | - | |
| Ara217 | L4.3 | LAM9 | HCN | 13/2 | - | - |
|
| HMS 18019 | L4.4.1.1 | S | HCN | 11/1 | - | - |
|
| HCU 3718 | L4 | T1 | HCN | 9/1 | - | - |
|
| HMS 18035 | L4 | T2 | LCN | 5/1 | - | - |
|
| HMS 18040 | L4 | T5 | HCN | 8/1 | - | - |
|
| HMS 18042 | L4 | T2 | HCN | 7/1 | - | - |
|
| HMS 18041 | L4 | T1 | HCN | 9/1 | - | - |
|
| HMS 18044 | L4 | T5_MAD2 | LCN | 6/1 | - | - |
|
| HMS 18014 | L4 | T5_MAD2 | LCN | 6/1 |
|
| - |
| HMS 18038 | L3 | CAS1_DELHI | HCN | 16/1 | - | - |
|
| HCU 2828 | L6 |
| HCN | 8/1 | - | - |
|
| HCU 3775 | L6 |
| LCN | 3/1 | - | - |
|
| HMS 1693 | L6 |
| LCN | 3/1 |
|
| - |
| HMS 14017 | L6 |
| LCN | 3/1 |
|
| - |
| HMS 1942 | L6 |
| LCN | unk/1 | - | - |
|
| HMS 2000 | L6 |
| LCN | 5/1 | - | - |
|
| HSJ 66 | L6 |
| LCN | 3/1 | - | - |
|
CN, Copy number; HCN, high copy number; LCN, low copy number; and SIT, Spoligo International Type.
Total number of IS6110 based in the IS6110 RFLP pattern.
Outbreak strain (.
Figure 1Summary of the different IS6110 copies studied in this work.
Figure 2Sequence of IS6110, highlighting in different coloured squares the points, the nucleotide changes, and the different strains where the 10 mutations were found. orfA is written in yellow, orfB is in red, and the overlapping region is in green. The black letters at the ends represent the inverted repeats.
Figure 3(A) Configuration of the ppe38 locus in the MtZ strain. (B) Configuration of the ppe38 locus in the Beijing GC1237 strain. The red lines indicate a truncation of the gene.