| Literature DB >> 31486766 |
Christopher J Harmer1, Ruth M Hall1.
Abstract
The relationships within a curated set of 112 insertion sequences (ISs) currently assigned to the IS6 family, here re-named the IS6/IS26 family, in the ISFinder database were examined. The encodedEntities:
Keywords: IS26; ISFinder; insertion sequence; phylogenetic analysis
Mesh:
Substances:
Year: 2019 PMID: 31486766 PMCID: PMC6807381 DOI: 10.1099/mgen.0.000291
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
IS257/IS431 pairwise DNA identity (%)
Lighter shading denotes the >95 % nucleotide identity cut-off applied by ISFinder to define isoforms. Darker shading denotes the nucleotide identity to IS257R1, the IS used to define isoforms in this group. Lengths of ISs as used to calculate DNA identity: 788 bp, IS431L; 789 bp, IS257-3 and IS257R2; 790 bp, IS257-2, IS431R, IS431mec and IS257R1; 791 bp, IS257-1.
|
IS |
IS |
IS |
IS |
IS |
IS |
IS |
IS | |
|---|---|---|---|---|---|---|---|---|
|
IS |
100 |
88.86 |
88.83 |
86.15 |
86.29 |
86.31 |
86.82 |
86.57 |
|
IS |
100 |
92.52 |
94.92 |
94.8 |
95.06 |
95.57 |
95.32 | |
|
IS |
100 |
95.3 |
96.7 |
95.18 |
95.69 |
95.44 | ||
|
IS |
100 |
97.59 |
97.21 |
97.72 |
97.97 | |||
|
IS |
100 |
98.23 |
97.97 |
97.72 | ||||
|
IS |
100 |
99.24 |
98.99 | |||||
|
IS |
100 |
99.49 | ||||||
|
IS |
100 |
IS257/IS431 transposase amino acid similarity (%)
Lighter shading denotes the >98 % amino acid similarity cut-off applied by ISFinder to define isoforms. Darker shading denotes the amino acid similarity to IS257R1, the IS used to define isoforms in this group. Lengths of transposases as used to calculate amino acid similarity: 221 aa, IS257-2; 224 aa, IS257-1, IS257-3, IS431L, IS431R, IS431mec, IS257R1 and IS257R2.
|
IS |
IS |
IS |
IS |
IS |
IS |
IS |
IS | |
|---|---|---|---|---|---|---|---|---|
|
IS |
100 |
96.55 |
97.04 |
96.05 |
96.05 |
95.56 |
96.05 |
96.55 |
|
IS |
100 |
98.66 |
98.21 |
98.66 |
98.21 |
98.66 |
99.10 | |
|
IS |
100 |
98.66 |
99.10 |
98.66 |
99.10 |
99.10 | ||
|
IS |
100 |
98.66 |
98.21 |
98.66 |
99.55 | |||
|
IS |
100 |
99.55 |
99.10 |
98.66 | ||||
|
IS |
100 |
98.55 |
99.10 | |||||
|
IS |
100 |
99.10 | ||||||
|
IS |
100 |
Archaeal ISs analysed
Only ISs conforming to the following criteria were included in this analysis: IS must follow the current ISFinder rules (IS sharing >98 % amino acid similarity and/or >95 % nucleotide identity are considered the same IS), IS must include an intact transposase, and the transposase must contain a conserved DDE catalytic triad.
|
IS |
IS length (bp) |
Tnp length (aa) |
TIR (bp)* |
TIR end† |
TSD (bp)‡ |
Accession no.§ |
|---|---|---|---|---|---|---|
|
ISC735 |
735 |
214 |
15 |
TA/GA |
nd |
AY671942 |
|
ISC774 |
778 |
231 |
16 |
GT |
nd |
NC_002754 |
|
ISH14 |
696 |
211 |
14/15 |
GG |
8 |
NC_006396 |
|
ISH15 |
697 |
211 |
15/17 |
CG/CA |
9 |
NC_006396 |
|
ISH17 |
745 |
226 |
14 |
GG |
8 |
NC_006393 |
|
ISH29 |
697 |
211 |
16/17 |
GG |
8 |
NC_002608 |
|
ISHarch12 |
698 |
211 |
15/17 |
GG |
8 |
CP002988 |
|
ISHarch13 |
698 |
211 |
15/17 |
GG |
nd |
CP002988 |
|
ISHarch14 |
698 |
211 |
15/17 |
GG/CA |
8 |
CP002988 |
|
ISHarch4 |
743 |
223 |
18 |
GG |
8 |
CP002989 |
|
ISHbo1 |
745 |
225 |
15 |
GG |
8 |
CP001692 |
|
ISHla13 |
745 |
226 |
14/15 |
GG |
nd |
CP001366 |
|
ISHli8 |
745 |
226 |
15/16 |
GG |
nd |
CP024845|| |
|
ISHli9 |
742 |
224 |
18/22 |
GG |
8 |
CP024845|| |
|
ISHme1 |
740 |
225 |
15 |
GG |
nd |
CP001871 |
|
ISHmu2 |
671 |
211 |
13/14 |
GG/GA |
8 |
CP001688 |
|
ISHmu9 |
697 |
206 |
14/22 |
GG/GT |
nd |
CP001688 |
|
ISHtu2 |
743 |
225 |
15/18 |
GG |
8 |
CP001861 |
|
ISMja1 |
703 |
214 |
19/22 |
GA |
nd |
NC_000909 |
|
ISNgar3 |
697 |
211 |
14/15 |
GG/CA |
8 |
CP003377 |
|
ISNamo10 |
697 |
211 |
13/17 |
GG/TG |
nd |
HF582854 |
|
ISNamo11 |
698 |
211 |
15/17 |
CA/CC |
8 |
HF582854 |
|
ISNamo12 |
697 |
211 |
14/17 |
GG |
8 |
HF582854 |
|
ISNamo13 |
745 |
226 |
16/18 |
GG |
8 |
HF582854 |
|
ISNamo7 |
697 |
211 |
14/17 |
GG |
8 |
HF582854 |
|
ISNamo8 |
696 |
211 |
15/17 |
GG |
nd |
HF582854 |
|
ISNamo9 |
696 |
211 |
15/17 |
GG |
nd |
HF582854 |
|
ISNma4 |
744 |
226 |
12 |
GG |
nd |
CP001932 |
|
ISNpe4 |
696 |
211 |
12/14 |
GG |
8 |
CP003372 |
|
ISNpe9 |
742 |
224 |
12/14 |
GG/GA |
3 |
CP003372 |
|
ISNph1 |
697 |
211 |
15/17 |
GG/CA |
nd |
NC_007426 |
|
ISPfu1 |
781 |
233 |
15/16 |
GA |
8 |
NC_003413 |
|
ISPfu2 |
782 |
233 |
15/16 |
GA |
8 |
NC_003413 |
|
ISPfu5 |
779 |
233 |
15/16 |
GA |
9 |
NC_003413 |
|
ISSis1 |
735 |
215 |
18 |
GA |
8 |
NC_006424 |
|
ISSte1 |
746 |
217 |
16 |
GT |
8 |
NC_005969 |
|
ISSto2 |
853 |
237 |
33/34 |
GG |
5 |
NC_003106 |
*TIR length as recorded in ISFinder.
†5′ and complementary 3′ sequence.
‡TSD length as recorded in ISFinder. nd denotes no TSD observed in the sequence submitted to ISFinder, but does not preclude the presence of a TSD in other sequenced examples of that particular IS.
§Reference sequence recorded in ISFinder.
||No accession number recorded in ISFinder. Accession number identified by performing a blast search using the sequence in the ISFinder entry.
Fig. 1.Un-rooted neighbour-joining phylogeny of 112 IS6/IS26 family transposases. The consensus tree was reconstructed by resampling the analysis 10 000 times. ISs derived from bacteria are coloured green (Gram-negative), red (Gram-positive) or purple (divergent longer bacterial ISs derived from species), and ISs of archaeal origin are blue. Clades encompassing ISs of bacterial origin are labelled I to VIII, and clades encompassing ISs of archaeal origin are labelled a to d.
Bacterial Gram-positive IS6 family members analysed*
Only ISs conforming to the following criteria were included in this analysis: IS must follow the current ISFinder rules (IS sharing >98 % amino acid similarity and/or >95 % nucleotide identity are considered the same IS), IS must include an intact transposase, and the transposase must contain a conserved DDE catalytic triad.
|
IS |
IS length (bp) |
Tnp length (aa) |
TIR (bp)* |
TSD (bp)† |
Clade |
Accession no.‡ |
|---|---|---|---|---|---|---|
|
| ||||||
|
IS |
808 |
226 |
24 |
nd |
II |
U49512 |
|
IS |
861 |
235 |
16/17 |
nd |
V |
M23740 |
|
IS |
817 |
241 |
16/17 |
nd |
V |
CP015730§ |
|
IS |
790 |
221 |
20/26 |
8 |
II |
X53951 |
|
IS |
790 |
224 |
18/20 |
8 |
II |
M13290 |
|
ISBth6 |
864 |
238 |
17/18 |
nd |
V |
NZ_AAJM01000806 |
|
ISBth20 |
808 |
228 |
16/18 |
nd |
II |
CP003766 |
|
ISBwe2 |
863 |
235 |
16 |
nd |
V |
NC_010180 |
|
ISBwe3 |
843 |
235 |
15/16 |
nd |
V |
NC_010180 |
|
ISCap |
793 |
224 |
22 |
nd |
II |
EF177828 |
|
ISEnfa1|| |
808 |
228 |
18/19¶ |
nd |
II |
KX579977 |
|
ISLgar4 |
809 |
223 |
17/18 |
nd |
II |
AMFE010000010 |
|
ISLmo3 |
807 |
226 |
17 |
nd |
II |
CP022021 |
|
ISLmo4 |
812 |
226 |
17 |
nd |
II |
CP006611§ |
|
ISS1N |
808 |
226 |
18 |
8 |
II |
M37395 |
|
ISS1S |
808 |
226 |
18 |
8 |
II |
M18294 |
|
ISS1W |
809 |
226 |
17/18 |
8 |
II |
M37396 |
|
ISSau6 |
793 |
224 |
21/22 |
nd |
II |
NC_002952 |
|
ISSau10 |
793 |
225 |
16 |
8# |
II |
FN390947 |
|
ISTeha2 |
809 |
226 |
17 |
nd |
II |
AP012046 |
|
| ||||||
|
ISCbo1 |
1307 |
344 |
20/25 |
8 |
VII |
NC_015425 |
|
ISClte3 |
1274 |
340 |
21/32 |
nd |
VII |
AE015927 |
|
ISCpe7 |
1274 |
340 |
12/15 |
8 |
VII |
NC_008262 |
|
ISDsp3 |
1456 |
319 |
28/36 |
8 |
Solo |
NC_009455 |
|
ISNth1 |
1226 |
353 |
17/23 |
8 |
VII |
NZ_ABKR01000003 |
|
ISTps1 |
1349 |
343 |
18/23 |
8 |
VII |
CP000924 |
|
ISTps1a |
1339 |
343 |
12/16 |
nd |
VII |
NC_010320 |
|
| ||||||
|
ISHahy6 |
1713 |
453 |
14/19 |
8 |
VIII |
CP002304 |
|
ISHco2 |
1727 |
453 |
14/19 |
nd |
VIII |
FNDF01000001 |
|
ISTet2 |
1761 |
454 |
15/23 |
8 |
VIII |
AEYS01000011 |
|
| ||||||
|
ISFal2 |
826 |
236 |
13/19 |
nd |
Solo |
NC_008278 |
*TIR length as recorded in ISFinder.
†TSD length as recorded in ISFinder. nd denotes no TSD observed in the sequence submitted to ISFinder, but does not preclude the presence of a TSD in other sequenced examples of that particular IS.
‡Reference sequence recorded in ISFinder.
§No accession number recorded in ISFinder. Accession number identified by performing a blast search using the sequence in the ISFinder entry.
||The sequence in ISFinder for ISEnfa1 (GenBank accession number AY884205) included a 10 bp deletion, resulting in a truncated transposase. A second complete sequence for ISEnfa1 was identified (GenBank accession number KX579977) and used to replace the truncated sequence.
¶TIRs were incorrectly defined in ISFinder. Additional available sequences were examined, and the TIRs were modified by removing 1 G nucleotide at the left and right ends of ISEnfa1 based on GenBank accession number CP031028. The IS length was adjusted accordingly.
#No TSD recorded in ISFinder, but an 8 bp TSD was reported by Gomez-Sanz et al. in 2013.[56].
Fig. 2.Structure of representative bacterial and archaeal IS6/IS26 family members. The extent and orientation of the transposases are shown with the N-terminal (N) and C-terminal (C) marked. Protein secondary structures predicted by JPred are marked, including predicted catalytic domains (blue ovals), H-HTH motifs (red boxes), zinc fingers (brown zig-zags) and an unknown domain (yellow triangle), and a signal peptide (S, green star). TIRs are shaded in grey. Drawn to scale in Gene Construction Kit (GCK, version 4.0) from the corresponding entries in the ISFinder database before transposing into Adobe Illustrator.
Fig. 3.Alignment of the amino acid sequences of representative transposases of bacterial IS6/IS26 family members. The transposases of three representatives from clades I to VI, and ISFal2 are shown. The extents of the H-HTH putative DNA binding domain and the DDE catalytic domain are marked above. Residues in the catalytic domain that are conserved in at least 62 of the 65 transposases in the complete alignment (Fig. S1) are marked with a capital letter. Residues for which only two variations exist are marked with smaller letters. The completely conserved DDE residues are marked by red letters. Amino acids are shaded as follows: black, 100 % similarity; dark grey, 80–99 % similarity; light grey, 60–79 % similarity; unshaded, less than 59 % similarity.
Fig. 4.Alignment of the TIRs of the short bacterial IS6/IS26 family members. The first 20 nucleotides and the complement of the last 20 nucleotides of each of the bacterial ISs from clades I to VI are aligned on the left and right, respectively, with each base coloured specifically. The consensus is shown above.
Fig. 5.Alignment of the ten long ISs and ten representatives of the short bacterial ISs from clades I to VI. A ClustalW alignment (BLOSUM matrix, gap open cost 10, gap extend cost 0.1) was generated in Geneious (version 7.1.9) to align the transposases of the ten long bacterial ISs and ten representatives of the short bacterial ISs from clades I to VI. Amino acids are shaded as follows: black, 100 % similarity; dark grey, 80–99 % similarity; light grey, 60–79 % similarity; unshaded, less than 59 % similarity.
Fig. 6.Alignment of the TIRs of the long bacterial IS6/IS26 family members. The first 20 nucleotides and the complement of the last 20 nucleotides of each of the bacterial ISs from clades VII and VIII are aligned on the left and right, respectively, with each base coloured specifically. The consensus is shown above.
Fig. 7.Alignment of representative transposases from archaeal IS6/IS26 family members. The transposases of four representatives from archaeal clades a to d are shown. Amino acids are shaded as follows: black, 100 % similarity; dark grey, 80–99 % similarity; light grey, 60–79 % similarity; unshaded, less than 59 % similarity.
Fig. 8.Alignment of the TIRs of the archaeal IS6/IS26 family members. The first 20 nucleotides and the complement of the last 20 nucleotides of each of the archaeal ISs from clades a to d are aligned on the left and right, respectively, with each base coloured specifically. The consensus is shown above.
Bacterial Gram-negative IS6 family members analysed
Only ISs conforming to the following criteria were included in this analysis: IS must follow the current ISFinder rules (IS sharing >98 % amino acid similarity and/or >95 % nucleotide identity are considered the same IS), IS must include an intact transposase, and the transposase must contain a conserved DDE catalytic triad.
|
IS |
IS length (bp) |
Tnp length (aa) |
TIR (bp)* |
TSD (bp)† |
Clade |
Accession no.‡ |
|---|---|---|---|---|---|---|
|
| ||||||
|
IS |
820 |
234 |
14 |
8 |
I |
X00011 |
|
IS |
819 |
234 |
16 |
nd |
I |
NC_004361 |
|
IS |
819 |
234 |
18 |
nd |
I |
AJ250860 |
|
IS |
820 |
234 |
17 |
nd |
I |
AJ251307 |
|
IS |
810 |
234 |
15/16 |
nd |
IV |
X87144 |
|
IS |
861 |
245 |
14 |
nd |
I |
Y18002 |
|
IS |
880 |
254 |
14 |
8 |
I |
X53635 |
|
ISEas1 |
811 |
231 |
16 |
nd |
IV |
CP011588 |
|
ISEc59 |
891 |
240 |
17/23 |
nd |
VI |
KX246266 |
|
ISPpr9 |
858 |
245 |
18/22 |
nd |
I |
NC_005871 |
|
ISPsa2 |
863 |
242 |
17/19 |
nd |
V |
HM563000 |
|
ISSba18 |
829 |
230 |
14/16 |
nd |
IV |
NC_009999 |
|
ISVsa4 |
861 |
246 |
15/18 |
nd |
I |
NC_011311 |
|
ISYps1 |
809 |
231 |
16 |
8 |
IV |
FM178282|| |
|
| ||||||
|
IS |
815 |
228 |
16/19 |
nd |
IV |
AF118548 |
|
ISApr5 |
806 |
231 |
11/12 |
nd |
III |
NZ_ABHC01000011 |
|
ISBj7 |
824 |
253 |
14/16 |
nd |
III |
BA000040 |
|
ISMex25 |
887 |
250 |
13/15 |
nd |
VI |
CP001513|| |
|
ISMno28 |
892 |
250 |
18/21 |
nd |
VI |
NC_011892 |
|
ISMno34 |
889 |
250 |
14/15 |
nd |
VI |
NC_011892 |
|
ISMno36 |
888 |
250 |
15/18 |
nd |
VI |
NC_011892 |
|
ISMno37 |
889 |
250 |
25/34 |
nd |
VI |
NC_011894 |
|
ISMno6 |
816 |
233 |
15 |
nd |
VI |
NC_011887 |
|
ISMpo1 |
888 |
250 |
15/16 |
nd |
VI |
NC_010727 |
|
ISMtsp1 |
893 |
250 |
15 |
nd |
VI |
NZ_ABAY01000077 |
|
ISMtsp2 |
894 |
250 |
16/18 |
nd |
VI |
NZ_ABAT01000047 |
|
ISPko6 |
817 |
227 |
15/19 |
nd |
IV |
KP294352|| |
|
ISPpa9 |
812 |
236 |
15 |
nd |
III |
EU909903 |
|
ISPmar1 |
821 |
227 |
18/21 |
8 |
IV |
GU997095 |
|
ISRle39A |
890 |
250 |
18/22 |
nd |
VI |
X99520 |
|
ISRle6 |
891 |
256 |
15/17 |
nd |
VI |
NC_008382 |
|
ISRle7 |
891 |
250 |
14 |
nd |
VI |
NC_008382 |
|
ISRH1 |
809 |
250 |
15/16¶ |
nd |
III |
AF023675 |
|
ISRsp9 |
818 |
237 |
18/20 |
nd |
III |
FO082821|| |
|
ISRssp3 |
810 |
236 |
20/24 |
nd |
III |
NZ_AAMC01000007 |
|
ISStag1 |
889 |
255 |
17/18 |
nd |
I |
NZ_AAUW01000024 |
|
| ||||||
|
ISBmu21 |
836 |
240 |
18/24 |
nd |
IV |
AP009387 |
|
| ||||||
|
IS |
841 |
236 |
20 |
nd |
I |
AF121000 |
|
ISAcma1 |
844 |
236 |
13/14¶ |
nd |
V |
NC_009932 |
|
ISCca2 |
834 |
235 |
12/13 |
8 |
IV |
CBQZ010000005 |
|
ISCef5 |
843 |
236 |
21/23¶ |
nd |
I |
NC_004320 |
|
ISDge13 |
806 |
237 |
19 |
nd |
III |
NC_009939 |
|
ISDge15 |
747 |
230 |
16/17 |
nd |
III |
NC_008010 |
|
ISSus3 |
823 |
237 |
33/34 |
nd |
V |
NC_003106 |
*TIR length as recorded in ISFinder.
†TSD length as recorded in ISFinder. nd denotes no TSD observed in the sequence submitted to ISFinder, but does not preclude the presence of a TSD in other sequenced examples of that particular IS.
‡Reference sequence recorded in ISFinder.
§Origin in ISFinder is recorded as Mycobacterium fortuitum (Actinobacter), though this IS is overwhelmingly found in Gammaproteobacteria.
||No accession number recorded in ISFinder. Accession number identified by performing a blast search using the sequence in the ISFinder entry.
¶TIRs were incorrectly defined in ISFinder. Additional available sequences were examined, and the TIRs were modified by removing 1 G nucleotide at the left and right ends of ISCef5 and ISAcma1 (based on GenBank accession numbers CP010451 and CP000844, respectively), or by removing 1 C nucleotide at each end of ISRH1 (based on GenBank accession number NZ_PIQN00000000). The IS lengths were adjusted accordingly.
#IS1628 and ISCef5, Actinobacteria Actinobacteria; ISCca2, Bacteroidetes Cytophagia; ISAcma1, Cyanobacteria Synococcales; ISDge13 and ISDge15, Deonococcus–Thermus Dinococci; ISSus3, Acidobacteria Solibacteres.