| Literature DB >> 29735891 |
Gilles Vergnaud1, Cédric Midoux2, Yann Blouin3, Maria Bourkaltseva4, Victor Krylov5, Christine Pourcel6.
Abstract
Transposable phages, also called saltoviruses, of which the Escherichia coli phage Mu is the reference, are temperate phages that multiply their genome through replicative transposition at multiple sites in their host chromosome. The viral genome is packaged together with host DNA at both ends. In the present work, genome sequencing of three Pseudomonas aeruginosa transposable phages, HW12, 2P1, and Ab30, incidentally gave us access to the location of thousands of replicative integration sites and revealed the existence of a variable number of hotspots. Taking advantage of deep sequencing, we then designed an experiment to study 13,000,000 transposon integration sites of bacteriophage Ab30. The investigation revealed the presence of 42 transposition hotspots adjacent to bacterial interspersed mosaic elements (BIME) accounting for 5% of all transposition sites. The rest of the sites appeared widely distributed with the exception of coldspots associated with low G-C content segments, including the putative O-antigen biosynthesis cluster. Surprisingly, 0.4% of the transposition events occurred in a copy of the phage genome itself, indicating that the previously described immunity against such events is slightly leaky. This observation allowed drawing an image of the phage chromosome supercoiling into four loops.Entities:
Keywords: chromosomal domain; deep sequencing; hotspots; imaging; supercoiling; transposable phages; transposon integration
Mesh:
Substances:
Year: 2018 PMID: 29735891 PMCID: PMC5977238 DOI: 10.3390/v10050245
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Main characteristics of phages Ab30, 2P1, and HW12.
| Features | vB_PaeS_PAO1_Ab30 | vB_PaeS_PcyII-10_2P1 | vB_PaeS_PAO1_HW12 |
|---|---|---|---|
| common name | Ab30 | 2P1 | HW12 |
| source | sewage water | sewage water | clinical isolate |
| geographic origin | Abidjan, Côte d’Ivoire | Orsay, France | Moscow, Russia |
| year of isolation | 2012 | 2014 | 1983 * |
| genome size | 37,205 bp | 37,087 bp | 37,496 bp |
| accession number | LN610590 | LN907801 | LT999987 |
| host strain for phage growth | PA14 | PcyII-10 | PAO1 |
| sequencing reads accessions | ERR1596849 | ERR1596851 | ERR2402365 |
| Average phage sequencing coverage | 1200× | 10,000× | 12,000× |
| publication | [ | this report | [ |
* Year of recovery of the phage from the clinical P. aeruginosa strain provided to V.K. by Professor Bruce Holloway.
Figure 1Distribution of transposition sites on Pseudomonas aeruginosa at base-pair resolution. Panel (A) The distribution of 25,000 HW12 transposition sites. A maximum of five coincident insertions (one in the reverse orientation and four in the forward orientation) is observed at position 4,720,184. Panel (B) The distribution of 19,300 2P1 transposition events. Eight peaks above ten coincident insertions are observed. The tentative HW12 hotspot is one of them. The highest peak corresponds to 27 coincident insertions at positions 265,012–265,013.
Figure 2The phage Ab30 major transposition hotspots. The distribution of the 13 million Ab30 transposition events is plotted along the PA14 bacterial host genome at a one base-pair resolution. On the vertical axis, the number of transposition events at a particular position is shown. The 14 sites with at least one position hosting more than three thousand insertions are numbered.
A list of the 14 main Ab30 insertion hotspots (at least one nucleotide position with more than 3000 events).
| # | Position PA14 NC_008463 * | Position PAO1 NC_002516 * | CDS Locus Tag ** | Distance | Ratio Forward $ | Distance from Consensus Motif (bp) £ | Mismatches £ | VNTR § |
|---|---|---|---|---|---|---|---|---|
| 1 | 254,808; 254,824 | 265,008; 265,024 | PA14_RS01160 | 28,587/24,475 | 41% | 56 | 2 | ms211 |
| 2 | 806,635; 806,652 | 4,720,167; 4,720,184 | PA14_RS30080 | 2414/16,824 | 58% | 58 | 0 | |
| 3 | 1,569,481; 1,569,485 | 3,988,644; 3,988,648 | PA14_RS07340 | 68,541/26,332 | 47% | 57 | + | |
| 4 | 1,682,632; 1,682,636 | 3,858,411; 3,858,415 | between PA14_RS07780 and PA14_RS07785 | 299/18,473 | 35% | 57 | 2 | |
| 5 | 2,879,877; 2,879,884 | 2,752,911; 2,752,918 | PA14_RS13425 | 336/4118 | 35% | 57 | 3 | |
| 6 | 2,938,509; 2,938,513 | 2,705,279; 2,705,283 | PA14_RS13610 | 4732/32,396 | 48% | 56 | 0 | ms214 |
| 7 | 3,079,036; 3,079,040 | 2,558,697; 2,558,701 | between PA14_RS14100 and PA14_RS14105 | 1443/34,807 | 50% | 56 | 0 | |
| 8 | 3,079,172 | 2,556,658 | PA14_RS14105 | 1579/34,675 | 47% | 60 | 0 | |
| 9 | 3,325,225; 3,325,229 | 3,874,426; 3,874,430 | between PA14_RS15200 and PA14_RS15205 | 6907/23,919 | 61% | 57 | 0 | |
| 10 | 3,555,249; 3,555,253 | 2,077,078; 2,077,082 | PA14_RS16160 | 495/7668 | 47% | 60 | 0 | |
| 11 | 5,214,540; 5,214,544 | 5,050,834; 5,050,838 | between PA14_RS23875 and PA14_RS23880 | 8737/1571 | 54% | 57 | + | |
| 12 | 5,725,984; 5,725,999 | 5,455,320; 5,455,335 | between PA14_RS26245 and PA14_RS26250 | 44,700/98 | 80% | 54 | 1 | ms223 |
| 13 | 5,732,031; 5,732,035 | 5,461,367; 5,461,371 | PA14_RS26270 | 5927/9363 | 51% | 59 | 1 | |
| 14 | 5,743,027; 5,743,031 | 5,471,620; 5,471,624 | between PA14_RS26335 and PA14_RS26340 | 1622/2499 | 58% | 58 | 1 | ms224 |
# the major hotspots numbered from 1–14 as shown in Figure 2. * the indicated positions in the PA14 or PAO1 reference genomes are immediately adjacent to at least 3000 right-ends; insertion of the phage DNA is not necessarily elevated at internal positions as illustrated in Figure 3 for hotspot 11; ** the intra or intergenic status of the hotspots is indicated. $ the distance from the hotspots to the nearest flanking NsiI sites is indicated together with the ratio of forward versus reverse integrations of the phage genome. A significant and artefactual bias is observed due to the host-phage junctions enrichment process used, when the hotspot is immediately adjacent to an NsiI site as in hotspots 4, 5, and 12. £ distance in base-pairs between the hotspot and the motif, and number of mismatches between the motif and the 35 bp consensus, as shown in Figure S4 and Figure 4. +, indel in the motif compared to the consensus. §, the Variable Number of Tandem Repeats (VNTR). Some motifs are part of a tandemly repeated element used in genotyping assays [35,36].
Figure 3A focus on Ab30 hotspot 11. The position of the transposition events is shown at a one base-pair resolution. Insertions in the forward or reverse orientations are presented in the red or blue colors, respectively. The small inset is obtained by a five base-pairs shift of the upper part of the graph. The almost perfect symmetry illustrates the absence of bias in transposition orientation.
Figure 4Definition of a highly conserved consensus. The conserved sequences adjacent to each of the fourteen hotspots are shown with the 35 bp consensus sequence on top.
Figure 5The discovery of a pattern of transposition events along the Ab30 genome. The position along the phage Ab30 genome of transposition events which occurred within the phage is plotted. (A) The number of reads in 50 bp bins is shown on the axis. The orientation of the insertion is depicted with colors. The position of the two NsiI sites present in the Ab30 genome is indicated. The NsiI site at position 6170 creates an artifact hotspot. Seven main peaks are observed defining nine domains numbered accordingly. (B) Proposed topology of the phage genome. The pattern of transposition events along the phage genome presented in panel (A) defining domains D1 to D9 suggested the existence of four loops (L1–L4) including four fixed crossing points inaccessible to transposition. The locations of the synaptonemal complex (transposome) and predicted strong gyrase site (SGS) are indicated.