| Literature DB >> 35894699 |
Jacob Cote1, Colin Welch1,2, Madeline Kimble1,2, Dakota Archambault1,2, John Curtis Ross1, Hector Orellana1, Katelyn Amero1,2, Claire Bourett1,2, Andre Daigle1, Keith W Hutchison1,2, Sally D Molloy1,2.
Abstract
Mycobacterium abscessus is an emerging pathogen of concern in cystic fibrosis and immunocompromised patients and is considered one of the most drug-resistant mycobacteria. The majority of clinical Mycobacterium abscessus isolates carry 1 or more prophages that are hypothesized to contribute to virulence and bacterial fitness. The prophage McProf was identified in the genome of the Bergey strain of Mycobacterium chelonae and is distinct from previously described prophages of Mycobacterium abscessus. The McProf genome increases intrinsic antibiotic resistance of Mycobacterium chelonae and drives expression of the intrinsic antibiotic resistance gene, whiB7, when superinfected by a second phage. The prevalence of McProf-like genomes was determined in sequenced mycobacterial genomes. Related prophage genomes were identified in the genomes of 25 clinical isolates of Mycobacterium abscessus and assigned to the novel cluster, MabR. They share less than 10% gene content with previously described prophages; however, they share features typical of prophages, including polymorphic toxin-immunity systems.Entities:
Keywords: zzm321990 Mycobacteriumzzm321990 ; bacteriophage; genome; prophage
Mesh:
Substances:
Year: 2022 PMID: 35894699 PMCID: PMC9434293 DOI: 10.1093/g3journal/jkac188
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
M. abscessus bacterial strains carrying MabR prophage.
| Accession no. |
|
| Additional prophage | Coordinates |
| Subspecies | Strain | |
|---|---|---|---|---|---|---|---|---|
| FSAT01 | GCA_900131665.1 | prophiFSAT01-1 | C1 2,104,368–2,172,096 | – | – | United Kingdom |
| 280 |
| FSIL01 | GCA_900136245.1 | prophiFSIL01-1 | C6 162,543–229,039 | prophiFSIL01-2 (MabA1) | C2 491,511–553,312 | United Kingdom |
| 1,009 |
| FSGY01 | GCA_900135415.1 | prophiFSIL01-1 | C4 326,208–259,712 | prophiFSIL01-2 (MabA1) | C2 209,626–147,825 | United Kingdom |
| 62 |
| FSGZ01 | GCA_900135455.1 | prophiFSIL01-1 | C7 228,809–162,313 | prophiFSIL01-2 (MabA1) | C2 491,649–553,450 | United Kingdom |
| 63 |
| FSHA01 | GCA_900135465.1 | prophiFSIL01-1 | C6 228,812–162,316 | prophiFSIL01-2 (MabA1) | C2 209,313–147,512 | United Kingdom |
| 64 |
| FSHB01 | GCA_900135495.1 | prophiFSIL01-1 | C2 229,051–162,555 | prophiFSIL01-2 (MabA1) | C2 563,712–501,911 | United Kingdom |
| 314 |
| FSHC01 | GCA_900135485.1 | prophiFSIL01-1 | C6 125,119–191,615 | prophiFSIL01-2 (MabA1) | C3 209,298–147,497 | United Kingdom |
| 66 |
| FSHD01 | GCA_900135515.1 | prophiFSIL01-1 | C7 125,118–191,614 | prophiFSIL01-2 (MabA1) | C3 491,671–553,472 | United Kingdom |
| 67 |
| FSHE01 | GCA_900135475.1 | prophiFSIL01-1 | C7 125,120–191,616 | prophiFSIL01-2 (MabA1) | C2 209,310–147,509 | United Kingdom |
| 68 |
| FSHF01 | GCA_900135505.1 | prophiFSIL01-1 | C1 826,179–162,313 | prophiFSIL01-2 (MabA1) | C1 491,510–553,311 | United Kingdom |
| 69 |
| FSHG01 | GCA_900135535.1 | prophiFSIL01-1 | C6 228,810–162,314 | prophiFSIL01-2 (MabA1) | C2 479,152–417,351 | United Kingdom |
| 70 |
| FSHI01 | GCA_900135525.1 | prophiFSIL01-1 | C7 228,802–162,306 | prophiFSIL01-2 (MabA1) | C3 209,315–147,514 | United Kingdom |
| 71 |
| FSIG01 | GCA_900136185.1 | prophiFSIL01-1 | C5 228,798–162,543 | prophiFSIL01-2 (MabA1) | C3 491,467–553,268 | United Kingdom |
| 991 |
| FSIH01 | GCA_900136155.1 | prophiFSIL01-1 | C1 1,678,669–1,745,165 | prophiFSIL01-2 (MabA1) | C2 707,577–769,378 | United Kingdom |
| 993 |
| FSIJ01 | GCA_900136115.1 | prophiFSIL01-1 | C6 125,125–191,621 | prophiFSIL01-2 (MabA1) | C3 209,291–147,490 | United Kingdom |
| 996 |
| FSIQ01 | GCA_900136355.1 | prophiFSIL01-1 | C6 228,795–162,299 | prophiFSIL01-2 (MabA1) | C2 491,560–553,361 | United Kingdom |
| 1,019 |
| FSKF01 | GCA_900137275.1 | prophiFSIL01-1 | C5 125,178–191,674 | prophiFSIL01-2 (MabA1) | C2 491,625–553,426 | United Kingdom |
| 1,024 |
| FVMH01 | GCA_900136085.1 | prophiFSIL01-1 | C6 228,779–162,283 | prophiFSIL01-2 (MabA1) | C1 1,082,839–1,144,640 | United Kingdom |
| 994 |
| FVPC01 | GCA_900137305.1 | prophiFSIL01-1 | C1 544,352–477,856 | prophiFSIL01-2 (MabA1) | C1 879,057–817,256 | United Kingdom |
| 1,026 |
| FSQJ01 | GCA_900141335.1 | prophiFSQJ01-1 | C10 102,082–169,883 | prophiFSQJ01-3 (MabD) | C12 50,449–101,334 | United States |
| 712 |
| FSMS01 | GCA_900139245.1 | prophiFSQJ01-1 | C13 99,951–167,702 | prophiFSMS01-2 (MabD), | C7 85,005–135,096 | United States |
| 699 |
| prophiFSMS01-3 (MabG) | C7 156,631–209,932 | |||||||
| FSOD01 | GCA_900140065.1 | prophiFSQJ01-1 | C13 85,252–17,501 | – | – | United States |
| 686 |
| FVXT01 | GCA_900141255.1 | prophiFSQJ01-1 | C10 93,905–26,154 | prophiFSMS01-2 (MabD) | C7 84,991–135,082 | United States |
| 698 |
| prophiFSMS01-3 (MabG) | C7 156,617–209,918 | |||||||
| FVLO01 | GCA_900135885.1 | prophiFVLQ01-1 | C1 163,540–230,227 | prophiFVLQ01-2 (MabD), | C15 73,430–126,907 | Australia |
| 874 |
| prophiFVLQ01-3 (MabC) | C5 363,193–311,358 | |||||||
| FVLQ01 | GCA_900135895.1 | prophiFVLQ01-1 | C2 360,992–427,679 | prophiFVLQ01-2 (MabD), | C4 73,988–127,465 | Australia |
| 875 |
| prophiFVLQ01-3 (MabC) | C7 510–52,345 |
GenBank assembly accession numbers.
Prophages are named after the first genome where they were first isolated, identical prophage in other genomes use the same name.
MabR prophage in the genomes FSGY01, FSGZ01, FSHE01, FSHG01, FVHM01, FSMS01, and FVXT01 have single-nucleotide differences with their representative prophage genome.
The contig number (C1, C2, etc.) is shown followed by the coordinates within that contig.
MabR prophage coordinates in representative host genomes (FSAT01, FSIL01, FSQJ01, and FVLQ01) are ordered from attL to attR.
All genome samples were isolated from the respiratory system of diseased hosts in the country indicated.
Genome characteristics of cluster MabR prophages.
| Prophage |
| Coordinates | Length | ORFs |
| Accession |
|---|---|---|---|---|---|---|
| McProf |
| 1,521,426–1,589,648 | 68,223 | 99 | TGCGCCGTCAGGGGCTCGAACCCCGGACCCGCTGATTAAGAGTCA | BK061309 |
| prophiFSAT01-1 |
| C1 2,104,368–2,172,096 | 67,729 | 99 | TGCGCCGTCAGGGGCTCGAACCCCGGACCCGCTGATTAAGAGTCA | BK061308 |
| prophiFSIL01-1 |
| C6 162,543–229,039 | 66,497 | 99 | TGCGCCGTCAGGGTTTCGAACCCCAGACCCGCTGATTAAGAGTCA TGCGCCGTCAGGGGCTCGAACCCCGGACCCGCTGATTAAGAGTCA | BK061311 |
| prophiFSQJ01-1 |
| C10 102,082–169,883 | 67,752 | 102 | CCCCTGTAGGGCTCGAACCTACGACCTACTGATTAAAAGTCAG CCCCACCAGGGCTCGAACCTGGGACCTGCGGATTAAAAGTCCG | BK061312 |
| prophiFVLQ01-1 |
| C2 360,992–427,679 | 66,688 | 100 | TGACTCTTAATCAGCGGGTCCGGGGTTCGAGCCCCTGACGGCGCA | BK061310 |
attB-18 was identified by Dedrick .
Coordinates of the selected phage in the host where it was first identified (e.g. prophiFSAT01-1 in the genome FSAT01). The contig number (C1, C2, etc.) is shown followed by the coordinates within that contig. Coordinates are arranged attL to attR.
Prophage lengths include 2 copies of the attachment sites.
McProf is a previously described prophage (Cushman ) found in the M. chelonae genome CCUG 47445.
Fig. 1.Diversity of MabR prophages. a) Dotplot comparison of MabR prophages. b) Phylogenetic network representation of cluster MabR prophages and M. abscessus prophages (Dedrick ) based on shared gene content as described by Pope ). Nodes represent individual prophage; circles represent prophage clusters. Scale marker indicates substitutions/site.
attB sites of cluster MabR prophage.
|
| Core sequence | Prophages | Genomic feature | Coordinates |
|---|---|---|---|---|
|
| CTGGTGCGCCGTCAGGGGCTCGAACCCCGGACCCGCTGATTAAGAGTC | McProf, prophiFSAT01-1, prophiFVLQ01-1 | Mab_t5022c; 3′ end tRNA-Lys | 1,550,157–1,550,204 |
|
| TGCGCCGTCAGGG | prophiFSIL01-1 | Mab_t5030c; 3′ half of tRNA-Lys | 2,089,033–2,089,077 |
|
| CCCC | prophiFSQJ01-1 | In Mab_0771c | 770,355–770,397 |
attB-18 was identified by Dedrick .
Sequence shared between attL and attR sites within and near the genomic feature for each attB site; mismatches are shown in bold. Novel attB sites (attB 19, 20) have no mismatches when aligning to attR sites in their respective phage.
As defined in Table 2.
Genes and coodinates in the M. abscessus strain ATCC1997.
Fig. 2.MabR prophage integration locations. The 3 integration schemes used by MabR prophage are shown as attB site locations (black bars) shown relative to M. abscessus ATCC 19977 genes for reference. Rightward and leftward transcribed genes are indicated by arrows with their ATCC 19977 gene number. Both tRNAs (t5022c and t5030c) are transcribed in the leftward direction. Not shown are McProf and prophiFVLQ01-1, which utilize the attB-18 site described by Dedrick .
Fig. 3.Organization of MabR genomes. MabR genomes are shown with pairwise nucleotide sequence similarity displayed by colors between genomes: purple is the most similar and red is the least similar above a BLASTN E threshold of 10−5. The ruler represents the coordinates of the genome. Forward and reverse-transcribed genes are shown as boxes above and below the ruler, respectively. Maps were generated using Phamerator and the database, “Actino_Mab_Draft (version 20)” (Cresawn ).
Fig. 4.Genome organization of a) prophiMcProf and b) prophiFSQJ01-1. The ruler represents the coordinates of the genome. Forward and reverse-transcribed genes are shown as boxes above and below the ruler, respectively. Genes are colored according to their assigned Phamily with the Phamily number shown above each gene with the number of Phamily members in parentheses. Genome maps were generated using Phamerator and the database, “Actino_Mab_Draft (version 20)” (Cresawn ).
Fig. 5.Organization of MabR PT-Imm systems. MabR genomes are aligned at their PT-Imm systems beginning at the 3′ end of the predicted immunity proteins. Genomes are displayed as described in Fig. 2 but are ordered in such a way that genomes with the most similarity in this region are next to each other. Also shown are the motifs/domains found at the N- and C-termini of MabR PTs. All predicted PTs have a single WXG-100 motif at the N-terminus while the C-terminus is variable. Note that gp99 in prophiFSIL01-1 and prophiFSAT01-1 has no predicted function and is included to show the relationship of the PT systems to the genome ends.
Fig. 6.MabR PT-Imm systems found in non-MabR prophage. Genomes are displayed as described in Figs. 2 and 5. prophiGD43-5 and prophiGD79-1 belong to clusters MabK and MabQ, respectively.