| Literature DB >> 17900363 |
Michael Käser1, Simona Rondini, Martin Naegeli, Tim Stinear, Francoise Portaels, Ulrich Certa, Gerd Pluschke.
Abstract
BACKGROUND: Comparative genomics has greatly improved our understanding of the evolution of pathogenic mycobacteria such as Mycobacterium tuberculosis. Here we have used data from a genome microarray analysis to explore insertion-deletion (InDel) polymorphism among a diverse strain collection of Mycobacterium ulcerans, the causative agent of the devastating skin disease, Buruli ulcer. Detailed analysis of large sequence polymorphisms in twelve regions of difference (RDs), comprising irreversible genetic markers, enabled us to refine the phylogenetic succession within M. ulcerans, to define features of a hypothetical M. ulcerans most recent common ancestor and to confirm its origin from Mycobacterium marinum.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17900363 PMCID: PMC2098775 DOI: 10.1186/1471-2148-7-177
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Positions of RD1 to RD12 on the . Widths of the bars correspond to the sizes of deletions.
Figure 2Geographical distribution of the five . The origin of M. ulcerans strains included in this study is shown in the world map, with each dot representing one patient isolate as defined in materials and methods. The five InDel haplotypes are encircled.
Genomic properties of M. ulcerans haplotypes as compared to the M. marinum strain M sequence in the twelve RDs
| size (kb) | in RD | quantity (qualifier) | in RD | size (kb) | in RD | quantity | in RD | quantity | in RD | quantity | in RD | |
| 5.0 | RD2 | |||||||||||
| 0.8 a | RD3A | |||||||||||
| 11.1::IS | RD1 | |||||||||||
| 1.8 | RD4 | |||||||||||
| 3.8::IS | RD3B | |||||||||||
| 7.3 | RD7 | 1 | RD12 | 1 | RD12 | |||||||
| 17.2::14.7 d | RD7 | |||||||||||
| 0.5 | RD10 | |||||||||||
| 30.5::IS | RD9B | |||||||||||
| 3.9 a | RD12A | |||||||||||
| 27.5 | RD12B | |||||||||||
| 0.8 a | RD3A | |||||||||||
| 60.7 | RD5,10 | |||||||||||
| 4.6::IS | RD11A | |||||||||||
| 51.5 | RD8 | |||||||||||
| 3.9 a | RD12A | |||||||||||
| 0.8 a | RD3A | |||||||||||
| 24.1 | RD9A | |||||||||||
| 3.6 | RD9 | |||||||||||
| 4.6::IS | RD11A | |||||||||||
| 3.4 | RD10 | |||||||||||
| 3.9 a | RD12A | |||||||||||
| 42.0 | RD12C | |||||||||||
| 3 (IS | RD1 | |||||||||||
| 1 (IS | RD3 | |||||||||||
| 1 (IS | RD10 | |||||||||||
| 2 (IS | RD5 | |||||||||||
| 2 (IS | RD6 | |||||||||||
| 1 (IS | RD7 | 1.7::IS | RD1 | |||||||||
| 2 | RD2 | 1 | RD2 | |||||||||
| 0.8 a | RD3A | 1 (IS | RD8 | 1 | RD2 | |||||||
| 3 | RD5,10 | 2 | RD5,10 | |||||||||
| 2.5 | RD8A | 3 (IS | RD12 | 2.1::0.3 d | RD2 | 3 | RD5,10 | |||||
| 1 | RD6 | 2 | RD6 | |||||||||
| 1.8 | RD9C | 2 (IS | RD2 | 1.0::15.7 d | RD6 | 1 | RD6 | |||||
| 2 | RD8 | 1 | RD8 | |||||||||
| 3.9 a | RD12A | 1 (IS | RD10 | 6.9::IS | RD8 | 2 | RD8 | |||||
| 1 | RD9 | 1 | RD9 | |||||||||
| 5 (IS | RD7 | 4.2::IS | RD9D | |||||||||
| 1 (IS | RD8 | |||||||||||
| 1 (IS | RD11 | |||||||||||
| 1 (IS | RD2 | |||||||||||
| 1 (IS | RD6 | |||||||||||
| 2 (IS | RD12 | |||||||||||
| as above, additionally: 3.5 (RD3C) | as above | as above | as above | as above | as above | |||||||
a: these two deletions are common for all investigated M. ulcerans strains
b: inserts consist of complete IS elements plus some additional flanking nucleotides
c: this insertion consists of one functional IS2404 element and an additional truncated IS2404 fragment
d: the inserted DNA stretches are not represented in the M. marinum strain M genome sequence; gene bank blast and fasta searches identified sequences of i) bacteria other than mycobacteria with about 60% nucleotide identity over up to 3100 bp for the InDel in RD7; ii) mammalia with about 68% identity over 320 bp for the InDel in RD2; and iii) mycobacteria and other environmental or pathogenic bacteria with identities between 57 and 83% over 600–1900 bp, most of them grouping into insertion sequences and phages, for the InDel in RD6.
Figure 3Linear genomic comparison of sections within RDs. Comparisons are made with three sequences each using ACT (the Artemis Comparison Tool software release 5) over at least 30 kb: M. marinum M on top, M. ulcerans Agy99 at the bottom, and M. ulcerans of either haplotype, the Asian (RD2 and RD5) or the South American (RD9 and RD10) in the middle. Regions of sequence conformity are shown in parallel light grey plains, inverted DNA segments are depicted in dark grey and inverted surfaces, and white areas represent non-homologous regions like deletions and insertions. Some sequence displacements are visualized as grey areas displaying across the panels. Cut-off value for inclusion of sequence identity was 100 bp. The principal genetic backbone of the Asian and South American haplotypes (both members of the ancestral lineage) is identical for each alignment shown, but – as a matter of how the RDs were found – the particular excluded haplotypes reveal deletions in the respective RDs. Although showing the same genetic backbone as M. marinum in the marginal parts, the Mexican strains disclose large deletions over their respective RDs and are therefore not included in this computational analysis. The sequence regions were retrieved by scanning the contigs by PCR, and by cloning and sequencing of critical segments.
Figure 4Genome comparison of . Only selected RDs that contribute to the understanding of the M. ulcerans phylogeny are shown. Dashed lines represent sequence identity; Δ = deletion; :: = substitution; shaded boxes are IS2404 and IS2606 as indicated. M. ulcerans MRCA = most recent common ancestor. Since the Mexican strains showed either deletions expanding the whole RD or did not show any significant microarray hit in other RDs, the Mexican haplotype was only included in the illustration of RD11 where it revealed informative differences. a) Alignment of the members of the African/Australian haplotype shows sequence identity in the tested crucial genome regions. b) Haplotype Australia 5142/47 is identical in all regions except for RD3C as indicated; here the breakpoints differ from deletion RD3B in the South American haplotype.
Positions of deletions used for the phylogenetic description
| RD3A | 823 | 3.702.623 – 3.703.446 | |
| RD3B | 3785 | 3.705.487 – 3.709.281 | |
| RD3C | 3452 | 3.705.557 – 3.709.018 | |
| RD8A | 2481 | 1.395.048 – 1.398.043 | MURD25 |
| RD9A | > 24067 | 4.338.150 – 4.362.217 | |
| RD9B | 30474 | 4.348.127 – 4.378.601 | |
| RD9C | 1763 | 4.348.699 – 4.350.461 | MURD94 |
| RD9D | 4230 | 4.357.791 – 4.362.021 | MURD95 |
| RD11A | 4565 | 3.108.140 – 3.112.725 | |
| RD12A | 3938 | 4.899.809 – 4.903.746 | MURD105 |
| RD12B | 27484 | 4.908.774 – 4.936.258 | |
| RD12C | 41961 | 4.915.409 – 4.957.370 |
Positions refer to the M. marinum M genome sequence [34] following Stinear et al., 2007 [5]. Discrepancies between position numbers and deletion sizes are due to nucleotide variations between the M. marinum and M. ulcerans genomes in these regions
Figure 5Evolutionary scenario for . All strains with a strain identifier added to the right depict recent isolates. Note that both the M. marinum progenitor and the M. ulcerans MRCA are hypothetical strains. Features differentiating clusters or strains are dedicated to the branches between the nodes. RDs indicated here are all differentiated by features that are also shown in Fig. 4, whereas more RDs bear supporting features between the nodes (Table 1). The lengths of the internodes do not reflect time or genetic distance.