| Literature DB >> 26884275 |
Guillaume Sapriel1,2,3, Julie Konjek4,5, Mickael Orgeur6, Laurent Bouri7, Lise Frézal8, Anne-Laure Roux9, Emilie Dumas10, Roland Brosch11, Christiane Bouchier12, Sylvain Brisse13, Mathias Vandenbogaert14, Jean-Michel Thiberge15, Valérie Caro16, Yun Fong Ngeow17, Joon Liang Tan18, Jean-Louis Herrmann19,20, Jean-Louis Gaillard21,22, Beate Heym23,24, Thierry Wirth25,26.
Abstract
BACKGROUND: In mycobacteria, conjugation differs from the canonical Hfr model, but is still poorly understood. Here, we quantified this evolutionary processe in a natural mycobacterial population, taking advantage of a large clinical strain collection of the emerging pathogen Mycobacterium abscessus (MAB).Entities:
Mesh:
Substances:
Year: 2016 PMID: 26884275 PMCID: PMC4756508 DOI: 10.1186/s12864-016-2448-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Densitree of 130 STs belonging to the MAB. The phylogenetic reconstruction is based on the concatenated housekeeping genes (3576 bp) and the GTR + I + G evolutionary model was implemented in BEAST 1.7.5. The best 8000 trees generated during the Markov chain are drawn transparently (in green). As a result, areas where the vast majority of the trees agree in topology and branch lengths show up as deep green, while areas with little agreement show up as webs. The root canal tree (blue lines) has a topology that is selected from the set of trees and has the highest probability of clades in the set. The scale corresponds to the genetic distance
Fig. 2Ancestry of 280 strains belonging to the MAB. a Distribution of the three subspecies (plus admixed strains) within a minimal spanning tree (MSTREE) based on the degree of allele sharing. Circles are proportional to the number of strains and are colored according to the subspecies identification obtained from the Bayesian algorithm STRUCTURE. The thickness of the line is function of the number of shared alleles. b Proportion of ancestry from the subspecies M. abscessus, M. bolletii and M. massiliense as inferred by STRUCTURE and their assignment to three rpoB groups as displayed with DISTRUCT [78]. c Genomic locations of the 7 housekeeping genes and the diagnostic genes rpoB. d Synthetic network representing the amount of gene flow between the three subspecies based on the sole STs and the STRUCTURE assignments. For each subspecies, percentage of foreign allele (form other subspecies) is indicated with 99 % confidence interval. e Proportions of admixed ST for each subspecies. Left bar: not admixed (ie. no exogenous allele). Right bar: at least one exogenous allele (ie. from another MAB subspecies). *P < 0.05; **P < 0.01 (Fisher exact two-sided test)
Population estimates of mutation rates (θ) and recombination rates (ρ) per base
|
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| MAB |
|
|
| MAB |
|
|
| MAB | |
|
| 0.007 | 0.006 | 0.015 | 0.018 | 0.005 | 0 | 0 | 0.001 | 0.714 | 0 | 0 | 0.056 |
|
| 0.001 | 0.012 | 0.011 | 0.012 | 0.004 | 0.016a | 0.004 | 0.008a | 4 | 1.333a | 0.364 | 0.667a |
|
| 0.007 | 0.008 | 0.006 | 0.01 | 0.002 | 0.011 | 0.001 | 0.004 | 0.286 | 1.375 | 0.167 | 0.4 |
|
| 0.004 | 0.012 | 0.014 | 0.016 | 0.005 | 0 | 0.006 | 0.004a | 1.25 | 0 | 0.429 | 0.25a |
|
| 0.009 | 0.012 | 0.008 | 0.015 | 0 | 0 | 0.001 | 0 | 0 | 0 | 0.125 | 0 |
|
| 0.009 | 0.006 | 0.008 | 0.01 | 0.016a | 0 | 0.012a | 0.006 | 1.778 | 0 | 1.5 | 0.6 |
|
| 0.011 | 0.003 | 0.011 | 0.012 | 0.007a | 0.008a | 0.005a | 0.005 | 0.636 | 2.667 | 0.455 | 0.417a |
| Average. | 0.007 | 0.008 | 0.01 | 0.013 | 0.006 | 0.005 | 0.004 | 0.004 | 0.813 | 0.593 | 0.397 | 0.29 |
| Concat.b | 0.008 | 0.008 | 0.011 | 0.013 | 0.041c | 0.009 | 0.03 c | 0.004 | 5.125a | 1.125 a | 2.727a | 0.308a |
| CI 95 % | 0.038–0.044 | 0.008–0.010 | 0.028–0.032 | 0.004–0.004 | 4.75–5.5 | 1–1.25 | 2.55–2.9 | 0.308–0.308 | ||||
The analyses are realised within lineages and for the MAB complex (pooled genotypes)
Values for rho (ρ) were obtained by dividing the per-locus recombination rate estimate from LDhat by the sequence length. μ: mutation rate per nucleotide; r: recombination rate per nucleotide. a: significant recombination estimates. Concat. bconcatenated data set. c: recombination significantly higher than intra-genic recombination rates (a and c: 95 % confidence interval)
Fig. 3Pathogenic types within an MSTREE and distribution of admixed strains according to clinical profiles. a Each strain is represented by a circle or a fraction of a circle, and colours correspond to different pathogenic types. Circled numbers indicate frequently encountered ST complexes or outbreak strains. Black lines connecting pairs of STs indicate that they share six (thick lines), five (thin) or four alleles (dotted) out of seven. Boxed strains correspond to the three MAB reference strains. Moreover, they include three additional admixed strains selected for whole-genome sequencing and comparative analyses. b Pathogenic type distribution according to the three subspecies (the frequencies did not differ significantly). c Admixture spectrum frequency according to the principal clinical profiles. Non-admixed: strain with absence of exogenous allele. Admixed: strain with at least one exogenous allele coming from another subspecies. *P < 0.05; **P < 0.01 (Fisher exact two-sided test)
Fig. 4Circular representation of MAB subspecies reference and admixed strains. a: M. abscessus, b: M. massiliense. c: M. bolletii. d: M. massiliense strain M139, e: M. abscessus strain 137, f: M. massiliense strain 23. For the outer circle, colours were reported following subspecies identification (red: M. abscessus, blue: M. massiliense. yellow: M. bolletii, black: regions with no significant subspecies attribution or not belonging to core genome). From outer to inner circle: subspecies type identification; GC% in 5-kb non-overlapping windows (yellow: higher than 64 %; green: lower than 64 %); GC skew in 5-kb non-overlapping windows (red: positive; blue: negative); sequenced genes for identification studies (MLST genes and rpoB) in grey; prophage regions in green; tRNA in magenta; and IS element in orange. Circular plots were generated by using CIRCOS [79]
Core, variable, and strain specific open reading frames (ORFs)
| Organism | ORFs | Core ORFs | Var ORFs | Strain spe. ORFs | Core ORFs (%) | Var ORFs (%) | Strain spe. ORFs (%) |
|---|---|---|---|---|---|---|---|
|
| 5477 | 4160 | 1317 | 617 | 76 | 24 | 11.3 |
|
| 5247 | 4135 | 1112 | 249 | 78.8 | 21.2 | 4.7 |
|
| 5275 | 4153 | 1122 | 520 | 78.7 | 21.3 | 9.9 |
|
| 5269 | 4145 | 1124 | 435 | 78.7 | 21.3 | 8.3 |
|
| 5267 | 4136 | 1131 | 459 | 78.5 | 21.5 | 8.7 |
|
| 5332 | 4122 | 1210 | 481 | 77.3 | 22.7 | 9 |
ORFs were identified on MAGE plateform, and core genome was defined by Bidirectional Best Blast Hit (BBH) with a 50 % sequence identity threshold, and at least 80 % coverage
Nucleotide BLAST results of admixed genomes genomic islands
| Strain Query | Genomic island Start position | Genomic island End position | Length (bp) | First nucleotide blast resulta | Total score | Query cover | E.value | Max |
|---|---|---|---|---|---|---|---|---|
|
| 1922873 | 1992300 | 69427 |
| 1.12E + 005 | 99 % | 0 | 99 % |
|
| 4564885 | 4576047 | 11162 |
| 8504 | 74 % | 0 | 80 % |
|
| 2800023 | 2810210 | 10187 |
| 3918 | 77 % | 0 | 81 % |
|
| 254418 | 263728 | 9310 |
| 1960 | 38 % | 0 | 84 % |
|
| 870688 | 892899 | 22212 |
| 21406 | 63 % | 0 | 99 % |
|
| 2385439 | 2401289 | 15851 |
| 10443 | 56 % | 0 | 89 % |
|
| 1922873 | 1992300 | 69428 |
| 1.07E + 005 | 88 % | 0 | 99 % |
|
| 2172909 | 2194917 | 22009 |
| 3410 | 41 % | 0 | 98 % |
|
| 4564885 | 4576047 | 11163 |
| 2879 | 55 % | 0 | 93 % |
aFirst nucleotide blast result with more than 80 % identity, and covering at least 30 % of the sequence of the tested Genomic Island
bpathogenic mycobacteria