| Literature DB >> 30115606 |
Thomas A Kohl1, Dag Harmsen2, Jörg Rothgänger3, Timothy Walker4, Roland Diel5, Stefan Niemann6.
Abstract
BACKGROUND: Global tuberculosis (TB) control is challenged by uncontrolled transmission of Mycobacterium tuberculosis complex (Mtbc) strains, esp. of multidrug (MDR) or extensively resistant (XDR) variants. Precise analysis of transmission networks is the basis to trace outbreak M/XDR clones and improve TB control. However, classical genotyping tools lack discriminatory power due to the high similarity of strains of particular successful lineages, e.g. Beijing or outbreak strains. This can be overcome by whole genome sequencing (WGS) approaches, but these are not yet standardized to facilitate larger investigations encompassing different laboratories or outbreak tracing across borders.Entities:
Keywords: Core genome MLST; Genotyping; Molecular epidemiology; Mycobacterium tuberculosis; Whole genome sequencing
Mesh:
Year: 2018 PMID: 30115606 PMCID: PMC6116475 DOI: 10.1016/j.ebiom.2018.07.030
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1Overview of the 251 isolates included in the evaluation and 45 isolates in the training set of isolates, shown in a neighbour joining tree built with BioNumerics from 41,774 SNP positions in logarithmic scaling. The 45 strains included in the training set are marked in yellow.
Evaluation of the cgMLST scheme suggested previously (Kohl et al. 2014).
| Number of strains | Perc. mean | Perc. min | Perc. max | |
|---|---|---|---|---|
| Lineage 1 | 35 | 97.7 | 96.4 | 98.1 |
| Lineage 2 | 71 | 97.8 | 96.9 | 98.2 |
| Lineage 3 | 28 | 98.1 | 97.4 | 98.4 |
| Lineage 4 | 74 | 99.0 | 96.9 | 99.7 |
| Lineage 5 | 14 | 97.2 | 96.6 | 97.4 |
| Lineage 6 | 15 | 97.7 | 97.3 | 98.0 |
| Lineage 7 | 2 | 97.7 | 97.6 | 97.7 |
| Canettii | 2 | 94.4 | 94.2 | 94.6 |
| Animal | 10 | 97.1 | 96.6 | 97.5 |
| total | 251 | 98.1 | 94.2 | 99.7 |
Percentage of cgMLST genes fulfilling quality criteria, given as mean, minimum, and maximum value.
Including M. caprae, M. bovis, M. orygis, M. microti, and M. pinnipedii.
Evaluation of the improved cgMLST scheme constructed from the 45 isolates representing the phylogenetic diversity of the Mtbc.
| Number of strains | Perc. mean | Perc. min | Perc. max | |
|---|---|---|---|---|
| Lineage 1 | 35 | 98.4 | 97.4 | 98.7 |
| Lineage 2 | 71 | 98.6 | 98.0 | 98.8 |
| Lineage 3 | 28 | 98.9 | 98.5 | 99.1 |
| Lineage 4 | 74 | 99.5 | 98.0 | 99.8 |
| Lineage 5 | 14 | 98.2 | 98.0 | 98.4 |
| Lineage 6 | 15 | 98.3 | 98.1 | 98.5 |
| Lineage 7 | 2 | 98.3 | 98.2 | 98.3 |
| Canettii | 2 | 97.8 | 97.7 | 97.9 |
| Animal | 10 | 97.8 | 97.4 | 98.3 |
| total | 251 | 98.8 | 97.4 | 99.8 |
Percentage of cgMLST genes fulfilling quality criteria, given as mean, minimum, and maximum value.
Including M. caprae, M. bovis, M. orygis, M. microti, and M. pinnipedii.
Fig. 2a: Within host and between host observed allelic diversity across paired isolates. From left to right, paired isolates from within host (pulmonary and non-pulmonary sampled within 6 months of each other); paired pulmonary isolates sampled >6 months apart from within individual hosts; pairwise distances between patients within households; and pairwise distances between patients across 11 different community clusters, stratified by type of epidemiological link. 22 of the 38 links within the 25 household clusters also occur within community clusters (ie, known linkage) but are shown with household isolates and not with community isolates. Top horizontal dashed line indicates the threshold above which direct transmission can be judged to be unlikely; bottom horizontal dashed line indicates the threshold below which transmission should be investigated. b: All against all comparison of the number of allele differences and MIRU-VNTR locus differences for genomes for which full 24-locus MIRU-VNTR profiles were available. Zero MIRU-VNTR differences implies identical MIRU-VNTR profiles. The size of the blue circles indicates the number of comparisons with X MIRU-VNTR locus differences and Y allele differences.
Fig. 3Comparison of the SNP and cgMLST approach employed for seven clusters defined by traditional genotyping methods for the Hamburg surveillance study. Minimum spanning trees built from either identified SNP positions (A) or from the cgMLST approach (B), shown in logarithmic scale. Colors indicate genomic clusters defined by a maximum distance of 12 SNP positions or allele variants, with orange and yellow denoting identified genomic clusters, and isolates marked in grey ungrouped. Isolates labeled in red exhibit differences between both approaches in their position in the derived tree.
Evaluation of the improved cgMLST scheme with 52 isolates clustered by traditional genotyping.
| Cluster ID | Number of strains | Subgroup | Genotype (Coll 2014) | Discerning SNPs | Discerning alleles | Perc. Min | Perc. Max |
|---|---|---|---|---|---|---|---|
| Cluster 48 | 9 | LAM | 4.3.4.2 | 8 | 8 | 99.4 | 99.5 |
| Cluster 56 | 5 | Beijing | 2.2.1 | 57 | 41 | 98.6 | 98.7 |
| Cluster 61 | 5 | Euro-American | 4.1.2 | 4 | 4 | 99.7 | 99.7 |
| Cluster 80 | 26 | Haarlem | 4.1.2.1 | 322 | 206 | 99.7 | 99.7 |
| Cluster 188 | 2 | Beijing | 2.2.1 | 0 | 0 | 98.6 | 98.7 |
| Cluster 189 | 2 | LAM | 4.3.3 | 15 | 11 | 99.9 | 99.9 |
| Cluster 199 | 3 | Euro-American | 4.8 | 5 | 3 | 99.5 | 99.6 |
| Total: | 52 | – | – | 411 | 273 | 98.6 | 98.9 |
Percentage of cgMLST genes fulfilling quality criteria, given as minimum and maximum value.