| Literature DB >> 32234123 |
Benjamin Sobkowiak1,2, Louis Banda3, Themba Mzembe4, Amelia C Crampin5, Judith R Glynn4, Taane G Clark4,2.
Abstract
Understanding host and pathogen factors that influence tuberculosis (TB) transmission can inform strategies to eliminate the spread of Mycobacterium tuberculosis (Mtb). Determining transmission links between cases of TB is complicated by a long and variable latency period and undiagnosed cases, although methods are improving through the application of probabilistic modelling and whole-genome sequence analysis. Using a large dataset of 1857 whole-genome sequences and comprehensive metadata from Karonga District, Malawi, over 19 years, we reconstructed Mtb transmission networks using a two-step Bayesian approach that identified likely infector and recipient cases, whilst robustly allowing for incomplete case sampling. We investigated demographic and pathogen genomic variation associated with transmission and clustering in our networks. We found that whilst there was a significant decrease in the proportion of infectors over time, we found higher transmissibility and large transmission clusters for lineage 2 (Beijing) strains. By performing evolutionary convergence testing (phyC) and genome-wide association analysis (GWAS) on transmitting versus non-transmitting cases, we identified six loci, PPE54, accD2, PE_PGRS62, rplI, Rv3751 and Rv2077c, that were associated with transmission. This study provides a framework for reconstructing large-scale Mtb transmission networks. We have highlighted potential host and pathogen characteristics that were linked to increased transmission in a high-burden setting and identified genomic variants that, with validation, could inform further studies into transmissibility and TB eradication.Entities:
Keywords: Bayesian analysis; Mycobacterium tuberculosis; bioinformatics; molecular epidemiology; pathogen transmission; tuberculosis
Mesh:
Year: 2020 PMID: 32234123 PMCID: PMC7276699 DOI: 10.1099/mgen.0.000361
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.(a) Maximum-likelihood phylogenetic tree of the 1857 Karonga strains used in this study, showing lineages clustering into monophyletic groups. Lineage 1 is shown in blue, lineage 2 in red, lineage 3 in green and lineage 4 in orange. (b) The total number of sequenced cases and the proportion of strains collected, by lineage, over the study period 1995–2014 in 4-year categories.
Fig. 2.(a) Pairwise SNP distance by lineage between all Karonga strains used in the study. (b) Transmission cluster size by lineage, excluding non-clustered individuals. Clusters are defined as sampled strains (excluding non-sampled cases) linked either through direct transmission or with up to two transmission events inferred between individuals. P-values are a comparison against lineage 4 using the Wilcoxon rank sum test. (c) Median SNP distance in transmission clusters (sampled cases only). P-values are a comparison against lineage 4 using the Wilcoxon rank sum test. (d) Pairwise SNP distance between individuals in direct transmission events. P-values are a comparison against lineage 4 using the Wilcoxon rank sum test.
Fig. 3.An example transmission network of lineage 2 strains reconstructed with TransPhylo software. Sampled cases are illustrated by filled dots and non-sampled, inferred intermediate hosts are indicated with unfilled dots. The direction of transmission is from left to right with the transmission date on the x-axis.
Demographic characteristics associated with recent infection, defined as cases for whom the source of infection has been inferred within the last 3 years. Cases prior to 1999 are excluded. Odds ratios and P-values are calculated through logistic regression and Wald chi-Squared test, adjusted for age, sex, year and lineage
|
Recent infection/total |
% recent infection |
Odds ratio (95 % CI) |
| |
|---|---|---|---|---|
|
| ||||
|
1999–2001 |
181/367 |
49.3 |
1 | |
|
2002–2004 |
171/379 |
45.1 |
0.9 (0.6–1.2) | |
|
2005–2007 |
137/298 |
46.0 |
0.9 (0.7–1.2) | |
|
2008–2010 |
93/219 |
42.5 |
0.8 (0.5–1.1) | |
|
2011–2014 |
108/219 |
49.3 |
1.0 (0.7–1.4) |
0.52 |
|
| ||||
|
1 |
86/232 |
37.1 |
0.7 (0.5–0.9) | |
|
2 |
46/64 |
71.9 |
2.8 (1.7–5.1) | |
|
3 |
88/187 |
47.1 |
1.0 (0.7–1.4) | |
|
4 |
470/999 |
47.0 |
1 |
<0.01 |
|
| ||||
|
<20 |
33/64 |
51.6 |
1.8 (1.0–3.1) | |
|
20–29 |
204/370 |
55.1 |
2.1 (1.5–2.9) | |
|
30–39 |
236/528 |
44.7 |
1.4 (1.0–1.9) | |
|
40–49 |
132/290 |
45.5 |
1.4 (1.0–2.0) | |
|
50+ |
85/230 |
37.0 |
1 |
<0.01 |
|
| ||||
|
Female |
361/752 |
48.0 |
1 | |
|
Male |
329/730 |
45.1 |
0.9 (0.7–1.1) |
0.33 |
|
| ||||
|
Negative |
235/514 |
45.7 |
1 | |
|
Positive on ART |
64/123 |
52.0 |
1.3 (0.8–1.9) | |
|
Positive no ART |
274/584 |
46.9 |
1.1 (0.8–1.3) |
0.51 |
|
| ||||
|
Yes |
101/167 |
60.5 |
1.9 (1.4–2.6) | |
|
No |
589/1315 |
44.8 |
1 |
<0.01 |
|
| ||||
|
Smear positive |
540/1144 |
47.2 |
1 | |
|
Smear negative |
135/294 |
45.9 |
0.9 (0.7–1.2) | |
|
Extrapulmonary |
15/44 |
34.1 |
0.5 (0.3–1.0) |
0.09 |
|
| ||||
|
Completed |
485/1023 |
47.4 |
1 | |
|
Died |
144/286 |
50.3 |
1.1 (0.9–1.5) | |
|
Lost/transferred |
55/149 |
36.9 |
0.6 (0.4–0.9) |
0.02 |
|
| ||||
|
Resistant |
59/99 |
59.6 |
1.8 (1.2–2.7) | |
|
Sensitive |
609/1313 |
46.4 |
1 |
<0.01 |
|
| ||||
|
Resistant |
5/12 |
41.7 |
0.7 (0.2–2.2) | |
|
Sensitive |
664/1399 |
47.5 |
1 |
0.65 |
|
| ||||
|
Karonga |
571/1158 |
49.3 |
1 | |
|
Other Malawi |
84/228 |
36.8 |
0.6 (0.4–0.8) | |
|
Other country |
21/66 |
31.8 |
0.5 (0.3–0.8) |
<0.01 |
|
| ||||
|
Karonga |
453/906 |
50.0 |
1 | |
|
Other Malawi |
121/300 |
40.3 |
0.7 (0.5–0.9) | |
|
Other country |
102/252 |
40.5 |
0.7 (0.5–0.9) |
<0.01 |
ART, antiretroviral therapy.
Fig. 4.Cluster size by the number of cases per year (calculated as the date range of a cluster divided by the number of cases) in clusters with ≥five cases. Large clusters are defined as clusters of ≥10 cases and >1 case per year (red dashed lines), and are shown in the upper right quadrant of the figure.
Demographic characteristics associated with large clusters (cases in clusters of ≥10 cases and >1 case per year). Cases in clusters ≥10 cases but <1 case per year, and cases in clusters with the index case after 2010, are excluded. Odds ratios are calculated through logistic regression and P-values by the Wald chi-squared test, adjusted for age, sex, year and lineage
|
Large clusters/total |
% large clusters |
Odds ratio (95 % CI) |
| |
|---|---|---|---|---|
|
| ||||
|
1995–1998 |
81/304 |
26.6 |
1 | |
|
1999–2001 |
80/302 |
26.5 |
1.0 (0.7–1.4) | |
|
2002–2004 |
95/303 |
31.4 |
1.2 (0.9–1.8) | |
|
2005–2007 |
82/247 |
33.2 |
1.4 (0.9–2.0) | |
|
2008–2010 |
57/184 |
31.0 |
1.2 (0.8–1.8) | |
|
2011–2014 |
52/188 |
27.7 |
1.0 (0.7–1.5) |
0.39 |
|
| ||||
|
1 |
69/257 |
26.8 |
0.9 (0.7–1.3) | |
|
2 |
27/47 |
57.4 |
3.4 (1.9–6.3) | |
|
3 |
59/176 |
33.5 |
1.3 (0.9–1.8) | |
|
4 |
292/1048 |
27.9 |
1 |
<0.01 |
|
| ||||
|
<20 |
23/68 |
33.8 |
1.9 (1.0–3.4) | |
|
20–29 |
130/403 |
32.6 |
1.8 (1.2–2.6) | |
|
30–39 |
155/527 |
29.4 |
1.5 (1.1–2.2) | |
|
40–49 |
89/299 |
29.8 |
1.5 (1.0–2.3) | |
|
50+ |
50/231 |
21.6 |
1 |
0.05 |
|
| ||||
|
Female |
220/761 |
28.9 |
1 | |
|
Male |
227/767 |
29.6 |
1.0 (0.8–1.3) |
0.80 |
|
| ||||
|
Negative |
131/503 |
26.0 |
1 | |
|
Positive on ART |
40/108 |
37.0 |
1.6 (1.0–2.6) | |
|
Positive no ART |
187/583 |
32.1 |
1.3 (1.0–1.8) |
0.08 |
|
| ||||
|
Yes |
55/148 |
37.2 |
1.5 (1.0–2.1) | |
|
No |
392/1380 |
28.4 |
1 |
0.12 |
|
| ||||
|
Smear-positive |
254/818 |
31.1 |
1 | |
|
Smear-negative |
79/294 |
26.9 |
0.9 (0.6–1.1) | |
|
Extrapulmonary |
37/163 |
22.7 |
0.8 (0.4–1.3) |
0.38 |
|
| ||||
|
Completed |
307/1003 |
30.6 |
1 | |
|
Died |
96/315 |
30.5 |
1.0 (0.8–1.3) | |
|
Lost/transferred |
43/190 |
22.6 |
0.7 (0.5–1.0) |
0.11 |
|
| ||||
|
Resistant |
24/99 |
24.2 |
0.7 (1.0–1.2) | |
|
Sensitive |
408/1366 |
29.9 |
1 |
0.20 |
|
| ||||
|
Resistant |
1/15 |
6.7 |
0.2 (0.0–0.9) | |
|
Sensitive |
432/1451 |
29.8 |
1 |
0.09 |
|
| ||||
|
Karonga |
360/1114 |
32.3 |
1 | |
|
Other Malawi |
51/248 |
20.6 |
0.5 (0.4–0.8) | |
|
Other country |
16/89 |
18.0 |
0.5 (0.3–0.8) |
<0.01 |
|
| ||||
|
Karonga |
281/907 |
31.0 |
1 | |
|
Other Malawi |
85/315 |
27.0 |
0.8 (0.5–1.1) | |
|
Other country |
69/264 |
26.1 |
0.8 (0.6–1.1) |
0.21 |
ART, antiretroviral therapy.
Demographic characteristics associated with infector cases that have transmitted to another sampled or non-sampled case within 4 years. Cases collected after 2010 and extrapulmonary cases are excluded. Odds ratios are calculated through ordered logistic regression by number of secondary infections, and P-values by the Wald chi-squared test, adjusted for age, sex, year and lineage
|
Infector /total |
% infectors |
Odds ratio (95 % CI) |
| |
|---|---|---|---|---|
|
| ||||
|
1995–1998 |
99/337 |
29.4 |
1 | |
|
1999–2001 |
95/346 |
27.5 |
0.9 (0.6–1.2) | |
|
2002–2004 |
89/367 |
24.3 |
0.8 (0.6–1.1) | |
|
2005–2007 |
61/288 |
21.2 |
0.6 (0.4–0.9) | |
|
2008–2010 |
25/218 |
11.5 |
0.3 (0.2–0.5) |
<0.01 |
|
| ||||
|
1 |
48/243 |
19.8 |
0.8 (0.6–1.2) | |
|
2 |
20/61 |
32.8 |
1.6 (0.9–2.7) | |
|
3 |
48/185 |
25.9 |
1.4 (0.9–1.8) | |
|
4 |
253/1067 |
23.7 |
1 |
0.05 |
|
| ||||
|
<20 |
13/55 |
23.6 |
1.6 (0.7–3.2) | |
|
20–29 |
116/411 |
28.2 |
2.1 (1.4–3.2) | |
|
30–39 |
132/547 |
24.1 |
1.8 (1.2–2.7) | |
|
40–49 |
69/290 |
23.8 |
1.8 (1.2–2.8) | |
|
50+ |
39/253 |
15.4 |
1 |
0.01 |
|
| ||||
|
Female |
206/815 |
25.3 |
1 | |
|
Male |
163/741 |
22.0 |
0.8 (0.7–1.1) |
0.18 |
|
| ||||
|
Negative |
112/469 |
23.9 |
1 | |
|
Positive on ART |
14/78 |
17.9 |
1.0 (0.5–1.9) | |
|
Positive no ART |
155/639 |
24.3 |
1.0 (0.7–1.3) |
1.0 |
|
| ||||
|
Yes |
35/171 |
20.4 |
0.9 (0.6–1.3) | |
|
No |
334/1385 |
24.1 |
1 |
0.43 |
|
| ||||
|
Smear-positive |
329/1218 |
27.0 |
1 | |
|
Smear-negative |
40/338 |
11.8 |
0.4 (0.2–0.5) |
<0.01 |
|
| ||||
|
Completed |
262/1011 |
25.9 |
1 | |
|
Died |
67/337 |
19.9 |
0.6 (0.5–0.9) | |
|
Lost/transferred |
40/191 |
20.9 |
0.7 (0.4–1.0) |
<0.01 |
|
| ||||
|
Resistant |
19/108 |
17.6 |
0.6 (0.4–1.0) | |
|
Sensitive |
347/1402 |
24.8 |
1 |
0.07 |
|
| ||||
|
Resistant |
2/15 |
13.3 |
0.5 (0.1–1.8) | |
|
Sensitive |
363/1495 |
24.3 |
1 |
0.36 |
|
| ||||
|
Karonga |
263/1131 |
23.3 |
1 | |
|
Other Malawi |
59/247 |
23.9 |
1.0 (0.7–1.3) | |
|
Other country |
20/90 |
22.2 |
0.8 (0.5–1.4) |
0.78 |
|
| ||||
|
Karonga |
242/942 |
25.7 |
1 | |
|
Other Malawi |
52/299 |
17.4 |
0.6 (0.4–0.8) | |
|
Other country |
66/281 |
23.5 |
0.9 (0.6–1.2) |
0.02 |
ART, antiretroviral therapy
Genomic variants related to transmissibility identified through evolutionary convergence testing (phyC) and genome-wide association study (GWAS) mixed model approach (SNPs) and aggregated mutation mixed model (gene)
|
Locus tag |
Gene name |
Function |
|
Identification method |
SNP, gene, insertion or deletion |
|---|---|---|---|---|---|
|
Rv3343c |
|
|
0.01 |
phyC |
Δ12 bp deletion |
|
Rv3751 |
. |
Probable integrase |
8.8×10−5 |
GWAS |
S* SNP (A31A) |
|
Rv0974c |
|
Acetyl-CoA carboxylase |
2.6×10−4 |
GWAS |
NS** SNP (N125H) |
|
Rv3812 |
|
|
3.1×10−4 |
GWAS |
Gene |
|
Rv0056 |
|
50S ribosomal protein L9 |
3.9×10−4 |
GWAS |
Δ3 bp insertion |
|
Rv2077c |
. |
Possible transmembrane protein |
6.2×10−4 |
GWAS |
Δ3 bp deletion |
*, synonymous; **, non-synonymous.