Literature DB >> 25336729

Recurrence due to relapse or reinfection with Mycobacterium tuberculosis: a whole-genome sequencing approach in a large, population-based cohort with a high HIV infection prevalence and active follow-up.

José Afonso Guerra-Assunção1, Rein M G J Houben1, Amelia C Crampin2, Themba Mzembe3, Kim Mallard4, Francesc Coll4, Palwasha Khan1, Louis Banda3, Arthur Chiwaya3, Rui P A Pereira4, Ruth McNerney4, David Harris5, Julian Parkhill5, Taane G Clark6, Judith R Glynn1.   

Abstract

BACKGROUND: Recurrent tuberculosis is a major health burden and may be due to relapse with the original strain or reinfection with a new strain.
METHODS: In a population-based study in northern Malawi, patients with tuberculosis diagnosed from 1996 to 2010 were actively followed after the end of treatment. Whole-genome sequencing with approximately 100-fold coverage was performed on all available cultures. Results of IS6110 restriction fragment-length polymorphism analyses were available for cultures performed up to 2008.
RESULTS: Based on our data, a difference of ≤10 single-nucleotide polymorphisms (SNPs) was used to define relapse, and a difference of >100 SNPs was used to define reinfection. There was no evidence of mixed infections among those classified as reinfections. Of 1471 patients, 139 had laboratory-confirmed recurrences: 55 had relapse, and 20 had reinfection; for 64 type of recurrence was unclassified. Almost all relapses occurred in the first 2 years. Human immunodeficiency virus infection was associated with reinfection but not relapse. Relapses were associated with isoniazid resistance, treatment before 2007, and lineage-3 strains. We identified several gene variants associated with relapse. Lineage-2 (Beijing) was overrepresented and lineage-1 underrepresented among the reinfecting strains (P = .004).
CONCLUSIONS: While some of the factors determining recurrence depend on the patient and their treatment, differences in the Mycobacterium tuberculosis genome appear to have a role in both relapse and reinfection.
© The Author 2014. Published by Oxford University Press on behalf of the Infectious Diseases Society of America.

Entities:  

Keywords:  HIV; RFLP; recurrence; reinfection; relapse; tuberculosis; whole-genome sequence

Mesh:

Substances:

Year:  2014        PMID: 25336729      PMCID: PMC4354982          DOI: 10.1093/infdis/jiu574

Source DB:  PubMed          Journal:  J Infect Dis        ISSN: 0022-1899            Impact factor:   5.226


Recurrent tuberculosis adds considerably to the burden of tuberculosis [1]. It arises through relapse of the original infection or through reinfection with a new strain. The 2 mechanisms have different implications for the patient and the control program. High rates of relapse may indicate unsuccessful treatment (relapse usually occurs early after the end of treatment and is associated with cavitary disease and drug resistance) [2-4]. A high rate of recurrence due to reinfection suggests a failure to develop protective immunity after the first episode and is associated with human immunodeficiency virus (HIV) infection [2, 3, 5]. Distinguishing the 2 mechanisms requires molecular marker analysis of cultures from both episodes, so results are available from only a few studies. These have shown varying proportions of disease due to reinfection [6-8], from <3% [9] to >60% [10, 11]. High proportions of apparent reinfections may be due to mislabeling, to cross-contamination or mixed infections, or to differences in patient populations, length of follow-up [7], and background risk of infection with Mycobacterium tuberculosis. A high proportion of apparent relapses could indicate insufficient variability of strains in the population with the marker used. Most studies to date have used IS6110-based restriction fragment-length polymorphism (RFLP) analysis or mycobacterial interspersed repetitive unit–variable number of tandem repeat (MIRU-VNTR) typing [12]. Whole-genome sequencing gives greater discrimination than RFLP analysis or MIRU-VNTR analysis, which should improve the ability to distinguish relapse from reinfection. The one study to date that evaluated whole-genome sequencing for this purpose compared whole-genome sequencing data to MIRU-VNTR data, resulting in reclassification of several strains and allowing detection of mixed infections [12]. Whole-genome sequencing also allows genetic determinants of recurrence to be examined. In a long-term population-based study of tuberculosis in northern Malawi with active follow-up beyond the end of treatment, we previously analyzed RFLP data from 1996 to 2005 and found that HIV infection increased the risk of recurrence due to reinfection but not the risk due to relapse [2]. Here, we have now used whole-genome sequences of all available strains obtained from patients up to 2010, and assessed the effect of HIV, M. tuberculosis lineage [13], and other factors on the rate of recurrence due to reinfection or relapse.

METHODS

Setting and Patients

The Karonga Prevention Study has been studying tuberculosis in the whole of Karonga district, northern Malawi, since 1986 [14, 15]. The area is largely rural, with a population of around 300 000 and a prevalence of HIV infection among adults of about 10%. Individuals with tuberculosis are identified through enhanced passive surveillance, with project staff based at health centers to identify those with chronic cough. Sputum and other specimens (eg, lymph node aspirates) are processed in the project laboratory. Cultures that appear to be positive for M. tuberculosis are sent to the United Kingdom for species confirmation and drug resistance testing, initially for resistance to isoniazid and rifampicin and, if resistant to these, then for resistance to further drugs. At least 3 sputum samples are obtained at the time of diagnosis, with further samples collected at 2 months and at the end of treatment. Ethics approval for the study was given by the ethics committee of the London School of Hygiene and Tropical Medicine and the Malawian National Health Science Research Committee. Informed consent was sought from each participant. The incidence of smear-positive tuberculosis in adults in this district peaked at 125 cases/100 000 in the mid 1990s and then fell to 80–90 cases/100 000 [15]. The annual risk of infection with M. tuberculosis in the 1980s and 1990s was estimated to be 1% per year [14]. Since 1988, HIV testing has been offered to all patients with tuberculosis, following counseling and consent; nearly two thirds are HIV positive. Free antiretroviral therapy (ART) has been available in the district since 2005 to individuals with World Health Organization stage 3 or 4 disease (which includes tuberculosis) or with CD4+ T-cell counts of <250 cells/mL. Patients are treated according to Malawi government guidelines. Beginning in 1996, new smear-positive patients received a 2-month course of treatment with streptomycin, isoniazid, rifampicin, and pyrazinamide (2SHRZ) and a 6-month regimen of ethambutol and isoniazid (6HE) therapy. Ethambutol replaced streptomycin beginning in 2001. New smear-negative patients were treated with 1SEH/11HE until 2001, after which treatment was the same as that for smear-positive patients. Since January 2007, a 6-month regimen involving rifampicin throughout has been used for all new patients (2HRZE/4HR). Beginning in 1996, individuals requiring retreatment received a regimen containing rifampicin throughout [15]. Individuals were defined as having laboratory-confirmed tuberculosis if they had positive smears (excluding those with single scanty smears) or M. tuberculosis–positive cultures or histological evidence of tuberculosis. Outcomes beyond the end of treatment were ascertained by linkage to our other studies in the district and by home visits in 2005 to those whose vital status was unknown [16]. Since 2007 all patients have been visited annually, with sputum specimens collected from those who are symptomatic. Recurrences were defined as further episodes of diagnosed tuberculosis after the end of treatment, whether or not there was microbiological evidence of cure of the previous episode. Recurrences could be laboratory confirmed or not.

Molecular Methods

From 1996 to 2008, IS6110 RFLP analysis was done on all cultures [17, 18]. Whole-genome sequencing was carried out on all available cultures from 1996 to 2010 (n = 1933) at the Sanger Institute, using the Illumina HiSeq 2000 system, with paired-end reads of 100 base pairs. For each of the samples, Trimmomatic software (http://www.usadellab.org/cms/?page=trimmomatic) was used to remove low-quality sequences and low-quality 3′ ends of reads, retaining only reads at least 50 base pairs long, with nucleotides above quality score Q27 (equivalent to a risk of error of <0.2% per read per base pair). The trimmed sequence reads for each sample were mapped against the H37Rv reference genome (GenBank accession no. AL123456.3) and other publicly available mycobacterial genomes, using the BWA-mem algorithm (http://bio-bwa.sourceforge.net/). Samples (n = 79) were excluded if the best match was a non-tuberculous mycobacterium or if their average genomic coverage was <10-fold. The average coverage of the remaining samples was high (median, 88-fold [range, 12–608-fold]; mean, 127-fold). Single-nucleotide polymorphism (SNP) positions were identified using SAMtools (http://samtools.sourceforge.net/). Sample genotypes were called using the majority allele (minimum frequency, 75%) in positions supported by at least 20-fold total coverage; they were otherwise classified as missing. This strategy ignores heterozygous calls. Samples with >15% missing genotype calls were excluded, thereby removing those that may be contaminated, those containing mixed isolates, or those involving technical errors. Similarly, we excluded SNPs with >15% missing genotypes and those in highly repetitive and variable regions (eg, PE/PPE genes). In the final analysis, 94% of the M. tuberculosis genome was analyzed for variants. The final data set consists of 1718 samples (89% of the original cultures) and 40 046 SNPs. Raw sequence data are available from the European Nucleotide Archive (accession numbers ERP000436 and ERP001072). Spoligotyping was performed in silico using SpolPred [19]. Lineages were defined from spoligotype families [20]. For samples with RFLP results but no whole-genome sequence data available, lineage was deduced from spoligotypes of these specimens or others with the same RFLP within the data set, as previously described [21].

Mixed Infections

To explore whether any of the apparent reinfections were explained by the presence of a mixed infection, the number of heterozygous positions was calculated. A position was classified as heterozygous if >1 allele accounted for ≥30% of the reads (and there were >30 reads).

Genomic Associations

To explore genomic associations with relapse, the genomes from the index episode of all individuals with molecularly confirmed relapses were compared with those from patients who were actively followed up and had no evidence of recurrence. Our in-house implementation of the PhyC algorithm [22] was used to explore associations showing evidence of positive selection, using a phylogenetic tree produced by RAxML [23]. Genome-wide association analysis for SNPs and annotated protein coding genes was performed using R statistical software, using logistic regression and including lineage as a covariate. For each gene, the number of mutations was aggregated by individual. A Bonferroni cutoff based on the number of independent genes with SNPs (3689) suggested a multiple testing–adjusted significance threshold of 0.000014.

Epidemiological Analysis

Rates of laboratory-confirmed recurrence were calculated from the end of treatment to the end of June 2011. The time at risk for recurrence was censored at the date of death, leaving the district, or the first recurrence of tuberculosis, whether confirmed or not (since those receiving treatment are not at risk). HIV status was treated as a time-varying covariate for those who were HIV negative or had an unknown HIV status during the first episode. The date of seroconversion was taken as halfway between the last negative test result (or the end of the first episode if the HIV status during that episode was unknown) and the first positive test result (or the date of ART initiation if this was the first evidence of HIV positivity). Calendar time and age were also treated as time-varying covariates. Survival methods and Nelson-Aalen plots of cumulative rates were used. Rate ratios were calculated using Poisson regression, stratifying by time since the end of treatment. STATA 13 was used for all statistical analyses. Risk factors for relapse and reinfection were examined separately.

RESULTS

Differentiating Relapse and Reinfection

After quality control, 1718 whole-genome sequences were available for analysis. These included some individuals with >1 sample from the same or different disease episode. For each of these individuals, the first and last sample were selected if there were >2 samples available. This left 92 individuals with paired samples, including 66 with samples from different episodes of disease. The number of SNPs between each of the pairs is shown in Figure 1. There is a clear gap, with pairs having a difference of ≤8 SNPs or >100 SNPs. RFLPs were available for both members of 42 pairs. All pairs (7 of 7) with a difference of >100 SNPs had different RFLPs. All pairs (35 of 35) with ≤8 SNPs different had identical RFLPs (n = 28) or RFLPs that differed by only 1 band (n = 7).
Figure 1.

Number of single-nucleotide polymorphisms between pairs of samples from individuals from the same episode of tuberculosis or from an index episode and a recurrent episode.

Number of single-nucleotide polymorphisms between pairs of samples from individuals from the same episode of tuberculosis or from an index episode and a recurrent episode. For the main analysis, only pairs from different episodes were used. Recurrence was defined as being due to relapse if there was a difference of ≤10 SNPs and as due to reinfection if there was a difference of >100 SNPs. For individuals with recurrent episodes with RFLP but no whole-genome sequencing data available, the RFLP data were used to define relapse and reinfection. Relapses had identical patterns or differed by 1 band. Reinfections had completely different patterns. In the whole data set, the number of heterozygous positions was low for most samples, with a few having large numbers of heterozygotes. From inspection of the plot of the number of heterozygotes per sample (Supplementary Figure 1), a cutoff of 140, the inflection point in the empirical distribution of the data, was chosen to identify possible mixtures. None of the reinfections had evidence of mixed infections in either episode of disease. Two of the patients with subsequent relapse had evidence of a mixed infection in the first episode (679 and 945 heterozygous positions). All further analysis was restricted to individuals who received a diagnosis of laboratory-confirmed tuberculosis between January 1996 and February 2010, who were normally resident in Karonga district, and who had completed treatment. There were 1535 such individuals (Figure 2). Information beyond the end of treatment was available for 1471 (96%). Of these, 139 had laboratory-confirmed recurrences, and 64 had unconfirmed recurrences. Of the confirmed recurrences, 55 were due to relapse (defined on the basis of SNP data in 46 cases and RFLP data in 9 cases), and 20 were due to reinfection (defined on the basis of SNP data in 14 cases and RFLP data in 6 cases); 64 were unclassified (no paired samples were available). The 64 unclassified recurrences were similar to those that could be classified in terms of age, sex, HIV status, and year.
Figure 2.

Selection of patients. Abbreviations: RFLP, restriction fragment-length polymorphism data; WGS, whole-genome sequence data.

Selection of patients. Abbreviations: RFLP, restriction fragment-length polymorphism data; WGS, whole-genome sequence data.

Recurrence

The rate of recurrence decreased by time since the end of the index episode. There were 71 recurrences in the first year, for a rate of 5.4 recurrences/100 person-years at risk (95% confidence interval [CI], 4.3–6.8); 27 in the second year, for a rate of 2.7/100 person-years at risk (95% CI, 1.8–3.9); and 41 after the second year, for a rate of 1.0/100 person-years at risk (95% CI, .76–1.4). The factors associated with recurrence within the first 2 years and beyond are shown in Table 1, and the development of recurrences, relapses, and reinfections over time are shown in Figure 3. Almost all known relapses (51 of 55) occurred in the first 2 years, whereas reinfections occurred at a more constant rate over time.
Table 1.

Rate of Tuberculosis Recurrence, by Patient and Index Tuberculosis Episode Characteristics

CharacteristicRate of Recurrence per 100 Person-Years at Risk
During the First 2 Years
After the First 2 Years
All Recurrences (Relapse, Reinfection)Person-Years at RiskRate (95% CI)All Recurrences (Relapse, Reinfection)Person-Years at RiskRate (95% CI)
Overall98 (51, 8)23114.2 (3.5–5.2)41 (4, 12)39951.0 (.76–1.4)
Age, y
 <3022 (13, 4)6713.3 (2.2–5.0)7 (2, 0)7580.92 (.44–1.9)
 30–3935 (17, 2)7704.5 (3.3–6.3)16 (0, 7)13141.2 (.75–2.0)
 40–4919 (9, 0)4174.6 (2.9–7.1)7 (1, 1)8030.80 (.38–1.7)
 ≥5022 (12, 2)4524.9 (3.2–7.4)11 (1, 4)10511.0 (.58–1.9)
Sex
 Female43 (25, 6)12463.5 (2.6–4.7)28 (1, 7)21161.1 (.72–1.6)
 Male55 (26, 2)10655.2 (4.0–6.7)18 (3, 5)18790.96 (.60–1.5)
HIV status
 Negative35 (21, 0)9223.8 (2.7–5.3)9 (1, 1)19780.46 (.24–.87)
 Positive59 (29, 7)10875.4 (4.2–7.0)31 (3, 11)14122.2 (1.5–3.1)
 Unknown4 (1, 1)3021.3 (.50–3.5)1 (0, 0)6050.17 (.02–1.2)
Tuberculosis type
 Smear-positive pulmonary82 (47, 4)18944.3 (3.5–5.4)35 (3, 10)32441.1 (.77–1.5)
 Smear-negative pulmonary13 (3, 3)2694.8 (2.8–8.3)4 (1, 1)4400.91 (.34–2.4)
 Extrapulmonary3 (1, 1)1482.0 (.65–6.3)2 (0, 1)3100.65 (.16–2.6)
Previous tuberculosis
 No90 (49, 7)21674.2 (3.4–5.1)38 (4, 11)37591.0 (.74–1.4)
 Yes8 (2, 1)1445.5 (2.8–11.1)3 (0, 1)2351.3 (.41–4.0)
M. tuberculosis lineage
 111 (6, 1)2704.1 (2.3–7.4)8 (0, 3)4681.7 (.85–3.4)
 21 (1, 0)701.4 (.20–10.1)1 (0, 1)1560.64 (.09–4.5)
 322 (16, 0)16713.2 (8.7–20.0)1 (0, 0)2120.47 (.07–3.4)
 441 (27, 6)11923.4 (2.5–4.7)25 (4, 8)21581.2 (.78–1.7)
 Unknown23 (1, 1)6143.7 (2.5–5.6)6 (0, 0)10000.60 (.27–1.3)
Isoniazid resistance
 No76 (44, 8)18434.1 (3.3–5.2)33 (3, 9)33441.0 (.70–1.4)
 Yes9 (6, 0)8111.1 (5.8–21.4)4 (1, 3)1093.7 (1.4–9.8)
 Unknown13 (1, 0)3873.4 (2.0–5.8)4 (0, 0)5420.74 (.28–2.0)
Year of follow-up
 1996–200025 (11, 6)5894.2 (2.9–6.3)3 (0, 1)1512.0 (.64–6.2)
 2001–200426 (19, 2)6294.1 (2.8–6.1)11 (2, 4)10421.1 (.58–1.9)
 2005 and later47 (21, 0)10934.3 (3.2–5.7)27 (2, 7)28020.96 (.66–1.4)
Year of treatment initiation
 Before 200780 (47, 8)17864.5 (3.6–5.6)39 (4, 12)38621.0 (.74–1.4)
 2007 and later18 (4, 0)5253.4 (2.2–5.4)2 (0, 0)1321.5 (.38–6.0)

Abbreviations: CI, confidence interval; HIV, human immunodeficiency virus; M. tuberculosis, Mycobacterium tuberculosis.

Figure 3.

Cumulative hazard of confirmed recurrent tuberculosis overall (A) and by human immunodeficiency virus (HIV) status (B). C and D, Cumulative hazard for recurrences due to relapse and reinfection, by HIV status.

Rate of Tuberculosis Recurrence, by Patient and Index Tuberculosis Episode Characteristics Abbreviations: CI, confidence interval; HIV, human immunodeficiency virus; M. tuberculosis, Mycobacterium tuberculosis. Cumulative hazard of confirmed recurrent tuberculosis overall (A) and by human immunodeficiency virus (HIV) status (B). C and D, Cumulative hazard for recurrences due to relapse and reinfection, by HIV status.

Relapse

Because of the very low number and rate of (confirmed) relapses after 2 years, risk factors for relapse were investigated within the first 2 years (Table 2). Relapses were more common soon after the end of treatment. They were not associated with age, sex, or HIV status. Fifty-five patients had strains that were isoniazid resistant in the index episode (including 5 that were also rifampicin resistant). Patients with isoniazid-resistant strains had 3 times the rate of relapse of those who had isoniazid-susceptible strains. Those who started treatment after 1 January 2007, who would have received rifampicin throughout their treatment, had much lower rates of relapse (rate ratio [RR], 0.29 [95% CI, .10–.80]). Those with lineage-3 strains had much higher relapse rates than those with strains from other lineages (RR, 4.2 [95% CI, 2.3–7.8], compared with lineage-4). All of these associations persisted after adjustment for each other, and additional adjustment for the other factors made no difference to the results (Table 2).
Table 2.

Relative Rates for Relapse (Within 2 Years) and Reinfection (At Any Time)

CharacteristicRate Ratio for Relapse Within 2 Years (95% CI)
P ValuebRate Ratio for Reinfection Disease (95% CI)
P Valueb
UnivariableMultivariableaUnivariableMultivariablea
Age, y, at start
 <3011
 30–391.1 (.55–2.3)1.5 (.48–5.0)
 40–491.1 (.48–2.6)0.28 (.03–2.5)
 ≥501.4 (.63–3.0)1.4 (.40–5.1)
Sex
 Female11
 Male1.2 (.70–2.1)0.61 (.25–1.5)
HIV status
 Negative111<.001c
 Positive1.2 (.67–2.1)20.9 (2.8–156)21.0 (2.8–157)
 Unknown0.15 (.02–1.1)3.2 (.20–51.1)2.5 (.15–40.2)
Tuberculosis type
 Smear-positive pulmonary11
 Smear-negative pulmonary0.45 (.14–1.4)2.1 (.68–6.3)
 Extrapulmonary0.27 (.04–2.0)1.6 (.36–6.3)
Previous tuberculosis
 No11
 Yes0.61 (.15–2.5)1.7 (.40–7.5)
M. tuberculosis lineage
 10.99 (.41–2.4)1.0 (.42–2.4).004c1.3 (.43–3.9)
 20.63 (.09–4.6)0.65 (.09–4.8)1.1 (.14–8.0)
 34.2 (2.3–7.8)4.2 (2.2–7.8)
 4111
 Unknown0.07 (.01–.53)0.09 (.009–.87)0.15 (.02–1.1)
Isoniazid resistance
 No11.05c11.06c
 Yes3.1 (1.3–7.3)2.6 (1.1–6.0)4.8 (1.4–16.5)4.1 (1.2–14.0)
 Unknown0.11 (.01–.79)0.91 (.09–9.1)
Calendar year
 1996–2000111.001
 2001–20041.6 (.77–3.4)0.38 (.13–1.1)0.20 (.06–.66)
 2005 and later1.0 (.50–2.1)0.19 (.07–0.54)0.10 (.03–.32)
Follow-up
 First year11.00211.03
 Second year0.40 (.21–.77)0.39 (.20–.74)3.9 (.79–19.4)4.7 (.94–23.3)
 Third year and later2.0 (.44–8.8)5.9 (1.2–29.7)
Year of treatment initiation
 Before 20071.006
 2007 and later0.29 (.10–.80)0.29 (.10–.83)

Abbreviations: CI, confidence interval; HIV, human immunodeficiency virus; M. tuberculosis, Mycobacterium tuberculosis.

a Adjusted for the other factors shown in the multivariable model.

b From the likelihood ratio χ2 test.

c From the model excluding unknowns for this variable.

Relative Rates for Relapse (Within 2 Years) and Reinfection (At Any Time) Abbreviations: CI, confidence interval; HIV, human immunodeficiency virus; M. tuberculosis, Mycobacterium tuberculosis. a Adjusted for the other factors shown in the multivariable model. b From the likelihood ratio χ2 test. c From the model excluding unknowns for this variable. The genomes of isolates from the index tuberculosis episodes of 52 patients with relapse were compared with those from 843 patients with active follow-up and no recurrence of tuberculosis. An analysis of SNPs showing evidence of convergent evolution and positive selection based on the PhyC algorithm found 9 SNPs (from 8 genes) that were significant at the 5% level (Supplementary Table 1). Six SNPs overlapped protein-coding genes and caused nonsynonymous changes. These include the known isoniazid-resistance associated gene, katG. Genome-wide association analyses showed no significant genic results, after correction for multiple testing. Supplementary Table 2 lists all genes identified at a less stringent threshold (P < .05); this includes katG.

Reinfection

The main factor associated with reinfection was HIV status (Table 2). Among HIV-negative individuals, only 1 of 44 recurrences was due to reinfection, 22 were relapses, and 21 could not be classified. Among HIV-positive individuals, 18 of 90 recurrences were due to reinfection, 32 were relapses, and 40 could not be unclassified. The one HIV-negative patient with reinfection was confirmed as being HIV negative at the time of the second episode. Reinfections were less common in more recent calendar years and were less common in the first year after the end of treatment than they were subsequently. These associations became slightly stronger after adjustment for each other, with no further change after adjustment for the other available factors. Isoniazid resistance in the first episode was also weakly associated with recurrence due to reinfection (Table 2). Of the 20 strains causing reinfection, 1 (5%) was lineage-1, 5 (25%) were lineage-2, 2 (10%) were lineage-3, and 12 (60%) were lineage-4. This compares with a distribution of lineages of 16%, 4%, 11%, and 68%, respectively, for the 4 lineages for the index episode (P = .004, by the Fisher exact test). The proportion of index episodes due to lineages 1 and 2 changed little over the study, so this did not explain the difference. Of the 19 patients with reinfection for whom the lineage was known in both episodes, 10 (53%) had different lineages in the 2 episodes (compared with 48% expected by chance, given the lineage distribution in the population). For the 14 patients with reinfection defined on the basis of SNP data, the number of differing SNPs was 100–200 for 4 patients, 600–800 for 2 patients, and >1000 for 8 patients. All those with <1000 differing SNPs had lineage-4 strains in both episodes. All those with a difference of >1000 SNPs had different lineages in the 2 episodes. The one reinfection in an HIV-negative patient was with a lineage-2 strain, following lineage-1 in the initial episode.

DISCUSSION

Whole-genome sequencing of strains from initial and recurrent episodes of disease showed a clear distinction between those with few (0–8) different SNPs and those with many (>100) differences. That this was an appropriate cutoff for distinguishing relapse and reinfection is supported by the distribution of SNP differences within the same episode (Figure 1), by the RFLP results, and by the strong correlation of relapse and reinfection so defined with time since the end of treatment and with HIV status. The cutoff is slightly higher than suggested in some other studies of SNP differences within the same episode or during transmission between patients [24-26], but there is considerable uncertainty about the appropriate cutoff, and the value will vary depending on the pipeline used and the genome coverage achieved [26]. The correlation between SNP and RFLP results suggests that an RFLP that is identical or differs by 1 band is a good approximation for defining relapse when no sequencing data are available. In this study, none of the reinfections were explained by mixed infection. This is in contrast to findings from Cape Town, where the annual risk of infection with M. tuberculosis is much higher [12, 27]. We have previously found a low proportion of mixed infections in this population [28]. The timing and risk factors for relapse and reinfection in this population are strikingly different. Relapse was most common in the first year and rare after the second year. Recurrences due to reinfection were less common in the first year, but then the rate was approximately constant. HIV infection greatly increased the rate of recurrence due to reinfection but was not associated with relapse. Calendar year was also associated with recurrence due to reinfection, with higher rates before 2001, when the incidence of new tuberculosis cases was also higher [15]. Relapse was associated with isoniazid resistance (as expected [29]), treatment in earlier years (before rifampicin was given in the continuation phase, consistent with the reported effect of prolonged rifampicin on recurrence [30]), and lineage. There is increasing interest in the extent to which genomic variation in M. tuberculosis may explain the variations seen in disease occurrence, patterns, and outcomes. Several studies have shown an increased rate of relapse for lineage-2 (Beijing) strains [31-33]. We did not find this, but there were only 43 patients with lineage-2 strains in the index episode, so study power was limited. We found greatly increased relapse rates for those with lineage-3 strains (East African Indian/CAS), independent of other factors. This has not been described before. In a recent study from Vietnam [33], relapse rates were compared between the Beijing strain and all other strains; lineage-3 made up about half of the comparison group. Even if all of the relapses had occurred in patients with this lineage, the rate would have been lower than the rate for those with Beijing strains. Using the PhyC algorithm and genome-wide association analysis, we found no genomic variants strongly associated with relapse. The expected finding of associations with katG, which is known to be involved in isoniazid resistance, supports the validity of the methods and suggests that the other identified genes should be investigated further for clues for mechanisms for relapse. Although this is one of the largest studies to compare paired samples to differentiate relapse and reinfection, power is limited for genetic comparisons of this type. The lack of paired samples for 46% of the recurrences (64 of 139) will have led to underestimation of relapse and reinfection rates but is unlikely to have biased the results of associations with relapse and reinfection, as individuals without paired samples were similar to those with paired samples. In the only previous study to use whole-genome sequencing to look at reinfection, it was suggested, based on 3 reinfections, that reinfections with strains differing by >1000 SNPs (representing different lineages) might be more likely [12]. In this analysis, 9 of 19 patients with reinfections with the lineage known for both episodes had reinfections with the same lineage, which does not support this hypothesis. However, there was some evidence that the lineage of the reinfecting strain might be important. Among episodes of recurrence due to reinfection, we found that lineage-2 strains were overrepresented and that lineage-1 (Indo-Oceanic) strains were underrepresented. This strengthens our earlier observations in this population [2] and is consistent with suggested higher virulence for lineage-2 strains [34] and lower virulence of lineage-1 strains [21]. Other studies of reinfection have not reported the results by lineage. The use of whole-genome sequencing surmounts some of the potential problems with older techniques, namely, excluding mixed infections and knowing how much change to allow when comparing specimens from the different episodes of disease. It validated our earlier results with RFLP analysis [2] and allows the genomic associations with relapse and reinfection to be examined directly. In this large population-based study with active follow-up, relapses were rare after the first 2 years and were associated with drug resistance and lineage-3 genotypes. Reinfections occurred at a more constant rate, were largely restricted to HIV-positive individuals, and were not more likely to be due to different lineages from the index episode but were due to lineage-2 genotypes more often than would be expected by chance.

Supplementary Data

Supplementary materials are available at The Journal of Infectious Diseases online (http://jid.oxfordjournals.org). Supplementary materials consist of data provided by the author that are published to benefit the reader. The posted materials are not copyedited. The contents of all supplementary data are the sole responsibility of the authors. Questions or messages regarding errors should be addressed to the author.
  34 in total

Review 1.  Recurrence in tuberculosis: relapse or reinfection?

Authors:  Marie-Laurence Lambert; Epco Hasker; Armand Van Deun; Dominique Roberfroid; Marleen Boelaert; Patrick Van der Stuyft
Journal:  Lancet Infect Dis       Date:  2003-05       Impact factor: 25.071

2.  Influence of initial drug resistance on the response to short-course chemotherapy of pulmonary tuberculosis.

Authors:  D A Mitchison; A J Nunn
Journal:  Am Rev Respir Dis       Date:  1986-03

3.  HIV-1 and recurrence, relapse, and reinfection of tuberculosis after cure: a cohort study in South African mineworkers.

Authors:  P Sonnenberg; J Murray; J R Glynn; S Shearer; B Kambashi; P Godfrey-Faussett
Journal:  Lancet       Date:  2001-11-17       Impact factor: 79.321

4.  Effects of human immunodeficiency virus infection on recurrence of tuberculosis after rifampin-based treatment: an analytical review.

Authors:  Eline L Korenromp; Fabio Scano; Brian G Williams; Christopher Dye; Paul Nunn
Journal:  Clin Infect Dis       Date:  2003-06-23       Impact factor: 9.079

5.  Rifapentine and isoniazid once a week versus rifampicin and isoniazid twice a week for treatment of drug-susceptible pulmonary tuberculosis in HIV-negative patients: a randomised clinical trial.

Authors:  Debra Benator; Mondira Bhattacharya; Lorna Bozeman; William Burman; Antonino Cantazaro; Richard Chaisson; Fred Gordin; C Robert Horsburgh; James Horton; Awal Khan; Christopher Lahart; Beverly Metchock; Constance Pachucki; Llewellyn Stanton; Andrew Vernon; M Elsa Villarino; Yong Chen Wang; Marc Weiner; Stephen Weis
Journal:  Lancet       Date:  2002-08-17       Impact factor: 79.321

6.  DNA fingerprint changes in tuberculosis: reinfection, evolution, or laboratory error?

Authors:  Judith R Glynn; Malcolm D Yates; Amelia C Crampin; Bagrey M Ngwira; Frank D Mwaungulu; Gillian F Black; Steven D Chaguluka; Donex T Mwafulirwa; Sian Floyd; Caroline Murphy; Francis A Drobniewski; Paul E M Fine
Journal:  J Infect Dis       Date:  2004-08-03       Impact factor: 5.226

7.  The temporal dynamics of relapse and reinfection tuberculosis after successful treatment: a retrospective cohort study.

Authors:  Florian M Marx; Rory Dunbar; Donald A Enarson; Brian G Williams; Robin M Warren; Gian D van der Spuy; Paul D van Helden; Nulda Beyers
Journal:  Clin Infect Dis       Date:  2014-03-18       Impact factor: 9.079

8.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

9.  Inferring patient to patient transmission of Mycobacterium tuberculosis from whole genome sequencing data.

Authors:  Josephine M Bryant; Anita C Schürch; Henk van Deutekom; Simon R Harris; Jessica L de Beer; Victor de Jager; Kristin Kremer; Sacha A F T van Hijum; Roland J Siezen; Martien Borgdorff; Stephen D Bentley; Julian Parkhill; Dick van Soolingen
Journal:  BMC Infect Dis       Date:  2013-02-27       Impact factor: 3.090

10.  Mycobacterium tuberculosis Beijing genotype and risk for treatment failure and relapse, Vietnam.

Authors:  Nguyen Thi Ngoc Lan; Hoang Thi Kim Lien; Le B Tung; Martien W Borgdorff; Kristin Kremer; Dick van Soolingen
Journal:  Emerg Infect Dis       Date:  2003-12       Impact factor: 6.883

View more
  67 in total

Review 1.  Metabolic Perspectives on Persistence.

Authors:  Travis E Hartman; Zhe Wang; Robert S Jansen; Susana Gardete; Kyu Y Rhee
Journal:  Microbiol Spectr       Date:  2017-01

2.  Tuberculosis along the continuum of HIV care in a cohort of adolescents living with HIV in Ethiopia.

Authors:  D Jerene; W Abebe; K Taye; P G Suarez; Y Feleke; I Hallström; A J Ruff
Journal:  Int J Tuberc Lung Dis       Date:  2017-01-01       Impact factor: 2.373

3.  Direct DNA Extraction from Mycobacterium tuberculosis Frozen Stocks as a Reculture-Independent Approach to Whole-Genome Sequencing.

Authors:  K Bjorn-Mortensen; J Zallet; T Lillebaek; A B Andersen; S Niemann; E M Rasmussen; T A Kohl
Journal:  J Clin Microbiol       Date:  2015-05-27       Impact factor: 5.948

4.  Beyond the SNP Threshold: Identifying Outbreak Clusters Using Inferred Transmissions.

Authors:  James Stimson; Jennifer Gardy; Barun Mathema; Valeriu Crudu; Ted Cohen; Caroline Colijn
Journal:  Mol Biol Evol       Date:  2019-03-01       Impact factor: 16.240

5.  Phase variation in Mycobacterium tuberculosis glpK produces transiently heritable drug tolerance.

Authors:  Hassan Safi; Pooja Gopal; Subramanya Lingaraju; Shuyi Ma; Carly Levine; Veronique Dartois; Michelle Yee; Liping Li; Landry Blanc; Hsin-Pin Ho Liang; Seema Husain; Mainul Hoque; Patricia Soteropoulos; Tige Rustad; David R Sherman; Thomas Dick; David Alland
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-05       Impact factor: 11.205

Review 6.  Advances in the understanding of Mycobacterium tuberculosis transmission in HIV-endemic settings.

Authors:  Julian S Peters; Jason R Andrews; Mark Hatherill; Sabine Hermans; Leonardo Martinez; Erwin Schurr; Yuri van der Heijden; Robin Wood; Roxana Rustomjee; Bavesh D Kana
Journal:  Lancet Infect Dis       Date:  2018-12-13       Impact factor: 25.071

Review 7.  Deciphering Within-Host Microevolution of Mycobacterium tuberculosis through Whole-Genome Sequencing: the Phenotypic Impact and Way Forward.

Authors:  A Van Rie; R M Warren; S D Ley; M de Vos
Journal:  Microbiol Mol Biol Rev       Date:  2019-03-27       Impact factor: 11.056

8.  23-valent pneumococcal polysaccharide vaccine elicits hierarchical antibody and cellular responses in healthy and tuberculosis-cured elderly, and HIV-1-infected subjects.

Authors:  Huichang Huang; Xiaohua Qian; Rong Pan; Ling Shen; Shanshan Liang; Feifei Wang; Peng Zhang; Hongbo Shen; Zheng W Chen
Journal:  Clin Immunol       Date:  2018-05-09       Impact factor: 3.969

9.  Older age at first tuberculosis diagnosis is associated with tuberculosis recurrence in HIV-negative persons.

Authors:  Y F van der Heijden; F Karim; T Chinappa; G Mufamadi; L Zako; B E Shepherd; F Maruri; M-Y S Moosa; T R Sterling; A S Pym
Journal:  Int J Tuberc Lung Dis       Date:  2018-08-01       Impact factor: 2.373

10.  Predictors and short-term outcomes of recurrent pulmonary tuberculosis, Uganda: a cohort study.

Authors:  Nelson Kalema; Christina Lindan; Dave Glidden; Samuel D Yoo; Achilles Katamba; Andama Alfred; Winceslaus Katagira; Patrick Byanyima; Emmanuel Musisi; Sylvia Kaswabuli; Sanyu Ingvar; Josephine Zawedde; Christina Yoon; Irene Ayakaka; J Lucian Davis; Laurence Huang; William Worodria; Adithya Cattamanchi
Journal:  S Afr Respir J       Date:  2017
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.