Literature DB >> 30817803

Overcoming the pitfalls of automatic interpretation of whole genome sequencing data by online tools for the prediction of pyrazinamide resistance in Mycobacterium tuberculosis.

Tomotada Iwamoto1, Yoshiro Murase2, Shiomi Yoshida3, Akio Aono2, Makoto Kuroda4, Tsuyoshi Sekizuka4, Akifumi Yamashita4, Kengo Kato4, Takemasa Takii5, Kentaro Arikawa1, Seiya Kato6, Satoshi Mitarai2,7.   

Abstract

OBJECTIVES: Automated online software tools that analyse whole genome sequencing (WGS) data without the need for bioinformatics expertise can motivate the implementation of WGS-based molecular drug susceptibility testing (DST) in routine diagnostic settings for tuberculosis (TB). Pyrazinamide (PZA) is a key drug for current and future TB treatment regimens; however, it was reported that predictive power for PZA resistance by the available tools is low. Therefore, this low predictive power may make users hesitant to use the tools. This study aimed to elucidate why and to uncover the real performance of the tools when taking into account their variation calling lists (manual inspection), not just their automated reporting system (default setting) that was evaluated by previous studies.
METHODS: WGS data from 191 datasets comprising 108 PZA-resistant and 83 susceptible strains were used to evaluate the potential performance of the available online tools (TB Profiler, TGS-TB, PhyResSE, and CASTB) for predicting phenotypic PZA resistance.
RESULTS: When taking into consideration the variation calling lists, 73 variants in total (47 non-synonymous mutations and 26 indels) in pncA were detected by TGS-TB and PhyResSE, covering all mutations for the 108 PZA-resistant strains. The 73 variants were confirmed by Sanger sequencing. TB Profiler also detected all but three complete loss, two large deletion at the 3'-end, and one relatively large insertion of pncA. On the other hand, many of the 73 variants were lacking in the automated reporting systems except by TGS-TB; of these variants, CASTB detected only 20. By applying the 'non-wild type sequence' approach for predicting PZA resistance, accuracy of the results significantly improved compared with that of the automated results obtained by each tool.
CONCLUSION: Users can obtain more accurate predictions for PZA resistance than previously reported by manually checking the results and applying the 'non-wild type sequence' approach.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30817803      PMCID: PMC6394917          DOI: 10.1371/journal.pone.0212798

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Mycobacterium tuberculosis (MTB) is a slow-growing bacillus and thus requires several weeks to months for phenotypic drug-susceptibility testing (DST). The essential role and ubiquitous use of pyrazinamide (PZA) in tuberculosis (TB) therapy underlines the importance of DST against PZA [1,2]. However, the test for PZA is more complicated and less reliable than those for other anti-TB drugs due to the difficulty of growing TB under the acidic (pH < 6.0) conditions required for the assay [3-5]. As the current phenotypic test is difficult and time consuming, clinical laboratories are eager for a more accurate and rapid assay to detect PZA resistance. Genotypic DST is an ideal approach for bypassing the difficulty, inaccuracy, and lengthy process of phenotypic DST, providing rapid results [6,7]. However, genotypic DST for PZA is extremely challenging because the pncA mutations that induce loss of pyrazinamidase (PZase) activity (the major mechanism leading to PZA resistance) are highly diverse and scattered throughout the pncA gene [8,9]. Whole-genome sequencing (WGS) can theoretically be used to rule in resistance to all drugs, which would accelerate DST to design appropriate treatment regimens. To assist the interpretation of complex WGS data, several automated online software tools are available for TB DST, including CASTB [10], PhyResSE [11], TB Profiler [12], Mykrobe predictor [13], and TGS-TB [14]. These tools are expected to overcome the barrier of implementing WGS methodologies in routine diagnostic settings, which usually lack bioinformatics expertise. However, it has been reported that the sensitivity and specificity for predicting resistance varies significantly between the different tools and antibiotics tested [15]. Moreover, previous studies that evaluated the predictive accuracy of these automated tools for drug resistance revealed that the accuracy for predicting PZA resistance is low [12,15-17]. This may make users hesitant to use the tools. Herein, we wanted to emphasise that a lack of drug resistance-associated variants in the automatically generated plain text report by the online tools does not necessarily mean the tools did not detect mutations in the region of interest, which are possibly causing resistance. Indeed, TB Profiler, TGS-TB, and PhyResSE provide variant calling lists in addition to the plain text report of drug resistance prediction no matter if they are included or not in the pre-defined mutation catalogue for automated reporting systems. Therefore, real performance evaluation of the tools should be performed by two different aspects, one at the level of the automated reporting system and the other at the full potential of its performance for the detection of mutations in a region of interest; all previous reports that evaluated online tool performance analysed the former aspect but did not take into account the latter [15-17]. To clarify the reasons for the reported low accuracy of PZA resistance prediction and to reveal the real performance of the tools, we evaluated the tools using WGS data from 191 datasets on resistant and susceptible strains at the level of automated reporting and the full potential of the tools.

Materials and methods

Specimens

A total of 191 MTB isolates collected from two different isolate stocks were used in this study. To avoid any unambiguous results by heteroresistance due to the existence of a mixed population [18], single colony isolation from the stock culture was performed. Sample set 1 included 112 multidrug-resistant MTB (MDR-TB) isolates from 112 patients, collected through the RYOKEN TB research consortium during 2012–2013 as part of a national drug resistance survey. Sample set 2 included 79 isolates obtained from Kinki-chuo Chest Medical Center located in the western part of Japan between 2001 and 2015; of these, 53 strains were MDR-TB. The isolates were obtained from 77 single patients including two cases where non-MDR-TB developed into MDR-TB in a single patient.

Ethics statement

All of the clinical isolates in this study were fully anonymized before we accessed them. Clinical isolates of M. tuberculosis in set 2 were used with approval of the ethical committee of Kinki-chuo Chest Medical Center (reference 2015/493). Clinical isolates in set 1 were obtained through the survey followed Japanese national guidelines for epidemiological research and ethical approval was not required for this laboratory study based on M. tuberculosis isolates.

Phenotypic DST of PZA

For all 191 strains, PZA susceptibility was measured using MGIT AST PZA (Becton Dickinson, Franklin Lakes, NJ) according to manufacturer’s instructions. The critical concentration used in this study was 100 μg/mL. Extended analysis using the PZase assay by Wayne’s method [19] was performed for the 16 strains that showed discrepant results between MGIT AST PZA and mutations in pncA or rpsA to determine PZA susceptibility. This analysis was also performed for all strains in sample set 1 (n = 112) to supplement MGIT AST PZA testing. Minimum inhibitory concentration (MIC) measurement for PZA was performed for 7 of the 16 strains whose PZA susceptibility could not be determined even after the PZase assay. MIC was determined [18] from PZA concentrations of 12.5, 25, 50, 100, 200, 300, 400, and 800 μg/mL in MGIT culture media, using the MGIT EpiCenter system. The MIC for M. tuberculosis H37Rv (control for PZA susceptibility) and M. bovis BCG Tokyo (control for PZA resistance) was found to be 50 μg/mL and > 800 μg/mL, respectively. We considered MIC values ≤ 100 as PZA susceptible.

WGS data analysis

Genomic DNA of each isolate was prepared from MTB grown on Ogawa medium using an Isoplant Kit (Wako Pure Chemical Industries Ltd, Tokyo, Japan). A DNA-seq library was prepared with the QIAseq FX DNA library kit (Qiagen, Hilden, Germany) using 50 ng TB genomic DNA, followed by paired-end sequencing using Illumina MiSeq (Illumina, San Diego, CA) and the MiSeq Reagent Kit v3 (600 cycles) according to manufacturer’s instructions. The read data were then analysed using TB Profiler (online version, September 2018) [12], TGS-TB v2 [14], PhyResSE v1.0 [11], and CASTB v1.5 [10]. In addition to the text report obtained through automated interpretation (default result), we manually inspected the tool variation calling lists for pncA, rpsA, panD, and their promoter regions (manual result); this was not performed with CASTB as it does not provide a variation calling list. The synonymous pncA Gln122Gln mutation and the known phylogenetic informative mutations [12] rpsA Met432Thr (for LAM lineage) and rpsA Arg212Arg (for Beijing lineage) detected in this study were excluded from further analysis. Phenotypic PZA susceptibility testing (MGIT testing) complemented with the PZase assay and MIC measurement were used as the standard reference for evaluating the online tools’ prediction powers for PZA resistance.

Sanger sequencing for pncA

To validate the mutation call by online tools, we performed sanger sequencing for all mutation variants of pncA detected in this study as previously described [20].

Statistical analysis

All statistical analyses were performed with EZR [21], a graphical user interface for R.

Read data deposit

Raw sequence reads of all 191 MTB strains subjected to WGS were deposited in the DDBJ Sequence Read Archive (DRA; https://www.ddbj.nig.ac.jp/dra/index.html) under the study accession number DRA006844 (S1 Table).

Results

Variation calling lists by online tools

Mutation variants detected in the 191 strains are summarised in Table 1.
Table 1

Mutation variants and detection by online tools.

GeneMutationNo. of variationTB ProfilerTGS-TBPhyResCASTB
DefaultManualNoneDefaultManualNoneDefaultManualNoneDefaultNone
pncASNV47351204250331401037
 Indel*26020**6260002601016
rpsASNV***808017017008
Indel101001001001
panD promoterSNV101001001001

None, not detected by either default or mannual inspection.

*Variation of the three complete and two 3' end deletions were discriminated based on their regions of deletion.

**Five deletions which showed discrepancy with TGS-TB and PhyRes were included (see S2 Table).

***Met432Thr is not included since it is a LAM defined SNP.

None, not detected by either default or mannual inspection. *Variation of the three complete and two 3' end deletions were discriminated based on their regions of deletion. **Five deletions which showed discrepancy with TGS-TB and PhyRes were included (see S2 Table). ***Met432Thr is not included since it is a LAM defined SNP. All mutations in panA detected through automated interpretation and manual inspection by three online tools (TB Profiler, TGS-TB, and PhyResSE) were validated by Sanger sequencing. Forty-seven single-nucleotide variants (SNVs) were detected by all three tools and were consistent with Sanger sequencing results. Regarding the indels, TGS-TB and PhyResSE called 26 variations that were consistent with Sanger sequencing, whereas TB Profiler showed different calls for 5 indel variants and no calls for 6 indels (three complete losses of pncA, two large deletions at the 3ʹ-end of pncA, and one relatively large insertion in pncA); details of these discrepancies are listed in S2 Table. Furthermore, eight SNVs and one indel in rpsA as well as one SNV in the panD promoter were detected by all three tools; S1 Fig shows an image of the detection of these variants by TGS-TB. In total, 83 variants (47 SNVs and 26 indels in pncA, 8 SNVs and 1 indel in rpsA, and 1 SNV in the panD promoter) were detected in this study and used for further analysis to evaluate the online tools’ prediction powers for PZA resistance. All strains with variants and phenotypic PZA susceptibility informationare listed in S1 Table.

Determination of PZA susceptibility

Of the 191 strains analysed, 98 demonstrated resistance by the MGIT test with mutations in pncA and its promoter (60 in set 1 and 38 in set 2) and were judged PZA resistant (Table 2). The 77 strains that were deemed susceptible by MGIT and without mutations (40 in set 1 and 37 in set 2) were judged PZA susceptible. These classifications correspond with PZase analysis results (S1 Table).
Table 2

Association between PZA resistant conferring mutations and MGIT PZA susceptibility testing.

 MGIT (set 1, n = 112)MGIT (set 2, n = 79)
RSRS
Default mutations by TB-Profiler355214
Mutations other than default by TB Profiler257170
No mutations040037

PZA, pyrazinamide; R, resistant; S, sensitive

PZA, pyrazinamide; R, resistant; S, sensitive Meanwhile, 16 strains of 191 showed discrepant results between genotypic and phenotypic results (Table 2); nine strains were found negative by PZase analysis (the major mechanism leading to PZA resistance) with mutations in the pncA gene and were thus characterised as PZA resistant (Table 3).
Table 3

Extended analysis of the 16 M. tuberculosis strains for determining PZA susceptibility.

Strain IDSourceDefault mutations by TB ProfilerMutations other than default by TB Profiler in candidate genesMGIT PZAPZaseMIC (μg/mL)Conclusion
126set 2pncA (Lys96Gln)noneSneg.N/AR
135set 2pncA (Asp63Gly)noneSneg.N/AR
131set 2pncA (Gly78Ser)noneSneg.N/AR
020set 1pncA (Leu182Ser)noneSneg.N/AR
011set 1pncA (T-11C promoter)noneSneg.N/AR
018set 1pncA (Lys96Asn)noneSneg.N/AR
017set 1pncA (Met175Ile)noneS±200R
019set 1pncA (Leu4Ser)rpsA (Asp15Ala)Sneg.N/AR
129set 2pncA (Gly97Asp)rpsA (1834845: TCGCCGCCGCCG /TCGCCGCCG), panD promoter (A-13G)Sneg.N/AR
052set 1nonepncA (Ser67Stop)Sneg.N/AR
049set 1nonepncA (Val44Ala)S±100S
048set 1nonerpsA (1834845: TCGCCGCCGCCG /TCGCCGCCG)Spos.50S
047set 1nonerpsA (Asp123Ala)Spos.50S
042set 1nonerpsA (Thr42Ala)Spos.100S
043set 1nonerpsA (Asp27Asn)Spos.100S
041set 1nonerpsA (Tyr109His), pncA (Stop187Gly)S±50S

R, resistant; S, sensitive; N/A., not applicable; neg., negative; pos., positive; ±, indeterminant

R, resistant; S, sensitive; N/A., not applicable; neg., negative; pos., positive; ±, indeterminant Of the remaining seven strains that were further analysed by MICs, six were deemed PZA susceptible and one was PZA resistant (Table 3). None of the 191 tested isolates possessed mutations in the panD gene. Overall, the tested isolates were phenotypically classified into 108 PZA-resistant and 83 PZA-susceptible isolates.

Mutations in PZA resistance-conferring genes

All 108 PZA-resistant strains possessed mutations in the pncA gene which consists of 44 different non-synonymous mutations, 26 different indels, and one mutation in the promoter region (S3 Table). None of the 71 mutations were found in the 83 PZA-susceptible strains. Among these mutations, 61 genetic variants had single pncA mutations per strain, one variant showed double pncA mutations, nine variants exhibited double mutations in both pncA and rpsA genes, and one variant had triple mutations in pncA, rpsA, and the panD promoter. All 26 indels in the pncA-containing region detected in this study led to PZA resistance, whereas two non-synonymous mutations in pncA (Stop187Gly, Val44Ala) did not confer PZA resistance. None of the 108 PZA-resistant strains possessed mutations in either rpsA or panD alone, which are putative PZA resistance-conferring genes. Among the 83 PZA-susceptible isolates, one possessed a pncA Val44Ala mutation, four exhibited single mutations in rpsA (Thr42Ala, Asp27Asn, Asp123Ala, or deletion at 1824845), and one showed double mutations in pncA Stop187Gly and rpsA Tyr109His (Table 2). pncA mutations in this study were found distributed across the entire gene (Table 4), as previously reported [7,22,23].
Table 4

Non-synonymous mutations in pncA and their presence in the published lists.

MutationPZA senositivitypncA mutations in the published lists
no. of Rno. of SMiotto [22]Yadon [23]Manson [24]Miotto [25]Farhat [26]
pncA (Met1Leu)10noyesnonono
pncA (Ala3Glu)60yesyesnoyesno
pncA (Leu4Ser)10noyesnoyesyes
pncA (Gln10Arg)10yesyesnoyesyes
pncA (Asp12Ala)30yesnonoyesyes
pncA (Asp12Glu)10yesyesnonoyes
pncA (Cys14Stop)10yesyesnoyesno
pncA (Cys14Tyr)10noyesnonono
pncA (Tyr34Asp)80nononoyesyes
pncA (Tyr34Stop)10yesnoyesyesno
pncA (His51Gln)10yesnoyesyesno
pncA (Pro54Leu)30yesyesnoyesyes
pncA (His57Gln)10yesyesnonono
pncA (Leu159Pro)10noyesnonono
pncA (Asp63Gly)10yesyesnoyesyes
pncA (Ser67Stop)10noyesnonoyes
pncA (Trp68Arg)10yesyesnoyesyes
pncA (Thr76Pro)10yesnonoyesyes
pncA (Gly78Ser)30nonononono
pncA (Lys96Asn)20yesyesnoyesno
pncA (Lys96Gln)10yesyesnonono
pncA (Gly97Asp)30yesyesnoyesyes
pncA (Thr100Pro)10yesnononoyes
pncA (Tyr103Ser)10nonononono
pncA (Tyr103Stop)10yesyesyesyesyes
pncA (Ser104Ile)20yesnononono
pncA (Leu116Arg)10yesyesnoyesno
pncA (Trp119Cys)10yesnononono
pncA (Val125Gly)10yesnononono
pncA (Val128Gly)10yesnonoyesyes
pncA (Gly132Ala)10yesyesnoyesno
pncA (Asp136Gly)10nonononoyes
pncA (Cys138Arg)10yesyesnonono
pncA (Val139Ala)20noyesnoyesyes
pncA (Val139Met)10noyesnonoyes
pncA (Gln141Pro)30yesnonoyesyes
pncA (Ala146Thr)10noyesnonono
pncA (Val155Met)10yesyesnonono
pncA (Val163Gly)10nonononono
pncA (Ser164Pro)10yesyesnonono
pncA (Leu172Pro)10yesyesnoyesyes
pncA (Met175Ile)20yesyesnoyesno
pncA (Val180Phe)10yesyesnoyesyes
pncA (Leu182Ser)20yesyesnonoyes
pncA (Stop187Gly)01nonononono
pncA (Val44Ala)01nonononono

R, resistant; S, sensitive

R, resistant; S, sensitive All 45 non-synonymous mutations in pncA—except for three mutations (Gly78Ser, Thr103Ser, Val163Gly)—detected in this study from 108 PZA-resistant strains were included in either of five comprehensively validated mutation lists as high confidence mutations [22-26] (Table 4). Although these three mutations were not on the list, the strains carrying these mutations showed negative results by PZase analysis and resistance via the MGIT test (S1 Table). Two strains harbouring the Stop187Gly and Val44Ala mutations in pncA were determined as PZA-susceptible by the MGIT test and neither mutation was listed in the five published lists (Table 4). The MIC values for these strains were not greater than 100 μg/mL, whereas M. tuberculosis H37Rv and M. bovis BCG Tokyo showed MICs of 50 μg/mL and > 800 μg/mL, respectively.

Sensitivity and specificity of PZA resistance prediction by online tools

Using WGS data from 191 strains, we analysed the sensitivity and specificity of PZA resistance predictions by different tools available online (TB profiler, TGS-TB, PhyResSE, and CASTB). To evaluate tool performance at the automated reporting system level and at their full potential for mutation detection, we categorised the mutations detected by TB profiler, TGS-TB, and PhyResSE as either default or manual (S3 Table and Table 1); default indicates the automated interpretation report and manual indicates all mutations detected by the tools other than default. Remarkable differences between the tools were found in their detection of indels. TGS-TB returned all 26 indel variants in the pncA gene through default settings, whereas the other two tools did not (Table 1). The default detection rate of SNVs in pncA varied among the tools; TGS-TB exhibited the best detection rate (42/47), followed by TB Profiler (35/47), PhyResSE (33/47), and CASTB which detected only 10 SNVs and 10 indels (Table 1). As a result, the default sensitivity of the tools was low (60.1%, 49.1%, and 31.5% for TB Profiler, PhyResSE, and CASTB, respectively), except for TGS-TB (97.2%; Table 5) which is consistent with previous studies [12,15-17].
Table 5

Sensitivity and specificity of the online tools in predicting PZA resistance.

   PZA susceptibilitySensitivity(95% CI)Specificity(95% CI)PPV(95% CI)NPV(95% CI)
   R (n = 108)S (n = 83)
Defalt settingTGS-TBR10510.972 (0.921–0.991)0.988 (0.935–0.998)0.991(0.949–1.00)0.965(0.900–0.993)
S382
TB ProfilerR6500.601 (0.510–0.694)1(0.935–1.000)1.000(0.918–1.000)0.659(0.569–0.741)
S4383
PhyResSER5310.491 (0.396–0.585)0.988 (0.935–0.998)0.981(0.901–1.000)0.599(0.511–0.681)
S5582
CASTBR3410.315 (0.227–0.402)0.988 (0.935–0.998)0.971(0.851–0.999)0.526(0.444–0.606)
S7482
Manual settingTGS-TBR10861(0.950–1.000)0.928 (0.872–0.983)0.947(0.889–0.980)1.000(0.931–1.000)
S077
TB ProfilerR10260.944(0.883–0.979)0.928 (0.849–0.973)0.944(0.883–0.979)0.928(0.849–0.973)
S677
PhyResSER10861(0.950–1.000)0.928 (0.872–0.983)0.947(0.889–0.980)1.000(0.931–1.000)
S077

R, resistance; S, Sensitive; CI, confidence interval; PPV, positive predictive value; NPV, negative predictive value

All strains with indels or mutations detected by manual setting are classified as PZA resistance.

R, resistance; S, Sensitive; CI, confidence interval; PPV, positive predictive value; NPV, negative predictive value All strains with indels or mutations detected by manual setting are classified as PZA resistance. This drastically changed when we evaluated the detection rate with manual settings. PhyResSE detected all 26 indels while TB Profiler missed 6 indels (Table 1); of these 6 indels, two were large deletions at the 3ʹ region of pncA, three were complete losses of pncA, and one was a large fragment insertion at 2288788 (S3 Table). Five indels that were miscalled by TB Profiler (S2 Table) were treated as the call for PZA resistance prediction by TB Profiler. All mutations in rpsA and the panD promoter in our sample set were detected by the three tools through searches beyond the default report (Table 1). When we deemed the strains with these indels and non-synonymous mutations as PZA resistant (interpret ‘non-wild type sequence’ as PZA resistant), the three online tools demonstrated remarkably high sensitivity and specificity in predicting PZA resistance (Table 5). Specificity slightly decreased when manual results were compared with those of default (Table 5). This is due to the inclusion of mutations in rpsA whose association with PZA resistance is debatable [27,28].

Discussion

Our analysis of online tool performance in detecting PZA resistance demonstrated that the tools can predict resistance with more sensitivity than previously reported when all detected variants (non-synonymous mutations and indels) are taken into account (Table 5). In fact, all or most mutations in the pncA gene of 108 PZA-resistant strains were detected by TB Profiler, TGS-TB, and PhyResSE but many variants were missing in the automated interpretation for plain text report. As a result, the accuracy of PZA resistance prediction by online tools differed substantially between the default report and manual inspection (Table 5). Thus, the fundamental cause for the reported low sensitivity of PZA resistance prediction is the tool interpretation pipeline, which is built upon pre-defined mutation catalogues and not due to the inability of software algorithms in detecting genetic variants. The main aim of the online tools is to make complex WGS data analysis accessible for routine use in clinical settings where there is usually a lack in bioinformatics expertise. The advantage of these tools is automated interpretation for drug resistance upon pre-defined mutation catalogues. However, this study emphasised that proper knowledge of drug resistance-conferring mutations with some background in bioinformatics is still needed to avoid misinterpretations, i.e., a lack of drug resistance-associated variants in the automated report does not necessarily mean that there are no mutations in the region of interest causing resistance. Users should be aware that three online tools (TB Profiler, TGS-TB, PhyResSE) used in this study provide variant calling lists in addition to the plain text report of drug resistance prediction no matter if they are included or not in the pre-defined mutation catalogue for automated reporting systems. The previously reported low accuracy of PZA resistance prediction can be improved by a manual inspection of the list (Table 5). Although the tools do not guide users for this adequately, the variation calling list provided by the tools is a straightforward approach. A recent genome-wide association study (GWAS) indicated that PZA resistance prediction can be enhanced by the inclusion of insertions and deletions [29], which was also confirmed by our data. It is clear that the superiority of TGS-TB over the other tools regarding the default setting was mainly due to the inclusion of indels for reporting PZA resistance. Our data showed a higher ratio of strains with structural variants by indels in PZA resistance (29.6%) compared with that of the previously reported values (16%, 19.6%, and 22.7%) for global samples [7], the Chinese province Zhejian [20], and the Chinese city Hanzhou [30], respectively. This higher ratio was observed for both sample set 1 (28.3%) and set 2 (30.0%) in our study, thereby reflecting the situation in Japan. These data imply that the impact of structural variants on PZA resistance is more significant in Japan where the ancient Beijing genotype is predominant compared with that of other regions [31,32]. Further studies are needed to clarify this distribution. In this study, we interpreted strains with any non-synonymous mutations or indels in pncA as genotypic PZA-resistant strains; this ‘non-wild type sequence’ approach was implemented in the Genoscholar PZA-TB II line probe assay (PZA-LPA2; NIPRO Corporation, Osaka, Japan) [28]. One concern with this approach is the overestimation of resistant strains caused by non-resistance-associated mutations, including synonymous and lineage-specific mutations. Unlike the line probe assay, WGS-based DST can exempt them from mutations conferring PZA resistance, but with a lack of reliable lists as a drawback. Yadon et al. [23] identified many non-synonymous amino acid substitutions that do not confer PZA resistance by using a comprehensive library of pncA mutations in M. tuberculosis created by random PCR mutagenesis. Although these mutations were created in vitro, such a list would be helpful in identifying exemptions from the ‘non-wild type sequence’ approach. To evaluate the impact of non-resistance-associated mutations on this approach in clinical settings, we examined recent studies performed in clinical settings under well-controlled conditions. The degree of adverse effects by non-resistance-associated mutations varied but was low, i.e., one mutation out of 400 PZA-susceptible isolates (1/400) [3], 5/156 [20], 0/57 [28], and 2/83 in this study. This implies that the occurrence of such mutations inside a patient’s body is rare. Although the overestimation of resistant strains is unavoidable by the ‘non-wild type sequence’ approach, it can effectively screen PZA-susceptible strains and help reduce the need for culture-based DST, which is labour-intensive, requires facilities with biosafety measures, and is usually less reliable for PZA than other key drugs [33]. Inclusion of other PZA resistance-associated genes such as panD [34] may improve the accuracy of screening for PZA-susceptible strains, although it was not detected in this study. In our sample set, inclusion of mutations in rpsA as PZA resistance-conferring mutations decreased specificity but did not increase sensitivity of WGS-based tools for PZA resistance prediction (Table 5). It remains debatable if rpsA is associated with PZA resistance [27,28]. Moreover, to draw conclusions on its use for the prediction of PZA resistance, MIC measurement for a large number of strains with mutations in rpsA should be performed. It is of note that detection of a complete region of interest deletion, such as for pncA in this study and katG for isoniazid resistance [35], is troublesome. TGS-TB and PhyResSE can detect complete deletions; however, we had to check the mapping status of PhyResSE results, which is not straightforward from the variation calling list and would likely be missed. TGS-TB was much more user friendly in this regard. TB profiler did not report the deletion as the tool has no function to visualise mapping status. To make the tools more user friendly, the developers should consider reporting the intactness of the region of interest as ‘intact and wild-type’ and including it in the automated reporting system, especially for pncA and possibly katG as well. The study limitation is that we did not evaluate heteroresistant strains of mixed populations that appear in clinical settings [18]. How to deal with phenotype/genotype discrepant results, mutations conferring low levels of resistance, and heteroresistance are the practical questions that need to be addressed by WGS data analysis [36,37]. In conclusion, our findings demonstrated that users can obtain more accurate predictions for PZA resistance than previously reported by carefully checking the results manually and applying the ‘non-wild type sequence’ approach. It would be worthwhile for developers to include all pncA mutations without synonymous and lineage-specific mutations into their automated reporting system for PZA resistance prediction as already implemented by TGS-TB for indels. This makes online PZA resistance prediction tools more suitable for clinical setting where there is usually a lack in bioinformatics expertise.

Capture image of mapping results by TGS-TB.

(PPTX) Click here for additional data file.

Strains analyzed in this study and their profiles of PZA resistance.

(XLSX) Click here for additional data file.

Variants in pncA those showed discrepant mutation calls among the online tools.

(XLSX) Click here for additional data file.

Variation of the mutations detected in this study.

(XLSX) Click here for additional data file.
  37 in total

1.  Conditions that may affect the results of susceptibility testing of Mycobacterium tuberculosis to pyrazinamide.

Authors:  Ying Zhang; Sallie Permar; Zhonghe Sun
Journal:  J Med Microbiol       Date:  2002-01       Impact factor: 2.472

2.  Genetic Determinants of Drug Resistance in Mycobacterium tuberculosis and Their Diagnostic Value.

Authors:  Maha R Farhat; Razvan Sultana; Oleg Iartchouk; Sam Bozeman; James Galagan; Peter Sisk; Christian Stolte; Hanna Nebenzahl-Guimaraes; Karen Jacobson; Alexander Sloutsky; Devinder Kaur; James Posey; Barry N Kreiswirth; Natalia Kurepina; Leen Rigouts; Elizabeth M Streicher; Tommie C Victor; Robin M Warren; Dick van Soolingen; Megan Murray
Journal:  Am J Respir Crit Care Med       Date:  2016-09-01       Impact factor: 21.405

3.  Population structure dynamics of Mycobacterium tuberculosis Beijing strains during past decades in Japan.

Authors:  Tomotada Iwamoto; Riyo Fujiyama; Shiomi Yoshida; Takayuki Wada; Chika Shirai; Yasuto Kawakami
Journal:  J Clin Microbiol       Date:  2009-08-26       Impact factor: 5.948

4.  Potential for erroneous results indicating resistance when using the Bactec MGIT 960 system for testing susceptibility of Mycobacterium tuberculosis to pyrazinamide.

Authors:  Pamela Chedore; Lina Bertucci; Joyce Wolfe; Meenu Sharma; Frances Jamieson
Journal:  J Clin Microbiol       Date:  2009-11-18       Impact factor: 5.948

5.  Mycobacterium tuberculosis resistance prediction and lineage classification from genome sequencing: comparison of automated analysis tools.

Authors:  Viola Schleusener; Claudio U Köser; Patrick Beckert; Stefan Niemann; Silke Feuerriegel
Journal:  Sci Rep       Date:  2017-04-20       Impact factor: 4.379

6.  PhyResSE: a Web Tool Delineating Mycobacterium tuberculosis Antibiotic Resistance and Lineage from Whole-Genome Sequencing Data.

Authors:  Silke Feuerriegel; Viola Schleusener; Patrick Beckert; Thomas A Kohl; Paolo Miotto; Daniela M Cirillo; Andrea M Cabibbe; Stefan Niemann; Kurt Fellenberg
Journal:  J Clin Microbiol       Date:  2015-04-08       Impact factor: 5.948

Review 7.  Mycobacterium tuberculosis and whole genome sequencing: a practical guide and online tools available for the clinical microbiologist.

Authors:  G Satta; A Atzeni; T D McHugh
Journal:  Clin Microbiol Infect       Date:  2016-09-15       Impact factor: 8.067

8.  Investigation of the freely available easy-to-use software 'EZR' for medical statistics.

Authors:  Y Kanda
Journal:  Bone Marrow Transplant       Date:  2012-12-03       Impact factor: 5.483

9.  Population-based resistance of Mycobacterium tuberculosis isolates to pyrazinamide and fluoroquinolones: results from a multicountry surveillance project.

Authors:  Matteo Zignol; Anna S Dean; Natavan Alikhanova; Sönke Andres; Andrea Maurizio Cabibbe; Daniela Maria Cirillo; Andrei Dadu; Andries Dreyer; Michèle Driesen; Christopher Gilpin; Rumina Hasan; Zahra Hasan; Sven Hoffner; Ashaque Husain; Alamdar Hussain; Nazir Ismail; Mostofa Kamal; Mikael Mansjö; Lindiwe Mvusi; Stefan Niemann; Shaheed V Omar; Ejaz Qadeer; Leen Rigouts; Sabine Ruesch-Gerdes; Marco Schito; Mehriban Seyfaddinova; Alena Skrahina; Sabira Tahseen; William A Wells; Ya Diul Mukadi; Michael Kimerling; Katherine Floyd; Karin Weyer; Mario C Raviglione
Journal:  Lancet Infect Dis       Date:  2016-07-07       Impact factor: 25.071

10.  TGS-TB: Total Genotyping Solution for Mycobacterium tuberculosis Using Short-Read Whole-Genome Sequencing.

Authors:  Tsuyoshi Sekizuka; Akifumi Yamashita; Yoshiro Murase; Tomotada Iwamoto; Satoshi Mitarai; Seiya Kato; Makoto Kuroda
Journal:  PLoS One       Date:  2015-11-13       Impact factor: 3.240

View more
  3 in total

Review 1.  The Bewildering Antitubercular Action of Pyrazinamide.

Authors:  Elise A Lamont; Nicholas A Dillon; Anthony D Baughn
Journal:  Microbiol Mol Biol Rev       Date:  2020-03-04       Impact factor: 11.056

2.  Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis.

Authors:  Nilay Peker; Leonard Schuele; Nienke Kok; Miguel Terrazos; Stefan M Neuenschwander; Jessica de Beer; Onno Akkerman; Silke Peter; Alban Ramette; Matthias Merker; Stefan Niemann; Natacha Couto; Bhanu Sinha; John Wa Rossen
Journal:  Microb Genom       Date:  2021-11

3.  Prediction of Mycobacterium tuberculosis pyrazinamidase function based on structural stability, physicochemical and geometrical descriptors.

Authors:  Rydberg Roman Supo-Escalante; Aldhair Médico; Eduardo Gushiken; Gustavo E Olivos-Ramírez; Yaneth Quispe; Fiorella Torres; Melissa Zamudio; Ricardo Antiparra; L Mario Amzel; Robert H Gilman; Patricia Sheen; Mirko Zimic
Journal:  PLoS One       Date:  2020-07-31       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.