| Literature DB >> 30337678 |
Matthew Ezewudo1, Amanda Borens1, Álvaro Chiner-Oms2, Paolo Miotto3, Leonid Chindelevitch4, Angela M Starks5, Debra Hanna1, Richard Liwski1, Matteo Zignol6, Christopher Gilpin6, Stefan Niemann7, Thomas Andreas Kohl8, Robin M Warren9, Derrick Crook10, Sebastien Gagneux11, Sven Hoffner12, Camilla Rodrigues13, Iñaki Comas14, David M Engelthaler15, David Alland16, Leen Rigouts17, Christoph Lange18, Keertan Dheda19, Rumina Hasan20, Ruth McNerney21, Daniela M Cirillo3, Marco Schito1, Timothy C Rodwell22,23, James Posey24.
Abstract
Drug-resistant tuberculosis poses a persistent public health threat. The ReSeqTB platform is a collaborative, curated knowledgebase, designed to standardize and aggregate global Mycobacterium tuberculosis complex (MTBC) variant data from whole genome sequencing (WGS) with phenotypic drug susceptibility testing (DST) and clinical data. We developed a unified analysis variant pipeline (UVP) ( https://github.com/CPTR-ReSeqTB/UVP ) to identify variants and assign lineage from MTBC sequence data. Stringent thresholds and quality control measures were incorporated in this open source tool. The pipeline was validated using a well-characterized dataset of 90 diverse MTBC isolates with conventional DST and DNA Sanger sequencing data. The UVP exhibited 98.9% agreement with the variants identified using Sanger sequencing and was 100% concordant with conventional methods of assigning lineage. We analyzed 4636 publicly available MTBC isolates in the ReSeqTB platform representing all seven major MTBC lineages. The variants detected have an above 94% accuracy of predicting drug based on the accompanying DST results in the platform. The aggregation of variants over time in the platform will establish confidence-graded mutations statistically associated with phenotypic drug resistance. These tools serve as critical reference standards for future molecular diagnostic assay developers, researchers, public health agencies and clinicians working towards the control of drug-resistant tuberculosis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30337678 PMCID: PMC6194142 DOI: 10.1038/s41598-018-33731-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Evaluation of Unified variant pipeline SNP calling versus SNP detection using Sanger sequencing and PhyResSE analysis pipeline.
| Resistance gene | # of SNPs detected by Sanger sequencing | # of SNPs detected by UVP | # of discordant SNPs between UVP and Sanger sequencing | % Concordance between UVP and Sanger sequencing | # of SNPs detected by PhyResSE | % Concordance between UVP and PhyResSE |
|---|---|---|---|---|---|---|
|
| 19 | 19 | 0 | 100 | 19 | 100 |
|
| 66 | 67 | 1 | 98.9 | 67 | 100 |
|
| 25 | 25 | 0 | 100 | 25 | 100 |
|
| 19 | 20 | 1 | 98.9 | 20 | 100 |
|
| 16 | 17 | 1 | 98.9 | 17 | 100 |
|
| 11 | 12 | 1 | 98.9 | 12 | 100 |
Comparisons of Indels calls by UVP and validated variants previously shown to be present in given isolates.
| Sample ID | PubMed ID | Affected gene | H37Rv reference genomic position | Indel | UVP inferred Indel | Lineage information | |
|---|---|---|---|---|---|---|---|
| ERR502929 | 27021327 | dfrA, thyA | 3073130–3074471 | complete deletion | complete deletion | 2.2.1 | present |
| ERR751361 | 27021327 | Wt | 3073130–3074471 | no deletion | no deletion | 2.2.1 | present |
| ERR751453 | 27021327 | dfrA, thyA | 3073130–3074471 | complete deletion | complete deletion | 2.2.1 | present |
| ERR751483 | 27021327 | Wt | 3073130–3074471 | no deletion | no deletion | 2.2.1 | present |
| ERR775373 | 27021327 | dfrA, thyA | 3073130–3074471 | complete deletion | complete deletion | 2.2.1 | present |
| ERR779910 | 27021327 | dfrA, thyA | 3073130–3074471 | complete deletion | complete deletion | 2.2.1 | present |
| SRR2333215 | 27021327 | dfrA, thyA | 3073130–3074471 | complete deletion | complete deletion | 2.2.2 | absent |
| SRR1952721 | 25977398 | pncA | 2288681–2289241 | complete deletion | complete deletion | 2.2.1 | present |
| SRR1948177 | 25977398 | pncA | 2288681–2289241 | complete deletion | complete deletion | 2.2.1 | present |
| R721_C13 | 26496891 | atsD | 756970 | C > CG | C > CG | 2.2.1.1 | present |
| R721_C13 | 26496891 | regX3 | 581317 | G > GCG | G > GCG | 2.2.1.1 | present |
| R966_C5 | 26496891 | Rv3861 | 4338065 | AC > A | AC > A | 2.2.1 | present |
| R160_C13 | 26496891 | adhD | 3452173 | CG > CGCCAT | CG > CGCCATG | 4.1.1.3 | absent |
| R376_C11 | 26496891 | Rv3256c | 3636924 | ACC > AC | ACC > AC | 4.1.2.1 | absent |
| R912_C11 | 26496891 | yrbE4A | 3920814 | GA > G | GAA > GA | 1.2.2 | absent |
| R965_C1 | 26496891 | adhA | 2110232 | A > AG | AG > AGG | 2.2.1 | present |
| R458_C3 | 26496891 | espR | 4323810 | GC > G | GC > G | 4.1.2 | absent |
| R641_C1 | 26496891 | Rv2375 | 2655550 | A > AT | AT > ATT | 2.2.1.1 | present |
Figure 1Distribution of isolates on ReSeqTB platform among the seven major lineages of the Mycobacterium tuberculosis complex (MTBC). This schematic shows the proportional representation of the major MTBC lineages in our dataset. The Euro American lineage has the most representation, but every other major lineage including Lineage 7 and animal strains are represented in the dataset.
Sensitivity, specificity and accuracy values of drug resistance prediction by high confidence mutations inferred to be associated with antibiotic resistance in isolates within the data set with DST results from the BACTEC MGIT 960 Mycobacteria liquid culture method.
| Drug | TP | FP | TN | FN | Sensitivity | 95%CI Low | 95% CI High | Specificity | 95%CI Low | 95%CI High | Accuracy (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Isoniazid | 321 | 1 | 463 | 41 | 0.887 | 0.848 | 0.917 | 0.997 | 0.986 | 0.999 | 94.9 |
| Levofloxacin | 23 | 10 | 313 | 2 | 0.92 | 0.725 | 0.986 | 0.969 | 0.942 | 0.984 | 96.6 |
| Ofloxacin | 35 | 1 | 312 | 2 | 0.946 | 0.805 | 0.991 | 0.997 | 0.978 | 0.999 | 99.1 |
| Rifampicin | 275 | 2 | 533 | 19 | 0.935 | 0.899 | 0.96 | 0.996 | 0.985 | 0.999 | 97.5 |