| Literature DB >> 34919513 |
Carla Mariner-Llicer1, Galo A Goig2,3, Laura Zaragoza-Infante4, Manuela Torres-Puente1, Luis Villamayor5, David Navarro6,7, Rafael Borras6,7, Álvaro Chiner-Oms1, Iñaki Comas1,8.
Abstract
A rapid and accurate diagnostic assay represents an important means to detect Mycobacterium tuberculosis, identify drug-resistant strains and ensure treatment success. Currently employed techniques to diagnose drug-resistant tuberculosis include slow phenotypic tests or more rapid molecular assays that evaluate a limited range of drugs. Whole-genome-sequencing-based approaches can detect known drug-resistance-conferring mutations and novel variations; however, the dependence on growing samples in culture, and the associated delays in achieving results, represents a significant limitation. As an alternative, targeted sequencing strategies can be directly performed on clinical samples at high throughput. This study proposes a targeted sequencing assay to rapidly detect drug-resistant strains of M. tuberculosis using the Nanopore MinION sequencing platform. We designed a single-tube assay that targets nine genes associated with drug resistance to seven drugs and two phylogenetic-determining regions to determine strain lineage and tested it in nine clinical isolates and six sputa. The study's main aim is to calibrate MinNION variant calling to detect drug-resistance-associated mutations with different frequencies to match the accuracy of Illumina (the current gold-standard sequencing technology) from both culture and sputum samples. After calibrating Nanopore MinION variant calling, we demonstrated 100% agreement between Illumina WGS and our MinION set up to detect known drug resistance and phylogenetic variants in our dataset. Importantly, other variants in the amplicons are also detected, decreasing the recall. We identify minority variants and insertions/deletions as crucial bioinformatics challenges to fully reproduce Illumina WGS results.Entities:
Keywords: MinION; Mycobacterium tuberculosis; Nanopore; amplicon sequencing; drug-resistance; genomics; genotyping
Mesh:
Substances:
Year: 2021 PMID: 34919513 PMCID: PMC8767343 DOI: 10.1099/mgen.0.000740
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Accession numbers of samples included in this study
|
Illumina |
MinION | |||
|---|---|---|---|---|
|
|
|
|
|
|
|
G841 |
PRJEB22237 |
ERR2099789 |
PRJEB44496 |
ERS6293816 |
|
N0067 |
PRJEB3223 |
ERR233356 |
PRJEB44496 |
ERS6293819 |
|
G1335_W27 |
PRJEB37609 |
ERR3994128 |
PRJEB44496 |
ERS6293807 |
|
G1646_W19 |
PRJEB37609 |
ERR3994105 |
PRJEB44496 |
ERS6293808 |
|
G2103_W23 |
PRJEB37609 |
ERR3994108 |
PRJEB44496 |
ERS6293812 |
|
G2267_W20 |
PRJEB37609 |
ERR3994106 |
PRJEB44496 |
ERS6293813 |
|
G2284_W26 |
PRJEB37609 |
ERR3994111 |
PRJEB44496 |
ERS6293814 |
|
G107 |
PRJEB38719 |
ERR4195665 |
PRJEB44496 |
ERS6293806 |
|
G1800 |
PRJEB38719 |
ERR4195305 |
PRJEB44496 |
ERS6293809 |
|
G1961 |
PRJEB38719 |
ERR4195372 |
PRJEB44496 |
ERS6293811 |
|
G770 |
PRJEB38719 |
ERR4195357 |
PRJEB44496 |
ERS6293815 |
|
G870 |
PRJEB38719 |
ERR4195619 |
PRJEB44496 |
ERS6293817 |
|
G981 |
PRJEB38719 |
ERR4195376 |
PRJEB44496 |
ERS6293818 |
|
182320_M20 |
PRJEB44496 |
ERS6337728 |
PRJEB44496 |
ERS6293805 |
|
G1952 |
PRJEB44496 |
ERS6337729 |
PRJEB44496 |
ERS6293810 |
Description of the dataset included in this study
|
Sample |
Sample type |
Run MinION |
Resistance profile |
Resistance agreement |
Lineage |
Lineage agreement |
|---|---|---|---|---|---|---|
|
182320_M20 |
Sputum |
Run1 |
Susceptible |
100 |
Lineage4 |
100 |
|
G107 |
Culture |
Run1 |
Susceptible |
100 |
Lineage3 |
100 |
|
G1800 |
Culture |
Run1 |
FQ, RMP, INH, KAN, EMB |
100 |
Lineage2 |
100 |
|
G1952 |
Culture |
Run1 |
Susceptible |
100 |
Lineage6 |
100 |
|
G1961 |
Culture |
Run1 |
RMP, EMB |
100 |
Lineage5 |
100 |
|
G770 |
Culture |
Run1 |
Susceptible |
100 |
Lineage4 |
100 |
|
G841 |
Culture |
Run1 |
RMP |
100 |
Lineage4 |
100 |
|
G870 |
Culture |
Run1 |
FQ, RMP, INH, KAN, EMB |
100 |
Lineage2 |
100 |
|
G981 |
Culture |
Run1 |
Susceptible |
100 |
Lineage2 |
100 |
|
N0067 |
Culture |
Run1 |
Susceptible |
100 |
Lineage1 |
100 |
|
G1335_W27 |
Sputum |
Run2 |
Susceptible |
100 |
Lineage2 |
100 |
|
G1646_W19 |
Sputum |
Run2 |
Susceptible |
100 |
Lineage4 |
100 |
|
G2103_W23 |
Sputum |
Run2 |
Susceptible |
100 |
Lineage4 |
100 |
|
G2267_W20 |
Sputum |
Run2 |
Susceptible |
100 |
Lineage4 |
100 |
|
G2284_W26 |
Sputum |
Run2 |
Susceptible |
100 |
Lineage4 |
100 |
TP, true positive variants; FN, false negative variants; FP, false positive variants; TN, true negative variants; TPR, true positive rate; TNR, true negative rate.
Genes and phylogenetic regions included in the gene panel, their corresponding positions in the MTBC genome and the antibiotic they confer resistance to
|
Amplicon MTBC genomic positions |
Gene |
Prediction | |
|---|---|---|---|
|
Start |
End | ||
|
4193 |
7279 |
|
Fluoroquinolones |
|
7300 |
9882 |
|
Fluoroquinolones |
|
759779 |
763387 |
|
Rifampicin |
|
1471177 |
1474319 |
|
Streptomycin |
|
1673088 |
1675890 |
|
Isoniazid |
|
2153843 |
2156356 |
|
Isoniazid |
|
2287571 |
2290686 |
|
Pirazinamid |
|
2713324 |
2716363 |
|
Kanamycin |
|
4246440 |
4249901 |
|
Ethambutol |
|
4356331 |
4359445 |
|
L1, L2, L5 |
|
1279773 |
1282976 |
|
L3, L4, L6 |
Fig. 1.Diagram of the target regions of the gene panel. The lengths of regions are not scaled to genome size.
Fig. 2.Workflow diagram summarizing the methodology.
Fig. 3.qPCR of the multiplex PCR product. Each colour represents a different gene. Two replicates per gene and negative controls were included.
Fig. 4.Depth of coverage by gene in each MinION sequencing run. Genes are ordered by length (from the shortest to the longest). (a) Bar plot representing the median depth by sample and by region of the gene panel. Colours represent the sequencing run and the pattern of the bars represent sample type. (b) Box plot representing the distribution of sequencing depths of each genomic region in the 15 samples. Plots are coloured according to the MinION sequencing run (purple for the first run, blue for the second run).
Median depth of coverage for each gene per MinION run
|
Region |
Run 1 |
Run 2 | ||||
|---|---|---|---|---|---|---|
|
Median depth |
IQR |
Min–max |
Median depth |
IQR |
Min–max | |
|
|
6677 |
737.25 |
4213–7493 |
1719 |
373 |
231–1856 |
|
|
6505 |
1561 |
4453–8510 |
2383 |
702 |
2056–3129 |
|
|
5802 |
1059.25 |
3916–7695 |
1479 |
410 |
863–1580 |
|
|
6222 |
1590 |
1310–7489 |
3011 |
695 |
686–3058 |
|
|
4564 |
801 |
2962–6318 |
3592 |
976.5 |
2833–4370 |
|
|
4358 |
1754.75 |
2086–6780 |
3888 |
960 |
587–4763 |
|
|
2156 |
1135.75 |
1299–3985 |
1211 |
1067 |
152–1967 |
|
|
2016 |
2011.75 |
1605–2643 |
84 |
78 |
3–118 |
|
|
2022 |
1088 |
852–4278 |
166 |
241 |
40–495 |
|
L125 |
2124 |
1597.5 |
994–3109 |
1520 |
1684 |
37–1896 |
|
L346 |
5222 |
1618 |
3844–7428 |
3593 |
3930.625 |
1860–3943 |
IQR, interquartile range.
Fig. 5.ROC curve used to set the frequency threshold employed to call variants in MinION. Points represent the values for recall and false positive rate obtained when applying different frequency values in MinION variant calling. Both axes are truncated. (a) ROC curve used to set the frequency threshold to call variable SNPs in MinION. Recall and false positive rate value obtained using different variant calling frequency cut-offs for MinION (from 0.1 to 0.9 using increments of 0.1) and comparing with Illumina variant calls at a 0.1 fixed threshold. (b) ROC curve used to determine the frequency threshold to call fixed SNPs in MinION. Both axes are truncated. Recall and false positive rate value obtained using different variant calling frequency cut-offs for MinION (from 0.3 to 0.9 using increments of 0.1) and comparing with Illumina variant calls at a 0.9 fixed threshold. For a detailed result see Fig. S1.
Fig. 6.Comparison of variants found in Illumina and MinION. (a) Number of variants by sample obtained in MinION and Illumina that passed all quality filters for credible variants defined in Methods. Blue bars represent all variants detected in MinION reads, yellow bars represent variants obtained in Illumina reads, and purple bars represent common variants detected by both sequencing platforms. TP, FP and FN values in the inset box were obtained considering 15 samples. (b) Variant frequency correlation between Illumina and MinION reads. Only SNPs that passed all filters are represented. Points represent variants that appear in both platforms, squares represent discrepant variants only present in MinION reads, and triangles represent variants only present in Illumina. Orange represents phylogenetic variants, purple represents variants associated with antibiotic resistance, and yellow represents other variants. Dashed lines represent the threshold values to define fixed variants for MinION (horizontal) and Illumina (vertical).
Comparison of common and discrepant fSNPs
|
Frequency in MinION |
TP |
FP |
FN |
TN |
TPR |
TNR |
|---|---|---|---|---|---|---|
|
0.1 |
129 |
1489 |
0 |
478248 |
1 |
0.9968962 |
|
0.2 |
129 |
98 |
0 |
479592 |
1 |
0.9997957 |
|
0.5 |
129 |
4 |
0 |
479678 |
1 |
0.9999917 |
|
0.4 |
129 |
6 |
0 |
479676 |
1 |
0.9999875 |
|
0.3 |
129 |
16 |
0 |
479666 |
1 |
0.9999666 |
|
0.6 |
129 |
3 |
0 |
479679 |
1 |
0.9999937 |
|
|
|
|
|
479680 |
|
|
|
0.8 |
125 |
2 |
4 |
479680 |
0.9689922 |
0.9999958 |
|
0.9 |
115 |
2 |
14 |
479679 |
0.8914729 |
0.9999958 |
TP, true positive; FN, false negative; FP, false positive; TN, true negative; TPR, true-positive rate; TNR, true-negative rate.
Indels' comparison
|
Frequency in MinION |
Indels MinION |
Indels Illumina |
TP |
|---|---|---|---|
|
Freq 10 |
14413 |
16 |
6 |
|
Freq 20 |
3735 |
16 |
6 |
|
Freq 30 |
1159 |
16 |
5 |
|
Freq 40 |
356 |
16 |
5 |
|
Freq 50 |
81 |
16 |
5 |
|
Freq 60 |
49 |
16 |
5 |
|
Freq 70 |
19 |
16 |
5 |
|
Freq 80 |
17 |
16 |
4 |
|
Freq 90 |
1 |
16 |
1 |
|
Freq 100 |
0 |
16 |
0 |
TP, true positive.
Phylogenetic and drug-resistance-associated variants detected by the gene panel
|
Sample |
Variant |
Gene |
Antibiotic or lineage |
Frequency MinION |
Frequency llumina |
Resistance profile |
|---|---|---|---|---|---|---|
|
G1961_L5 |
761155CT |
|
RMP |
96.33 |
100 |
MDR |
|
G1961_L5 |
1673425CT |
|
INH |
52.94 |
55.67 | |
|
G1961_L5 |
4247431GA |
|
EMB |
89.12 |
100 | |
|
G1961_L5 |
4357657GA |
L125 |
Lineage 5 |
95.37 |
100 | |
|
G1800 |
7570CT |
|
FQ |
87.44 |
100 |
pre-XDR |
|
G1800 |
761095TC |
|
RMP |
92.72 |
100 | |
|
G1800 |
2155168CG |
|
INH |
98.06 |
100 | |
|
G1800 |
2715344GA |
|
KAN |
96.54 |
100 | |
|
G1800 |
4247429AG |
|
EMB |
98.49 |
100 | |
|
G1800 |
4357804TG |
L125 |
Lineage 2 |
98.17 |
100 | |
|
G870 |
6750CT |
|
FQ |
95.26 |
100 |
pre-XDR |
|
G870 |
761155CT |
|
RMP |
95.76 |
99.17 | |
|
G870 |
2155168CG |
|
INH |
98.11 |
100 | |
|
G870 |
2715346GA |
|
KAN |
92.68 |
100 | |
|
G870 |
4247429AG |
|
EMB |
98.28 |
100 | |
|
G870 |
4357804TG |
L125 |
Lineage 2 |
98.49 |
100 | |
|
G841 |
761277AT |
|
RMP |
98.67 |
100 |
RIF-R |
|
G841 |
1281771CT |
L346 |
Lineage 4 |
93.07 |
100 | |
|
N0067_L1 |
4357773GA |
L125 |
Lineage 1 |
94.77 |
100 |
Susceptible |
|
G1335_W27 |
4357804TG |
L125 |
Lineage 2 |
97.43 |
100 |
Susceptible |
|
G981_L2 |
4357804TG |
L125 |
Lineage 2 |
98.4 |
100 |
Susceptible |
|
G107_L3 |
1281984GA |
L346 |
Lineage 3 |
97.62 |
100 |
Susceptible |
|
182320_M20 |
1281771CT |
L346 |
Lineage 4 |
92.89 |
100 |
Susceptible |
|
G1646_W19 |
1281771CT |
L346 |
Lineage 4 |
90.75 |
100 |
Susceptible |
|
G2103_W23 |
1281771CT |
L346 |
Lineage 4 |
89.69 |
100 |
Susceptible |
|
G2267_W20 |
1281771CT |
L346 |
Lineage 4 |
90.18 |
100 |
Susceptible |
|
G2284_W26 |
1281771CT |
L346 |
Lineage 4 |
90.96 |
100 |
Susceptible |
|
G770_L4 |
1281771CT |
L346 |
Lineage 4 |
90.9 |
100 |
Susceptible |
|
G1952_L6 |
1281685CG |
L346 |
Lineage 6 |
84.47 |
100 |
Susceptible |
FQ, fluoroquinolones; RMP, rifampicin; SM, streptomicin; INH, isoniazid; PZA, pyrazinamide; KAN, kanamycin; EMB, ethambutol.