| Literature DB >> 30209183 |
David H Wyllie1,2,3, Esther Robinson4, Tim Peto5,3, Derrick W Crook5,3, Adebisi Ajileye4, Priti Rathod4, Rosemarie Allen4, Lisa Jarrett4, E Grace Smith4, A Sarah Walker5,3.
Abstract
The detection of laboratory cross-contamination and mixed tuberculosis infections is an important goal of clinical mycobacteriology laboratories. The objective of this study was to develop a method to detect mixtures of different Mycobacterium tuberculosis lineages in laboratories performing mycobacterial next-generation sequencing (NGS). The setting was the Public Health England National Mycobacteriology Laboratory Birmingham, which performs Illumina sequencing on DNA extracted from positive mycobacterial growth indicator tubes. We analyzed 4,156 samples yielding M. tuberculosis from 663 MiSeq runs, which were obtained during development and production use of a diagnostic process using NGS. The counts of the most common (major) variant and all other variants (nonmajor variants) were determined from reads mapping to positions defining M. tuberculosis lineages. Expected variation was estimated during process development. For each sample, we determined the nonmajor variant proportions at 55 sets of lineage-defining positions. The nonmajor variant proportion in the two most mixed lineage-defining sets (F2 metric) was compared with that of the 47 least-mixed lineage-defining sets (F47 metric). The following three patterns were observed: (i) not mixed by either metric; (ii) high F47 metric, suggesting mixtures of multiple lineages; and (iii) samples compatible with mixtures of two lineages, detected by differential F2 metric elevations relative to F47. Pattern ii was observed in batches, with similar patterns in the M. tuberculosis H37Rv control present in each run, and is likely to reflect cross-contamination. During production, the proportions of samples in the patterns were 97%, 2.8%, and 0.001%, respectively. The F2 and F47 metrics described could be used for laboratory process control in laboratories sequencing M. tuberculosis genomes.Entities:
Keywords: Mycobacterium tuberculosis; cross-contamination; mixed infection; next-generation sequencing; quality control
Mesh:
Substances:
Year: 2018 PMID: 30209183 PMCID: PMC6204665 DOI: 10.1128/JCM.00923-18
Source DB: PubMed Journal: J Clin Microbiol ISSN: 0095-1137 Impact factor: 5.948
FIG 1Laboratory and bioinformatic processing. GP, general practitioner; −ve, negative; +ve, positive; LIMS, lab information management system.
Samples analyzed
| Development stage | No. of sequences (samples and controls) | Date range (mo-day-yr) | No. of individuals providing samples | MiSeq run identifiers | No. of clinical samples | No. of MiSeq Runs |
|---|---|---|---|---|---|---|
| Development | 938 | 04-20-2015 to 12-15-2015 | 630 | 101–291 | 776 | 154 |
| Preproduction | 1,167 | 04-01-2016 to 12-06-2016 | 753 | 1152–1522 | 919 | 163 |
| Production | 2,191 | 12-07-2016 to 12-30-2017 | 1,481 | 1523–2307 | 1,794 | 346 |
FIG 2Mixtures in H37Rv controls. (A) Histogram showing the F2 metric, which reflects the mixture in the two most-mixed lineage-associated sets, in H37Rv control DNA. (B) Median F2 metric among clinical samples other than H37Rv; red lines indicate that the F2 mixture metric in H37Rv controls is raised (as shown in panel A). (C) F2 metric for each M. tuberculosis sequence from a clinical sample.
FIG 3Mixture metrics in the development phase. (A) Samples are arranged first by the order of the MiSeq runs (depicted as solid gray blocks) and the order bioinformatics processing was completed. Only samples yielding M. tuberculosis are shown, which is why some blocks are longer than others. If the H37Rv control samples had increased F2 statistics, a red bar is shown above each sample in panel A. (B to D) We depicted the F2 (B) and F47 metrics (C), as well as the estimated mixture F in each of the 58 lineage-defining sets (D). The arrow illustrates a sample with elevated F2 but low F47 metrics.
Detection of mixtures in clinical samples
| Development stage ( | No. (%) of mixtures with: | ||
|---|---|---|---|
| Neither F2 nor F47 raised | F2 raised, but F47 normal | F47 raised | |
| Preproduction (919) | 900 (98) | 5 (0.003) | 14 (1.1) |
| Production (1,794) | 1,741 (97) | 1 (0.001) | 52 (2.8) |
FIG 4Consistency of isolated F2 elevation in individuals. Six individuals with elevated F2 but not F47 statistics were identified during the preproduction and production phases. The observed minor variant proportions for all deep branches analyzed are shown in a heatmap. For example, patient 4 had two samples taken in December 2015; sample 2 was analyzed twice (sequencing identification numbers [IDs] 0d9d5 and 9276f), and sample 3 was analyzed once. A similar pattern of minor variation is seen in all three samples.
FIG 5The relationship between the F47 metric and the number of uncallable bases is shown. The red line corresponds to the cutoff used to define F47 as being elevated.