Literature DB >> 30209183

Identifying Mixed Mycobacterium tuberculosis Infection and Laboratory Cross-Contamination during Mycobacterial Sequencing Programs.

David H Wyllie1,2,3, Esther Robinson4, Tim Peto5,3, Derrick W Crook5,3, Adebisi Ajileye4, Priti Rathod4, Rosemarie Allen4, Lisa Jarrett4, E Grace Smith4, A Sarah Walker5,3.   

Abstract

The detection of laboratory cross-contamination and mixed tuberculosis infections is an important goal of clinical mycobacteriology laboratories. The objective of this study was to develop a method to detect mixtures of different Mycobacterium tuberculosis lineages in laboratories performing mycobacterial next-generation sequencing (NGS). The setting was the Public Health England National Mycobacteriology Laboratory Birmingham, which performs Illumina sequencing on DNA extracted from positive mycobacterial growth indicator tubes. We analyzed 4,156 samples yielding M. tuberculosis from 663 MiSeq runs, which were obtained during development and production use of a diagnostic process using NGS. The counts of the most common (major) variant and all other variants (nonmajor variants) were determined from reads mapping to positions defining M. tuberculosis lineages. Expected variation was estimated during process development. For each sample, we determined the nonmajor variant proportions at 55 sets of lineage-defining positions. The nonmajor variant proportion in the two most mixed lineage-defining sets (F2 metric) was compared with that of the 47 least-mixed lineage-defining sets (F47 metric). The following three patterns were observed: (i) not mixed by either metric; (ii) high F47 metric, suggesting mixtures of multiple lineages; and (iii) samples compatible with mixtures of two lineages, detected by differential F2 metric elevations relative to F47. Pattern ii was observed in batches, with similar patterns in the M. tuberculosis H37Rv control present in each run, and is likely to reflect cross-contamination. During production, the proportions of samples in the patterns were 97%, 2.8%, and 0.001%, respectively. The F2 and F47 metrics described could be used for laboratory process control in laboratories sequencing M. tuberculosis genomes.
Copyright © 2018 Wyllie et al.

Entities:  

Keywords:  Mycobacterium tuberculosis; cross-contamination; mixed infection; next-generation sequencing; quality control

Mesh:

Substances:

Year:  2018        PMID: 30209183      PMCID: PMC6204665          DOI: 10.1128/JCM.00923-18

Source DB:  PubMed          Journal:  J Clin Microbiol        ISSN: 0095-1137            Impact factor:   5.948


INTRODUCTION

Mycobacterium tuberculosis is an organism which has coevolved with humans during the early migrations of modern humans, diverging from a common M. tuberculosis ancestor about 75,000 years ago (1). Distinct lineages, corresponding to evolution occurring during these early migrations, are readily identified by next-generation sequencing (NGS), with each lineage characterized by ancient single-nucleotide variants (SNVs) which define deep branches in the M. tuberculosis phylogeny (1, 2).

Multiple M. tuberculosis lineages.

Infection by multiple lineages of tuberculosis (TB) is well described and has been detected by observing mixed results on fractional sequencing (e.g., spoligotyping and mycobacterial interspersed repetitive-unit–variable-number tandem-repeat [MIRU-VNTR]) and validated by the characterization of multiple individual picks from solid medium (3). Multilineage infection is characterized by isolates differing by many hundreds of SNVs, in which respect it differs from the increasingly recognized and more common in-host microevolution (4). Reported rates of mixed infection vary markedly, as reviewed previously (3, 5), with rates between 10% and 30% reported in areas of current (5–7) or historical (8, 9) high prevalence. Much lower rates are reported in low-incidence countries (3), although systematic underdetection is likely to occur due to both the limited representation of bacteria in single-sputum samples of pulmonary disease and the decrease in diversity occurring during differential strain growth in broth culture (10).

Implications of mixed infection.

Mixed infection, assessed by either MIRU-VNTR polymorphisms (7) or by heterogeneity in drug susceptibility testing (11), is independently associated with reduced drug treatment response, so there are compelling clinical reasons to try to identify it. There are also important technical implications of isolating mixed TB strains from a culture. First, such a finding may reflect cross-contamination within the laboratory (3). Second, mixed infection complicates the interpretation of drug resistance tests, whether phenotypic or genotypic, as one or other coinfecting strains may dominate the results from these tests. Third, it complicates the understanding of relatedness when techniques, such as SNV distance computation, are applied, as these generally assume that a single sequence is present when basecalling (5, 12–14), marking mixed sites as uncertain. Maximal likelihood tree drawing algorithms assume that such “uncertain” sites contain no information and impute a single nucleotide at each of such positions, an approach which may be inappropriate in the presence of mixtures. Increasingly, NGS-based species and resistance determination is becoming routine in mycobacteriology laboratories and has been deployed in reference laboratories in England (15, 16). As part of the quality control and accreditation of the routine process now operating in these laboratories, we describe an approach to identifying mixed samples using Illumina next-generation sequencing data, illustrating its use by studying over 4,000 consecutive positive cultures from a single reference laboratory.

MATERIALS AND METHODS

Isolation of DNA from mycobacteria and sequencing.

This study includes all mycobacteria processed between 01 June 2015 and 30 December 2017 in the Public Health England Midlands and North Reference Laboratory, whose catchment is approximately 15 million people, or about one-third of England. Clinical specimens were decontaminated and inoculated into mycobacterial growth indicator tubes (MGIT). During the process (Fig. 1), positive MGITs were batched when they became available, either following growth in the local laboratory or following receipt from another laboratory. Since a positive MGIT may contain more than one M. tuberculosis strain, and since subculture is not performed prior to DNA extraction and sequencing, the DNA extracted from such MGIT cultures may derive from multiple M. tuberculosis strains. Positive-control samples (H37Rv strain) were also grown in MGIT cultures. Batches of positive samples were extracted using a manual process, exactly as described in the supplemental methods in a previous study (16). Illumina sequencing libraries were prepared using Nextera XT chemistry from equal amounts of 12 (from 20 April 2015 to 31 July 2015) or 16 (from 01 August 2015 to 30 December 2017) mycobacterial DNA extracts (16), using manual steps (Table 1). Positive-control DNA (H37Rv, obtained from ATCC) was included as one of the 12 or 16 extracts in all libraries, either from a contemporaneously extracted broth culture or from stored DNA. Libraries were loaded into an Illumina MiSeq instrument and sequenced (16).
FIG 1

Laboratory and bioinformatic processing. GP, general practitioner; −ve, negative; +ve, positive; LIMS, lab information management system.

TABLE 1

Samples analyzed

Development stageNo. of sequences (samples and controls)Date range (mo-day-yr)No. of individuals providing samplesMiSeq run identifiersNo. of clinical samplesNo. of MiSeq Runs
Development93804-20-2015 to 12-15-2015630101–291776154
Preproduction1,16704-01-2016 to 12-06-20167531152–1522919163
Production2,19112-07-2016 to 12-30-20171,4811523–23071,794346
Laboratory and bioinformatic processing. GP, general practitioner; −ve, negative; +ve, positive; LIMS, lab information management system. Samples analyzed

Routine bioinformatic processing.

The routine bioinformatics pipeline deployed by Public Health England has been previously described (15). Briefly, reads were first processed using the Mykrobe predictor tool, which identifies Mycobacterium tuberculosis bacteria using species-specific k-mers (17). Specimens identified as containing M. tuberculosis bacteria were further processed (16) and mapped to the H37Rv v2 genome (NCBI RefSeq no. NC_000962.2) (18), as described previously (16), and vcf files were generated using SAMtools mPileup, with additional basecalling using GATK VariantAnnotator v2.1. A consensus base is called from high-quality bases provided one base accounts for >90% of the pileup; otherwise, the base is recorded as uncertain (“N”) (15).

Estimation of the minor variant frequency within a set of lineage-defining positions.

In this study, we evaluated an additional step not present in the production pipeline. This evaluates high-quality base counts (identified by the BaseCounts VCF tag) at positions known to be lineage associated (2), which were extracted and summarized using code available at https://github.com/davidhwyllie/VCFMIX. We elected to study sites known to be subject to lineage-specific variation to maximize power in detecting interlineage mixtures, as nonlineage-associated sites can appear to be mixed for technical reasons, e.g., due to mismapping. Coll et al. (2) described the M. tuberculosis phylogeny and identified 62 sets of nucleotide positions defining the deep branches of the Mycobacterium tuberculosis lineage. At each position within a nucleotide set, in one particular clade, one nucleotide is uniquely present (i.e., is not present in any other of the known clades). These sets contain a median of 108 nucleotide positions (range, 1 to 898 positions). In this analysis, we considered 55 branches, excluding branches 1.2, 3.1, 3.1.2, 4.1.2, 4.3.4.2.1, 4.6, and 4.7 because they contain fewer than 20 positions, making estimates of minor variation in these positions less reliable than estimates in other branches. The minor variant frequency at a set of bases can be due to sequencing error, mapping error, and/or bona fide interlineage mixtures (see Fig. S1A and B in the supplemental material). Minor variant frequencies were determined as follows: if there are n bases in a lineage-defining set, we count the high-quality depths (d) at each base, e.g., if n = 3 and d1 = 30, d2 = 70, and d3 = 100, the total depth (D) is For each position, we also identify the most common base; the minor depth, m, is the total depth minus the most common base depth. If the minor depths are m1 = 3, m2 = 7, and m3 = 10, the total minor depth (M) is We estimate the minor allele fraction p in the set as M/D = 0.1.

F2 and F47 metrics.

If sequences from two different M. tuberculosis lineages are mixed together, the sets which uniquely define these lineages are mixed (Fig. S1C); there are a minimum of two and maximum of eight sets affected (e.g., a lineage 5/7 mixture mixes two sets of lineage-defining nucleotides, a 2.1/4.2.1 mixes five sets, and a 4.1.1.1/3.1.2.1 mixture mixes 8 sets). Only if more than two samples are mixed will more than 8 sets be mixed. In this work, we describe two metrics reflecting mixing. Having computed the minor allele frequency estimates of p1, p2, and p55, we can sort these in descending order, identifying the sets with the highest and lowest minor allele frequencies. We then estimate the minor variant frequency across the nucleotides in the top two (F2 metric) and lowest 47 (F47 metric) sets. For example, if there are n47 nucleotides in the lowest 47 sets, we compute F47 by M47/D47, where and The underpinning assumptions are that mixtures of biological origin are most likely to occur between two lineages, and, therefore, F2 is the most sensitive metric for identifying these. Since between 2 and 8 sets are mixed in such genuine coinfections, the lowest 47 (55 − 8) sets are not be mixed; thus, the F47 metric is more sensitive for identifying laboratory contamination involving more than two samples.

Regression modeling.

Because of high leverage by a small number of observations, we used quantile regression to estimate the relationship between the median values of log-transformed noncallable base numbers and log-transformed F47, using the quantreg R package (R 3.3.1).

Lineages to which the M. tuberculosis strains studied belong.

To describe the samples studied, we identified lineage using consensus basecalling in these 55 branches. If the signature SNV of a branch was called as uncertain, we called only to the level of the branch deeper (i.e., closer to the root) than the uncertain call. If more than one different lineage-defining variant was called or we could not call any lineage-defining positions, we reported the samples as “lineage not defined.” This process does not form part of the computation of F2 or F47 metrics.

Ethics framework.

Only anonymized data were used in this work; therefore, approval from an ethics institution was not required.

Data availability.

The data analyzed are available at https://ora.ox.ac.uk/objects/uuid:5e4ec1f8-e212-47db-8910-161a303a0757.

RESULTS

Samples studied.

A total of 4,156 samples were included since they were identified using MyKrobe (17) as belonging to the M. tuberculosis complex and had at least 0.5 × 106 read pairs mapped to the H37Rv reference genome, a criterion reflecting successful DNA extraction and sequencing. These sequences were highly diverse, originating from six branches of lineage 1 (n = 320), five branches from lineage 2 (n = 278), five branches of lineage 3 (n = 1,010), and 30 branches of lineage 4 (n = 2,266). A total of 106 samples were from M. bovis or M. africanum, and 176 samples did not have their lineages defined (but were included in calculations of F2 and F47 metrics). The laboratory processes operated under the following three different phases: in the first phase, development, laboratory processes were being actively refined; in the second phase, preproduction, laboratory processes were fixed and controlled by standard operating procedures, with version-controlled changes; and the third production state was similar to the second, except that the process had received ISO15189 accreditation.

Variation in lineage-defining positions in H37Rv controls.

The F2 mixture metric reflects the estimated mixture in the two most-mixed lineage-defining sets; in the H37Rv controls, this follows a distribution skewed to the right (Fig. 2A). The MiSeq runs with H37Rv controls with F2 mixture metrics in the top 5% (Fig. 2A) are temporally clustered (red lines on Fig. 2B and C), with a number of examples in the development phase. Among clinical (noncontrol) samples, variation in the F2 metric is explained in part by the MiSeq run (Kruskal-Wallis test, P < 10−16), and a strong correlation exists between the F2 metric in H37Rv controls and that in clinical samples on the same plate (ρ = 0.61; 95% confidence interval [CI], 0.56 to 0.61; Spearman's rank correlation). That is, in plates with elevated F2 metrics in the H37Rv control, the clinical samples are more likely to have elevated F2, as is evident visually (see, e.g., Fig. 2B and C, around run 2,301).
FIG 2

Mixtures in H37Rv controls. (A) Histogram showing the F2 metric, which reflects the mixture in the two most-mixed lineage-associated sets, in H37Rv control DNA. (B) Median F2 metric among clinical samples other than H37Rv; red lines indicate that the F2 mixture metric in H37Rv controls is raised (as shown in panel A). (C) F2 metric for each M. tuberculosis sequence from a clinical sample.

Mixtures in H37Rv controls. (A) Histogram showing the F2 metric, which reflects the mixture in the two most-mixed lineage-associated sets, in H37Rv control DNA. (B) Median F2 metric among clinical samples other than H37Rv; red lines indicate that the F2 mixture metric in H37Rv controls is raised (as shown in panel A). (C) F2 metric for each M. tuberculosis sequence from a clinical sample.

Different patterns of mixtures were observed during development.

We ordered specimens first by the order of the plates analyzed and the order in which the bioinformatics processing was completed, which is the order that an automated quality control monitoring system would encounter output. During the development phase (Fig. 3), blocks of samples derived from runs with elevated mixtures in the H37Rv control are seen (red bars in Fig. 3A), coincident with clear increases in both F2 and F47 metrics (Fig. 3B and C), reflecting elevations in mixed bases across most or all lineage-defining positions (Fig. 3D). These blocks of samples typically span multiple MiSeq runs (Fig. 3A and B). In addition to the blocks of samples with elevated F2 and F47 metrics, we also observed small numbers of single samples with elevated F2 but not F47 metrics (Fig. 3B, arrow). The pattern with single samples is expected in cases of interlineage mixtures of only two samples (see Fig. S1, Supplemental Methods). These patterns were also seen in the subsequent phases (see Fig. S2 to S5, yellow dots).
FIG 3

Mixture metrics in the development phase. (A) Samples are arranged first by the order of the MiSeq runs (depicted as solid gray blocks) and the order bioinformatics processing was completed. Only samples yielding M. tuberculosis are shown, which is why some blocks are longer than others. If the H37Rv control samples had increased F2 statistics, a red bar is shown above each sample in panel A. (B to D) We depicted the F2 (B) and F47 metrics (C), as well as the estimated mixture F in each of the 58 lineage-defining sets (D). The arrow illustrates a sample with elevated F2 but low F47 metrics.

Mixture metrics in the development phase. (A) Samples are arranged first by the order of the MiSeq runs (depicted as solid gray blocks) and the order bioinformatics processing was completed. Only samples yielding M. tuberculosis are shown, which is why some blocks are longer than others. If the H37Rv control samples had increased F2 statistics, a red bar is shown above each sample in panel A. (B to D) We depicted the F2 (B) and F47 metrics (C), as well as the estimated mixture F in each of the 58 lineage-defining sets (D). The arrow illustrates a sample with elevated F2 but low F47 metrics.

Mixtures of multiple lineages are common.

Based on the pattern observed in the development phase, we categorized samples as having one of the following: (i) neither F2 nor F47 raised, (ii) raised F47, or (iii) raised F2 without raised F47. We defined a raised F2 and F47 as more than 10× and 5× the respective median metric during development in all control and clinical samples, cutoffs which correspond to 4.7% (F2) and 0.2% (F47) minor variant frequencies across the relevant lineage-defining sets, respectively. In the preproduction and production phases, F2 and F47 values below these thresholds (reflecting unmixed samples) were observed in 97.5% of the samples studied, raised F47 and F2 values (reflecting a mixture of multiple samples) were in observed 2.5% of the samples, and six samples (0.001%) had raised F2 but normal F47 values (Table 2).
TABLE 2

Detection of mixtures in clinical samples

Development stage (n)No. (%) of mixtures with:
Neither F2 nor F47 raisedF2 raised, but F47 normalF47 raised
Preproduction (919)900 (98)5 (0.003)14 (1.1)
Production (1,794)1,741 (97)1 (0.001)52 (2.8)
Detection of mixtures in clinical samples

Isolated F2 metric elevation is rare.

Isolated elevation of F2 is expected if bacteria from two different lineages are mixed. In Fig. 4, we show the minor variant frequencies from all samples from the six individuals with raised F2 but normal F47 metrics. In one case, patient 3, two technical repeats of the same sample (sample 2) showed the same pattern, as did a separate sample taken contemporaneously. In other cases (patients 1, 5, and 6), the mixed pattern was only observed in one out of two positive samples taken on the same day, and in two cases (patients 2 and 3), only a single sample was positive. Thus, between 1 and 6 samples of the 4,156 samples studied may truly reflect mixed coinfections.
FIG 4

Consistency of isolated F2 elevation in individuals. Six individuals with elevated F2 but not F47 statistics were identified during the preproduction and production phases. The observed minor variant proportions for all deep branches analyzed are shown in a heatmap. For example, patient 4 had two samples taken in December 2015; sample 2 was analyzed twice (sequencing identification numbers [IDs] 0d9d5 and 9276f), and sample 3 was analyzed once. A similar pattern of minor variation is seen in all three samples.

Consistency of isolated F2 elevation in individuals. Six individuals with elevated F2 but not F47 statistics were identified during the preproduction and production phases. The observed minor variant proportions for all deep branches analyzed are shown in a heatmap. For example, patient 4 had two samples taken in December 2015; sample 2 was analyzed twice (sequencing identification numbers [IDs] 0d9d5 and 9276f), and sample 3 was analyzed once. A similar pattern of minor variation is seen in all three samples.

Impact of interlineage variation on basecalling.

One obvious question is whether very low-level cross-contamination impacts the consensus sequence which can discerned from the pileup. As cross-contamination increases, at some point minor variant frequencies in some parts of the genome will start to rise above the 10% cutoff specified by the basecalling algorithm. The numbers of uncalled bases will then rise; this relationship can be observed in Fig. 5, where the number of uncallable bases rises rapidly when F47 exceeds the cutoff value but only slowly below it. Below the cutoff value of 4.7% (red line in Fig. 5), which is 10× the median (black line in Fig. 5), the number of uncallable bases increased by 1.25-fold (95% CI, 1.22- to 1.28-fold) for every 10-fold increase in F47; above the cutoff, the corresponding increase was 9.24-fold (95% CI, 5.5- to 11.2-fold).
FIG 5

The relationship between the F47 metric and the number of uncallable bases is shown. The red line corresponds to the cutoff used to define F47 as being elevated.

The relationship between the F47 metric and the number of uncallable bases is shown. The red line corresponds to the cutoff used to define F47 as being elevated.

DISCUSSION

In this work, we describe methods for monitoring the presence of mixtures of Mycobacterium tuberculosis of different lineages. The methods described are computationally efficient and will add little to the overall costs of sequencing and reporting. Our approach assumes that multiple lineages and sublineages of M. tuberculosis are being sequenced contemporaneously; this is the case in our setting and is also true globally (19, 20). Using single-nucleotide variants, each of which uniquely defines a branch in the phylogenetic tree of M. tuberculosis, we can show two patterns of mixtures. The first, which occurred in about 2.5% of samples during the preproduction and production phases of our project, is indicative of multiple samples being mixed together, since mixtures are seen in most or all of the lineage-defining branches. This occurred in batches, was characterized by cross-contamination at levels of less than 1%, and can be monitored by a metric we term F47. This pattern likely reflects process failures. The strength of the F47 metric is that the depth analyzed is very high, as about 5,000 nucleotides typically contribute across the lineage-defining sets included in it. If there is a sequencing depth of 50 to 100 reads at each of these, the effective sequencing depth analyzed is of the order of 25,000 to 50,000 reads, making the detection of minor variations at sub-1% levels readily feasible with high statistical confidence. Possible points of cross-contamination include DNA extraction, which occurs in batches, and library preparation, which also occurs in batches. Low-level cross-contamination during centrifugation of a batch of samples in a microcentrifuge is one possibility. Whether such cross-contamination would be expected to affect the H37Rv internal control depends on whether this control is processed end to end with each batch or whether H37Rv DNA is preprepared and added at the library construction stage. Such low-level cross-contamination, as observed during our production process and illustrated in Fig. S2 to S4, is likely to have minimal influence on inference drawn from the sequence, unless highly sensitive assays for heteroresistance are required. However, given the rarity of detection of intersample mixtures during production use of M. tuberculosis sequencing, repeating extraction and sequencing of samples in batches with elevated F47 metrics could be considered. A sensitive metric, such as F47, will also allow the early detection of emerging problems and allow a review of the process as part of continuous quality improvement. A second class of mixture, which was rarely detected in this setting, is compatible with coinfection with two organisms of differing lineages within the patient. This kind of mixture is clinically relevant (7, 11) and may be underdetected using the laboratory process we describe here, since culture-based amplification can reduce diversity in the sample inoculated (10). Its frequency may rise if direct-from-sample sequencing is employed or if samples from areas with high endemicity are studied, but here, we identified only one probable case of such mixtures and five other possible cases. Confirmatory approaches are available; microbiological techniques conducted separately on multiple picks from the same samples have been used as validation (3). A limitation of this study is that we could not undertake such work, as only multiply subcultured stored isolates exist for historical samples. Techniques for reconstructing the contributing sequences also have been described in detail (21–23), and we did not study them here. Another limitation is that we were not able to consider six of the lineage-defining sets in the study by Coll et al., because they covered <20 nucleotide positions; therefore, we considered that they did not contain sufficient information to be used in F2 or F47 metrics. A consequence is that our method did not identify mixtures of samples if they only involved mixtures in these excluded branches. Practically, such observations should be assessed on a case-by-case basis, having reviewed results from other samples from the same patient, treatment history, and basecalls at resistance loci for possible heteroresistance. More intensive follow-up could be considered (3), but there are few data to define optimal management. The clinical use of bacterial genome sequencing is rising (16, 17, 24), and given the importance of M. tuberculosis and the complexity of treatment, M. tuberculosis has been one of the first organisms tested in such a way (15). The processes followed involve multiple steps at which the opportunity for cross-contamination exists. The availability of tools monitoring the critical aspects of the laboratory process is required for accreditation under ISO15189, and the F2 and F47 metrics described here will contribute to addressing this gap.
  24 in total

1.  Metagenomic analysis of tuberculosis in a mummy.

Authors:  Jacqueline Z-M Chan; Martin J Sergeant; Oona Y-C Lee; David E Minnikin; Gurdyal S Besra; Ilidkó Pap; Mark Spigelman; Helen D Donoghue; Mark J Pallen
Journal:  N Engl J Med       Date:  2013-07-18       Impact factor: 91.245

2.  Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations.

Authors:  Sergio Pulido-Tamayo; Aminael Sánchez-Rodríguez; Toon Swings; Bram Van den Bergh; Akanksha Dubey; Hans Steenackers; Jan Michiels; Jan Fostier; Kathleen Marchal
Journal:  Nucleic Acids Res       Date:  2015-05-18       Impact factor: 16.971

Review 3.  Mixed-strain mycobacterium tuberculosis infections and the implications for tuberculosis treatment and control.

Authors:  Ted Cohen; Paul D van Helden; Douglas Wilson; Caroline Colijn; Megan M McLaughlin; Ibrahim Abubakar; Robin M Warren
Journal:  Clin Microbiol Rev       Date:  2012-10       Impact factor: 26.132

4.  An investigation on the population structure of mixed infections of Mycobacterium tuberculosis in Inner Mongolia, China.

Authors:  Xiaoying Wang; Haican Liu; Jianhao Wei; Xiaocui Wu; Qin Yu; Xiuqin Zhao; Jianxin Lyu; Yongliang Lou; Kanglin Wan
Journal:  Tuberculosis (Edinb)       Date:  2015-10-19       Impact factor: 3.131

5.  Recurrence due to relapse or reinfection with Mycobacterium tuberculosis: a whole-genome sequencing approach in a large, population-based cohort with a high HIV infection prevalence and active follow-up.

Authors:  José Afonso Guerra-Assunção; Rein M G J Houben; Amelia C Crampin; Themba Mzembe; Kim Mallard; Francesc Coll; Palwasha Khan; Louis Banda; Arthur Chiwaya; Rui P A Pereira; Ruth McNerney; David Harris; Julian Parkhill; Taane G Clark; Judith R Glynn
Journal:  J Infect Dis       Date:  2014-10-21       Impact factor: 5.226

6.  Out-of-Africa migration and Neolithic coexpansion of Mycobacterium tuberculosis with modern humans.

Authors:  Iñaki Comas; Mireia Coscolla; Tao Luo; Sonia Borrell; Kathryn E Holt; Midori Kato-Maeda; Julian Parkhill; Bijaya Malla; Stefan Berg; Guy Thwaites; Dorothy Yeboah-Manu; Graham Bothamley; Jian Mei; Lanhai Wei; Stephen Bentley; Simon R Harris; Stefan Niemann; Roland Diel; Abraham Aseffa; Qian Gao; Douglas Young; Sebastien Gagneux
Journal:  Nat Genet       Date:  2013-09-01       Impact factor: 38.330

7.  Deep Whole-Genome Sequencing to Detect Mixed Infection of Mycobacterium tuberculosis.

Authors:  Mingyu Gan; Qingyun Liu; Chongguang Yang; Qian Gao; Tao Luo
Journal:  PLoS One       Date:  2016-07-08       Impact factor: 3.240

8.  Genomic analysis of globally diverse Mycobacterium tuberculosis strains provides insights into the emergence and spread of multidrug resistance.

Authors:  Abigail L Manson; Keira A Cohen; Thomas Abeel; Christopher A Desjardins; Derek T Armstrong; Clifton E Barry; Jeannette Brand; Sinéad B Chapman; Sang-Nae Cho; Andrei Gabrielian; James Gomez; Andreea M Jodals; Moses Joloba; Pontus Jureen; Jong Seok Lee; Lesibana Malinga; Mamoudou Maiga; Dale Nordenberg; Ecaterina Noroc; Elena Romancenco; Alex Salazar; Willy Ssengooba; A A Velayati; Kathryn Winglee; Aksana Zalutskaya; Laura E Via; Gail H Cassell; Susan E Dorman; Jerrold Ellner; Parissa Farnia; James E Galagan; Alex Rosenthal; Valeriu Crudu; Daniela Homorodean; Po-Ren Hsueh; Sujatha Narayanan; Alexander S Pym; Alena Skrahina; Soumya Swaminathan; Martie Van der Walt; David Alland; William R Bishai; Ted Cohen; Sven Hoffner; Bruce W Birren; Ashlee M Earl
Journal:  Nat Genet       Date:  2017-01-16       Impact factor: 38.330

9.  Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study.

Authors:  Timothy M Walker; Camilla L C Ip; Ruth H Harrell; Jason T Evans; Georgia Kapatai; Martin J Dedicoat; David W Eyre; Daniel J Wilson; Peter M Hawkey; Derrick W Crook; Julian Parkhill; David Harris; A Sarah Walker; Rory Bowden; Philip Monk; E Grace Smith; Tim E A Peto
Journal:  Lancet Infect Dis       Date:  2012-11-15       Impact factor: 25.071

10.  Rapid, comprehensive, and affordable mycobacterial diagnosis with whole-genome sequencing: a prospective study.

Authors:  Louise J Pankhurst; Carlos Del Ojo Elias; Antonina A Votintseva; Timothy M Walker; Kevin Cole; Jim Davies; Jilles M Fermont; Deborah M Gascoyne-Binzi; Thomas A Kohl; Clare Kong; Nadine Lemaitre; Stefan Niemann; John Paul; Thomas R Rogers; Emma Roycroft; E Grace Smith; Philip Supply; Patrick Tang; Mark H Wilcox; Sarah Wordsworth; David Wyllie; Li Xu; Derrick W Crook
Journal:  Lancet Respir Med       Date:  2015-12-04       Impact factor: 102.642

View more
  5 in total

1.  In-host population dynamics of Mycobacterium tuberculosis complex during active disease.

Authors:  Roger Vargas; Luca Freschi; Maximillian Marin; L Elaine Epperson; Melissa Smith; Irina Oussenko; David Durbin; Michael Strong; Max Salfinger; Maha Reda Farhat
Journal:  Elife       Date:  2021-02-01       Impact factor: 8.140

2.  Towards standardisation: comparison of five whole genome sequencing (WGS) analysis pipelines for detection of epidemiologically linked tuberculosis cases.

Authors:  Rana Jajou; Thomas A Kohl; Timothy Walker; Anders Norman; Daniela Maria Cirillo; Elisa Tagliani; Stefan Niemann; Albert de Neeling; Troels Lillebaek; Richard M Anthony; Dick van Soolingen
Journal:  Euro Surveill       Date:  2019-12

3.  Rapid and accurate SNP genotyping of clonal bacterial pathogens with BioHansel.

Authors:  Geneviève Labbé; Peter Kruczkiewicz; James Robertson; Philip Mabon; Justin Schonfeld; Daniel Kein; Marisa A Rankin; Matthew Gopez; Darian Hole; David Son; Natalie Knox; Chad R Laing; Kyrylo Bessonov; Eduardo N Taboada; Catherine Yoshida; Kim Ziebell; Anil Nichani; Roger P Johnson; Gary Van Domselaar; John H E Nash
Journal:  Microb Genom       Date:  2021-09

4.  Application of Metagenomic Next-Generation Sequencing in Mycobacterium tuberculosis Infection.

Authors:  Yaoguang Li; Mengfan Jiao; Ying Liu; Zhigang Ren; Ang Li
Journal:  Front Med (Lausanne)       Date:  2022-04-01

Review 5.  Mycobacterium bovis: From Genotyping to Genome Sequencing.

Authors:  Ana M S Guimaraes; Cristina K Zimpel
Journal:  Microorganisms       Date:  2020-05-03
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.