Literature DB >> 29960078

Towards better prediction of Mycobacterium tuberculosis lineages from MIRU-VNTR data.

Nithum Thain1, Christopher Le1, Aldo Crossa2, Shama Desai Ahuja2, Jeanne Sullivan Meissner2, Barun Mathema3, Barry Kreiswirth4, Natalia Kurepina4, Ted Cohen5, Leonid Chindelevitch6.   

Abstract

The determination of lineages from strain-based molecular genotyping information is an important problem in tuberculosis. Mycobacterial interspersed repetitive unit-variable number tandem repeat (MIRU-VNTR) typing is a commonly used molecular genotyping approach that uses counts of the number of times pre-specified loci repeat in a strain. There are three main approaches for determining lineage based on MIRU-VNTR data - one based on a direct comparison to the strains in a curated database, and two others, on machine learning algorithms trained on a large collection of labeled data. All existing methods have limitations. The direct approach imposes an arbitrary threshold on how much a database strain can differ from a given one to be informative. On the other hand, the machine learning-based approaches require a substantial amount of labeled data. Notably, all three methods exhibit suboptimal classification accuracy without additional data. We explore several computational approaches to address these limitations. First, we show that eliminating the arbitrary threshold improves the performance of the direct approach. Second, we introduce RuleTB, an alternative direct method that proposes a concise set of rules for determining lineages. Lastly, we propose StackTB, a machine learning approach that requires only a fraction of the training data to outperform the accuracy of both existing machine learning methods. Our approaches demonstrate superior performance on a training dataset collected in New York City over 10 years, and the improvement in performance translates to a held-out testing set. We conclude that our methods provide opportunities for improving the determination of pathogenic lineages based on MIRU-VNTR data.
Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Interpretability; Lineage; MIRU-VNTR; Machine learning; Mycobacterium tuberculosis

Mesh:

Year:  2018        PMID: 29960078      PMCID: PMC6708508          DOI: 10.1016/j.meegid.2018.06.029

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


  22 in total

1.  Variable-number tandem repeat typing of Mycobacterium tuberculosis isolates with low copy numbers of IS6110 by using mycobacterial interspersed repetitive units.

Authors:  Lauren Steinlein Cowan; Laura Mosher; Lois Diem; Jeffrey P Massey; Jack T Crawford
Journal:  J Clin Microbiol       Date:  2002-05       Impact factor: 5.948

2.  A recently evolved sublineage of the Mycobacterium tuberculosis Beijing strain family is associated with an increased ability to spread and cause disease.

Authors:  M Hanekom; G D van der Spuy; E Streicher; S L Ndabambi; C R E McEvoy; M Kidd; N Beyers; T C Victor; P D van Helden; R M Warren
Journal:  J Clin Microbiol       Date:  2007-03-14       Impact factor: 5.948

3.  Evaluation and strategy for use of MIRU-VNTRplus, a multifunctional database for online analysis of genotyping data and phylogenetic identification of Mycobacterium tuberculosis complex isolates.

Authors:  Caroline Allix-Béguec; Dag Harmsen; Thomas Weniger; Philip Supply; Stefan Niemann
Journal:  J Clin Microbiol       Date:  2008-06-11       Impact factor: 5.948

4.  Transmission of multidrug-resistant tuberculosis in the UK: a cross-sectional molecular and epidemiological study of clustering and contact tracing.

Authors:  Laura F Anderson; Surinder Tamne; Timothy Brown; John P Watson; Catherine Mullarkey; Dominik Zenner; Ibrahim Abubakar
Journal:  Lancet Infect Dis       Date:  2014-03-04       Impact factor: 25.071

5.  Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis.

Authors:  Philip Supply; Caroline Allix; Sarah Lesjean; Mara Cardoso-Oelemann; Sabine Rüsch-Gerdes; Eve Willery; Evgueni Savine; Petra de Haas; Henk van Deutekom; Solvig Roring; Pablo Bifani; Natalia Kurepina; Barry Kreiswirth; Christophe Sola; Nalin Rastogi; Vincent Vatin; Maria Cristina Gutierrez; Maryse Fauville; Stefan Niemann; Robin Skuce; Kristin Kremer; Camille Locht; Dick van Soolingen
Journal:  J Clin Microbiol       Date:  2006-09-27       Impact factor: 5.948

6.  Major Mycobacterium tuberculosis lineages associate with patient country of origin.

Authors:  Michael B Reed; Victoria K Pichler; Fiona McIntosh; Alicia Mattia; Ashley Fallow; Speranza Masala; Pilar Domenech; Alice Zwerling; Louise Thibert; Dick Menzies; Kevin Schwartzman; Marcel A Behr
Journal:  J Clin Microbiol       Date:  2009-02-11       Impact factor: 5.948

Review 7.  Mycobacterium africanum--review of an important cause of human tuberculosis in West Africa.

Authors:  Bouke C de Jong; Martin Antonio; Sebastien Gagneux
Journal:  PLoS Negl Trop Dis       Date:  2010-09-28

8.  Progression to active tuberculosis, but not transmission, varies by Mycobacterium tuberculosis lineage in The Gambia.

Authors:  Bouke C de Jong; Philip C Hill; Alex Aiken; Timothy Awine; Martin Antonio; Ifedayo M Adetifa; Dolly J Jackson-Sillah; Annette Fox; Kathryn Deriemer; Sebastien Gagneux; Martien W Borgdorff; Keith P W J McAdam; Tumani Corrah; Peter M Small; Richard A Adegbola
Journal:  J Infect Dis       Date:  2008-10-01       Impact factor: 5.226

9.  Mycobacterium tuberculosis mutation rate estimates from different lineages predict substantial differences in the emergence of drug-resistant tuberculosis.

Authors:  Christopher B Ford; Rupal R Shah; Midori Kato Maeda; Sebastien Gagneux; Megan B Murray; Ted Cohen; James C Johnston; Jennifer Gardy; Marc Lipsitch; Sarah M Fortune
Journal:  Nat Genet       Date:  2013-06-09       Impact factor: 38.330

10.  High functional diversity in Mycobacterium tuberculosis driven by genetic drift and human demography.

Authors:  Ruth Hershberg; Mikhail Lipatov; Peter M Small; Hadar Sheffer; Stefan Niemann; Susanne Homolka; Jared C Roach; Kristin Kremer; Dmitri A Petrov; Marcus W Feldman; Sebastien Gagneux
Journal:  PLoS Biol       Date:  2008-12-16       Impact factor: 8.029

View more
  2 in total

1.  Novel methods included in SpolLineages tool for fast and precise prediction of Mycobacterium tuberculosis complex spoligotype families.

Authors:  David Couvin; Wilfried Segretier; Erick Stattner; Nalin Rastogi
Journal:  Database (Oxford)       Date:  2020-12-15       Impact factor: 3.451

Review 2.  Precision, Equity, and Public Health and Epidemiology Informatics - A Scoping Review.

Authors:  David L Buckeridge
Journal:  Yearb Med Inform       Date:  2020-08-21
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.