Literature DB >> 35309001

First-line drug resistance profiling of Mycobacterium tuberculosis: a machine learning approach.

Stephanie J Müller1, Rebone L Meraba1, Gciniwe S Dlamini1, Darlington S Mapiye1.   

Abstract

The persistence and emergence of new multi-drug resistant Mycobacterium tuberculosis (M. tb) strains continues to advance the devastating tuberculosis (TB) epidemic. Robust systems are needed to accurately and rapidly perform drug-resistance profiling, and machine learning (ML) methods combined with genomic sequence data may provide novel insights into drug-resistance mechanisms. Using 372 M. tb isolates, the combined utility of ML and bioinformatics to perform drug-resistance profiling is demonstrated. SNPs, InDels, and dinucleotide frequencies are explored as input features for three ML models, namely Decision Trees, Random Forest, and the eXtreme Gradient Boosted model. Using SNPs and InDels, all three models performed equally well yielding a 99% accuracy, 97% recall, and 99% F1-score. Using dinucleotide frequencies, the XGBoost algorithm was superior with a 97% accuracy, 94% recall and 97% F1-score. This study validates the use of variants and presents dinucleotide features as another effective feature encoding method for ML-based phenotype classification. ©2021 AMIA - All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35309001      PMCID: PMC8861754     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  23 in total

Review 1.  Decision trees: an overview and their use in medicine.

Authors:  Vili Podgorelec; Peter Kokol; Bruno Stiglic; Ivan Rozman
Journal:  J Med Syst       Date:  2002-10       Impact factor: 4.460

2.  The MycoBrowser portal: a comprehensive and manually annotated resource for mycobacterial genomes.

Authors:  Adamandia Kapopoulou; Jocelyne M Lew; Stewart T Cole
Journal:  Tuberculosis (Edinb)       Date:  2010-10-25       Impact factor: 3.131

3.  Next-generation ion torrent sequencing of drug resistance mutations in Mycobacterium tuberculosis strains.

Authors:  Luke T Daum; John D Rodriguez; Sue A Worthy; Nazir A Ismail; Shaheed V Omar; Andries W Dreyer; P Bernard Fourie; Anwar A Hoosen; James P Chambers; Gerald W Fischer
Journal:  J Clin Microbiol       Date:  2012-09-12       Impact factor: 5.948

4.  The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities.

Authors:  James J Davis; Alice R Wattam; Ramy K Aziz; Thomas Brettin; Ralph Butler; Rory M Butler; Philippe Chlenski; Neal Conrad; Allan Dickerman; Emily M Dietrich; Joseph L Gabbard; Svetlana Gerdes; Andrew Guard; Ronald W Kenyon; Dustin Machi; Chunhong Mao; Dan Murphy-Olson; Marcus Nguyen; Eric K Nordberg; Gary J Olsen; Robert D Olson; Jamie C Overbeek; Ross Overbeek; Bruce Parrello; Gordon D Pusch; Maulik Shukla; Chris Thomas; Margo VanOeffelen; Veronika Vonstein; Andrew S Warren; Fangfang Xia; Dawen Xie; Hyunseung Yoo; Rick Stevens
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

5.  Genome-wide SNP and InDel mutations in Mycobacterium tuberculosis associated with rifampicin and isoniazid resistance.

Authors:  Haicheng Li; Huixin Guo; Tao Chen; Li Yu; Yuhui Chen; Jiao Zhao; Huimin Yan; Mu Chen; Qi Sun; Chenchen Zhang; Lin Zhou; Liang Chen
Journal:  Int J Clin Exp Pathol       Date:  2018-08-01

6.  Classification of COVID-19 and Other Pathogenic Sequences: A Dinucleotide Frequency and Machine Learning Approach.

Authors:  Gciniwe S Dlamini; Stephanie J Muller; Rebone L Meraba; Richard A Young; James Mashiyane; Tapiwa Chiwewe; Darlington S Mapiye
Journal:  IEEE Access       Date:  2020-10-15       Impact factor: 3.367

7.  Performance of MTBDR plus for detecting high/low levels of Mycobacterium tuberculosis resistance to isoniazid.

Authors:  F Brossier; N Veziris; V Jarlier; W Sougakoff
Journal:  Int J Tuberc Lung Dis       Date:  2009-02       Impact factor: 2.373

8.  Application of machine learning techniques to tuberculosis drug resistance analysis.

Authors:  Samaneh Kouchaki; Yang Yang; Timothy M Walker; A Sarah Walker; Daniel J Wilson; Timothy E A Peto; Derrick W Crook; David A Clifton
Journal:  Bioinformatics       Date:  2019-07-01       Impact factor: 6.937

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  Machine learning for classifying tuberculosis drug-resistance from DNA sequencing data.

Authors:  Yang Yang; Katherine E Niehaus; Timothy M Walker; Zamin Iqbal; A Sarah Walker; Daniel J Wilson; Tim E A Peto; Derrick W Crook; E Grace Smith; Tingting Zhu; David A Clifton
Journal:  Bioinformatics       Date:  2018-05-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.