Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Systematic Comparisons for Composition Profiles, Taxonomic Levels, and Machine Learning Methods for Microbiome-Based Disease Prediction.

Literature DB >> 33392266

Systematic Comparisons for Composition Profiles, Taxonomic Levels, and Machine Learning Methods for Microbiome-Based Disease Prediction.

Kuncheng Song¹, Fred A Wright², Yi-Hui Zhou³.

Abstract

Microbiome composition profiles generated from 16S rRNA sequencing have been extensively studied for their usefulness in phenotype trait prediction, including for complex diseases such as diabetes and obesity. These microbiome compositions have typically been quantified in the form of Operational Taxonomic Unit (OTU) count matrices. However, alternate approaches such as Amplicon Sequence Variants (ASV) have been used, as well as the direct use of k-mer sequence counts. The overall effect of these different types of predictors when used in concert with various machine learning methods has been difficult to assess, due to varied combinations described in the literature. Here we provide an in-depth investigation of more than 1,000 combinations of these three clustering/counting methods, in combination with varied choices for normalization and filtering, grouping at various taxonomic levels, and the use of more than ten commonly used machine learning methods for phenotype prediction. The use of short k-mers, which have computational advantages and conceptual simplicity, is shown to be effective as a source for microbiome-based prediction. Among machine-learning approaches, tree-based methods show consistent, though modest, advantages in prediction accuracy. We describe the various advantages and disadvantages of combinations in analysis approaches, and provide general observations to serve as a useful guide for future trait-prediction explorations using microbiome data.

Entities: Chemical Disease Species

Keywords: amplicon sequence variant (ASV); k-mers; machine learning method; operational taxonomic unit (OTU); phenotype prediction; phylogenetic analysis

Year: 2020 PMID： 33392266 PMCID： PMC7772236 DOI： 10.3389/fmolb.2020.610845

Source DB: PubMed Journal: Front Mol Biosci ISSN： 2296-889X

4 in total

1. Benchmark of Data Processing Methods and Machine Learning Models for Gut Microbiome-Based Diagnosis of Inflammatory Bowel Disease.

Authors: Ryszard Kubinski; Jean-Yves Djamen-Kepaou; Timur Zhanabaev; Alex Hernandez-Garcia; Stefan Bauer; Falk Hildebrand; Tamas Korcsmaros; Sani Karam; Prévost Jantchou; Kamran Kafi; Ryan D Martin
Journal: Front Genet Date: 2022-02-14 Impact factor: 4.599

2. Improve the Colorectal Cancer Diagnosis Using Gut Microbiome Data.

Authors: Yi-Hui Zhou; George Sun
Journal: Front Mol Biosci Date: 2022-08-12

Review 3. Towards multi-label classification: Next step of machine learning for microbiome research.

Authors: Shunyao Wu; Yuzhu Chen; Zhiruo Li; Jian Li; Fengyang Zhao; Xiaoquan Su
Journal: Comput Struct Biotechnol J Date: 2021-04-28 Impact factor: 7.271

4. Whole-genome sequencing and gene sharing network analysis powered by machine learning identifies antibiotic resistance sharing between animals, humans and environment in livestock farming.

Authors: Zixin Peng; Alexandre Maciel-Guerra; Michelle Baker; Xibin Zhang; Yue Hu; Wei Wang; Jia Rong; Jing Zhang; Ning Xue; Paul Barrow; David Renney; Dov Stekel; Paul Williams; Longhai Liu; Junshi Chen; Fengqin Li; Tania Dottorini
Journal: PLoS Comput Biol Date: 2022-03-25 Impact factor: 4.475

4 in total