Literature DB >> 30030800

MetaVW: Large-Scale Machine Learning for Metagenomics Sequence Classification.

Kévin Vervier1, Pierre Mahé2, Jean-Philippe Vert3,4,5,6.   

Abstract

Metagenomics is the study of microbial community diversity, especially the uncultured microorganisms by shotgun sequencing environmental samples. As the sequencers throughput and the data volume increase, it becomes challenging to develop scalable bioinformatics tools that reconstruct microbiome structure by binning sequencing reads to reference genomes. Standard alignment-based methods, such as BWA-MEM, provide state-of-the-art performance, but we demonstrate in Vervier et al. (2016) that compositional approaches using nucleotides motifs have faster analysis time, for comparable accuracy. In this work, we describe how to use MetaVW, a scalable machine learning implementation for short sequencing reads binning, based on their k-mers profile. We provide a step-by-step guideline on how we trained the classification models and how it can easily generalize to user-defined reference genomes and specific applications. We also give additional details on what effect parameters in the algorithm have on performances.

Entities:  

Keywords:  Binning; Classification; Machine learning; Metagenomics; Microbiology; Next-generation sequencing

Mesh:

Year:  2018        PMID: 30030800     DOI: 10.1007/978-1-4939-8561-6_2

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  3 in total

1.  High-throughput data on circular RNA reveal novel insights into chronic glomerulonephritis.

Authors:  Ya-Chen Gao; Nan-Nan Jiang; Xiu-Juan Qin; Hui Jiang; Liang-Bing Wei; Jia-Rong Gao
Journal:  Genes Genomics       Date:  2022-10-20       Impact factor: 2.164

2.  Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data.

Authors:  Julie Chih-Yu Chen; Andrea D Tyler
Journal:  Biol Direct       Date:  2020-12-10       Impact factor: 4.540

3.  Genomics enters the deep learning era.

Authors:  Etienne Routhier; Julien Mozziconacci
Journal:  PeerJ       Date:  2022-06-24       Impact factor: 3.061

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.