Literature DB >> 30785347

Continuous Embeddings of DNA Sequencing Reads and Application to Metagenomics.

Romain Menegaux1,2, Jean-Philippe Vert1,2,3,4.   

Abstract

We propose a new model for fast classification of DNA sequences output by next-generation sequencing machines. The model, which we call fastDNA, embeds DNA sequences in a vector space by learning continuous low-dimensional representations of the k-mers it contains. We show on metagenomics benchmarks that it outperforms the state-of-the-art methods in terms of accuracy and scalability.

Entities:  

Keywords:  classification; embedding.; metagenomics; sequencing

Mesh:

Substances:

Year:  2019        PMID: 30785347     DOI: 10.1089/cmb.2018.0174

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  6 in total

1.  Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter.

Authors:  A Hoarfrost; A Aptekmann; G Farfañuk; Y Bromberg
Journal:  Nat Commun       Date:  2022-05-11       Impact factor: 17.694

2.  Variable number tandem repeats mediate the expression of proximal genes.

Authors:  Mehrdad Bakhtiari; Jonghun Park; Yuan-Chun Ding; Sharona Shleizer-Burko; Susan L Neuhausen; Bjarni V Halldórsson; Kári Stefánsson; Melissa Gymrek; Vineet Bafna
Journal:  Nat Commun       Date:  2021-04-06       Impact factor: 14.919

3.  Using Cartesian Doubt To Build a Sequencing-Based View of Microbiology.

Authors:  Braden T Tierney; Erika Szymanski; James R Henriksen; Aleksandar D Kostic; Chirag J Patel
Journal:  mSystems       Date:  2021-10-12       Impact factor: 7.324

Review 4.  Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation.

Authors:  Alban Mathieu; Mickael Leclercq; Melissa Sanabria; Olivier Perin; Arnaud Droit
Journal:  Front Microbiol       Date:  2022-03-14       Impact factor: 5.640

5.  Genomics enters the deep learning era.

Authors:  Etienne Routhier; Julien Mozziconacci
Journal:  PeerJ       Date:  2022-06-24       Impact factor: 3.061

Review 6.  Representation learning applications in biological sequence analysis.

Authors:  Hitoshi Iuchi; Taro Matsutani; Keisuke Yamada; Natsuki Iwano; Shunsuke Sumi; Shion Hosoda; Shitao Zhao; Tsukasa Fukunaga; Michiaki Hamada
Journal:  Comput Struct Biotechnol J       Date:  2021-05-23       Impact factor: 7.271

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.