Literature DB >> 30994904

DeepSignal: detecting DNA methylation state from Nanopore sequencing reads using deep-learning.

Peng Ni1, Neng Huang1, Zhi Zhang1, De-Peng Wang2, Fan Liang2, Yu Miao2, Chuan-Le Xiao3, Feng Luo4, Jianxin Wang1.   

Abstract

MOTIVATION: The Oxford Nanopore sequencing enables to directly detect methylation states of bases in DNA from reads without extra laboratory techniques. Novel computational methods are required to improve the accuracy and robustness of DNA methylation state prediction using Nanopore reads.
RESULTS: In this study, we develop DeepSignal, a deep learning method to detect DNA methylation states from Nanopore sequencing reads. Testing on Nanopore reads of Homo sapiens (H. sapiens), Escherichia coli (E. coli) and pUC19 shows that DeepSignal can achieve higher performance at both read level and genome level on detecting 6 mA and 5mC methylation states comparing to previous hidden Markov model (HMM) based methods. DeepSignal achieves similar performance cross different DNA methylation bases, different DNA methylation motifs and both singleton and mixed DNA CpG. Moreover, DeepSignal requires much lower coverage than those required by HMM and statistics based methods. DeepSignal can achieve 90% above accuracy for detecting 5mC and 6 mA using only 2× coverage of reads. Furthermore, for DNA CpG methylation state prediction, DeepSignal achieves 90% correlation with bisulfite sequencing using just 20× coverage of reads, which is much better than HMM based methods. Especially, DeepSignal can predict methylation states of 5% more DNA CpGs that previously cannot be predicted by bisulfite sequencing. DeepSignal can be a robust and accurate method for detecting methylation states of DNA bases.
AVAILABILITY AND IMPLEMENTATION: DeepSignal is publicly available at https://github.com/bioinfomaticsCSU/deepsignal. SUPPLEMENTARY INFORMATION: Supplementary data are available at bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30994904     DOI: 10.1093/bioinformatics/btz276

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  52 in total

1.  Gaussian mixture model-based unsupervised nucleotide modification number detection using nanopore-sequencing readouts.

Authors:  Hongxu Ding; Andrew D Bailey; Miten Jain; Hugh Olsen; Benedict Paten
Journal:  Bioinformatics       Date:  2020-12-08       Impact factor: 6.937

Review 2.  Experimental approaches to tracking mobile genetic elements in microbial communities.

Authors:  Christina C Saak; Cong B Dinh; Rachel J Dutton
Journal:  FEMS Microbiol Rev       Date:  2020-09-01       Impact factor: 16.408

Review 3.  Long-read human genome sequencing and its applications.

Authors:  Glennis A Logsdon; Mitchell R Vollger; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2020-06-05       Impact factor: 53.242

4.  Nanopore sequencing of native adeno-associated virus (AAV) single-stranded DNA using a transposase-based rapid protocol.

Authors:  Marco T Radukic; David Brandt; Markus Haak; Kristian M Müller; Jörn Kalinowski
Journal:  NAR Genom Bioinform       Date:  2020-09-28

5.  DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning.

Authors:  Michael A Boemo
Journal:  BMC Genomics       Date:  2021-06-09       Impact factor: 3.969

6.  Novel thermophilic polyhydroxyalkanoates producing strain Aneurinibacillus thermoaerophilus CCM 8960.

Authors:  Jana Musilova; Xenie Kourilova; Iva Pernicova; Matej Bezdicek; Martina Lengerova; Stanislav Obruca; Karel Sedlar
Journal:  Appl Microbiol Biotechnol       Date:  2022-06-27       Impact factor: 4.813

7.  ClusterMine: A knowledge-integrated clustering approach based on expression profiles of gene sets.

Authors:  Hong-Dong Li; Yunpei Xu; Xiaoshu Zhu; Quan Liu; Gilbert S Omenn; Jianxin Wang
Journal:  J Bioinform Comput Biol       Date:  2020-06       Impact factor: 1.122

8.  Cotton D genome assemblies built with long-read data unveil mechanisms of centromere evolution and stress tolerance divergence.

Authors:  Zhaoen Yang; Xiaoyang Ge; Weinan Li; Yuying Jin; Lisen Liu; Wei Hu; Fuyan Liu; Yanli Chen; Shaoliang Peng; Fuguang Li
Journal:  BMC Biol       Date:  2021-06-03       Impact factor: 7.431

9.  iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool.

Authors:  Xiao Yang; Xiucai Ye; Xuehong Li; Lesong Wei
Journal:  Front Genet       Date:  2021-03-31       Impact factor: 4.599

Review 10.  Modern epigenetics methods in biological research.

Authors:  Yuanyuan Li
Journal:  Methods       Date:  2020-07-06       Impact factor: 3.608

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.