Literature DB >> 33990665

A convolution based computational approach towards DNA N6-methyladenine site identification and motif extraction in rice genome.

Chowdhury Rafeed Rahman1, Ruhul Amin1, Swakkhar Shatabda2, Md Sadrul Islam Toaha1.   

Abstract

DNA N6-methylation (6mA) in Adenine nucleotide is a post replication modification responsible for many biological functions. Automated and accurate computational methods can help to identify 6mA sites in long genomes saving significant time and money. Our study develops a convolutional neural network (CNN) based tool i6mA-CNN capable of identifying 6mA sites in the rice genome. Our model coordinates among multiple types of features such as PseAAC (Pseudo Amino Acid Composition) inspired customized feature vector, multiple one hot representations and dinucleotide physicochemical properties. It achieves auROC (area under Receiver Operating Characteristic curve) score of 0.98 with an overall accuracy of 93.97% using fivefold cross validation on benchmark dataset. Finally, we evaluate our model on three other plant genome 6mA site identification test datasets. Results suggest that our proposed tool is able to generalize its ability of 6mA site identification on plant genomes irrespective of plant species. An algorithm for potential motif extraction and a feature importance analysis procedure are two by products of this research. Web tool for this research can be found at: https://cutt.ly/dgp3QTR .

Entities:  

Year:  2021        PMID: 33990665     DOI: 10.1038/s41598-021-89850-9

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  26 in total

1.  Genomic N(6)-methyladenine determination by MEKC with LIF.

Authors:  Annette M Krais; Michael G Cornelius; Heinz H Schmeiser
Journal:  Electrophoresis       Date:  2010-10       Impact factor: 3.535

2.  Genome-wide high throughput analysis of DNA methylation in eukaryotes.

Authors:  Kyle R Pomraning; Kristina M Smith; Michael Freitag
Journal:  Methods       Date:  2008-10-23       Impact factor: 3.608

3.  pLogo: a probabilistic approach to visualizing sequence motifs.

Authors:  Joseph P O'Shea; Michael F Chou; Saad A Quader; James K Ryan; George M Church; Daniel Schwartz
Journal:  Nat Methods       Date:  2013-10-06       Impact factor: 28.547

4.  i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome.

Authors:  Wei Chen; Hao Lv; Fulei Nie; Hao Lin
Journal:  Bioinformatics       Date:  2019-08-15       Impact factor: 6.937

5.  Identification and analysis of adenine N6-methylation sites in the rice genome.

Authors:  Chao Zhou; Changshi Wang; Hongbo Liu; Qiangwei Zhou; Qian Liu; Yan Guo; Ting Peng; Jiaming Song; Jianwei Zhang; Lingling Chen; Yu Zhao; Zhixiong Zeng; Dao-Xiu Zhou
Journal:  Nat Plants       Date:  2018-07-30       Impact factor: 15.793

6.  iDNA6mA-PseKNC: Identifying DNA N6-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.

Authors:  Pengmian Feng; Hui Yang; Hui Ding; Hao Lin; Wei Chen; Kuo-Chen Chou
Journal:  Genomics       Date:  2018-01-31       Impact factor: 5.736

7.  Direct detection of DNA methylation during single-molecule, real-time sequencing.

Authors:  Benjamin A Flusberg; Dale R Webster; Jessica H Lee; Kevin J Travers; Eric C Olivares; Tyson A Clark; Jonas Korlach; Stephen W Turner
Journal:  Nat Methods       Date:  2010-05-09       Impact factor: 28.547

8.  PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Anal Biochem       Date:  2007-10-13       Impact factor: 3.365

Review 9.  N6-methyl-adenine: an epigenetic signal for DNA-protein interactions.

Authors:  Didier Wion; Josep Casadesús
Journal:  Nat Rev Microbiol       Date:  2006-03       Impact factor: 60.633

10.  6mA-RicePred: A Method for Identifying DNA N 6-Methyladenine Sites in the Rice Genome Based on Feature Fusion.

Authors:  Qianfei Huang; Jun Zhang; Leyi Wei; Fei Guo; Quan Zou
Journal:  Front Plant Sci       Date:  2020-01-31       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.