Literature DB >> 35402983

Pairwise Heuristic Sequence Alignment Algorithm Based on Deep Reinforcement Learning.

Yong-Joon Song1, Dong Jin Ji1, Hyein Seo1, Gyu-Bum Han1, Dong-Ho Cho1.   

Abstract

Goal: Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used method for comparative analysis of biological genomes. We intend to propose a novel pairwise sequence alignment method using deep reinforcement learning to break out the old pairwise alignment algorithms.
Methods: We defined the environment and agent to enable reinforcement learning in the sequence alignment system. This novel method, named DQNalign, can immediately determine the next direction by observing the subsequences within the moving window.
Results: DQNalign shows superiority in the dissimilar sequence pairs that have low identity values. And theoretically, we confirm that DQNalign has a low dimension for the sequence length in view of the complexity. Conclusions: This research shows the application method of deep reinforcement learning to the sequence alignment system and how deep reinforcement learning can improve the conventional sequence alignment method.

Entities:  

Keywords:  Deep reinforcement learning; global alignment; pairwise alignment; sequence alignment; sequence comparison

Year:  2021        PMID: 35402983      PMCID: PMC8901008          DOI: 10.1109/OJEMB.2021.3055424

Source DB:  PubMed          Journal:  IEEE Open J Eng Med Biol        ISSN: 2644-1276


  20 in total

1.  Aligning two sequences within a specified diagonal band.

Authors:  K M Chao; W R Pearson; W Miller
Journal:  Comput Appl Biosci       Date:  1992-10

Review 2.  Next-generation sequencing transforms today's biology.

Authors:  Stephan C Schuster
Journal:  Nat Methods       Date:  2007-12-19       Impact factor: 28.547

3.  A novel k-word relative measure for sequence comparison.

Authors:  Jie Tang; Keru Hua; Mengye Chen; Ruiming Zhang; Xiaoli Xie
Journal:  Comput Biol Chem       Date:  2014-11-07       Impact factor: 2.877

4.  Maximum-likelihood estimation of the statistical distribution of Smith-Waterman local sequence similarity scores.

Authors:  R Mott
Journal:  Bull Math Biol       Date:  1992-01       Impact factor: 1.758

5.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

6.  Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12.

Authors:  T Hayashi; K Makino; M Ohnishi; K Kurokawa; K Ishii; K Yokoyama; C G Han; E Ohtsubo; K Nakayama; T Murata; M Tanaka; T Tobe; T Iida; H Takami; T Honda; C Sasakawa; N Ogasawara; T Yasunaga; S Kuhara; T Shiba; M Hattori; H Shinagawa
Journal:  DNA Res       Date:  2001-02-28       Impact factor: 4.458

7.  Novel association strategy with copy number variation for identifying new risk Loci of human diseases.

Authors:  Xianfeng Chen; Xinlei Li; Ping Wang; Yang Liu; Zhenguo Zhang; Guoping Zhao; Haiming Xu; Jun Zhu; Xueying Qin; Suchao Chen; Landian Hu; Xiangyin Kong
Journal:  PLoS One       Date:  2010-08-20       Impact factor: 3.240

8.  Statistical distributions of optimal global alignment scores of random protein sequences.

Authors:  Hongxia Pang; Jiaowei Tang; Su-Shing Chen; Shiheng Tao
Journal:  BMC Bioinformatics       Date:  2005-10-15       Impact factor: 3.169

9.  Local sequence alignments statistics: deviations from Gumbel statistics in the rare-event tail.

Authors:  Stefan Wolfsheimer; Bernd Burghardt; Alexander K Hartmann
Journal:  Algorithms Mol Biol       Date:  2007-07-11       Impact factor: 1.405

10.  MUMmer4: A fast and versatile genome alignment system.

Authors:  Guillaume Marçais; Arthur L Delcher; Adam M Phillippy; Rachel Coston; Steven L Salzberg; Aleksey Zimin
Journal:  PLoS Comput Biol       Date:  2018-01-26       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.