Literature DB >> 34009299

DeepEBV: A deep learning model to predict Epstein-Barr virus (EBV) integration sites.

Jiuxing Liang1, Zifeng Cui2, Canbiao Wu1, Yao Yu3,4, Rui Tian5, Hongxian Xie6, Zhuang Jin2, Weiwen Fan2, Weiling Xie2, Zhaoyue Huang2, Wei Xu2, Jingjing Zhu2, Zeshan You2, Xiaofang Guo7, Xiaofan Qiu1, Jiahao Ye1,8, Bin Lang9, Mengyuan Li2, Songwei Tan10, Zheng Hu2,11.   

Abstract

MOTIVATION: Epstein-Barr virus (EBV) is one of the most prevalent DNA oncogenic viruses. The integration of EBV into the host genome has been reported to play an important role in cancer development. The preference of EBV integration showed strong dependence on the local genomic environment, which enables the prediction of EBV integration sites.
RESULTS: An attention-based deep learning model, DeepEBV, was developed to predict EBV integration sites by learning local genomic features automatically. First, DeepEBV was trained and tested using the data from the dsVIS database. The results showed that DeepEBV with EBV integration sequences plus Repeat peaks and 2 fold data augmentation performed the best on the training dataset. Furthermore, the performance of the model was validated in an independent dataset. In addition, the motifs of DNA-binding proteins could influence the selection preference of viral insertional mutagenesis. Furthermore, the results showed that DeepEBV can predict EBV integration hotspot genes accurately. In summary, DeepEBV is a robust, accurate and explainable deep learning model, providing novel insights into EBV integration preferences and mechanisms. AVAILABILITY: DeepEBV is available as open-source software and can be downloaded from https://github.com/JiuxingLiang/DeepEBV.gitSupplementary information  Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 34009299     DOI: 10.1093/bioinformatics/btab388

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  1 in total

1.  Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.

Authors:  Lezheng Yu; Yonglin Zhang; Li Xue; Fengjuan Liu; Qi Chen; Jiesi Luo; Runyu Jing
Journal:  Front Microbiol       Date:  2022-03-15       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.