Literature DB >> 33385273

sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks.

Mengting Niu1, Yuan Lin2, Quan Zou3.   

Abstract

KEY MESSAGE: We proposed an ensemble convolutional neural network model to identify sgRNA high on-target activity in four crops and we used one-hot encoding and k-mers for sequence encoding. As an important component of the CRISPR/Cas9 system, single-guide RNA (sgRNA) plays an important role in gene redirection and editing. sgRNA has played an important role in the improvement of agronomic species, but there is a lack of effective bioinformatics tools to identify the activity of sgRNA in agronomic species. Therefore, it is necessary to develop a method based on machine learning to identify sgRNA high on-target activity. In this work, we proposed a simple convolutional neural network method to identify sgRNA high on-target activity. Our study used one-hot encoding and k-mers for sequence data conversion and a voting algorithm for constructing the convolutional neural network ensemble model sgRNACNN for the prediction of sgRNA activity. The ensemble model sgRNACNN was used for predictions in four crops: Glycine max, Zea mays, Sorghum bicolor and Triticum aestivum. The accuracy rates of the four crops in the sgRNACNN model were 82.43%, 80.33%, 78.25% and 87.49%, respectively. The experimental results showed that sgRNACNN realizes the identification of high on-target activity sgRNA of agronomic data and can meet the demands of sgRNA activity prediction in agronomy to a certain extent. These results have certain significance for guiding crop gene editing and academic research. The source code and relevant dataset can be found in the following link: https://github.com/nmt315320/sgRNACNN.git .

Entities:  

Keywords:  Convolutional neural networks; One-hot encoding; k-mer; sgRNA high on-target activity

Mesh:

Substances:

Year:  2021        PMID: 33385273     DOI: 10.1007/s11103-020-01102-y

Source DB:  PubMed          Journal:  Plant Mol Biol        ISSN: 0167-4412            Impact factor:   4.076


  35 in total

1.  Genome-wide identification of CRISPR/Cas9 off-targets in human genome.

Authors:  Jinzhi Duan; Guangqing Lu; Zhan Xie; Mingliang Lou; Jiao Luo; Lei Guo; Yu Zhang
Journal:  Cell Res       Date:  2014-07-01       Impact factor: 25.617

2.  Deep learning improves prediction of CRISPR-Cpf1 guide RNA activity.

Authors:  Hui Kwon Kim; Seonwoo Min; Myungjae Song; Soobin Jung; Jae Woo Choi; Younggwang Kim; Sangeun Lee; Sungroh Yoon; Hyongbum Henry Kim
Journal:  Nat Biotechnol       Date:  2018-01-29       Impact factor: 54.908

3.  Computational and Biological Methods for Gene Therapy.

Authors:  Liang Cheng
Journal:  Curr Gene Ther       Date:  2019       Impact factor: 4.391

4.  SSizer: Determining the Sample Sufficiency for Comparative Biological Study.

Authors:  Fengcheng Li; Ying Zhou; Xiaoyu Zhang; Jing Tang; Qingxia Yang; Yang Zhang; Yongchao Luo; Jie Hu; Weiwei Xue; Yunqing Qiu; Qiaojun He; Bo Yang; Feng Zhu
Journal:  J Mol Biol       Date:  2020-02-07       Impact factor: 5.469

5.  A deep recurrent neural network discovers complex biological rules to decipher RNA protein-coding potential.

Authors:  Steven T Hill; Rachael Kuintzle; Amy Teegarden; Erich Merrill; Padideh Danaee; David A Hendrix
Journal:  Nucleic Acids Res       Date:  2018-09-19       Impact factor: 16.971

6.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9.

Authors:  John G Doench; Nicolo Fusi; Meagan Sullender; Mudra Hegde; Emma W Vaimberg; Jennifer Listgarten; Katherine F Donovan; Ian Smith; Zuzana Tothova; Craig Wilen; Robert Orchard; Herbert W Virgin; David E Root
Journal:  Nat Biotechnol       Date:  2016-01-18       Impact factor: 54.908

7.  ge-CRISPR - An integrated pipeline for the prediction and analysis of sgRNAs genome editing efficiency for CRISPR/Cas system.

Authors:  Karambir Kaur; Amit Kumar Gupta; Akanksha Rajput; Manoj Kumar
Journal:  Sci Rep       Date:  2016-09-01       Impact factor: 4.379

8.  InfAcrOnt: calculating cross-ontology term similarities using information flow by a random walk.

Authors:  Liang Cheng; Yue Jiang; Hong Ju; Jie Sun; Jiajie Peng; Meng Zhou; Yang Hu
Journal:  BMC Genomics       Date:  2018-01-19       Impact factor: 3.969

9.  NOREVA: normalization and evaluation of MS-based metabolomics data.

Authors:  Bo Li; Jing Tang; Qingxia Yang; Shuang Li; Xuejiao Cui; Yinghong Li; Yuzong Chen; Weiwei Xue; Xiaofeng Li; Feng Zhu
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

Review 10.  Computational Methods for Identifying Similar Diseases.

Authors:  Liang Cheng; Hengqiang Zhao; Pingping Wang; Wenyang Zhou; Meng Luo; Tianxin Li; Junwei Han; Shulin Liu; Qinghua Jiang
Journal:  Mol Ther Nucleic Acids       Date:  2019-09-28       Impact factor: 8.886

View more
  15 in total

1.  Substitution effects via aromaticity, polarizability, APT, AIM, IR analysis, and hydrogen adsorption in C20-nTin nanostructures: a DFT survey.

Authors:  Yan Cao; Abdol Ghaffar Ebadi; Zahra Rahmani; Mohammad Reza Poor Heravi; Esmail Vessally
Journal:  J Mol Model       Date:  2021-11-08       Impact factor: 1.810

2.  Cytotoxicity properties of plant-mediated synthesized K-doped ZnO nanostructures.

Authors:  Mojtaba Haghighat; Hajar Q Alijani; Masoomeh Ghasemi; Sajad Khosravi; Fariba Borhani; Fatemeh Sharifi; Siavash Iravani; Kazem Najafi; Mehrdad Khatami
Journal:  Bioprocess Biosyst Eng       Date:  2021-09-28       Impact factor: 3.210

3.  Actin gamma 1 is a critical regulator of pancreatic ductal adenocarcinoma.

Authors:  Yichen Tang; Xuehui Peng; Xiaobing Huang; Jing Li
Journal:  Saudi J Gastroenterol       Date:  2022 May-Jun       Impact factor: 3.214

4.  In vitro and in vivo anticancer effect of pH-responsive paclitaxel-loaded niosomes.

Authors:  Mahmood Barani; Mohammad Reza Hajinezhad; Saman Sargazi; Abbas Rahdar; Sheida Shahraki; Azadeh Lohrasbi-Nejad; Francesco Baino
Journal:  J Mater Sci Mater Med       Date:  2021-12-04       Impact factor: 3.896

5.  Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique.

Authors:  Hasan Zulfiqar; Qin-Lai Huang; Hao Lv; Zi-Jie Sun; Fu-Ying Dao; Hao Lin
Journal:  Int J Mol Sci       Date:  2022-01-23       Impact factor: 5.923

6.  Interactions between favipiravir and a BNC cage towards drug delivery applications.

Authors:  Azar Asgari Pari; Mohammad Yousefi
Journal:  Struct Chem       Date:  2021-09-08       Impact factor: 1.887

Review 7.  DrugHybrid_BS: Using Hybrid Feature Combined With Bagging-SVM to Predict Potentially Druggable Proteins.

Authors:  Yuxin Gong; Bo Liao; Peng Wang; Quan Zou
Journal:  Front Pharmacol       Date:  2021-11-30       Impact factor: 5.810

8.  Taguchi-Assisted Optimization Technique and Density Functional Theory for Green Synthesis of a Novel Cu-MOF Derived From Caffeic Acid and Its Anticancerious Activities.

Authors:  Malihe Zeraati; Ali Mohammadi; Somayeh Vafaei; Narendra Pal Singh Chauhan; Ghasem Sargazi
Journal:  Front Chem       Date:  2021-11-26       Impact factor: 5.221

Review 9.  Next Generation Cereal Crop Yield Enhancement: From Knowledge of Inflorescence Development to Practical Engineering by Genome Editing.

Authors:  Lei Liu; Penelope L Lindsay; David Jackson
Journal:  Int J Mol Sci       Date:  2021-05-13       Impact factor: 5.923

10.  CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

Authors:  Mengting Niu; Quan Zou; Chen Lin
Journal:  PLoS Comput Biol       Date:  2022-01-20       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.