Literature DB >> 31267129

Data imbalance in CRISPR off-target prediction.

Yuli Gao1, Guohui Chuai1, Weichuan Yu2, Shen Qu1, Qi Liu1,3.   

Abstract

For genome-wide CRISPR off-target cleavage sites (OTS) prediction, an important issue is data imbalance-the number of true OTS recognized by whole-genome off-target detection techniques is much smaller than that of all possible nucleotide mismatch loci, making the training of machine learning model very challenging. Therefore, computational models proposed for OTS prediction and scoring should be carefully designed and properly evaluated in order to avoid bias. In our study, two tools are taken as examples to further emphasize the data imbalance issue in CRISPR off-target prediction to achieve better sensitivity and specificity for optimized CRISPR gene editing. We would like to indicate that (1) the benchmark of CRISPR off-target prediction should be properly evaluated and not overestimated by considering data imbalance issue; (2) incorporation of efficient computational techniques (including ensemble learning and data synthesis techniques) can help to address the data imbalance issue and improve the performance of CRISPR off-target prediction. Taking together, we call for more efforts to address the data imbalance issue in CRISPR off-target prediction to facilitate clinical utility of CRISPR-based gene editing techniques.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  CRISPR; data imbalance; off-target prediction

Year:  2019        PMID: 31267129     DOI: 10.1093/bib/bbz069

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  6 in total

1.  crisprSQL: a novel database platform for CRISPR/Cas off-target cleavage assays.

Authors:  Florian Störtz; Peter Minary
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

2.  Prediction of CRISPR/Cas9 single guide RNA cleavage efficiency and specificity by attention-based convolutional neural networks.

Authors:  Guishan Zhang; Tian Zeng; Zhiming Dai; Xianhua Dai
Journal:  Comput Struct Biotechnol J       Date:  2021-03-07       Impact factor: 7.271

3.  Effective use of sequence information to predict CRISPR-Cas9 off-target.

Authors:  Zhong-Rui Zhang; Zhen-Ran Jiang
Journal:  Comput Struct Biotechnol J       Date:  2022-01-19       Impact factor: 7.271

4.  ACP-DA: Improving the Prediction of Anticancer Peptides Using Data Augmentation.

Authors:  Xian-Gan Chen; Wen Zhang; Xiaofei Yang; Chenhong Li; Hengling Chen
Journal:  Front Genet       Date:  2021-06-30       Impact factor: 4.599

5.  Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.

Authors:  Chao Feng; Shufen Liu; Hao Zhang; Renchu Guan; Dan Li; Fengfeng Zhou; Yanchun Liang; Xiaoyue Feng
Journal:  Int J Mol Sci       Date:  2020-03-22       Impact factor: 5.923

Review 6.  Computational approaches for effective CRISPR guide RNA design and evaluation.

Authors:  Guanqing Liu; Yong Zhang; Tao Zhang
Journal:  Comput Struct Biotechnol J       Date:  2019-11-29       Impact factor: 7.271

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.