| Literature DB >> 32123556 |
Guishan Zhang1, Zhiming Dai2,3, Xianhua Dai1,4.
Abstract
CRISPR/Cas9 is a hot genomic editing tool, but its success is limited by the widely varying target efficiencies among different single guide RNAs (sgRNAs). In this study, we proposed C-RNNCrispr, a hybrid convolutional neural networks (CNNs) and bidirectional gate recurrent unit network (BGRU) framework, to predict CRISPR/Cas9 sgRNA on-target activity. C-RNNCrispr consists of two branches: sgRNA branch and epigenetic branch. The network receives the encoded binary matrix of sgRNA sequence and four epigenetic features as inputs, and produces a regression score. We introduced a transfer learning approach by using small-size datasets to fine-tune C-RNNCrispr model that were pre-trained from benchmark dataset, leading to substantially improved predictive performance. Experiments on commonly used datasets showed C-RNNCrispr outperforms the state-of-the-art methods in terms of prediction accuracy and generalization. Source codes are available at https://github.com/Peppags/C_RNNCrispr.Entities:
Keywords: Bidirectional gate recurrent unit network; CRISPR/Cas9; Convolutional neural network; On-target; sgRNA
Year: 2020 PMID: 32123556 PMCID: PMC7037582 DOI: 10.1016/j.csbj.2020.01.013
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1An overview of C-RNNCrispr architecture. We trained the C-RNNCrispr from scratch on the benchmark dataset. Then, we fine-tuned the well-trained pre-trained C-RNNCrispr model on small-size datasets.
The variants of C-RNNCrispr models compared in this work.
| Model | Architecture |
|---|---|
| C-RNNCrispr_std | Using one convolutional layer with 256 1D filtering kernels of length 5 with dropout layer |
| C-RNNCrispr_2conv | Using two convolutional layers with 256 1D filtering kernels of length 5 |
| C-RNNCrispr_len7 | Using one convolutional layer with 256 1D filtering kernels of length 7 |
| C-RNNCrispr_avepool2 | Using average-pooling layer of window size 2 |
| C-RNNCrispr_ndrop | Using one convolutional layer with 256 1D filtering kernels of length 5 without using dropout layer |
Note: The descriptions of four C-RNNCrispr variants are relative to C-RNNCrispr_std model descripted in Section 2.3.
Four transfer learning strategies for C-RNNCrispr model.
| Strategy | Transfer learning procedure |
|---|---|
| Fine tune | Only the weights in the last two fully connected layers of sgRNA and epigenetic branches as well as the last fully connected layer of C-RNNCrispr are trainable |
| Frozen CNN | Freeze the weights of CNN layers |
| Frozen BGRU | Freeze the weights of BGRU layer |
| Frozen FC | Freeze the weights of fully connected layers |
Performance comparisons amongst different architectures under 5-fold cross-validation on benchmark dataset.
| Model | Spearman correlation | AUROC |
|---|---|---|
| C-RNNCrispr_std | ||
| C-RNNCrispr_2conv | 0.833 | 0.972 |
| C-RNNCrispr_len7 | 0.833 | 0.975 |
| C-RNNCrispr_avepool2 | 0.833 | 0.974 |
| C-RNNCrispr_ndrop | 0.833 | 0.969 |
Note: Performance is shown as mean ± standard deviation. This representation also applies to Table 4. The best performance (as measured by each metric) across different architectures is highlighted in bold for clarification. These highlights also apply to Table 4, Table 6, Supplementary Tables 2, 3 and 4.
Performance comparison among C-RNNCrispr and its two variant architectures (i.e., without CNN and without BGRU) on benchmark dataset under 5-fold cross-validation.
| Model | Spearman correlation | AUROC |
|---|---|---|
| C-RNNCrispr | ||
| without CNN | 0.831 | 0.971 |
| without BGRU | 0.817 | 0.942 |
Performance comparisons amongst five deep learning models based on target sequence composition on four cell line datasets under 5-fold cross-validation.
| Model | HCT116 | HEK293T | HELA | HL60 | Average |
|---|---|---|---|---|---|
| (a) Spearman correlation | |||||
| C-RNNCrispr | |||||
| Seq_deepCpf1 | 0.672 | 0.651 | 0.558 | 0.637 | |
| DeepCRISPR | 0.650 | 0.035 | 0.510 | 0.200 | 0.349 |
| DeepCas9 | 0.603 | −0.116 | 0.418 | 0.118 | 0.256 |
| DeepCas9 + TF | 0.683 | 0.572 | 0.675 | 0.495 | 0.606 |
| (b) AUROC | |||||
| C-RNNCrispr | 0.934 | ||||
| Seq_deepCpf1 | 0.921 | 0.928 | 0.942 | ||
| DeepCRISPR | 0.887 | 0.474 | 0.788 | 0.584 | 0.683 |
| DeepCas9 | 0.784 | 0.470 | 0.677 | 0.535 | 0.617 |
| DeepCas9 + TF | 0.902 | 0.905 | 0.902 | 0.887 | 0.899 |
Note: The top table records Spearman correlation values while the bottom one records AUROC values. DeepCas9 + TF: DeepCas9 + transfer learning. The performance of DeepCRISPR is take from [5].
Fig. 2Performance comparison of different transfer learning strategies for sgRNA activity prediction on four cell line-specific training data under 5-fold cross-validation.
Fig. 3Performance comparison of C-RNNCrispr training from scratch and transfer learning via fine tune for each cell line data by 5-fold cross-validation.
Existing deep learning-based methods for sgRNA on-target activity prediction.
| Model | Model | Sequence | Training mode | Reference |
|---|---|---|---|---|
| Seq_deepCpf1 | 1D CNN | sgRNA | Transfer learning | |
| DeepCRISPR | 2D CNN | sgRNA + Epi | Transfer learning | |
| DeepCas9 | 1D CNN | sgRNA | From scratch | |
| C-RNNCrispr | 1D CNN-BGRU | sgRNA + Epi | Transfer learning | – |
Note: sgRNA + Epi, sequence features and epigenetic features.
Fig. 4Performance comparison of C-RNNCrispr and other learning-based prediction models on different testing cell line data under 5-fold cross-validation.
Fig. 5Performance comparison of C-RNNCrispr and other learning-based prediction models on different testing cell line data with a leave-one-cell-out procedure.
Fig. 6Visualization of the importance of different nucleotides and epigenetic features at different positions for our C-RNNCrispr. The colors represent the contribution of the position-specific nucleotide and epigenetic features to determining an efficient sgRNA. The nucleotides and epigenetic features are arranged vertically, whereas the positions of the sequence are placed horizontally.