| Literature DB >> 34104972 |
Shuntaro Chiba1, Kenji Rowel Q Lim2, Narin Sheri2, Saeed Anwar2, Esra Erkut2, Md Nur Ahad Shah2, Tejal Aslesh2, Stanley Woo2, Omar Sheikh2, Rika Maruyama2, Hiroaki Takano1, Katsuhiko Kunitake3, William Duddy4, Yasushi Okuno1,5, Yoshitsugu Aoki3, Toshifumi Yokota2.
Abstract
Exon skipping using antisense oligonucleotides (ASOs) has recently proven to be a powerful tool for mRNA splicing modulation. Several exon-skipping ASOs have been approved to treat genetic diseases worldwide. However, a significant challenge is the difficulty in selecting an optimal sequence for exon skipping. The efficacy of ASOs is often unpredictable, because of the numerous factors involved in exon skipping. To address this gap, we have developed a computational method using machine-learning algorithms that factors in many parameters as well as experimental data to design highly effective ASOs for exon skipping. eSkip-Finder (https://eskip-finder.org) is the first web-based resource for helping researchers identify effective exon skipping ASOs. eSkip-Finder features two sections: (i) a predictor of the exon skipping efficacy of novel ASOs and (ii) a database of exon skipping ASOs. The predictor facilitates rapid analysis of a given set of exon/intron sequences and ASO lengths to identify effective ASOs for exon skipping based on a machine learning model trained by experimental data. We confirmed that predictions correlated well with in vitro skipping efficacy of sequences that were not included in the training data. The database enables users to search for ASOs using queries such as gene name, species, and exon number.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34104972 PMCID: PMC8265194 DOI: 10.1093/nar/gkab442
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of eSkip-Finder.
Selected features
| Selected features for PMO | Selected features for 2OMe | ||||
|---|---|---|---|---|---|
| Name | Description | FIa | Name | Description | FIa |
| ASO concentrationb | Concentration of oligomer used in the experiment | 0.64±0.14 | ASO concentrationb | Concentration of oligomer used in the experiment | 0.11±0.05 |
| Exon v intron %GC after blocking by oligo | %GC in exon when blocked by oligo / %GC 5′ intron 200 bases upstream | 0.68±0.15 | GCs (number of) | Total GCs in ASO sequence | 0.67±0.20 |
| dG (50BaseFlanksAroundTarget) | Predicted binding energy ( | 0.66±0.16 | ACP | Distance in bases from the splice acceptor site to the center of the target site ( | 0.49±0.21 |
| ACC_LAST15 | Predicted accessibility scores ( | 0.32±0.09 | %GC of exon when blocked by oligo | Total remaining %GCs of target exon sequence when blocked by ASOs | 0.46±0.11 |
| niscore_per_base | Cumulative NI score ( | 0.18±0.09 | |||
| ACC_LAST8 | Predicted accessibility scores of the 3′ end of the target (last 8 bases) | 0.12±0.07 | |||
aThe feature importance (FI) was calculated by the permutation importance method (23).
bThe ASO concentration used in the experiment is always included as one of the features of the predictive model.
Figure 2.Predictive performance of SVR models for PMO and 2OMe. Symbols represent oligomer concentration (c) given in μM used in the experiment. The coefficient of determination, R2, was calculated by linear regression (black lines).
Figure 3.Case study on predicting skipping ASOs for exon 44 of the dystrophin pre-mRNA. (A) Input image of the predictive model. A user specifies the length of ASO and its chemistry (PMO or 2OMe). The upstream (200 bases) and downstream (200 bases) intron sequences of the target exon are required in addition to the target exon sequence, which are used to calculate features. (B) Output image. The relative exon-skipping efficacy is predicted by scanning the target exon sequence with a window size of the length specified by the user. Moving averages with 15 bases are plotted with a dashed line. (C) Efficacy of dystrophin exon 44 skipping observed under identical experimental conditions (cell type used = healthy primary human myotubes, ASO chemistry = PMO, ASO length = 30, ASO concentration = 0.5 μM) as previously reported (15), which is not included in the training dataset. The correlation between predicted and experimental skipping efficacies R2 was 0.7 as shown in Supplementary Figure S3.