| Literature DB >> 16526963 |
Xiaochen Bo1, Shaoke Lou, Daochun Sun, Wenjie Shu, Jing Yang, Shengqi Wang.
Abstract
BACKGROUND: Local structures of target mRNAs play a significant role in determining the efficacies of antisense oligonucleotides (ODNs), but some structure-based target site selection methods are limited by uncertainties in RNA secondary structure prediction. If all the predicted structures of a given mRNA within a certain energy limit could be used simultaneously, target site selection would obviously be improved in both reliability and efficiency. In this study, some key problems in ODN target selection on the basis of multiple predicted target mRNA structures are systematically discussed.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16526963 PMCID: PMC1421440 DOI: 10.1186/1471-2105-7-122
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Summary of antisense target genes and their predicted structures used in this study
| Accession | Description | No. structures | No. ODNs |
| X62295 | Rattus mRNA for vascular type-1 angiotensin II receptor. | 50 | 36 |
| XM_051583 | Homo sapiens v-raf-1 murine leukemia viral oncogene homolog 1 (RAF1), mRNA | 50 | 31 |
| M14758 | Homo sapiens P-glycoprotein (PGY1) mRNA | 50 | 22 |
| NM_004996 | Homo sapiens ATP-binding cassette, sub-family C (CFTR/MRP), member 1 (ABCC1), transcript variant 1, mRNA | 50 | 14 |
| M24283 | Human intercellular adhesion molecule-1 (ICAM-1) | 50 | 66 |
| X52479 | Human PKC alpha mRNA for protein kinase C alpha | 37 | 19 |
| NM_001078 | Homo sapiens vascular cell adhesion molecule 1 (VCAM1), transcript variant 1, mRNA | 50 | 35 |
| XM_057446 | Homo sapiens selectin E (endothelial adhesion molecule 1) (SELE), mRNA. | 50 | 11 |
| M30640 | Human endothelial leukocyte adhesion molecule I (ELAM1) mRNA, complete cds | 50 | 4 |
| NM_000877 | Homo sapiens interleukin 1 receptor, type I (IL1R1), mRNA. | 50 | 20 |
| M31585 | Mouse (clone lambda-c5e) intercellular adhesion molecule 1 (ICAM-1) mRNA, complete cds | 39 | 8 |
| BC036531 | Homo sapiens collagen, type I, alpha 1, mRNA (cDNA clone MGC:33668 IMAGE:5264710) | 50 | 19 |
| NM_010784 | Mus musculus midkine (Mdk), mRNA | 17 | 4 |
| M15077 | P.pyralis (firefly) luciferase gene, complete cds | 39 | 8 |
| X03484 | Human mRNA for raf oncogene | 50 | 20 |
| X14805 | Mus musculus mRNA for DNA methyltransferase 1 | 50 | 8 |
| BC005976 | Homo sapiens ras homolog gene family, member A, mRNA | 23 | 13 |
| M10843 | Rabbit beta-globin mRNA | 26 | 24 |
| U45880 | Human X-linked inhibitor of apotosis protein XIAP mRNA | 36 | 6 |
| AF015950 | Human telomerase reverse transcriptase mRNA | 50 | 5 |
| NR_001566 | Homo sapiens telomerase RNA component (TERC) on chromosome 3 | 23 | 5 |
| M34309 | Human epidermal growth factor receptor (HER3) mRNA, complete cds. | 50 | 22 |
| NM_004507 | Homo sapiens HUS1 checkpoint homolog (S. pombe) (HUS1), mRNA. | 33 | 11 |
| AJ278710 | Escherichia coli 23S rRNA gene, strain K12 DSM 30083T | 50 | 7 |
| X03363 | Human c-erb-B-2 mRNA | 50 | 3 |
| M10988 | Human tumor necrosis factor (TNF) mRNA | 26 | 4 |
| NM_000791 | Homo sapiens dihydrofolate reductase (DHFR), mRNA. | 50 | 7 |
| NM_001168 | Homo sapiens baculoviral IAP repeat-containing 5 (survivin) (BIRC5), mRNA | 29 | 5 |
| NM_013642 | Mus musculus dual specificity phosphatase 1 (Dusp1), mRNA | 37 | 8 |
| AF025846 | Co-reporter vector pRL-TK, complete sequence | 50 | 4 |
Figure 1Two representations of multiple predicted structures of rabbit β-globin mRNA (G101-G130). (a) Single-stranded probability profile; (b) 'SUP' representation.
Parameters derived from the SSPP representation
| Parameter | Definition |
| Mean, | |
| Root mean square, | |
| Maximum, | |
| Impulse factor, | |
| Peak factor, | |
| Wave factor, | |
| Mean of difference, |
Parameters derived from the SUP sequence representation
| Parameter | Definition |
| Number of bases in single-stranded region | |
| Number of bases in double-stranded region | |
| Percentage of bases in single-stranded region to the length of ODN | |
| Percentage of bases in double-stranded region to the length of ODN | |
| Maximum length of consecutive subsequence in single-stranded region | |
| Maximum length of consecutive subsequence in base pairing | |
| Maximum length of consecutive subsequence in single-stranded region counting from 5' terminal | |
| Maximum length of consecutive subsequence in base pairing counting from 5' terminal | |
| Maximum length of consecutive subsequence in single-stranded region counting from 3' terminal | |
| Maximum length of consecutive subsequence in base pairing counting from 3' terminal | |
| Structure consistency, |
Figure 2The distribution of ODN length and length-limited features. (a) The distribution of ODN lengths in the dataset; (b) Mean values of some features of ODNs with different lengths.
Correlations between features and efficacy
| Parameter | Pearson Correlation | Spearman Correlation | Kendall Correlation |
| -0.086 | -0.055 | -0.087 | |
| -0.150** | -0.100** | -0.147** | |
| -0.099* | -0.113** | -0.155** | |
| 0.040 | 0.039 | 0.060 | |
| 0.124** | 0.083** | 0.125** | |
| -0.030 | -0.017 | -0.025 | |
| -0.094* | -0.034 | -0.051 | |
| -0.087 | -0.057 | -0.082 | |
| -0.045 | -0.043 | -0.061 | |
| -0.073 | -0.050 | -0.075 | |
| -0.040 | -0.040 | -0.057 | |
| -0.062 | -0.037 | -0.053 | |
| -0.012 | -0.012 | -0.019 | |
| 0.031 | 0.012 | 0.016 | |
| -0.009 | -0.039 | -0.055 | |
| -0.050 | -0.011 | -0.016 | |
| -0.036 | -0.030 | -0.039 | |
| -0.064 | -0.045 | -0.066 |
**. Correlation is significant at the 0.01 level
*. Correlation is significant at the 0.05 level
Performance of Fisher linear discriminators for each parameter
| Parameter | Threshold = 50% | Threshold = 75% | ||
| 0.56 | 0.53 | 0.65 | 0.51 | |
| 0.58 | 0.54 | 0.50 | 0.54 | |
| 0.33 | 0.60 | |||
| 0.37 | 0.67 | 0.48 | 0.38 | |
| 0.50 | 0.61 | 0.56 | 0.59 | |
| 0.42 | 0.58 | 0.63 | 0.43 | |
| 0.52 | 0.50 | 0.52 | 0.50 | |
| 0.56 | 0.50 | 0.56 | 0.48 | |
| 0.44 | 0.51 | 0.52 | 0.49 | |
| 0.54 | 0.52 | 0.58 | 0.51 | |
| 0.49 | 0.45 | 0.54 | ||
| 0.56 | 0.47 | 0.45 | ||
| 0.43 | 0.63 | 0.40 | ||
| 0.31 | 0.30 | |||
| 0.33 | 0.63 | 0.58 | 0.37 | |
| 0.29 | 0.65 | 0.29 | ||
| 0.34 | 0.64 | 0.50 | 0.37 | |
| 0.50 | 0.50 | 0.60 | 0.52 | |
. high specificity (≥ 0.7) or high sensitivity (≥ 0.7)
Dataset for cross-validation experiments
| Networks | Accession of test gene | Number in train set | Number in test set |
| NSSPP1 and NSUP1 | X62295 | 412 | 36 |
| NSSPP2 and NSUP2 | XM_051583 | 417 | 31 |
| NSSPP3 and NSUP3 | M14758 | 426 | 22 |
| NSSPP4 and NSUP4 | M24283 | 356 | 66 |
| NSSPP5 and NSUP5 | NM_001078 | 379 | 35 |
| NSSPP6 and NSUP6 | NM_000877 | 428 | 20 |
| NSSPP7 and NSUP7 | X03484 | 428 | 20 |
| NSSPP8 and NSUP8 | M10843 | 424 | 24 |
The performances of two groups of networks in cross-validation experiments
| Networks | Se | Sp | Acc | ROC area | Networks | Se | Sp | Acc | ROC area |
| Nsspp1 | 0.50 | 0.97 | 0.92 | 0.91 | Nsup1 | 0 | 0.94 | 0.83 | 0.60 |
| Nsspp2 | 0.33 | 0.96 | 0.90 | 0.75 | Nsup2 | 0 | 0.93 | 0.84 | 0.69 |
| Nsspp3 | 0 | 0.93 | 0.59 | 0.71 | Nsup3 | 0 | 1 | 0.64 | 0.65 |
| Nsspp4 | 0 | 1 | 0.88 | 0.66 | Nsup4 | 0.5 | 0.86 | 0.82 | 0.66 |
| Nsspp5 | 0 | 1 | 0.71 | 0.69 | Nsup5 | 0 | 1 | 0.71 | 0.74 |
| Nsspp6 | 0 | 0.94 | 0.85 | 0.81 | Nsup6 | 0 | 1 | 0.9 | 0.89 |
| Nsspp7 | 0 | 1 | 0.70 | 0.98 | Nsup7 | 0.17 | 0.93 | 0.70 | 0.89 |
| Nsspp8 | 0 | 1 | 0.58 | 0.63 | Nsup8 | 0.40 | 0.86 | 0.67 | 0.71 |
. high specificity (≥ 0.7) or high sensitivity (≥ 0.7)
Figure 3ROC curves for efficacy-predicting neural networks. ROC curves are shown for networks (a) NSSPP1 and NSUP1; (b) NSSPP2 and NSUP2; (c) NSSPP3 and NSUP3; (d) NSSPP4 and NSUP4; (e) NSSPP5 and NSUP5; (f) NSSPP6 and NSUP6; (g) NSSPP7 and NSUP7; (h) NSSPP8 and NSUP8.