| Literature DB >> 20470414 |
Donald M Gray1, Carla W Gray, Byong-Hoon Yoo, Tzu-Fang Lou.
Abstract
BACKGROUND: The enumeration of tetrameric and other sequence motifs that are positively or negatively correlated with in vivo antisense DNA effects has been a useful addition to the arsenal of information needed to predict effective targets for antisense DNA control of gene expression. Such retrospective information derived from in vivo cellular experiments characterizes aspects of the sequence dependence of antisense inhibition that are not predicted by nearest-neighbor (NN) thermodynamic parameters derived from in vitro experiments. However, quantitation of the antisense contributions of motifs is problematic, since individual motifs are not isolated from the effects of neighboring nucleotides, and motifs may be overlapping. These problems are circumvented by a next-nearest-neighbor (NNN) analysis of antisense DNA effects in which the overlapping nature of nearest-neighbors is taken into account.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20470414 PMCID: PMC2877693 DOI: 10.1186/1471-2105-11-252
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
64 mRNA triplets and associated inhibition parameters, P.
| NNN triplet or other parameter (5' to 3' in mRNA) | Values of antisense inhibition parameter P (%) from fit to 112 sequences ± standard error from SVD analysis | Number of NNN triplet or other parameter in data set |
|---|---|---|
| AAA | 2.98 ± 2.08 | 30 |
| AAC | 5.96 ± 1.45 | 32 |
| AAG | 0.48 ± 1.82 | 36 |
| AAU | -3.52 ± 1.76 | 30 |
| ACA | -3.78 ± 1.67 | 48 |
| ACC | 4.63 ± 2.79 | 25 |
| ACG | 1.82 ± 1.69 | 28 |
| ACU | 3.81 ± 1.66 | 33 |
| AGA | 3.23 ± 1.99 | 40 |
| AGC | 3.86 ± 1.46 | 39 |
| AGG | 4.83 ± 1.82 | 52 |
| AGU | -4.72 ± 2.20 | 17 |
| AUA | 0.29 ± 2.42 | 15 |
| AUC | 3.27 ± 2.07 | 19 |
| AUG | 0.45 ± 1.63 | 67 |
| AUU | 1.05 ± 1.97 | 21 |
| CAA | 1.83 ± 1.65 | 37 |
| CAC | -1.51 ± 1.31 | 46 |
| CAG | -1.47 ± 1.49 | 38 |
| CAU | 5.77 ± 1.62 | 29 |
| CCA | -0.99 ± 2.14 | 26 |
| CCC | -3.73 ± 2.06 | 30 |
| CCG | 3.09 ± 1.28 | 58 |
| CCU | 5.48 ± 1.76 | 35 |
| CGA | 5.43 ± 1.64 | 40 |
| CGC | 7.38 ± 1.32 | 33 |
| CGG | 5.65 ± 1.49 | 51 |
| CGU | -5.02 ± 1.96 | 19 |
| CUA | 13.27 ± 2.79 | 14 |
| CUC | 1.37 ± 1.75 | 31 |
| CUG | 0.87 ± 1.75 | 44 |
| CUU | -4.62 ± 1.64 | 28 |
| GAA | 3.57 ± 2.00 | 53 |
| GAC | 1.08 ± 1.67 | 37 |
| GAG | -2.19 ± 1.07 | 63 |
| GAU | 4.87 ± 1.71 | 47 |
| GCA | 6.89 ± 1.64 | 46 |
| GCC | -1.26 ± 1.57 | 58 |
| GCG | 3.96 ± 1.59 | 41 |
| GCU | 6.20 ± 1.91 | 31 |
| GGA | 2.96 ± 1.10 | 82 |
| GGC | -2.70 ± 1.10 | 56 |
| GGG | 2.06 ± 1.28 | 76 |
| GGU | 8.28 ± 1.99 | 30 |
| GUA | -8.59 ± 2.99 | 15 |
| GUC | 1.85 ± 1.84 | 24 |
| GUG | -3.40 ± 1.46 | 42 |
| GUU | 10.43 ± 2.95 | 18 |
| UAA | -2.48 ± 3.44 | 8 |
| UAC | 0.94 ± 1.91 | 19 |
| UAG | 10.37 ± 2.55 | 11 |
| UAU | -2.06 ± 2.69 | 16 |
| UCA | 2.50 ± 1.75 | 30 |
| UCC | 4.20 ± 1.98 | 36 |
| UCG | 4.57 ± 2.47 | 16 |
| UCU | -4.60 ± 2.60 | 18 |
| UGA | -4.28 ± 1.93 | 38 |
| UGC | 7.26 ± 1.14 | 48 |
| UGG | -1.93 ± 1.33 | 65 |
| UGU | 1.75 ± 2.00 | 33 |
| UUA | 1.80 ± 3.21 | 10 |
| UUC | 0.19 ± 2.06 | 26 |
| UUG | 4.88 ± 2.28 | 31 |
| UUU | 0.81 ± 2.59 | 25 |
| A549 cell | 10.59 ± 1.78 | 62 |
| CRAF1 | 3.65 ± 2.33 | 52 |
| BCL2 | -3.10 ± 1.85 | 29 |
| AKT2 | 8.97 ± 4.43 | 7 |
| PKC-α | -3.75 ± 2.76 | 24 |
Values of parameters, P, for 64 NNN mRNA triplets obtained from an SVD solution of inhibitory data for 112 antisense S-DNA sequences targeted to mRNAs encoded by four genes, in two cell lines. Except for the NNN parameters for AAA, UUU, CCC, and GGG, and the parameter for the cell line, the parameters are meaningful only in combinations. The parameters are the percent changes in net protein accumulation assigned to the NNN such that they may be summed to give inhibitory values for closed-circular mRNA antisense targets. (Positive values denote a decrease in accumulated protein.)
Figure 1Experimental . Experimental values of percent reduction in net protein accumulation from 112 antisense S-DNA experiments versus values calculated by summing NNN parameters from Table 1. The experimental values and errors are given in additional file 1. The correlation coefficient r = 0.792. Symbols are as follows: × (PKC-α), blue triangle (BCL2), red triangle (AKT2), grey circle (CRAF2 in A549 cells), black circle (CRAF2 in T24 cells).
Inhibition parameters for linearly independent combinations of next-nearest-neighbor triplets.
| Independent sequence (12-mer) (5' to 3' in mRNA) | Antisense inhibition parameter P (%) per 12-mer |
|---|---|
| (ACU)4 | 72.1 ± 14.7 |
| (UUG)4 | 68.2 ± 21.2 |
| (CG)6 | 68.0 ± 14.7 |
| (AACU)3 | 61.7 ± 17.1 |
| (GGUC)3 | 61.0 ± 11.3 |
| (UGC)4 | 57.3 ± 9.0 |
| (UUAG)3 | 53.6 ± 20.7 |
| (AACG)3 | 50.3 ± 10.6 |
| (AUC)4 | 46.2 ± 14.0 |
| (GGUA)3 | 44.7 ± 15.6 |
| (UCC)4 | 44.2 ± 14.0 |
| (UUGC)3 | 41.2 ± 10.0 |
| (AAGC)3 | 39.2 ± 7.5 |
| (AGC)4 | 37.1 ± 10.7 |
| (CCUG)3 | 37.0 ± 10.2 |
| (CCG)4 | 36.8 ± 9.1 |
| (A)12 | 35.7 ± 24.9 |
| (GGAC)3 | 34.5 ± 8.2 |
| (ACG)4 | 33.3 ± 11.7 |
| (UUCG)3 | 30.5 ± 19.6 |
| (AAG)4 | 29.1 ± 16.1 |
| (UUCA)3 | 28.5 ± 9.5 |
| (CGG)4 | 27.6 ± 9.2 |
| (G)12 | 24.8 ± 15.4 |
| (GGCA)3 | 22.7 ± 9.7 |
| (AGG)4 | 22.4 ± 8.6 |
| (GGAU)3 | 19.0 ± 7.5 |
| (AAC)4 | 16.0 ± 11.2 |
| (AAUC)3 | 12.2 ± 11.8 |
| (UGG)4 | 11.8 ± 9.7 |
| (U)12 | 9.7 ± 31.1 |
| (ACC)4 | 8.5 ± 18.1 |
| (GGCU)3 | 7.3 ± 9.1 |
| (AG)6 | 6.2 ± 12.6 |
| (UUAC)3 | 5.8 ± 16.7 |
| (UCG)4 | 5.6 ± 15.6 |
| (AUG)4 | 4.2 ± 12.8 |
| (AUU)4 | 3.2 ± 16.0 |
| (CCAG)3 | 0.4 ± 11.5 |
| (UG)6 | -9.9 ± 15.4 |
| (AU)6 | -10.6 ± 26.0 |
| (AAUG)3 | -11.3 ± 10.1 |
| (AGU)4 | -11.8 ± 15.9 |
| (UC)6 | -19.4 ± 19.8 |
| (AAU)4 | -22.8 ± 17.7 |
| (AC)6 | -31.8 ± 12.4 |
| (UUC)4 | -36.1 ± 14.9 |
| (C)12 | -44.8 ± 24.7 |
| (AAGU)3 | -46.0 ± 20.5 |
Values of inhibition parameters, P (percent change, with positive values indicating a decrease in accumulated protein) for an example set of 49 linearly independent 12-mer mRNA sequence combinations of NNN triplets. Values are ranked in decreasing order. Percent standard errors were derived from the complete SVD variance-covariance matrix (i.e. including the covariances).
Inhibition parameters for independent sequence combinations of NNN that contain only purines or pyrimidines.
| Parameters P (%) for five purine-containing independent sequences | Parameters P (%) for five pyrimidine-containing independent sequences | ||
|---|---|---|---|
| (A)12 | 35.7 ± 24.9 | (UCC)4 | 44.2 ± 14.0 |
| (AAG)4 | 29.1 ± 16.1 | (U)12 | 9.7 ± 31.1 |
| (G)12 | 24.8 ± 15.4 | (UC)6 | -19.4 ± 19.8 |
| (AGG)4 | 22.4 ± 8.6 | (UUC)4 | -36.1 ± 14.9 |
| (AG)6 | 6.2 ± 12.6 | (C)12 | -44.8 ± 24.7 |
Inhibition parameters, P, from Table 2 for the five independent sequence combinations that contain only purines and the five that contain only pyrimidines. The former are generally more favorable target sequence combinations than those containing only pyrimidines.
Inhibition parameters for independent combinations of targeted genes.
| Parameters for each of four gene combinations | Antisense inhibition parameter P (%) |
|---|---|
| CRAF1 minus average | 2.20 ± 2.36 |
| BCL2 minus average | -4.54 ± 1.86 |
| AKT2 minus average | 7.53 ± 4.37 |
| PKC-α minus average | -5.20 ± 2.83 |
Values are the differences from the average of the four gene parameters in Table 1. Any set of three of the four parameters is an independent set. The fourth is a dependent parameter, since it is a linear combination (i.e. the negative sum) of the three parameters chosen to be independent. Percent standard errors were derived from the SVD variance-covariance matrix.
Fits of data sets with NNN or NN parameters.
| Fit with in vivo NNN parameters | Fit with in vivo NN parameters | Fit with in vitro NN parameters | |||||
|---|---|---|---|---|---|---|---|
| CRAF1 (A549 cells) | 26 | 0.636 | <0.001 | 0.146 | 0.474 | 0.242 | 0.234 |
| CRAF1 (T24 cells) | 26 | 0.748 | <0.001 | 0.493 | 0.010 | 0.403 | 0.041 |
| BCL2 (A549 cells) | 29 | 0.635 | <0.001 | 0.148 | 0.441 | 0.231 | 0.229 |
| PKC-α (T24 cells) | 24 | 0.903 | <0.001 | 0.130 | 0.544 | 0.639 | <0.001 |
| Published data PKC-α (A549 cells) [ | 20 | 0.561 | 0.010 | 0.716 | <0.001 | 0.368 | 0.110 |
Correlation coefficients, r, and the significance of correlation, Psig, between the indicated data sets and in vivo next-nearest-neighbor (NNN) parameters or in vivo or in vitro nearest-neighbor (NN) parameters. Psig is the significance of the correlation coefficient by the t-test and is the probability of being wrong in rejecting the null hypothesis. The smaller the value of Psig, the more significant the correlation.