| Literature DB >> 17540034 |
Jyoti K Shah1, Harold R Garner, Michael A White, David S Shames, John D Minna.
Abstract
BACKGROUND: RNA interference has revolutionized our ability to study the effects of altering the expression of single genes in mammalian (and other) cells through targeted knockdown of gene expression. In this report we describe a web-based computational tool, siRNA Information Resource (sIR), which consists of a new open source database that contains validation information about published siRNA sequences and also provides a user-friendly interface to design and analyze siRNA sequences against a chosen target sequence.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17540034 PMCID: PMC1896181 DOI: 10.1186/1471-2105-8-178
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Screen shot of siRNA information resource (sIR). sIR provides a menu interface which can be used to switch between various modes such as Target design mode, Resource (database) mode, Accession Finder mode and Pre-computed siRNA mode. The user can choose from among the siRNA pre-computed designs or input a sequence and select various input parameters such as sequence/accession number, percent GC content, maximal nucleotide runs, open reading frame, pattern and scoring system for a custom analysis.
Weights of the scoring system
| A1 | A/U at position 1 of sense strand | -1.4 |
| A2 | G/C at position 1 of sense strand | 1.11 |
| A3 | A at position 6 of sense strand | 0.70 |
| A4 | U at position 10 of sense strand | 0.25 |
| A5 | G at position 13 of sense strand | -1.66 |
| A6 | U/T at position 13 of sense strand | 0.31 |
| A7 | A/u at position 16 of sense strand | 0.74 |
| A8 | A/U at position 17 of sense strand | 1.20 |
| A9 | A/u at position 18 of sense strand | 1.44 |
| A10 | A/U at position 19 of sense strand | 0.87 |
| A11 | G/C at position 19 of sense strand | -1.02 |
| A12 | GC content | 0.42 |
This table shows the different weights converged upon for the various parameters using a MATLAB algorithm. It can be observed that the criterion A5 with a base 'G' at position 13 of sense strand has the highest absolute weight whereas the criterion A4 with a base 'U' at position 10 of sense strand has the lowest absolute weight. The parameters with positive weights enhance the siRNA efficacy whereas the ones with negative weights suppress the efficacy of siRNA.
Figure 2Correlation between the score and average percentage efficiency. The average efficiency was computed for a score bin size of 5. The error bars are the standard deviations in the average efficiency values. The data was fitted using a polynomial trend of second order and a correlation of 0.95 was observed.
Summary of test set scores
| Least potent avg score | Most potent avg score | T-test p-value | Correlation between efficiency and Inhibitory activity | Low efficiency avg score | Medium efficiency avg score | High efficiency avg score | Very high efficiency avg score | One-way ANOVA test p-value | |
| sIR | 37.9 | 62.9 | 5.60E-115 | 0.422 | 50.7 | 57.9 | 58.3 | 60 | 1.51E-18 |
| Amarzguioui | 40.8 | 62.9 | 4.54E-91 | 0.34 | 51.2 | 58.1 | 59.1 | 60 | 1.51E-18 |
| Ui-Tei | 40.3 | 68.4 | 2.67E-101 | 0.39 | 52.6 | 60.9 | 61.8 | 64 | 1.84E-18 |
| Rational | 21.6 | 50.3 | 1.90E-108 | 0.4 | 34.7 | 39.5 | 40.2 | 43.8 | 4.03E-12 |
This table summarizes the scores attained by ANN (Huesken et al., 2005) and siRecords (Ren et al., 2006) sequences. siRNA scores were computed to compare weighted "sIR" and un-weighted "Rational design", "Ui-tei design" and "Amarzguioui design" scoring systems. This table shows the averages, p-values as well as correlation coefficient values obtained by the siRNA sequences in all four scoring systems.
Parameters used for the precomputation of siRNAs.
| Percent GC Content range | Moderate range, 30% to 50% |
| Multiple nucleotide run (Runs of 4 or more nucleotides) | Avoided |
| Open reading frame | An open reading frame region between 50 nucleotides from the 5' end and 50 nucleotides from the 3' end, downstream and upstream of the mRNA was considered for the design. |
| Score cutoff | Score > 50. siRNAs with scores > 50 were retained |
| Blast Hit cutoff | Number of blast hits < = 2. siRNAs with blast hits < = 2 and percent homology > 80% were retained. |
This table lists the exact parameters used while pre-designing siRNAs for all the genes in the human RefSeq database.