Literature DB >> 18440976

IDT SciTools: a suite for analysis and design of nucleic acid oligomers.

Richard Owczarzy1, Andrey V Tataurov, Yihe Wu, Jeffrey A Manthey, Kyle A McQuisten, Hakeem G Almabrazi, Kent F Pedersen, Yuan Lin, Justin Garretson, Neil O McEntaggart, Chris A Sailor, Robert B Dawson, Andrew S Peek.   

Abstract

DNA and RNA oligomers are used in a myriad of diverse biological and biochemical experiments. These oligonucleotides are designed to have unique biophysical, chemical and hybridization properties. We have created an integrated set of bioinformatics tools that predict the properties of native and chemically modified nucleic acids and assist in their design. Researchers can select PCR primers, probes and antisense oligonucleotides, find the most suitable sequences for RNA interference, calculate stable secondary structures, and evaluate the potential for two sequences to interact. The latest, most accurate thermodynamic algorithms and models are implemented. This free software is available at http://www.idtdna.com/SciTools/SciTools.aspx.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18440976      PMCID: PMC2447751          DOI: 10.1093/nar/gkn198

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Synthetic oligonucleotides are widely employed in various molecular biology applications, e.g. polymerase chain reaction (PCR), molecular beacons, microarrays, mutagenesis, RNAi, antisense and de novo gene construction (1–7). Published bioinformatics algorithms can predict biophysical properties of oligonucleotides from their sequence and estimate performance of oligonucleotides in specific assays both singly and together with other sequences (8,9). Here, we describe an online suite of computational software tools that enable molecular biologists to design, evaluate and make informed decisions about the properties of nucleic acid sequences. The IDT SciTools receives over 7000 unique visitors and 1.5 million hits every month. The web servers consist of several independent applications summarized in Table 1. Instructions and help to each software tool can be found at the top of web input forms. The code is regularly updated when more accurate models and algorithms are published. New applications will be added in the future.
Table 1.

Summary of IDT SciTools

Software toolFeatures
OligoAnalyzerComprehensive oligonucleotide analysis (molecular weight, extinction coefficient, melting temperature, folding and hybridization of strands, effects of mismatches).
PrimerQuestSelect optimal probes and primers for PCR assays.
LNA designTool to design LNA modified probes and primers having specific duplex stability.
ddRNAi designDesign sequences for DNA-directed RNA interference.
RNAi designTool to design siRNA duplex oligomers.
TriFECTa RNAi KitsPredesigned dicer-substrate RNAi duplexes, gene knockdown kits.
Antisense designAntisense oligonucleotide selection tool.
mFoldPrediction of oligonucleotide secondary structure.
DilutionCalcCalculate volumes to dilute oligonucleotide to the desired concentration.
ResuspensionCalcCalculate volumes to dissolve a dry lyophilized nucleic acid to the desired concentration.
Summary of IDT SciTools

ONLINE TOOLS

OligoAnalyzer 3.1

The OligoAnalyzer is the central calculator where various kinds of information about an oligonucleotide sequence can be predicted. The interface is presented on Figure 1. A user can input a nucleotide sequence and conditions, i.e. the concentrations of DNA, Na+, K+, Mg2+ and deoxynucleoside triphosphates. Melting temperature is predicted under these conditions for the duplex where the oligonucleotide hybridizes to the complementary sequence. This complementary strand can be either RNA or DNA; this is selected using the Target Type option. The oligonucleotide sequence can be modified with over 150 different labels and chemical groups (e.g. biotin, phosphorothioate, fluorescent dyes) using symbols listed in the tabbed sections below the sequence box. Seven different analyses can be performed when the specific button is selected on the right side of the interface. Selection of the ANALYZE button results in the physical properties of the oligonucleotide, such as a complementary sequence, oligonucleotide length, content of G and C bases, melting temperature, extinction coefficient at 260 nm and molecular weight (Figure 1). Published nearest-neighbor parameters are employed to calculate the extinction coefficient (10–12). Using values obtained from the published literature or coefficients estimated at Integrated DNA Technologies, the effects of modifications are included in the oligonucleotide extinction coefficient.
Figure 1.

Interface of OligoAnalyzer 3.1. Instructions and help files are at the top of the calculator. The sequence and conditions of experiments are entered in the upper region. Many chemical modifications can be added to a sequence using symbols from tabbed sections in the middle segment. The results are displayed in the lower left segment.

Interface of OligoAnalyzer 3.1. Instructions and help files are at the top of the calculator. The sequence and conditions of experiments are entered in the upper region. Many chemical modifications can be added to a sequence using symbols from tabbed sections in the middle segment. The results are displayed in the lower left segment. The oligonucleotide molecular weight also includes the weights of any chemical modifications. These weights have been experimentally validated (± 2 g/mol) for thousands of synthesized sequences by electrospray-ionization liquid chromatography mass spectrometry (13). Melting temperatures are calculated from the nearest-neighbor model (14–16) and the duplex is assumed to melt in two-state fashion, Oligonucleotide concentration, [S1], is assumed to be significantly larger (at least 6 ×) than the concentration of the complementary target, [S2], as this is seen in many molecular biology assays. In that case, Coligo is equal to [S1] and the concentration of the target can be neglected (17). If [S1] is not significantly larger than [S2], but [S1] ≥ [S2], the following concentration should be entered into the calculator, If [S2] < [S1], Equation (2) is valid when the designation of strands is switched. Transition enthalpy, ΔH°, and entropy, ΔS°, are calculated from the latest nearest-neighbor parameters for DNAs (15,16) and RNAs (18,19). The effects of counterions are modeled using the improved corrections for monovalent ions (20) and magnesium ions (53), This unique biophysical model employed for various counterions is not implemented elsewhere (21–23). Thermodynamic parameters are not available for many modifications that were demonstrated to change duplex stability (e.g. 2′-O-methyl RNA, 2-aminopurine, Cy3 dye) (13), and their effects on melting temperature are therefore neglected in the current version. When these parameters are published, they will be implemented in the predictive algorithm. If a sequence contains degenerate bases, the minimum and the maximum melting temperatures for the mixture of sequences are also estimated (Figure 1). The thermodynamic algorithm was validated using an independent set of over 100 different sequences ranging in length from 8 to 60 base pairs that were not used to derive the algorithm (20). Selection of the HAIRPIN button will present the user with the input form for predicting oligonucleotide secondary structures. This tool uses the mFold algorithm (9,24–26) that is described later. The SELF-DIMER and HETERO-DIMER buttons allow the user to examine possible duplexes when oligonucleotide anneals to itself or another target sequence. Predicted structures from most stable to least stable are shown. These cross-hybridization analyses are important for PCR assays where primer–primer interactions can decrease the efficiency of the reaction and cause secondary by-products. Selection of the NCBI BLAST button sends the sequence to the NCBI website for searching various databases using the short nearly exact matches method (27). This analysis can provide predicted annealing sites of the oligonucleotide within a genome or other group of candidate sequences. Selection of the TM MISMATCH button will allow the user to examine the effects of single base mismatches on duplex stability and oligonucleotide hybridization. Several published sets of nearest-neighbor parameters from SantaLucia's lab are employed to make these predictions (16,28–32). Dangling unpaired bases usually stabilize the duplex, so the predictive algorithm also takes these effects into account (33). If a red target base is clicked, a dropdown box will appear and allow the user to select the desired base mismatch. The target concentration can be set to zero when the target concentration is negligible in comparison with the oligonucleotide concentration. Results will show melting temperatures of perfectly matched and mismatched duplexes as well as the fractions of oligonucleotide bound to the targets. The LNA CONVERSION button will be described later. The tool allows the user to position LNA modifications within a sequence, so that the desired melting temperature of the duplex sequence is achieved.

PrimerQuestSM

Primer and probe selection for the PCR-based assays are important activities in molecular biology. Several software packages were therefore designed for this procedure (34–39). PrimerQuestSM is based on the Primer3 code (37). However, the selection method was improved and a graphical user interface was created. The algorithm finds sequences having desired oligonucleotide length, GC content, melting temperature, content of consecutive GC base pairs and sequence stability at the 3′ end. The intramolecular secondary structures, long repeats of the same bases and cross-hybridization between primers and the probes are minimized in the primer selection model. Furthermore, the oligonucleotide melting temperature is calculated using the same thermodynamic model employed in the OligoAnalyzer. Once the nucleotide sequence is entered in the sequence box, the name and design criteria for the sequence of interest can be set using the appropriate fields under the basic, standard and advanced tabs. The basic interface exposes the minimal information that needs to be entered and hides detailed criteria. These basic settings are suitable for typical PCR experiments. The standard and advanced tabs show increasing amounts of settings that an advanced user can configure to customize their predictive model. The CALCULATE button submits the data for the prediction of primers with the desired properties. Results show several sets of probes and primers that were found to be optimal (Figure 2). The predicted sets are ranked from best to next best. A graphical representation of the sequence is displayed with color bars for included, excluded and targeted regions. The biophysical properties of the primers and probes are also reported.
Figure 2.

Example of PrimerQuestSM results. Primers and a probe were selected for beta hemoglobin mRNA. Included and excluded regions are displayed as horizontal colored graphic bars along the sequence. The target region is included in the amplicon. The primers and the probe are shown as horizontal arrows where the location and direction of the arrow represent hybridization area and orientation. Properties of the primers and the probe are reported below graphical illustrations.

Example of PrimerQuestSM results. Primers and a probe were selected for beta hemoglobin mRNA. Included and excluded regions are displayed as horizontal colored graphic bars along the sequence. The target region is included in the amplicon. The primers and the probe are shown as horizontal arrows where the location and direction of the arrow represent hybridization area and orientation. Properties of the primers and the probe are reported below graphical illustrations.

LNA design

Chimeric probes containing locked nucleic acid residues were demonstrated to increase duplex stability, specificity and mismatch discrimination (40,41). These properties improve genotyping and microarray assays. The LNA design tool suggests positions within a specific sequence, where LNA modifications can be introduced to produce the desired biophysical properties. A user enters desired number of LNA residues and the melting temperature of LNA-modified duplex. The software will attempt to decrease the length of the sequence and introduce LNA modifications, so that the desired melting temperature is achieved. The LNA residues are indicated with ‘+’ symbol in front of the base. Melting temperatures are predicted using the nearest-neighbor two-state model (Equation 1), the latest thermodynamic parameters (16,42), and improved salt corrections for the effects of monovalent and magnesium ions (20,53). The algorithm was tested with a published set of melting data for LNA modified oligomers (40,42).

Antisense design

Expression of specific genes can be suppressed with antisense oligonucleotides (43). Software can be used to select the most effective antisense oligonucleotides based on a model that discriminates between effective and ineffective antisense sequences. The nucleotide sequence of a gene or other target candidate for antisense-based knockdown can be retrieved from NCBI databases using GenBank ID or RefSeq ID. Antisense DNA oligomers are typically from 19 to 26 bases long and modified with phosphorothioates for nuclease resistance. Optionally, 2′-O-methyl RNA residues can be introduced to increase duplex stability and resistance against nucleases. The general algorithm for predicting active sites includes or excludes 3 and 4 base long motifs that are correlated with antisense activity responses (44,45). Sequences with the best score for the number of positive motifs (CCAC, TCCC, ACTC, GCCA, CTCT) are most likely to show antisense activity. A user can modify search criteria and include or exclude various motifs.

RNAi design, ddRNAi design and TriFECTa RNAi kits

In addition to antisense-based gene knockdown, the use of short interfering RNAs to induce RNA interference is a powerful strategy to suppress gene expression in vivo (5,6). These three tools assist in the design of effective siRNAs, as there are several properties that discriminate between effective and ineffective siRNA duplexes (46,47). The ddRNAi tool helps to design siRNAs, which are expressed directly from DNA transfected into cells to make the siRNA (48–50). The RNAi design software tool allows users to predict effective short synthetic 27-mer siRNA duplexes that are delivered to target cells (6). A user can specify criteria for the siRNA duplex and overhangs, e.g. desired duplex length, strand content of G and C bases and various sequence motifs at specific positions. Mixed bases can be introduced and different weights can be assigned to each motif. The algorithm searches a target gene sequence, calculates scores for sequence candidates and ranks optimal siRNA duplexes. The duplexes can have symmetrical overhangs up to 3 bases long. Results show properties of selected siRNA duplexes and their location within the target sequence. SiRNA–TriFECTa sequences are a collection of predesigned dicer-substrate siRNA sequences that have been found to be optimal using dicer-substrate siRNA design criteria. Besides incorporating siRNA activity criteria into the design algorithm, additional analyses are performed, so that chosen siRNA sequences do not target alternatively spliced exons and do not include known polymorphic sites. Gene sequences from the NCBI reference sequence set within eight organisms can be selected and displayed.

mFold

This software tool predicts the most stable secondary structure of an oligonucleotide by minimizing folding free energy (51). Suboptimal energetic secondary structures having free energies close to minimal ΔG° can be predicted as well. The mFold software was developed by and implemented in collaboration with Prof. Michael Zuker. The algorithm has been well tested and described in published sources (9,24–26). A user can input a nucleotide sequence and the conditions, including temperature and ionic concentrations. The folded structures are predicted at the specified temperature. The results show both a dot-plot diagram of possible base pairings and predicted secondary structures. These structures are ranked from the highest to lowest probability using transition free energies. Melting temperature is estimated using a two-state model. The connectivity table for each base and details of the energetics (each loop and stack ΔG° contributions) can also be obtained.

Dilution and resuspension calculations

Oligonucleotide dilutions can be calculated using the dilution calculator. A user inputs initial concentration, volume and desired final concentration. The calculator returns volumes used to mix solutions. Various concentration formats and units are accepted. Similarly, the resuspension calculator determines the volume of solution needed to achieve a specific concentration when known moles or mass of dry oligonucleotide are dissolved. Both calculators can be brought directly from the OligoAnalyzer results. In that case, oligonucleotide properties predicted by OligoAnalyzer are transferred automatically to these calculators.

Biophysics.idtdna.com

Subdomain http://biophysics.idtdna.com contains advanced, unique calculators that are being developed. Software is stable and tested, but it has yet to be included into IDT SciTools. In the current version, the extinction coefficients and UV spectrum from 215 to 310 nm can be predicted for both single-stranded and double-stranded DNA oligomers. The models, parameters and their accuracy tests have been recently published (12). A user can also choose to apply Cavaluzzi–Borer correction for extinction coefficients of DNA bases at 260 nm (52). The predicted UV spectrum is plotted and extinction coefficients at each wavelength are tabulated.

Webservices

Additional tools are also freely available, but the software requires a free login username and password to manage the user's information between sessions due to the computational time requirements. For example, oligonucleotides suitable for creating a gene expression microarray can be designed with the Workbench application. However, the design and storage of possibly thousands of target sequences is not amenable to be done during a single web browser session. The use of a free user login allows this information to be associated with a single user. Finally, for the programmatic incorporation of some of these software engines, IDT also provides a set of web service based interfaces to allow external software developers access to these functions (http://www.idtdna.com/AnalyzerService/AnalyzerService.asmx).

CONCLUSIONS

IDT SciTools web server provides useful predictions of oligonucleotide properties under various experimental conditions. The software tools help to select oligonucleotides that are most likely to exhibit the best performance in biological applications.
  51 in total

1.  Sequence-dependent thermodynamic parameters for locked nucleic acid (LNA)-DNA duplex formation.

Authors:  Patricia M McTigue; Raymond J Peterson; Jason D Kahn
Journal:  Biochemistry       Date:  2004-05-11       Impact factor: 3.162

2.  Constitutive and conditional suppression of exogenous and endogenous genes by anti-sense RNA.

Authors:  J G Izant; H Weintraub
Journal:  Science       Date:  1985-07-26       Impact factor: 47.728

3.  Improved predictions of secondary structures for RNA.

Authors:  J A Jaeger; D H Turner; M Zuker
Journal:  Proc Natl Acad Sci U S A       Date:  1989-10       Impact factor: 11.205

4.  Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase.

Authors:  R K Saiki; D H Gelfand; S Stoffel; S J Scharf; R Higuchi; G T Horn; K B Mullis; H A Erlich
Journal:  Science       Date:  1988-01-29       Impact factor: 47.728

5.  Calculating thermodynamic data for transitions of any molecularity from equilibrium melting curves.

Authors:  L A Marky; K J Breslauer
Journal:  Biopolymers       Date:  1987-09       Impact factor: 2.505

6.  Oligonucleotide interactions. 3. Circular dichroism studies of the conformation of deoxyoligonucleotides.

Authors:  C R Cantor; M M Warshaw; H Shapiro
Journal:  Biopolymers       Date:  1970       Impact factor: 2.505

7.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.

Authors:  M Zuker; P Stiegler
Journal:  Nucleic Acids Res       Date:  1981-01-10       Impact factor: 16.971

8.  Fast algorithm for predicting the secondary structure of single-stranded RNA.

Authors:  R Nussinov; A B Jacobson
Journal:  Proc Natl Acad Sci U S A       Date:  1980-11       Impact factor: 11.205

9.  Revised UV extinction coefficients for nucleoside-5'-monophosphates and unpaired DNA and RNA.

Authors:  Michael J Cavaluzzi; Philip N Borer
Journal:  Nucleic Acids Res       Date:  2004-01-13       Impact factor: 16.971

10.  Predicting stability of DNA duplexes in solutions containing magnesium and monovalent cations.

Authors:  Richard Owczarzy; Bernardo G Moreira; Yong You; Mark A Behlke; Joseph A Walder
Journal:  Biochemistry       Date:  2008-04-19       Impact factor: 3.162

View more
  115 in total

1.  In vitro reconstitution of yeast splicing with U4 snRNA reveals multiple roles for the 3' stem-loop.

Authors:  Amy J Hayduk; Martha R Stark; Stephen D Rader
Journal:  RNA       Date:  2012-03-12       Impact factor: 4.942

2.  Quantification of human fecal bifidobacterium species by use of quantitative real-time PCR analysis targeting the groEL gene.

Authors:  Jana Junick; Michael Blaut
Journal:  Appl Environ Microbiol       Date:  2012-02-03       Impact factor: 4.792

3.  Molecular characterization of PgUFGT gene and R2R3-PgMYB transcription factor involved in flavonoid biosynthesis in four tissues of wild pomegranate (Punica granatum L.).

Authors:  Ramanjeet Kaur; Nisha Kapoor; Lubna Aslam; Ritu Mahajan
Journal:  J Genet       Date:  2019-11       Impact factor: 1.166

4.  Mechanism of strand-specific smooth muscle alpha-actin enhancer interaction by purine-rich element binding protein B (Purbeta).

Authors:  Jon E Ramsey; Robert J Kelm
Journal:  Biochemistry       Date:  2009-07-14       Impact factor: 3.162

5.  A self-assembling short oligonucleotide duplex suitable for pretargeting.

Authors:  Prabodhika Mallikaratchy; Jeffery Gardner; Lars Ulrik R Nordstrøm; Nicholas J Veomett; Michael R McDevitt; Mark L Heaney; David A Scheinberg
Journal:  Nucleic Acid Ther       Date:  2013-07-13       Impact factor: 5.486

6.  Development and accuracy of quantitative real-time polymerase chain reaction assays for detection and quantification of enterotoxigenic Escherichia coli (ETEC) heat labile and heat stable toxin genes in travelers' diarrhea samples.

Authors:  Bonnie P Youmans; Nadim J Ajami; Zhi-Dong Jiang; Joseph F Petrosino; Herbert L DuPont; Sarah K Highlander
Journal:  Am J Trop Med Hyg       Date:  2013-11-04       Impact factor: 2.345

7.  RNA-Catalyzed Polymerization of Deoxyribose, Threose, and Arabinose Nucleic Acids.

Authors:  David P Horning; Saikat Bala; John C Chaput; Gerald F Joyce
Journal:  ACS Synth Biol       Date:  2019-05-03       Impact factor: 5.110

8.  Development of a panel of recombinase polymerase amplification assays for detection of common bacterial urinary tract infection pathogens.

Authors:  B Raja; H J Goux; A Marapadaga; S Rajagopalan; K Kourentzi; R C Willson
Journal:  J Appl Microbiol       Date:  2017-08       Impact factor: 3.772

9.  Evaluation of Solasonine Content and Expression Patterns 
of SGT1 Gene in Different Tissues of Two Iranian Eggplant (Solanum melongena L.) Genotypes.

Authors:  Mahmoud Bagheri; Ali Akbar Shahnejat Bushehri; Mohammad Reza Hassandokht; Mohammad Reza Naghavi
Journal:  Food Technol Biotechnol       Date:  2017-06       Impact factor: 3.918

10.  Thermodynamic basis for engineering high-affinity, high-specificity binding-induced DNA clamp nanoswitches.

Authors:  Andrea Idili; Kevin W Plaxco; Alexis Vallée-Bélisle; Francesco Ricci
Journal:  ACS Nano       Date:  2013-11-20       Impact factor: 15.881

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.