| Literature DB >> 18805906 |
Maik Friedel1, Swetlana Nikolajewa, Jürgen Sühnel, Thomas Wilhelm.
Abstract
DiProDB (http://diprodb.fli-leibniz.de) is a database of conformational and thermodynamic dinucleotide properties. It includes datasets both for DNA and RNA, as well as for single and double strands. The data have been shown to be important for understanding different aspects of nucleic acid structure and function, and they can also be used for encoding nucleic acid sequences. The database is intended to facilitate further applications of dinucleotide properties. A number of property datasets is highly correlated. Therefore, the database comes with a correlation analysis facility. Authors having determined new sets of dinucleotide property values are invited to submit these data to DiProDB.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18805906 PMCID: PMC2686603 DOI: 10.1093/nar/gkn597
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Number of dinucleotide property datasets for each category
| Nucleic acid type | Strand information | Mode of property determination | Property type | ||||||
|---|---|---|---|---|---|---|---|---|---|
| DNA | DNA/RNA | RNA | Double | Single | Experimental | Theoretical/calculated | Thermo-dynamical | Conformational | Letter-based |
| 93 | 7 | 15 | 103 | 12 | 33 | 82 | 34 | 74 | 7 |
Figure 1.Screenshot of the DiProDB table displaying search results for the term ‘twist’ (conformational dinucleotide property) in the property name.
Figure 2.Pearson's correlation coefficients for five sets of twist angles. ID (Ref.): 1 (9), 61 (10), 88 (11), 92 (12) and 98 (13). Correlation coefficients >0.8 are coloured in green.
Figure 3.Hierarchical clustering of all 23 B-DNA double-strand physicochemical properties and the three-dinucleotide letter-based quantities GC content, purine (GA) content and keto (GT) content. The property sets are designated by their IDs and names.
Content of supplementary material
| Figure S1 | Single linkage hierarchical clustering of 115 dinucleotide properties. |
| Figure S2 | Ward hierarchical clustering of 115 dinucleotide properties. |
| Figure S3 | 115 dinucleotide properties in the first two principal components. |
| Figure S4 | 115 dinucleotide properties in the first and third principal component. |
| Figure S5 | The 16 dinucleotides in the first two principal components. |
| Table S1 | Percentage of importance of the 15 PCs carrying >10−14% of information. |
| Table S2 | Percentage of importance of the first 10 dinucleotide properties in the first 15 PCs in decreasing order. |
| Table S3 | Involvement of the 10 most important dinucleotide properties in the PCs 1–15. |
Figure 4.All dinucleotide properties plotted in the first two PCs. A few of them are designated by property name and ID.