| Literature DB >> 27570672 |
Joseph D Romano1, Nicholas P Tatonetti1.
Abstract
Venoms and venom-derived compounds constitute a rich and largely unexplored source of potentially therapeutic compounds. To facilitate biomedical research, it is necessary to design a robust informatics infrastructure that will allow semantic computation of venom concepts in a standardized, consistent manner. We have designed an ontology of venom-related concepts - named Venom Ontology - that reuses an existing public data source: UniProt's Tox-Prot database. In addition to describing the ontology and its construction, we have performed three separate case studies demonstrating its utility: (1) An exploration of venom peptide similarity networks within specific genera; (2) A broad overview of the distribution of available data among common taxonomic groups spanning the known tree of life; and (3) An analysis of the distribution of venom complexity across those same taxonomic groups. Venom Ontology is publicly available on BioPortal at http://bioportal.bioontology.org/ontologies/CU-VO.Entities:
Year: 2016 PMID: 27570672 PMCID: PMC5001765
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Distribution of venom complexity across the tree of life, by common taxonomic groups. A venom’s complexity is defined as the number of distinct peptide components it contains.
| Common Name | Minimum | Median | Mean | Maximum | Skewness |
|---|---|---|---|---|---|
| Molluscs | 1 | 4 | 11.230 | 118 | 3.638 |
| Insects | 1 | 2 | 3.101 | 15 | 2.211 |
| Arachnids | 1 | 4 | 13.020 | 293 | 6.576 |
| Fish | 1 | 2 | 2.800 | 6 | 1.517 |
| Amphibians | 1 | 1.5 | 1.500 | 2 | n/a |
| Reptiles | 1 | 4 | 9.496 | 64 | 2.271 |
| Mammals | 6 | 6 | 6.000 | 6 | n/a |
| All Species | 1 | 4 | 9.922 | 293 | 7.987 |
Skewness is the estimated third standardised moment of the empirical distribution. Higher skewness indicates greater lack of symmetry about the mean
Distribution of species and sequences in Venom Ontology across common taxonomic groups. Some groups with no species or sequences are included for completeness.
| Common Name | Taxonomic group(s) | # species in ontology | % total species | # sequences in ontology | % total sequences |
|---|---|---|---|---|---|
| Cnidarians | Cnidaria | 0 | 0.00% | 0 | 0.00% |
| Molluscs | Mollusca | 97 | 15.80% | 1089 | 17.88% |
| Insects | Insecta | 79 | 12.87% | 245 | 4.02% |
| Arachnids | Arachnida | 183 | 29.80% | 1089 | 17.88% |
| Fish | Actinopterygii | 4 | 0.65% | 12 | 0.20% |
| Coelacanthimorpha | 0 | 0.00% | 0 | 0.00% | |
| Chondrichthyes | 1 | 0.16% | 2 | 0.03% | |
| Cyclostomata | 0 | 0.00% | 0 | 0.00% | |
| Dipnoi | 0 | 0.00% | 0 | 0.00% | |
| Amphibians | Amphibia | 2 | 0.33% | 3 | 0.05% |
| Reptiles | Archelosauria | 0 | 0.00% | 0 | 0.00% |
| Squamata | 242 | 39.41% | 2298 | 37.72% | |
| Birds | Aves | 0 | 0.00% | 0 | 0.00% |
| Mammals | Mammalia | 1 | 0.16% | 6 | 0.10% |
| Other/unclassified | 5 | 0.81% | 1348 | 22.13% | |
| Total | 614 | 6092 |
Mann-Whitney U test results for all pairs of venom complexity distributions. A p-value of less than 0.05 signifies that two distributions are statistically different.
| Arachnids | Fish | Insects | Mammals | Molluscs | Reptiles | |
|---|---|---|---|---|---|---|
| Amphibians | 0.1165 | 0.417 | 0.4389 | 0.6667 | 0.1669 | 0.1262 |
| Arachnids | 0.1554 |
| 0.8424 | 0.9086 | 0.7252 | |
| Fish | 0.732 | 0.3657 | 0.2162 | 0.1701 | ||
| Insects | 0.1935 |
|
| |||
| Mammals | 0.8584 | 0.8128 | ||||
| Molluscs | 0.8778 |