| Literature DB >> 34677481 |
James Lever1, Robert Brkljača2, Colin Rix1, Sylvia Urban1.
Abstract
This study provides a review of all isolated natural products (NPs) reported for sponges within the order Verongiida (1960 to May 2020) and includes a comprehensive compilation of their geographic and physico-chemical parameters. Physico-chemical parameters were used in this study to infer pharmacokinetic properties as well as the potential pharmaceutical potential of NPs from this order of marine sponge. In addition, a network analysis for the NPs produced by the Verongiida sponges was applied to systematically explore the chemical space relationships between taxonomy, secondary metabolite and drug score variables, allowing for the identification of differences and correlations within a dataset. The use of scaffold networks as well as bipartite relationship networks provided a platform to explore chemical diversity as well as the use of chemical similarity networks to link pharmacokinetic properties with structural similarity. This study paves the way for future applications of network analysis procedures in the field of natural products for any order or family.Entities:
Keywords: cheminformatics; in silico mapping; natural products; network analysis; verongiida sponges
Mesh:
Substances:
Year: 2021 PMID: 34677481 PMCID: PMC8539549 DOI: 10.3390/md19100582
Source DB: PubMed Journal: Mar Drugs ISSN: 1660-3397 Impact factor: 5.118
Figure 1Classes of SIA, BT and BTOx. Mono spiroisoxazolines (mSIA), bis spiroisoxazolines (bSIA), bis spiroisoxazoline bromotyramine (bSIABT), mono spiroisoxazoline bromotyramine (mSIABT), mono spiroisoxazoline guanidine (mSIAGdn), bromotyramine (BT), bromotyramine oxime (BTOx), bis spiroisoxazoline guanidine (bSIAGdn), mono spiroisoxazoline bromotyrmamine oxime guanidine (mSIABTOxGdn), bromotyramine guanidine (BTGdn), bromotyramine oxime guanidine.
Distribution of SIA and BT compound classes amongst Verongiida sponges (see Supporting Information S1 for references).
| Species | mSIA | bSIA | mSIABT | bSIA | BT | BTOx | mSIAGdn | bSIAGdn | BTGdn | BTOxGdn | mSIABTOxGdn |
|---|---|---|---|---|---|---|---|---|---|---|---|
| + | + | + | + | + | |||||||
| + | |||||||||||
| + | + | + | |||||||||
| + | + | + | |||||||||
| + | + | ||||||||||
| + | + | + | + | ||||||||
| + | + | + | |||||||||
| + | + | + | + | + | |||||||
| + | + | + | + | + | |||||||
| + | |||||||||||
| + | + | + | + | ||||||||
| + | + | ||||||||||
| + | + | + | + | + | + | + | + | ||||
| + | + | ||||||||||
| + | + | + | + | + | + | ||||||
| + | |||||||||||
| + | + | + | + | ||||||||
| + | |||||||||||
| + | |||||||||||
| + | + | ||||||||||
| + | |||||||||||
| + | |||||||||||
| + | |||||||||||
| + | + | + | |||||||||
| + | |||||||||||
| + | + | + | + | + | + | ||||||
| + | + | ||||||||||
| + | + | + | + | + | + | + | |||||
| + | + | + | + | + | + | + | + | ||||
| + | + | + | + | + | + | + | + | + | + | ||
| + | + | + | + | + | + | + | + | + | |||
| + | + | + | |||||||||
| + | + | + | |||||||||
| + | + | ||||||||||
| + | + | + | + | ||||||||
| + | |||||||||||
| + | + | + | |||||||||
| + | + | + | + | + | |||||||
| + | + | + | + | + | + | ||||||
| + | + | + |
Figure 2Proposed biosynthesis of BTAs from brominated tyrosine [9,11,12,64].
Figure 3Known distribution of Verongiida sponges globally, data from Ocean Biodiversity Information System (OBIS) (heat map) [65,66]. Locations where Verongiida sponges have been sampled, yielding natural products (black dot).
Figure 4OBIS distribution of Verongiida sponge families (heat map). Locations of Verongiida sponges that have been sampled yielding NPs (coloured dots).
Figure 5Marine ecoregions of the world (MEOWs) (Black borders) [156] displaying the total number of NPs isolated from Verongiida sponges in each region.
Figure 6Bipartite network representation of species (nodes, right) and compounds (nodes, left) that exist within species (edges = curved lines) for the sponges in the order Verongiida.
Figure 7Monopartite projection of the bipartite network with respect to species (visualised using the Force Atlas layout algorithm in Gephi).
Figure 8(A) Venn diagram for compounds distributed across families of Verongiida. (B) Total number of unique compounds for each species. (C) Node degree for each species in monopartite projection. (D) Number of compounds that are shared with at least one other species.
Figure 9(A) GR Value assessing overall intra- vs. inter-genera sharing of compounds. (B) Number of compounds shared with species of the same genus. (C) Number of compounds shared with species of different genera.
Murcko scaffold analysis for NPs of all genera within the order Verongiida.
| Genus | Natural Products | Murcko Scaffolds | Singleton | Diversity | Novelty |
|---|---|---|---|---|---|
|
| 15 | 8 | 0 | 0.533 | 0 |
|
| 1 | 1 | 0 | 1 | 0 |
|
| 140 | 44 | 20 | 0.314 | 0.143 |
|
| 63 | 19 | 8 | 0.301 | 0.127 |
|
| 12 | 9 | 0 | 0.75 | 0 |
|
| 95 | 29 | 20 | 0.305 | 0.211 |
|
| 232 | 67 | 35 | 0.289 | 0.151 |
|
| 115 | 47 | 24 | 0.409 | 0.209 |
|
| 51 | 28 | 13 | 0.549 | 0.255 |
Figure 10Cumulative scaffold frequency by genus (left). Number of shared scaffolds by genus (right).
Figure 11Most frequently occurring Murcko scaffolds for each genus.
Figure 12(A) Scaffold network (SN) created using HierS type scaffolds that displays species (coloured), initialised compounds (dark grey) and scaffolds (light grey). (B) SimRank score data calculated for each species comparison in the SN, organised via genera comparing intra/inter genera relationships.
Figure 13PCA analysis of chemical descriptors MW, TPSA, nRotB, nHBDon, nHBAcc and cLogP.
PCA descriptive statistics.
| PC1 | PC2 | PC3 | PC4 | PC5 | |
|---|---|---|---|---|---|
| Eigenvalues | 3.8916 | 1.2130 | 0.5068 | 0.2505 | 0.0841 |
| Proportion of variance | 0.649 | 0.202 | 0.084 | 0.014 | 0.009 |
| Cumulative proportion | 0.649 | 0.851 | 0.935 | 0.991 | 1.000 |
PCA loadings for all compound descriptors.
| PC1 | PC2 | PC3 | PC4 | PC5 | |
|---|---|---|---|---|---|
| MW | 0.472 | 0.147 | −0.133 | −0.476 | −0.636 |
| TPSA | 0.458 | −0.338 | 0.020 | 0.049 | 0.538 |
| nHBAcc | 0.471 | −0.199 | 0.122 | −0.453 | 0.277 |
| nHBDon | 0.420 | −0.374 | −0.153 | 0.681 | −0.374 |
| nRotB | 0.307 | 0.522 | 0.748 | 0.270 | −0.002 |
| cLogP | 0.273 | 0.642 | −0.619 | 0.172 | 0.299 |
Figure 14Euclidean distance of genera based on pharmacokinetic properties of compounds.
Figure 15(A) Species drug score network. (B) Drug score distribution by genus. (C) Lipinski statistics by genus.
Figure 16(A) Similarity network with Louvain clustering applied. (B) Average drug score for each cluster arranged by genera. (C) Similarity network highlighting compounds that conform to the Lipinski/Veber rules. (D) Similarity network ranking compounds based on their drug score.
Figure 17Compounds from Verongiida sponges that achieve a drug score of ≥0.75.
Molecular descriptors for compounds that achieve a drug score ≥ 0.75.
| Compound | Cluster | Drug Score | Genus | MW | TPSA | nHBAcc | nHBDon | nRotB | cLogP |
|---|---|---|---|---|---|---|---|---|---|
|
| 3 | 0.81 |
| 323.03 | 23.47 | 2 | 1 | 3 | 2.82 |
|
| 3 | 0.84 |
| 337.05 | 12.47 | 2 | 0 | 4 | 3.09 |
|
| 3 | 0.79 |
| 380.12 | 38.49 | 3 | 1 | 7 | 2.63 |
|
| 3 | 0.81 |
| 396.12 | 58.72 | 4 | 2 | 7 | 1.56 |
|
| 3 | 0.79 |
| 452.19 | 61.8 | 4 | 2 | 9 | 2.42 |
|
| 3 | 0.78 |
| 407.1 | 49.77 | 4 | 1 | 7 | 1.89 |
|
| 3 | 0.75 |
| 408.18 | 15.71 | 3 | 0 | 8 | 3.25 |
|
| 2 | 0.77 |
| 460.13 | 99.6 | 6 | 4 | 7 | 2.56 |
|
| 2 | 0.88 |
| 381.23 | 99.6 | 5 | 3 | 7 | 1.84 |
|
| 2 | 0.81 |
| 323.23 | 118.2 | 5 | 3 | 6 | 0.14 |
|
| 2 | 0.8 |
| 437.13 | 82.95 | 5 | 3 | 8 | 2.39 |
|
| 2 | 0.77 |
| 475.14 | 118.83 | 7 | 5 | 7 | 1.84 |
|
| 2 | 0.77 |
| 379.23 | 127.43 | 5 | 3 | 7 | 1.19 |
|
| 2 | 0.77 |
| 460.13 | 99.6 | 5 | 3 | 7 | 2.56 |
|
| 2 | 0.77 |
| 434.33 | 96.94 | 5 | 3 | 10 | 3.16 |
|
| 2 | 0.75 |
| 475.14 | 125.62 | 6 | 4 | 7 | 2.25 |
|
| 2 | 0.77 |
| 461.11 | 136.62 | 6 | 5 | 6 | 1.98 |
|
| 2 | 0.89 |
| 367.2 | 110.6 | 5 | 4 | 6 | 1.56 |
|
| 2 | 0.75 |
| 475.14 | 125.62 | 7 | 5 | 7 | 2.25 |
|
| 16 | 0.96 |
| 254.29 | 63.19 | 3 | 2 | 1 | 1.01 |
|
| 16 | 0.91 |
| 333.19 | 63.19 | 3 | 2 | 1 | 1.74 |
|
| 7 | 0.93 |
| 201.61 | 66.4 | 3 | 2 | 0 | −0.57 |
|
| 7 | 0.93 |
| 201.61 | 66.4 | 3 | 2 | 0 | −0.57 |
|
| 21 | 0.79 |
| 454.12 | 75.08 | 5 | 2 | 3 | 1.94 |
|
| 21 | 0.77 |
| 468.15 | 66.29 | 5 | 1 | 3 | 2.19 |
|
| 42 | 0.94 |
| 126.11 | 61.69 | 3 | 2 | 0 | 0.06 |
|
| 50 | 0.95 |
| 154.17 | 97.92 | 4 | 5 | 2 | −1.28 |
|
| 39 | 0.98 |
| 137.14 | 46.92 | 2 | 1 | 0 | 0.38 |
|
| 19 | 0.81 |
| 346.07 | 19.03 | 1 | 1 | 3 | 3.2 |
|
| 43 | 0.79 |
| 185.23 | 53.6 | 3 | 2 | 2 | 0.02 |
|
| 66 | 0.97 |
| 256.31 | 63.19 | 3 | 2 | 2 | 0.8 |
|
| 57 | 0.97 |
| 244.29 | 49.41 | 2 | 1 | 2 | 0.85 |
MW = molecular weight, TPSA = total polar surface Area, nHBAcc = number of hydrogen bond acceptors, nHBDon = number of hydrogen bond donors, nRotB = number of rotatable bonds, cLogP = octanol/water partition coefficient.
Taxonomy of sponges in the order Verongiida.
| Order | Family | Genera (Total NPs) |
|---|---|---|
| Verongiida | Aplysinellidae | |
| Aplysinidae | ||
| Ernstillidae | ||
| Ianthellidae | ||
| Pseudoceratinidae |
Figure 18Conceptual scheme for networking NP compound libraries. (A) Collecting the NP literature obtained from SciFinder database with an emphasis on keyword searches such as genus and species name. (B) Curating the literature to form a database including all pertinent attributes of compounds such as species and sample location. (C) Reading and canonising SMILES codes from the NP database using in house python code and RDKit library. (D) Creation of a unique binary fingerprint for all molecules. (E) Calculating the Tanimoto similarity value for all possible compound comparisons. (F) Combining similarity calculations into an edge list describing all edge information for networks and creation of a node list including all nodes that will feature in the network and their attribute information from the NP database. (G) Using the Gephi software package to input node and edge list information to visualise the network.