| Literature DB >> 36235142 |
Edoardo Luca Viganò1, Erika Colombo1, Giuseppa Raitano1, Alberto Manganaro2, Alessio Sommovigo2, Jean Lou Cm Dorne3, Emilio Benfenati1.
Abstract
Read-across applies the principle of similarity to identify the most similar substances to represent a given target substance in data-poor situations. However, differences between the target and the source substances exist. The present study aims to screen and assess the effect of the key components in a molecule which may escape the evaluation for read-across based only on the most similar substance(s) using a new open-access software: Virtual Extensive Read-Across (VERA). VERA provides a means to assess similarity between chemicals using structural alerts specific to the property, pre-defined molecular groups and structural similarity. The software finds the most similar compounds with a certain feature, e.g., structural alerts and molecular groups, and provides clusters of similar substances while comparing these similar substances within different clusters. Carcinogenicity is a complex endpoint with several mechanisms, requiring resource intensive experimental bioassays and a large number of animals; as such, the use of read-across as part of new approach methodologies would support carcinogenicity assessment. To test the VERA software, carcinogenicity was selected as the endpoint of interest for a range of botanicals. VERA correctly labelled 70% of the botanicals, indicating the most similar substances and the main features associated with carcinogenicity.Entities:
Keywords: botanicals; carcinogenicity; read-across; software
Mesh:
Year: 2022 PMID: 36235142 PMCID: PMC9570968 DOI: 10.3390/molecules27196605
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.927
Figure 1Case Study—An overall scheme of the Virtual Extensive Read-Across (VERA) algorithm. Given a target, VERA searches for similar compounds according to the Virtual models for Evaluating the properties of chemicals within a Global Architecture (VEGA) similarity index (>0.65) and the presence of structural alerts (SA, red circle) if the target substance contains any; if all similar substances are concordant in experimental values (toxic or NON-toxic), the target substance will be labelled accordingly. In the figure, molecular groups (MGs) in common between the target and similar compounds (MG1, green circle; MG2, blue circle) and not in common (yellow circles) are highlighted.
Figure 2Reasoning within the VERA algorithm. For the SA in the target molecule, the algorithm analyses the co-presence of SA and MGs for similar compounds to assess the toxicity of the target and searches for any exception rules that may modulate toxicity. In the example of the figure, VERA found MG1 as an exception rule.
Figure 3Case Study—Reasoning of the VERA algorithm. Without SA in the target molecule, VERA follows the reasoning: the algorithm analyses the co-presence of the MG with the higher toxic prevalence in the dataset and other MGs. In this case, VERA found MG2 as the MG with higher toxicity prevalence.
Classification parameters and the confusion matrix of botanicals.
| Sensitivity | 0.69 | ||
| Specificity | 0.70 | ||
| Accuracy | 0.69 | ||
| Precision | 0.69 | ||
|
|
|
|
|
| Carcinogen | 18 | 8 | 4 |
| Non-Carcinogen | 8 | 19 | 6 |
Figure 4Statistical values of several QSAR models (Toxtree and VEGA for carcinogenicity) compared with VERA results.