| Literature DB >> 31936210 |
Irini Furxhi1,2, Finbarr Murphy1,2, Martin Mullins1,2, Athanasios Arvanitis3, Craig A Poland4.
Abstract
Machine Learning (ML) techniques have been applied in the field of nanotoxicology with very encouraging results. Adverse effects of nanoforms are affected by multiple features described by theoretical descriptors, nano-specific measured properties, and experimental conditions. ML has been proven very helpful in this field in order to gain an insight into features effecting toxicity, predicting possible adverse effects as part of proactive risk analysis, and informing safe design. At this juncture, it is important to document and categorize the work that has been carried out. This study investigates and bookmarks ML methodologies used to predict nano (eco)-toxicological outcomes in nanotoxicology during the last decade. It provides a review of the sequenced steps involved in implementing an ML model, from data pre-processing, to model implementation, model validation, and applicability domain. The review gathers and presents the step-wise information on techniques and procedures of existing models that can be used readily to assemble new nanotoxicological in silico studies and accelerates the regulation of in silico tools in nanotoxicology. ML applications in nanotoxicology comprise an active and diverse collection of ongoing efforts, although it is still in their early steps toward a scientific accord, subsequent guidelines, and regulation adoption. This study is an important bookend to a decade of ML applications to nanotoxicology and serves as a useful guide to further in silico applications.Entities:
Keywords: computational; in silico; machine learning; nanoforms; nanoparticle; nanotoxicology
Year: 2020 PMID: 31936210 PMCID: PMC7023261 DOI: 10.3390/nano10010116
Source DB: PubMed Journal: Nanomaterials (Basel) ISSN: 2079-4991 Impact factor: 5.076
Review protocol.
| Subject | Description | Subject | Description |
|---|---|---|---|
|
| Google Scholar, Elsevier (Scopus and ScienceDirect), Web of Science and PubMed |
| Studies predicting nano-properties, environmental outcomes, pharmacokinetic modelling |
|
| nanoparticle, nanomaterial, in silico, computational, machine learning, model, nanotoxicity |
| Peer-reviewed journals and reports |
|
| title, abstract, keywords |
| 2010–2019 |
Figure 1A summarized general roadmap for implementing a model in the field of nanotoxicology. The roadmap can be divided into five main parts: dataset formation, data pre-processing, model implementation, model validation, and applicability domain.
An overview of selection techniques used in the reviewed studies.
| Feature Selection | Description | References |
|---|---|---|
|
| Widely used for analysis of multivariate datasets applies transformation of observations to PC space with an objective to minimize the correlation and maximize the variance. | [ |
|
| Applied to predict a set of dependent variables from independent ones, finding the best correlation between them by extracting a number of latent variables preserves information. PLS reveals the most important variables and determines the influence of inputs on output. Star plots produce qualitative selections regarding descriptor importance. | [ |
|
| A resampling technique preceding bootstrap that estimates variance and bias. | [ |
|
| GA is applied to select from descriptors the best combinations for highest predictivity. Based on biological evolution, GA performs function optimization stochastically. | [ |
|
| ERM is a full search algorithm that avoids local minima and shows little dependency on the initial set of descriptors. As such, it can be preferable to GA, depending on the case. | [ |
|
| The GFA method finds out the most frequent descriptors in a large set. The GFA smoothing factor controls the number of independent variables and is varied to determine the optimal number of descriptors. | [ |
|
| At each step of the selection process, the descriptor that led to the highest model performance is retained until a specified number of descriptors are selected. As an extension to SFS, after each forward selection step, SFFS conducts backward elimination to evaluate descriptors that can be removed. | [ |
|
| (1) In MLR, a set of models is examined for stability and validity. | [ |
|
| (1) Evaluation and ranking for selecting descriptors based on the variance reduction or entropy as a measure of information gain. | [ |
Figure 2Model (cases) categories, their population (left), and detailed breakdown (right, zoomed box) as extracted from the 273 cases derived from the 86 studies gathered. Instance based (Inst Based), decision tress (D. Tree), Bayesian networks (Bayes), neural networks (N. Network), and dimensionality reduction algorithms (D. reduction).
Endpoints predicted by trees category extracted from the studies gathered.
| Reference | NMs Category | Output | Reference | NMs Category | Output |
|---|---|---|---|---|---|
| [ | Carbon-based, Metal, Metal Oxide, Quantum Dots | Accumulation, reproductive toxicity | [ | Metal, Metal oxide | Cellular Viability |
| [ | Carbon-based | Total protein, Macrophages, Membrane integrity, Neutrophils | [ | Metal | |
| [ | Metal, dendrimer, metal oxide, polymeric | Aggregated | [ | Dendrimers | |
| [ | Metal, Metal oxide, Quantum Dots | [ | Carbon-based | ||
| [ | Metal, Metal oxide | Aggregated, Exocytosis, Viability | [ | ||
| [ | Metal | Cell association | [ | Carbon-based, Metal, Metal Oxide, Polymeric, dendrimers, Quantum Dots | |
| [ | Metal | [ | Metal | ||
| [ | Metal Oxide | Cellular uptake | [ | Quantum Dots | |
| [ | Carbon-based | Dose-response | [ | Metal Oxide | |
| [ | Metal Oxide | Membrane integrity | [ | ||
| [ | Metal, Metal oxide | Minimum Inhibitory Concentration (MIC), Viability | [ | ||
| [ | [ | ||||
| [ | Carbon-based | Mitotoxicity | [ | ||
| [ | Metal Oxide | No-Observed-Adverse-Effect concentration (NOAEC), Oxidative stress, Protein carbonylation | |||
Endpoints predicted by regression tools extracted from the studies gathered.
| Reference | NMs Category | Output | Reference | NMs Category | Output |
|---|---|---|---|---|---|
| [ | Carbon-based | Aggregated, Viability | [ | Metal Oxide | Viability |
| [ | Metal Oxide | Apoptosis, Cellular uptake | [ | ||
| [ | Apoptosis | [ | |||
| [ | Metal | Cell association | [ | ||
| [ | [ | ||||
| [ | Carbon-based | Mitotoxicity | [ | ||
| [ | Metal Oxide | Cellular uptake | [ | ||
| [ | [ | ||||
| [ | [ | ||||
| [ | [ | ||||
| [ | [ | ||||
| [ | Metal Oxide, Quantum Dots | Inhibition Ratio, Viability | [ | ||
| [ | Carbon-based | Mutagenicity | [ | ||
| [ | Metal Oxide | Membrane integrity, oxidative stress | [ | ||
| [ | Metal Oxide | Membrane integrity | [ | ||
| [ | [ | ||||
| [ | [ | Dendrimers | |||
| [ | [ | Metal | |||
| [ | [ |
Endpoints predicted by instance-based tools extracted from the studies gathered.
| Reference | NMs Category | Output | Reference | NMs Category | Output |
|---|---|---|---|---|---|
| [ | Carbon-based | exposed/not exposed groups | [ | Metal Oxide | Viability |
| [ | Metal, Metal oxide, Quantum Dots | Aggregated, Cellular uptake | [ | Carbon-based | |
| [ | Metal | Aggregated | [ | Metal Oxide | |
| [ | Metal, dendrimer, metal oxide, polymeric | [ | Dendrimers | ||
| [ | Metal | Cell association | [ | Metal | |
| [ | [ | Carbon-based | |||
| [ | [ | Metal Oxide | Dose-response | ||
| [ | Metal Oxide | Cellular uptake | [ | ||
| [ | [ | Membrane integrity | |||
| [ | Metal, dendrimer, metal oxide, polymeric | Mortality rate | [ | Metal, Metal oxide | MIC, mortality rate, viability |
Endpoints predicted by a neural network extracted from the studies gathered.
| Reference | NMs Category | Output | Reference | NMs Category | Output |
|---|---|---|---|---|---|
| [ | Metal, Metal oxide, Quantum Dots | Aggregated | [ | Polymeric | Viability |
| [ | [ | Metal Oxide | |||
| [ | Metal, Metal oxide | [ | |||
| [ | Metal Oxide | Apoptosis | [ | ||
| [ | Quantum Dots | [ | Metal, Metal Oxide, Quantum Dots | ||
| [ | Metal | Cell association | [ | Metal Oxide | Membrane integrity |
| [ | Metal Oxide | Cellular uptake | [ | Metal Oxide | |
| [ | [ | Carbon-based | Mitotoxicity | ||
| [ | Metal Oxide | Oxidative stress |
Endpoints predicted by a dimensionality reduction extracted from the studies gathered.
| Reference | NMs Category | Output | Reference | NMs Category | Output |
|---|---|---|---|---|---|
| [ | Polymeric | Arginase: iNOS, cathepsin, IL-10/protein, TNF-α/protein | [ | Metal Oxide, Quantum Dots | Viability |
| [ | Metal, Metal oxide, Quantum Dots | Aggregated | [ | Metal Oxide | |
| [ | [ | ||||
| [ | Metal, Metal oxide | [ | Metal, Metal oxide | ||
| [ | [ | ||||
| [ | Metal | Cell association | [ | Metal | Exocytosis |
| [ | [ | Metal Oxide | Membrane integrity | ||
| [ | Carbon-based | Mitotoxicity |
Endpoints predicted by the ensemble extracted from the studies gathered.
| Reference | NMs Category | Output |
|---|---|---|
| [ | Metal, dendrimer, metal oxide, polymeric | Aggregated |
| [ | Metal Oxide, Quantum Dots | Aggregated, cellular uptake, viability |
| [ | Metal Oxide | Cellular uptake |
| [ | Metal, Metal Oxide | MIC, mortality rate, viability |
| [ | Metal Oxide | Viability |
| [ | ||
| [ | ||
| [ | Dendrimers | |
| [ | Carbon-based |
Endpoints predicted by Bayes models extracted from the studies gathered.
| Reference | NMs Category | Output |
|---|---|---|
| [ | Metal, Metal oxide, polymeric | Disrupted cellular processes |
| [ | Quantum Dots | IC50, viability |
| [ | Metal, Metal Oxide | Aggregated |
| [ | Carbon-based, Metal, Metal Oxide | |
| [ | Metal, Metal Oxide | |
| [ | Metal Oxide | Viability |
| [ | Metal | |
| [ | Dendrimers |
Figure 3Machine learning categories used over the last decade (left) and their relation with data size samples (right).
Figure 4Machine learning categories vs. number of descriptors (right) and vs percentage of p-chem data cases over the years (left).
Figure 5Methods determining the applicability domain of a model.
Studies performing goodness-of-fit, robustness, and predictivity and assessing the applicability domain.
| Reference | Algorithm Category | Endpoint Class | Reference | Algorithm Category | Endpoint Class |
|---|---|---|---|---|---|
| [ | Regression | Numerical | [ | Neural Networks | Numerical |
| [ | [ | Meta | |||
| [ | [ | Trees | Binary | ||
| [ | [ | ||||
| [ | [ | Regression, Dimen. Red. | Numerical | ||
| [ | [ | ||||
| [ | [ | Trees, meta | |||
| [ | [ | Neural networks, instance based, trees, regression | |||
| [ | [ | Binary | |||
| [ | [ | Meta, trees, instance based | |||
| [ | [ | ||||
| [ | Instance Based | Numerical | [ | Instance Based | |
| [ | [ |