| Literature DB >> 29453389 |
My Kieu Ha1, Tung Xuan Trinh1, Jang Sik Choi2, Desy Maulina1, Hyung Gi Byun2, Tae Hyun Yoon3.
Abstract
Development of nanotoxicity prediction models is becoming increasingly important in the risk assessment of engineered nanomaterials. However, it has significant obstacles caused by the wide heterogeneities of published literature in terms of data completeness and quality. Here, we performed a meta-analysis of 216 published articles on oxide nanoparticles using 14 attributes of physicochemical, toxicological and quantum-mechanical properties. Particularly, to improve completeness and quality of the extracted dataset, we adapted two preprocessing approaches: data gap-filling and physicochemical property based scoring. Performances of nano-SAR classification models revealed that the dataset with the highest score value resulted in the best predictivity with compromise in its applicability domain. The combination of physicochemical and toxicological attributes was proved to be more relevant to toxicity classification than quantum-mechanical attributes. Overall, by adapting these two preprocessing methods, we demonstrated that meta-analysis of nanotoxicity literatures could provide an effective alternative for the risk assessment of engineered nanomaterials.Entities:
Year: 2018 PMID: 29453389 PMCID: PMC5816655 DOI: 10.1038/s41598-018-21431-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Workflow of data collection, preprocessing, model development, validation and interpretation.
Data attributes.
| Dosage | PChem attributes | QM attributes | Tox attributes | ||
|---|---|---|---|---|---|
| Dose (μg/mL) | Core size (nm) | Surface charge (mV) | Formation enthalpy ΔHsf (eV) | Assay | Cell type (normal/cancer) |
| Method for core size | Method for surface charge | Conduction band energy Ec (eV) | Cell name | Exposure time (hours) | |
| Hydrodynamic size (nm) | Specific surface area (m2/g) | Valence band energy Ev (eV) | Cell species | Viability (%) | |
| Method for hydrodynamic size | Method for specific surface area | Electronegativity χMeO (eV) | Cell origin | ||
Figure 2(a) Missing data map for PChem attributes in the original dataset; (b) Datasets with different preprocessing steps; (c) Effect of data gap filling on PChem score distribution.
Scoring rules for PChem data.
| Attribute | Criteria | Score | |
|---|---|---|---|
| Core size | Data source | - Experimentally measured by the authors | 3 |
| - Adapted from manufacturers’ specifications | 2 | ||
| - Adapted from other references using the same nanomaterials and experimental conditions | 1 | ||
| - No data | 0 | ||
| Data method | - TEM | 2 | |
| - Estimated from specific surface area - Other methods (e.g., SEM/AFM) | 1 | ||
| - No information | 0 | ||
| Hydrodynamic size | Data source | - Experimentally measured by the authors | 3 |
| - Adapted from manufacturers’ specifications | 2 | ||
| - Adapted from other references using the same nanomaterials and experimental conditions | 1 | ||
| - No data | 0 | ||
| Data method | - DLS/NTA | 2 | |
| - Other methods | 1 | ||
| - No information | 0 | ||
| Surface charge | Data source | - Experimentally measured by the authors | 3 |
| - Adapted from manufacturers’ specifications | 2 | ||
| - Adapted from other references using the same nanomaterials and experimental conditions | 1 | ||
| - No data | 0 | ||
| Data method | - Zeta potential | 2 | |
| - Other methods | 1 | ||
| - No information | 0 | ||
| Specific surface area | Data source | - Experimentally measured by the authors | 3 |
| - Adapted from manufacturers’ specifications | 2 | ||
| - Adapted from other references using the same nanomaterials and experimental conditions | 1 | ||
| - No data | 0 | ||
| Data method | - BET | 2 | |
| - Estimated from core size - Other methods | 1 | ||
| - No information | 0 | ||
TEM: Transmission Electron Microscopy; SEM: Scanning Electron Microscopy; AFM: Atomic Force Microscopy; XRD: X-Ray Diffraction; DLS: Dynamic Light Scattering; NTA: Nanoparticle Tracking Analysis; BET: Brunauer-Emmett-Teller method.
Validation results of models built upon datasets with different preprocessing steps.
| I | II | III-A | III-B | |
|---|---|---|---|---|
| Precision* | 80% | 83% | 84% | 91% |
| Sensitivity* | 3% | 65% | 74% | 88% |
| Accuracy | 85% | 94% | 95% | 95% |
| F1 score | 6% | 73% | 79% | 89% |
*Precision and sensitivity were calculated with “Toxic” class as positive.
Figure 3Comparison between “Toxic” and “Nontoxic” data rows in the (a) training set and (b) test set of each dataset.
Applicability domains regarding the numerical attributes.
| Attribute | I | II | III-A | III-B | ||||
|---|---|---|---|---|---|---|---|---|
| Min. | Max. | Min. | Max. | Min. | Max. | Min. | Max. | |
| Dose (μg/mL) | 0 | 10000 | 0 | 167000 | 0 | 1500 | 0 | 1500 |
| Time (h) | 0 | 360 | 1 | 168 | 2 | 72 | 6 | 72 |
| Core size (nm) | 2.7 | 629 | 2.7 | 496 | 5 | 496 | 5.9 | 193 |
| Hydro. size (nm) | 8.6 | 6181 | 8.6 | 2300 | 12.5 | 1463 | 12.5 | 1457 |
| Surface charge (mV) | −63.3 | 61.9 | −63.3 | 61.9 | −52 | 61.9 | −47.6 | 42.8 |
| Surface area (m2/g) | 0.8 | 1150 | 5.5 | 576 | 5.5 | 576 | 6 | 576 |
| ΔHsf (eV) | −64.7 | −1.2 | −64.7 | −1.2 | −26.8 | −1.2 | −26.8 | −1.6 |
| Ec (eV) | −6.6 | −0.1 | −6.6 | −0.1 | −5.2 | −0.1 | −5.3 | −0.3 |
| Ev (eV) | −11.4 | −5.0 | −11.3 | −5.0 | −11.1 | −5.0 | −11.4 | −5.0 |
| χ (eV) | 3.2 | 8.3 | 3.4 | 8.3 | 3.4 | 6.8 | 3.8 | 8.3 |
Validation results of models built upon datasets with different attribute combinations.
| Attributes | III-A | III-B | ||
|---|---|---|---|---|
| Accuracy | F1 score | Accuracy | F1 score | |
| Dose + PChem | 93% | 75% | 96% | 92% |
| Dose + QM | 94% | 74% | 92% | 84% |
| Dose + Tox | 86% | 30% | 89% | 78% |
| Dose + PChem + QM | 93% | 75% | 94% | 87% |
| Dose + PChem + Tox | 95% | 81% | 96% | 93% |
Figure 4Leave-one-out OOB errors against attributes.