| Literature DB >> 36081121 |
Lara Mauri1, Ernesto Damiani1,2.
Abstract
The application of emerging technologies, such as Artificial Intelligence (AI), entails risks that need to be addressed to ensure secure and trustworthy socio-technical infrastructures. Machine Learning (ML), the most developed subfield of AI, allows for improved decision-making processes. However, ML models exhibit specific vulnerabilities that conventional IT systems are not subject to. As systems incorporating ML components become increasingly pervasive, the need to provide security practitioners with threat modeling tailored to the specific AI-ML pipeline is of paramount importance. Currently, there exist no well-established approach accounting for the entire ML life-cycle in the identification and analysis of threats targeting ML techniques. In this paper, we propose an asset-centered methodology-STRIDE-AI-for assessing the security of AI-ML-based systems. We discuss how to apply the FMEA process to identify how assets generated and used at different stages of the ML life-cycle may fail. By adapting Microsoft's STRIDE approach to the AI-ML domain, we map potential ML failure modes to threats and security properties these threats may endanger. The proposed methodology can assist ML practitioners in choosing the most effective security controls to protect ML assets. We illustrate STRIDE-AI with the help of a real-world use case selected from the TOREADOR H2020 project.Entities:
Keywords: artificial intelligence security; threat modeling; vulnerability assessment
Mesh:
Year: 2022 PMID: 36081121 PMCID: PMC9459912 DOI: 10.3390/s22176662
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Our reference ML life-cycle [3].
Figure 2Assets in the AI ecosystem [3].
ML data assets and the corresponding answers to FMEA asset and failure modes questions.
| ML Data Asset | Answers to FMEA Asset Questions | Answers to FMEA Failure Modes Questions |
|---|---|---|
| - Requirements support a clear understanding of the ML model’s business context and goals.
| - Unclear requirements may lead to inaccurate AI-ML models.
| |
| - Raw data constitute the primary source of information needed to achieve the model’s business goals.
| - Raw data may not be sufficiently representative of the domain or unfit the AI-ML model business goal (e.g., due to sample size and population characteristics).
| |
| - Pre-processed data create a data set suitable for ML-based analysis. | - Pre-processed data may cause incorrect estimates of the ML model’s performance if data preparation is applied before splitting the data for model evaluation.
| |
| - Labels make data useful in supervised ML setups.
| - Labeled data fail when enough items are deleted or omitted, a sufficient number of spurious labelled data is included into the data set, or enough labels are flipped.
| |
| - Validation data provides an unbiased evaluation of a model’s fitness on the training data set, while tuning the hyper-parameters.
| - Validation data may fail when labelled data items are manipulated.
| |
| - Augmented data helps to solve the problem of data deficiency by increasing the amount of data available in the training data set.
| - Augmented data sets may fail due to inconsistency with the training set they are derived from.
| |
| - The rationale for HTCs is that even if an ML model keeps showing good accuracy, its performance on specific inputs may become unacceptable. | - HTCs fail when the ML model’s accuracy metrics computed on them does not correspond to the business goals of the application.
| |
| - Inferences serve to produce actionable outputs when live data run into ML models. | - Inferences may fail by showing high entropy, i.e., conveying little information useful for the ML task at hand. |
ML model assets and possible answers to FMEA asset and failure modes questions.
| ML Model Asset | Answers to FMEA Asset Questions | Answers to FMEA Failure Modes Questions |
|---|---|---|
| - Data pre-processing has the purpose to convert incomplete or defective raw data into improved training data to provide improved AI-ML model’s performance.
| - Data pre-processing algorithms fail when errors in the definition of the data conversion generate flawed or defective training data.
| |
| - HPs are used for model-space search and identification of the best AI-ML model.
| - HPs may fail to deliver the size and configuration for the model that makes its training and operation feasible.
| |
| - Learning algorithms should estimate a target function that best maps input variables to an output variable.
| - An improper choice of the learning algorithm may adversely affect the desired accuracy.
| |
| - Parameters determine the inferences (i.e., the actual outputs) computed by the AI-ML model.
| - Parameters fail when the AI-ML model’s output computed according to them is such that the model’s performance in executing the task (classification, prediction, anomaly detection) is poor.
| |
| - Trained models perform a ML task (like regression, prediction or anomaly detection) on test input data. | - Trained models fail when the discrepancy between the training data seen during the learning process and inference is so high to cause a drop in performance in production. | |
| - Deployed models perform a task (like regression, prediction or anomaly detection) computing inferences in production. | - Deployed models fail when the AI-ML task is not performed at the desired performance (e.g., accuracy) level.
|
ML artefact assets and possible answers to FMEA asset and failure modes questions.
| ML Artefact Asset | Answers to FMEA Asset Questions | Answers to FMEA Failure Modes Questions |
|---|---|---|
| The | - The model architecture provides guidance toward the software/hardware implementation of the AI pipeline.
| - The architecture may be too vague or incomplete to guide pipeline implementation.
|
| The ML model’s | - The hardware design includes all design and optimization choices for building the hardware implementation of a ML model.
| - The hardware design choices may support physical side-channels disclosing the model parameters as well as the details of the training or inference algorithm. |
| - Data and metadata schemata support the interpretability and interoperability of data among ML models as well as back-to-back, seamless connection between the stages of a AI pipeline.
| - Metadata may not deliver human interpretability or interoperability of ML models result.
| |
| - A learned index can learn the sort order or structure of lookup data points and use this information to predict the position or existence of records.
| - Learned indexes can disclose information to the data distribution.
|
A 5-step TM process.
| Step | Description | |
|---|---|---|
| 1 | Objectives Identification |
|
| 2 | Survey |
|
| 3 | Decomposition |
|
| 4 | Threat Identification |
|
| 5 | Vulnerabilities Identifications |
|
STRIDE threats in a nutshell.
| Threat | Description |
|---|---|
| Spoofing Identity |
|
| Tampering with Data |
|
| Repudiation |
|
| Information Disclosure |
|
| Denial of Service (DoS) |
|
| Elevation of Privilege (EoP) |
|
Threats vs. properties in STRIDE-AI.
| Property | Threat |
|---|---|
| Authenticity | Spoofing |
| Integrity | Tampering |
| Non-repudiability | Repudiation |
| Confidentiality | Information Disclosure |
| Availability | Denial-of-Service (DoS) |
| Authorization | Elevation-of-Privilege (EoP) |
ML-specific hexagon.
| Property | ML-Specific Definition |
|---|---|
| Authenticity |
|
| Integrity |
|
| Non-repudiation |
|
| Confidentiality |
|
| Availability |
|
| Authorization |
|
Mapping data assets’ failure modes to hexagon.
| Data Asset | Properties | Threats | Known Attacks |
|---|---|---|---|
| Requirements | Availability | DoS |
|
| Raw Data | Authenticity, Confidentiality, Availability, Authorization | Spoofing, Disclosure, DoS, EoP |
|
| Pre-processed data | Integrity, Availability | Tampering, DoS |
|
| Labeled Data | Authenticity, Integrity | Spoofing, Tampering |
|
| Augmented Data | Integrity, Availability | Tampering, DoS |
|
| Validation Data | Integrity, Availability | Tampering, DoS |
|
| Held-Out Test Cases | Integrity, Confidentiality, Availability | Tampering, Disclosure, DoS | |
| Inferences | Authenticity, Integrity, Availability, Authorization | Spoofing, Tampering, DoS, EoP |
|
Mapping model assets’ failure modes to hexagon.
| Model Asset | Properties | Threats | Known Attacks |
|---|---|---|---|
| Data pre-processing algorithms | Integrity, Availability | Tampering, DoS |
|
| Hyper-parameters | Confidentiality, Availability, Authorization | Disclosure, DoS, EoP | |
| Learning algorithms | Integrity, Confidentiality, Availability | Tampering, Disclosure, DoS |
|
| Model parameters | Confidentiality, Availability, Authorization | Disclosure, DoS, EoP |
|
| Trained models | Integrity, Availability | Tampering, DoS |
|
| Deployed models | Authenticity, Integrity, Non-repudiation, Confidentiality, Availability, Authorization | Spoofing, Tampering, Repudiation, Disclosure, DoS, EoP |
|
Mapping artefact assets’ failure modes to hexagon.
| Artefact Asset | Properties | Threats | Known Attacks |
|---|---|---|---|
| Model architecture | Authenticity, Confidentiality | Spoofing, Disclosure |
|
| Model hardware design | Confidentiality | Disclosure |
|
| Data and metadata schemata | Confidentiality | Disclosure |
|
| Learned data indexes | Integrity | Tampering |
|
Figure 3The LIGHT-TOREADOR predictive maintenance architecture.
Figure 4The identified data and model assets.
Complete mapping for the assets identified in the use case.
| Use Case Asset | Properties | Threats |
|---|---|---|
| Authenticity, Integrity | Spoofing, Tampering | |
| Authenticity, Integrity | Spoofing, Tampering | |
| Authenticity, Integrity, Authorization | Spoofing, Tampering, EoP | |
| Authenticity, Integrity | Spoofing, Tampering | |
| Authenticity, Integrity, Non-repudiability, Authorization | Spoofing, Tampering, Repudiation, EoP | |
| Authenticity, Integrity, Non-repudiability | Spoofing, Tampering, Repudiation | |
| Authenticity, Integrity, Non-repudiability, Availability | Spoofing, Tampering, Repudiation, DoS | |
| Authenticity, Integrity, Availability, Authorization | Spoofing, Tampering, Dos, EoP |
Figure 5The Spoofing threat tree for the Training Data Stream asset [3].
Figure 6The Tampering threat tree for the Training Data Stream asset [3].
Figure 7The Tampering threat tree for the Trained Model asset.
DREAD rating for the Spoofing and Tampering threats.
| Threats | D | R | E | A | D | Average [Rating] |
|---|---|---|---|---|---|---|
| DATA: Spoofing | 5 | 2 | 4 | 2 | 7 | 4.0 [Medium Risk] |
| DATA: Tampering | 7 | 2 | 6 | 2 | 5 | 4.4 [Medium Risk] |
| MODEL: Tampering | 6 | 4 | 5 | 5 | 6 | 5.2 [Medium Risk] |