| Literature DB >> 34102034 |
Flora T Musuamba1,2,3, Ine Skottheim Rusten1,4, Raphaëlle Lesage5,6, Giulia Russo7, Roberta Bursi8, Luca Emili8, Gaby Wangorsch1,9, Efthymios Manolis1,10, Kristin E Karlsson1,11, Alexander Kulesza12, Eulalie Courcelles12, Jean-Pierre Boissel12, Cécile F Rousseau13, Emmanuelle M Voisin13, Rossana Alessandrello14, Nuno Curado15, Enrico Dall'ara16, Blanca Rodriguez17, Francesco Pappalardo7, Liesbet Geris5,6,18.
Abstract
The value of in silico methods in drug development and evaluation has been demonstrated repeatedly and convincingly. While their benefits are now unanimously recognized, international standards for their evaluation, accepted by all stakeholders involved, are still to be established. In this white paper, we propose a risk-informed evaluation framework for mechanistic model credibility evaluation. To properly frame the proposed verification and validation activities, concepts such as context of use, regulatory impact and risk-based analysis are discussed. To ensure common understanding between all stakeholders, an overview is provided of relevant in silico terminology used throughout this paper. To illustrate the feasibility of the proposed approach, we have applied it to three real case examples in the context of drug development, using a credibility matrix currently being tested as a quick-start tool by regulators. Altogether, this white paper provides a practical approach to model evaluation, applicable in both scientific and regulatory evaluation contexts.Entities:
Mesh:
Year: 2021 PMID: 34102034 PMCID: PMC8376137 DOI: 10.1002/psp4.12669
Source DB: PubMed Journal: CPT Pharmacometrics Syst Pharmacol ISSN: 2163-8306
Definition of key terms related to computer modelling and simulation in drug development
| Biological system | Complex ensemble of entities of a population or an individual that are interdependent and function as a whole. The entities and the limits of the defined system itself may be of various scale of organization (population, individual, physiological system, organ, cellular, molecular, etc.). |
|
| Abstract and simplified representation of a biological system, composed of a set of rules or algorithms describing the system’s behaviour, implemented and studied computationally. According to the mathematical nature of the rules, the behaviour of the system can be studied over time and/or space and quantitatively or qualitatively. The term in silico refers to the computational nature of the model and discerns it from its in vitro and in vivo counterparts. |
|
| Class of trials for pharmacological therapies |
| Data driven models ( black‐box models, phenomenological models | Models developed from observations or data with the aim of reconstituting a set of rules explaining those data. These models can be developed using statistical, mathematical and/or computational methods including bioinformatics, machine learning and artificial intelligence. This type of models is built to match the observation content of the data but the resulting rules do not necessarily correspond to real, physical or tangible mechanisms, which makes it more difficult to interpret, hence the term of black‐box model. |
| Mechanistic models (white box models, hypothesis‐driven models) | Set of theoretical rules and algorithms based on known mechanisms expected to reconstitute observed behaviours. Consequently, the rules describe known or hypothesized mechanisms in a lower scale of organization and the model read‐out often regards an emerging behaviour at a higher scale of organization. This type of model is essentially hypothesis‐driven and allows to test the validity of the underlying mechanisms, and to explain an observation, hence the term of white box model. |
| Agent‐based models | Agent‐based models (ABM) are an effective approach for modelling discrete, autonomous agents such as cells or bacteria. |
| Artificial Intelligence (AI) | In the field of |
| Model‐Informed Drug Discovery and Development | Quantitative framework for prediction and extrapolation, centred on knowledge and inference generated from integrated models of compound, mechanism and disease level data and aimed at improving the quality, efficiency and cost effectiveness of decision making. |
| Pharmacometrics | Pharmacometrics is the branch of science concerned with mathematical models of biology, pharmacology, disease, and physiology used to describe and quantify interactions between xeno biotics and patients, including beneficial effects and side‐effects resultant from such interfaces. |
| Population PK (PopPK) and Pharmacokinetic / Pharmacodynamic (PK/PD) models | PopPK is the study of pharmacokinetics (i.e., time course of concentration at a certain dosing regimen) at the population level, in which data from all individuals in a population are evaluated simultaneously using a nonlinear mixed‐effects model. |
| Physiologically Based Pharmacokinetic (PBPK) models | PBPK models estimate the PK profile or exposure in “a target tissue or organ after a drug dose by taking into account the rate of absorption into the body, distribution among target organs and tissues, metabolism, and excretion”. |
| Systems medicine models | This type of |
| Quantitative Systems Pharmacology (QSP) | QSP is broadly defined as an approach to translational medicine that combines computational and experimental methods to elucidate, validate and apply new pharmacological concepts to the development and use of small molecule and biologic drugs. QSP will provide an integrated “systems level” approach to determining mechanisms of action of new and existing drugs in preclinical and animal models and in patients. |
| Model uncertainty | A certain amount of contingencies and inaccuracies may arise from the model predictions/simulations and resulting decisions. These uncertainties may be due to the model structure (assumptions), parameters and/or the inputs. |
| Model uncertainty quantification | Characterization of the model uncertainty with quantitative metrics. It assesses how much the outcome of the model is impacted when some part of the system or some inputs are changed or not precisely known. By systematically identifying the sources of uncertainty, characterizing their probability distribution and analysing their impact on the model's outputs of interest, the evaluation process ensures that the uncertainty's impacts on the model predictions are understood and controlled. |
| Historical data / Legacy data | Data previously collected in a relevant context but for a different purpose. Historical data, when appropriate for the context of use and of sufficient quality, can be used for validation of new models. |
| Good simulation practice | In analogy to the ICH Good Clinical Practice or the OECD Good Laboratory Practice, GSP could be a quality standard for the designing, implementing and reporting of in silico trials in the context of the development and regulation of medical treatments. When established in concertation with the proper authorities, compliance with the GSP standard could ultimately provide public assurance that the digital evidence generated by in silico technologies is credible. |
FIGURE 1In silico Model Process flowchart
FIGURE 2UISS‐TB predicts the dynamic of the tuberculosis course with a specific vaccine administered, suggesting possible interactions to maximize the chance of success in a personalized fashion
FIGURE 3schematic overview of the Virtual Assay software platform at its main components: the Core Engine (middle), Drug Module (left) and Analysis Module (right). X: ion channel availability; h: hill coefficient; D: doses; IC50: half‐maximal inhibitory concentration; M.P.: Membrane Potential; CTRL: Control (no drug)
FIGURE 4Disease Computational Model structure. Light blue rectangles represent the submodels with the associated number of parameters, variables and reactions. Dark blue rectangles represent the major connector variables shared between submodels. Myocardium submodels are duplicated throughout 10 zones to introduce a spatial discretization of the myocardium
|
| |
|---|---|
|
| Therapeutic vaccines for pulmonary tuberculosis, such as RUTI. |
|
| Physiology based agent‐based model (ABM). |
|
|
What is the dose‐response curve of a specific vaccine for active tuberculosis in a reference population of adults affected by What is the most optimal dose to maximize the efficacy of tuberculosis vaccine? |
|
|
UISS is a physiology‐ and agent‐based model of the human immune system. UISS‐TB includes a disease model component for the infection of pulmonary tuberculosis, the treatment (the therapeutic vaccine to be tested) effect component, and is run over a virtual population, representative of the target population. The aim of the model is the dose selection for confirmatory trials, with a significant reduction of the human experimentation in the phase II dose‐response trial. Data input would include: clinical data from the phase 1 safety assessment trial clinical data from a limited scale exploratory trial: only a single arm ( |
|
|
The UISS‐TB model is informed by a set of NI = 22 inputs, named vector of features (VoFs), formed by quantities that can be measured/observed in an individual MTB patient. All 22 inputs have to be considered with their admissible minimum, maximum, and average values.
Model structure and parameter sources and values should be disclosed and justified for the disease and the drug model, as well as for the virtual population simulator.
GitLab versioning control system will be used. The following will be monitored and results provided: For the Deterministic model Absence of Round‐off errors Absence of Conservation errors. Absence of Discretisation errors. Uniqueness: repeated deterministic runs produce identical results. Smoothness: analyse lag correlation. Non‐chaoticity: Lyapunov’s exponent. Time step convergence analysis For the stochastic model verification. Convergence and consistency analysis.
GitLab QA to run regression testing, including all VV&UQ tests. Long‐term: Compliance with IEC 62304 “Medical Device Software ‐ Software Life Cycle Processes”.
The UISS model needs to be able to model to simulate and to adequately predict the key features of patients experimentally recruited in the Phase 2 study. The UISS model needs to be able to predict the distribution of immunogenicity biomarkers at the other three follow‐up time points and compare these to those observed experimentally. |
|
| Medium: modelling results are additional evidence to be complemented by data from clinical trials. |
|
| In the case of UISS‐TB‐IG, an underestimation of the optimal dose might affect the efficacy of the treatment, and an overestimation might induce adverse effects. If we assume that the final decision is the marketing authorisation of the new therapeutic vaccine, the influence of the model is low for both the final efficacy component and the safety component that will rather be informed by the results of the confirmatory Phase 3 trial. For a lower‐than‐optimal prediction, we could have an increased risk of recurrence. For the higher‐than‐optimal prediction, we could have an increase of reported adverse effects. However, typical overdosing adverse effects for TB vaccines ore mild in nature (occasional muscle spasms, pain at the site of injection, etc.). Thus, also the consequence of a model error can be considered mild. |
|
| The credibility factors (as described in Section |
|
| The dose‐response relationship was characterized for efficacy of vaccine against tuberculosis that allowed optimal dose selection for the confirmatory trial. |
|
| |
|---|---|
|
| All new drugs candidates given the regulatory requirement of assessment of |
|
| The Virtual Assay Software: human‐based cardiac electrophysiology modelling and simulation framework. |
|
| Would the drug result in risk of developing Torsades de Pointes in the human population, even in the context of positive hERG assays and multichannel effects? |
|
|
INPUT: OUTPUT: Simulations with the Virtual Assay software categorize drugs as being safe or inducing pro‐arrhythmic cardiotoxicity in human. Decision on the potential cardiotoxicity will be informed by the simulations, combined with experimental data from animal models and potentially stem‐cell derived cardiomyocytes. Mechanistic models can be helpful to rule out a positive non‐clinical signal. |
|
|
The structure of the Virtual Assay software is summarised in Figure
Virtual Assay has been developed in C++. Drug simulations in a modern laptop require approximately 5‐10 minutes for each drug concentration for a population of 100 cell models, and simulations are run in parallel on multiple cores. Verification of numerical scheme and code comparison has been conducted as explained in.
The Virtual Assay software includes documentation and benchmark verification test cases. Details on software verification are provided in.
Sensitivity analysis is incorporated in the population of models, as this consists of using the same baseline model but with key parameters varied randomly, thus generating thousands of virtual cells.
The Virtual Assay software incorporates a friendly interface, simulation software and visualisation of outputs.
The population of models approach incorporated in Virtual Assay tackles uncertainty in electrophysiology model parameters. In the case of uncertainty in input values, simulations with the most extreme cases are run and compared.
The accuracy of drug classification using Virtual Assay was requested to be superior to the classification based on hERG alone and at least 80%. The sensitivity in the prediction of cardiac toxicity of individual drugs needs to be >60% or 70%. |
|
| High regulatory impact: modelling and simulation results constitute the key source of evidence to answer the question of interest, |
|
| High clinical influence given the new Q&A Guidelines: impact on the decision to accept phase 1 to 3 trial designs, and also based on this model, waiver of intensive monitoring of electrocardiogram (ECG) in confirmatory trials. This is also crucial for the evaluation of cardiotoxicity in cancer drugs. Wrong model prediction/simulation could expose patients to risk of lethal arrhythmias, in following clinical trials due to cardiotoxic drugs. |
|
| The credibility factors (as described in Section |
|
| The drug’s pro‐arrhythmic cardiotoxicity was characterized for 62 compounds, based on their Torsade de pointe (TdP) score. Each drug could be categorized as safe or risky based on their TdP score. Subpopulations of patients at higher risk were identified for some of the drugs. |
| Credibility matrix | |
|---|---|
| Investigational product | Modulator of respiratory complex 1: inhibitor of ROS production |
| Type of model | QSP‐type disease model: based on ordinary differential equations (ODE). The model had 625 parameters and 173 ODE. |
| Scientific Question(s) of interest (QOI) | What is the target population to demonstrate the effect of C1 modulation as it would have been evaluated in a classical phase 2 clinical trial? |
| Context of use |
A mechanistic disease model describing myocardial infarction pathophysiology and effects of C1 modulation is used with a Virtual Population to identify markers that characterize responders to C1 modulation in a Phase II setting Data extracted from the scientific literature and preclinical Individual patient data from a subset of a clinical trial dataset were used for a quantitative calibration of the clinical outcomes calculated by the model combined with a Virtual Population. The model and related simulator are proposed to support an upcoming Phase III trial design aimed at confirming the drug clinical benefit. |
|
|
Code verification should include the convergence analysis of all dynamics concerning space discretization of the left ventricle. As all patients will use the same space discretization, the model needs to present qualitatively the same results by predictive visual check for two discretization schemes so that it is safe to assume that inter‐patient variability is unchanged. Calculation verification was carried out by using the simulation outputs obtained with the lowest possible solver tolerances as reference solution. The error between simulated outputs and reference solutions needs to be lower than a given threshold (1%). Further quantitative acceptability criteria on the software side are model item transparency, documentation completeness and unit checking. Model acceptability is mainly assessed on the 4 outcomes with a quantitative validation based on independent individual data extracted from a previous clinical trial dataset (placebo arm, 26 patients). In our COU, the most important capability of the model is a correct prediction of a class/individual outcome based on its descriptors (and needs to be validated as such). For trial design purposes we should thus compare virtual patient classes with real patient classes and individual patient (ranks) with individual patient (ranks), respectively. According to these two requirements two precision levels have been checked for classifying ranking and patients by the model A response classifier model should have the capacity to identify patients with a severe outcome according to Receiver Operating Characteristic (ROC), Area Under the Curve (AUC) above 0.7 for a number of classification scenarios (similar as for any predictive biomarker). A response ranking model should have a significant capacity to rank individual patient's outcome severity. This capability should be tested by a suitable statistical procedure, Qualitative acceptability criteria need to be checked for validating explorative capabilities of the disease model not covered by the quantitative input (patient descriptors) output (creatine phosphokinase (CPK), troponin I (TnI), Infarct Size (IS) and left ventricular ejection fraction (LVEF)) validation. A set of credibility factors are defined including Model form is deemed acceptable if the conceptual form is validated by a biologist, a clinician or logical modelling; if model granularity allows the answer to the QOI; if a transparency checking is allowed in the model structure. Model inputs are deemed acceptable if used assumptions are listed and their impact on model prediction explored and if a sensitivity analysis has been performed Model is deemed relevant to the context of used if the simulation protocol is delivered prior the experiment; M&S output(s) is/are biomarker(s), a surrogate or a clinical outcome; validity domain is relevant to the COU |
| Regulatory impact | Medium: modelling results are additional evidence to be complemented by data from clinical trial. |
| Risk based analysis of decision consequence | The treatment being indicated as a complement of the first line (percutaneous coronary intervention), suboptimal patient selection will not result in harm to patients. However, it may lead to a suboptimal design for the phase 3 and a suboptimal indication for market authorisation, leading to off‐label use of the drug. |
| Credibility activities results |
All credibility factors were evaluated: Model form evaluation KM validation: Acceptable (Validation by review) Relevance of Computational Model granularity: Good (Model granularity is adapted to the QOI(s)) Transparency checking: Good (Comprehensive checking) Model reuse: Good (The model or a part of the submodels has been reused from a different COU) Model inputs evaluation Uncertainty management: Poor (No uncertainty management performed yet) Sensitivity analysis: Poor (No sensitivity analysis performed yet) Relevance to the Context of Use Simulation design: Acceptable (Simulation protocol delivered prior the experiment) Relevance to clinical outcome score: Good (M&S output(s) is/are clinical outcome) Relevance to the COU: Good (Relevance of M&S output(s) of interest and validity domain to COU) The model of ischemia reperfusion was quantitatively validated on 4 outcomes. Evaluation metrics for the primary outcome (Infarct size) were the following: Spearman rank correlation: 0.51 ROC curve AUC average: 0.77 The Computational Model of myocardial ischemia reperfusion is thus validated for the anticipated use but should be completed with uncertainty analyses. |
| Model informed decision |
Two criteria were identified to characterize optimal responders: Final TIMI flow grade above 3 and Mid or Proximal lesion location. The selection of this sub‐population doubles the clinical benefit (from 5% to 10% of average infarct size reduction). These results support a subgroup analysis with the results of a potential phase 3 clinical trial evaluating C1 modulation. |