| Literature DB >> 34220187 |
Paul B Stone1, Hailey Marie Nelson1, Mary E Fendley1, Subhashini Ganapathy1.
Abstract
The purpose of this study was to develop a method for validation of cognitive models consistent with the remote working situation arising from COVID-19 restrictions in place in Spring 2020. We propose a framework for structuring validation tasks and applying a scoring system to determine initial model validity. We infer an objective validity level for cognitive models requiring no in-person observations, and minimal reliance on remote usability and observational studies. This approach has been derived from the necessity of the COVID-19 response, however, we believe this approach can lower costs and reduce timelines to initial validation in post-Covid-19 studies, enabling faster progress in the development of cognitive engineering systems. A three-stage hybrid validation framework was developed based on existing validation methods and was adapted to enable compliance with the specific limitations derived from COVID-19 response restrictions. This validation method includes elements of argument-based validation combined with a cognitive walkthrough analysis, and reflexivity assessments. We conducted a case study of the proposed framework on a developmental cognitive model of cardiovascular surgery to demonstrate application of a real-world validation task. This framework can be easily and quickly implemented by a small research team and provides a structured validation method to increase confidence in assumptions as well as to provide evidence to support validity claims in the early stages of model development.Entities:
Keywords: COVID‐19; cardiovascular surgery; cognitive modeling; model validation; validation frameworks
Year: 2021 PMID: 34220187 PMCID: PMC8239641 DOI: 10.1002/hfm.20904
Source DB: PubMed Journal: Hum Factors Ergon Manuf ISSN: 1090-8471 Impact factor: 1.722
Hybrid cognitive validation framework
| ID | Sub Task | Qualitative Validation Criteria | Objectivity (Reflexivity) | Overall Score/Evidence |
|---|---|---|---|---|
| (including responsibility) | Individual score(0–3 pts) | Individual Score (0–2 pts) | Task Score (/10) | |
| Task definition | Total task score (6 pts) | Total task score (4 pts) | Framework (/30) | |
| Framework score (/18 pts) | Framework score (/12 pts) | |||
|
| ||||
|
|
| The validation task is clearly stated and in line with the limitations of the framework. | Validation objective clarity and coherence with interpretations assessed by second researcher. | Written confirmation of assessment. |
| (Research Team) | ||||
| Define the validation assessment question and purpose including the purpose and scope of the validation. |
| |||
|
|
| Inferences are reasonable, assumptions are plausible and underpinning descriptions and evidence are clear, coherent, and complete. | Validation objective clarity and coherence with interpretations assessed by second researcher. | Written confirmation of assessment. |
| (Research Team) | ||||
| Define validation scoring interpretations framework: Establish the implications and assumptions of the validations. |
| |||
|
| ||||
|
|
|
| Scenarios confirmed by second researcher. | Detail scenarios used in the validation. |
| (Research Team) | (Subject Matter Expert) |
| Provide Cognitive Task Matrix. | |
| Generate scenarios that are | Scenarios are representative of | Scenarios approved by |
| |
| representative of model use cases to enable cognitive walkthrough. | operational situations (routine, emergency, complications). | external subject matter expert. | ||
|
| ||||
| Classes of patient and procedure types are representative and appropriate (age, underlying condition, gender). | ||||
|
| ||||
| Scenarios clearly represent the complete cognitive tasks identified in the cognitive task matrix | ||||
|
| ||||
|
|
|
| Assessment of model representativeness and cognitive walkthrough confirmed and assessed by second researcher. | Record walkthrough results and detail cognitive states actions and pathways as appropriate. |
| (Subject Matter Expert) | There are significant gaps in the representation cognitive states and pathways for many tasks. |
|
| |
| SME to Conduct cognitive walkthrough. | ||||
| if the model is applicable to specific tasks and infer generalizability to tasks not identified explicitly in the model development. | The model meets the basic requirements of representing the underlying cognitive processes but may have some gaps | |||
|
| ||||
| Cognitive states, actions and pathways were clear, complete, and coherent and representative of all surgical phases in one scenario. | ||||
| Cognitive states, actions and pathways were clear, complete, and coherent and representative in at least two scenarios. | ||||
|
| ||||
|
|
| Subject(s) has/have no experience or expertise in the field. | Assessment of subject qualifications performed by second investigator/or qualifications and experience is known. | Qualifications and experience of those contributing data is recorded. |
| (Research Team) | Subject(s) has/have some training and minimal experience—less than 2 years. | |||
| Were appropriately | ||||
| qualified subjects used in the model development? |
|
|
| |
| Subject(s) is/are experienced practitioner(s) and currently practicing—2–5 years. | ||||
| Subject(s) is/are considered expert(s) and currently practicing—more than 5 years. | ||||
|
| Was the data collection process robust and complete during model development? | Data collection covered the breadth and depth of the model and a representation can be made. | The debrief was conducted by a second researcher, not present in the data collection. | Identify any data collection techniques used along with records of data collection. |
|
|
|
| ||
| Data collection conditions were consistent across all participants. | The debrief was | |||
|
| conducted within 1 week of the data collection. | |||
| Relevant data collection was used without omission or bias. |
| |||
|
| ||||
Figure 1Cognitive model of cardiovascular surgery. Combined RPD‐Decision Ladder Cardiovascular Cognitive Model, RPD, recognition‐primed decision
Case study validation scenario definition
| Scenario phase | Description |
|---|---|
| Procedure type: | Planned, routine procedure—Stent fitting to correct narrowing of coronary artery. |
| Patient definition: | 58‐year‐old male, 6 feet tall, with a body mass index of 25 |
| Patient preparation complications: | none |
| Catheter selection complications: | The Fluoroscopic imaging of the patient reveals a narrower than expected vascular structure for a man this size and the initial expectations on catheter size and shape are violated. |
| Catheter insertion complications: | none |
| Catheter maneuver complications: | The catheter maneuver is unsuccessful due to the narrowed vascular geometry. |
Case study validation scenario walkthrough with result (in bold)
| Walkthrough phase | Description |
|---|---|
| Step 1 | A 58‐year‐old male has been diagnosed with a narrowing of a coronary artery and has been scheduled for a stent fitting to correct the condition. The patient characteristics are within the experience of the cardiovascular surgeon and planning elements are routine. |
|
| |
| Step 2 | The preparation is conducted by the anesthetist and the patient responds as expected. |
|
| |
| Step 3 | This vascular geometry is assessed creating a decision point at D1. The vascular geometry is found to be narrower than expected in line with the scenario definition, resulting in the surgeon using their experience and associated heuristics to reselect an appropriate catheter based on this new information. |
|
| |
| Step 4 | A suitable insertion point is easily found but catheter insertion task is somewhat harder than expected. The surgeon intuitively corrects and quickly achieves a successful insertion without requiring consultation with other team members. |
|
| |
| Step 5 | The maneuver of the catheter to the procedure site is unsuccessful in line with the scenario definition. The surgeon corrects position and tries again but is still unsuccessful. |
|
|
Cognitive task matrix for generic cardiovascular surgery scenariosa
| Sub‐tasks | Intuitive example | Mixed example | Analytical example |
|---|---|---|---|
| Procedure planning | The combination of patient characteristics and procedure type and complexity are familiar. | The patient and procedure type are largely familiar, but some anomalies discovered but within situations governed by analytical heuristics. | The patient characteristics and/or procedure type are unfamiliar and additional cognitive resource are assigned to develop an analytical solution. |
| Patient preparation | Patient responds to sedative and initial preparation as expected. | Some anomalous response but within experiential reference and responds as expected. | Patient response is not expected, and initial remedial measures are unsuccessful. |
| Catheter selection | Patient size, condition and vascular geometry are familiar and correspond to experience of successful catheterization | Some patient characteristics are unfamiliar but generally within expectations and can be extrapolated from experience. | Complex vascular geometry, narrowing or blockage requiring reassessment of entry or catheter choice (size shape etc.) |
| Catheter insertion | Vascular location and orientation are predictable and catheter insertion at chosen site is successful. | Some difficulty in locating appropriate insertion point but alternatives discovered and implemented. | Problems inserting catheter due to depth of vascular network. Potential bleeding occurs or site reassessment required. |
| Catheter maneuver | Catheter behaves as expected and catheterization task can be completed | Some difficulty in maneuvering the catheter to the desired site but behavior is predictable and corrected easily | Catheter does not behave as expected and anomalies cannot be understood or corrected with existing heuristics. |
Cardiovascular surgery subtasks and associated hierarchical cognitive decision examples.
Validation score interpretation framework
| Validation status | Interpretation | Criteria |
|---|---|---|
| High overall validity | The model can be said to have a high validity for an early developmental model and is suitable for implementation in initial research studies only. Model validation should continue as part of an ongoing, iterative design process. | Total VF |
| Interpretation Score ≥7 | ||
| The model has high validity scores across all validation tasks. There is good evidence that the underlying assumptions are valid, data collection techniques are sound and researcher bias has been addressed through reflexivity assessment. The model has been demonstrated to be representative. The inferences of the Interpretation framework are reasonable and the assumptions plausible and the definitions of are clear, coherent, and complete. | Representation Score ≥7 | |
| Reflexivity Score ≥7 | ||
| Min Success Criteria Score ≥2 | ||
| Min objectivity Criteria Score ≥1 | ||
| Good validity | There is evidence that this model has good validity for an early developmental model and clear, complete, and coherent interpretations were defined. This model may be useful for implementation in initial research studies, however reflexivity scores were low so there is a remaining caveat on the potential for researcher bias. Further external confirmation of the model is required to use with confidence. | Total VF Score ≥19 (max = 30) |
| Poor reflexivity | Interpretation Score ≥6 (60%) | |
| Well defined interpretation | Representation Score ≥6 (60%) | |
| Reflexivity Score ≤6 (60%) | ||
| Objectivity score ≤6 (60%) | ||
| Good validity | There is evidence that this model has good validity for an early developmental model and reflexivity assessments have been complete. This model may be useful for implementation in initial research studies, however interpretation frameworks were not provided so findings are should be treated as somewhat speculative until an interpretation framework is defined. | Total VF Score ≥19 (max = 30) |
| Good reflexivity | Interpretation Score ≤6 (60%) | |
| Poorly defined interpretation | Representation Score ≥6 (60%) | |
| Reflexivity Score ≥6 (60%) | ||
| Poor validity | Underlying data and assumptions used in the development of this model were somewhat unclear, incomplete, or incoherent. Researcher bias has been assessed and interpretations are clear, however caution should be used when implementing this model outside research team development activities. | Total VF Score ≥19 (max = 30) |
| Good reflexivity | ||
| Well defined interpretation | Interpretation Score ≥6 (60%) | |
| Representation Score ≤6 (60%) | ||
| Reflexivity Score ≥6 (60%) | ||
| Poor overall validity | There are significant problems with two or more of the validation criteria. Further validation effort is required before the model can be used, even in initial research investigations. | Total VF Score ≤19 (max = 30) |
Validation Framework (VF) score defined on page 11 of the main text.
Hybrid cognitive validation framework score summary—primary researchera
| Success criteria score | Objectivity score | Total score | |
|---|---|---|---|
| Task 1—Argument‐based analysis | 6/6 | 0/4 | 6/10 |
| Task 2—Walkthrough analysis | 5/6 | 0/4 | 5/10 |
| Task 3—Reflexivity analysis | 6/6 | 0/4 | 6/10 |
| Total (vertical sum) | 17/18 | 0/12 | 17/30 |
The preliminary validation framework scores from all three validation tasks and associated sub tasks are collated and summed to use in conjunction with the Interpretation Framework, Table 5.
Hybrid cognitive validation framework score summary—including objectivity ratinga
| Success criteria score | Objectivity score | Total score | |
|---|---|---|---|
| Task 1—Argument‐based analysis | 6/6 | 4/4 | 10/10 |
| Task 2—Walkthrough analysis | 4/6 | 4/4 | 8/10 |
| Task 3—Reflexivity analysis | 5/6 | 3/4 | 8/10 |
| Total (vertical sum) | 15/18 | 11/12 | 26/30 |
The complete validation framework scores from all three validation tasks and associated sub tasks are collated and summed to use in conjunction with the Interpretation Framework, Table 5.