| Literature DB >> 31517065 |
M Olsen1, Z Zhelev2, H Hunt2, J L Peters2, P Bossuyt1, C Hyde2.
Abstract
BACKGROUND: A variety of study designs are available to evaluate the accuracy of tests, but the terms used to describe these designs seem to lack clarity and standardization. We investigated if this was the case in the diagnostic guidance of the National Institute of Care and Health Excellence (NICE), an influential source of advice on the value of tests.Entities:
Keywords: Diagnostic test accuracy study design; Labels; NICE diagnostic guidance; Terminology
Year: 2019 PMID: 31517065 PMCID: PMC6727490 DOI: 10.1186/s41512-019-0062-9
Source DB: PubMed Journal: Diagn Progn Res ISSN: 2397-7523
Fig. 1Flow diagram of the diagnostic guidance
Labels grouped by DTA-specificity
| Label | Interpretation | Similar/synonym labels |
|---|---|---|
| 1. DTA-specific labels ( | ||
| 1.1 DTA-informative ( | ||
| Comparative performance | Comparison of minimum two index test/strategies [ | Comparative (diagnostic) accuracy studies |
| | Comparison of two index test/strategies within the same group/population (i.e. with the same set of eligibility criteria). | Head to head comparisons; within-study comparisons |
| (Prospective cohort) randomized to either the index test or comparator2 | Randomization to either the index test or reference standard/comparator. | Randomized head-to-head comparisons |
| (Prospective cohort) receiving the comparator and at least one index test with follow up3 | All participants receive both an index test and the reference standard/comparator and are followed up for a final outcome measure/status. | |
| Reference standard positive studies | Only participants with a positive reference standard test receive the index test. | |
| | Including one (single set of eligibility criteria) or two/multiple groups (separate sets of eligibility criteria) for comparison [ | Two-study cohorts |
| 1.2 DTA-descriptors ( | ||
| Single/ | Participants are sampled from a single or from multiple centres. | Large, multi-centre prospective (UK) study; two-centre |
| Primary care study | Participants are sampled from settings in the primary (vs. secondary or tertiary) care. | |
| Consecutive recruitment | Participants are sampled consecutively (vs. random or convenient sampling) [ | |
| Symptomatic study | Including only symptomatic participants. | |
| ( | Including only high risk participants. | |
| Population (-based) studies | Including/sampling participants that represent the target/study population [ | (Multi-centre) community-based study |
| | Data collection (or recruitment) was planned before (prospective study) the index test and reference standard were performed or after (retrospective study) (STARD) [ | Prospective cohort studies; prospective, consecutive cohort study; prospective, international multicentre |
| Retrospective analysis (of prospective database) | Post-hoc or non-pre-specified statistical analysis. | |
| Discordant case analysis4 | A study that excludes discordant cases in the analysis. | |
| 1.3 General DTA labels ( | ||
| ( | A diagnostic cohort study in which “individuals are enrolled before the final outcome (presence or absence of the target condition) is known” [ | (Retrospective) cohort studies |
| (Diagnostic) case-control studies/design | “(…) studies in which the disease status is already known before the index test is performed”. “(…) the reference test is applied only to a subsample of the participants with or without the target condition”. [ | |
| (Diagnostic) | Comparison of the result of the index test with that of the reference standard in the same participant at the same time. [ | |
| (Diagnostic) | A DTA-study that only make use of observational data e.g. re-interpretation/review of existing tests results. | |
| 2. Umbrella terms ( | ||
| Diagnostic accuracy studies | A study reporting diagnostic accuracy performance measures. | (Large prospective cohort) studies collecting diagnostic accuracy data |
| | A diagnostic accuracy study that directly links test results to patient outcomes (such as RCTs) [ | |
| | A study that includes follow-up of any given outcome measure/status. | |
| Pilot studies | A study that is testing/piloting a setting/feasibility, often with a small sample size. | |
| 3. Non-DTA-specific labels ( | ||
| Test-treat trials | A study in which a test guides the treatment strategy. | |
| Controlled clinical trials | A clinical trial including a control (comparator) group to the intervention group. | |
| Cluster-randomized controlled trial | A study that randomizes clusters of participants to either intervention or control group. | |
| Derivation study | A study that focuses on the development of a prediction model (contra validation of a model). | |
| Observational studies | A study that only includes observational data (non-intervention data). | |
| (Observational) cohort studies | A study using data from all cohort-members (in contrast to case-control study that uses data from an outcome-selected subset of the source population) [ | |
| Prospective, two cohorts (feasibility, validation) | A study with two separate cohorts recruited prospectively. | |
| | A study with randomization of participants to either intervention group(s) or a control (comparator) group (vs. non-randomized). | (Large multi-centre) RCT |
| Validation study | A study that focuses on validation (of an existing/developed model). | |
| 4. Unclear labels ( | ||
| Diagnostic studies with a control group | ||
| (Multi-centre) tracker study | ||
| Mixed design (of within-study comparisons) | ||
The table shows the classification of the labels identified and their most likely interpretations. The labels in italic are labels (also) used in the guidance
1Elsewhere, other possible interpretation could be comparison of index test and reference test within the same patient (i.e. fully paired) [11]
2In DTA terminology, the term “comparator” usually refers to one of two index tests (in a comparative study). However, in this case, the “comparator” seems to refer to other tests that are used for the comparison of the index tests (which is used when a commonly accepted/implemented reference standard is absent) [12]. Other possible interpretation could be randomization to one of several index tests or randomization to two different test-sequence strategies
3In other contexts of DTA-studies, “follow-up” can imply the clinical follow-up compose the reference standard itself (i.e. the final outcome, diagnosis, is established by follow-up). This is also referred to as a “delayed type cross-sectional study” [13]. Yet, it can also mean that any given reference standard is performed at a later time point (delayed) than the index test
4This could also mean studies that identify discordant cases
Single- and two-gate interpretations
| Reference | Text | Variations in interpretation |
|---|---|---|
| Rutjes et al. [ | “(…). All patients pass through a single-gate: a single set of criteria for study admission, typically defined by the clinical presentation.” “(…) We refer to this as a “two-gate design using healthy controls”. Two different sets of inclusion criteria (gates) are used: one for the diseased and another for the non-diseased participants.” | A single- and two-gate is defined by the number of eligibility criteria (sets). |
| DAR8 | “The majority of included studies comply with a single-gate design, a single sample of individuals with unknown metastatic status was assessed by both the diagnostic test under scrutiny and the reference standard.” | A single-gate study is defined by a single sample with (unknown) disease status and all participants receiving both reference standard and index test. |
| DAR20 | “Single-gate: A study design in which only patients with the target condition are recruited.” “(…), single-gate studies, that is, studies in which only patients with the target condition (suspected sepsis) were recruited.” | A single-gate study is defined by inclusion of only patients by (known) disease status. Later, participants with the target condition are described as suspected individuals. |
| DAR27 | “Single-gate study: Where a single sample of individuals is assessed by both the index test and reference standard.” “Two-gate study: Studies which employ separate sampling schemes for diseased and non-diseased participants, with both groups being assessed by the index test.” | A single-gate is defined by both (a single set of) eligibility criteria and study flow (receiving both tests). Two-gate study is defined by (two sets of) eligibility criteria, (known) disease status and study flow both groups receiving the index test. |
| DG33 | “Single-gate study: Study design where participants’ disease status is unknown and the index test result is evaluated against the reference standard to confirm the diagnosis” | A single-gate is defined by inclusion of participants by only (unknown) disease status and receiving both tests. |
The table shows variations in the interpretations of the single- and two-gate labels from included DG. The first row shows the citations from the original definition made by Rutjes et al. [9]
Fig. 2Frequency of labels used in DARs and guidance