| Literature DB >> 35426943 |
Yilu Fang1, Betina Idnay2,3, Yingcheng Sun1, Hao Liu1, Zhehuan Chen1, Karen Marder3, Hua Xu4, Rebecca Schnall2,5, Chunhua Weng1.
Abstract
OBJECTIVE: To combine machine efficiency and human intelligence for converting complex clinical trial eligibility criteria text into cohort queries.Entities:
Keywords: cohort identification; eligibility prescreening; human–computer collaboration; informatics
Mesh:
Year: 2022 PMID: 35426943 PMCID: PMC9196697 DOI: 10.1093/jamia/ocac051
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 7.942
Figure 1.The pipeline of C2Q 2.0.
Figure 2.(A) An example of using the editable user interface to generate cohort results after modification. (B) Examples for negation scope detection, temporal and value normalization.
Test set for each evaluation task
| Evaluation task | # of trials | # of entities |
|---|---|---|
| Negation scope detection | 308 | 1223 |
| Value normalization | 1010 | 1603 |
| Temporal normalization | 1010 | 1001 |
Performance of negation scope detection, value normalization, and temporal normalization modules in C2Q 1.0 and C2Q 2.0 with 95% confidence intervals using COVID-19 trials
| Evaluation task | Metric | C2Q 1.0 | C2Q 2.0 |
|---|---|---|---|
| Negation scope detection | Accuracy | 0.776 [0.751, 0.798] | 0.924 [0.907, 0.937] |
| Precision | 0.792 [0.758, 0.823] | 0.963 [0.945, 0.977] | |
| Recall | 0.759 [0.724, 0.791] | 0.884 [0.857, 0.908] | |
| F1-score | 0.775 [0.748, 0.800] | 0.922 [0.905, 0.937] | |
| Value normalization | Accuracy | 0.601 [0.576, 0.624] | 0.966 [0.955, 0.973] |
| Temporal normalization | Accuracy | 0.554 [0.522, 0.584] | 0.916 [0.896, 0.931] |
Evaluators’ clinical research backgrounda
| Characteristic | Category | Ten ( | Eight ( |
|---|---|---|---|
| Included evaluators (%) | Excluded evaluators (%) | ||
| Number of years working in clinical research | Less than 1 year | 2 (20) | 2 (25) |
| 1 year to less than 5 years | 5 (50) | 2 (25) | |
| 5 years or over | 3 (30) | 4 (50) | |
| Alzheimer’s disease clinical research experience | No experience | 2 (20) | 4 (50) |
| Less than 1 year | 5 (50) | 0 (0) | |
| 1 year or more | 3 (30) | 4 (50) | |
| Involvement in prescreening potential participants for research | No | 5 (50) | 1 (12.5) |
| Yes | 5 (50) | 7 (87.5) |
There were 18 evaluators in total, 10 of which were included.
Figure 3.Comparison of the criteria parsing result before and after modifications made by one of the evaluators for the clinical trial NCT04249869. The evaluator is with prescreening involvement and has 1 to 2 years of experience in clinical research and in AD research. The modified medical concepts are highlighted with star icons. The automatically extracted concepts removed by the evaluator are enclosed by dotted line boxes. AD: Alzheimer’s disease.
Figure 4.Average frequency of using each function for criteria parsing result’s modification.
Health-ITUES and feature-specific usability scores
| Mean (SD) | |
|---|---|
| Health-ITUES | |
| Perceived usefulness | 3.99 (0.66) |
| Perceived ease of use | 3.80 (1.06) |
| User control | 3.73 (0.89) |
| Overall score | 3.84 (0.71) |
| Feature-specific | |
| Pleasant to use | 3.90 (0.57) |
| User satisfaction: automatically generated criteria parsing result | 4.00 (0.67) |
| User satisfaction: modified criteria parsing result | 4.10 (0.74) |
| User satisfaction: concept searching | 3.60 (0.97) |
| User satisfaction: annotation dialog | 4.20 (0.63) |
| Easy to learn: add a concept | 4.50 (0.53) |
| Easy to learn: update a concept | 4.70 (0.49) |
| Easy to learn: delete a concept | 4.60 (0.52) |
| Easy to learn: delete all concepts in an eligibility criterion | 4.60 (0.52) |
| Easy to learn: select eligibility criteria | 4.30 (0.95) |
| Availability of all user engagement features | 4.30 (0.95) |
Figure 5.Score comparison of feature-specific usability scores across groups.