| Literature DB >> 32306964 |
Wen Shi1, Tom Kelsey2, Frank Sullivan1.
Abstract
BACKGROUND: Trials often struggle to achieve their target sample size with only half doing so. Some researchers have turned to Electronic Health Records (EHRs), seeking a more efficient way of recruitment. The Scottish Health Research Register (SHARE) obtained patients' consent for their EHRs to be used as a searching base from which researchers can find potential participants. However, due to the fact that EHR data is not complete, sufficient or accurate, a database search strategy may not generate the best case-finding result. The current study aims to evaluate the performance of a case-based reasoning method in identifying participants for population-based clinical studies recruiting through SHARE, and assess the difference between its resultant cohort and the original one deriving from searching EHRs.Entities:
Keywords: Artificial intelligence; Clinical studies; Electronic health record; Machine learning
Mesh:
Year: 2020 PMID: 32306964 PMCID: PMC7169032 DOI: 10.1186/s12911-020-1091-6
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Work flow for recruiting participants to clinical studies through SHARE [11].
Fig. 2Flow diagram for the selection of projects for analysis
The Area Under the ROC Curve for recruitment prediction test for each project. For each project the ROCAUC and 95% confidence intervals are given for the first and second cross validation datasets, followed by the average
| Project Acronym | Fold 1 (CI) | Fold 2 (CI) | Average (CI) |
|---|---|---|---|
| ALPHA | 0.947 (0.897–0.984) | 0.855 (0.777–0.928) | 0.901 (0.837–0.956) |
| ALLAY | 0.928 (0.903–0.955) | 0.854 (0.768–0.941) | 0.891 (0.836–0.948) |
| METFORMIN | 0.977 (0.970–0.989) | 0.980 (0.970–0.983) | 0.979 (0.970–0.986) |
| REFORM | 0.946 (0.896–0.980) | 0.957 (0.920–0.987) | 0.952 (0.908–0.984) |
| IMPOCT | 0.397 (NA) | 0.276 (NA) | 0.337 (NA) |
| TARDIS | 0.897 (0.844–0.946) | 0.887 (0.852–0.919) | 0.892 (0.848–0.933) |
| 4P | 0.952 (0.909–0.993) | 0.744 (0.447–0.978) | 0.848 (0.678–0.986) |
| HF | 0.907 (0.851–0.957) | 0.927 (0.870–0.974) | 0.917 (0.861–0.966) |
| ImmunoStat | 0.632 (0.503–0.763) | 0.605 (0.443–0.756) | 0.619 (0.473–0.760) |
NA: insufficient data to calculate an accurate CI
Fig. 3Scaled sensitivity plus specificity against prediction score cut-offs
Performance of the resulted ranking lists. For each project we give the number of participants appearing in the top 5 and top 10 of a ranking list, the mean average precision (MAP) and the mean reciprocal rank (MRR)
| List | P5 | P10 | MAP | MRR |
|---|---|---|---|---|
| ALPHA | 0 | 0 | 0.003 | 0.006 |
| ALLAY | 0 | 0 | 0.001 | 0.001 |
| METFORMIN | 0 | 0 | 0.003 | 0.002 |
| REFORM | 0 | 0 | 0.010 | 0.035 |
| IMPOCT | 0 | 0 | 0.000 | 0.000 |
| TARDIS | 0 | 0 | 0.002 | 0.005 |
| 4P | 0 | 0 | 0.002 | 0.003 |
| HF | 0 | 0 | 0.006 | 0.028 |
| ImmunoStat | 0 | 0 | 0.001 | 0.007 |
| Average of all ranking lists | 0 | 0 | 0.003 | 0.010 |
| Averaged Lower | 0 | 0 | 0.000 | 0.000 |
| Averaged Upper | 0.789 | 0.611 | 1 | 1 |
Proportion of persons both identified within the top 50 positions of the ranking list for each project
| Project acronym | Fold 1 | Fold 2 | Average |
|---|---|---|---|
| ALPHA | 0.66 | 0.32 | 0.49 |
| ALLAY | 0.54 | 0.54 | 0.54 |
| METFORMIN | 0 | 0 | 0.00 |
| REFORM | 0.32 | 0.16 | 0.24 |
| IMPOCT | 0 | 0 | 0.00 |
| TARDIS | 0.22 | 0 | 0.11 |
| 4P | 0.06 | 0.14 | 0.10 |
| HF | 0.30 | 0.42 | 0.36 |
| ImmunoStat | 0.04 | 0.08 | 0.06 |