| Literature DB >> 31390010 |
Cong Liu1, Chi Yuan1, Alex M Butler1,2, Richard D Carvajal2, Ziran Ryan Li1, Casey N Ta1, Chunhua Weng1.
Abstract
OBJECTIVE: Information overload remains a challenge for patients seeking clinical trials. We present a novel system (DQueST) that reduces information overload for trial seekers using dynamic questionnaires.Entities:
Keywords: clinical trial search; common data model; eligibility criteria; interactive search; natural language processing
Year: 2019 PMID: 31390010 PMCID: PMC6798577 DOI: 10.1093/jamia/ocz121
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.The pipeline architecture of the DQueST system. (A) Module 1 works offline to retrieve information from the trial repository and curate the eligibility criteria library; (B) module 2 interacts with users and generates questions dynamically.
Table schema of the criteria library generated after extraction and mapping
| Field Name | Example | Description |
|---|---|---|
|
| 12345678 | Unique ID for the record |
|
| NCT01152203 | NCT ID associated with the record |
|
| Cytotoxic chemotherapy | Clinical term extracted from the original text |
|
| 4141762 | Standard OMOP concept ID mapped to the clinical entity |
|
| Oral chemotherapy | Standard OMOP concept name |
|
| 0.87 | Similarity score between the mapped OMOP concept and the clinical entity |
|
| 4273629 | OMOP concept ID after clustering |
|
| Chemotherapy | OMOP concept term after clustering |
|
| INCLUSION | This is an inclusion or exclusion criteria |
|
| Procedure | The domain of the OMOP concept |
|
| N/A | Standardized minimum of numeric expression associated with this concept |
|
| N/A | Standardized maximum of numeric expression associated with this concept |
|
| N/A | Unit of numeric expression associated with this concept |
|
| − | Standardized minimum of temporal constraint associated with this concept |
|
| −3 | Standardized maximum of temporal constraint associated with this concept |
|
| weeks | Unit of temporal constraint associated with this concept |
Question templates used for different domains
| Question Template | Answer Template | |
|---|---|---|
|
| ||
|
| Have you ever been diagnosed with [condition_concept]? ( | yes/no/don't know |
|
| N/A | N/A |
|
| Could you provide the start and end time? ( | start_date, end_date |
|
| ||
|
| Do you currently have or have you ever had/been [observation_concept]? ( | yes/no/don't know |
|
| N/A | N/A |
|
| Could you provide the start and end time? ( | start_date, end_date |
|
| ||
|
| Do you know your most recent [measurement_concept]? | yes/no/don’t know |
|
| Please enter the value: ( | value_as_number/NULL |
|
| N/A | N/A |
|
| ||
|
| Have you ever taken or received [drug_concept]? ( | yes/no/don't know |
|
| N/A | N/A |
|
| Could you provide the start and end time? ( | start_date, end_date |
|
| ||
|
| When you ever undergone a(n) [procedure_concept]? ( | yes/no/don't know |
|
| N/A | N/A |
|
| Could you provide the start and end time? ( | start_date, end_date |
Figure 2.The DQueST user interface. (A) Trial searches can be initialized by searching for keywords, demographics, and location using the API provided by . Users can use advanced search to restrict to only ongoing trials. (B) Question-guided interactive trial searching is followed to filtered out ineligible trials dynamically. The remaining trials in will be updated and a new question will be asked once the users click the confirm button. (C) the remaining eligible trials are shown and the users can click the link to see more details about a specific trial in the repository. (D) The users can use the navigation panel to provide a different answer to any previous questions or simply review the remaining trials at any question stage.
Basic statistics for the record, entity, concepts and concept cluster
| Domain | Measurement | Condition | Drug | Observation | Procedure |
|---|---|---|---|---|---|
| Total number of entity occurrences | 811 822 | 3 187 262 | 1 010 898 | 312 375 | 882 553 |
| Total number of unique entities | 104 334 | 302 426 | 93 815 | 3977 | 30 116 |
| Total number of unique entities mapped to the OMOP CDM | 13 665 | 107 763 | 14 314 | 1385 | 10 004 |
| Total number of unique concepts | 4547 | 18 684 | 6128 | 853 | 3581 |
| Total number of unique concept clusters | N/A | 4094 | 4720 | 642 | 2193 |
Figure 3.The average percent of trials filtered out (total number of trials) vs the number of questions answered.
Abbreviations: AD, Alzheimer’s disease; ATHM, Asthma; BRCA, Breast cancer; CAD, Coronary artery disease; DEPR, Depression; DM, Diabetes mellitus; HIV, Human immunodeficiency virus infection; HTN, Hypertension; OBESE, Obesity; RA, Rheumatoid arthritis.
The top 10 concepts identified in eligibility criteria associated with the most trials
| Concept Name | Domain | Number of unique trials associated with |
|---|---|---|
|
| condition | 76 719 |
|
| condition | 22 832 |
|
| condition | 19 175 |
|
| condition | 15 745 |
|
| condition | 14 065 |
|
| measurement | 13 333 |
|
| procedure | 12 080 |
|
| condition | 10 173 |
|
| condition | 8720 |
|
| condition | 8708 |
Examples of trials that are erroneously filtered out
| Question | Answer | NCT_ID | Criteria and its Section | Error Reason |
|---|---|---|---|---|
|
| Yes | NCT00948766 |
| Inappropriate granularity |
|
| Yes | NCT00477659 |
| “permitted” in exclusion |
|
| No | NCT02958670 |
| Condition for family member |
|
| No | NCT00814697 |
| “must not” in exclusion |
|
| Yes | NCT02968719 |
| mismatched question and criteria |
Estimated percentage of remaining trials considered to be eligible for the mock patient by physician and biomedical informatician, respectively
| Mock patient | System | Physician | Biomedical Informatician | Mean between Physician and Informatician |
|---|---|---|---|---|
|
| DQueST | 20% | 50% | 35% |
|
| 10% | 30% | 20% | |
|
| DQueST | 50% | 60% | 55% |
|
| 40% | 40% | 40% | |
|
| DQueST | 30% | 40% | 35% |
|
| 20% | 30% | 25% | |
|
| DQueST | 20% | 40% | 30% |
|
| 20% | 20% | 20% | |
|
| DQueST | 60% | 70% | 65% |
|
| 50% | 50% | 50% |
User evaluation of the usability of DQueST
| Question |
| Std. Dev. | |
|---|---|---|---|
| 1 I found the tool unnecessarily complex | 1.92 | 0.90 | |
| 2 I thought the tool was easy to use | 4.00 | 0.43 | |
| 3 I would need the support of a technical person to be able to use this system | 2.42 | 1.44 | |
| 4 I would imagine that most people would learn to use this tool very quickly | 3.92 | 0.90 | |
| 5 I thought the questions were clear and easy to understand | 4.67 | 0.49 | |
| 6 I thought the questions were relevant to my search | 4.25 | 0.97 | |
| 7 I am likely to use this tool to search for clinical trials | 4.08 | 1.00 | |
| 8 I found this tool more useful than other trial search tools I have used in the past | 3.92 | 1.24 | |
| 9 The number of questions to identify clinical trials was appropriate | 4.50 | 0.52 | |
| 10 How likely are you to recommend this tool to others? | 9.33 | 0.78 | |
| # of Questions Answered | 5.33 | 3.73 | |
| Time Spent on Questionnaire (minutes) | 5.83 | 1.95 | |
“For question 1–9, ‘1’ indicates ‘Strongly Disagree’, ‘5’ indicates ‘Strongly Agree’. For question 10, ‘1’ indicates” No recommendation, and “10” indicates “Strongly recommended.”