| Literature DB >> 28815102 |
Vibha Anand1, Amos Cahan1, Soumya Ghosh1.
Abstract
ClinicalTrials.gov was established as a web-based registry for clinical trials of human participants in 2000. Mandatory registration started in 2008. Given more than a decade of registered trials, it's important to understand the "topic" areas and their evolution over time from this resource. This information may help in identifying current knowledge gaps. We use dynamic topic model (DTM) methods to discover topics and their evolution over last 17 years. Our model suggests that there are disease or organ specific trials such as 'Cardiovascular disorders', Heart & Brain conditions', or 'Breast & Prostate cancer' as well as trials registered for general health. General health trials are less likely to be FDA regulated, but both health and pain management, as well as surgical, heart, and brain trials have upward trend in recent years while advanced cancer trials have downward trended. Our model derives unique insights from metadata associated with each topic area.Entities:
Year: 2017 PMID: 28815102 PMCID: PMC5543348
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Metadata in ClinicalTrials.gov registry used in this study (n=218,618)
| Variables | Description | Count | |
|---|---|---|---|
| 1. | Title | Unique title record of trial | 217,047 |
| 2. | Conditions | Disease condition(s) for trial | 79,752 |
| 3. | First Received | Date in years | 17 |
| 4. | Gender | Both, Female, Male | |
| 5. | Overall status | Status of the trial Completed, Recruiting, Terminated, Active, Not Recruiting, Withdrawn | |
| 6. | Study Types | Interventional, Observational, Expanded Access | |
| 7. | Is Section 801 | Yes, No | |
| 8. | Is FDA Regulated | Yes, No | |
Figure 1:Dynamic Topic Model
Human interpretation based on top words from model
| Topic Number | Top words in Topic | Human Interpretation / Description | Generic(G) vs. Topical (T) |
|---|---|---|---|
| 0 | ‘study’, ‘trial’, ‘clinical’, ‘controlled’, ‘randomized’, ‘tive’, ‘patients’, ‘pilot’, ‘evaluation’ | G | |
| 1 | imaging’, ‘using’, ‘ultrasound’, ‘guided’, ‘study’, ‘monitoring’, ‘infants’, ‘sleep’, ‘atrial’ | T | |
| 2 | ‘treatment’, ‘therapy’, ‘life’, ‘disorder’, ‘depression’, ‘quality’, ‘disorders’, ‘stress’, ‘patients’ | T | |
| 3 | patients’, ‘treatment’, ‘safety’, ‘efficacy’, ‘study’, ‘severe’, ‘term’, ‘infection’, ‘hepatitis’ | T | |
| 4 | ‘patients’, ‘acute’, ‘syndrome’, ‘coronary’, ‘artery’, ‘tissue’, ‘analysis’, ‘stroke’, ‘response’ | T | |
| 5 | ‘patients’, ‘function’, ‘training’, ‘device’, ‘heart’, ‘failure’, ‘brain’, ‘exercise’, ‘disease’ | T | |
| 6 | ‘cancer’, ‘breast’, ‘patients’, ‘women’, ‘prostate’, ‘therapy’, ‘chemotherapy’, ‘treatment’, ‘ovarian’ | T | |
| 7 | ‘surgery’, ‘versus’, ‘post’, ‘patients’, ‘knee’, ‘cardiac’, ‘postoperative’, ‘block’, ‘following’ | T | |
| 8 | ‘cancer’, ‘patients’, ‘advanced’, ‘metastatic’, ‘cell’, ‘lung’, ‘tumors’, ‘combination’, ‘carcinoma’ | T | |
| 9 | disease’, ‘patients’, ‘chronic’, ‘pulmonary’, ‘refractory’, ‘relapsed’, ‘cell’, ‘lymphoma’, ‘leukemia’ | T | |
| 10 | ‘care’, ‘based’, ‘health’, ‘patient’, ‘program’, ‘management’, ‘outcomes’, ‘improve’, ‘weight’ | T | |
| 11 | ‘diabetes’, ‘intervention’, ‘activity’, ‘patients’, ‘type2’, ‘children’, ‘adults’, ‘control’, ‘insulin’ | T | |
| 12 | ‘safety’, ‘study’, ‘healthy’, ‘subjects’, ‘efficacy’, ‘evaluate’, ‘tolerability’, ‘dose’, ‘pharmacokinetics’ | T | |
| 13 | ‘risk’, ‘high’, ‘blood’, ‘cells’, ‘patients’, ‘transplantation’, ‘stem’, ‘liver’, ‘bone’ | T | |
| 14 | pain’, ‘treatment’, ‘stimulation’, ‘patients’, ‘chronic’, ‘versus’, ‘emergency’, ‘back’, ‘induced’ | T |
Figure 2:(a) Topic Proportion (b) Documents per topic ( incomplete data for 2016)
Figure 3:Diseases or Conditions in discovered “topics” (size of words reflect the frequency in metadata “conditions” field)
Figure 4:Source of trials in discovered “topics” (size of words reflect the frequency in metadata field - source)
Figure 5:Overall status of trials in each discovered “topic”