| Literature DB >> 24410735 |
Justin Doods, Florence Botteri, Martin Dugas, Fleur Fritz1.
Abstract
BACKGROUND: Clinical studies are a necessity for new medications and therapies. Many studies, however, struggle to meet their recruitment numbers in time or have problems in meeting them at all. With increasing numbers of electronic health records (EHRs) in hospitals, huge databanks emerge that could be utilized to support research. The Innovative Medicine Initiative (IMI) funded project 'Electronic Health Records for Clinical Research' (EHR4CR) created a standardized and homogenous inventory of data elements to support research by utilizing EHRs. Our aim was to develop a Data Inventory that contains elements required for site feasibility analysis.Entities:
Mesh:
Year: 2014 PMID: 24410735 PMCID: PMC3895709 DOI: 10.1186/1745-6215-15-18
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Data element example
| Findings/Weight | 80 kg | The weight of a subject. |
Example for the definition of data elements. Data element concepts consist of a data group and data item part. The elements also contain an example, a definition and a link to the NCI Metathesaurus referencing its UMLS concept.
Overview of the companies, the numbers and disease areas of studies used
| AMGEN | | | | 1 | |
| AstraZeneca | | | | 1 | |
| Bayer Health Care | 2 | | | | |
| GlaxoSmithKline | | | | 3 | |
| Johnson & Johnson | | 1 | | 1 | |
| MERCK KGaA | | | | | 2 |
| Novartis Pharma AG | 1 | | 1 | | 1 |
| F. Hoffmann-La Roche Ltd. | 1 | | | | |
| Sanofi-Aventis | 2 |
Figure 1Main steps of the iterative approach to create the data inventory. A list of data elements (DI (previous iteration)) is extended by data elements of simplified eligibility criteria. The availability of data elements from the extended list (Preliminary DI) then get validated through data exports (DE) at the sites, which afterwards get analyzed. Data elements which are hardly used or not available at the sites get removed from the DI and are added to the wish list. The remaining elements form a new version of the Data Inventory (DI (new version)).
Figure 2Examples from the data inventory. Each element in the data inventory contains a sequential number, the data group and item, an example of a possible value, the textual definition and the corresponding NCIm link.
Data elements of the wish list
| Findings | QTc interval, Left ventricular ejection fraction |
| Laboratory findings | MAGE-A3 status |
| Medical history | Trial title, Inclusion date, End of participation date, Current method of contraception, Vaccines, HIV status, Lactation |
| Patient characteristics | Willingness to participate in clinical trials |
| Scores/classifications | Date of score/Classifications, Karnofsky-score, Eastern Cooperative Oncology Group -performance status, TNM-classification, New York Heart Association - status, Response Evaluation Criteria in Solid Tumors, Hoehn and Yahr, scale, GRID-Hamilton Depression Rating Scale, Mini-Mental State Examination, Unified Parkinson’s Disease Rating Scale Section 1 |
The wish list contains data elements that are currently not, or very rarely available, in European EHRs, but that are frequently requested in study protocols.
Figure 3Heat map of the data exports from the data inventory current version. The first two columns describe the ISO 11179 data element concept (data group/data item). The third column shows the average usage of the data element over all sites while the following columns (site 1 to site 9) display the frequency at the individual sites. The Data Inventory is ordered by the average usage sorted in descending order from most available to least. The frequency ranges from 100% (dark green) to 0% (dark red). Data elements that are not available at a site are shown as Not Available (NA) (black).
Comparison of the Data Inventory with US cardiovascular data fields[18]
| History and physical examination elements | 8 | 24 | 5 |
| Pharmacological therapy data elements | 0 | 20 | 0 |
| Laboratory results elements | 10 | 0 | 1 |
| Diagnostic and therapeutic procedures elements | 2 | 2 | 26 |
| Outcomes data elements | 0 | 0 | 2 |
Exact matches are available in both lists; no matches means that the data field is not represented in the Data Inventory and ‘data field as value of a data element’ means that data fields can be matched to data elements because they refer to similar concepts (for example, data field ‘Diabetes’ corresponds to data elements ‘Diagnosis/Text’).
Semantic classes from Weng[12]/Luo[13]corresponding to the data groups
| Diagnosis | Disease, Symptom and signs, Neoplasm status |
| Procedures | Therapy or surgery, Diagnostic or lab results |
| Laboratory findings | Diagnostic or lab results |
| Findings | Diagnostic or lab results |
| Medical history | Pregnancy-related activity, Addictive behavior |
| Scores and Classification | Neoplasm status, Disease stage |
| Medication | Pharmaceutical substance or drug |
| Demographics | Age, Gender |
Comparison between data groups of this work and semantic classes according to Weng [12]/Luo [13]. Some of the semantic classes are listed more than once because they correspond to more than one data group. Similarly, one data group can correspond to one or more semantic class.