| Literature DB >> 31074176 |
Stanley C Ahalt1, Christopher G Chute2, Karamarie Fecho1, Gustavo Glusman3, Jennifer Hadlock3, Casey Overby Taylor2, Emily R Pfaff4, Peter N Robinson5, Harold Solbrig2, Casey Ta6, Nicholas Tatonetti6, Chunhua Weng6.
Abstract
Entities:
Mesh:
Year: 2019 PMID: 31074176 PMCID: PMC6617834 DOI: 10.1111/cts.12638
Source DB: PubMed Journal: Clin Transl Sci ISSN: 1752-8054 Impact factor: 4.689
Clinical data types, regulatory access restrictions, and applications
| Clinical data type | Brief description | Regulatory access restrictions | Applications |
|---|---|---|---|
| Fully identified clinical data sets | Observational patient data derived from paper‐based or electronic medical records | IRB approval is required; an executed data use agreement is possibly required | Clinical interpretation and scientific inference and discovery |
| HIPAA‐limited clinical data sets | Observational patient data containing only a limited set of HIPAA‐defined PHI | IRB approval is required; an executed data use agreement is possibly required | Clinical interpretation and scientific inference and discovery, but with the understanding that certain data elements have been removed from the data and/or transformed |
| Deidentified clinical data sets | Observational patient data, but with all HIPAA‐defined PHI elements removed | IRB approval is not required | Clinical interpretation and scientific inference and discovery, but with the understanding that inferences regarding time and potentially other factors cannot be made |
| HuSH+ clinical data sets |
Observational patient data, fully compliant with HIPAA Safe Harbor, but unlike deidentified clinical data sets, HuSH+ clinical data sets have been altered such that (i) real patient identifiers (including geocodes) have been replaced with random patient identifiers and (ii) dates (including birth dates) have been shifted by a random number of days (maximum of ± 50 days), with all dates for a given patient shifted by the same number of days | An executed data use agreement is required | Clinical interpretation and scientific inference and discovery, but with the understanding that any inferences based on date/time and location (geocode) cannot be made with precision, and all other inferences must consider date/time and location as potentially hidden covariates |
| Clinical profiles |
Statistical profiles of disease and associated phenotypic presentation derived from observational patient data | IRB approval is required to generate clinical profiles; no other restrictions apply | Clinical interpretation and scientific inference, but with the understanding that the data represent statistical profiles |
| Synthetic clinical data sets | Realistic, but not real, observational patient data generated statistically using population distributions of observational patient data | None | Feasibility assessments and algorithm validation; generation of clinical profiles |
| COHD |
Counts of observational clinical co‐occurrences (e.g., co‐occurrences of specific diagnoses and prescribed medications), as well as their relative frequency and observed–expected frequency ratio | None | Clinical interpretation and scientific inference, but with the understanding that the data are restricted to co‐occurrences |
| ICEES |
Patient‐level or visit‐level counts of observational patient data integrated at the patient and visit level with a variety of environmental exposures derived from multiple public data sources | IRB approval is required to generate ICEES integrated feature tables; no other restrictions apply | Clinical interpretation and scientific inference, but with the understanding that the raw data have been transformed (e.g., binned or categorized) |
COHD, Columbia Open Health Data; HIPAA, Health Insurance Portability and Accountability Act; HuSH+, HIPAA Safe Harbor Plus; ICEES, Integrated Clinical and Environmental Exposures Service; IRB, institutional review board; PHI, protected health information; UNC, University of North Carolina.
aIndividual institutions may require a secure workspace for data access and use. bWhile HIPAA and IRB regulations do not apply, institutional approvals may be required. cHuSH+ clinical data sets were conceptualized and created by UNC as part of the National Center for Advancing Translational Sciences–funded Biomedical Data Translator program. The institution requires a fully executed data use agreement for access to the data.
Figure 1Example queries, including input parameters and output, for Columbia Open Health Data (COHD) (a) and the Integrated Clinical and Environmental Exposures Service (ICEES) (b). AvgDailyPM2.5Exposure = average daily patient exposure to PM2.5 (μg/m3) over a 1‐year study period; TotalEDInpatient Vists = total number of emergency department or inpatient visits for respiratory issues during a 1‐year study period. The study period shown here is for calendar year 2010. AvgDailyPM2.5Exposure <3 range: 1.58, 9.63 μg/m3; AvgDailyPM2.5Exposure ≥3 range: 9.63, 17.33 μg/m3. ID, identifier; PM2.5, airborne particulate matter ≤2.5 μm in diameter.