| Literature DB >> 30993808 |
Helen Strongman1, Rachael Williams1, Wilhelmine Meeraus1, Tarita Murray-Thomas1, Jennifer Campbell1, Lucy Carty1, Daniel Dedman1, Arlene M Gallagher1, Jessie Oyinlola1, Antonis Kousoulis1, Janet Valentine1.
Abstract
PURPOSE: UK primary care provides a rich data source for research. The impact of proposed data collection restrictions is unknown. This study aimed to assess the impact of restricting the scope of electronic health record (EHR) data collection on the ability to conduct research. The study estimated the consequences of restricted data collection on published Clinical Practice Research Datalink studies from high impact journals or referenced in clinical guidelines.Entities:
Keywords: bias; electronic health records; pharmacoepidemiology; primary care
Mesh:
Year: 2019 PMID: 30993808 PMCID: PMC6618795 DOI: 10.1002/pds.4765
Source DB: PubMed Journal: Pharmacoepidemiol Drug Saf ISSN: 1053-8569 Impact factor: 2.890
Figure 1Flow diagram of manuscript identification
Impact on published studies due to restrictions on the time period of data collectiona
| Historical Data Available From 01/01/2010 (6 y of Data) | Historical Data Available From 01/01/2004 (12 y of Data) | |
|---|---|---|
| Total unfeasible or compromised studies according to year, % (n = 100) |
|
|
| Compromised | 55 | 14 |
| Unfeasible | 12 | 8 |
| Studies compromised because of restricted follow‐up, % (n = 100) | 65 | 18 |
| Bias introduced (eg, misclassification) | 51 | 15 |
| Unable to measure exposure‐outcome association | 25 | 1 |
| Cannot examine temporal trends | 12 | 3 |
| Study considered unfeasible | 6 | 2 |
| Studies compromised because of restricted calendar time, % (n = 100) | 10 | 8 |
| Exposure/outcome not available | 1 | 0 |
| Event of interest not available | 9 | 8 |
| Unfeasible for other reason | 1 | 0 |
Some studies are restricted by both follow‐up and calendar time and thus included in both categories.
Key overall results highlighted in bold.
Characteristics of included studies
| Reason for inclusion | % |
|---|---|
| Published in a high‐impact factor journal | 79 |
| Referenced in a UK clinical guidance document | 17 |
| Both | 4 |
| Year of publication | % |
| 2000 | 8 |
| 2001 | 5 |
| 2002 | 4 |
| 2003 | 5 |
| 2004 | 7 |
| 2005 | 6 |
| 2006 | 4 |
| 2007 | 8 |
| 2008 | 3 |
| 2009 | 6 |
| 2010 | 10 |
| 2011 | 3 |
| 2012 | 6 |
| 2013 | 11 |
| 2014 | 8 |
| 2015 | 2 |
| Journal title | % |
|
| 52 |
| Other | 21 |
|
| 10 |
|
| 8 |
|
| 5 |
|
| 3 |
|
| 1 |
| Study type | % |
| Adverse drug reaction/drug safety | 43 |
| Disease epidemiology | 31 |
| Drug effectiveness | 14 |
| Health/public health services research | 8 |
| Other | 2 |
| Drug utilisation | 1 |
| Methodological | 1 |
| Data source | % |
| CPRD primary care data only | 75 |
| CPRD primary care linked data | 17 |
| Other nonlinked UK database | 6 |
| Other non‐UK database | 2 |
| Type of CPRD primary care linked data | % |
| Any | 17 |
| HES admitted patient care | 10 |
| ONS mortality | 7 |
| Deprivation | 3 |
| HES outpatient | 2 |
| Myocardial Ischaemia National Audit Project (MINAP) | 2 |
| Cancer registry | 1 |
| Other | 1 |
| Five most frequently studied disease areas | % |
| Circulatory system diseases | 34 |
| Mental disorders | 16 |
| Neoplasms (ie, cancer related) | 13 |
| Digestive system diseases | 12 |
| Endocrine, nutritional, metabolic, and immunity disorders | 11 |
Abbreviations: CPRD, Clinical Practice Research Datalink; HES, Hospital Episode Statistics; ONS, Office for National Statistics.
Impact on published studies due to retrospective data collection restricted to specified diseases only
| Aspect of Study Affected by Restriction | |||||
|---|---|---|---|---|---|
| Study Population | Exposure | Primary Outcome | Important Covariates | Study as a Whole | |
| Total unfeasible or compromised studies, % (n = 100) | 28 | 25 | 51 | 30 |
|
| Studies with potential for modified design, % (n = 100) | 12 | 7 | 20 | 15 |
|
| Studies that could not be done, even with design changes, % (n = 100) | 16 | 18 | 31 | 15 |
|
Studies may have more than one reason for being unfeasible or compromised.
Key overall results highlighted in bold.
Impact on published studies due to restrictions on collection of sensitive data
| Aspect of Study Affected by Restriction | |||
|---|---|---|---|
| Study Population | Primary Outcome | Total | |
| Total unfeasible or compromised studies, % (n = 100) | 8 | 2 |
|
| Requiring legally restricted Read codes | 4 | 1 | 5 |
| Requiring abortion or gender Read codes | 4 | 1 | 5 |
| Studies with potential for modified design, % (n = 100) | 7 | 1 |
|
| By changing definition of study population | 5 | 1 | 6 |
| By using linked data from other sources | 2 | 0 | 2 |
| Studies that could not be done, even with design changes, % (n = 100) | 1 | 1 |
|
Key overall results highlighted in bold.