| Literature DB >> 35387634 |
Rachel Canaway1, Douglas Boyle2, Jo-Anne Manski-Nankervis1, Kathleen Gray3.
Abstract
BACKGROUND: Most people receive most of their health care in in Australia in primary care, yet researchers and policymakers have limited access to resulting clinical data. Widening access to primary care data and linking it with hospital or other data can contribute to research informing policy and provision of services and care; however, limitations of primary care data and barriers to access curtail its use. The Australian Health Research Alliance (AHRA) is seeking to build capacity in data-driven healthcare improvement; this study formed part of its workplan.Entities:
Keywords: Australia; Data curation; Data linkage; Data management; Data quality; Primary health care data; Secondary use of data
Mesh:
Year: 2022 PMID: 35387634 PMCID: PMC8988328 DOI: 10.1186/s12911-022-01830-9
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Data custodian and user perspectives on benefits and limitations of secondary use of primary care data
| Synopsis of survey respondents’ perspectives on secondary use of primary care data (n = 53) | |
|---|---|
| Intrinsic benefits of primary care data | Unique, rich, granular, ‘real world’ data with capacity to provide more population health information than any other health data source. Makes regional and remote-level information more accessible. Minimises measurement bias in research. When linked it creates systems view and triangulation, creating a ‘patient centred view’ of care pathways, patient needs and service gaps. |
| Assists policy and planning for provision of improved health services and health outcomes | Its analysis enables greater knowledge to assess and improve services in localities or broadly, service quality improvement (through competitive benchmarking), understanding of treatment outcomes and population health improvements. Provides an evidence-base for investment, interventions and efficiencies in health spending and technical infrastructure. Can inform policy and workforce planning. Contributes to a ‘Learning Healthcare System’. |
| Pragmatic research efficiencies & improvements | Data driven research can be rapid and cost effective leading to cost reductions. Big data (from large electronic medical record repositories) increases statistical power and increases research scale. Big data research can illuminate aspects of primary care otherwise not seen and can generate new research questions. |
| Patient generated data | Technologies can facilitate patient reported data collection through add-on apps, with potential to enhance primary care data. |
| Practice level | Enables providers to review their activities and make business improvements and ability to track and improve patient outcomes. |
| Technical and data capture limitations | Limitations from using or merging data from Clinicians who use paper-based rather than digital records. |
| Poor data quality, reliability | Data captured in non-standardised clinical software systems and extracted using non-standardised data capture systems leading to data inaccuracies and loss of data context. Poor use of existing field coding by clinicians and high use of free text limiting data utility (and adding to burden of data cleaning). Incomplete data fields (fields not captured by clinician or not extracted for secondary use). Difference between terms used by GPs and available SNOMED Clinical Terms prompting free text entries, and preferred terms changing over time, limiting data standardisation. |
| Data governance and access requirements limiting its usability | Requirements that data be de-identified or aggregated limits its utility. Lack of a minimum primary care/general practice data set in Australia containing data from all providers. Lack of a nationally endorsed patient unique identifier limits data linkage and identification of patient duplication. Limited permissions on what data can be linked and the ‘arduous’ nature of permissions process and cost to access data. Little incentive for primary care providers to share data leading to limited data available for secondary use. |
| Poor understanding of data complexity and context | Many data end-users unable to appropriately interpret the data because they do not understand its social and clinical context. Data collected for one purpose (clinical care) being used for purposes other than the primary purpose. |
| Unequal data representativeness | Lack of available data on priority populations including culturally and linguistically diverse, aboriginal communities, under-representation of vulnerable groups and over-representation of the ‘worried well’. |
| Privacy concerns, trust and ownership | Lack of community consultation on data use. Concern that shared data are stored off-shore or its use cannot be controlled. Unclear consent mechanisms and privacy concerns not addressed limiting clinicians sharing data. Varying ideas of who owns the data limiting the extent to which it is shared. |
| Lack of guidelines, policies, standards and ‘common data model’ | The following limiting availability and utility of secondary primary care data use, lack of: national standards for general practice data quality and evaluation, clinical data capture system interoperability (too many clinical data systems), standardised data extraction tools, standard coding and common terminology; leadership to improve data standards and a ‘common data model’. |
Adapted from [37]
Data custodian and user perspectives on barriers and enablers of secondary use of primary care data
| Synopsis of survey respondents’ perspectives on secondary use of primary care data (n = 53) | |
|---|---|
| Fear, reticence and lack of trust | GP concerns for patient Fear or |
| Leadership, governance & ethical constraints | Lack of clarity on what constitutes ‘deidentified’ data and concern about sharing ‘deidentified’ data without explicit patient consent. |
| Lack of data availability | Lack of available longitudinal patient data. Incomplete data entry by service providers. |
| Lack of access due to cost or awareness | Limited knowledge about what data are available and how to access it. Prohibitive cost to access data for research. The high costs charged by vendors to use their data extraction tools and to access extended tool applications and enhancements. |
| Lack of expertise, experience & incentive | Too few clinicians involved in planning data analyses and in reaching research conclusions. General practice staff not motivated to collect clean, accurate and complete data. Absence of shared vision/capacity to build systems to utilise current non-standardised data sources. |
| Barriers to data linkage | Inability to link patient data. Lack of a reliable individual person identification numbers for data linkage. Lack of availability of, and access to, some datasets needed for linkage (lack of stakeholder agreement and governance arrangements). |
Technical systems barriers | Data extractions tools unable to collect from all clinical software systems. Poor data quality (completeness, cleanliness, granularity) as barrier to better use. Inadequate national digital health record, lack of primary care minimum data set and lack of consensus on what a minimum dataset should include. Relatively few providers of data warehousing. |
| Health system & resource barriers | Lack of funding to collect and analyse data and to support the implementation of findings; Lack of motivation, capacity, resources and education to prioritise data input and improve data quality; Lack of research resources to interpret data (including for Primary Health Networks to interpret for planning purposes); Insufficient research and skills/capacity building funding for the academic primary care sector. |
| Qualities | Build primary care provider and public |
| Leadership | |
| Governance | Improved, ‘tighter’ or ‘clearer’ governance with: unambiguous and agreed strategic framework(s), |
| Partnerships and capacity building | Facilitate Enable consumers to understand the research value of primary care data (especially when linked to other datasets) and build on public expectation/perception that policymakers may already use linked data systems to improve services. |
| Technologies and method development | Better use of secondary data through advancement in computing hardware and software technologies, Improved portals for practice display of data to encourage continuous quality improvement in data capture. Advancement in: data storage and IT security, technical cross-sectoral capability to enable data linkage, |
| Resources | Train primary care staff and clinicians in health informatics and educate on data value and best practice data collection, quality improvement and better use of own data ( |
Adapted from [37]
Jurisdiction of respondent affiliations, N = 62
| Organisation | Total n (%) | Data custodians n (%) | Data users n (%) |
|---|---|---|---|
| Educational or Research Institute * (including universities) | 24 (38.7) | 8 (26.7) | 16 (50.0) |
| Primary Health Network | 24 (38.7) | 16 (53.3) | 8 (25.0) |
| Government | 6 (9.7) | 1 (3.3) | 5 (15.6) |
| General Practice* | 3 (4.8) | 2 (6.7) | 1 (3.1) |
| Pharmaceutical | 1 (1.6) | Nil | 1 (3.1) |
| Health insurer | Nil | Nil | Nil |
| Other (incl. software developer, non-Government/non-University data holder) | 3 (4.8) | 3 (10.0) | Nil |
| Not stated | 1 (1.6) | Nil | 1 (3.1) |
| State or Territory | |||
| Australian Capital Territory | 7 (11) | 3 | 4 |
| New South Wales | 18 (29) | 10 | 8 |
| Northern Territory | 3 (5) | 0 | 3 |
| Queensland | 8 (13) | 5 | 3 |
| South Australia | 3 (5) | 1 | 2 |
| Tasmania | 2 (3) | 0 | 2 |
| Victoria | 17 (27) | 9 | 8 |
| Western Australia | 2 (3) | 1 | 1 |
| Elsewhere (National) | 1 (2) | 1 | 0 |
| Not stated | 1 (2) | 0 | 1 |
| Total | 62 (100) | 30 (100) | 32 (100) |
Table adapted from [37]
*Six of the persons from research/education institutions also worked in general practice. The 3 GPs noted worked only in general practice
Most commonly identified primary care datasets for ‘secondary use’ and their characteristics identified by respondents
| Times mentioned | Dataset used for secondary purposes | Jurisdiction | Data extraction/collection tool | Purpose |
|---|---|---|---|---|
| > 20 | Primary Health Network (PHN) collected data (individual datasets held by 28 PHNs) | National | PenCS tools, POLAR, Primary Sense | Audit, health planning, quality improvement, sometimes research |
| 20 | NPS MedicineInsight | National | GRHANITE® and cdmNET | Post market surveillance, audit, research |
| 11 | BEACH (Bettering the Evaluation and Care of Health) data (1998–2016) | National | Paper-based data collection | Research |
| 11 | Outcome Health and POLAR data | NSW, Victoria | POLAR | Audit (used by PHNs), research |
| 11 | Medical Benefits Scheme (MBS) data | National | Administrative claims | Administrative, audit, research |
| 11 | Pharmaceutical Benefits Scheme (PBS) data | National | Administrative claims | Administrative, audit, research |
| 9 | PHN related Primary Mental Health Care Minimum Data Set | National | PenCS tools | Audit, health planning, quality improvement, |
| 7 | Patron primary care data repository/Data for Decisions (University of Melbourne) | Victoria | GRHANITE® | Research |
| 4 | Aboriginal Community Controlled Organisations/Aboriginal Medical Services | National | Administrative data | Clinical care, audit |
| 4 | Australian Institute of Health and Welfare (AIHW) held data ( | National | Administrative data | Audit, health planning |
| 3 | Australian Immunisation Register (AIR) | National | Administrative data | Audit, surveillance |
| 3 | University of NSW ePractice-Based Research Network data | NSW | GRHANITE® | Research |
| 3 | Medical Director (clinical software vendor held data) | National | Cloud-based collection | Not stated |
| 3 | 10% MBS and PBS sample data (no longer available) | National | Administrative claims | Research |
| 3 | Patient Reported Experience Measures (Australian Bureau of Statistics) | National | Household survey questionnaire | Care planning |
| 3 | My Health Record | National | Cloud-based collection | Clinical care, research |
Table adapted from [37]
NSW, New South Wales
Exemplar issues raised by interview participants on data linkage
Quotes/table taken and adapted from [37]
Notable limitations of data quality frameworks
| Lack of accessible and agreed standards: No agreed standard data quality framework (that is straightforward to apply) and no defined data coding/mapping standard |
| Shortfalls of “SNOMED Clinical Terms” in practical applications |
| Lack of resourcing and activities to support primary care providers to implement data quality improvements at point of data capture |
| The ‘resource drain’ for researchers or data custodians to implement a comprehensive data quality framework |
| Inconsistencies or lack of transparency around data transformation related to data extraction tools, leading to data quality issues including inconsistent or inaccurate results |
| Uncertainty among data custodians on types and definitions of data ‘de-identification’, leading to the possibility of secondary users re-identifying individuals in datasets |
| Technical limitations of received data structures and data tools limiting data recipients’ ability to analyse and report received data |
Adapted from [37]