| Literature DB >> 32456382 |
Seonji Kim1, Myo-Song Kim2, Seung-Hun You2, Sun-Young Jung2.
Abstract
An increasing number of studies are using healthcare claims databases to assess healthcare intervention utilization patterns or outcomes in real-world clinical settings. However, methodological issues affecting study design or data analysis can make conducting and reporting these types of studies difficult. This review presents an overview of the types of information contained in claims data, describes some advantages and limitations of using claims data for research purposes, and outlines steps for utilizing the Korea Health Insurance Review and Assessment and National Health Insurance Service databases. The study also reviews epidemiological approaches utilizing healthcare claims databases (including cross-sectional, case-control, case-crossover, and cohort designs) with respect to protocol development, analysis, and reporting of results, and introduces relevant guidelines and checklists, including the Guidelines for Good Pharmacoepidemiology Practices, the Strengthening the Reporting of Observational Studies in Epidemiology checklist, and the Risk of Bias in Nonrandomized Studies of Interventions tool.Entities:
Keywords: Bias; Epidemiology; Healthcare Administrative Claims; Research Design
Year: 2020 PMID: 32456382 PMCID: PMC7272364 DOI: 10.4082/kjfm.20.0062
Source DB: PubMed Journal: Korean J Fam Med ISSN: 2005-6443
Figure. 1.Governance of the healthcare system organization and healthcare claims databases in South Korea. HIRA, Health Insurance Review and Assessment Service; NHIS, National Health Insurance Service; NHID, National Health Information Database.
Types and contents of South Korean healthcare claims databases
| Database type | Data period | Sampling description | Size |
|---|---|---|---|
| HIRA database | Depends on data size | Total eligible Korean patients | Over 50 million people |
| HIRA-NPS | 2009–2018 | Stratified proportional sample of patients (3% of population) | 700,000 inpatients per year; approximately 400,000 outpatients per year |
| HIRA-NIS | 2009–2018 | Stratified proportional sample of patients who used inpatient services (13% of inpatients and 1% of outpatients) | 1.4 million patients overall per year |
| HIRA-APS | 2009–2018 | Annual stratified proportional sample of patients over 65 years (20%) | Approximately 1 million patients per year |
| HIRA-PPS | 2009–2018 | Annual stratified proportional sample of patients under 20 years (10%) | Approximately 1.1 million patients per year |
| NHIS-NHID | Depends on data size | Total eligible Korean population | Over 50 million people |
| NHIS-NSC | 2002–2015 | Stratified proportional sample of total eligible Korean population (2%) | Approximately 1 million people |
| NHIS-HEALS | 2002–2015 | Simple random sample of population 40 years and over (5%) | Approximately 0.51 million people |
| NHIS-senior cohort | 2002–2015 | Simple random sample of population 60 years and over (10%) | Approximately 0.55 million people |
| NHIS-FEM | 2007–2015 | Simple random sample of employed women aged 26–64 years (5%) | Approximately 0.18 million people |
| NHIS-INCHS | 2008–2015 | 5% sample of newborns by birth year between 2008 and 2012 | Approximately 0.08 million people |
HIRA, Health Insurance Review and Assessment Service; HIRA-NPS, HIRA-National Patient Sample; HIRA-NIS, HIRA-National Inpatient Sample; HIRA-APS, HIRA-Aged Population Sample; HIRA-PPS, HIRA-Pediatric Patient Sample; NHIS, National Health Insurance Service; NHIS-NHID, NHIS-National Health Information Database; NHIS-NSC, NHIS-National Sample Cohort; NHIS-HEALS, NHIS-National Health Screening Cohort; NHIS-FEM, NHIS-Female Employees; NHIS-INCHS, NHIS-Infants and Children’s Health Screening.
Databases and information available for linkage in South Korea
| Source | Database | Data period | Contents and variables[ |
|---|---|---|---|
| HIRA | HIRA database | 2007–2018 | - General specifications (billing statement identification key, age, gender, type of insurance, date of treatment, primary diagnosis, secondary diagnosis, surgery, etc.) |
| - Healthcare services (billing statement identification key, inpatient prescriptions, treatments, diagnostic tests, unit price, days of supply, etc.) | |||
| - Diagnosis (billing statement identification key, diagnostic code, department, etc.) | |||
| - Outpatient prescriptions (billing statement identification key, drug codes, unit price, days of supply, etc.) | |||
| NHIS | NHIS-NHID | 2007–2018 | - General specifications (year, age, gender, region, grade of disability, contribution amount, etc.) |
| - Health examinations - subjects (year, working type) | |||
| - Health examinations (disease history, physical activity, current medications, smoking, drinking, height, weight, blood pressure, laboratory tests, etc.) | |||
| - Medical institution (year, location, number of doctors, number of nurses, number of pharmacists, number of beds, etc.) | |||
| - Death information (death year and month) | |||
| - Cancer information (breast/colorectal/cervical/liver/gastric cancer) | |||
| - Medical examination of cancer (medical examination experience, medical history, year, family history, etc.) | |||
| NCC | KNCI DB | 2002–2016 | - Age, gender, date of diagnosis, Surveillance Epidemiology and End Results code, diagnosis code, primary cancer site, treatment, histological type, etc. |
| KCDC | KNHANES | 2007–2017 | - Age, gender, socioeconomic status, educational status, chronic disease, health status, cancer examination, cost, quality of life information, injury, height, weight, blood pressure, laboratory tests, nutritional intake, dietary supplements, nutritional knowledge, etc.) |
| KCDC | Quarantine database | 2013–2018 | - Date of quarantine, type of quarantine, site of quarantine, country of departure, transportation, number of crew, number of passengers, number of suspicious entrants, pollution, major freight |
| KCDC | KTBS system database | 2013–2018 | - Year, age, age group, gender, region, nationality, reporting public health center, reporting medical institution, date of reporting, type of tuberculosis, disease code, patient type, smear screening |
| KCDC | KoGES | 2001–2013 | - Cohort name, age, gender, chronic disease, smoking, drinking, exercise, blood pressure, height, weight, laboratory tests, etc. |
| KCDC | Immunization registry data | 2012–2018 (only NIP) | - Vaccination name, date of vaccination, medical institution, region of medical institution |
HIRA, Health Insurance Review and Assessment Service; NHIS, National Health Insurance Service; NHIS-NHID, NHIS-National Health Information Database; NCC, National Cancer Center; KNCI DB, Korea National Cancer Incidence database; KCDC, Korea Centers for Disease Control and Prevention; KNHANES, Korea National Health and Nutrition Examination Survey; KTBS, Korean Tuberculosis Surveillance; KoGES, Korean Genome and Epidemiology Study; NIP, National Immunization Program.
Information based on the Healthcare Big Data platform (https://hcdl.mohw.go.kr/BD/Portal/Enterprise/DefaultPage.bzr).
Strengths and limitations of healthcare claims databases
| Strengths | Limitations |
|---|---|
| - High generalizability for the Korean population | - Risk of confounding bias such as confounding by indication and healthy user effect |
| - Minimized risk of recall bias | - Often no measurement of potential confounders such as laboratory data, disease severity, and health behaviors |
| - Thorough cover of disease conditions | - Risk of misclassification bias (may affect internal validity) |
| - Sufficiently large sample size to retain statistical power | - Not applicable to research on healthcare services not covered by insurance |
| - Various information on healthcare utilization, diagnoses, procedures, treatment, and payments | - Insufficient information on patient adherence to treatment |
| - Relatively inexpensive to use | - Time gap between actual provision of health services and availability of the claim data for research |
| - Linkable to other databases |
Guidelines for observational studies using big data
| Guidelines | Publication year | Source | Checklist items | Link |
|---|---|---|---|---|
| Guide on Methodological Standards in Pharmacoepidemiology | 2018 (version 7) | ENCePP | Research question, study design, data sources, source and study population, definition and measurement of exposures/outcomes, bias, effect measure modification, data management, data analysis, quality control, ethical/data protection issues, communication of study results | |
| GPP | 2016 (version 4) | Public Policy Committee and International Society of Pharmacoepidemiology | Population, definition of exposures/outcomes/other risk factors, study size, statistical precision, data management, data analysis, quality assurance, quality control | |
| STROBE | 2007 (version 4) | STROBE Initiative | Introduction (background, objective), methods (study design, setting, participants, data source, bias, study size, statistical analysis), results (descriptive data, outcome, main results, other analysis), discussion (interpretation, generalizability, limitations), funding information | |
| ROBINS-I | 2016 (version 1) | Researchers, many involved with Cochrane systematic reviews | Bias related to confounding factors, selection of participants, classification of interventions, deviations from the intended interventions, missing data, measurement of outcomes, and selection of reporting results |
ENCePP, European Network of Centres for Pharmacoepidemiology and Pharmacovigilance; GPP, Guidelines on Good Pharmacoepidemiology Practices; STROBE, Strengthening the Reporting of Observational Studies in Epidemiology; ROBINS-I, Risk of Bias in Nonrandomized Studies of Interventions.