| Literature DB >> 28378543 |
Jee Ae Kim1, Seokjun Yoon2, Log Young Kim1, Dong Sook Kim3.
Abstract
Health Insurance and Review Assessment (HIRA) in South Korea, also called National Health Insurance (NHI) data, is a repository of claims data collected in the process of reimbursing healthcare providers. Under the universal coverage system, having fee-for-services covering all citizens in South Korea, HIRA contains comprehensive and rich information pertaining to healthcare services such as treatments, pharmaceuticals, procedures, and diagnoses for almost 50 million beneficiaries. This corpus of HIRA data, which constitutes a large repository of data in the healthcare sector, has enormous potential to create value in several ways: enhancing the efficiency of the healthcare delivery system without compromising quality of care; adding supporting evidence for a given intervention; and providing the information needed to prevent (or monitor) adverse events. In order to actualize this potential, HIRA data need to actively be utilized for research. Thus understanding this data would greatly enhance this potential. We introduce HIRA data as an important source for health research and provide guidelines for researchers who are currently utilizing HIRA, or interested in doing so, to answer their research questions. We present the characteristics and structure of HIRA data. We discuss strengths and limitations that should be considered in conducting research with HIRA data and suggest strategies for optimal utilization of HIRA data by reviewing published research using HIRA data.Entities:
Keywords: Claims Data; HIRA Data; Health Insurance Review and Assessment Service; Health Research; Healthcare Services; Korea; National Health Insurance
Mesh:
Year: 2017 PMID: 28378543 PMCID: PMC5383602 DOI: 10.3346/jkms.2017.32.5.718
Source DB: PubMed Journal: J Korean Med Sci ISSN: 1011-8934 Impact factor: 2.153
NHI program from 2010
| Parameters | Reported numbers by year | ||||
|---|---|---|---|---|---|
| 2010 | 2011 | 2012 | 2013 | 2014 | |
| Total population, No. (unit: 1,000) | 49,410 | 49,779 | 50,004 | 50,220 | 50,424 |
| Beneficiaries, No. (unit: 1,000) | 50,581 | 50,909 | 51,169 | 51,448 | 51,757 |
| Health insurance | 48,907 | 49,299 | 49,662 | 49,990 | 50,316 |
| Medical aid | 1,674 | 1,609 | 1,507 | 1,459 | 1,441 |
| Coverage rate, % (beneficiaries/total population) | 102.3 | 102.3 | 102.3 | 102.4 | 102.6 |
| No. of claims (unit: 1,000) | 1,307,823 | 1,327,233 | 1,420,857 | 1,418,710 | 1,453,776 |
| Inpatient | 12,491 | 13,201 | 14,338 | 15,512 | 17,491 |
| Outpatient | 1,295,332 | 1,314,032 | 1,406,519 | 1,403,199 | 1,436,285 |
| No. of providers | 81,681 | 82,948 | 83,811 | 84,971 | 86,629 |
| Tertiary hospitals | 44 | 44 | 44 | 43 | 43 |
| General hospitals | 274 | 275 | 278 | 281 | 287 |
| Hospitals | 2,182 | 2,363 | 2,524 | 2,683 | 2,811 |
| Clinics | 27,469 | 27,837 | 28,033 | 28,328 | 28,883 |
| Community health centers | 3,515 | 3,508 | 3,502 | 3,504 | 3,516 |
| Oriental clinics | 12,229 | 12,585 | 12,906 | 13,312 | 13,654 |
| Dental clinics | 14,872 | 15,257 | 15,566 | 15,930 | 16,377 |
| Pharmaceuticals | 21,096 | 21,079 | 20,958 | 20,890 | 21,058 |
NHI = National Health Insurance.
Fig. 1Flow of data generation of HIRA data.
HIRA = Health Insurance and Review Assessment, DW = data warehouse.
List of files with variables (selective)
| Files | Variables | Common variables |
|---|---|---|
| General information | - Beneficiary ID, age, gender, insurance number, type of insurance, date of review, provider ID, indicators for inpatients/outpatients, indicators for types of providers | Billing statement code, date & year of receipts |
| - Operation related to primary diagnosis | ||
| - Specialty | ||
| - Dates of treatment, dates of dispensation | ||
| - Primary diagnosis, secondary diagnosis, surgery, area of provider's practice | ||
| - No. of days undergoing care, first visit to a physician, dates of encounter, date of admission, date of discharge | ||
| - No. of days of supply for prescriptions, quantity of prescriptions, special codes for different out-of-pocket costs | ||
| Healthcare services | - Procedures, inpatient prescriptions, diagnostic tests, treatments | |
| - Operation, injection, and examination | ||
| - Unit price, quantity per day, days of supply, etc | ||
| Diagnosis | All diagnoses | |
| Outpatient prescription | Quantity per time, quantity per day, days of supply, drug code, unit price, amount, date of prescription | |
| Drug master | Drug code, date of starting (and terminating) coverage, drug name, unit, manufacturer, channel of administration, coverage, unit price, etc | |
| Providers information | Provider ID, location, zip code, name of providers, types of provider, address, date of open, no. of business, no. of beds, etc |
ID = identification.
Fig. 2Linkage of files.
ID = identification.
Research areas using HIRA (or NHI) data
| Class | Reference | Linkage to other data | Derived variables | Working definitions | Methods |
|---|---|---|---|---|---|
| Adherence and persistence, prescribing pattern | ( | No | Persistency | Yes | Kaplan-Meier survival analysis |
| ( | No | Persistency | Yes | Cox proportional hazard | |
| ( | No | MPR | Yes | Multivariate logistic regression analysis | |
| ( | No | CMA | Yes | Multiple logistic regression analysis | |
| ( | No | CMA | Yes | Multiple logistic regression analysis | |
| ( | No | MPR | Yes | Cox proportional hazard | |
| Healthcare utilization | ( | NHIC — Medicaid-aid case management center | No | No | Logistic regression analysis (multivariate) |
| ( | No | No | No | Multivariate logistic regression analysis | |
| ( | Suicides were identified by the investigator's note from National Police Agency | No | No | Multiple logistic regression analysis, repeated-measure data analysis (proc mixed procedure) | |
| ( | No | Crude surgery rate | Yes | Poisson regression model | |
| ( | No | No | No | Multivariate logistic regression analysis | |
| ( | No | No | No | Multilevel analysis (linear mixed models with random intercept) | |
| ( | No | No | No | Two-level random effect logistic regression model | |
| ( | No | No | No | Multiple regression analysis | |
| ( | No | No | No | Multiple logistic regression analysis | |
| ( | KNHANES II | No | No | Multivariate logistic regression analysis | |
| ( | No | CCI | No | Multiple logistic regression model | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Multiple regression analysis | |
| ( | No | CCI | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Multiple regression analysis | |
| ( | No | No | No | Multiple regression analysis | |
| Burden of disease | ( | No | No | No | Descriptive summary of statistics |
| Incidence or prevalence | ( | No | No | No | Descriptive summary of statistics |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Life table method | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| ( | No | No | No | Descriptive summary of statistics | |
| Outcomes | ( | No | No | Yes | PSM/Cox proportional hazard |
| ( | No | Charlson score | Yes | Descriptive summary of statistics/multivariate regression | |
| Adverse event | ( | No | Charlson score | No | PSM |
| Policy evaluation | ( | No | No | No | Adjusted rate |
| ( | No | No | No | - | |
| ( | No | No | No | Descriptive summary | |
| Health informatics and others | ( | Population data from Statistics Korea | Lengthiness index, costliness index, etc | No | Clustering, decision tree, stratification |
| ( | National Cancer Center | Charlson score | No | Multivariate analysis | |
| ( | Statistics Korea | No | No | Multivariate logistic analyses |
HIRA data is also called NHI data; The list is selective and not mutually exclusive because some fall into multiple categories.
HIRA = Health Insurance and Review Assessment, NHI = National Health Insurance, NHIC = National Health Information Center, MPR = medication possession ratio, CMA = cumulative medication adherence, KNHANES = Korean National Health and Nutritional Survey, CCI = Charlson Comorbidity Index, PSM = propensity score matching.