| Literature DB >> 35656154 |
Dae-Sung Kyoung1, Hun-Sung Kim2,3.
Abstract
Almost every Korean (97%) is enrolled in the National Health Insurance program, and most receive medical treatment at least once a year. Data are collected by the Health Insurance Review and Assessment Service (HIRA), and the results of the review are sent to the National Health Insurance Service (NHIS). The data handled by NHIS and HIRA cover almost the entire population and can be used for various research purposes. NHIS and HIRA support research by making these data available to researchers. The greatest advantage of these data is that they are the only data which include virtually the entire population. Both HIRA and NHIS data are provided in the form of sample data and all (customized) data. NHIS and HIRA data are similar but exhibit minor differences. HIRA data consists of five tables, including general specification details, in-hospital treatment details, disease details, out-of-hospital prescription details, and nursing institution information. NHIS data include death records (including cause of death), some medical examination records, and the socio-economic variables of the subject, such as income, in addition to all the HIRA data. Clinical results of treatments are not recorded in NHIS or HIRA. However, because public data are used for billing purposes, actual research has thus far been limited. Therefore, researchers must develop a study design that can minimize the errors or bias occurring during the course of the study. Therefore, it is necessary to clearly understand the structure and characteristics of NHIS and HIRA data when initiating research.Entities:
Keywords: Big data; Cohort study; Data science
Year: 2021 PMID: 35656154 PMCID: PMC9133780 DOI: 10.12997/jla.2022.11.2.103
Source DB: PubMed Journal: J Lipid Atheroscler ISSN: 2287-2892
Claims data versus EMR data
| Claims data | EMR data | |
|---|---|---|
| Example | NHIS data, HIRA data | Medical chart data |
| Subject | Includes all medical use of the total national population | If a patient visits a hospital other than a specific hospital, tracking is not possible. |
| Data Structure | Consists of the same data structure | The data structure of each hospital is different |
| DQM | No DQM required | DQM required |
| Purpose of data accumulation | - Not “medical” data; rather this data is a bill for medical treatment. | - Detailed records of events occurring during the course of patient care. |
| - Not data from a medical perspective in nature. | ||
| - Data for billing to the country. | ||
| Analysis of medical practice | There is a limit to the interpretation of the results of medical practice. | All analysis from a medical perspective such as patient symptoms, examination, diagnosis, treatment, and prognosis is possible. |
| Data characteristics | - Only insured health benefits item related data is recorded. | - Any data related to the insured/non-insured health benefits item is recorded. |
| - It is impossible to analyze non-insured health benefits items. | - “Medical” data based on actual clinical practice. | |
| - In many cases, the “outcome” of treatment is not reflected. | - All test results and treatment results are reflected. | |
| - Most of the “test results” do not require description (test results). | - In some cases, there is no indication for actions taken. | |
| - There are reasons for specific medical actions that were taken (ICD-10 code, etc.). |
EMR, electronic medical records; NHIS, National Health Insurance Service; HIRA, Health Insurance Review and Assessment Service; DQM, data quality management; ICD, International Classification of Diseases.
NHIS data versus HIRA data
| NHIS data | HIRA data | ||
|---|---|---|---|
| SAMPLE data§ | |||
| Components | 2% of the total population and 4 other types | 3% of the total population and 3 other types | |
| Acquisition (purchase) of sample data | IRB-approved research protocol is required | No research protocol required* | |
| Ownership | No ownership of data | Provide ownership of data | |
| Type of preferred study | Longitudinal study preferred | Cross-sectional data preferred | |
| ALL (customized) data‖ | |||
| Components | Provides all national data that meet the conditions requested by the researcher | ||
| Ownership | No ownership of data | ||
| Death records | Includes death records (except cause of death) | Cannot be confirmed† | |
| Center visit | Researchers must visit the center for use | Researchers must visit the center for use‡ | |
Both NHIS and HIRA provide “ALL (customized)” and “SAMPLE” data. For “ALL (customized)” data, neither NHIS nor HIRA provide ownership of the data (direct visit to the center or remote access) HIRA's “SAMPLE” data is the only data that can be directly owned and processed by medical personnel or companies.
NHIS, National Health Insurance Service; HIRA, Health Insurance Review and Assessment Service; IRB, Institutional Review Board.
*Acquisition (purchase) of sample data provided by HIRA does not require research protocol or IRB approval. However, when conducting research using sample data, researchers can start the research by obtaining the research protocol and IRB approval; †In the case of death in the hospital, death information can be checked only when the medical treatment termination code is checked as death in the HIRA data. However, not all people die in the hospital, and even if they die in the hospital, there are cases where the treatment termination code is omitted; therefore, it is not possible to check death record with 100% accuracy; ‡Basically, remote access to HIRA is possible, but in some cases, remote access is restricted when the amount of data is too large or when a private company participates in the research and conducts analysis; §Only some patients are represented as a sample; ‖All the eligible citizens of the country.
Fig. 1Selection of appropriate claims data according to research purpose.
NHIS, National Health Insurance Service; HIRA, Health Insurance Review and Assessment Service.