Literature DB >> 31759388

Validation of claims data to identify death among aged persons utilizing enrollment data from health insurance unions.

M Sakai1,2,3, S Ohtera4,5, T Iwao5, Y Neff4,5, G Kato6, Y Takahashi4, T Nakayama4.   

Abstract

The identification of death is critical for epidemiological research. Despite recent developments in health insurance claims databases, the quality of death information in claims is not guaranteed because health insurance claims are collected primarily for reimbursement. We aimed to examine the usefulness and limitations of death information in claims data and to examine methods for improving the quality of death information for aged persons.We used health insurance claims data and enrollment data (as the gold standard) from September 2012 through August 2015 for nondependent persons aged 65-74 years enrolled in Japanese workplace health insurance. Overall, 3,710,538 insured persons were registered in the database during the study period. We analyzed 45,441 eligible persons. Inpatient and outpatient deaths were identified from the discharge/disease status in the claims, with sensitivities of 94.3% and 47.4%, specificities of 98.5% and 99.9%, and PPVs of 96.3% and 95.7%, respectively, using enrollment data as the gold standard. For outpatients, death defined as a combination of disease status and charge data for terminal care still indicated low sensitivity (54.7%).The validity of death information in inpatient claims was high, suggesting its potential usefulness for identifying death. However, given the low sensitivity for outpatient deaths, the use of death information obtained solely from records in outpatient claims is not recommended.

Entities:  

Keywords:  Aged; Health insurance claims; Sensitivity; Specificity; Validation

Mesh:

Year:  2019        PMID: 31759388      PMCID: PMC6875027          DOI: 10.1186/s12199-019-0819-3

Source DB:  PubMed          Journal:  Environ Health Prev Med        ISSN: 1342-078X            Impact factor:   3.674


Introduction

In Japan, an ultra-aging, high-mortality society, 33.0% of the population, were ≥ 60 years old in 2017 [1]. Deceased persons in Japan comprised approximately 1,200,000 in 2014; this is estimated to increase to approximately 1,670,000 by 2040 [2]. In constructing a sustainable end-of-life care system, several challenges for research have emerged, surveying mortality and medical care practice for aged persons at the end of life [3-5]. Thus, the identification of death is critical for epidemiological research targeting aged persons. Recent developments in the health insurance claims databases of government agencies and the private sector have transformed epidemiological research in Asia-Pacific countries [6-12]. In Japan, private companies have created health insurance claims databases for research. The National Database of Health Insurance Claims and Specific Health Checkups of Japan (NDB) that contains almost 100% of the digitized health insurance claims for the entire country was also constructed. However, health insurance claims data are collected primarily for reimbursement, rather than for research following the patient prognosis. Thus, quality of death information in claims is not guaranteed. Previous study has examined the validity of death information recorded in claims using data in 2005–2009 and indicated that the sensitivity of death information in claims was low [13]. However, to date, a method to improve the validity of death information in claims has not been established. Currently, validation studies using the latest claims database are underway [14]. As part of this effort, we aimed to examine usefulness and limitations when using death information in claims databases and to examine methods for improving the quality of death information for aged persons.

Methods

Study design

This cross-sectional study validated death information from health insurance claims against that recorded in enrollment data for the health insurance union from the same month (the latter serving as a gold standard). Because the claims data used in the present study did not include information on the date of death, the data were compared by month.

Data source

Claim validation requires the linkage of claims data and other sources of highly reliable data (gold standard). However, The Japanese Ministry of Health, Labor and Welfare (MHLW) prohibits linking data from NDB to external data. Hence, we utilized workplace health insurance’s claims database which directly link health insurance claims and enrollment data at the individual patient level with high precision [15]. We used the claims database for insured persons enrolled in workplace health insurance unions that were available through a database vendor, Japan Medical Data Center Co., Ltd. This database contains monthly claims submitted to health insurance unions, particularly for those insured in Japanese health insurance unions for employees of large companies (union-managed health insurance). As of September 2015, these comprised approximately 10% of all Japanese beneficiaries. The database does not include the data of those enrolled in health insurance unions targeting medium-sized to small businesses, seamen, public employees, self-employed individuals, and those covered by the Medical Insurance System for individuals aged ≥ 75 years. The data provided the discharge status for inpatient claims, with the following potential values: “continued,” “cure,” “death,” “termination,” and “transferred.” Values reflect the status of healthcare provision: “continued” represents continued therapy; “cure” indicates that no further healthcare was needed due to complete cure or improvement; “termination” suggests that no healthcare will be provided at least for the time being; and “transferred” indicates that the patient was transferred to another hospital. The same information was also available as the disease status in the outpatient claims database. The enrollment data recorded the month and year of the loss of insurance status as well as the reason for the loss of insured status (“retired, moved away, died, term expired, insurance premiums unpaid, transition, household separated, or other”). When an insured individual loses their insured status, the employer notifies the union, and this is reflected in the enrollment data.

Study participants

We included nondependent insured persons aged 65–74 years registered in the workplace health insurance’s claims database between October 2012 and September 2015. We analyzed only nondependent insured persons to guarantee gold standard-level accuracy in death information from enrollment data. The insurance status for dependents is sometimes misclassified (in some cases, if the insured individual dies, the dependent of that insured individual is also registered as dead) and thus dependents are excluded from analysis. To conduct validation in a cross-sectional study design that would compare claims from the same month with enrollment data from the health insurance union, we excluded patients with missing information on health insurance union enrollment status (i.e., continued enrollment/loss of insured status) for the month of the most recent claim data (i.e., claims issued most recently). We also excluded any of those who lost their insured status with unknown reasons for the loss of insured status.

Validation of death information that can be obtained from claims

Claims-based definition of death

We defined claims-based definition of death as patients for whom the discharge/disease status recorded in the most recent (last issued) claims was death (Definition 1). If multiple claims were issued in the same month, all claims were examined. If at least one claim noted death as the discharge/disease status, the patient was considered deceased. As we do not intend to develop a system by which health insurance associations confirm death from claims, we examined methods for increasing the validity of death information for outpatients by combining disease status and charge data recorded in claims. If two or more house calls or home visits are made within 14 days of death, a fee for terminal care can be charged for outpatients. Fees can also be charged when death certificates are issued at a patient’s residence. We included these 2 charges for the definition of outpatient death to improve the validity. Definition 2 applied to patients for whom no claims were issued after a fee was charged for terminal care. Definition 3 applied to patients for whom no claims were issued after a death certificate was charged. Definition 4 applied to patients for whom the outpatient disease status was recorded as death or no claims were issued after a fee was charged for terminal care, or a death certificate was issued (i.e., Definition 1 or 2 or 3).

Gold standard definition of death from the enrollment data

We defined the gold standard deceased patients as those for whom the loss of the insured status due to death was recorded in the health insurance union enrollment data. Patients, whose enrollment data listed the reason for the loss of the insured status as a cause other than death or were in a “continued” status, were not regarded as gold standard deceased patients.

Defining true positives, false negatives, false positives, and true negatives

True positives were defined as cases with any claims-based definition of death (i.e., death information can be obtained from claims) and gold standard definition of death (i.e., the reason for the loss of the insured status in the enrollment data was recorded as death). False negatives were defined as cases with no claims-based definition of death but with a gold standard definition of death. False positives were defined as cases with any claims-based definition of death but not the case with the gold standard definition of death (i.e., the reason for the loss of the insured status in the enrollment data listed a cause other than death or was in the “continued” status). True negatives were defined as cases with no claims-based definition of death and no gold standard definition of death.

Statistical analysis

Sensitivity, specificity, and positive predictive value

We calculated the sensitivity, specificity, and positive predictive value (PPV) of our claims-based definitions of death for inpatients and outpatients separately. In the present validation, only claims issued in the most recent month for each patient were analyzed; thus, inpatients and outpatients were categorized based on claims issued in the most recent month. Inpatients were those who had received inpatient medical care during the most recent month of the issued claim (i.e., patients with 1+ inpatient claims issued), and inpatient claims data were analyzed. Outpatients were those who only received outpatient medical care during the most recent month of the issued claim (i.e., those for whom only outpatient claims were issued), and their outpatient claims data were analyzed. R version 3.2.4 was used for statistical analysis. Informed consent was not obtained because our study only used data that were anonymized in an unlinkable fashion (the data were anonymized using a method that does not leave a lookup table linking a patient with an assigned code or number to prevent the identification of specific patients). The study protocol was approved by Kyoto University’s research ethics committee.

Results

Patient characteristics

Overall, 3,710,538 insured persons were registered in the database from October 2012 through September 2015. We analyzed 45,441 nondependent insured persons (43,870 outpatients and 1571 inpatients), excluding 3,584,302 persons aged < 65 years, 56,130 dependents, 5743 persons with no health insurance claims, 13,370 persons with no data on health insurance union enrollment status (i.e., continued enrollment/loss of insured status), and 5552 persons with unknown reasons for losing their insured status. Sex, age, year of enrollment in the health insurance union, enrollment period, and presence/absence of insured status in the union are shown in Table 1 for all cases subjected to analysis. We analyzed nondependent insured persons to guarantee gold standard-level accuracy in death information from enrollment data. Thus, both inpatients and outpatients comprised a relatively high proportion of males. The median enrollment duration in a health insurance union was 115 and 101 months for inpatients and outpatients, respectively. Inpatients and outpatients who lost their insured status between October 2012 and September 2015 numbered 796 (50.7%) and 13,042 (29.7%), respectively. Of these, inpatients and outpatients who lost their insured status due to death numbered 473 (30.1%) and 95 (0.2%), respectively.
Table 1

Patient characteristics

Total (N = 45,441)Inpatients (N = 1571)Outpatients (N = 43,870)
N%N%N%
Sex
 Male40,84689.9150095.539,34689.7
 Female459510.1714.5452410.3
Age, years
 65–6931,85370.192458.830,92970.5
 70–7413,58829.964741.212,94129.5
Number of claims issued in the most recent month
 132,08470.6140489.430,68069.9
 2 or more13,35729.416710.613,19030.1
Year of enrollment in a health insurance union
 2000 or earlier928220.433721.5894520.4
 2001–200511,24524.750031.810,74524.5
 2006–201014,98433.048530.914,49933.0
 2011–2015993021.924915.8968122.1
Enrollment period, months (median)89115101
Union enrollment status
 Continued enrollment31,60369.577549.330,82870.3
 Loss of insured status13,83830.579650.713,04229.7
Reason for loss of insured status
 Retirement672314.817611.2654714.9
 Relocation1040.220.11020.2
 Unpaid insurance fees11302.5120.811182.5
 Other531311.71338.5518011.8
 Death5681.247330.1950.2
Patient characteristics

Validation of claims-based definition of death

Table 2 shows results of the validation of death based on claims, with health insurance enrollment data regarded as the gold standard. Regarding the definition using the information of discharge or disease status only (Definition 1, Table 2), the sensitivity, specificity, and PPV were 94.3% (446/473), 98.5% (1081/1098), and 96.3% (446/463) for inpatients, and 47.4% (45/95), 99.9% (43,773/43,775), and 95.7% (45/47) for outpatients, respectively. Among outpatients, regarding the definition using reimbursements for terminal care (Definition 2), the sensitivity, specificity, and PPV were 37.9% (36/95), 100.0% (43,775/43,775), and 100.0% (36/36), respectively. Regarding the definition using the issuance of death certificates at home (Definition 3), the sensitivity, specificity, and PPV were 9.5% (9/95), 100.0% (43,775/43,775), and 100.0% (9/9), respectively. When the reimbursement claims for the issuance of a death certificate and terminal care were combined with the disease status (Definition 4), the cases of death were identified with a sensitivity, specificity, and PPV of 54.7% (52/95), 100.0% (43,775/43,775), and 100.0% (52/52), respectively.
Table 2

Sensitivity and specificity of the claims-based definition of death

DefinitionDescriptionTotalTrue positiveFalse positiveFalse negativeTrue negativeSensitivitySpecificityPositive predictive value (PPV)
Inpatients
 1Death as discharge status15714461727108194.3[92.2–96.4]98.5[97.7–99.2]96.3[94.6–98.0]
Outpatients
 1Death as disease status43,8704525043,77347.4[37.3–57.4]99.9[99.9–100.0]95.7[90.0–100.0]
 2Terminal care at home + claims terminated43,8703605943,77537.9[28.1–47.7]100.0[100.0–100.0]100.0[100.0–100.0]
 3Death certificate + claims terminated43,870908643,7759.5[3.6–15.4]100.0[100.0–100.0]100.0[100.0–100.0]
 41 or 2 or 343,8705204343,77554.7[44.7–64.7]100.0[100.0–100.0]100.0[100.0–100.0]
Sensitivity and specificity of the claims-based definition of death

Discussion

We examined usefulness and limitations of death information in claims and methods for improving the quality of death information for aged persons. For inpatients, both the sensitivity (94.3%) and specificity (98.5%) of death information in the claims were high. Among outpatients, however, the specificity was high (99.9%), but the sensitivity was low (47.4%) (Table 2). The addition of reimbursement claims for terminal care or the issuance of a death certificate at home to the definition of death, followed by the termination of subsequent health insurance claims, still indicated low sensitivity (54.7%) (Table 2). Despite developments in health insurance claims databases, the quality of death information in claims is not guaranteed because those data are collected primarily for reimbursement. A previous validation study using the claims data in Japan between January 2005 and August 2009 reported that the sensitivity of death was limited [13]. The increase in validity among inpatients compared with the previous study may be due to policies promoting digitization, such as the obligation to submit health insurance claims online starting in 2011 [16]. Differences in inpatient characteristics between the 2 study populations (20–74 years in Ooba et al.’s study [13] and 65–74 years in our study) also likely contributed to the discrepancy in the results. On the other hand, validity among outpatients still remained low compared with the previous study [13]. Possible reasons for the low sensitivity of outpatient death are attributable to the nature of claims. Because health insurance claims data are collected primarily for reimbursement, medical institutions are not motivated to record deaths in claims, and thus, the omission of recording of death is possible. Additionally, there is no system to follow patients’ prognoses and reflect them in the claims database. Our study contributed to promote epidemiological research using claims database by increasing understanding for the limitation and usefulness of data. Specificity of death information for both inpatients and outpatients was high, and thus, the overestimation of the number of death due to misclassification of outcomes, that is, researchers misclassify survivors as decedents, is low. High sensitivity of inpatient deaths suggests the potential usefulness for identifying death. The risk of the misclassification of outcomes, that is, researchers misclassify deceased persons as being alive due to the absence of a record of death in the claims, is low. However, it should be noted that 5.7% of deceased persons are possibly misclassified. Given the low sensitivity for outpatient deaths, there is a distinct limitation for identifying death from claims data. Although we also examined methods for increasing the sensitivity of death information by combining disease status and charge data recorded in claims (when death was defined solely by disease status or together with charge records of terminal care), the sensitivity for identifying deaths is still limited. Currently, we recommend not using death information obtained solely from records in outpatient claims. These findings should be known among researchers and health insurance societies when they use claims data. This study possessed some limitations. Regarding the generalization of our results, the target population comprised nondependent insured persons aged 65–74 years enrolled in workplace health insurance; thus, our results may not be applicable to all aged persons. Moreover, subjects were enrolled in health insurance unions insuring a fraction of all large companies. We did not include subjects enrolled in health insurance unions targeting medium-sized to small businesses, seamen, public employees, self-employed individuals, and individuals covered by the Medical Insurance System for individuals ≥ 75 years. Finally, although the PPV depends on the prevalence (i.e., prior probability of death) of the study population, we know little about the prevalence of our study population. Therefore, our findings cannot be applied to the claims data of all aged persons. Nevertheless, the claims database used in the present study represents the best available current data because validation to identify death in aged persons could be performed by the direct linkage of health insurance claims and health insurance enrollment data.

Conclusions

We examined usefulness and limitations of using death information in claims databases and examined methods to improve the quality of death information for aged persons. High sensitivity and specificity of death information in inpatient claims suggested the potential utility of identifying death. However, given the low sensitivity for outpatient deaths, the use of death information obtained solely from records in outpatient claims is not recommended.
  12 in total

1.  Nationwide Population Science: Lessons From the Taiwan National Health Insurance Research Database.

Authors:  Ann W Hsing; John P A Ioannidis
Journal:  JAMA Intern Med       Date:  2015-09       Impact factor: 21.873

2.  Brief Report: Databases in the Asia-Pacific Region: The Potential for a Distributed Network Approach.

Authors:  Edward Chia-Cheng Lai; Kenneth K C Man; Nathorn Chaiyakunapruk; Ching-Lan Cheng; Hsu-Chih Chien; Celine S L Chui; Piyameth Dilokthornsakul; N Chantelle Hardy; Cheng-Yang Hsieh; Chung Y Hsu; Kiyoshi Kubota; Tzu-Chieh Lin; Yanfang Liu; Byung Joo Park; Nicole Pratt; Elizabeth E Roughead; Ju-Young Shin; Sawaeng Watcharathanakij; Jin Wen; Ian C K Wong; Yea-Huei Kao Yang; Yinghong Zhang; Soko Setoguchi
Journal:  Epidemiology       Date:  2015-11       Impact factor: 4.822

3.  The Asian Pharmacoepidemiology Network (AsPEN): promoting multi-national collaboration for pharmacoepidemiologic research in Asia.

Authors:  Morten Andersen; Ulf Bergman; Nam-Kyong Choi; Tobias Gerhard; Cecilia Huang; Jessica Jalbert; Michio Kimura; Tomomi Kimura; Kiyoshi Kubota; Edward Chia-Cheng Lai; Nobuhiro Ooba; Byung-Joo Park; Nicole Pratt; Elizabeth E Roughead; Tsugumichi Sato; Soko Setoguchi; Ju-Young Shin; Anders Sundström; Yea-Huei Kao Yang
Journal:  Pharmacoepidemiol Drug Saf       Date:  2013-05-08       Impact factor: 2.890

4.  Intensity of Care at the End of Life Among Older Adults in Korea.

Authors:  Su Hyun Kim; Sangwook Kang; Mi-Kyung Song
Journal:  J Palliat Care       Date:  2018-01-24       Impact factor: 2.250

5.  Claims-based definition of death in Japanese claims database: validity and implications.

Authors:  Nobuhiro Ooba; Soko Setoguchi; Takashi Ando; Tsugumichi Sato; Takuhiro Yamaguchi; Mayumi Mochizuki; Kiyoshi Kubota
Journal:  PLoS One       Date:  2013-05-31       Impact factor: 3.240

6.  Development of a database of health insurance claims: standardization of disease classifications and anonymous record linkage.

Authors:  Shinya Kimura; Toshihiko Sato; Shunya Ikeda; Mitsuhiko Noda; Takeo Nakayama
Journal:  J Epidemiol       Date:  2010-08-07       Impact factor: 3.211

Review 7.  Towards Actualizing the Value Potential of Korea Health Insurance Review and Assessment (HIRA) Data as a Resource for Health Research: Strengths, Limitations, Applications, and Strategies for Optimal Use of HIRA Data.

Authors:  Jee Ae Kim; Seokjun Yoon; Log Young Kim; Dong Sook Kim
Journal:  J Korean Med Sci       Date:  2017-05       Impact factor: 2.153

8.  Analysis of the evidence-practice gap to facilitate proper medical care for the elderly: investigation, using databases, of utilization measures for National Database of Health Insurance Claims and Specific Health Checkups of Japan (NDB).

Authors:  Takeo Nakayama; Yuichi Imanaka; Yasushi Okuno; Genta Kato; Tomohiro Kuroda; Rei Goto; Shiro Tanaka; Hiroshi Tamura; Shunichi Fukuhara; Shingo Fukuma; Manabu Muto; Motoko Yanagita; Yosuke Yamamoto
Journal:  Environ Health Prev Med       Date:  2017-06-06       Impact factor: 3.674

9.  Health-care data collecting, sharing, and using in Thailand, China mainland, South Korea, Taiwan, Japan, and Malaysia.

Authors:  Syed Mohamed Aljunid; Samrit Srithamrongsawat; Wen Chen; Seung Jin Bae; Raoh-Fang Pwu; Shunya Ikeda; Ling Xu
Journal:  Value Health       Date:  2012 Jan-Feb       Impact factor: 5.725

10.  A review of accessibility of administrative healthcare databases in the Asia-Pacific region.

Authors:  Dominique Milea; Soraya Azmi; Praveen Reginald; Patrice Verpillat; Clement Francois
Journal:  J Mark Access Health Policy       Date:  2015-07-20
View more
  12 in total

1.  Elucidating variations in outcomes among older end-stage renal disease patients on hemodialysis in Fukuoka Prefecture, Japan.

Authors:  Aziz Jamal; Akira Babazono; Yunfei Li; Takako Fujita; Shinichiro Yoshida; Sung A Kim
Journal:  PLoS One       Date:  2021-05-25       Impact factor: 3.240

2.  Accuracy of algorithms to identify patients with a diagnosis of major cancers and cancer-related adverse events in an administrative database: a validation study in an acute care hospital in Japan.

Authors:  Takashi Fujiwara; Takashi Kanemitsu; Kosei Tajima; Akinori Yuri; Masahiro Iwasaku; Yasuyuki Okumura; Hironobu Tokumasu
Journal:  BMJ Open       Date:  2022-07-13       Impact factor: 3.006

3.  Data resource profile: JMDC claims database sourced from health insurance societies.

Authors:  Katsuhiko Nagai; Takashi Tanaka; Norihisa Kodaira; Shinya Kimura; Yoshimitsu Takahashi; Takeo Nakayama
Journal:  J Gen Fam Med       Date:  2021-02-14

Review 4.  Current Status, Challenges, and Future Perspectives of Real-World Data and Real-World Evidence in Japan.

Authors:  Katsutoshi Hiramatsu; Annabel Barrett; Yasuhiko Miyata
Journal:  Drugs Real World Outcomes       Date:  2021-06-19

5.  Associations of kidney tests at medical facilities and health checkups with incidence of end-stage kidney disease: a retrospective cohort study.

Authors:  Ryuichi Yoshimura; Ryohei Yamamoto; Maki Shinzawa; Rie Kataoka; Mina Ahn; Nami Ikeguchi; Natsuki Wakida; Hiroshi Toki; Toshiki Moriyama
Journal:  Sci Rep       Date:  2021-10-26       Impact factor: 4.379

6.  Medical expenditures for fragility hip fracture in Japan: a study using the nationwide health insurance claims database.

Authors:  Takahiro Mori; Jun Komiyama; Tomoko Fujii; Masaru Sanuki; Keitaro Kume; Genta Kato; Yukiko Mori; Hiroaki Ueshima; Hiroki Matsui; Nanako Tamiya; Takehiro Sugiyama
Journal:  Arch Osteoporos       Date:  2022-04-11       Impact factor: 2.617

7.  Reduction in inequalities in health insurance coverage and healthcare utilization among older adults in the Philippines after mandatory national health insurance coverage: trend analysis for 2003-2017.

Authors:  Kathryn Lizbeth Lucena Siongco; Keiko Nakamura; Kaoruko Seino
Journal:  Environ Health Prev Med       Date:  2020-06-09       Impact factor: 3.674

8.  Decreased Administration of Life-Sustaining Treatment just before Death among Older Inpatients in Japan: A Time-Trend Analysis from 2012 through 2014 Based on a Nationally Representative Sample.

Authors:  Michi Sakai; Shosuke Ohtera; Tomohide Iwao; Yukiko Neff; Tomoe Uchida; Yoshimitsu Takahashi; Genta Kato; Tomohiro Kuroda; Shuzo Nishimura; Takeo Nakayama
Journal:  Int J Environ Res Public Health       Date:  2021-03-18       Impact factor: 3.390

9.  Validity of claims-based algorithms for selected cancers in Japan: Results from the VALIDATE-J study.

Authors:  Cynthia de Luise; Naonobu Sugiyama; Toshitaka Morishima; Takakazu Higuchi; Kayoko Katayama; Sho Nakamura; Haoqian Chen; Edward Nonnenmacher; Ryota Hase; Sadao Jinno; Mitsuyo Kinjo; Daisuke Suzuki; Yoshiya Tanaka; Soko Setoguchi
Journal:  Pharmacoepidemiol Drug Saf       Date:  2021-06-01       Impact factor: 2.890

10.  Association between pre-admission anticoagulation and in-hospital death, venous thromboembolism, and major bleeding among hospitalized COVID-19 patients in Japan.

Authors:  Motohiko Adomi; Toshiki Kuno; Jun Komiyama; Yuta Taniguchi; Toshikazu Abe; Atsushi Miyawaki; Shinobu Imai; Kojiro Morita; Makoto Saito; Hiroyuki Ohbe; Tadashi Kamio; Nanako Tamiya; Masao Iwagami
Journal:  Pharmacoepidemiol Drug Saf       Date:  2022-04-07       Impact factor: 2.732

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.