Literature DB >> 28142051

Validity of diagnoses, procedures, and laboratory data in Japanese administrative data.

Hayato Yamana1, Mutsuko Moriwaki2, Hiromasa Horiguchi3, Mariko Kodan4, Kiyohide Fushimi5, Hideo Yasunaga6.   

Abstract

BACKGROUND: Validation of recorded data is a prerequisite for studies that utilize administrative databases. The present study evaluated the validity of diagnoses and procedure records in the Japanese Diagnosis Procedure Combination (DPC) data, along with laboratory test results in the newly-introduced Standardized Structured Medical Record Information Exchange (SS-MIX) data.
METHODS: Between November 2015 and February 2016, we conducted chart reviews of 315 patients hospitalized between April 2014 and March 2015 in four middle-sized acute-care hospitals in Shizuoka, Kochi, Fukuoka, and Saga Prefectures and used them as reference standards. The sensitivity and specificity of DPC data in identifying 16 diseases and 10 common procedures were identified. The accuracy of SS-MIX data for 13 laboratory test results was also examined.
RESULTS: The specificity of diagnoses in the DPC data exceeded 96%, while the sensitivity was below 50% for seven diseases and variable across diseases. When limited to primary diagnoses, the sensitivity and specificity were 78.9% and 93.2%, respectively. The sensitivity of procedure records exceeded 90% for six procedures, and the specificity exceeded 90% for nine procedures. Agreement between the SS-MIX data and the chart reviews was above 95% for all 13 items.
CONCLUSION: The validity of diagnoses and procedure records in the DPC data and laboratory results in the SS-MIX data was high in general, supporting their use in future studies.
Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Administrative data; Diagnosis; Laboratory data; Procedure; Validation

Mesh:

Year:  2017        PMID: 28142051      PMCID: PMC5602797          DOI: 10.1016/j.je.2016.09.009

Source DB:  PubMed          Journal:  J Epidemiol        ISSN: 0917-5040            Impact factor:   3.211


Introduction

Administrative databases are widely used in medical research studies.1, 2, 3, 4 Their large sample size, population representativeness, and clinical information enables large-scale studies to be conducted inexpensively.5, 6 However, the use of administrative databases for research is based on an assumption that databases convey reasonably accurate data for health status and service utilization information. Because the use of inaccurate data could produce biased results,6, 7 validation of the information recorded in administrative databases is essential. In previous validation studies, comorbidities were recorded with high specificity, but their sensitivities were low and variable across different diseases.8, 9, 10, 11, 12, 13 Studies have also shown that, despite accurate recording of major procedures, such as surgeries and invasive examinations, minor procedures not related to reimbursements were often under-reported.14, 15, 16 However, literature on validation studies is sparse compared with the widespread application of databases, and administrative database research has often used non-validated diagnostic or procedural codes. The Japanese Diagnosis Procedure Combination (DPC) database has been widely used in clinical epidemiology studies.18, 19, 20, 21 The DPC data are unique in that distinctions are made between main diagnosis, comorbidities, and complications, and unlimited numbers of procedures and medications can be recorded. In addition, the National Hospital Organization (NHO) introduced the Standardized Structured Medical Record Information Exchange (SS-MIX) standardized storage to its hospitals, enabling daily laboratory data to be recorded. However, there have been no validation studies for either the DPC or SS-MIX data. The aim of the present study was to evaluate the validity of diagnoses, procedures, and laboratory results recorded as DPC and SS-MIX data. We conducted a multicenter validation study in NHO hospitals using chart review results as reference standards.

Methods

Data source

In Japan, the DPC-based lump-sum payment system was introduced in acute-care hospitals nationwide from 2003. The DPC data used for payments include patient demographics and selected clinical information, admission and discharge statuses, diagnoses, surgeries and procedures performed, medications, and special reimbursements for specific conditions. Diagnoses are recorded using International Classification of Diseases, Tenth Revision (ICD-10) codes by the attending physicians. Suspected diagnoses are allowed to be recorded, but are designated as such. There are six categories of diagnoses, each with a limited number of recordable diseases. One diagnosis each is coded for “main diagnosis”, “admission-precipitating diagnosis”, “most resource-consuming diagnosis”, and “second most resource-consuming diagnosis”. A maximum of four diagnoses each can be coded for “comorbidities present at time of admission” and “conditions arising after admission”. All procedures performed during hospitalization are recorded according to the Japanese fee schedule for reimbursement. The NHO was established in 2004 to take over the management of the national hospitals. As of October 2014, there were 143 hospitals nationwide run by the NHO, including both general acute-care hospitals and specialized long-term-care hospitals. Fifty-four hospitals from 35 prefectures had implemented the DPC-based payment system, and the mean number of acute-care beds in these 54 hospitals was 410 (range, 135–730). All NHO hospitals provide administrative claims data to the Medical Information Analysis (MIA) databank, which is managed by the Clinical Research Center at NHO Headquarters. In NHO hospitals with implementation of the DPC-based payment system, the DPC data are also stored in the MIA databank. In addition, the NHO preliminarily introduced the SS-MIX standardized storage to its hospitals in 2013. The SS-MIX storage enables medical chart information from different vendors, including daily laboratory data, to be recorded in a standardized manner. In the SS-MIX storage, laboratory data are specified using JLAC-10 codes. The flow of data is shown in Fig. 1.
Fig. 1

Flow of data in the National Hospital Organization. DPC, Diagnosis Procedure Combination; MIA, Medical Information Analysis; NHO, National Hospital Organization; SS-MIX, Standardized Structured Medical Record Information Exchange.

Flow of data in the National Hospital Organization. DPC, Diagnosis Procedure Combination; MIA, Medical Information Analysis; NHO, National Hospital Organization; SS-MIX, Standardized Structured Medical Record Information Exchange. We conducted the present study on patients admitted to four acute-care NHO hospitals with implementation of both the DPC-based payment system and SS-MIX storage. The four hospitals were a 380-bed hospital in Shizuoka Prefecture, a 280-bed hospital in Kochi Prefecture, a 380-bed hospital in Fukuoka Prefecture, and a 420-bed hospital in Saga Prefecture. Laboratory data collected from the SS-MIX storage at each hospital and DPC data extracted from the MIA databank were compared with chart review results.

Study population and variables

Among the patients aged ≥18 years on admission who were eligible for the DPC-based payment system and admitted and discharged between April 1, 2014 and March 31, 2015, we randomly selected 100 patients from each hospital, aiming to conduct 400 chart reviews in total. The items examined in the study are presented in Table 1. The 17 diagnoses were diseases used for calculating the Charlson comorbidity index (CCI),25, 26, 27 which is widely used for risk adjustment in administrative database research studies. We also evaluated whether 10 specific procedures were performed on the day of admission. These procedures were selected from those used to calculate another index of severity, as identified by a previous study. We excluded blood examinations and drug use, as well as procedures that were rarely conducted (<5% among the 400 patients by searching the MIA databank before the chart reviews). The authors did not see the frequencies of the remaining 10 procedures until the chart reviews were complete. In addition, we examined the data of 13 laboratory tests performed on the day of admission. When multiple tests of the same item were conducted on the admission day, we selected the earliest one.
Table 1

Items examined in the study.

Diagnoses
 Myocardial infarction
 Congestive heart failure
 Peripheral vascular disease
 Cerebrovascular disease
 Dementia
 Chronic pulmonary disease
 Rheumatic disease
 Peptic ulcer disease
 Mild liver disease
 Diabetes without chronic complications
 Diabetes with chronic complications
 Hemiplegia or paraplegia
 Renal disease
 Any malignancy, including leukemia and lymphoma, except malignant neoplasm of skin
 Moderate or severe liver disease
 Metastatic solid tumor
 AIDS/HIV
Procedures [codes]
 Urine tests (general) [D000]
 Urine microscopy [D002, D002-2]
 Bacterial microscopy [D017]
 Bacterial culture [D018]
 Heart rate/respiration monitoring [D220]
 Pulse oximetry [D223]
 Radiography [E002]
 Computed tomography scan [E200]
 Peripheral intravenous infusion [G001]
 Urinary catheter insertion [J063]
Laboratory data
 White blood cell count
 Platelets
 Hemoglobin
 Prothrombin time international normalized ratio
 Na
 Aspartate transaminase
 Total bilirubin
 Creatinine
 Total cholesterol
 C-reactive protein
 Glucose
 Hemoglobin A1c
 Brain natriuretic peptide

AIDS, acquired immunodeficiency syndrome; HIV, human immunodeficiency virus.

Items examined in the study. AIDS, acquired immunodeficiency syndrome; HIV, human immunodeficiency virus.

Chart reviews

We conducted chart reviews in the four hospitals from November 2015 through February 2016. Two authors (HY1 and MM) independently conducted chart reviews of the cases and identified whether patients had each of the 17 Charlson diseases, either as a primary diagnosis or a comorbidity present on admission. When chronic complications of diabetes were not documented, diabetes was classified as “diabetes without chronic complications”. The reviewers also identified whether each of the 10 procedures were performed on patients on the day of admission. When discrepancies arose, the two reviewers re-reviewed the charts and resolved the discrepancies through discussion. The inter-reviewer agreements for the 17 diseases before discussion in the chart reviews were evaluated using kappa coefficients and categorized as near-perfect (0.81–1.00), substantial (0.61–0.80), moderate (0.41–0.60), fair (0.21–0.40), and poor (0.00–0.20). Laboratory data were obtained by one author (MK) in one hospital and an assistant in three hospitals. The chief investigator (HY1) was consulted when questions arose about which laboratory data to use. The review process took 10–15 min per patient on average.

Data extraction from databases

Patient demographics, diagnoses, and outcomes were extracted from the MIA databank. The 17 Charlson diseases were identified using the coding algorithms of Quan et al. A diagnosis was considered primary when it appeared as “main diagnosis” or “admission-precipitating diagnosis” and as a comorbidity when it appeared in “comorbidities present on admission”. Suspected diagnoses were excluded. In addition to the diagnoses, we searched for metastatic and recurrent malignancies using the TNM classification and primary/recurrent malignancy information recorded as clinical information. Records of procedures performed on the day of admission were extracted from the MIA databank using the codes presented in Table 1.

Statistical analysis

The frequencies of the 17 Charlson diseases (as either a primary disease or a comorbidity) were identified by the chart reviews and DPC data. The sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of the DPC data were calculated, accepting the chart review results as the reference standard. In addition to the original diagnoses, we combined the following diagnoses and assessed the resulting validity: diabetes (with or without chronic complications) and liver disease (mild, moderate, or severe). We also tested the validity of identifying metastatic tumors using clinical information in the DPC data (M1 in the TNM classification or recurrent cancer). The CCI values were calculated from the 17 diagnoses using the newly validated weight assignment by Quan et al, and the changes in the CCIs obtained using the different data sources were assessed. The overall false-positive and false-negative cases for the 17 Charlson diseases in this study were defined as those with one or more false-positive diagnoses and one or more false-negative diagnoses, respectively. To evaluate the effect of recordable number of comorbidities in the DPC data, we categorized patients by the number of comorbidities recorded in the DPC data (0–3 vs. 4) and compared the overall false-positive and false-negative rates between the two groups. Furthermore, we evaluated the validity of the primary diagnosis in the DPC data for identifying the Charlson diseases. Here, sensitivity was defined as the proportion of patients with one of the 17 Charlson diseases as the primary diagnosis by chart review who had the same diagnosis recorded in the DPC data. Specificity was defined as the proportion of patients without any of the 17 diseases as the primary diagnosis who also had none of these diseases recorded as the primary diagnosis in the DPC data. The sensitivity, specificity, PPV, and NPV of the procedure records in the DPC data were calculated in a similar manner to the diagnoses, placing the chart review results as the reference standard. Laboratory data in the SS-MIX data were considered accurate if the value recorded in the SS-MIX data equaled that in the chart review, or if the SS-MIX data correctly identified that no testing of the item was conducted on the admission day. Comparisons of categorical variables were conducted using the chi-square test, and a two-sided P value of <0.05 was considered significant. Statistical analyses were performed using IBM SPSS for Windows, version 21.0 (IBM Corp., Armonk, NY, USA).

Standard protocol approvals, registrations, and patient consent

The study protocol was approved by the Central Ethics Review Board of the NHO, which deemed that written informed consent from participants was unnecessary. An announcement about the study and the possibility for participants to opt out of the study was made through the Internet.

Results

Overview and patient characteristics

We were able to conduct 80 chart reviews in three hospitals and 75 chart reviews in a fourth hospital (315 patients analyzed in total) due to time constraints. Cultures for methicillin-resistant Staphylococcus aureus were performed in most patients in one hospital, and vital signs could not be obtained from the charts of another hospital. Therefore, we excluded the former hospital from analyses of bacterial culture, and the latter hospital from analyses of heart rate/respiration monitoring and pulse oximetry. No patients had human immunodeficiency virus infection. The mean age of the 315 patients was 66.8 (standard deviation [SD], 16.7) years, 183 (58.1%) were male, and 15 patients (4.8%) died during hospitalization. The inter-reviewer agreements for diagnoses before discussion by the reviewers are presented in Table 2. Agreement was near-perfect for six diagnoses, substantial for seven, moderate for two, and fair for one. Table 2 also presents the frequencies of 16 diseases in each hospital. Malignancy was the most commonly identified disease in all hospitals, and there was variation in disease frequencies across the four hospitals.
Table 2

Frequency of Charlson diseases and inter-reviewer agreements of identification in chart reviews.

DiagnosesHospital A (n = 80)
Hospital B (n = 80)
Hospital C (n = 75)
Hospital D (n = 80)
All hospitals (n = 315)
Inter-reviewer agreement (kappa)
n%n%n%n%n%
Myocardial infarction56.311.31114.767.5237.30.86
Congestive heart failure1215.022.579.31113.83210.20.82
Peripheral vascular disease911.311.31013.3911.3299.20.71
Cerebrovascular disease1113.878.856.71518.83812.10.61
Dementia78.822.534.045.0165.10.83
Chronic pulmonary disease56.356.31216.056.3278.60.82
Rheumatic disease11.322.511.333.872.20.93
Peptic ulcer disease22.511.356.711.392.90.62
Mild liver disease45.078.834.0810.0227.00.56
Diabetes without chronic complications1012.51012.5912.01721.34614.60.76
Diabetes with chronic complications67.500.068.056.3175.40.63
Hemiplegia or paraplegia67.522.534.033.8144.40.25
Renal disease45.033.856.733.8154.80.46
Malignancya1620.02531.32837.32835.09730.80.83
Moderate or severe liver disease00.045.011.311.361.90.66
Metastatic solid tumor78.8911.31216.01316.34113.00.71

Includes leukemia and lymphoma, excludes malignant neoplasm of skin.

Frequency of Charlson diseases and inter-reviewer agreements of identification in chart reviews. Includes leukemia and lymphoma, excludes malignant neoplasm of skin.

Validity of diagnosis

The frequencies of the Charlson diseases and validity indices for the DPC data are presented in Table 3. With the exception of peptic ulcer disease, the frequencies were lower in the DPC data than in the chart reviews. Sensitivity ranged from 0% (hemiplegia or paraplegia) to 83.5% (malignancy) and was lower than 50% for seven of 17 diseases, while specificity was above 96% for all diseases. PPV ranged from 23.1% (peptic ulcer disease) to 100% (dementia and moderate or severe liver disease, both with specificity of 100%), and exceeded 80% for 9 diseases. NPV was above 90% for all diseases.
Table 3

Frequencies of diagnoses and validity indices for the DPC data-based diagnosis identification.

DiagnosesFrequency (charts)
Frequency (DPC data)
Sensitivity (%)Specificity (%)PPV (%)NPV (%)
n%n%
Original Charlson diseases identified by recorded diagnoses
 Myocardial infarction237.3134.152.299.792.396.4
 Congestive heart failure3210.2299.268.897.575.996.5
 Peripheral vascular disease299.2123.834.599.383.393.7
 Cerebrovascular disease3812.1227.050.098.986.493.5
 Dementia165.161.937.5100.0100.096.8
 Chronic pulmonary disease278.6185.733.396.950.093.9
 Rheumatic disease72.251.657.199.780.099.0
 Peptic ulcer disease92.9134.133.396.723.198.0
 Mild liver disease227.0134.136.498.361.595.4
 Diabetes without chronic complications4614.63310.552.296.772.792.2
 Diabetes with chronic complications175.461.929.499.783.396.1
 Hemiplegia or paraplegia144.400.00.0100.095.6
 Renal disease154.8103.253.399.380.097.7
 Malignancya9730.88627.383.597.794.293.0
 Moderate or severe liver disease61.931.050.0100.0100.099.0
 Metastatic solid tumor4113.0288.958.598.585.794.1
Combined Charlson diseases identified by recorded diagnoses
 Liver disease (any)288.9154.839.398.673.394.3
 Diabetes (any)6320.03912.455.698.489.789.9
Disease identified by clinical information
 Metastatic tumor4113.04113.073.296.073.296.0

DPC, Diagnosis Procedure Combination; NPV, negative predictive value; PPV, positive predictive value.

Includes leukemia and lymphoma, excludes malignant neoplasm of skin.

Frequencies of diagnoses and validity indices for the DPC data-based diagnosis identification. DPC, Diagnosis Procedure Combination; NPV, negative predictive value; PPV, positive predictive value. Includes leukemia and lymphoma, excludes malignant neoplasm of skin. When combining two conditions in two diagnoses (liver disease and diabetes), we observed modest increases in sensitivity and specificity compared with the original diagnoses that only included less severe conditions. Identification of metastatic tumors using clinical information in the DPC data had higher sensitivity with slightly lower specificity compared with diagnosis-based identification. The mean CCIs calculated from the chart review results and DPC data were 2.2 (SD, 2.9) and 1.5 (SD, 2.4), respectively. Of the 315 patients, the CCIs calculated from the two data sources were equal to each other in 198 cases (62.9%), lower in the DPC data in 89 cases (28.3%), and higher in the DPC data in 28 cases (8.9%). Of the 315 patients, the number of diagnoses recorded as comorbidities present on admission was zero in 50 cases (15.9%), one in 53 cases (16.8%), two in 63 cases (20.0%), three in 43 cases (13.7%), and four in 106 cases (33.7%). Overall, there were 53 patients (16.8%) with false-positive diagnoses for at least one of the 17 diagnoses, and 128 (40.6%) with one or more false-negative diagnoses. Compared with the patients with 0–3 diagnoses recorded as comorbidities, the patients with four diagnoses recorded as comorbidities had a higher overall false-positive rate (27.4% vs. 11.5%, p < 0.001) and a higher overall false-negative rate (51.9% vs. 34.9%, p = 0.004). According to the chart reviews, there were 123 patients (39.0%) with one of the 17 Charlson diseases as the primary diagnosis, and 192 patients (61.0%) without such a diagnosis. The sensitivity, specificity, PPV, and NPV of the primary diagnosis in the DPC data were 78.9% (97/123), 93.2% (179/192), 88.2% (97/110), and 87.3% (179/205), respectively.

Validity of procedures

The frequencies of conducted procedures and validity indices for the DPC data are presented in Table 4. Sensitivity was low for four procedures: heart rate/respiration monitoring (66.7%), pulse oximetry (21.1%), peripheral intravenous infusion (72.7%), and urinary catheter insertion (65.5%). For the other six procedures, sensitivity exceeded 90%. The DPC data were also highly specific, with the lowest of the 10 procedures having a specificity of 88.5% (pulse oximetry).
Table 4

Frequencies of procedures and validity indices for the DPC data-based diagnosis identification.

ProceduresFrequency (charts)
Frequency (DPC data)
Sensitivity (%)Specificity (%)PPV (%)NPV (%)
n%n%
Urine tests (general)7423.57724.498.698.394.899.6
Urine microscopy3511.13611.497.199.394.499.6
Bacterial microscopy3511.13210.291.4100.0100.098.9
Bacterial culture3414.53314.097.1100.0100.099.5
Heart rate/respiration monitoring3012.83615.366.792.255.695.0
Pulse oximetry20988.94720.021.188.593.612.2
Radiography16151.115850.297.599.499.497.5
Computed tomography scan9329.59429.8100.099.598.9100.0
Peripheral intravenous infusion16552.412238.772.798.798.476.7
Urinary catheter insertion299.2278.665.597.270.496.5

DPC, Diagnosis Procedure Combination; NPV, negative predictive value; PPV, positive predictive value.

Frequencies of procedures and validity indices for the DPC data-based diagnosis identification. DPC, Diagnosis Procedure Combination; NPV, negative predictive value; PPV, positive predictive value.

Validity of laboratory data

The agreements for laboratory data between the chart review results and the SS-MIX data are presented in Table 5. Based on the chart review results, the frequency of the conducted tests ranged from 18% (brain natriuretic peptide) to 63% (white blood cell count, hemoglobin, and creatinine). The SS-MIX data were accurate in over 95% of cases for all 13 tests examined.
Table 5

Accuracy of laboratory data recorded in the SS-MIX storage.

Laboratory dataSame results recorded in charts and SS-MIX
No result recorded in charts or SS-MIX
Different results recorded in charts and SS-MIX
Result recorded in charts only
Result recorded in SS-MIX only
Agreement between charts and SS-MIX (%)
n%n%n%n%n%
WBC19561.911536.531.010.310.398.4
Platelets19461.611536.531.010.320.698.1
Hemoglobin19261.011536.561.910.310.397.5
PT-INR11636.819261.010.310.351.697.8
Na19060.312138.420.610.310.398.7
AST19461.611636.820.610.320.698.4
Total bilirubin18659.012539.731.010.331.097.8
Creatinine19762.511536.510.310.310.399.0
Total cholesterol8928.322270.500.020.620.698.7
CRP17555.613141.641.310.341.397.1
Glucose13041.317254.610.310.3113.595.9
Hemoglobin A1c5818.425280.000.020.631.098.4
BNP5718.125380.310.300.041.398.4

AST, aspartate transaminase; BNP, brain natriuretic peptide; CRP, C-reactive protein; PT-INR, prothrombin time international normalized ratio; SS-MIX, Standardized Structured Medical Record Information Exchange; WBC, white blood cell count.

Accuracy of laboratory data recorded in the SS-MIX storage. AST, aspartate transaminase; BNP, brain natriuretic peptide; CRP, C-reactive protein; PT-INR, prothrombin time international normalized ratio; SS-MIX, Standardized Structured Medical Record Information Exchange; WBC, white blood cell count.

Discussion

In the present study, we evaluated the validity of diagnoses, procedures, and laboratory data recorded as administrative data, using chart review results as the reference standard. As with any administrative data, there were some limitations to the recorded data. However, our results suggest that the DPC and SS-MIX data can serve as relatively accurate substitutes for clinical data in future studies. The specificity of DPC diagnosis records in identifying the Charlson diseases was high, but the sensitivity was low and varied across conditions. Within the diseases examined, there seemed to be considerable under-reporting, while coding of conditions that did not exist (i.e., up-coding) appeared uncommon. Compared with previous validation studies, the sensitivity was similar or slightly lower in the DPC data.8, 9, 10, 11, 12, 13 Two reasons could account for these present findings. First, attending physicians, who are not professional coders, are obliged to record the diagnoses and may not be aware of the specific codes used to express conditions. For example, “hemiplegia or paraplegia” may not be recognized as a diagnosis, and codes within “diabetes” (with or without chronic complications) may be disregarded. Second, only two primary diagnoses and up to four comorbidities can be recorded in the DPC data, in contrast to up to 16 diagnoses in a database in Canada.8, 9, 10, 11, 12 In the present study, patients with four recorded comorbidities had a higher overall false-negative rate compared with patients with three or less recorded comorbidities. Even when comorbidities are recognized, some of them may not be recorded when all four “slots” for comorbidities are occupied. Meanwhile, sensitivity (78.9%) and specificity (93.2%) were both high when limited to primary diagnosis. A strength of the present study is the random selection of participants and inclusion of those without specific diseases, which enabled calculations of sensitivity and specificity. From our results, PPV and NPV can be estimated by assigning disease prevalences. Under random selection, the PPV was acceptably high for most diseases, thus warranting diagnosis-based patient identification in future studies. By limiting the study population to patients with higher disease prevalences (e.g., elderly patients), the PPV would be higher. However, it should also be noted that patients identified using database diagnoses may be an underestimated or unrepresentative sample of the disease population. When we calculated the CCIs, the value derived from the DPC data tended to be lower than that derived from the chart review results. There were differences in the CCIs for 37.1% of the patients, including 28.3% with lower values when using the DPC data, which is consistent with previous studies.9, 10 An international comparison of databases also showed variation in the ability of the CCI to predict mortality, which may partly arise through the characteristics of databases. Although the CCI is a commonly used index of comorbidity, its interpretation requires caution because the values may be biased compared with clinical studies or studies using different databases. Cancer classifications were among the clinical information stored in the DPC data, and we tested their ability to correctly identify clinical conditions. A simple method of using M1 or recurrent tumor was accurate in identifying metastatic tumors (sensitivity: 73.2%; specificity: 96.0%). Such clinical information may serve as a useful substitute for diagnosis, especially when the number of recordable diagnoses is limited. Other clinical information in the DPC data included consciousness, New York Heart Association classification, and Hugh-Jones classification, which may also be used to identify clinical conditions. We also evaluated the validity of procedure records in the DPC data. Although the 10 procedures in our study were minor, the DPC data identified most of them with high accuracy. This is in contrast to a previous study, wherein the sensitivities for X-ray scans, computerized tomography scans, insertion of indwelling urinary catheter, and venous catheterization were 0%, 0.5%, 0%, and 39.6%, respectively. Only 10 procedures can be coded in the database used in the previous study, whereas there is no such limit in the DPC data. This difference could explain the different rates of under-reporting. Heart rate/respiration monitoring and pulse oximetry can only be reimbursed when patients have severe conditions under continuous monitoring, but such criteria were difficult to define in the chart reviews and we included all patients in whom these two procedures were performed. This could have resulted in the low sensitivities. The laboratory data recorded in the SS-MIX storage had excellent agreement with the chart review results. Instead of summarizing patient information for use in payment or database construction, the SS-MIX storage is designed to standardize and store electronic medical records themselves. Although standardization of laboratory results using JLAC-10 codes in the newly-introduced SS-MIX system was considered a possible challenge, the data were collected accurately. Our previous study utilized the SS-MIX storage in its preliminarily introduced hospitals, but a large-scale study remains to be conducted. SS-MIX data would be a useful source for future clinical epidemiology studies. Several limitations of the present study should be noted. First, we used chart review results as the reference standard to assess the validity of the DPC and SS-MIX data. These reviews are dependent on the quality of the charts, and unrecorded information could not be captured. An ideal reference standard should identify a condition that is truly present in a patient. However, such information is difficult to obtain, and previous studies have also regarded chart reviews as the best available references. Also, the kappa coefficients for inter-reviewer agreement were low for some diagnoses, which poses a challenge when considering the chart review results as a reference. Second, we conducted the study in four hospitals within the NHO. Although we confirmed variation in the disease prevalences across the hospitals, it remains unclear whether the results can be extrapolated to other institutions. Importantly, these hospitals were early adopters of the SS-MIX storage, and it is possible that their data collection and recording are more accurate compared with other hospitals. Furthermore, there could be some features of the NHO hospitals that make their recorded data more reliable compared with other hospitals in Japan. One such example is lectures to health information administrators that aim to improve disease coding. However, other hospitals could also be conducting similar initiatives. Third, we assessed limited numbers of diagnoses, procedures, and laboratory data, and the validity of items that were not examined cannot be determined. Last, the number of participants was small, and the sensitivity estimates may be statistically unstable for some diseases with low prevalence. The present study adds to the sparse literature of studies validating administrative data and can serve as an important basis for future studies using DPC and SS-MIX data. The results support the usefulness of diagnoses and procedure records within the DPC data, provided that the investigators using these data acknowledge their limitations and make appropriate interpretations. We also confirmed the validity of laboratory data in the newly-introduced SS-MIX storage, and the SS-MIX storage would add a considerable amount of information to future database-based studies.

Conflicts of interest

None declared.
  30 in total

1.  Co-morbidity data in outcomes research: are clinical data derived from administrative databases a reliable alternative to chart review?

Authors:  K H Humphries; J M Rankin; R G Carere; C E Buller; F M Kiely; J J Spinelli
Journal:  J Clin Epidemiol       Date:  2000-04       Impact factor: 6.437

2.  SS-MIX: a ministry project to promote standardized healthcare information exchange.

Authors:  M Kimura; K Nakayasu; Y Ohshima; N Fujita; N Nakashima; H Jozaki; T Numano; T Shimizu; M Shimomura; F Sasaki; T Fujiki; T Nakashima; K Toyoda; H Hoshi; T Sakusabe; Y Naito; K Kawaguchi; H Watanabe; S Tani
Journal:  Methods Inf Med       Date:  2011-01-05       Impact factor: 2.176

3.  Comparison of coding of heart failure and comorbidities in administrative and clinical data for use in outcomes research.

Authors:  Douglas S Lee; Linda Donovan; Peter C Austin; Yanyan Gong; Peter P Liu; Jean L Rouleau; Jack V Tu
Journal:  Med Care       Date:  2005-02       Impact factor: 2.983

4.  Comparison and validity of procedures coded With ICD-9-CM and ICD-10-CA/CCI.

Authors:  Carolyn De Coster; Bing Li; Hude Quan
Journal:  Med Care       Date:  2008-06       Impact factor: 2.983

5.  Updating and validating the Charlson comorbidity index and score for risk adjustment in hospital discharge abstracts using data from 6 countries.

Authors:  Hude Quan; Bing Li; Chantal M Couris; Kiyohide Fushimi; Patrick Graham; Phil Hider; Jean-Marie Januel; Vijaya Sundararajan
Journal:  Am J Epidemiol       Date:  2011-02-17       Impact factor: 4.897

6.  A new method of classifying prognostic comorbidity in longitudinal studies: development and validation.

Authors:  M E Charlson; P Pompei; K L Ales; C R MacKenzie
Journal:  J Chronic Dis       Date:  1987

7.  Validation of diagnostic codes within medical services claims.

Authors:  Machelle Wilchesky; Robyn M Tamblyn; Allen Huang
Journal:  J Clin Epidemiol       Date:  2004-02       Impact factor: 6.437

8.  Differences in cancer stage, treatment and in-hospital mortality between patients with and without schizophrenia: retrospective matched-pair cohort study.

Authors:  Hanako Ishikawa; Hideo Yasunaga; Hiroki Matsui; Kiyohide Fushimi; Norito Kawakami
Journal:  Br J Psychiatry       Date:  2015-11-19       Impact factor: 9.319

9.  Improved accuracy of co-morbidity coding over time after the introduction of ICD-10 administrative data.

Authors:  Jean-Marie Januel; Jean-Christophe Luthi; Hude Quan; François Borst; Patrick Taffé; William A Ghali; Bernard Burnand
Journal:  BMC Health Serv Res       Date:  2011-08-18       Impact factor: 2.655

10.  Comparison of Procedure-Based and Diagnosis-Based Identifications of Severe Sepsis and Disseminated Intravascular Coagulation in Administrative Data.

Authors:  Hayato Yamana; Hiromasa Horiguchi; Kiyohide Fushimi; Hideo Yasunaga
Journal:  J Epidemiol       Date:  2016-04-09       Impact factor: 3.211

View more
  130 in total

1.  Follow-up tests and outcomes for patients undergoing percutaneous coronary intervention: analysis of a Japanese administrative database.

Authors:  Tomotsugu Seki; Masato Takeuchi; Ryusuke Miki; Koji Kawakami
Journal:  Heart Vessels       Date:  2018-07-14       Impact factor: 2.037

2.  Early enteral nutrition for cardiogenic or obstructive shock requiring venoarterial extracorporeal membrane oxygenation: a nationwide inpatient database study.

Authors:  Hiroyuki Ohbe; Taisuke Jo; Hayato Yamana; Hiroki Matsui; Kiyohide Fushimi; Hideo Yasunaga
Journal:  Intensive Care Med       Date:  2018-07-20       Impact factor: 17.440

3.  Association between body mass index and localization of breast cancer: results from a nationwide inpatient database in Japan.

Authors:  Takaaki Konishi; Michimasa Fujiogi; Nobuaki Michihata; Kojiro Morita; Hiroki Matsui; Kiyohide Fushimi; Masahiko Tanabe; Yasuyuki Seto; Hideo Yasunaga
Journal:  Breast Cancer Res Treat       Date:  2020-09-19       Impact factor: 4.872

4.  Temporal Trends in the Practice Pattern for Venous Thromboembolism in Japan: Insight From JROAD-DPC.

Authors:  Yugo Yamashita; Takeshi Morimoto; Yusuke Yoshikawa; Hidenori Yaku; Yoko Sumita; Michikazu Nakai; Koh Ono; Takeshi Kimura
Journal:  J Am Heart Assoc       Date:  2020-01-10       Impact factor: 5.501

5.  Association between intravenous contrast media exposure and non-recovery from dialysis-requiring septic acute kidney injury: a nationwide observational study.

Authors:  Yoshihisa Miyamoto; Masao Iwagami; Shotaro Aso; Hideo Yasunaga; Hiroki Matsui; Kiyohide Fushimi; Yoshifumi Hamasaki; Masaomi Nangaku; Kent Doi
Journal:  Intensive Care Med       Date:  2019-08-26       Impact factor: 17.440

6.  Prophylactic sivelestat for esophagectomy and in-hospital mortality: a propensity score-matched analysis of claims database.

Authors:  Chikashi Takeda; Masato Takeuchi; Yohei Kawasaki; Hiroshi Yonekura; Isao Nahara; Aki Kuwauchi; Satomi Yoshida; Shiro Tanaka; Koji Kawakami
Journal:  J Anesth       Date:  2019-01-05       Impact factor: 2.078

7.  Impact of congenital heart disease on outcomes after primary repair of esophageal atresia: a retrospective observational study using a nationwide database in Japan.

Authors:  Tetsuya Ishimaru; Michimasa Fujiogi; Nobuaki Michihata; Hiroki Matsui; Kiyohide Fushimi; Hiroshi Kawashima; Jun Fujishiro; Hideo Yasunaga
Journal:  Pediatr Surg Int       Date:  2019-08-08       Impact factor: 1.827

8.  Predictors Associated with Survival Among Elderly In-Patients Who Receive Cardiopulmonary Resuscitation in Japan: An Observational Cohort Study.

Authors:  Tetsuro Hayashi; Masato Matsushima; Seiji Bito; Natsuko Kanazawa; Norihiko Inoue; Sarah Kyuragi Luthe; Christina C Wee
Journal:  J Gen Intern Med       Date:  2018-11-27       Impact factor: 5.128

9.  Cholinergic Crisis Caused by Cholinesterase Inhibitors: a Retrospective Nationwide Database Study.

Authors:  Hiroyuki Ohbe; Taisuke Jo; Hiroki Matsui; Kiyohide Fushimi; Hideo Yasunaga
Journal:  J Med Toxicol       Date:  2018-06-15

10.  Clinical features and practice patterns of gastroschisis: a retrospective analysis using a Japanese national inpatient database.

Authors:  Michimasa Fujiogi; Nobuaki Michihata; Hiroki Matsui; Kiyohide Fushimi; Hideo Yasunaga; Jun Fujishiro
Journal:  Pediatr Surg Int       Date:  2018-05-16       Impact factor: 1.827

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.