| Literature DB >> 31118821 |
Cheng-Yang Hsieh1,2, Chien-Chou Su1, Shih-Chieh Shao1,3, Sheng-Feng Sung4,5, Swu-Jane Lin6, Yea-Huei Kao Yang1, Edward Chia-Cheng Lai1,7.
Abstract
Taiwan's National Health Insurance Research Database (NHIRD) exemplifies a population-level data source for generating real-world evidence to support clinical decisions and health care policy-making. Like with all claims databases, there have been some validity concerns of studies using the NHIRD, such as the accuracy of diagnosis codes and issues around unmeasured confounders. Endeavors to validate diagnosed codes or to develop methodologic approaches to address unmeasured confounders have largely increased the reliability of NHIRD studies. Recently, Taiwan's Ministry of Health and Welfare (MOHW) established a Health and Welfare Data Center (HWDC), a data repository site that centralizes the NHIRD and about 70 other health-related databases for data management and analyses. To strengthen the protection of data privacy, investigators are required to conduct on-site analysis at an HWDC through remote connection to MOHW servers. Although the tight regulation of this on-site analysis has led to inconvenience for analysts and has increased time and costs required for research, the HWDC has created opportunities for enriched dimensions of study by linking across the NHIRD and other databases. In the near future, researchers will have greater opportunity to distill knowledge from the NHIRD linked to hospital-based electronic medical records databases containing unstructured patient-level information by using artificial intelligence techniques, including machine learning and natural language processes. We believe that NHIRD with multiple data sources could represent a powerful research engine with enriched dimensions and could serve as a guiding light for real-world evidence-based medicine in Taiwan.Entities:
Keywords: Health and Welfare Data Center of Taiwan; big data analysis; database cross-linkage; real-world data; validation
Year: 2019 PMID: 31118821 PMCID: PMC6509937 DOI: 10.2147/CLEP.S196293
Source DB: PubMed Journal: Clin Epidemiol ISSN: 1179-1349 Impact factor: 4.790
Databases in the health and welfare databases center
| A. Linkable health care databases | |
|---|---|
| National Health Insurance Research Database (NHIRD) | Longitudinal Health Insurance Database (LHID) |
| Datasets
Ambulatory care expenditures by visits Inpatient expenditures by admissions Expenditures for prescriptions dispensed at contracted pharmacies Details of ambulatory care orders Details of inpatient orders Details of prescriptions dispensed at contracted pharmacies Health services utilization of medical facilities Registry for contracted medical facilities Registry for board-certified specialists Accreditation profile of medical facilities Registry for medical personnel Registry for beneficiaries Registry for catastrophic illness | Longitudinal Health Insurance Database 2000 Longitudinal Health Insurance Database 2005 Longitudinal Health Insurance Database 2010 |
Figure 1Conceptual presentation of database cross-linkage within HWDC.
Summary of validation studies regarding the validity of diagnosis codes in the NHIRD
| Diseases/conditions | ICD-9-CM code | Sensitivity, % | Positive predictive value, PPV % | Note | References |
|---|---|---|---|---|---|
| Acute ischemic stroke | 433.xx, 434.xx | 94.5 | 97.9 | Chart review by neurologic specialist as reference standard | Pharmacoepidemiol Drug Saf 2011 |
| Epilepsy | 345.xx | 81.4 | 76.8 | Chart review by neurologic specialist as reference standard; specificity: 99.8% | Epilepsia 2012 |
| Pneumonia | 480.xx–486.xx | 92.3–94.7 | Not available | Chart review by medical doctors as reference standard | CMAJ 2014 |
| Coronary artery bypass graft postoperative surgical site infection | 996.03, 996.61, 996.72, 998.5 038.0–038.4, 038.8, 038.9, 682.6, 682.9, 780.6, 790.7, 875.0, 875.1, 891.0, 891.1, 996.03, 996.61, 996.72, 998.3, and 998.5. | 35.3 | 19.4 | Health care-associated infection surveillance data and manually reviewed medical charts as reference standard; specificity: 97.0% | BMC Med Inform Decis Mak 2014 |
| Acute myocardial infarction | 410.xx | 88.0 | 92.0 | Chart review by neurologic specialist as reference standard | J Epidemiol 2014 |
| Renal dysfunction | 250.4, 283.11, 403.x, 404.x, 580–589, 753.0, 753.1 | 38.4 | 76.0 | Taiwan Stroke Registry as reference standard; only validated in stroke patients; specificity: 94.7% | Int J Stroke 2015 |
| Acute ischemic stroke | 433.xx, 434.xx | 97.3 | 88.4 | Taiwan Stroke Registry as reference standard | J Formos Med Assoc 2015 |
| Tuberculosis contact | V01.1 with at least 1 chest radiographic examination or 795.5 | 98.3 | Not available | Chart review by pulmonologists as reference standard | Medicine 2016 |
| Hypertension | 401.x, 402.x, 403.x, 404.x, 405.x | 92.4 | 88.5 | Few conditions in patients with stroke. Taiwan Stroke Registry as reference standard | Int J Cardiol 2016 |
| Diabetes | 250.x | 90.9 | 92.0 | ||
| Hyperlipidemia | 272.x | 69.1 | 89.5 | ||
| Coronary artery disease | 410.x, 411.x, 412.x, 413.x, 414.x | 63.7 | 47.6 | ||
| Atrial fibrillation | 427.31 | 72.8 | 71.1 | ||
| Tuberculosis | 010–018 plus prescriptions of at least two anti-tuberculosis drugs | 96.3 | Not available | Chart review by pulmonologists as reference standard | Chest 2017 |
| Heart failure | 428 | Not available | 97.6 | Chart review by cardiologic specialist as reference standard | J Am Heart Assoc 2017 |
| Ischemic stroke | 433–437 | Not available | 94.2 | ||
| All cancer | 140–208 | 91.5 | 93.6 | National Cancer Registry of Taiwan as reference standard | Pharmacoepidemiol Drug Saf 2018 |
| Varicose veins | 454 | Not available | 98.0 | Chart review as reference standard | JAMA 2018 |