Yisha Ye1, Richard Hubbard2, Gloria Hoi-Yee Li3, Shun Cheong Ho1, Chor Wing Sing1, Ching Lung Cheung1,4, David Chi Leung Lam5. 1. Department of Pharmacology and Pharmacy, The University of Hong Kong, Pokfulam, Hong Kong SAR, China. 2. School of Medicine, University of Nottingham, Nottingham, UK. 3. Department of Health Technology and Informatics, Faculty of Health and Social Sciences, The Hong Kong Polytechnic University, Hong Kong SAR, China. 4. Laboratory of Data Discovery for Health (D24H), Hong Kong Science Park, Pak Shek Kok, Hong Kong SAR, China. 5. Department of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China.
Abstract
OBJECTIVE: Large electronic medical record (EMR) databases can facilitate epidemiology research into uncommon diseases such as interstitial lung disease (ILD). Given the rarity and diagnostic difficulty of ILD, the validity of the coding in EMR requires clarification. We aimed to assess the validity of International Classification of Diseases, 9th Revision (ICD-9) code algorithms for identifying ILD in the territory-wide electronic medical health record system of Clinical Data Analysis and Reporting System (CDARS) in Hong Kong. METHOD: Patients who visited the Queen Mary Hospital in 2005-2018 with ILD were identified using the following ICD-9 codes: post-inflammatory pulmonary fibrosis (PPF; ICD-9: 515), idiopathic fibrosing alveolitis (IFA; ICD-9: 516.3), connective tissue disease-associated interstitial lung disease (CTD-ILD; ICD-9: 517.2, 517.8, 714.81), sarcoidosis (ICD-9: 135) and extrinsic allergic alveolitis (EAA; ICD-9: 495). A random selection was conducted in cases with diagnostic code of PPF and IFA, where a relative higher case number was identified. All the cases of CTD-ILD, sarcoidosis and EAA were included in validation for relatively small case number. RESULTS: Two hundred and sixty nine cases were validated using medical record review by a respiratory specialist. The overall positive predictive value (PPV) was 79% (95% CI, 74%-84%). In subgroup analysis, true positive case numbers of PPF, IFA, CTD-ILD, sarcoidosis and EAA were 74/100 (74%), 95/100 (95%), 11/15 (73%), 27/32 (84%) and 6/22 (27%), respectively. CONCLUSIONS: This was the first ICD-9 coding validation for ILD in Hong Kong CDARS. Our study demonstrated that using ICD-9 algorithms 515, 516.3, 517.2, 517.8, 714.81 and 135 enhanced identifications of ILDs with PPV that was reliable to support utility of CDARS database for further clinical research on ILDs. The validity is particularly high with 516.3.
OBJECTIVE: Large electronic medical record (EMR) databases can facilitate epidemiology research into uncommon diseases such as interstitial lung disease (ILD). Given the rarity and diagnostic difficulty of ILD, the validity of the coding in EMR requires clarification. We aimed to assess the validity of International Classification of Diseases, 9th Revision (ICD-9) code algorithms for identifying ILD in the territory-wide electronic medical health record system of Clinical Data Analysis and Reporting System (CDARS) in Hong Kong. METHOD: Patients who visited the Queen Mary Hospital in 2005-2018 with ILD were identified using the following ICD-9 codes: post-inflammatory pulmonary fibrosis (PPF; ICD-9: 515), idiopathic fibrosing alveolitis (IFA; ICD-9: 516.3), connective tissue disease-associated interstitial lung disease (CTD-ILD; ICD-9: 517.2, 517.8, 714.81), sarcoidosis (ICD-9: 135) and extrinsic allergic alveolitis (EAA; ICD-9: 495). A random selection was conducted in cases with diagnostic code of PPF and IFA, where a relative higher case number was identified. All the cases of CTD-ILD, sarcoidosis and EAA were included in validation for relatively small case number. RESULTS: Two hundred and sixty nine cases were validated using medical record review by a respiratory specialist. The overall positive predictive value (PPV) was 79% (95% CI, 74%-84%). In subgroup analysis, true positive case numbers of PPF, IFA, CTD-ILD, sarcoidosis and EAA were 74/100 (74%), 95/100 (95%), 11/15 (73%), 27/32 (84%) and 6/22 (27%), respectively. CONCLUSIONS: This was the first ICD-9 coding validation for ILD in Hong Kong CDARS. Our study demonstrated that using ICD-9 algorithms 515, 516.3, 517.2, 517.8, 714.81 and 135 enhanced identifications of ILDs with PPV that was reliable to support utility of CDARS database for further clinical research on ILDs. The validity is particularly high with 516.3.