| Literature DB >> 31888720 |
Lia Jamian1, Lee Wheless2,3, Leslie J Crofford4, April Barnado5.
Abstract
BACKGROUND: Systemic sclerosis (SSc) is a rare disease with studies limited by small sample sizes. Electronic health records (EHRs) represent a powerful tool to study patients with rare diseases such as SSc, but validated methods are needed. We developed and validated EHR-based algorithms that incorporate billing codes and clinical data to identify SSc patients in the EHR.Entities:
Keywords: Algorithms; Bioinformatics; Electronic health records; Systemic sclerosis
Mesh:
Year: 2019 PMID: 31888720 PMCID: PMC6937803 DOI: 10.1186/s13075-019-2092-7
Source DB: PubMed Journal: Arthritis Res Ther ISSN: 1478-6354 Impact factor: 5.156
Fig. 1Development of algorithms to identify patients with systemic sclerosis (SSc) in the electronic health record (EHR). At least a 1-time count of the SSc ICD-9 code (710.1) or ICD-10-CM codes (M34*) was applied to the 3 million subjects in Vanderbilt’s Synthetic Derivative, which resulted in 1899 potential SSc cases. Of these 1899 potential SSc cases, 200 were randomly selected for a training set to develop and test algorithms with various combinations of the SSc ICD-9 and ICD-10-CM codes, keyword search for Raynaud’s phenomenon, and positive ANA (≥ 1:80). The highest performing algorithm was internally validated in a set of 100 subjects who were not part of the original training set
Characteristics of SSc cases and non-cases in the training set
| Characteristics | SSc cases ( | Non-cases ( | |
|---|---|---|---|
| Age, years, mean ± standard deviation | 68 ± 14 | 59 ± 20 | < 0.01 |
| Female, | 71 (83%) | 84 (89%) | 0.19 |
| White, | 65 (76%) | 70 (75%) | 0.86 |
| Number of counts of the SSc ICD-92 code (710.1), mean ± standard deviation | 10 ± 16 | 2 ± 5 | < 0.01 |
| Number of counts of the SSc ICD-10-CM3 codes (M34*), mean ± standard deviation | 6 ± 7 | 2 ± 8 | < 0.01 |
| Years of follow-up4, mean ± standard deviation | 7 ± 6 | 10 ± 7 | < 0.01 |
1Mann-Whitney U test for continuous variables and chi-square test for categorical variables
2ICD-9 International Classification of Diseases, Ninth Revision
3ICD-10-CM International Classification of Diseases, Tenth Revision, Clinical Modification
4Years of data available in the electronic health record from first to last ICD-9 and/or ICD-10-CM codes for any conditions
Performance of electronic health record algorithms for systemic sclerosis
| Algorithm1 | PPV (%) | Sensitivity (%) | |
|---|---|---|---|
| ICD-9 codes only | |||
| ≥ 1 count of the ICD-9 code (710.1) | 52 | 100 | 81 |
| ≥ 2 counts | 63 | 88 | 74 |
| ≥ 3 counts | 79 | 72 | 75 |
| ≥ 4 counts | 86 | 67 | 75 |
| ICD-10 codes only | |||
| ≥ 1 count of the ICD-10 codes (M34*) | 82 | 94 | 88 |
| ≥ 2 counts | 84 | 91 | 87 |
| ≥ 3 counts | 88 | 85 | 87 |
| ≥ 4 counts | 91 | 85 | 88 |
| ICD-9 or ICD-10 codes | |||
| ≥ 1 count | 52 | 98 | 68 |
| ≥ 2 counts | 70 | 97 | 81 |
| ≥ 3 counts | 86 | 94 | 90 |
| ≥ 4 counts | 91 | 91 | 91 |
| ICD-9 code AND ANA positive2 | |||
| ≥ 1 count of the ICD-9 codes AND ANA | 53 | 81 | 64 |
| ≥ 2 counts of the ICD-9 codes AND ANA | 68 | 81 | 74 |
| ≥ 3 counts of the ICD-9 codes AND ANA | 84 | 70 | 77 |
| ≥ 4 counts of the ICD-9 codes AND ANA | 93 | 64 | 76 |
| ICD-10 codes AND ANA positive | |||
| ≥ 1 count of the ICD-10 codes AND ANA | 95 | 53 | 68 |
| ≥ 2 counts AND ANA | 95 | 53 | 68 |
| ≥ 3 counts AND ANA | 100 | 50 | 67 |
| ≥ 4 counts AND ANA | 100 | 50 | 67 |
| ICD-9 code AND Raynaud’s (RP) keyword | |||
| ≥ 1 count of the ICD-9 code AND RP | 78 | 90 | 84 |
| ≥ 2 counts AND RP | 86 | 80 | 83 |
| ≥ 3 counts AND RP | 92 | 66 | 77 |
| ≥ 4 counts AND RP | 91 | 60 | 73 |
| ICD-9 code, RP, ANA positive | |||
| ≥ 1 count of the ICD-9 code AND ANA OR RP | 55 | 95 | 70 |
| ≥ 2 counts AND ANA OR RP | 67 | 89 | 76 |
| ≥ 3 counts AND ANA OR RP | 85 | 77 | 81 |
| ≥ 4 counts AND ANA OR RP | 94 | 70 | 81 |
| ≥ 1 count AND ANA AND RP | 75 | 75 | 75 |
| ≥ 2 counts AND ANA AND RP | 87 | 75 | 80 |
| ≥ 3 counts AND ANA AND RP | 94 | 66 | 77 |
| ≥ 4 counts AND ANA AND RP | 93 | 59 | 72 |
1All algorithms included at least one or more counts of the SSc ICD-9 (710.1) or ICD-10-CM (M34*) codes for SSc
2ANA positive (titer ≥ 1:80)