| Literature DB >> 27891528 |
Richard D Boyce1, Steven M Handler1, Jordan F Karp1, Subashan Perera1, Charles F Reynolds1.
Abstract
INTRODUCTION: A potential barrier to nursing home research is the limited availability of research quality data in electronic form. We describe a case study of converting electronic health data from five skilled nursing facilities to a research quality longitudinal dataset by means of open-source tools produced by the Observational Health Data Sciences and Informatics (OHDSI) collaborative.Entities:
Keywords: common data model; elderly; individuals who need chronic care; informatics
Year: 2016 PMID: 27891528 PMCID: PMC5108634 DOI: 10.13063/2327-9214.1252
Source DB: PubMed Journal: EGEMS (Wash DC) ISSN: 2327-9214
Figure 1.Flow Diagram Showing the Count and Reasons that Minimum Data Set 3.0 Records Were Dropped During the Loading Process
Figure 2.Overview of the CDM Data Provided by a Subset of Reports Created Using the OHDSI Achilles Data Characterization Program
Notes: A prevalence of gender of race*; B: year of birth at first observation; C: number of observations per person; D: distribution of duration of observation by age decile; E: percent of population by cumulative observation period.
*The “Unknown” category contains race/ethnic categories such as “Asian and “Hispanic or Latino” that were coded in the source MDS data but were not translated to codes in CDM. This will be corrected in future work.
Population Characteristics for Selected Variables from All Five SNF Facilities Included in the Study
| Age | ||||||
| < 65 | 87 / 16% | 238 / 21% | 310 / 22% | 217 / 19% | 48 / 16% | 900 / 20% |
| 65–74 | 76 / 14% | 182 / 16% | 282 / 20% | 190 / 17% | 56 / 19% | 786 / 17% |
| 75–84 | 171 / 31% | 311 / 28% | 413 / 29% | 322 / 28% | 99 / 33% | 1316 / 29% |
| 85+ | 219 / 40% | 380 / 34% | 403 / 29% | 417 / 36% | 98 / 33% | 1517 / 34% |
| Race | ||||||
| White | 440 / 80% | 714 / 64% | 1311 / 93% | 922 / 81% | 289 / 96% | 3676/ 81% |
| Black | 94 / 17% | 357 / 32% | 57 / 4% | 212 / 19% | 5 / 2% | 725 / 16% |
| Other | 19 / 3% | 40 / 4% | 40 / 3% | 12 / 1% | 7 / 2% | 118 / 3% |
| Gender | ||||||
| Female | 360 / 65% | 758 / 68% | 846 / 60% | 737 / 64% | 179 / 60% | 2880 / 64% |
| Male | 193 / 35% | 353 / 32% | 562 / 40% | 409 / 36% | 122 / 41% | / 36% |
| Marital Status | ||||||
| Married | 156 / 28% | 205 / 19% | 369 / 26% | 306 / 27% | 90 / 30% | 1126 / 25% |
| Divorced /Separated | 48 / 9% | 104 / 9% | 102 / 7% | 102 / 9% | 41 / 14% | 397 / 9% |
| Widowed | 228 / 41% | 257 / 23% | 409 / 29% | 420 / 37% | 119 / 40% | 1433 / 32% |
| Never married | 112 / 20% | 204 / 18% | 156 / 11% | 170 / 15% | 39 / 13% | 681 / 15% |
| Unknown | 9 / 2% | 341 / 31% | 372 / 26% | 148 / 13% | 12 / 4% | 882 / 20% |
| Cognitive impairment (BIMS summary score ≤ 12) | 106 / 19% | 206 / 19% | 258 / 18% | 208 / 18% | 82 / 27% | 860 / 19% |
| Fall since admission (any stay) | 177 / 32% | 329 / 30% | 335 / 24% | 351 / 31% | 139 / 46% | 1331 / 30% |
| Impaired functional status (ADL summary score ≥ 16) | 506 / 92% | 958 / 86% | 1310 / 93% | 1025 / 89% | 265 / 88% | 4064 / 90% |
| Impaired transfer (ADL required extensive or total assistance or occurred only once or twice) | 497 / 90% | 897 / 81% | 1283 / 91% | 1017 / 89% | 268 / 89% | 3962 / 88% |
| Alzheimer’s | 60 / 11% | 146 / 13% | 104 / 7% | 176 / 15% | 40 / 13% | 526 / 12% |
| Anemia | 222 / 40% | 482 / 43% | 415 / 29% | 598 / 52% | 165 / 55% | 1882 / 42% |
| Anxiety | 163 / 29% | 292 / 26% | 302 / 21% | 322 / 28% | 128 / 43% | 1207 / 27% |
| Aphasia | 11 / 2% | 37 / 3% | 24 / 2% | 53 / 5% | 22 / 7% | 147 / 3% |
| Arthritis | 218 / 39% | 453 / 41% | 249 / 18% | 515 / 45% | 84 /28% | 1519 / 34% |
| Arteriosclerotic heart disease | 170 / 31% | 290 / 26% | 268 / 19% | 335 / 29% | 102 / 34% | 1165 / 26% |
| Benign prostate hyperplasia | 44 / 8% | 76 / 7% | 59 / 4% | 119 / 10% | 31 / 10% | 329 / 7% |
| Cancer | 73 / 13% | 140 / 13% | 102 / 7% | 189 / 17% | 20 / 7% | 524 / 12% |
| Stroke | 46 / 8% | 158 / 14% | 114 / 8% | 273 / 24% | 62 / 21% | 653 / 14% |
| Constipation | 50 / 9% | 292 / 26% | 160 / 11% | 104 / 9% | 9 / 3% | 615 / 14% |
| Non-Alzheimer’s dementia | 165 / 30% | 241 / 22% | 218 / 15% | 253 / 22% | 144 / 48% | 1021 / 23% |
| Depression | 251 / 45% | 461 / 41% | 511 / 36% | 495 / 43% | 165 / 55% | 1883 / 42% |
| Diabetes Mellitus | 189 / 34% | 395 / 36% | 506 / 36% | 427 / 37% | 101 / 34% | 1618 / 36% |
| Embolisms | 10 / 2% | 57 / 5% | 23 / 2% | 42 / 4% | 12 / 4% | 144 / 3% |
| COPD | 138 / 25% | 323 / 29% | 336 / 24% | 366 / 32% | 85 / 28% | 1248 / 28% |
| GERD or GI Ulcer | 194 / 35% | 322 / 29% | 329 / 23% | 386 / 34% | 157 / 52% | 1388 / 31% |
| Congestive heart failure | 138 / 25% | 300 / 27% | 313 / 22% | 340 / 30% | 73 / 24% | 1164 / 26% |
| Hemiplegia | 39 / 7% | 71 / 6% | 41 / 3% | 97 / 8% | 17 / 6% | 265 / 6% |
| Hypertension | 421 / 76% | 818 / 74% | 979 / 70% | 862 / 75% | 235 / 78% | 3315 / 74% |
| Hypotension | 8 / 1% | 21 / 2% | 22 / 2% | 24 / 2% | 7 / 2% | 82 / 2% |
| Hypoosmolar hyponatremia | 5 / 1% | 20 / 2% | 15 / 1% | 26 / 2% | 13 / 4% | 79 / 2% |
| Bipolar disorder | 27 / 5% | 61 / 5% | 61 / 4% | 28 / 2% | 10 / 3% | 187 / 4% |
| Infection due to resistant organism (MDRO) | 21 / 4% | 25 / 2% | 108 / 8% | 30 / 3% | 16 / 5% | 200 / 5% |
| Multiple sclerosis | 6 / 1% | 9 / 1% | 12 / 1% | 23 / 2% | 4 / 1% | 54 / 1% |
| Neurogenic bladder | 17 / 3% | 17 / 2% | 19 / 1% | 45 / 4% | 8 / 3% | 106 / 2% |
| Osteoporosis | 102 / 18% | 170 / 15% | 93 / 7% | 213 / 19% | 72 / 24% | 650 / 14% |
| Parkinson’s disease | 25 / 5% | 50 / 5% | 49 / 3% | 63 / 5% | 24 / 8% | 211 / 5% |
| Pneumonia | 67 / 12% | 118 / 11% | 109 / 8% | 165 / 14% | 28 / 9% | 487 / 11% |
| Schizophrenia | 5 / 1% | 35 / 3% | 14 / 1% | 21 / 2% | 10 / 3% | 85 / 2% |
| Seizure | 36 / 7% | 90 / 8% | 56 / 4% | 107 / 9% | 45 / 15% | 334 / 7% |
| Septicemia | 16 / 3% | 6 / 1% | 22 / 2% | 22 / 2% | 5 / 2% | 71 / 2% |
| Thyroid disorder | 118 / 21% | 184 / 17% | 231 / 16% | 262 / 23% | 78 / 26% | 873 / 19% |
| UTI | 116 / 21% | 130 / 12% | 179 / 13% | 235 / 21% | 61 / 20% | 721 / 16% |
| Open wound without complication | 25 / 5% | 32 / 3% | 32 / 2% | 56 / 5% | 6 / 2% | 151 / 3% |
| Psychosis | 60 / 11% | 57 / 5% | 30 / 2% | 82 / 7% | 139 / 46% | 368 / 8% |
Notes:
Age and Marital status values were from the first admission or quarterly MDS report for a patient and might not reflect changes in status throughtout a stay.
Achilles Heel Errors Identified and How Addressed
| Issues with how codes from the standard vocabulary were used | Number of persons with at least one procedure occurrence, by procedure_concept_id; 2 concepts in data are not in correct vocabulary (CPT4/HCPCS/ICD9P) | The only “procedures” in our data set were MDS reports that we consider administrative procedures. These break down into those done at admission and those done for existing patients. Our review of CPT4/HCPCS/ICD9P did not find any relevant codes for these two types of procedures. Therefore, we ignored this error and continued to use the two SNOMED CT codes we had chosen (108221006: “Evaluation AND/ OR management - established patient” and 108220007: “Evaluation AND/OR management - new patient”). |
| Issues with how codes from the standard vocabulary were used | Number of persons with at least one observation occurrence, by observation_concept_id; 1 concepts in data are not in correct vocabulary (LOINC) | This was an expected issue because we coded fall incidents using the OHDSI concept identifier for the MedDRA preferred term for “Fall” since no appropriate LOINC code could be found. |
| Date patient data are collected falls outside a valid observation period | Number of procedure occurrence records outside valid observation period; count (n=9,767) should not be > 0 | Our original algorithm for generating observation periods from MDS data did not correctly implement all the business rules provided by the MDS 3 Quality Measures user manual. |
| Date patient data are collected falls outside a valid observation period; drug exposure periods with invalid date values | Number of drug exposure records outside valid observation period; count (n=2,195) should not be > 0 | The reasons for these errors were the same as those that caused the procedure occurrence records outside a valid observation period (see above). Once those issues were corrected, we changed the load procedure to check that a drug order fell within an observation period before adding the records to the Drug Exposure and Drug Era tables. |
| Date patient data are collected falls outside a valid observation period | Number of observation records outside valid observation period; count (n=77,898) should not be > 0 | The same error that caused the procedure occurrence records to fall outside a valid observation period (see above) was also responsible for most counts of this error. Changing the load procedure to check that an observation fell within an observation period before adding the records to the Observation table corrected all but 26,209 cases. Analysis of these cases found that these occur in two cases where we had intentionally dropped records: (1) when a patient had a single day stay, or (2) a patient’s only MDS 3 records occurred before the study start date. We took no action on these remaining cases because all analyses should use correct observation periods to identify valid exposures and observations. |
| Observation records with invalid values | Number of observation records with no value (numeric, string, or concept); count (n=162,271) should not be > 0 | This error exposed a bug in the translation and loading procedure whereby data from validated MDS scales (e.g., the BIMS) were not being loaded properly. Correcting this issue removed the problem. |
| Drug exposure periods with invalid date values | Number of drug eras with end date < start date; count (n=179) should not be > 0 | There were two causes for this error: (1) the source data had a small number of drug records with incorrect start and end dates; and (2) the same issues with the business rules for generating observation periods that affected procedure, drug, and observations above caused the code creating drug eras to create erroneous drug eras for some patients. Both issues were addressed and this error was no longer triggered. |
Comparison of CDM Drug Dispensing Data to Medication Administration Records with Respect to the Prevalence of Exposure to Specific Drugs During the First Week of Each Quarter
| A | 11 | 8.6 | 1.0 | 3.3 | 4.6 | 4.3(9) | 31.9 | 0.0 | 9.7 | 17.0 | 18.0(11) | 50.6 | 0.4 | 5.2 | 8.0 | 4.1(11) | 10.1 | 1.2 | 3.8 | 9.7 | 5.4(9) | 25.0 | 4.0 | 9.1 | 16.8 | 20.4(11) | <0.1 | 1.0 | 2.0 | 6.4 | – | 32.1 | 0.2 | 1.8 | 13.1 | 3.1(11) |
| B | 9 | 15.6 | 0.3 | 0.5 | 2.5 | 0.7(9) | 20.8 | 1.5 | 4.2 | 10.0 | 9.8(9) | 56.6 | 0.5 | 3.4 | 5.0 | 2.6(9) | 6.0 | 0.8 | 3.6 | 4.8 | 12.1(9) | 29.0 | 1.3 | 7.2 | 9.1 | 13.7(9) | 0.1 | 0.7 | 1.4 | 6.8 | – | 26.2 | 0.4 | 1.2 | 5.5 | 1.5(9) |
| C | 4 | 19.8 | 0.3 | 0.55 | 1.8 | 0.2(4) | 17.1 | 6.3 | 6.6 | 8.9 | 9.1(4) | 53.6 | 0.2 | 1.0 | 3.9 | 0.6(4) | 12.9 | 2.3 | 6.7 | 8.8 | 9.7(4) | 40.7 | 2.2 | 5.4 | 5.8 | 2.8(4) | <0.1 | 4.3 | 5.9 | 6.4 | – | 35.1 | 1.7 | 2.8 | 3.7 | 1.1(4) |
| D | 8 | 16.3 | 0.0 | 0.7 | 2.3 | 0.7(8) | 14.6 | 0.0 | 0.6 | 2.8 | 18(8) | 52.7 | 0.5 | 1.5 | 2.3 | 0.6(8) | 11.3 | 0.6 | 2.2 | 3.5 | 3.6(8) | 36.5 | 0.2 | 1.7 | 2.7 | 0.8(8) | <0.0 | 2.5 | 4.8 | 6.2 | 1.3(1) | 34.6 | 0.2 | 1.1 | 2.4 | 0.5(8) |
| E | 6 | 11.1 | 0.2 | 0.7 | 1.2 | 0.2(5) | 38.8 | 0.0 | 2.8 | 8.9 | 3.6(5) | 54.8 | 10.3 | 12.1 | 13.2 | 17(5) | 21.6 | 0.6 | 1.9 | 2.6 | 0.7(5) | 48.6 | 3.2 | 4.5 | 6.1 | 2.7(5) | 0.1 | 0.0 | 0.8 | 1.1 | – | 23.2 | 8.6 | 9.4 | 10.7 | 18.4(5) |
Notes:
The data shown are the following: (1) median percent prevalence of administration of the drugs within class according to the pharmacy services provider (PSP) across the measurable quarters; and (2) minimum, median, and maximum absolute difference in percentage prevalence of exposure between CDM and PSP across all quarters for which electronic MAR data were available in the facilities. Also shown are the results of an X2 goodness of fit test with the number of quarters for which sufficient data were available (df).
P ≤ 0.005;
P ≤ 0.001
Comparison of Selected Nursing Home Compare (NHC) Quality Measures Between the CDM Data Set and Data that Were Publicly Reported by the Centers for Medicare and Medicaid Services
| A | 31.3 | 0.3 | 3.4 | 7.8 | 4(7) | 2.9 | 0.3 | 1.8 | 7.8 | – | 2.1 | 0.0 | 1.8 | 4.3 | – | 2.8 | 0.0 | 1.3 | 3.9 | – | 20.5 | 0.6 | 3.3 | 5.8 | 5.3(11) | 31.7 | 0.4 | 4.7 | 19.5 | 6.2(10) | 4.3 | 0.3 | 2.5 | 5.1 | 7.4(3) |
| B | 18.8 | 0.5 | 2.1 | 5.1 | 3.7(7) | 3.1 | 0.3 | 4.1 | 7.5 | 80.4(5) | 1.0 | 0.0 | 0.8 | 1.6 | 0.3(2) | 1.7 | 0.0 | 0.7 | 3.1 | 8.1(2) | 22.1 | 0.3 | 1.6 | 6.2 | 6.2(11) | 39.4 | 0.6 | 4.8 | 8.9 | 15.3(11) | 1.8 | 0.0 | 0.7 | 1.9 | 2.6(2) |
| C | 16.3 | 0.0 | 2.7 | 4.5 | 4.5(7) | 3.0 | 0.5 | 1.2 | 5.4 | 23.4(6) | 0 | 0.0 | 0.9 | 1.8 | – | 2.1 | 0.7 | 0.9 | 3.6 | 27.5(7) | 14.1 | 0.8 | 2.9 | 7.1 | 14.9(11) | 20.5 | 2.2 | 4.9 | 17.2 | 60.7(11) | 4.0 | 0.1 | 1.2 | 5.1 | 3.9(5) |
| D | 13.4 | 0.0 | 0.6 | 2.8 | 1.1(7) | 1.0 | 2.0 | 4.0 | 4.7 | 52.1(3) | 2.5 | 0.0 | 0.6 | 4.9 | 12.2(3) | 2.2 | 0.8 | 2.0 | 5.1 | 32.8(10) | 20.9 | 0.3 | 2.6 | 8.9 | 12.5(11) | 32.9 | 0.0 | 2.2 | 6.4 | 7.3(11) | 3.6 | 0.3 | 0.7 | 1.8 | 1.6(6) |
| E | 34.6 | 0.6 | 3.2 | 7.5 | 4.0(7) | 2.7 | 0.0 | 2.0 | 8.0 | – | 11.7 | 0.1 | 1.35 | 6.2 | 5.1(7) | 5.2 | 0.5 | 1.7 | 5.3 | 2.7(3) | 19.7 | 0.3 | 2.7 | 14.5 | 3.3(8) | 29.9 | 0.0 | 6.5 | 21.5 | 7.5(7) | 4.8 | 0.5 | 1.9 | 3.3 | 1.5(2) |
Notes:
The comparison covers 11 quarters, from the second quarter of 2011 through the fourth quarter of 2013—except for the two antipsychotic medication measures, for which data were available for only 7 quarters across all facilities, and all other measures for Facility E for which MDS 3 data were available for 8 quarters. The data shown are the following: (1) the median percent prevalence of the measure according to NHC across the measurable quarters, (2) the minimum, median, and maximum absolute difference in percent prevalence across all quarters for which electronic MAR data were available in the facilities; and (3) the results of a X2 goodness of fit test with the number of quarters for which sufficient data were available (df).
P ≤ 0.05;
P ≤ 0.01