Yuying Chen1, Huacong Wen1, Russel Griffin2, Mary Joan Roach3,4, Michael L Kelly5. 1. Department of Physical Medicine and Rehabilitation, University of Alabama at Birmingham, Birmingham, Alabama. 2. Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, Alabama. 3. Department of Physical Medicine and Rehabilitation, Case Western Reserve University School of Medicine, Cleveland, Ohio. 4. Center for Health Research & Policy, MetroHealth Medical System, Cleveland, Ohio. 5. Department of Neurosurgery, Case Western Reserve University School of Medicine, MetroHealth Medical Center, Cleveland, Ohio.
Abstract
BACKGROUND: Linking records from the National Spinal Cord Injury Model Systems (SCIMS) database to the National Trauma Data Bank (NTDB) provides a unique opportunity to study early variables in predicting long-term outcomes after traumatic spinal cord injury (SCI). The public use data sets of SCIMS and NTDB are stripped of protected health information, including dates and zip code. OBJECTIVES: To develop and validate a probabilistic algorithm linking data from an SCIMS center and its affiliated trauma registry. METHOD: Data on SCI admissions 2011-2018 were retrieved from an SCIMS center (n = 302) and trauma registry (n = 723), of which 202 records had the same medical record number. The SCIMS records were divided equally into two data sets for algorithm development and validation, respectively. We used a two-step approach: blocking and weight generation for linking variables (race, insurance, height, and weight). RESULTS: In the development set, 257 SCIMS-trauma pairs shared the same sex, age, and injury year across 129 clusters, of which 91 records were true-match. The probabilistic algorithm identified 65 of the 91 true-match records (sensitivity, 71.4%) with a positive predictive value (PPV) of 80.2%. The algorithm was validated over 282 SCIMS-trauma pairs across 127 clusters and had a sensitivity of 73.7% and PPV of 81.1%. Post hoc analysis shows the addition of injury date and zip code improved the specificity from 57.9% to 94.7%. CONCLUSION: We demonstrate the feasibility of probabilistic linkage between SCIMS and trauma records, which needs further refinement and validation. Gaining access to injury date and zip code would improve record linkage significantly.
BACKGROUND: Linking records from the National Spinal Cord Injury Model Systems (SCIMS) database to the National Trauma Data Bank (NTDB) provides a unique opportunity to study early variables in predicting long-term outcomes after traumatic spinal cord injury (SCI). The public use data sets of SCIMS and NTDB are stripped of protected health information, including dates and zip code. OBJECTIVES: To develop and validate a probabilistic algorithm linking data from an SCIMS center and its affiliated trauma registry. METHOD: Data on SCI admissions 2011-2018 were retrieved from an SCIMS center (n = 302) and trauma registry (n = 723), of which 202 records had the same medical record number. The SCIMS records were divided equally into two data sets for algorithm development and validation, respectively. We used a two-step approach: blocking and weight generation for linking variables (race, insurance, height, and weight). RESULTS: In the development set, 257 SCIMS-trauma pairs shared the same sex, age, and injury year across 129 clusters, of which 91 records were true-match. The probabilistic algorithm identified 65 of the 91 true-match records (sensitivity, 71.4%) with a positive predictive value (PPV) of 80.2%. The algorithm was validated over 282 SCIMS-trauma pairs across 127 clusters and had a sensitivity of 73.7% and PPV of 81.1%. Post hoc analysis shows the addition of injury date and zip code improved the specificity from 57.9% to 94.7%. CONCLUSION: We demonstrate the feasibility of probabilistic linkage between SCIMS and trauma records, which needs further refinement and validation. Gaining access to injury date and zip code would improve record linkage significantly.
Authors: Matthew R Kesinger; Shannon B Juengst; Hillary Bertisch; Janet P Niemeier; Jason W Krellman; Mary Jo Pugh; Raj G Kumar; Jason L Sperry; Patricia M Arenth; Jesse R Fann; Amy K Wagner Journal: Arch Phys Med Rehabil Date: 2016-03-14 Impact factor: 3.966
Authors: Matthew Ryan Kesinger; Raj Gopalan Kumar; Anne Connelly Ritter; Jason Lee Sperry; Amy Kathleen Wagner Journal: Am J Phys Med Rehabil Date: 2017-01 Impact factor: 2.159
Authors: Yuying Chen; Anne Deutsch; Michael J DeVivo; Kurt Johnson; Claire Z Kalpakjian; Gregory Nemunaitis; David Tulsky Journal: Arch Phys Med Rehabil Date: 2011-03 Impact factor: 3.966
Authors: Raj G Kumar; Zhensheng Wang; Matthew R Kesinger; Mark Newman; Toan T Huynh; Janet P Niemeier; Jason L Sperry; Amy K Wagner Journal: Am J Phys Med Rehabil Date: 2018-04 Impact factor: 2.159