BACKGROUND: Manual chart review is labor-intensive and requires specialized knowledge possessed by highly trained medical professionals. Natural language processing (NLP) tools are distinctive in their ability to extract critical information from raw text in electronic health records (EHRs). As a proof of concept for the potential application of this technology, we examined the ability of NLP to correctly identify common elements described by surgeons in operative notes for total hip arthroplasty (THA). METHODS: We evaluated primary THAs that had been performed at a single academic institution from 2000 to 2015. A training sample of operative reports was randomly selected to develop prototype NLP algorithms, and additional operative reports were randomly selected as the test sample. Three separate algorithms were created with rules aimed at capturing (1) the operative approach, (2) the fixation method, and (3) the bearing surface category. The algorithms were applied to operative notes to evaluate the language used by 29 different surgeons at our center and were applied to EHR data from outside facilities to determine external validity. Accuracy statistics were calculated with use of manual chart review as the gold standard. RESULTS: The operative approach algorithm demonstrated an accuracy of 99.2% (95% confidence interval [CI], 97.1% to 99.9%). The fixation technique algorithm demonstrated an accuracy of 90.7% (95% CI, 86.8% to 93.8%). The bearing surface algorithm demonstrated an accuracy of 95.8% (95% CI, 92.7% to 97.8%). Additionally, the NLP algorithms applied to operative reports from other institutions yielded comparable performance, demonstrating external validity. CONCLUSIONS: NLP-enabled algorithms are a promising alternative to the current gold standard of manual chart review for identifying common data elements from orthopaedic operative notes. The present study provides a proof of concept for use of NLP techniques in clinical research studies and registry-development endeavors to reliably extract data of interest in an expeditious and cost-effective manner.
BACKGROUND: Manual chart review is labor-intensive and requires specialized knowledge possessed by highly trained medical professionals. Natural language processing (NLP) tools are distinctive in their ability to extract critical information from raw text in electronic health records (EHRs). As a proof of concept for the potential application of this technology, we examined the ability of NLP to correctly identify common elements described by surgeons in operative notes for total hip arthroplasty (THA). METHODS: We evaluated primary THAs that had been performed at a single academic institution from 2000 to 2015. A training sample of operative reports was randomly selected to develop prototype NLP algorithms, and additional operative reports were randomly selected as the test sample. Three separate algorithms were created with rules aimed at capturing (1) the operative approach, (2) the fixation method, and (3) the bearing surface category. The algorithms were applied to operative notes to evaluate the language used by 29 different surgeons at our center and were applied to EHR data from outside facilities to determine external validity. Accuracy statistics were calculated with use of manual chart review as the gold standard. RESULTS: The operative approach algorithm demonstrated an accuracy of 99.2% (95% confidence interval [CI], 97.1% to 99.9%). The fixation technique algorithm demonstrated an accuracy of 90.7% (95% CI, 86.8% to 93.8%). The bearing surface algorithm demonstrated an accuracy of 95.8% (95% CI, 92.7% to 97.8%). Additionally, the NLP algorithms applied to operative reports from other institutions yielded comparable performance, demonstrating external validity. CONCLUSIONS: NLP-enabled algorithms are a promising alternative to the current gold standard of manual chart review for identifying common data elements from orthopaedic operative notes. The present study provides a proof of concept for use of NLP techniques in clinical research studies and registry-development endeavors to reliably extract data of interest in an expeditious and cost-effective manner.
Authors: Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Christopher G Chute; Hongfang Liu Journal: J Am Med Inform Assoc Date: 2014-03-17 Impact factor: 4.497
Authors: Joshua C Denny; Marylyn D Ritchie; Melissa A Basford; Jill M Pulley; Lisa Bastarache; Kristin Brown-Gentry; Deede Wang; Dan R Masys; Dan M Roden; Dana C Crawford Journal: Bioinformatics Date: 2010-03-24 Impact factor: 6.937
Authors: Hilal Maradit Kremers; Dirk R Larson; Cynthia S Crowson; Walter K Kremers; Raynard E Washington; Claudia A Steiner; William A Jiranek; Daniel J Berry Journal: J Bone Joint Surg Am Date: 2015-09-02 Impact factor: 5.284
Authors: Kavishwar B Wagholikar; Kathy L MacLaughlin; Michael R Henry; Robert A Greenes; Ronald A Hankey; Hongfang Liu; Rajeev Chaudhry Journal: J Am Med Inform Assoc Date: 2012-04-29 Impact factor: 4.497
Authors: Hongfang Liu; Suzette J Bielinski; Sunghwan Sohn; Sean Murphy; Kavishwar B Wagholikar; Siddhartha R Jonnalagadda; K E Ravikumar; Stephen T Wu; Iftikhar J Kullo; Christopher G Chute Journal: AMIA Jt Summits Transl Sci Proc Date: 2013-03-18
Authors: Sunyang Fu; Guilherme S Lopes; Sandeep R Pagali; Bjoerg Thorsteinsdottir; Nathan K LeBrasseur; Andrew Wen; Hongfang Liu; Walter A Rocca; Janet E Olson; Jennifer St Sauver; Sunghwan Sohn Journal: J Gerontol A Biol Sci Med Sci Date: 2022-03-03 Impact factor: 6.053
Authors: Aditya V Karhade; Jacobien H F Oosterhoff; Olivier Q Groot; Nicole Agaronnik; Jeffrey Ehresman; Michiel E R Bongers; Ruurd L Jaarsma; Santosh I Poonnoose; Daniel M Sciubba; Daniel G Tobert; Job N Doornberg; Joseph H Schwab Journal: Clin Orthop Relat Res Date: 2022-04-12 Impact factor: 4.755
Authors: Gyuseon Song; Su Jin Chung; Ji Yeon Seo; Sun Young Yang; Eun Hyo Jin; Goh Eun Chung; Sung Ryul Shim; Soonok Sa; Moongi Simon Hong; Kang Hyun Kim; Eunchan Jang; Chae Won Lee; Jung Ho Bae; Hyun Wook Han Journal: J Clin Med Date: 2022-05-24 Impact factor: 4.964
Authors: Elham Sagheb; Taghi Ramazanian; Ahmad P Tafti; Sunyang Fu; Walter K Kremers; Daniel J Berry; David G Lewallen; Sunghwan Sohn; Hilal Maradit Kremers Journal: J Arthroplasty Date: 2020-10-10 Impact factor: 4.757
Authors: Olivier Q Groot; Michiel E R Bongers; Paul T Ogink; Joeky T Senders; Aditya V Karhade; Jos A M Bramer; Jorrit-Jan Verlaan; Joseph H Schwab Journal: Clin Orthop Relat Res Date: 2020-12 Impact factor: 4.755
Authors: Sunyang Fu; Cody C Wyles; Douglas R Osmon; Martha L Carvour; Elham Sagheb; Taghi Ramazanian; Walter K Kremers; David G Lewallen; Daniel J Berry; Sunghwan Sohn; Hilal Maradit Kremers Journal: J Arthroplasty Date: 2020-08-05 Impact factor: 4.757
Authors: Sunyang Fu; Guilherme S Lopes; Sandeep R Pagali; Bjoerg Thorsteinsdottir; Nathan K LeBrasseur; Andrew Wen; Hongfang Liu; Walter A Rocca; Janet E Olson; Jennifer St Sauver; Sunghwan Sohn Journal: J Gerontol A Biol Sci Med Sci Date: 2022-03-03 Impact factor: 6.053