Vincent Damotte1, Antoine Lizée2, Matthew Tremblay3, Alisha Agrawal1, Pouya Khankhanian4, Adam Santaniello1, Refujia Gomez1, Robin Lincoln1, Wendy Tang1, Tiffany Chen1, Nelson Lee5, Pablo Villoslada6, Jill A Hollenbach1, Carolyn D Bevan1, Jennifer Graves1, Riley Bove1, Douglas S Goodin1, Ari J Green1, Sergio E Baranzini1, Bruce Ac Cree1, Roland G Henry1, Stephen L Hauser1, Jeffrey M Gelfand1, Pierre-Antoine Gourraud2. 1. MS Genetics, Department of Neurology, School of Medicine, University of California San Francisco (UCSF), San Francisco, CA, USA. 2. MS Genetics, Department of Neurology, School of Medicine, University of California San Francisco (UCSF), San Francisco, CA, USA/Université de Nantes, INSERM, UMR 1064, ATIP-Avenir, Equipe 5 Centre de Recherche en Transplantation et Immunologie, Nantes, France. 3. MS Genetics, Department of Neurology, School of Medicine, University of California San Francisco (UCSF), San Francisco, CA, USA/Department of Neurology, John Dempsey Hospital, University of Connecticut Health Center, Farmington, CT, USA. 4. MS Genetics, Department of Neurology, School of Medicine, University of California San Francisco (UCSF), San Francisco, CA, USA/Center for Neuroengineering and Therapeutics, University of Pennsylvania, Philadelphia, PA, USA. 5. Information Technology, University of California San Francisco (UCSF), San Francisco, CA, USA. 6. MS Genetics, Department of Neurology, School of Medicine, University of California San Francisco (UCSF), San Francisco, CA, USA/IDIBAPS-Hospital Clinic of Barcelona, Barcelona, Spain.
Abstract
BACKGROUND: Electronic medical records (EMR) data are increasingly used in research, but no studies have yet evaluated similarity between EMR and research-quality data and between characteristics of an EMR multiple sclerosis (MS) population and known natural MS history. OBJECTIVES: To (1) identify MS patients in an EMR system and extract clinical data, (2) compare EMR-extracted data with gold-standard research data, and (3) compare EMR MS population characteristics to expected MS natural history. METHODS: Algorithms were implemented to identify MS patients from the University of California San Francisco EMR, de-identify the data and extract clinical variables. EMR-extracted data were compared to research cohort data in a subset of patients. RESULTS: We identified 4142 MS patients via search of the EMR and extracted their clinical data with good accuracy. EMR and research values showed good concordance for Expanded Disability Status Scale (EDSS), timed-25-foot walk, and subtype. We replicated several expected MS epidemiological features from MS natural history including higher EDSS for progressive versus relapsing-remitting patients and for male versus female patients and increased EDSS with age at examination and disease duration. CONCLUSION: Large real-world cohorts algorithmically extracted from the EMR can expand opportunities for MS clinical research.
BACKGROUND: Electronic medical records (EMR) data are increasingly used in research, but no studies have yet evaluated similarity between EMR and research-quality data and between characteristics of an EMR multiple sclerosis (MS) population and known natural MS history. OBJECTIVES: To (1) identify MS patients in an EMR system and extract clinical data, (2) compare EMR-extracted data with gold-standard research data, and (3) compare EMR MS population characteristics to expected MS natural history. METHODS: Algorithms were implemented to identify MS patients from the University of California San Francisco EMR, de-identify the data and extract clinical variables. EMR-extracted data were compared to research cohort data in a subset of patients. RESULTS: We identified 4142 MS patients via search of the EMR and extracted their clinical data with good accuracy. EMR and research values showed good concordance for Expanded Disability Status Scale (EDSS), timed-25-foot walk, and subtype. We replicated several expected MS epidemiological features from MS natural history including higher EDSS for progressive versus relapsing-remitting patients and for male versus female patients and increased EDSS with age at examination and disease duration. CONCLUSION: Large real-world cohorts algorithmically extracted from the EMR can expand opportunities for MS clinical research.
Entities:
Keywords:
Electronic medical records; natural language processing
Authors: Beata Fonferko-Shadrach; Arron S Lacey; Angus Roberts; Ashley Akbari; Simon Thompson; David V Ford; Ronan A Lyons; Mark I Rees; William Owen Pickrell Journal: BMJ Open Date: 2019-04-01 Impact factor: 2.692
Authors: Valerie J Block; Erica A Pitsch; Arpita Gopal; Chao Zhao; Mark J Pletcher; Gregory M Marcus; Jeffrey E Olgin; Jill Hollenbach; Riley Bove; Bruce A C Cree; Jeffrey M Gelfand Journal: J Neurol Date: 2021-08-17 Impact factor: 4.849
Authors: Jeffrey A Cohen; Maria Trojano; Ellen M Mowry; Bernard Mj Uitdehaag; Stephen C Reingold; Ruth Ann Marrie Journal: Mult Scler Date: 2019-11-28 Impact factor: 6.312
Authors: Milena A Gianfrancesco; Laura Trupin; Charles E McCulloch; Stephen Shiboski; Gabriela Schmajuk; Jinoos Yazdany Journal: ACR Open Rheumatol Date: 2019-04-10