Nicolas Garcelon1, Antoine Neuraz2, Rémi Salomon3, Hassan Faour4, Vincent Benoit4, Arthur Delapalme4, Arnold Munnich5, Anita Burgun2, Bastien Rance6. 1. Institut Imagine, Paris Descartes Université Paris Descartes-Sorbonne Paris Cité, Paris, France; INSERM, Centre de Recherche des Cordeliers, UMR 1138 Equipe 22, Université Paris Descartes, Sorbonne Paris Cité, Paris, France. Electronic address: nicolas.garcelon@institutimagine.org. 2. INSERM, Centre de Recherche des Cordeliers, UMR 1138 Equipe 22, Université Paris Descartes, Sorbonne Paris Cité, Paris, France; Department of Medical informatics, Hôpital Necker-Enfant Malades, Assistance Publique des Hôpitaux de Paris, Paris, France. 3. Institut Imagine, Paris Descartes Université Paris Descartes-Sorbonne Paris Cité, Paris, France; Service de Néphrologie Pédiatrique, Hôpital Necker-Enfants Malades, Assistance Publique-Hôpitaux de Paris (AP-HP), Université Paris Descartes, Sorbonne Paris Cité, France. 4. Institut Imagine, Paris Descartes Université Paris Descartes-Sorbonne Paris Cité, Paris, France. 5. Institut Imagine, Paris Descartes Université Paris Descartes-Sorbonne Paris Cité, Paris, France; Département de génétique médicale, Hôpital Necker-Enfants Malades, Assistance Publique-Hôpitaux de Paris (AP-HP), Université Paris Descartes, Sorbonne Paris Cité, France; Centre de Référence des Maladies Osseuses Constitutionnelles, INSERM UMR 1163, Laboratoire de bases moléculaires et physiopathologiques de l'ostéochondrodysplasie, Paris Descartes-Sorbonne Paris Cité University, AP-HP, Institut Imagine, 75015 Paris, France. 6. INSERM, Centre de Recherche des Cordeliers, UMR 1138 Equipe 22, Université Paris Descartes, Sorbonne Paris Cité, Paris, France; Hôpital Européen Georges Pompidou, Assistance Publique-Hôpitaux de Paris (AP-HP), Université Paris Descartes, Sorbonne Paris Cité, France.
Abstract
INTRODUCTION: Clinical data warehouses are often oriented toward integration and exploration of coded data. However narrative reports are of crucial importance for translational research. This paper describes Dr. Warehouse®, an open source data warehouse oriented toward clinical narrative reports and designed to support clinicians' day-to-day use. METHOD: Dr. Warehouse relies on an original database model to focus on documents in addition to facts. Besides classical querying functionalities, the system provides an advanced search engine and Graphical User Interfaces adapted to the exploration of text. Dr. Warehouse is dedicated to translational research with cohort recruitment capabilities, high throughput phenotyping and patient centric views (including similarity metrics among patients). These features leverage Natural Language Processing based on the extraction of UMLS® concepts, as well as negation and family history detection. RESULTS: A survey conducted after 6 months of use at the Necker Children's Hospital shows a high rate of satisfaction among the users (96.6%). During this period, 122 users performed 2837 queries, accessed 4,267 patients' records and included 36,632 patients in 131 cohorts. The source code is available at this github link https://github.com/imagine-bdd/DRWH. A demonstration based on PubMed abstracts is available at https://imagine-plateforme-bdd.fr/dwh_pubmed/.
INTRODUCTION: Clinical data warehouses are often oriented toward integration and exploration of coded data. However narrative reports are of crucial importance for translational research. This paper describes Dr. Warehouse®, an open source data warehouse oriented toward clinical narrative reports and designed to support clinicians' day-to-day use. METHOD: Dr. Warehouse relies on an original database model to focus on documents in addition to facts. Besides classical querying functionalities, the system provides an advanced search engine and Graphical User Interfaces adapted to the exploration of text. Dr. Warehouse is dedicated to translational research with cohort recruitment capabilities, high throughput phenotyping and patient centric views (including similarity metrics among patients). These features leverage Natural Language Processing based on the extraction of UMLS® concepts, as well as negation and family history detection. RESULTS: A survey conducted after 6 months of use at the Necker Children's Hospital shows a high rate of satisfaction among the users (96.6%). During this period, 122 users performed 2837 queries, accessed 4,267 patients' records and included 36,632 patients in 131 cohorts. The source code is available at this github link https://github.com/imagine-bdd/DRWH. A demonstration based on PubMed abstracts is available at https://imagine-plateforme-bdd.fr/dwh_pubmed/.
Keywords:
Computational biology; Data warehouse; Electronic health records; Information storage and retrieval; Method; Rare diseases; Software; Text-mining
Authors: G Pouliquen; L Fillon; V Dangouloff-Ros; M Kuchenbuch; C Bar; N Chemaly; R Levy; C-J Roux; A Saitovitch; J Boisgontier; R Nabbout; N Boddaert Journal: AJNR Am J Neuroradiol Date: 2022-09-22 Impact factor: 4.966
Authors: Anna E Mason; Ethan S Sen; Agnieszka Bierzynska; Elizabeth Colby; Maryam Afzal; Guillaume Dorval; Ania B Koziell; Maggie Williams; Olivia Boyer; Gavin I Welsh; Moin A Saleem Journal: Clin J Am Soc Nephrol Date: 2020-04-21 Impact factor: 8.237
Authors: Alison Callahan; Vladimir Polony; José D Posada; Juan M Banda; Saurabh Gombar; Nigam H Shah Journal: J Am Med Inform Assoc Date: 2021-07-14 Impact factor: 4.497
Authors: Michelle M Clark; Amber Hildreth; Sergey Batalov; Yan Ding; Shimul Chowdhury; Kelly Watkins; Katarzyna Ellsworth; Brandon Camp; Cyrielle I Kint; Calum Yacoubian; Lauge Farnaes; Matthew N Bainbridge; Curtis Beebe; Joshua J A Braun; Margaret Bray; Jeanne Carroll; Julie A Cakici; Sara A Caylor; Christina Clarke; Mitchell P Creed; Jennifer Friedman; Alison Frith; Richard Gain; Mary Gaughran; Shauna George; Sheldon Gilmer; Joseph Gleeson; Jeremy Gore; Haiying Grunenwald; Raymond L Hovey; Marie L Janes; Kejia Lin; Paul D McDonagh; Kyle McBride; Patrick Mulrooney; Shareef Nahas; Daeheon Oh; Albert Oriol; Laura Puckett; Zia Rady; Martin G Reese; Julie Ryu; Lisa Salz; Erica Sanford; Lawrence Stewart; Nathaly Sweeney; Mari Tokita; Luca Van Der Kraan; Sarah White; Kristen Wigby; Brett Williams; Terence Wong; Meredith S Wright; Catherine Yamada; Peter Schols; John Reynders; Kevin Hall; David Dimmock; Narayanan Veeraraghavan; Thomas Defay; Stephen F Kingsmore Journal: Sci Transl Med Date: 2019-04-24 Impact factor: 19.319
Authors: Hyo Soung Cha; Jip Min Jung; Seob Yoon Shin; Young Mi Jang; Phillip Park; Jae Wook Lee; Seung Hyun Chung; Kui Son Choi Journal: Int J Environ Res Public Health Date: 2019-06-28 Impact factor: 3.390
Authors: Sandra Brasil; Carlota Pascoal; Rita Francisco; Vanessa Dos Reis Ferreira; Paula A Videira; And Gonçalo Valadão Journal: Genes (Basel) Date: 2019-11-27 Impact factor: 4.096