George Cernile1, Trevor Heritage1, Neil J Sebire2, Ben Gordon3, Taralyn Schwering1, Shana Kazemlou1, Yulia Borecki1.
Abstract
INTRODUCTION: Numerous scientific journal articles related to COVID-19 have been rapidly published, making navigation and understanding of relationships difficult.
METHODS: A graph network was constructed from the publicly available COVID-19 Open Research Dataset (CORD-19) of COVID-19-related publications using an engine leveraging medical knowledge bases to identify discrete medical concepts and an open-source tool (Gephi) to visualise the network.
RESULTS: The network shows connections between diseases, medications and procedures identified from the title and abstract of 195 958 COVID-19-related publications (CORD-19 Dataset). Connections between terms with few publications, those unconnected to the main network and those irrelevant were not displayed. Nodes were coloured by knowledge base and the size of the node related to the number of publications containing the term. The data set and visualisations were made publicly accessible via a webtool.
CONCLUSION: Knowledge management approaches (text mining and graph networks) can effectively allow rapid navigation and exploration of entity inter-relationships to improve understanding of diseases such as COVID-19. © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
INTRODUCTION: Numerous scientific journal articles related to COVID-19 have been rapidly published, making navigation and understanding of relationships difficult.
METHODS: A graph network was constructed from the publicly available COVID-19 Open Research Dataset (CORD-19) of COVID-19-related publications using an engine leveraging medical knowledge bases to identify discrete medical concepts and an open-source tool (Gephi) to visualise the network.
RESULTS: The network shows connections between diseases, medications and procedures identified from the title and abstract of 195 958 COVID-19-related publications (CORD-19 Dataset). Connections between terms with few publications, those unconnected to the main network and those irrelevant were not displayed. Nodes were coloured by knowledge base and the size of the node related to the number of publications containing the term. The data set and visualisations were made publicly accessible via a webtool.
CONCLUSION: Knowledge management approaches (text mining and graph networks) can effectively allow rapid navigation and exploration of entity inter-relationships to improve understanding of diseases such as COVID-19. © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
Entities:
Keywords:
BMJ health informatics; health care; information science; medical informatics
Year: 2021
PMID: 33419870 DOI: 10.1136/bmjhci-2020-100254
Source DB: PubMed Journal: BMJ Health Care Inform ISSN: 2632-1009