Literature DB >> 25122604

Identifying Similar Cases in Document Networks Using Cross-Reference Structures.

Taxiarchis Botsis, John Scott, Emily Jane Woo, Robert Ball.   

Abstract

Our objective was to explore the creation of document networks based on different thresholds of shared information and different clustering algorithms on those networks to identify document clusters describing similar clinical cases. We created networks from vaccine adverse event report sets using seven approaches for linking reports. We then applied three clustering algorithms [visualization of similarities (VOS), Louvain, k-means] to these networks and evaluated their ability to identify known clusters. The report sets included one simulated set and three sets from the Vaccine Adverse Event Reporting System; each was split into training and testing subsets. Training subsets were used to estimate parameter values for the clustering algorithms and testing subsets to evaluate clusters. We created the networks by linking reports based on shared information in the form either of individual Medical Dictionary for Regulatory Activities Preferred Terms (PTs) or of dyads, triplets, quadruplets, quintuplets, and sextuplets of PTs; we created another network by weighting the single PT network connections by Lin's information theoretic approach to similarity. We then repeated this entire process using networks based on text mining output rather than structured data. We evaluated report clustering using recall, precision, and f-measure. The VOS algorithm outperformed Louvain and k-means in general. The best weighting scheme appeared to be related to the complexity of the known cluster. For example, singleton weighting performed best for an intussusception cluster driven by a single PT. We observed marginal differences between the code- and textual-based clustering. In conclusion, our approach supported identification of similar nodes in a document network.

Entities:  

Mesh:

Year:  2014        PMID: 25122604     DOI: 10.1109/JBHI.2014.2345873

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  3 in total

1.  Application of Natural Language Processing and Network Analysis Techniques to Post-market Reports for the Evaluation of Dose-related Anti-Thymocyte Globulin Safety Patterns.

Authors:  Taxiarchis Botsis; Matthew Foster; Nina Arya; Kory Kreimeyer; Abhishek Pandey; Deepa Arya
Journal:  Appl Clin Inform       Date:  2017-04-26       Impact factor: 2.342

2.  "Artificial Intelligence" for Pharmacovigilance: Ready for Prime Time?

Authors:  Robert Ball; Gerald Dal Pan
Journal:  Drug Saf       Date:  2022-05-17       Impact factor: 5.228

3.  Monitoring biomedical literature for post-market safety purposes by analyzing networks of text-based coded information.

Authors:  Taxiarchis Botsis; Matthew Foster; Kory Kreimeyer; Abhishek Pandey; Richard Forshee
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.