Literature DB >> 33411774

Two-stage topic modelling of scientific publications: A case study of University of Nairobi, Kenya.

Leacky Muchene1, Wende Safari2.   

Abstract

Unsupervised statistical analysis of unstructured data has gained wide acceptance especially in natural language processing and text mining domains. Topic modelling with Latent Dirichlet Allocation is one such statistical tool that has been successfully applied to synthesize collections of legal, biomedical documents and journalistic topics. We applied a novel two-stage topic modelling approach and illustrated the methodology with data from a collection of published abstracts from the University of Nairobi, Kenya. In the first stage, topic modelling with Latent Dirichlet Allocation was applied to derive the per-document topic probabilities. To more succinctly present the topics, in the second stage, hierarchical clustering with Hellinger distance was applied to derive the final clusters of topics. The analysis showed that dominant research themes in the university include: HIV and malaria research, research on agricultural and veterinary services as well as cross-cutting themes in humanities and social sciences. Further, the use of hierarchical clustering in the second stage reduces the discovered latent topics to clusters of homogeneous topics.

Entities:  

Year:  2021        PMID: 33411774      PMCID: PMC7790388          DOI: 10.1371/journal.pone.0243208

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  10 in total

1.  Finding scientific topics.

Authors:  Thomas L Griffiths; Mark Steyvers
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-10       Impact factor: 11.205

2.  Applications of text mining within systematic reviews.

Authors:  James Thomas; John McNaught; Sophia Ananiadou
Journal:  Res Synth Methods       Date:  2011-04-11       Impact factor: 5.273

3.  Capturing the trend of mHealth research using text mining.

Authors:  Hyejin Park; Min Sook Park
Journal:  Mhealth       Date:  2019-10-11

Review 4.  Utilization of open source electronic health record around the world: A systematic review.

Authors:  Farzaneh Aminpour; Farahnaz Sadoughi; Maryam Ahamdi
Journal:  J Res Med Sci       Date:  2014-01       Impact factor: 1.852

5.  Decomposing biodiversity data using the Latent Dirichlet Allocation model, a probabilistic multivariate statistical method.

Authors:  Denis Valle; Benjamin Baiser; Christopher W Woodall; Robin Chazdon
Journal:  Ecol Lett       Date:  2014-10-17       Impact factor: 9.492

6.  Topic modeling for cluster analysis of large biological and medical datasets.

Authors:  Weizhong Zhao; Wen Zou; James J Chen
Journal:  BMC Bioinformatics       Date:  2014-10-21       Impact factor: 3.169

7.  Discovering health topics in social media using topic models.

Authors:  Michael J Paul; Mark Dredze
Journal:  PLoS One       Date:  2014-08-01       Impact factor: 3.240

8.  Application of dynamic topic models to toxicogenomics data.

Authors:  Mikyung Lee; Zhichao Liu; Ruili Huang; Weida Tong
Journal:  BMC Bioinformatics       Date:  2016-10-06       Impact factor: 3.169

Review 9.  Why sub-Saharan Africa lags in electronic health record adoption and possible strategies to increase its adoption in this region.

Authors:  Florence Femi Odekunle; Raphael Oluseun Odekunle; Srinivasan Shankar
Journal:  Int J Health Sci (Qassim)       Date:  2017 Sep-Oct

10.  Implementing an Open Source Electronic Health Record System in Kenyan Health Care Facilities: Case Study.

Authors:  Naomi Muinga; Steve Magare; Jonathan Monda; Onesmus Kamau; Stuart Houston; Hamish Fraser; John Powell; Mike English; Chris Paton
Journal:  JMIR Med Inform       Date:  2018-04-18
  10 in total
  2 in total

1.  Discovering Thematically Coherent Biomedical Documents Using Contextualized Bidirectional Encoder Representations from Transformers-Based Clustering.

Authors:  Khishigsuren Davagdorj; Ling Wang; Meijing Li; Van-Huy Pham; Keun Ho Ryu; Nipon Theera-Umpon
Journal:  Int J Environ Res Public Health       Date:  2022-05-12       Impact factor: 4.614

Review 2.  Applications of natural language processing in ophthalmology: present and future.

Authors:  Jimmy S Chen; Sally L Baxter
Journal:  Front Med (Lausanne)       Date:  2022-08-08
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.