| Literature DB >> 35257868 |
Xiaobei Zhou1, Miao Zhou1, Desheng Huang2, Lei Cui3.
Abstract
The co-occurrence analysis of Medical Subject Heading (MeSH) terms extracted from the PubMed database is popularly used in bibliometrics. Practically for making the result interpretable, it is necessary to apply a certain filter procedure of co-occurrence matrix for removing the low-frequency items due to their low representativeness. Unfortunately, there is rare research referring to determine a critical threshold to remove the noise of co-occurrence matrix. Here, we proposed a probabilistic model for co-occurrence analysis that can provide statistical inferences about whether the paired items co-occur randomly. With help of this model, the dimensionality of co-occurrence matrix could be reduced according to the selected threshold. The conceptual model framework, simulation and practical applications are illustrated in the manuscript. Further details (including all reproducible codes) can be downloaded from the project website: https://github.com/xizhou/co-occurrence-analysis.git.Entities:
Keywords: Bibliometrics; Co-occurrence analysis; MeSH term; Probabilistic model; Simulation
Mesh:
Year: 2022 PMID: 35257868 DOI: 10.1016/j.jbi.2022.104047
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 8.000