| Literature DB >> 32993795 |
Patricio Wolff1, Sebastián Ríos1, David Clavijo1, Manuel Graña2, Miguel Carrasco3.
Abstract
BACKGROUND: Medical knowledge is accumulated in scientific research papers along time. In order to exploit this knowledge by automated systems, there is a growing interest in developing text mining methodologies to extract, structure, and analyze in the shortest time possible the knowledge encoded in the large volume of medical literature. In this paper, we use the Latent Dirichlet Allocation approach to analyze the correlation between funding efforts and actually published research results in order to provide the policy makers with a systematic and rigorous tool to assess the efficiency of funding programs in the medical area.Entities:
Keywords: Data science; Healthcare management; Latent Dirichlet allocation; Machine learning; Strategy
Mesh:
Year: 2020 PMID: 32993795 PMCID: PMC7523397 DOI: 10.1186/s13326-020-00226-w
Source DB: PubMed Journal: J Biomed Semantics
Monthly and annual distribution of the Research Articles downloaded from RevMed (2012-2015)
| 16 | 15 | 16 | 15 | 17 | 15 | 14 | 12 | 10 | 16 | 13 | 11 | ||
| 14 | 15 | 16 | 12 | 13 | 13 | 12 | 13 | 14 | 14 | 14 | 15 | ||
| 13 | 15 | 13 | 14 | 14 | 13 | 8 | 13 | 14 | 13 | 14 | 14 | ||
| 13 | 12 | 11 | 13 | 14 | 13 | 14 | 14 | 12 | 12 | 12 | 10 | ||
Fig. 1Methodological steps for the analysis of medical research funding from the literature
Fig. 2LDA’s model graphic representation
Fig. 3Plot of the metrics for the identification of the optimal number of topics for the analysis. The upper part corresponds to the metrics that are minimized, the lower part for the metric that is maximized
Fig. 4Distribution of the number of documents per topic for LDA and LSA algorithms
Fig. 5Visualization of the 50 most relevant topics found by LDA in the corpus. Circle diameter is proportional to topic relevance
Fig. 6The 30 most frequent terms of the corpus
Fig. 7Visualization of topic grouping done by the team of experts based on semantic grounds
Fig. 8Nature of funding sources for each group of topics