Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Finding scientific topics.

Literature DB >> 14872004

Finding scientific topics.

Abstract

A first step in identifying the content of a document is determining which topics that document addresses. We describe a generative model for documents, introduced by Blei, Ng, and Jordan [Blei, D. M., Ng, A. Y. & Jordan, M. I. (2003) J. Machine Learn. Res. 3, 993-1022], in which each document is generated by choosing a distribution over topics and then choosing each word in the document from a topic selected according to this distribution. We then present a Markov chain Monte Carlo algorithm for inference in this model. We use this algorithm to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics. We show that the extracted topics capture meaningful structure in the data, consistent with the class designations provided by the authors of the articles, and outline further applications of this analysis, including identifying "hot topics" by examining temporal dynamics and tagging abstracts to illustrate semantic content.

Mesh：

Year: 2004 PMID： 14872004 PMCID： PMC387300 DOI： 10.1073/pnas.0307752101

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

2 in total

1. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

Authors: S Geman; D Geman
Journal: IEEE Trans Pattern Anal Mach Intell Date: 1984-06 Impact factor: 6.226

2. Fundamental theorem of natural selection under gene-culture transmission.

Authors: C S Findlay
Journal: Proc Natl Acad Sci U S A Date: 1991-06-01 Impact factor: 11.205

2 in total

193 in total

Finding scientific topics.

1. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

2. Fundamental theorem of natural selection under gene-culture transmission.

1. Mapping subsets of scholarly information.

2. From paragraph to graph: latent semantic analysis for information visualization.

3. Mixed-membership models of scientific publications.

4. Mapping knowledge domains: characterizing PNAS.

5. Mapping annotations with textual evidence using an scLDA model.

6. A LDA-based approach to promoting ranking diversity for genomics information retrieval.

7. Reconceptualizing the classification of PNAS articles.

8. Effects of event knowledge in processing verbal arguments.

9. Pairwise Latent Semantic Association for Similarity Computation in Medical Imaging.

10. Structured Correspondence Topic Models for Mining Captioned Figures in Biological Literature.