Literature DB >> 15020766

Mixed-membership models of scientific publications.

Elena Erosheva1, Stephen Fienberg, John Lafferty.   

Abstract

PNAS is one of world's most cited multidisciplinary scientific journals. The PNAS official classification structure of subjects is reflected in topic labels submitted by the authors of articles, largely related to traditionally established disciplines. These include broad field classifications into physical sciences, biological sciences, social sciences, and further subtopic classifications within the fields. Focusing on biological sciences, we explore an internal soft-classification structure of articles based only on semantic decompositions of abstracts and bibliographies and compare it with the formal discipline classifications. Our model assumes that there is a fixed number of internal categories, each characterized by multinomial distributions over words (in abstracts) and references (in bibliographies). Soft classification for each article is based on proportions of the article's content coming from each category. We discuss the appropriateness of the model for the PNAS database as well as other features of the data relevant to soft classification.

Mesh:

Year:  2004        PMID: 15020766      PMCID: PMC387299          DOI: 10.1073/pnas.0307760101

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  7 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.

Authors:  P Tamayo; D Slonim; J Mesirov; Q Zhu; S Kitareewan; E Dmitrovsky; E S Lander; T R Golub
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-16       Impact factor: 11.205

3.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

4.  Finding scientific topics.

Authors:  Thomas L Griffiths; Mark Steyvers
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-10       Impact factor: 11.205

5.  The PNAS way back then.

Authors:  S Mac Lane
Journal:  Proc Natl Acad Sci U S A       Date:  1997-06-10       Impact factor: 11.205

6.  Mathematical typology: a grade of membership technique for obtaining disease definition.

Authors:  M A Woodbury; J Clive; A Garson
Journal:  Comput Biomed Res       Date:  1978-06

7.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

  7 in total
  20 in total

1.  From paragraph to graph: latent semantic analysis for information visualization.

Authors:  Thomas K Landauer; Darrell Laham; Marcia Derr
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-22       Impact factor: 11.205

2.  Mapping knowledge domains: characterizing PNAS.

Authors:  Kevin W Boyack
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-12       Impact factor: 11.205

3.  Reconceptualizing the classification of PNAS articles.

Authors:  Edoardo M Airoldi; Elena A Erosheva; Stephen E Fienberg; Cyrille Joutard; Tanzy Love; Suyash Shringarpure
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-15       Impact factor: 11.205

4.  Beyond prediction: A framework for inference with variational approximations in mixture models.

Authors:  T Westling; T H McCormick
Journal:  J Comput Graph Stat       Date:  2019-06-26       Impact factor: 2.302

5.  A general population-genetic model for the production by population structure of spurious genotype-phenotype associations in discrete, admixed or spatially distributed populations.

Authors:  Noah A Rosenberg; Magnus Nordborg
Journal:  Genetics       Date:  2006-04-02       Impact factor: 4.562

6.  mStruct: inference of population structure in light of both genetic admixing and allele mutations.

Authors:  Suyash Shringarpure; Eric P Xing
Journal:  Genetics       Date:  2009-04-10       Impact factor: 4.562

7.  DESCRIBING DISABILITY THROUGH INDIVIDUAL-LEVEL MIXTURE MODELS FOR MULTIVARIATE BINARY DATA.

Authors:  Elena A Erosheva; Stephen E Fienberg; Cyrille Joutard
Journal:  Ann Appl Stat       Date:  2007       Impact factor: 2.083

8.  Topic model for Chinese medicine diagnosis and prescription regularities analysis: case on diabetes.

Authors:  Xiao-Ping Zhang; Xue-Zhong Zhou; Hou-Kuan Huang; Qi Feng; Shi-Bo Chen; Bao-Yan Liu
Journal:  Chin J Integr Med       Date:  2011-04-21       Impact factor: 1.978

9.  GLAD: a mixed-membership model for heterogeneous tumor subtype classification.

Authors:  Hachem Saddiki; Jon McAuliffe; Patrick Flaherty
Journal:  Bioinformatics       Date:  2014-09-29       Impact factor: 6.937

10.  Longitudinal Mixed Membership Trajectory Models for Disability Survey Data.

Authors:  Daniel Manrique-Vallier
Journal:  Ann Appl Stat       Date:  2014-12       Impact factor: 2.083

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.