Literature DB >> 17600104

A document clustering and ranking system for exploring MEDLINE citations.

Yongjing Lin1, Wenyuan Li, Keke Chen, Ying Liu.   

Abstract

OBJECTIVE: A major problem faced in biomedical informatics involves how best to present information retrieval results. When a single query retrieves many results, simply showing them as a long list often provides poor overview. With a goal of presenting users with reduced sets of relevant citations, this study developed an approach that retrieved and organized MEDLINE citations into different topical groups and prioritized important citations in each group.
DESIGN: A text mining system framework for automatic document clustering and ranking organized MEDLINE citations following simple PubMed queries. The system grouped the retrieved citations, ranked the citations in each cluster, and generated a set of keywords and MeSH terms to describe the common theme of each cluster. MEASUREMENTS: Several possible ranking functions were compared, including citation count per year (CCPY), citation count (CC), and journal impact factor (JIF). We evaluated this framework by identifying as "important" those articles selected by the Surgical Oncology Society.
RESULTS: Our results showed that CCPY outperforms CC and JIF, i.e., CCPY better ranked important articles than did the others. Furthermore, our text clustering and knowledge extraction strategy grouped the retrieval results into informative clusters as revealed by the keywords and MeSH terms extracted from the documents in each cluster.
CONCLUSIONS: The text mining system studied effectively integrated text clustering, text summarization, and text ranking and organized MEDLINE retrieval results into different topical groups.

Entities:  

Mesh:

Year:  2007        PMID: 17600104      PMCID: PMC1975797          DOI: 10.1197/jamia.M2215

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  23 in total

1.  Network analysis. The structure of the Web.

Authors:  J Kleinberg; S Lawrence
Journal:  Science       Date:  2001-11-30       Impact factor: 47.728

2.  Text categorization models for high-quality article retrieval in internal medicine.

Authors:  Yindalon Aphinyanaphongs; Ioannis Tsamardinos; Alexander Statnikov; Douglas Hardin; Constantin F Aliferis
Journal:  J Am Med Inform Assoc       Date:  2004-11-23       Impact factor: 4.497

Review 3.  A survey of current work in biomedical text mining.

Authors:  Aaron M Cohen; William R Hersh
Journal:  Brief Bioinform       Date:  2005-03       Impact factor: 11.622

4.  Text mining biomedical literature for discovering gene-to-gene relationships: a comparative study of algorithms.

Authors:  Ying Liu; Shamkant B Navathe; Jorge Civera; Venu Dasigi; Ashwin Ram; Brian J Ciliax; Ray Dingledine
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2005 Jan-Mar       Impact factor: 3.710

5.  Text analysis of MEDLINE for discovering functional relationships among genes: evaluation of keyword extraction weighting schemes.

Authors:  Ying Liu; Shamkant B Navathe; Alex Pivoshenko; Venu G Dasigi; Ray Dingledine; Brian J Ciliax
Journal:  Int J Data Min Bioinform       Date:  2006       Impact factor: 0.667

Review 6.  How well do physicians use electronic information retrieval systems? A framework for investigation and systematic review.

Authors:  W R Hersh; D H Hickam
Journal:  JAMA       Date:  1998-10-21       Impact factor: 56.272

Review 7.  IGF-I physiology and breast cancer.

Authors:  M Pollak
Journal:  Recent Results Cancer Res       Date:  1998

8.  Anti-vascular endothelial growth factor receptor-1 antagonist antibody as a therapeutic agent for cancer.

Authors:  Yan Wu; Zhaojing Zhong; James Huber; Rajiv Bassi; Bridget Finnerty; Erik Corcoran; Huiling Li; Elizabeth Navarro; Paul Balderes; Xenia Jimenez; Henry Koo; Venkata R M Mangalampalli; Dale L Ludwig; James R Tonra; Daniel J Hicklin
Journal:  Clin Cancer Res       Date:  2006-11-01       Impact factor: 12.531

9.  GoPubMed: exploring PubMed with the Gene Ontology.

Authors:  Andreas Doms; Michael Schroeder
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  Mining microarray expression data by literature profiling.

Authors:  Damien Chaussabel; Alan Sher
Journal:  Genome Biol       Date:  2002-09-13       Impact factor: 13.583

View more
  17 in total

1.  Automatic summarization of MEDLINE citations for evidence-based medical treatment: a topic-oriented evaluation.

Authors:  Marcelo Fiszman; Dina Demner-Fushman; Halil Kilicoglu; Thomas C Rindflesch
Journal:  J Biomed Inform       Date:  2008-11-05       Impact factor: 6.317

2.  UMLS-Interface and UMLS-Similarity : open source software for measuring paths and semantic similarity.

Authors:  Bridget T McInnes; Ted Pedersen; Serguei V S Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

3.  U-path: An undirected path-based measure of semantic similarity.

Authors:  Bridget T McInnes; Ted Pedersen; Ying Liu; Genevieve B Melton; Serguei V Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

4.  Mining MEDLINE for problems associated with vitamin D.

Authors:  Dina Demner-Fushman; James G Mork; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

5.  Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.

Authors:  Zhiguo Yu; Byron C Wallace; Todd Johnson; Trevor Cohen
Journal:  Stud Health Technol Inform       Date:  2017

6.  Aggregator: a machine learning approach to identifying MEDLINE articles that derive from the same underlying clinical trial.

Authors:  Weixiang Shao; Clive E Adams; Aaron M Cohen; John M Davis; Marian S McDonagh; Sujata Thakurta; Philip S Yu; Neil R Smalheiser
Journal:  Methods       Date:  2014-11-20       Impact factor: 3.608

7.  Towards Transforming Expert-based Content to Evidence-based Content.

Authors:  Soheil Moosavinasab; Majid Rastegar-Mojarad; Hongfang Liu; Siddhartha R Jonnalagadda
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2014-04-07

8.  Mining Health Social Media with Sentiment Analysis.

Authors:  Fu-Chen Yang; Anthony J T Lee; Sz-Chen Kuo
Journal:  J Med Syst       Date:  2016-09-23       Impact factor: 4.460

Review 9.  A systematic review of the quality and impact of anxiety disorder meta-analyses.

Authors:  Jonathan C Ipser; Dan J Stein
Journal:  Curr Psychiatry Rep       Date:  2009-08       Impact factor: 5.285

10.  Health-related hot topic detection in online communities using text clustering.

Authors:  Yingjie Lu; Pengzhu Zhang; Jingfang Liu; Jia Li; Shasha Deng
Journal:  PLoS One       Date:  2013-02-15       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.