Literature DB >> 16845111

HubMed: a web-based biomedical literature search interface.

Alfred D Eaton1.   

Abstract

HubMed is an alternative search interface to the PubMed database of biomedical literature, incorporating external web services and providing functions to improve the efficiency of literature search, browsing and retrieval. Users can create and visualize clusters of related articles, export citation data in multiple formats, receive daily updates of publications in their areas of interest, navigate links to full text and other related resources, retrieve data from formatted bibliography lists, navigate citation links and store annotated metadata for articles of interest. HubMed is freely available at http://www.hubmed.org/.

Entities:  

Mesh:

Year:  2006        PMID: 16845111      PMCID: PMC1538859          DOI: 10.1093/nar/gkl037

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


BACKGROUND

NCBI's PubMed (), a biomedical literature database incorporating MEDLINE, is the primary source of peer-reviewed biomedical information for scientific researchers, practising health professionals and the general public. Rapid response times from the search engine Entrez and integration with other NCBI-hosted databases such as GenBank allow PubMed to provide broad, up-to-date and curated search results. However, this breadth of coverage and functionality for a wide variety of users, ranging from those researching the results of clinical trials to those examining the composition of DNA sequences, means that Entrez/PubMed is unable to optimize its interface and functions for researchers that need to search and browse large volumes of literature covering their specific area of interest. The PubMed interface also lacks integration with web-based resources outside the NCBI. Availability of the PubMed database via a web services API (), launched in 2002, opened up the possibility for external developers to take advantage of the NCBI's databases and processing power to provide alternative representations of the biomedical literature; e.g. analysing and extracting meaning from abstracts and MESH headings (1) or providing interfaces that add specialized functions (2).

FUNCTIONS PROVIDED BY HUBMED

HubMed () is one such tool based around the Entrez Programming Utilities web service API. HubMed provides a dynamic and intuitive interface that transforms data from PubMed and integrates it with data from other sources, with the aim of improving the ability of researchers to find and manage biomedical literature related to their research. For the last three years, HubMed has been providing daily updates of new arrivals to the MEDLINE database in a variety of XML (Extensible Markup Language) feed formats [currently Atom (), RSS 1.0 (RDF) and RSS 2.0 ()]. Subscribing to a feed of new matches for any search query is free and requires no registration, enabling tools such as Onfolio () and Kebberfegg () to dynamically generate feed subscriptions on demand, that can then be processed by desktop or web-based feed aggregators (see for more details). Each item in a feed is linked via a unique identifier—the PubMed ID (PMID)—to HubMed's display of the most useful metadata available for that article, from where users can carry out a variety of functions, some of which are described below. As most publications are not generally made available to researchers in a metadata-rich interchange format, the full text PDF of an article remains the most fundamental part of a researcher's digital library: an important link out of HubMed is therefore to the full online text of a paper. Users can proceed to the full text of an article using any of four overlapping options: through PubMed's ELink service () that leads to the document on the publisher's website; via Ex Libris' demonstration SFX server (). that provides a range of alternate full text services (often based on either the PMID or Digital Object Identifier (DOI, ) of an article); through Google Scholar (), that carries out a full text search of selected web documents; or via activation of embedded COinS metadata () which allows anyone with a COinS-activating web browser extension (available from ) or proxy server to receive links to a full text resolver (based on the OpenURL linking standard, ) appropriate for their location or institutional affiliation. While searching, browsing and reading articles, researchers are able to use HubMed to build a store of metadata for the papers that they find the most useful or interesting, as well as generating a taxonomy for these collections, by affixing tags (a synonym for keywords or labels) and annotations to each article. The Tag Storage service (), which requires a free registration, facilitates the recall and browsing of articles collected by each user or user group. HubMed also works fluently with other academic- and science-targeted social bookmarking tools such as CiteULike () and Connotea (), both of which are able to automatically retrieve metadata for items stored using a PMID. Once articles are stored inside HubMed's Tag Storage, users can arrange them into lists, view weighted visualizations of their tag usage frequency and export their stored data as RDF (), for use with other tools. This RDF/XML export feature is also available from any HubMed search result page, providing a basis for the use of information harvesting and management tools, such as SIMILE's Piggy Bank (), an extension for the Firefox web browser that can be used to store, manipulate, browse and visualize data collected from any RDF data-exporting source. The possibilities enabled by this kind of semantic data store are numerous, such as inferring conflicts or agreements between networks of biomedical research publications (e.g. ). As illustrated in Figure 1, HubMed also provides direct export of article metadata in a range of other formats, including RIS (, for use with Endnote, RefDB and many other bibliographic tools), BibTeX (, for use in TeX documents), MODS (, for use with XML document formats) and a direct link to send citation data to the online bibliographic library manager RefWorks (). HubMed maintains Unicode (UTF-8) characters throughout all its processes, so can provide the option to either include these accented characters in exported citation data or convert them to their Latin equivalents for use with older, Unicode-incompatible tools.
Figure 1

A HubMed page displaying the abstract for a single article along with action links and options for a variety of export formats.

To aid researchers wishing to browse the bibliography lists of papers published online in PDF format, HubMed can extract bibliographic data from text copied and pasted from PDF documents. The Citation Finder, available at , extracts each reference, parses the citation string and converts it into a PubMed search; the results are then displayed in HubMed as standard search results, allowing users to continue to read and work with the referenced articles. This citation parsing algorithm is based on a modified version of the ParaTools Perl modules () produced by the Open Citation Project (). To help users better understand jargon, acronyms and specialized scientific terms found within articles, HubMed's ‘Terms’ function, which accompanies each abstract, passes the abstract text through two web service filters in order to identify important keywords. The first, Whatizit (), is provided by the EBI and identifies Gene Ontology terms, along with protein and drug names in the text, adding links from each term to the Gene Ontology (3), UniProt (4) and MedlinePlus (), respectively. The second filter compares all words to a database of Wikipedia page titles (available from ) and adds links to the appropriate Wikipedia pages () from words for which information is available. HubMed also aids search result browsing by extracting and displaying sentences from the abstract text in which the query terms occur. Additionally, searches are augmented both by the use of PubMed's ESpell web service (), which provides alternative spelling suggestions for queries which return few or no results, and by a display of the MeSH categories () matched by each query, which can be deselected or augmented as desired to refine the search query. There are a number of tools in HubMed for exploring connections between related papers. Citation links can be explored directly for papers that are deposited in PubMed Central (data available from , including those from Open Access publisher BioMed Central), and there are also links to Elsevier's subscription service Scopus (), which allows in-depth exploration of citation and co-citation data. Articles related by co-occurrence of keywords can be explored directly as with normal search results using the relatedness score calculated by PubMed (described at ); these connections can be visualized as a dynamic force-directed graph using a TouchGraph Java applet (used with permission from ). Articles can also be ranked by order of relatedness to multiple articles using HubMed's ‘Rank Relations’ feature, which allows an iterative refinement of clustered articles providing a more focused view of a topic than standard keyword searches. This is similar to a previously published process used for automatically updating bibliographies using ranking of related articles (5). In conjunction with browsing articles related by keywords and citation links, it would be useful to be able to browse the network of collaborations between authors of scientific papers (6), but this is currently precluded by a lack of unique author identifiers in the MEDLINE database, making it difficult to disambiguate multiple researchers who share the same name.

CONCLUSIONS

For future development, HubMed will continue to incorporate the functions of external web services as they become available (so far, all the mentioned web services have used simple Representational State Transfer (REST)-based interfaces), as well as augmenting built-in functions that improve search efficiency and user-friendliness. Personalization of searches and recommendations, based on patterns of user attention and implied interests, may also improve the accuracy of search results. The role of HubMed in providing building blocks for semantic life sciences data management will continue to adapt to new developments and the needs of researchers in this area.
  6 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  UniProt: the Universal Protein knowledgebase.

Authors:  Rolf Apweiler; Amos Bairoch; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Update on XplorMed: A web server for exploring scientific literature.

Authors:  Carolina Perez-Iratxeta; Antonio J Pérez; Peer Bork; Miguel A Andrade
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  Updating a bibliography using the related articles function within PubMed.

Authors:  X Liu; R B Altman
Journal:  Proc AMIA Symp       Date:  1998

5.  GoPubMed: exploring PubMed with the Gene Ontology.

Authors:  Andreas Doms; Michael Schroeder
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

6.  PubNet: a flexible system for visualizing literature derived networks.

Authors:  Shawn M Douglas; Gaetano T Montelione; Mark Gerstein
Journal:  Genome Biol       Date:  2005-08-16       Impact factor: 13.583

  6 in total
  16 in total

1.  SEACOIN--an investigative tool for biomedical informatics researchers.

Authors:  Eva K Lee; Hee-Rin Lee; Alexander Quarshie
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Semantic Predications for Complex Information Needs in Biomedical Literature.

Authors:  Delroy Cameron; Ramakanth Kavuluru; Olivier Bodenreider; Pablo N Mendes; Amit P Sheth; Krishnaprasad Thirunarayan
Journal:  IEEE Int Conf Bioinform Biomed Workshops       Date:  2011

3.  Natural language query in the biochemistry and molecular biology domains based on cognition search™.

Authors:  Elizabeth J Goldsmith; Saurabh Mendiratta; Radha Akella; Kathleen Dahlgren
Journal:  Summit Transl Bioinform       Date:  2009-03-01

4.  Studying PubMed usages in the field for complex problem solving: Implications for tool design.

Authors:  Barbara Mirel; Jean Song; Jennifer Steiner Tonks; Fan Meng; Weijian Xuan; Rafiqa Ameziane
Journal:  J Am Soc Inf Sci Technol       Date:  2013-05-01

Review 5.  PubMed and beyond: a survey of web tools for searching biomedical literature.

Authors:  Zhiyong Lu
Journal:  Database (Oxford)       Date:  2011-01-18       Impact factor: 3.451

6.  CDAPubMed: a browser extension to retrieve EHR-based biomedical literature.

Authors:  David Perez-Rey; Ana Jimenez-Castellanos; Miguel Garcia-Remesal; Jose Crespo; Victor Maojo
Journal:  BMC Med Inform Decis Mak       Date:  2012-04-05       Impact factor: 2.796

7.  eTBLAST: a web server to identify expert reviewers, appropriate journals and similar publications.

Authors:  Mounir Errami; Jonathan D Wren; Justin M Hicks; Harold R Garner
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

Review 8.  Defrosting the digital library: bibliographic tools for the next generation web.

Authors:  Duncan Hull; Steve R Pettifer; Douglas B Kell
Journal:  PLoS Comput Biol       Date:  2008-10-31       Impact factor: 4.475

9.  Anne O'Tate: A tool to support user-driven summarization, drill-down and browsing of PubMed search results.

Authors:  Neil R Smalheiser; Wei Zhou; Vetle I Torvik
Journal:  J Biomed Discov Collab       Date:  2008-02-15

10.  Knowledge-Based Query Construction Using the CDSS Knowledge Base for Efficient Evidence Retrieval.

Authors:  Muhammad Afzal; Maqbool Hussain; Taqdir Ali; Jamil Hussain; Wajahat Ali Khan; Sungyoung Lee; Byeong Ho Kang
Journal:  Sensors (Basel)       Date:  2015-08-28       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.