| Literature DB >> 29368755 |
Bernd Müller1, Christoph Poley1, Jana Pössel1, Alexandra Hagelstein1, Thomas Gübitz1.
Abstract
The explosive growth of literature and data in the life sciences challenges researchers to keep track of current advancements in their disciplines. Novel approaches in the life science like the One Health paradigm require integrated methodologies in order to link and connect heterogeneous information from databases and literature resources. Current publications in the life sciences are increasingly characterized by the employment of trans-disciplinary methodologies comprising molecular and cell biology, genetics, genomic, epigenomic, transcriptional and proteomic high throughput technologies with data from humans, plants, and animals. The literature search engine LIVIVO empowers retrieval functionality by incorporating various literature resources from medicine, health, environment, agriculture and nutrition. LIVIVO is developed in-house by ZB MED - Information Centre for Life Sciences. It provides a user-friendly and usability-tested search interface with a corpus of 55 Million citations derived from 50 databases. Standardized application programming interfaces are available for data export and high throughput retrieval. The search functions allow for semantic retrieval with filtering options based on life science entities. The service oriented architecture of LIVIVO uses four different implementation layers to deliver search services. A Knowledge Environment is developed by ZB MED to deal with the heterogeneity of data as an integrative approach to model, store, and link semantic concepts within literature resources and databases. Future work will focus on the exploitation of life science ontologies and on the employment of NLP technologies in order to improve query expansion, filters in faceted search, and concept based relevancy rankings in LIVIVO.Entities:
Keywords: Data Mining; Information Retrieval; Knowledge Discovery; LIVIVO; Life Sciences; Literature Search; Text Mining
Year: 2017 PMID: 29368755 PMCID: PMC5750838 DOI: 10.1007/s13222-016-0245-2
Source DB: PubMed Journal: Datenbank Spektrum ISSN: 1618-2162
Fig. 1Service oriented architecture of LIVIVO with the four layers for the web interface, the portal software, the query standardization, and the base layer of the search engine
Fig. 2The ZB MED Knowledge Environment (KE) built upon literature resources with meta-data and fulltexts that are annotated using a UIMA-based text and data mining workflow. Dictionaries with concepts as well as relations of concepts are derived from MeSH, AGROVOC, and DrugBank. Big data analysis is conducted on the ZB MED KE that is transferred into LIVIVO as potentially novel visualization techniques