Literature DB >> 30848457

A Guide to Dictionary-Based Text Mining.

Helen V Cook1,2, Lars Juhl Jensen3.   

Abstract

PubMed contains more than 27 million documents, and this number is growing at an estimated 4% per year. Even within specialized topics, it is no longer possible for a researcher to read any field in its entirety, and thus nobody has a complete picture of the scientific knowledge in any given field at any time. Text mining provides a means to automatically read this corpus and to extract the relations found therein as structured information. Having data in a structured format is a huge boon for computational efforts to access, cross reference, and mine the data stored therein. This is increasingly useful as biological research is becoming more focused on systems and multi-omics integration. This chapter provides an overview of the steps that are required for text mining: tokenization, named entity recognition, normalization, event extraction, and benchmarking. It discusses a variety of approaches to these tasks and then goes into detail on how to prepare data for use specifically with the JensenLab tagger. This software uses a dictionary-based approach and provides the text mining evidence for STRING and several other databases.

Keywords:  Automated text processing; Dictionary-based approach; Named entity recognition; PubMed; Structured information; Text mining; Text normalization

Mesh:

Year:  2019        PMID: 30848457     DOI: 10.1007/978-1-4939-9089-4_5

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  2 in total

1.  Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining.

Authors:  Lejun Gong; Zhifei Zhang; Shiqi Chen
Journal:  J Healthc Eng       Date:  2020-11-24       Impact factor: 2.682

2.  TWIRLS, a knowledge-mining technology, suggests a possible mechanism for the pathological changes in the human host after coronavirus infection via ACE2.

Authors:  Xiaoyang Ji; Wenting Tan; Chunming Zhang; Yubo Zhai; Yiching Hsueh; Zhonghai Zhang; Chunli Zhang; Yanqiu Lu; Bo Duan; Guangming Tan; Renhua Na; Guohong Deng; Gang Niu
Journal:  Drug Dev Res       Date:  2020-07-13       Impact factor: 5.004

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.