Literature DB >> 34174885

Large-scale literature mining to assess the relation between anti-cancer drugs and cancer types.

Chris Bauer1, Ralf Herwig2, Matthias Lienhard2, Paul Prasse3, Tobias Scheffer3, Johannes Schuchhardt4.   

Abstract

BACKGROUND: There is a huge body of scientific literature describing the relation between tumor types and anti-cancer drugs. The vast amount of scientific literature makes it impossible for researchers and physicians to extract all relevant information manually.
METHODS: In order to cope with the large amount of literature we applied an automated text mining approach to assess the relations between 30 most frequent cancer types and 270 anti-cancer drugs. We applied two different approaches, a classical text mining based on named entity recognition and an AI-based approach employing word embeddings. The consistency of literature mining results was validated with 3 independent methods: first, using data from FDA approvals, second, using experimentally measured IC-50 cell line data and third, using clinical patient survival data.
RESULTS: We demonstrated that the automated text mining was able to successfully assess the relation between cancer types and anti-cancer drugs. All validation methods showed a good correspondence between the results from literature mining and independent confirmatory approaches. The relation between most frequent cancer types and drugs employed for their treatment were visualized in a large heatmap. All results are accessible in an interactive web-based knowledge base using the following link: https://knowledgebase.microdiscovery.de/heatmap .
CONCLUSIONS: Our approach is able to assess the relations between compounds and cancer types in an automated manner. Both, cancer types and compounds could be grouped into different clusters. Researchers can use the interactive knowledge base to inspect the presented results and follow their own research questions, for example the identification of novel indication areas for known drugs.

Entities:  

Keywords:  Anti-cancer drugs; Database; Literature mining; Tumor types; Word embeddings

Year:  2021        PMID: 34174885     DOI: 10.1186/s12967-021-02941-z

Source DB:  PubMed          Journal:  J Transl Med        ISSN: 1479-5876            Impact factor:   5.531


  13 in total

1.  ChemSpot: a hybrid system for chemical named entity recognition.

Authors:  Tim Rocktäschel; Michael Weidlich; Ulf Leser
Journal:  Bioinformatics       Date:  2012-04-12       Impact factor: 6.937

2.  A comparison of word embeddings for the biomedical natural language processing.

Authors:  Yanshan Wang; Sijia Liu; Naveed Afzal; Majid Rastegar-Mojarad; Liwei Wang; Feichen Shen; Paul Kingsbury; Hongfang Liu
Journal:  J Biomed Inform       Date:  2018-09-12       Impact factor: 6.317

3.  Unsupervised word embeddings capture latent knowledge from materials science literature.

Authors:  Vahe Tshitoyan; John Dagdelen; Leigh Weston; Alexander Dunn; Ziqin Rong; Olga Kononova; Kristin A Persson; Gerbrand Ceder; Anubhav Jain
Journal:  Nature       Date:  2019-07-03       Impact factor: 49.962

4.  The GNAT library for local and remote gene mention normalization.

Authors:  Jörg Hakenberg; Martin Gerner; Maximilian Haeussler; Illés Solt; Conrad Plake; Michael Schroeder; Graciela Gonzalez; Goran Nenadic; Casey M Bergman
Journal:  Bioinformatics       Date:  2011-08-03       Impact factor: 6.937

5.  GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao; Zhiyong Lu
Journal:  Biomed Res Int       Date:  2015-08-25       Impact factor: 3.411

6.  Extraction of Protein-Protein Interaction from Scientific Articles by Predicting Dominant Keywords.

Authors:  Shun Koyabu; Thi Thanh Thuy Phan; Takenao Ohkawa
Journal:  Biomed Res Int       Date:  2015-12-10       Impact factor: 3.411

7.  Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.

Authors:  Simon Baker; Imran Ali; Ilona Silins; Sampo Pyysalo; Yufan Guo; Johan Högberg; Ulla Stenius; Anna Korhonen
Journal:  Bioinformatics       Date:  2017-12-15       Impact factor: 6.937

8.  BioWordVec, improving biomedical word embeddings with subword information and MeSH.

Authors:  Yijia Zhang; Qingyu Chen; Zhihao Yang; Hongfei Lin; Zhiyong Lu
Journal:  Sci Data       Date:  2019-05-10       Impact factor: 6.444

9.  iTextMine: integrated text-mining system for large-scale knowledge extraction from the literature.

Authors:  Jia Ren; Gang Li; Karen Ross; Cecilia Arighi; Peter McGarvey; Shruti Rao; Julie Cowart; Subha Madhavan; K Vijay-Shanker; Cathy H Wu
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

10.  A systematic analysis of FDA-approved anticancer drugs.

Authors:  Jingchun Sun; Qiang Wei; Yubo Zhou; Jingqi Wang; Qi Liu; Hua Xu
Journal:  BMC Syst Biol       Date:  2017-10-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.