Literature DB >> 34888523

Supervised learning with word embeddings derived from PubMed captures latent knowledge about protein kinases and cancer.

Vida Ravanmehr1, Hannah Blau1, Luca Cappelletti2, Tommaso Fontana2, Leigh Carmody1, Ben Coleman1, Joshy George1, Justin Reese3, Marcin Joachimiak3, Giovanni Bocci4, Peter Hansen1, Carol Bult5, Jens Rueter5, Elena Casiraghi2, Giorgio Valentini2, Christopher Mungall3, Tudor I Oprea4, Peter N Robinson1.   

Abstract

Inhibiting protein kinases (PKs) that cause cancers has been an important topic in cancer therapy for years. So far, almost 8% of >530 PKs have been targeted by FDA-approved medications, and around 150 protein kinase inhibitors (PKIs) have been tested in clinical trials. We present an approach based on natural language processing and machine learning to investigate the relations between PKs and cancers, predicting PKs whose inhibition would be efficacious to treat a certain cancer. Our approach represents PKs and cancers as semantically meaningful 100-dimensional vectors based on word and concept neighborhoods in PubMed abstracts. We use information about phase I-IV trials in ClinicalTrials.gov to construct a training set for random forest classification. Our results with historical data show that associations between PKs and specific cancers can be predicted years in advance with good accuracy. Our tool can be used to predict the relevance of inhibiting PKs for specific cancers and to support the design of well-focused clinical trials to discover novel PKIs for cancer therapy.
© The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2021        PMID: 34888523      PMCID: PMC8652379          DOI: 10.1093/nargab/lqab113

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  40 in total

Review 1.  Diagnosing the decline in pharmaceutical R&D efficiency.

Authors:  Jack W Scannell; Alex Blanckley; Helen Boldon; Brian Warrington
Journal:  Nat Rev Drug Discov       Date:  2012-03-01       Impact factor: 84.694

2.  Drug target identification using side-effect similarity.

Authors:  Monica Campillos; Michael Kuhn; Anne-Claude Gavin; Lars Juhl Jensen; Peer Bork
Journal:  Science       Date:  2008-07-11       Impact factor: 47.728

3.  Sorafenib in radioactive iodine-refractory, locally advanced or metastatic differentiated thyroid cancer: a randomised, double-blind, phase 3 trial.

Authors:  Marcia S Brose; Christopher M Nutting; Barbara Jarzab; Rossella Elisei; Salvatore Siena; Lars Bastholt; Christelle de la Fouchardiere; Furio Pacini; Ralf Paschke; Young Kee Shong; Steven I Sherman; Johannes W A Smit; John Chung; Christian Kappeler; Carol Peña; István Molnár; Martin J Schlumberger
Journal:  Lancet       Date:  2014-04-24       Impact factor: 79.321

4.  Unsupervised word embeddings capture latent knowledge from materials science literature.

Authors:  Vahe Tshitoyan; John Dagdelen; Leigh Weston; Alexander Dunn; Ziqin Rong; Olga Kononova; Kristin A Persson; Gerbrand Ceder; Anubhav Jain
Journal:  Nature       Date:  2019-07-03       Impact factor: 49.962

5.  node2vec: Scalable Feature Learning for Networks.

Authors:  Aditya Grover; Jure Leskovec
Journal:  KDD       Date:  2016-08

6.  Unexplored therapeutic opportunities in the human genome.

Authors:  Tudor I Oprea; Cristian G Bologa; Søren Brunak; Allen Campbell; Gregory N Gan; Anna Gaulton; Shawn M Gomez; Rajarshi Guha; Anne Hersey; Jayme Holmes; Ajit Jadhav; Lars Juhl Jensen; Gary L Johnson; Anneli Karlson; Andrew R Leach; Avi Ma'ayan; Anna Malovannaya; Subramani Mani; Steven L Mathias; Michael T McManus; Terrence F Meehan; Christian von Mering; Daniel Muthas; Dac-Trung Nguyen; John P Overington; George Papadatos; Jun Qin; Christian Reich; Bryan L Roth; Stephan C Schürer; Anton Simeonov; Larry A Sklar; Noel Southall; Susumu Tomita; Ilinca Tudose; Oleg Ursu; Dušica Vidovic; Anna Waller; David Westergaard; Jeremy J Yang; Gergely Zahoránszky-Köhalmi
Journal:  Nat Rev Drug Discov       Date:  2018-03-23       Impact factor: 84.694

7.  The ClinicalTrials.gov results database--update and key issues.

Authors:  Deborah A Zarin; Tony Tse; Rebecca J Williams; Robert M Califf; Nicholas C Ide
Journal:  N Engl J Med       Date:  2011-03-03       Impact factor: 91.245

Review 8.  Kinase Atlas: Druggability Analysis of Potential Allosteric Sites in Kinases.

Authors:  Christine Yueh; Justin Rettenmaier; Bing Xia; David R Hall; Andrey Alekseenko; Kathryn A Porter; Krister Barkovich; Gyorgy Keseru; Adrian Whitty; James A Wells; Sandor Vajda; Dima Kozakov
Journal:  J Med Chem       Date:  2019-07-05       Impact factor: 7.446

9.  Computational Drug Repositioning for Gastric Cancer using Reversal Gene Expression Profiles.

Authors:  In-Wha Kim; Jung Mi Oh; Hayoung Jang; Jae Hyun Kim; Myeong Gyu Kim; Sangsoo Kim
Journal:  Sci Rep       Date:  2019-02-25       Impact factor: 4.379

10.  New drug candidates for treatment of atypical meningiomas: An integrated approach using gene expression signatures for drug repurposing.

Authors:  Zsolt Zador; Andrew T King; Nophar Geifman
Journal:  PLoS One       Date:  2018-03-20       Impact factor: 3.240

View more
  1 in total

1.  Why was this cited? Explainable machine learning applied to COVID-19 research literature.

Authors:  Lucie Beranová; Marcin P Joachimiak; Tomáš Kliegr; Gollam Rabby; Vilém Sklenák
Journal:  Scientometrics       Date:  2022-04-09       Impact factor: 3.801

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.