Literature DB >> 31419838

Cancer Informatics in 2018: The Mysteries of the Cancer Genome Continue to Unravel, Deep Learning Approaches the Clinic, and Passive Data Collection Demonstrates Utility.

Jeremy L Warner1, Debra Patt2.   

Abstract

OBJECTIVE: To summarize significant research contributions on cancer informatics published in 2018.
METHODS: An extensive search using PubMed/Medline, Google Scholar, and manual review was conducted to identify the scientific contributions published in 2018 that address topics in cancer informatics. The selection process comprised three steps: (i) 15 candidate best papers were first selected by the two section editors, (ii) external reviewers from internationally renowned research teams reviewed each candidate best paper, and (iii) the final selection of four best papers was conducted by the editorial board of the International Medical Informatics Association (IMIA) Yearbook.
RESULTS: The four selected best papers present studies addressing many facets of cancer informatics, with immediate applicability in the translational and clinical domains.
CONCLUSION: Cancer informatics is a broad and vigorous subfield of biomedical informatics. Progress in cancer genomics, artificial intelligence, and passively collected data is especially notable in 2018. Georg Thieme Verlag KG Stuttgart.

Entities:  

Mesh:

Year:  2019        PMID: 31419838      PMCID: PMC6697504          DOI: 10.1055/s-0039-1677931

Source DB:  PubMed          Journal:  Yearb Med Inform        ISSN: 0943-4747


Introduction

The field of cancer informatics intends to take full advantage of the many data streams generated in the course of cancer prevention, diagnosis, care, and survivorship with several fundamental goals: 1) organizing the data in ways that are comprehensible and meaningful to clinicians, researchers, and patients; 2) using the data to advance the treatment of cancer; 3) bringing new data streams, such as person-generated data, into the mix; and 4) manipulating the data, such as through visualization, to yield new insights. In this second year of the Cancer Informatics section, we continue to focus on translational and clinical cancer informatics. While there is no survey paper this year, progress continues to be rapid, most notably in the area of cancer genomics. In 2019, the selection of papers in cancer informatics intends to illuminate the current progress of research with a focus on efforts to translate research towards immediate clinical applicability.

Paper Selection Method

Two electronic databases were searched, PubMed/MEDLINE and Google Scholar. Searches were performed in January 2019 to identify peer-reviewed journal articles. A PubMed search using the MeSH terms “Neoplasms” and “Medical Informatics Applications” for papers in English language published between Oct 31 2017 and Dec 31 2018 returned too many results for practical review (3,479 results). Similarly, a Google Scholar search for “cancer informatics” limited to 2017 & 2018 returned 27,700 results. Thus, we performed searches of PubMed-indexed well-known informatics journals and proceedings (i.e., JAMIA, Applied Clinical Informatics, Bioinformatics, Journal of Biomedical Informatics, Journal of Medical Internet Research, Pacific Symposium on Biocomputing) using the search phrase (“J Am Med Inform Assoc”[Journal]) AND neoplasms[MeSH Major Topic] (as shown, for JAMIA). Additionally, the contents of the journal JCO Clinical Cancer Informatics , which was not PubMed-indexed in 2018, were hand-searched. For candidate articles that were PubMed-indexed, we also searched for additional relevant articles using PubMed’s “Similar articles” service. We also searched the proceedings of AMIA 2018 and the 2018 AMIA Joint Summits. One of the two section editors performed the searches. Given the vast results, we focused on identifying articles with translational or clinical applications, as opposed to papers describing more fundamental bioinformatics methodologies. Then, the two section editors undertook independently the initial screening of titles and abstracts to identify papers relevant to the field of interest. Both section editors classified the papers into three categories: definitely include, possibly include, or exclude. They then reviewed in detail the possibly include full-text articles to finally reach a mutual list of 15 candidate best papers. Papers were considered according to their originality, innovativeness, scientific and/or practical impact, and scientific quality. In accordance with the IMIA Yearbook selection process 1 , the 15 candidate best papers were evaluated by the two section editors and by additional external reviewers (at least four reviewers per paper). Four papers were finally selected as best papers ( Table 1 ). A content summary of the selected best papers can be found in the appendix of this synopsis.
Table 1

Best paper selection of articles for the IMIA Yearbook of Medical Informatics 2019 in the section 'Cancer Informatics'. The articles are listed in alphabetical order of the first author’s surname.

SectionCancer Informatics
▪ Bailey MH, Tokheim C, Porta-Pardo E, Sengupta S, Bertrand D, Weerasinghe A, Colaprico A, Wendl MC, Kim J, Reardon B, Ng PK, Jeong KJ, Cao S, Wang Z, Gao J, Gao Q, Wang F, Liu EM, Mularoni L, Rubio-Perez C, Nagarajan N, Cortes-Ciriano I, Zhou DC, Liang WW, Hess JM, Yellapantula VD, Tamborero D, Gonzalez-Perez A, Suphavilai C, Ko JY, Khurana E, Park PJ, Van Allen EM, Liang H; MC3 Working Group; Cancer Genome Atlas Research Network, Lawrence MS, Godzik A, Lopez-Bigas N, Stuart J, Wheeler D, Getz G, Chen K, Lazar AJ, Mills GB, Karchin R, Ding L. Comprehensive characterization of cancer driver genes and mutations. Cell 2018 Apr 5;173(2):371-385.e18.
▪ Hosny A, Parmar C, Coroller TP, Grossmann P, Zeleznik R, Kumar A, Bussink J, Gillies RJ, Mak RH, Aerts HJWL. Deep learning for lung cancer prognostication: A retrospective multi-cohort radiomics study. PLoS Med 2018 Nov 30;15(11):e1002711.
▪ Low CA, Dey AK, Ferreira D, Kamarck T, Sun W, Bae S, Doryab A. Estimation of symptom severity during chemotherapy from passively sensed data: Exploratory study. J Med Internet Res 2017 Dec 19;19(12):e420.
▪ Tamborero D, Rubio-Perez C, Deu-Pons J, Schroeder MP, Vivancos A, Rovira A, Tusquets I, Albanell J, Rodon J, Tabernero J, de Torres C, Dienstmann R, Gonzalez-Perez A, Lopez-Bigas N. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med 2018 Mar 28;10(1):25.

Conclusions and Outlook

The four selected best papers are representative of three distinct subdomains of cancer informatics: cancer genomics; deep learning applications; and passive data collection. Bailey, et al. , 2 described a very substantial international effort to catalogue a list of cancer driver genes and their mutations. Generally speaking, somatic mutations observed in cancers are felt to either be drivers of the cancer or passive passengers ; most drug development is focused on disabling drivers. Despite much work in this area, automated algorithms often do not agree on candidate driver genes and mutations, requiring expert manual curation. This broad application of 26 bioinformatic software tools to 10,000 TCGA tumor samples (representing 33 cancer types) is the most comprehensive discovery of cancer driver genes and mutations to date. The data generated lay the groundwork for years of basic, translational, and clinical efforts. Hosny, et al. , 3 conducted a multi-site retrospective study of lung cancer prognostication using radiomics. Despite being the deadliest cancer, there are scant prognostic tools to determine lung cancer prognosis outside of the traditional anatomic staging systems. This study was an integrative analysis on seven independent radiographic datasets across five institutions, using a 3D convolution neural network. The authors successfully predicted survival for lung cancer patients with an AUC of 0.70. While the performance is only modest, radiography is standard of care for lung cancer and this study has immediate clinical applicability for a very common and highly lethal disease. Over the coming years, we expect that such approaches will become more comprehensive and more accurate, such as the recently published 4 approach to lung cancer screening by the Google Artificial Intelligence (AI) group. Low, et al. , 5 undertook a prospective pilot study of passively collected data from patients undergoing chemotherapy treatment. The aim of this study was to explore whether passively sensed mobile phone and Fitbit ¯ data could be used to estimate daily symptom burden during chemotherapy. Passively collected data matched patient-reported symptom burden with an accuracy of 88%. This was a small study (n=14), but proof of concept that mobile phone accelerometer and usage and Fitbit ¯ -assessed activity and sleep were related to daily symptom burden during chemotherapy. These findings highlight opportunities for long-term monitoring of cancer patients during chemotherapy, including the possibility of obviating invasive and tedious surveys. The interested reader is also referred to the recent American Society of Clinical Oncology educational book chapter 6 on the topic. Tamborero, et al. , 7 presented Cancer Genome Interpreter (https://www.cancerge- nomeinterpreter.org/home), a software tool that streamlines and automates the process of identifying and annotating variants. The process is critical to interpretation and contextualization of tumor sequencing results, in particular calling out those that may have clinical actionability. The tool accepts several data formats and provides a user-friendly output. Also described is a new knowledge base of 5,314 validated mutations (the Catalog of Validated Oncogenic Mutations). As a proof of concept, 72% of AACR Project GENIE 8 tumors (~17k) have at least one biomarker of drug response in the system. This proportion is much higher than what has been reported in older studies of genomically-informed treatment decisions, suggesting that the match between mutation and drug continues to improve. The other candidate best papers are in the same line with innovative and/or effective cancer informatics approaches. Two of the papers are in the general domain of data extraction and data mining. Baker, et al. , 9 described the Cancer Hallmarks Analytics Tool (CHAT), which extracts cancer-relevant literature from Pubmed. Gianni, et al. , 10 described an effort that seeks to find new combination treatments for cancer using genomics and clinical datasets. These complementary approaches of literature search and clinical data mining should become increasingly intertwined in the future. Three of the candidate best papers 11 12 13 are in the cancer genomics domain, similar to Bailey, et al. , 2 and Tamborero, et al. , 7 . Bertrand, et al. , 11 introduced ConsensusDriver, an algorithmic approach to adjudicating the discrepancies in driver mutation ascertainment mentioned above. Sun, et al. , 12 tackled an important related problem - determining the difference between germline and somatic mutations in the absence of matched normal tissue. Pineiro-Yanez, et al. , 13 developed PanDrugs, a method that aims to prioritize drugs by genomic findings in the cancer specimen. Four of the candidate best papers 4 5 6 7 8 9 10 11 12 13 14 15 16 17 are concerned with knowledge management and ontologies. Cario, et al. , 14 and Warner, et al. , 15 described Orchid and SMART Cancer Navigator respectively, both of which are frameworks for the management of the knowledge needed to practice precision cancer medicine. Malty, et al., 16 described an OWL-based ontology to represent chemotherapy regimens. Pecora, et al. , 17 described a barcode-like approach to classification of cancer. Finally, several additional candidate papers described the uses of information technology for patient and caregiver engagement. Gupta, et al. , 18 carried out a feasibility study of using physical activity monitors as a surrogate for clinician-ascertained performance status. Gustafson, et al. , 19 conducted two sizeable randomized clinical trials examining the utility of a caregiver e-alert system to reduce patients’ distress from symptoms.
  19 in total

1.  Toward a formalization of the process to select IMIA Yearbook best papers.

Authors:  J-B Lamy; B Séroussi; N Griffon; G Kerdelhué; M-C Jaulent; J Bouaud
Journal:  Methods Inf Med       Date:  2014-11-14       Impact factor: 2.176

2.  Comprehensive Characterization of Cancer Driver Genes and Mutations.

Authors:  Matthew H Bailey; Collin Tokheim; Eduard Porta-Pardo; Sohini Sengupta; Denis Bertrand; Amila Weerasinghe; Antonio Colaprico; Michael C Wendl; Jaegil Kim; Brendan Reardon; Patrick Kwok-Shing Ng; Kang Jin Jeong; Song Cao; Zixing Wang; Jianjiong Gao; Qingsong Gao; Fang Wang; Eric Minwei Liu; Loris Mularoni; Carlota Rubio-Perez; Niranjan Nagarajan; Isidro Cortés-Ciriano; Daniel Cui Zhou; Wen-Wei Liang; Julian M Hess; Venkata D Yellapantula; David Tamborero; Abel Gonzalez-Perez; Chayaporn Suphavilai; Jia Yu Ko; Ekta Khurana; Peter J Park; Eliezer M Van Allen; Han Liang; Michael S Lawrence; Adam Godzik; Nuria Lopez-Bigas; Josh Stuart; David Wheeler; Gad Getz; Ken Chen; Alexander J Lazar; Gordon B Mills; Rachel Karchin; Li Ding
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

3.  ConsensusDriver Improves upon Individual Algorithms for Predicting Driver Alterations in Different Cancer Types and Individual Patients.

Authors:  Denis Bertrand; Sibyl Drissler; Burton K Chia; Jia Yu Koh; Chenhao Li; Chayaporn Suphavilai; Iain Beehuat Tan; Niranjan Nagarajan
Journal:  Cancer Res       Date:  2017-12-19       Impact factor: 12.701

4.  Orchid: a novel management, annotation and machine learning framework for analyzing cancer mutations.

Authors:  Clinton L Cario; John S Witte
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

5.  AACR Project GENIE: Powering Precision Medicine through an International Consortium.

Authors: 
Journal:  Cancer Discov       Date:  2017-06-01       Impact factor: 39.397

6.  Reducing Symptom Distress in Patients With Advanced Cancer Using an e-Alert System for Caregivers: Pooled Analysis of Two Randomized Clinical Trials.

Authors:  David H Gustafson; Lori L DuBenske; Amy K Atwood; Ming-Yuan Chih; Roberta A Johnson; Fiona McTavish; Andrew Quanbeck; Roger L Brown; James F Cleary; Dhavan Shah
Journal:  J Med Internet Res       Date:  2017-11-14       Impact factor: 5.428

7.  Estimation of Symptom Severity During Chemotherapy From Passively Sensed Data: Exploratory Study.

Authors:  Carissa A Low; Anind K Dey; Denzil Ferreira; Thomas Kamarck; Weijing Sun; Sangwon Bae; Afsaneh Doryab
Journal:  J Med Internet Res       Date:  2017-12-19       Impact factor: 5.428

8.  Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.

Authors:  Simon Baker; Imran Ali; Ilona Silins; Sampo Pyysalo; Yufan Guo; Johan Högberg; Ulla Stenius; Anna Korhonen
Journal:  Bioinformatics       Date:  2017-12-15       Impact factor: 6.937

9.  Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations.

Authors:  David Tamborero; Carlota Rubio-Perez; Jordi Deu-Pons; Michael P Schroeder; Ana Vivancos; Ana Rovira; Ignasi Tusquets; Joan Albanell; Jordi Rodon; Josep Tabernero; Carmen de Torres; Rodrigo Dienstmann; Abel Gonzalez-Perez; Nuria Lopez-Bigas
Journal:  Genome Med       Date:  2018-03-28       Impact factor: 11.117

10.  A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal.

Authors:  James X Sun; Yuting He; Eric Sanford; Meagan Montesion; Garrett M Frampton; Stéphane Vignot; Jean-Charles Soria; Jeffrey S Ross; Vincent A Miller; Phil J Stephens; Doron Lipson; Roman Yelensky
Journal:  PLoS Comput Biol       Date:  2018-02-07       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.