Literature DB >> 31752974

A novel framework for horizontal and vertical data integration in cancer studies with application to survival time prediction models.

Iliyan Mihaylov1, Maciej Kańduła2,3, Milko Krachunov1, Dimitar Vassilev4.   

Abstract

BACKGROUND: Recently high-throughput technologies have been massively used alongside clinical tests to study various types of cancer. Data generated in such large-scale studies are heterogeneous, of different types and formats. With lack of effective integration strategies novel models are necessary for efficient and operative data integration, where both clinical and molecular information can be effectively joined for storage, access and ease of use. Such models, combined with machine learning methods for accurate prediction of survival time in cancer studies, can yield novel insights into disease development and lead to precise personalized therapies.
RESULTS: We developed an approach for intelligent data integration of two cancer datasets (breast cancer and neuroblastoma) - provided in the CAMDA 2018 'Cancer Data Integration Challenge', and compared models for prediction of survival time. We developed a novel semantic network-based data integration framework that utilizes NoSQL databases, where we combined clinical and expression profile data, using both raw data records and external knowledge sources. Utilizing the integrated data we introduced Tumor Integrated Clinical Feature (TICF) - a new feature for accurate prediction of patient survival time. Finally, we applied and validated several machine learning models for survival time prediction.
CONCLUSION: We developed a framework for semantic integration of clinical and omics data that can borrow information across multiple cancer studies. By linking data with external domain knowledge sources our approach facilitates enrichment of the studied data by discovery of internal relations. The proposed and validated machine learning models for survival time prediction yielded accurate results. REVIEWERS: This article was reviewed by Eran Elhaik, Wenzhong Xiao and Carlos Loucera.

Entities:  

Keywords:  Breast cancer; Machine learning; Neuroblastoma; Semantic data integration; Survival time prediction

Mesh:

Year:  2019        PMID: 31752974      PMCID: PMC6868770          DOI: 10.1186/s13062-019-0249-6

Source DB:  PubMed          Journal:  Biol Direct        ISSN: 1745-6150            Impact factor:   4.540


  20 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Discovery informatics: its evolving role in drug discovery.

Authors:  Brian L Claus; Dennis J Underwood
Journal:  Drug Discov Today       Date:  2002-09-15       Impact factor: 7.851

3.  Data Integration through Ontology-Based Data Access to Support Integrative Data Analysis: A Case Study of Cancer Survival.

Authors:  Hansi Zhang; Yi Guo; Qian Li; Thomas J George; Elizabeth A Shenkman; Jiang Bian
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

4.  Meeting clinician information needs by integrating access to the medical record and knowledge resources via the Web.

Authors:  P Tarczy-Hornoch; T S Kwan-Gett; L Fouche; J Hoath; S Fuller; K N Ibrahim; D S Ketchell; J P LoGerfo; H I Goldberg
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

5.  Systematic analysis of challenge-driven improvements in molecular prognostic models for breast cancer.

Authors:  Adam A Margolin; Erhan Bilal; Erich Huang; Thea C Norman; Lars Ottestad; Brigham H Mecham; Ben Sauerwine; Michael R Kellen; Lara M Mangravite; Matthew D Furia; Hans Kristian Moen Vollan; Oscar M Rueda; Justin Guinney; Nicole A Deflaux; Bruce Hoff; Xavier Schildwachter; Hege G Russnes; Daehoon Park; Veronica O Vang; Tyler Pirtle; Lamia Youseff; Craig Citro; Christina Curtis; Vessela N Kristensen; Joseph Hellerstein; Stephen H Friend; Gustavo Stolovitzky; Samuel Aparicio; Carlos Caldas; Anne-Lise Børresen-Dale
Journal:  Sci Transl Med       Date:  2013-04-17       Impact factor: 17.956

6.  Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.

Authors:  Wenqian Zhang; Ying Yu; Falk Hertwig; Jean Thierry-Mieg; Wenwei Zhang; Danielle Thierry-Mieg; Jian Wang; Cesare Furlanello; Viswanath Devanarayan; Jie Cheng; Youping Deng; Barbara Hero; Huixiao Hong; Meiwen Jia; Li Li; Simon M Lin; Yuri Nikolsky; André Oberthuer; Tao Qing; Zhenqiang Su; Ruth Volland; Charles Wang; May D Wang; Junmei Ai; Davide Albanese; Shahab Asgharzadeh; Smadar Avigad; Wenjun Bao; Marina Bessarabova; Murray H Brilliant; Benedikt Brors; Marco Chierici; Tzu-Ming Chu; Jibin Zhang; Richard G Grundy; Min Max He; Scott Hebbring; Howard L Kaufman; Samir Lababidi; Lee J Lancashire; Yan Li; Xin X Lu; Heng Luo; Xiwen Ma; Baitang Ning; Rosa Noguera; Martin Peifer; John H Phan; Frederik Roels; Carolina Rosswog; Susan Shao; Jie Shen; Jessica Theissen; Gian Paolo Tonini; Jo Vandesompele; Po-Yen Wu; Wenzhong Xiao; Joshua Xu; Weihong Xu; Jiekun Xuan; Yong Yang; Zhan Ye; Zirui Dong; Ke K Zhang; Ye Yin; Chen Zhao; Yuanting Zheng; Russell D Wolfinger; Tieliu Shi; Linda H Malkas; Frank Berthold; Jun Wang; Weida Tong; Leming Shi; Zhiyu Peng; Matthias Fischer
Journal:  Genome Biol       Date:  2015-06-25       Impact factor: 13.583

7.  Prognostic value of cross-omics screening for kidney clear cell renal cancer survival.

Authors:  Slavica Dimitrieva; Ralph Schlapbach; Hubert Rehrauer
Journal:  Biol Direct       Date:  2016-12-20       Impact factor: 4.540

8.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

9.  Predicting clinical outcome of neuroblastoma patients using an integrative network-based approach.

Authors:  Léon-Charles Tranchevent; Petr V Nazarov; Tony Kaoma; Georges P Schmartz; Arnaud Muller; Sang-Yoon Kim; Jagath C Rajapakse; Francisco Azuaje
Journal:  Biol Direct       Date:  2018-06-07       Impact factor: 4.540

Review 10.  Bioinformatics clouds for big data manipulation.

Authors:  Lin Dai; Xin Gao; Yan Guo; Jingfa Xiao; Zhang Zhang
Journal:  Biol Direct       Date:  2012-11-28       Impact factor: 4.540

View more
  18 in total

1.  Assisted estimation of gene expression graphical models.

Authors:  Huangdi Yi; Qingzhao Zhang; Yifan Sun; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2021-02-01       Impact factor: 2.344

Review 2.  Liquid biopsies and cancer omics.

Authors:  Ivano Amelio; Riccardo Bertolo; Pierluigi Bove; Oreste Claudio Buonomo; Eleonora Candi; Marcello Chiocchi; Chiara Cipriani; Nicola Di Daniele; Carlo Ganini; Hartmut Juhl; Alessandro Mauriello; Carla Marani; John Marshall; Manuela Montanaro; Giampiero Palmieri; Mauro Piacentini; Giuseppe Sica; Manfredi Tesauro; Valentina Rovella; Giuseppe Tisone; Yufang Shi; Ying Wang; Gerry Melino
Journal:  Cell Death Discov       Date:  2020-11-26

3.  Serological determinants of COVID-19.

Authors:  Annalisa Noce; Maria Luisa Santoro; Giulia Marrone; Cartesio D'Agostini; Ivano Amelio; Andrea Duggento; Manfredi Tesauro; Nicola Di Daniele
Journal:  Biol Direct       Date:  2020-11-02       Impact factor: 4.540

4.  The ZNF750-RAC1 axis as potential prognostic factor for breast cancer.

Authors:  Alessio Butera; Matteo Cassandri; Francesco Rugolo; Massimiliano Agostini; Gerry Melino
Journal:  Cell Death Discov       Date:  2020-11-29

5.  Polymorphism on human aromatase affects protein dynamics and substrate binding: spectroscopic evidence.

Authors:  Giovanna Di Nardo; Almerinda Di Venere; Chao Zhang; Eleonora Nicolai; Silvia Castrignanò; Luisa Di Paola; Gianfranco Gilardi; Giampiero Mei
Journal:  Biol Direct       Date:  2021-04-26       Impact factor: 4.540

Review 6.  Can COVID-19 pandemic boost the epidemic of neurodegenerative diseases?

Authors:  Alexei Verkhratsky; Qing Li; Sonia Melino; Gerry Melino; Yufang Shi
Journal:  Biol Direct       Date:  2020-11-27       Impact factor: 4.540

Review 7.  Commensal microbes and p53 in cancer progression.

Authors:  Ivana Celardo; Gerry Melino; Ivano Amelio
Journal:  Biol Direct       Date:  2020-11-19       Impact factor: 4.540

Review 8.  Cancer predictive studies.

Authors:  Ivano Amelio; Riccardo Bertolo; Pierluigi Bove; Eleonora Candi; Marcello Chiocchi; Chiara Cipriani; Nicola Di Daniele; Carlo Ganini; Hartmut Juhl; Alessandro Mauriello; Carla Marani; John Marshall; Manuela Montanaro; Giampiero Palmieri; Mauro Piacentini; Giuseppe Sica; Manfredi Tesauro; Valentina Rovella; Giuseppe Tisone; Yufang Shi; Ying Wang; Gerry Melino
Journal:  Biol Direct       Date:  2020-10-14       Impact factor: 4.540

9.  Feasibility and outcomes of ERAS protocol in elective cT4 colorectal cancer patients: results from a single-center retrospective cohort study.

Authors:  Vittoria Bellato; Yongbo An; Daniele Cerbo; Michela Campanelli; Marzia Franceschilli; Krishn Khanna; Bruno Sensi; Leandro Siragusa; Piero Rossi; Giuseppe S Sica
Journal:  World J Surg Oncol       Date:  2021-07-02       Impact factor: 2.754

10.  NUAK2 and RCan2 participate in the p53 mutant pro-tumorigenic network.

Authors:  Eleonora Mammarella; Carlotta Zampieri; Emanuele Panatta; Gerry Melino; Ivano Amelio
Journal:  Biol Direct       Date:  2021-08-04       Impact factor: 4.540

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.