Literature DB >> 33431999

GVES: machine learning model for identification of prognostic genes with a small dataset.

Soohyun Ko1, Jonghwan Choi2, Jaegyoon Ahn3.   

Abstract

Machine learning may be a powerful approach to more accurate identification of genes that may serve as prognosticators of cancer outcomes using various types of omics data. However, to date, machine learning approaches have shown limited prediction accuracy for cancer outcomes, primarily owing to small sample numbers and relatively large number of features. In this paper, we provide a description of GVES (Gene Vector for Each Sample), a proposed machine learning model that can be efficiently leveraged even with a small sample size, to increase the accuracy of identification of genes with prognostic value. GVES, an adaptation of the continuous bag of words (CBOW) model, generates vector representations of all genes for all samples by leveraging gene expression and biological network data. GVES clusters samples using their gene vectors, and identifies genes that divide samples into good and poor outcome groups for the prediction of cancer outcomes. Because GVES generates gene vectors for each sample, the sample size effect is reduced. We applied GVES to six cancer types and demonstrated that GVES outperformed existing machine learning methods, particularly for cancer datasets with a small number of samples. Moreover, the genes identified as prognosticators were shown to reside within a number of significant prognostic genetic pathways associated with pancreatic cancer.

Entities:  

Year:  2021        PMID: 33431999      PMCID: PMC7801384          DOI: 10.1038/s41598-020-79889-5

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  25 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  The graph neural network model.

Authors:  Franco Scarselli; Marco Gori; Ah Chung Tsoi; Markus Hagenbuchner; Gabriele Monfardini
Journal:  IEEE Trans Neural Netw       Date:  2008-12-09

3.  TCGA-assembler 2: software pipeline for retrieval and processing of TCGA/CPTAC data.

Authors:  Lin Wei; Zhilin Jin; Shengjie Yang; Yanxun Xu; Yitan Zhu; Yuan Ji
Journal:  Bioinformatics       Date:  2018-05-01       Impact factor: 6.937

4.  Regulation of the actin cytoskeleton in cancer cell migration and invasion.

Authors:  Hideki Yamaguchi; John Condeelis
Journal:  Biochim Biophys Acta       Date:  2006-07-14

5.  Cross-talk between phospho-STAT3 and PLCγ1 plays a critical role in colorectal tumorigenesis.

Authors:  Peng Zhang; Yiqing Zhao; Xiaofeng Zhu; David Sedwick; Xiaodong Zhang; Zhenghe Wang
Journal:  Mol Cancer Res       Date:  2011-08-12       Impact factor: 5.852

6.  G2Vec: Distributed gene representations for identification of cancer prognostic genes.

Authors:  Jonghwan Choi; Ilhwan Oh; Sangmin Seo; Jaegyoon Ahn
Journal:  Sci Rep       Date:  2018-09-13       Impact factor: 4.379

7.  A network module-based method for identifying cancer prognostic signatures.

Authors:  Guanming Wu; Lincoln Stein
Journal:  Genome Biol       Date:  2012-12-10       Impact factor: 13.583

8.  The Reactome pathway knowledgebase.

Authors:  David Croft; Antonio Fabregat Mundo; Robin Haw; Marija Milacic; Joel Weiser; Guanming Wu; Michael Caudy; Phani Garapati; Marc Gillespie; Maulik R Kamdar; Bijay Jassal; Steven Jupe; Lisa Matthews; Bruce May; Stanislav Palatnik; Karen Rothfels; Veronica Shamovsky; Heeyeon Song; Mark Williams; Ewan Birney; Henning Hermjakob; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2013-11-15       Impact factor: 16.971

9.  A meta analysis of pancreatic microarray datasets yields new targets as cancer genes and biomarkers.

Authors:  Nalin C W Goonesekere; Xiaosheng Wang; Lindsey Ludwig; Chittibabu Guda
Journal:  PLoS One       Date:  2014-04-16       Impact factor: 3.240

10.  An Improved Method for Prediction of Cancer Prognosis by Network Learning.

Authors:  Minseon Kim; Ilhwan Oh; Jaegyoon Ahn
Journal:  Genes (Basel)       Date:  2018-10-02       Impact factor: 4.096

View more
  2 in total

1.  Machine learning for manually-measured water quality prediction in fish farming.

Authors:  Andres Felipe Zambrano; Luis Felipe Giraldo; Julian Quimbayo; Brayan Medina; Eduardo Castillo
Journal:  PLoS One       Date:  2021-08-18       Impact factor: 3.240

2.  Transcriptional and post-transcriptional regulation of checkpoint genes on the tumour side of the immunological synapse.

Authors:  Paula Dobosz; Przemysław A Stempor; Miguel Ramírez Moreno; Natalia A Bulgakova
Journal:  Heredity (Edinb)       Date:  2022-04-22       Impact factor: 3.832

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.