Literature DB >> 30292791

CGPS: A machine learning-based approach integrating multiple gene set analysis tools for better prioritization of biologically relevant pathways.

Chen Ai1, Lei Kong2.   

Abstract

Gene set enrichment (GSE) analyses play an important role in the interpretation of large-scale transcriptome datasets. Multiple GSE tools can be integrated into a single method as obtaining optimal results is challenging due to the plethora of GSE tools and their discrepant performances. Several existing ensemble methods lead to different scores in sorting pathways as integrated results; furthermore, it is difficult for users to choose a single ensemble score to obtain optimal final results. Here, we develop an ensemble method using a machine learning approach called Combined Gene set analysis incorporating Prioritization and Sensitivity (CGPS) that integrates the results provided by nine prominent GSE tools into a single ensemble score (R score) to sort pathways as integrated results. Moreover, to the best of our knowledge, CGPS is the first GSE ensemble method built based on a priori knowledge of pathways and phenotypes. Compared with 10 widely used individual methods and five types of ensemble scores from two ensemble methods, we demonstrate that sorting pathways based on the R score can better prioritize relevant pathways, as established by an evaluation of 120 simulated datasets and 45 real datasets. Additionally, CGPS is applied to expression data involving the drug panobinostat, which is an anticancer treatment against multiple myeloma. The results identify cell processes associated with cancer, such as the p53 signaling pathway (hsa04115); by contrast, according to two ensemble methods (EnrichmentBrowser and EGSEA), this pathway has a rank higher than 20, which may cause users to miss the pathway in their analyses. We show that this method, which is based on a priori knowledge, can capture valuable biological information from numerous types of gene set collections, such as KEGG pathways, GO terms, Reactome, and BioCarta. CGPS is publicly available as a standalone source code at ftp://ftp.cbi.pku.edu.cn/pub/CGPS_download/cgps-1.0.0.tar.gz.
Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Differential expression; Gene expression; Gene set enrichment; Support vector machine

Mesh:

Substances:

Year:  2018        PMID: 30292791     DOI: 10.1016/j.jgg.2018.08.002

Source DB:  PubMed          Journal:  J Genet Genomics        ISSN: 1673-8527            Impact factor:   4.275


  36 in total

Review 1.  Precision medicine to prevent glaucoma-related blindness.

Authors:  Sayoko E Moroi; David M Reed; David S Sanders; Ahmed Almazroa; Lawrence Kagemann; Neil Shah; Nakul Shekhawat; Julia E Richards
Journal:  Curr Opin Ophthalmol       Date:  2019-05       Impact factor: 3.761

2.  Blocked synthesis of sporopollenin and jasmonic acid leads to pollen wall defects and anther indehiscence in genic male sterile wheat line 4110S at high temperatures.

Authors:  Xuetong Yang; Jiali Ye; Lingli Zhang; Xiyue Song
Journal:  Funct Integr Genomics       Date:  2019-11-15       Impact factor: 3.410

3.  Bioinformatics Analysis based on Multiple Databases Identifies Hub Genes Associated with Hepatocellular Carcinoma.

Authors:  Lu Zeng; Xiude Fan; Xiaoyun Wang; Huan Deng; Kun Zhang; Xiaoge Zhang; Shan He; Na Li; Qunying Han; Zhengwen Liu
Journal:  Curr Genomics       Date:  2019-08       Impact factor: 2.236

4.  Bioinformatics Analysis Identifies the Estrogen Receptor 1 (ESR1) Gene and hsa-miR-26a-5p as Potential Prognostic Biomarkers in Patients with Intrahepatic Cholangiocarcinoma.

Authors:  Xianzheng Qin; Yuning Song
Journal:  Med Sci Monit       Date:  2020-05-21

5.  RNA N6-methyladenosine demethylase FTO promotes breast tumor progression through inhibiting BNIP3.

Authors:  Yi Niu; Ziyou Lin; Arabella Wan; Honglei Chen; Heng Liang; Lei Sun; Yuan Wang; Xi Li; Xiao-Feng Xiong; Bo Wei; Xiaobin Wu; Guohui Wan
Journal:  Mol Cancer       Date:  2019-03-28       Impact factor: 27.401

6.  Early Transcriptional Response to DNA Virus Infection in Sclerotinia sclerotiorum.

Authors:  Feng Ding; Jiasen Cheng; Yanping Fu; Tao Chen; Bo Li; Daohong Jiang; Jiatao Xie
Journal:  Viruses       Date:  2019-03-19       Impact factor: 5.048

7.  MiR-532-3p suppresses colorectal cancer progression by disrupting the ETS1/TGM2 axis-mediated Wnt/β-catenin signaling.

Authors:  Chuncai Gu; Jianqun Cai; Zhijun Xu; Shiming Zhou; Liangying Ye; Qun Yan; Yue Zhang; Yuxin Fang; Yongfeng Liu; Chenge Tu; Xinke Wang; Juan He; Qingyuan Li; Lu Han; Xin Lin; Aimin Li; Side Liu
Journal:  Cell Death Dis       Date:  2019-09-30       Impact factor: 8.469

8.  Comparative Transcriptome Analysis Provides Molecular Insights into the Interaction of Beet necrotic yellow vein virus and Beet soil-borne mosaic virus with Their Host Sugar Beet.

Authors:  Jose Fernando Gil; Daniel Wibberg; Omid Eini; Eugene I Savenkov; Mark Varrelmann; Sebastian Liebe
Journal:  Viruses       Date:  2020-01-08       Impact factor: 5.048

9.  Genomic evolution and virulence association of Clostridioides difficile sequence type 37 (ribotype 017) in China.

Authors:  Xingxing Xu; Yuo Luo; Huan Chen; Xiaojun Song; Qiao Bian; Xianjun Wang; Qian Liang; Jianhong Zhao; Chunhui Li; Guangzhong Song; Jun Yang; Lingli Sun; Jianmin Jiang; Huanying Wang; Bo Zhu; Guangyong Ye; Liang Chen; Yi-Wei Tang; Dazhi Jin
Journal:  Emerg Microbes Infect       Date:  2021-12       Impact factor: 7.163

Review 10.  How artificial intelligence might disrupt diagnostics in hematology in the near future.

Authors:  Wencke Walter; Claudia Haferlach; Niroshan Nadarajah; Ines Schmidts; Constanze Kühn; Wolfgang Kern; Torsten Haferlach
Journal:  Oncogene       Date:  2021-06-08       Impact factor: 9.867

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.