Literature DB >> 31142576

Graph Algorithms for Condensing and Consolidating Gene Set Analysis Results.

Sara R Savage1, Zhiao Shi1, Yuxing Liao1, Bing Zhang2.   

Abstract

Gene set analysis plays a critical role in the functional interpretation of omics data. Although this is typically done for one omics experiment at a time, there is an increasing need to combine gene set analysis results from multiple experiments performed on the same or different omics platforms, such as in multi-omics studies. Integrating results from multiple experiments is challenging, and annotation redundancy between gene sets further obscures clear conclusions. We propose to use a weighted set cover algorithm to reduce redundancy of gene sets identified in a single experiment. Next, we use affinity propagation to consolidate similar gene sets identified from multiple experiments into clusters and to automatically determine the most representative gene set for each cluster. Using three examples from over representation analysis and gene set enrichment analysis, we showed that weighted set cover outperformed a previously published set cover method and reduced the number of gene sets by 52-77%. Focusing on overlapping genes between the list of input genes and the enriched gene sets in over-representation analysis and leading-edge genes in gene set enrichment analysis further reduced the number of gene sets. A use case combining enrichment analysis results from RNA-Seq and proteomics data comparing basal and luminal A breast cancer samples highlighted the known difference in proliferation and DNA damage response. Finally, we used these algorithms for a pan-cancer survival analysis. Our analysis clearly revealed prognosis-related pathways common to multiple cancer types or specific to individual cancer types, as well as pathways associated with prognosis in different directions in different cancer types. We implemented these two algorithms in an R package, Sumer, which generates tables and static and interactive plots for exploration and publication. Sumer is publicly available at https://github.com/bzhanglab/sumer.
© 2019 Savage et al.

Entities:  

Keywords:  Algorithms; Bioinformatics software; Breast cancer; Cancer Biology*; Computational Biology; Data evaluation; Networks*; Omics; Pathway Analysis

Mesh:

Substances:

Year:  2019        PMID: 31142576      PMCID: PMC6692773          DOI: 10.1074/mcp.TIR118.001263

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  27 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Clustering by passing messages between data points.

Authors:  Brendan J Frey; Delbert Dueck
Journal:  Science       Date:  2007-01-11       Impact factor: 47.728

3.  Redundancy control in pathway databases (ReCiPa): an application for improving gene-set enrichment analysis in Omics studies and "Big data" biology.

Authors:  Juan C Vivar; Priscilla Pemu; Ruth McPherson; Sujoy Ghosh
Journal:  OMICS       Date:  2013-06-11

4.  Molecular signatures database (MSigDB) 3.0.

Authors:  Arthur Liberzon; Aravind Subramanian; Reid Pinchback; Helga Thorvaldsdóttir; Pablo Tamayo; Jill P Mesirov
Journal:  Bioinformatics       Date:  2011-05-05       Impact factor: 6.937

5.  RAMONA: a Web application for gene set analysis on multilevel omics data.

Authors:  Steffen Sass; Florian Buettner; Nikola S Mueller; Fabian J Theis
Journal:  Bioinformatics       Date:  2014-09-18       Impact factor: 6.937

6.  How different are luminal A and basal breast cancers?

Authors:  François Bertucci; Pascal Finetti; Nathalie Cervera; Emmanuelle Charafe-Jauffret; Max Buttarelli; Jocelyne Jacquemier; Max Chaffanet; Dominique Maraninchi; Patrice Viens; Daniel Birnbaum
Journal:  Int J Cancer       Date:  2009-03-15       Impact factor: 7.396

7.  WebGestalt: an integrated system for exploring gene sets in various biological contexts.

Authors:  Bing Zhang; Stefan Kirov; Jay Snoddy
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

8.  ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks.

Authors:  Gabriela Bindea; Bernhard Mlecnik; Hubert Hackl; Pornpimol Charoentong; Marie Tosolini; Amos Kirilovsky; Wolf-Herman Fridman; Franck Pagès; Zlatko Trajanoski; Jérôme Galon
Journal:  Bioinformatics       Date:  2009-02-23       Impact factor: 6.937

9.  Identification of copy number alterations in colon cancer from analysis of amplicon-based next generation sequencing data.

Authors:  Duarte Mendes Oliveira; Gianluca Santamaria; Carmelo Laudanna; Simona Migliozzi; Pietro Zoppoli; Michael Quist; Catie Grasso; Chiara Mignogna; Laura Elia; Maria Concetta Faniello; Cinzia Marinaro; Rosario Sacco; Francesco Corcione; Giuseppe Viglietto; Donatella Malanga; Antonia Rizzuto
Journal:  Oncotarget       Date:  2018-04-17

10.  Using set theory to reduce redundancy in pathway sets.

Authors:  Ruth Alexandra Stoney; Jean-Marc Schwartz; David L Robertson; Goran Nenadic
Journal:  BMC Bioinformatics       Date:  2018-10-19       Impact factor: 3.169

View more
  6 in total

1.  Proteomics Is Not an Island: Multi-omics Integration Is the Key to Understanding Biological Systems.

Authors:  Bing Zhang; Bernhard Kuster
Journal:  Mol Cell Proteomics       Date:  2019-08-09       Impact factor: 5.911

2.  Maternal age affects equine day 8 embryo gene expression both in trophoblast and inner cell mass.

Authors:  Emilie Derisoud; Luc Jouneau; Cédric Dubois; Catherine Archilla; Yan Jaszczyszyn; Rachel Legendre; Nathalie Daniel; Nathalie Peynot; Michèle Dahirel; Juliette Auclair-Ronzaud; Laurence Wimel; Véronique Duranthon; Pascale Chavatte-Palmer
Journal:  BMC Genomics       Date:  2022-06-15       Impact factor: 4.547

3.  Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma.

Authors:  Chen Huang; Lijun Chen; Sara R Savage; Rodrigo Vargas Eguez; Yongchao Dou; Yize Li; Felipe da Veiga Leprevost; Eric J Jaehnig; Jonathan T Lei; Bo Wen; Michael Schnaubelt; Karsten Krug; Xiaoyu Song; Marcin Cieślik; Hui-Yin Chang; Matthew A Wyczalkowski; Kai Li; Antonio Colaprico; Qing Kay Li; David J Clark; Yingwei Hu; Liwei Cao; Jianbo Pan; Yuefan Wang; Kyung-Cho Cho; Zhiao Shi; Yuxing Liao; Wen Jiang; Meenakshi Anurag; Jiayi Ji; Seungyeul Yoo; Daniel Cui Zhou; Wen-Wei Liang; Michael Wendl; Pankaj Vats; Steven A Carr; D R Mani; Zhen Zhang; Jiang Qian; Xi S Chen; Alexander R Pico; Pei Wang; Arul M Chinnaiyan; Karen A Ketchum; Christopher R Kinsinger; Ana I Robles; Eunkyung An; Tara Hiltke; Mehdi Mesri; Mathangi Thiagarajan; Alissa M Weaver; Andrew G Sikora; Jan Lubiński; Małgorzata Wierzbicka; Maciej Wiznerowicz; Shankha Satpathy; Michael A Gillette; George Miles; Matthew J Ellis; Gilbert S Omenn; Henry Rodriguez; Emily S Boja; Saravana M Dhanasekaran; Li Ding; Alexey I Nesvizhskii; Adel K El-Naggar; Daniel W Chan; Hui Zhang; Bing Zhang
Journal:  Cancer Cell       Date:  2021-01-07       Impact factor: 31.743

4.  Multiomic analysis identifies CPT1A as a potential therapeutic target in platinum-refractory, high-grade serous ovarian cancer.

Authors:  Dongqing Huang; Shrabanti Chowdhury; Hong Wang; Sara R Savage; Richard G Ivey; Jacob J Kennedy; Jeffrey R Whiteaker; Chenwei Lin; Xiaonan Hou; Ann L Oberg; Melissa C Larson; Najmeh Eskandari; Davide A Delisi; Saverio Gentile; Catherine J Huntoon; Uliana J Voytovich; Zahra J Shire; Qing Yu; Steven P Gygi; Andrew N Hoofnagle; Zachary T Herbert; Travis D Lorentzen; Anna Calinawan; Larry M Karnitz; S John Weroha; Scott H Kaufmann; Bing Zhang; Pei Wang; Michael J Birrer; Amanda G Paulovich
Journal:  Cell Rep Med       Date:  2021-12-21

Review 5.  Application of Proteomics in Cancer: Recent Trends and Approaches for Biomarkers Discovery.

Authors:  Yang Woo Kwon; Han-Seul Jo; Sungwon Bae; Youngsuk Seo; Parkyong Song; Minseok Song; Jong Hyuk Yoon
Journal:  Front Med (Lausanne)       Date:  2021-09-22

Review 6.  A Detailed Catalogue of Multi-Omics Methodologies for Identification of Putative Biomarkers and Causal Molecular Networks in Translational Cancer Research.

Authors:  Efstathios Iason Vlachavas; Jonas Bohn; Frank Ückert; Sylvia Nürnberg
Journal:  Int J Mol Sci       Date:  2021-03-10       Impact factor: 5.923

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.