Gregory W Gundersen1, Matthew R Jones1, Andrew D Rouillard1, Yan Kou1, Caroline D Monteiro2, Axel S Feldmann1, Kevin S Hu1, Avi Ma'ayan1. 1. Department of Pharmacology and Systems Therapeutics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA, BD2K-LINCS Data Coordination and Integration Center, Mount Sinai Knowledge Management Center for Illuminating the Druggable Genome and 367 Airport Sector, Goiania, Goias 74075, Brazil Department of Pharmacology and Systems Therapeutics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA, BD2K-LINCS Data Coordination and Integration Center, Mount Sinai Knowledge Management Center for Illuminating the Druggable Genome and 367 Airport Sector, Goiania, Goias 74075, Brazil Department of Pharmacology and Systems Therapeutics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA, BD2K-LINCS Data Coordination and Integration Center, Mount Sinai Knowledge Management Center for Illuminating the Druggable Genome and 367 Airport Sector, Goiania, Goias 74075, Brazil. 2. Department of Pharmacology and Systems Therapeutics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA, BD2K-LINCS Data Coordination and Integration Center, Mount Sinai Knowledge Management Center for Illuminating the Druggable Genome and 367 Airport Sector, Goiania, Goias 74075, Brazil.
Abstract
MOTIVATION: Identification of differentially expressed genes is an important step in extracting knowledge from gene expression profiling studies. The raw expression data from microarray and other high-throughput technologies is deposited into the Gene Expression Omnibus (GEO) and served as Simple Omnibus Format in Text (SOFT) files. However, to extract and analyze differentially expressed genes from GEO requires significant computational skills. RESULTS: Here we introduce GEO2Enrichr, a browser extension for extracting differentially expressed gene sets from GEO and analyzing those sets with Enrichr, an independent gene set enrichment analysis tool containing over 70 000 annotated gene sets organized into 75 gene-set libraries. GEO2Enrichr adds JavaScript code to GEO web-pages; this code scrapes user selected accession numbers and metadata, and then, with one click, users can submit this information to a web-server application that downloads the SOFT files, parses, cleans and normalizes the data, identifies the differentially expressed genes, and then pipes the resulting gene lists to Enrichr for downstream functional analysis. GEO2Enrichr opens a new avenue for adding functionality to major bioinformatics resources such GEO by integrating tools and resources without the need for a plug-in architecture. Importantly, GEO2Enrichr helps researchers to quickly explore hypotheses with little technical overhead, lowering the barrier of entry for biologists by automating data processing steps needed for knowledge extraction from the major repository GEO. AVAILABILITY AND IMPLEMENTATION: GEO2Enrichr is an open source tool, freely available for installation as browser extensions at the Chrome Web Store and FireFox Add-ons. Documentation and a browser independent web application can be found at http://amp.pharm.mssm.edu/g2e/. CONTACT: avi.maayan@mssm.edu.
MOTIVATION: Identification of differentially expressed genes is an important step in extracting knowledge from gene expression profiling studies. The raw expression data from microarray and other high-throughput technologies is deposited into the Gene Expression Omnibus (GEO) and served as Simple Omnibus Format in Text (SOFT) files. However, to extract and analyze differentially expressed genes from GEO requires significant computational skills. RESULTS: Here we introduce GEO2Enrichr, a browser extension for extracting differentially expressed gene sets from GEO and analyzing those sets with Enrichr, an independent gene set enrichment analysis tool containing over 70 000 annotated gene sets organized into 75 gene-set libraries. GEO2Enrichr adds JavaScript code to GEO web-pages; this code scrapes user selected accession numbers and metadata, and then, with one click, users can submit this information to a web-server application that downloads the SOFT files, parses, cleans and normalizes the data, identifies the differentially expressed genes, and then pipes the resulting gene lists to Enrichr for downstream functional analysis. GEO2Enrichr opens a new avenue for adding functionality to major bioinformatics resources such GEO by integrating tools and resources without the need for a plug-in architecture. Importantly, GEO2Enrichr helps researchers to quickly explore hypotheses with little technical overhead, lowering the barrier of entry for biologists by automating data processing steps needed for knowledge extraction from the major repository GEO. AVAILABILITY AND IMPLEMENTATION: GEO2Enrichr is an open source tool, freely available for installation as browser extensions at the Chrome Web Store and FireFox Add-ons. Documentation and a browser independent web application can be found at http://amp.pharm.mssm.edu/g2e/. CONTACT: avi.maayan@mssm.edu.
Authors: Neil R Clark; Kevin S Hu; Axel S Feldmann; Yan Kou; Edward Y Chen; Qiaonan Duan; Avi Ma'ayan Journal: BMC Bioinformatics Date: 2014-03-21 Impact factor: 3.169
Authors: Edward Y Chen; Christopher M Tan; Yan Kou; Qiaonan Duan; Zichen Wang; Gabriela Vaz Meirelles; Neil R Clark; Avi Ma'ayan Journal: BMC Bioinformatics Date: 2013-04-15 Impact factor: 3.169
Authors: Kathleen M Jagodnik; Simon Koplev; Sherry L Jenkins; Lucila Ohno-Machado; Benedict Paten; Stephan C Schurer; Michel Dumontier; Ruben Verborgh; Alex Bui; Peipei Ping; Neil J McKenna; Ravi Madduri; Ajay Pillai; Avi Ma'ayan Journal: J Biomed Inform Date: 2017-05-10 Impact factor: 6.317
Authors: Alejandro A Pezzulo; Rosarie A Tudas; Carley G Stewart; Luis G Vargas Buonfiglio; Brian D Lindsay; Peter J Taft; Nicholas D Gansemer; Joseph Zabner Journal: J Clin Invest Date: 2019-01-14 Impact factor: 14.808
Authors: Maxim V Kuleshov; Matthew R Jones; Andrew D Rouillard; Nicolas F Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L Jenkins; Kathleen M Jagodnik; Alexander Lachmann; Michael G McDermott; Caroline D Monteiro; Gregory W Gundersen; Avi Ma'ayan Journal: Nucleic Acids Res Date: 2016-05-03 Impact factor: 16.971
Authors: Neil R Clark; Maciej Szymkiewicz; Zichen Wang; Caroline D Monteiro; Matthew R Jones; Avi Ma'ayan Journal: Proceedings (IEEE Int Conf Bioinformatics Biomed) Date: 2015-11
Authors: Lu Zhang; Zichen Wang; Ruijie Liu; Zhengzhe Li; Jennifer Lin; Megan L Wojciechowicz; Jiyi Huang; Kyung Lee; Avi Ma'ayan; John Cijiang He Journal: Diabetes Date: 2020-10-16 Impact factor: 9.461
Authors: Zhuorui Xie; Allison Bailey; Maxim V Kuleshov; Daniel J B Clarke; John E Evangelista; Sherry L Jenkins; Alexander Lachmann; Megan L Wojciechowicz; Eryk Kropiwnicki; Kathleen M Jagodnik; Minji Jeon; Avi Ma'ayan Journal: Curr Protoc Date: 2021-03
Authors: John Erol Evangelista; Daniel J B Clarke; Zhuorui Xie; Alexander Lachmann; Minji Jeon; Kerwin Chen; Kathleen M Jagodnik; Sherry L Jenkins; Maxim V Kuleshov; Megan L Wojciechowicz; Stephan C Schürer; Mario Medvedovic; Avi Ma'ayan Journal: Nucleic Acids Res Date: 2022-05-07 Impact factor: 19.160
Authors: Sathya Neelature Sriramareddy; Fernanda Faião-Flores; Michael F Emmons; Biswarup Saha; Srikumar Chellappan; Clayton Wyatt; Inna Smalley; Jonathan D Licht; Michael A Durante; J William Harbour; Keiran S M Smalley Journal: Cancer Gene Ther Date: 2022-03-24 Impact factor: 5.854