Literature DB >> 23257199

InCroMAP: integrated analysis of cross-platform microarray and pathway data.

Clemens Wrzodek1, Johannes Eichner, Finja Büchel, Andreas Zell.   

Abstract

SUMMARY: Microarrays are commonly used to detect changes in gene expression between different biological samples. For this purpose, many analysis tools have been developed that offer visualization, statistical analysis and more sophisticated analysis methods. Most of these tools are designed specifically for messenger RNA microarrays. However, today, more and more different microarray platforms are available. Changes in DNA methylation, microRNA expression or even protein phosphorylation states can be detected with specialized arrays. For these microarray technologies, the number of available tools is small compared with mRNA analysis tools. Especially, a joint analysis of different microarray platforms that have been used on the same set of biological samples is hardly supported by most microarray analysis tools. Here, we present InCroMAP, a tool for the analysis and visualization of high-level microarray data from individual or multiple different platforms. Currently, InCroMAP supports mRNA, microRNA, DNA methylation and protein modification datasets. Several methods are offered that allow for an integrated analysis of data from those platforms. The available features of InCroMAP range from visualization of DNA methylation data over annotation of microRNA targets and integrated gene set enrichment analysis to a joint visualization of data from all platforms in the context of metabolic or signalling pathways. AVAILABILITY: InCroMAP is freely available as Java™ application at www.cogsys.cs.uni-tuebingen.de/software/InCroMAP, including a comprehensive user's guide and example files.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23257199      PMCID: PMC3570209          DOI: 10.1093/bioinformatics/bts709

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Typical workflows for the analysis of microarray data involve several steps, namely, the preparation of samples and arrays, their hybridization to arrays, scanning the array and processing the image to read out the raw probe intensities. Depending on the array type, several quality control and low-level data analysis steps are then performed in silico. These steps mostly include normalization, annotation of gene identifiers and the calculation of diverse measures of differential probe-level intensities (such as P-values, fold changes or log ratios). Mostly, these tasks are performed in R, a statistical programming language (www.r-project.org) or by using derived applications with a graphical user interface (e.g. Mayday; Dietzsch ). The processed datasets can then be used in various high-level data analysis tools for further evaluation and data mining. A popular example is the commercial Ingenuity Pathway Analysis software (www.ingenuity.com), which links processed microarray datasets with pathway analysis. However, most of these high-level analysis tools are specialized on single platforms, and only a few approaches are available for an integrated analysis of high-throughput data from heterogenous platforms. Furthermore, not many software tools are freely available that offer suitable and easy-to-use analysis and visualization techniques for microarray platforms, other than mRNA expression arrays. Therefore, we developed InCroMAP, a user-friendly and interactive application with a graphical user interface that is specialized on an integrated analysis of cross-platform microarray and pathway data. InCroMAP supports DNA methylation, messenger RNA, microRNA and protein modification datasets. Besides these platforms, it is possible to import data from any platform that contains expression values that can somehow be assigned to genes. A special emphasis has been put on the usability of the application. Hence, all required files, for example, for mapping gene identifiers to gene symbols, annotating mRNA targets to microRNAs or pathways to visualize, are either directly included in the application or downloaded dynamically in the background.

2 RESULTS

To integrate data from multiple platforms, a common denominator must be established. The vast majority of all data are somehow associated with genes. Hence, integration of multiple data types is performed by mapping each probe to a gene. This procedure is straightforward for protein or mRNA datasets. DNA methylation datasets are region based and can be mapped onto genes by defining a window upstream and downstream of each gene’s transcription start site. InCroMAP proposes a window of −2000 and +500 bp as default region, but users may change these values. Integration of microRNA data is performed by annotating the genes of the mRNA targets to each microRNA. For this task, the user can choose between three microRNA target databases that contain experimentally verified targets and three databases with predicted targets (listed in Fig. 1B; databases reviewed in Alexiou ).
Fig. 1.

Different views of InCroMAP. (A) The pop-up menu shows different methods that are provided for a joint analysis of heterogeneous microarray platforms. (B) MicroRNA datasets can be annotated with three experimental and three predicted microRNA target databases directly from within the application. In the background, the result of the ‘integrate heterogeneous data’ procedure is shown. (C) Integrated pathway-based visualization of heterogenous microarray datasets allows to visualize up to four different platforms in a single pathway (here: excerpt from the ‘MAPK signalling’ pathway). Pathway nodes can be selected to get more detailed information, including various plots for all assigned expression values (here: DNA methylation in the promoter region of Egfr)

Different views of InCroMAP. (A) The pop-up menu shows different methods that are provided for a joint analysis of heterogeneous microarray platforms. (B) MicroRNA datasets can be annotated with three experimental and three predicted microRNA target databases directly from within the application. In the background, the result of the ‘integrate heterogeneous data’ procedure is shown. (C) Integrated pathway-based visualization of heterogenous microarray datasets allows to visualize up to four different platforms in a single pathway (here: excerpt from the ‘MAPK signalling’ pathway). Pathway nodes can be selected to get more detailed information, including various plots for all assigned expression values (here: DNA methylation in the promoter region of Egfr) A first approach to integratively investigate data from any two platforms is the ‘data-pairing’ procedure. This procedure shows two datasets next to each other, thus, simplifying common lookup task, such as investigating the effect of a differentially methylated promoter on mRNA level. Further, this view is especially suitable to inspect the effect of microRNA expression on target mRNAs. An arbitrary amount of data from different platforms can be inspected, using the ‘integrate heterogenous data’ procedure. To keep the clarity, only the most relevant information, that is, the expression values (as fold changes or P-values) are shown. Therefore, one row is created for each gene and one column for each platform. A hierarchical representation of the table allows for expanding nodes to get more information, such as all microRNAs targeting this gene’s mRNA (Fig. 1B). A popular method for a generic analysis of expression data is performing a gene set enrichment. We have extended this procedure to an integrated gene set enrichment that is able to perform enrichments across multiple platforms. The user can choose the datasets and thresholds for each dataset to calculate a P-value, using a hypergeometric test for each predefined gene set (Backes ). InCroMAP supports gene sets from the KEGG PATHWAY database (Kanehisa ), Gene Ontology and any gene set from the molecular signatures database (www.broadinstitute.org/gsea/msigdb/). Furthermore, BioPAX Level 2 and Level 3 pathways can be imported for visualization in InCroMAP. The results of a pathway enrichment can further be visualized in metapathways (e.g. the ‘metabolic pathways’ map), together with mRNA expression data and enriched sub-pathways. All pathways are visualized using KEGGtranslator (Wrzodek ), and InCroMAP extends these pathways by visualizing expression data from each single platform therein. Therefore, node colour is changed according to mRNA expression, and small boxes are added and coloured according to each protein modification’s expression value. MicroRNAs are added as small coloured triangles to the graph and are connected to their targets with edges. DNA methylation data are indicated with a black bar that shows the maximum differential peak in each gene’s promoter (stretching from the middle to the left to indicate hypomethylation and to the right for hypermethylation). This is an interactive graph, therefore, allowing users to modify the layout and selecting nodes to get more detailed information and plots of the associated expression data. Besides those integrated analysis methods, InCroMAP allows plotting region-based DNA methylation data in a genome plot with boxes for gene bodies, which in turn can be coloured according to mRNA expression. Further, all enrichments can also be performed on any single dataset, which is straightforward for mRNA or protein datasets, but implementations that can also handle DNA methylation or microRNA data are less common.
  5 in total

1.  Mayday--a microarray data analysis workbench.

Authors:  Janko Dietzsch; Nils Gehlenborg; Kay Nieselt
Journal:  Bioinformatics       Date:  2006-02-24       Impact factor: 6.937

Review 2.  Lost in translation: an assessment and perspective for computational microRNA target identification.

Authors:  Panagiotis Alexiou; Manolis Maragkakis; Giorgos L Papadopoulos; Martin Reczko; Artemis G Hatzigeorgiou
Journal:  Bioinformatics       Date:  2009-09-29       Impact factor: 6.937

3.  KEGGtranslator: visualizing and converting the KEGG PATHWAY database to various formats.

Authors:  Clemens Wrzodek; Andreas Dräger; Andreas Zell
Journal:  Bioinformatics       Date:  2011-06-23       Impact factor: 6.937

4.  From genomics to chemical genomics: new developments in KEGG.

Authors:  Minoru Kanehisa; Susumu Goto; Masahiro Hattori; Kiyoko F Aoki-Kinoshita; Masumi Itoh; Shuichi Kawashima; Toshiaki Katayama; Michihiro Araki; Mika Hirakawa
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

5.  GeneTrail--advanced gene set enrichment analysis.

Authors:  Christina Backes; Andreas Keller; Jan Kuentzer; Benny Kneissl; Nicole Comtesse; Yasser A Elnakady; Rolf Müller; Eckart Meese; Hans-Peter Lenhof
Journal:  Nucleic Acids Res       Date:  2007-05-25       Impact factor: 16.971

  5 in total
  10 in total

1.  Bone healing in an aged murine fracture model is characterized by sustained callus inflammation and decreased cell proliferation.

Authors:  John H Hebb; Jason W Ashley; Lee McDaniel; Luke A Lopas; John Tobias; Kurt D Hankenson; Jaimo Ahn
Journal:  J Orthop Res       Date:  2017-10-09       Impact factor: 3.494

2.  The integrative analysis of DNA methylation and mRNA expression profiles confirmed the role of selenocompound metabolism pathway in Kashin-Beck disease.

Authors:  Ping Li; Yujie Ning; Weizhuo Wang; Xiong Guo; Blandine Poulet; Xi Wang; Yan Wen; Jing Han; Jingcan Hao; Xiao Liang; Li Liu; Yanan Du; Bolun Cheng; Shiqiang Cheng; Lu Zhang; Mei Ma; Xin Qi; Chujun Liang; Cuiyan Wu; Sen Wang; Hongmou Zhao; Guanghui Zhao; Mary B Goldring; Feng Zhang; Peng Xu
Journal:  Cell Cycle       Date:  2020-08-20       Impact factor: 4.534

Review 3.  Strategies for Integrated Analysis of Genetic, Epigenetic, and Gene Expression Variation in Cancer: Addressing the Challenges.

Authors:  Louise B Thingholm; Lars Andersen; Enes Makalic; Melissa C Southey; Mads Thomassen; Lise Lotte Hansen
Journal:  Front Genet       Date:  2016-02-01       Impact factor: 4.599

4.  Integrating genome-wide DNA methylation and mRNA expression profiles identified different molecular features between Kashin-Beck disease and primary osteoarthritis.

Authors:  Yan Wen; Ping Li; Jingcan Hao; Chen Duan; Jing Han; Awen He; Yanan Du; Li Liu; Xiao Liang; Feng Zhang; Xiong Guo
Journal:  Arthritis Res Ther       Date:  2018-03-07       Impact factor: 5.156

Review 5.  Advances in metabolome information retrieval: turning chemistry into biology. Part II: biological information recovery.

Authors:  Abdellah Tebani; Carlos Afonso; Soumeya Bekri
Journal:  J Inherit Metab Dis       Date:  2017-08-25       Impact factor: 4.982

6.  The arginine methyltransferase PRMT7 promotes extravasation of monocytes resulting in tissue injury in COPD.

Authors:  Gizem Günes Günsel; Thomas M Conlon; Aicha Jeridi; Rinho Kim; Zeynep Ertüz; Niklas J Lang; Meshal Ansari; Mariia Novikova; Dongsheng Jiang; Maximilian Strunz; Mariia Gaianova; Christine Hollauer; Christina Gabriel; Ilias Angelidis; Sebastian Doll; Jeanine C Pestoni; Stephanie L Edelmann; Marlene Sophia Kohlhepp; Adrien Guillot; Kevin Bassler; Hannelore P Van Eeckhoutte; Özgecan Kayalar; Nur Konyalilar; Tamara Kanashova; Sophie Rodius; Carolina Ballester-López; Carlos M Genes Robles; Natalia Smirnova; Markus Rehberg; Charu Agarwal; Ioanna Krikki; Benoit Piavaux; Stijn E Verleden; Bart Vanaudenaerde; Melanie Königshoff; Gunnar Dittmar; Ken R Bracke; Joachim L Schultze; Henrik Watz; Oliver Eickelberg; Tobias Stoeger; Gerald Burgstaller; Frank Tacke; Vigo Heissmeyer; Yuval Rinkevich; Hasan Bayram; Herbert B Schiller; Marcus Conrad; Robert Schneider; Ali Önder Yildirim
Journal:  Nat Commun       Date:  2022-03-14       Impact factor: 17.694

7.  LncRNA Ctcflos orchestrates transcription and alternative splicing in thermogenic adipogenesis.

Authors:  Andrea Bast-Habersbrunner; Christoph Kiefer; Peter Weber; Tobias Fromme; Anna Schießl; Petra C Schwalie; Bart Deplancke; Yongguo Li; Martin Klingenspor
Journal:  EMBO Rep       Date:  2021-05-31       Impact factor: 8.807

8.  Evaluation of toxicogenomics approaches for assessing the risk of nongenotoxic carcinogenicity in rat liver.

Authors:  Johannes Eichner; Clemens Wrzodek; Michael Römer; Heidrun Ellinger-Ziegelbauer; Andreas Zell
Journal:  PLoS One       Date:  2014-05-14       Impact factor: 3.240

9.  ZBIT Bioinformatics Toolbox: A Web-Platform for Systems Biology and Expression Data Analysis.

Authors:  Michael Römer; Johannes Eichner; Andreas Dräger; Clemens Wrzodek; Finja Wrzodek; Andreas Zell
Journal:  PLoS One       Date:  2016-02-16       Impact factor: 3.240

Review 10.  Beyond genomics: understanding exposotypes through metabolomics.

Authors:  Nicholas J W Rattray; Nicole C Deziel; Joshua D Wallach; Sajid A Khan; Vasilis Vasiliou; John P A Ioannidis; Caroline H Johnson
Journal:  Hum Genomics       Date:  2018-01-26       Impact factor: 4.639

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.