Literature DB >> 21546551

iPath2.0: interactive pathway explorer.

Takuji Yamada1, Ivica Letunic, Shujiro Okuda, Minoru Kanehisa, Peer Bork.   

Abstract

iPath2.0 is a web-based tool (http://pathways.embl.de) for the visualization and analysis of cellular pathways. Its primary map summarizes the metabolism in biological systems as annotated to date. Nodes in the map correspond to various chemical compounds and edges represent series of enzymatic reactions. In two other maps, iPath2.0 provides an overview of secondary metabolite biosynthesis and a hand-picked selection of important regulatory pathways and other functional modules, allowing a more general overview of protein functions in a genome or metagenome. iPath2.0's main interface is an interactive Flash-based viewer, which allows users to easily navigate and explore the complex pathway maps. In addition to the default pre-computed overview maps, iPath offers several data mapping tools. Users can upload various types of data and completely customize all nodes and edges of iPath2.0's maps. These customized maps give users an intuitive overview of their own data, guiding the analysis of various genomics and metagenomics projects.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21546551      PMCID: PMC3125749          DOI: 10.1093/nar/gkr313

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Genomes contain a variety of genes of high functional divergence. Interpretation of these huge datasets often requires an overview of various functional traits encoded by the genes (1). Metagenomics or recent pan-genomics projects (2) increase those demands even further. The KEGG database provides hand-curated pathway diagrams, and recently made their overview pathway diagram for metabolism publicly available in SVG (scalable vector graphics) format (3,4). Several tools have been developed since to utilize this diagram and the underlying database (5,6). In our previous study, we have developed a web-based tool that provides a simple interface to navigate and customize those overview pathways, iPath (7). Here, we report iPath version 2.0 (hereafter iPath2.0) with a considerably expanded amount of underlying data, numerous changes to its data mapping capabilities and a completely overhauled interactive user interface. The underlying global pathway map, which was originally constructed using ∼120 KEGG metabolic pathways in the previous version, has been greatly extended in the current version. iPath2.0 gives overviews of (i) the complete central metabolism in biological systems, (ii) secondary metabolite biosynthesis pathways and (iii) regulatory pathways and functional modules. In total, the three overview pathway diagrams currently cover 172 pathways or functional modules. Nodes in the map correspond to various chemical compounds and edges represent series of enzymatic reactions or protein complexes. This upgrade considerably extends its usefulness in various genome, metagenome, transcriptome or proteome analysis projects.

FEATURES

iPath2.0 is an online tool, accessible using any modern web browser. Pathway diagrams are displayed through an interactive environment developed in Adobe Flex (http://www.adobe.com/products/flex/). Data mapping and customization of various maps are performed on our web server, using a set of Perl scripts and a PostgreSQL-based relational database, thus considerably reducing the local CPU needs of users.

USER INTERFACE AND BASIC FUNCTIONS

iPath2.0′s main interface allows users to easily navigate and explore the complex pathway maps. The viewer provides zooming and panning controls, with different levels of map details corresponding to various zoom levels. Clicking on nodes and edges in the map displays a popup window with detailed information about the associated data, such as enzymes, reactions and compounds involved. Names and identifiers of these associated data can be searched using the built in keyword search engine, allowing users to quickly identify map elements of interest.

UNDERLYING DATA SET

iPath contains 172 pathways or functional modules and 3733 protein orthologous groups defined in KEGG (KOs) based on sequence similarity and manual curation, which are mapped to the respective 4392 clusters of orthologous groups (COGs) (8) and other ortholgous groups derived from the eggNOG database for cross reference (9). These orthologous groups represent enzymes in metabolic or other pathways, parts protein complexes or other components of functional modules. The content of iPath2.0 is summarized in three separate overview maps. The first one represents the central metabolism, composed of 145 pathways (2130 reactions); for example, glycolysis or amino acid metabolism. The second map gives an overview of 58 metabolic pathways mainly for secondary metabolite biosynthesis (53 pathways are shared with central metabolism; however, 5 unique pathways contains 656 reactions for this overview pathway diagram), such as polyketide biosynthesis, which has high evolutionary diversity. The third one contains 22 regulatory pathways or functional modules such as ribosome or transport systems. In addition to the default overview maps, iPath offers species-specific pathways for 933 fully sequenced genomes derived from their orthologous protein information defined in KEGG. In the current version, iPath’s maps do not cover all genes of an organism. For example, Escherichia coli has 2549 annotated COGs/NOGs in the eggNOG database corresponding to 4493 of its genes, with only 970 currently covered in iPath as many genes are functionally ill defined or without functional context (859 COGs for 1149 genes in E. coli are classified into ‘poorly characterized’ category), thus the vast majority of well-understood function is already in the map.

PATHWAY CUSTOMIZATION

In addition to the default pre-computed overview maps, iPath2.0 offers a number of useful data mapping tools and extensive customization options. Users can upload various types of data associated with genes, proteins or compounds to generate custom representations of any overview or species-specific pathways map (for examples see below).

INPUT USER DATA

Users can upload query data in plain text, and can define colors, opacity and width for various nodes and edges in the map. Detailed explanation of parameters and example customizations are available in the iPath online help pages. The following types of data can be used to specify parts of the map to customize: KEGG pathways, KEGG compounds, KEGG KOs, KEGG proteins, enzyme EC numbers, COGs, eggNOG orthologous groups, KEGG modules (3) and STRING proteins (10). iPath also contains species information as described in the section on the underlying data sets, and, using a NCBI taxonomy ID or three-letter KEGG organism code, allows users to display only customized versions of species-specific pathways. Correct taxonomy IDs can be selected using the built-in species search engine. iPath can store map customizations within its database, allowing users to simply reload them in future visits.

DATA EXPORT

Customized maps are displayed in the interactive viewer by default, providing the same functionality available for default maps. However, customized maps can also be exported into several graphical formats, both vector and bitmap, for direct inclusion into publications or other documents. Currently supported formats are Scalable Vector Graphics (svg), Portable Network Graphics (png), Encapsulated Postscript (eps), Postscript (ps) and Portable Document Format (pdf).

ILLUSTRATIVE EXAMPLES

Customized maps generated by iPath allow users to digest their own data in the context of genomic or metagenomic projects. Here, we show an example of the mapping describing the enzymatic activity in human gut microbiota (Figure 1). Using the SmashCommunity pipeline for phylogenetic and functional annotation (11), metagenomic sequence reads from fecal samples of 13 Japanese individuals (12) were mapped to orthologous groups (KOs) defined in the KEGG database via the STRING database (10), and the abundance of those KOs in each sample was calculated. As highly abundant gene orthologous groups across samples might encode crucial functions for human gut microbiome (13), we computed the average abundance of each KOs in 13 metagenomic samples and projected them as widths of edges in the overview pathway diagram (Figure 1). Several pathways, such as Glycerolipid metabolism or the Sec-dependent pathway, show overrepresentation in comparison to the other pathways detected in the data set. Further analysis is required to reveal the detailed associations between abundant pathways and human gut, however, iPath easily provides a general overview of the functionality.
Figure 1.

Enzymatic activity in the human gut microbiome: the nodes colored in red represent enzymes or proteins detected in metagenomic data derived from human fecal samples (12). Users can easily get an overview of complete metabolic and other pathways. In this example, enzymes in glycerolipid metabolism (center left) or proteins in protein export systems (center right) are overrepresented compared to other pathways in the data set. Line widths in the map are proportional to the level of overrepresentation.

Enzymatic activity in the human gut microbiome: the nodes colored in red represent enzymes or proteins detected in metagenomic data derived from human fecal samples (12). Users can easily get an overview of complete metabolic and other pathways. In this example, enzymes in glycerolipid metabolism (center left) or proteins in protein export systems (center right) are overrepresented compared to other pathways in the data set. Line widths in the map are proportional to the level of overrepresentation. In addition to visualizing the metabolic capacity of the human microbiome, iPath2.0 has already been proven as a useful tool in a number of other genomic or metagenomic analyses (1,14–16).

COMPARISON WITH OTHER TOOLS

In order to quantify functionality of iPath2.0, we listed four functional categories [(i) User interface, (ii) Customization capability, (iii) Data mapping and (iv) Functional coverage] and subdivided them into 15 detailed features. Using these features, we compared iPath2.0 with KEGG Atlas (4), Pathway Projector (6) and the previous version of iPath (7) (see Supplementary Table). All tools provide integrated pathways and zooming/panning capability. Compared to the original version, keyword search and mouse over popups have been implemented in iPath2.0. iPath2.0 enables users to map COG/eggNOG, UniProt and STRING IDs directly into pathway diagrams in addition to KEGG IDs, strengthening its advantage in customization capabilities and data mapping. In addition, iPath2.0 is the only tool that provides an overview for regulatory pathways, which is a vital point for functional coverage. Taken together, iPath2.0 has the advantage in making customized maps. Although multiple conditional data is not covered by iPath2.0 yet, it will be provided in the next version.

FUTURE DIRECTIONS

The current version of iPath provides powerful visualization and customization of cellular pathway diagrams. However, a significant amount of manual intervention and analysis is still required to identify interesting pathways in a particular data set. To simplify this process, we are planning to develop an API which will enable programmatic access to iPath by end users and other software packages. Combined with other tools developed by our group, such as iTOL(17) and SmashCommunity (11), iPath will become an integral part of a Pathway-analysis-suite and will also contain more functionality, for example, to display differences between two data sets.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR online.

FUNDING

Funding for open access charge: European Molecular Biology Laboratory. Conflict of interest statement. None declared.
  17 in total

1.  Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation.

Authors:  Ivica Letunic; Peer Bork
Journal:  Bioinformatics       Date:  2006-10-18       Impact factor: 6.937

2.  Integration of biological networks and gene expression data using Cytoscape.

Authors:  Melissa S Cline; Michael Smoot; Ethan Cerami; Allan Kuchinsky; Nerius Landys; Chris Workman; Rowan Christmas; Iliana Avila-Campilo; Michael Creech; Benjamin Gross; Kristina Hanspers; Ruth Isserlin; Ryan Kelley; Sarah Killcoyne; Samad Lotia; Steven Maere; John Morris; Keiichiro Ono; Vuk Pavlovic; Alexander R Pico; Aditya Vailaya; Peng-Liang Wang; Annette Adler; Bruce R Conklin; Leroy Hood; Martin Kuiper; Chris Sander; Ilya Schmulevich; Benno Schwikowski; Guy J Warner; Trey Ideker; Gary D Bader
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

3.  iPath: interactive exploration of biochemical pathways and networks.

Authors:  Ivica Letunic; Takuji Yamada; Minoru Kanehisa; Peer Bork
Journal:  Trends Biochem Sci       Date:  2008-02-13       Impact factor: 13.807

4.  Quantifying environmental adaptation of metabolic pathways in metagenomics.

Authors:  Tara A Gianoulis; Jeroen Raes; Prianka V Patel; Robert Bjornson; Jan O Korbel; Ivica Letunic; Takuji Yamada; Alberto Paccanaro; Lars J Jensen; Michael Snyder; Peer Bork; Mark B Gerstein
Journal:  Proc Natl Acad Sci U S A       Date:  2009-01-22       Impact factor: 11.205

5.  The ecoresponsive genome of Daphnia pulex.

Authors:  John K Colbourne; Michael E Pfrender; Donald Gilbert; W Kelley Thomas; Abraham Tucker; Todd H Oakley; Shinichi Tokishita; Andrea Aerts; Georg J Arnold; Malay Kumar Basu; Darren J Bauer; Carla E Cáceres; Liran Carmel; Claudio Casola; Jeong-Hyeon Choi; John C Detter; Qunfeng Dong; Serge Dusheyko; Brian D Eads; Thomas Fröhlich; Kerry A Geiler-Samerotte; Daniel Gerlach; Phil Hatcher; Sanjuro Jogdeo; Jeroen Krijgsveld; Evgenia V Kriventseva; Dietmar Kültz; Christian Laforsch; Erika Lindquist; Jacqueline Lopez; J Robert Manak; Jean Muller; Jasmyn Pangilinan; Rupali P Patwardhan; Samuel Pitluck; Ellen J Pritham; Andreas Rechtsteiner; Mina Rho; Igor B Rogozin; Onur Sakarya; Asaf Salamov; Sarah Schaack; Harris Shapiro; Yasuhiro Shiga; Courtney Skalitzky; Zachary Smith; Alexander Souvorov; Way Sung; Zuojian Tang; Dai Tsuchiya; Hank Tu; Harmjan Vos; Mei Wang; Yuri I Wolf; Hideo Yamagata; Takuji Yamada; Yuzhen Ye; Joseph R Shaw; Justen Andrews; Teresa J Crease; Haixu Tang; Susan M Lucas; Hugh M Robertson; Peer Bork; Eugene V Koonin; Evgeny M Zdobnov; Igor V Grigoriev; Michael Lynch; Jeffrey L Boore
Journal:  Science       Date:  2011-02-04       Impact factor: 47.728

6.  KEGG Atlas mapping for global analysis of metabolic pathways.

Authors:  Shujiro Okuda; Takuji Yamada; Masami Hamajima; Masumi Itoh; Toshiaki Katayama; Peer Bork; Susumu Goto; Minoru Kanehisa
Journal:  Nucleic Acids Res       Date:  2008-05-13       Impact factor: 16.971

7.  Comparative metagenomics revealed commonly enriched gene sets in human gut microbiomes.

Authors:  Ken Kurokawa; Takehiko Itoh; Tomomi Kuwahara; Kenshiro Oshima; Hidehiro Toh; Atsushi Toyoda; Hideto Takami; Hidetoshi Morita; Vineet K Sharma; Tulika P Srivastava; Todd D Taylor; Hideki Noguchi; Hiroshi Mori; Yoshitoshi Ogura; Dusko S Ehrlich; Kikuji Itoh; Toshihisa Takagi; Yoshiyuki Sakaki; Tetsuya Hayashi; Masahira Hattori
Journal:  DNA Res       Date:  2007-10-03       Impact factor: 4.458

8.  KEGG for representation and analysis of molecular networks involving diseases and drugs.

Authors:  Minoru Kanehisa; Susumu Goto; Miho Furumichi; Mao Tanabe; Mika Hirakawa
Journal:  Nucleic Acids Res       Date:  2009-10-30       Impact factor: 16.971

9.  Pathway projector: web-based zoomable pathway browser using KEGG atlas and Google Maps API.

Authors:  Nobuaki Kono; Kazuharu Arakawa; Ryu Ogawa; Nobuhiro Kido; Kazuki Oshita; Keita Ikegami; Satoshi Tamaki; Masaru Tomita
Journal:  PLoS One       Date:  2009-11-11       Impact factor: 3.240

10.  The COG database: an updated version includes eukaryotes.

Authors:  Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal:  BMC Bioinformatics       Date:  2003-09-11       Impact factor: 3.169

View more
  134 in total

1.  Rhizosphere microbiome assemblage is affected by plant development.

Authors:  Jacqueline M Chaparro; Dayakar V Badri; Jorge M Vivanco
Journal:  ISME J       Date:  2013-11-07       Impact factor: 10.302

2.  Aldolase B-Mediated Fructose Metabolism Drives Metabolic Reprogramming of Colon Cancer Liver Metastasis.

Authors:  Pengcheng Bu; Kai-Yuan Chen; Kun Xiang; Christelle Johnson; Scott B Crown; Nikolai Rakhilin; Yiwei Ai; Lihua Wang; Rui Xi; Inna Astapova; Yan Han; Jiahe Li; Bradley B Barth; Min Lu; Ziyang Gao; Robert Mines; Liwen Zhang; Mark Herman; David Hsu; Guo-Fang Zhang; Xiling Shen
Journal:  Cell Metab       Date:  2018-04-26       Impact factor: 27.287

3.  A catalog of microbial genes from the bovine rumen unveils a specialized and diverse biomass-degrading environment.

Authors:  Junhua Li; Huanzi Zhong; Yuliaxis Ramayo-Caldas; Nicolas Terrapon; Vincent Lombard; Gabrielle Potocki-Veronese; Jordi Estellé; Milka Popova; Ziyi Yang; Hui Zhang; Fang Li; Shanmei Tang; Fangming Yang; Weineng Chen; Bing Chen; Jiyang Li; Jing Guo; Cécile Martin; Emmanuelle Maguin; Xun Xu; Huanming Yang; Jian Wang; Lise Madsen; Karsten Kristiansen; Bernard Henrissat; Stanislav D Ehrlich; Diego P Morgavi
Journal:  Gigascience       Date:  2020-06-01       Impact factor: 6.524

4.  Putting the pieces together: high-performance LC-MS/MS provides network-, pathway-, and protein-level perspectives in Populus.

Authors:  Paul Abraham; Richard J Giannone; Rachel M Adams; Udaya Kalluri; Gerald A Tuskan; Robert L Hettich
Journal:  Mol Cell Proteomics       Date:  2012-10-16       Impact factor: 5.911

Review 5.  Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review.

Authors:  Peter Csermely; Tamás Korcsmáros; Huba J M Kiss; Gábor London; Ruth Nussinov
Journal:  Pharmacol Ther       Date:  2013-02-04       Impact factor: 12.310

6.  Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses.

Authors:  Bonnie L Hurwitz; Anton H Westveld; Jennifer R Brum; Matthew B Sullivan
Journal:  Proc Natl Acad Sci U S A       Date:  2014-07-07       Impact factor: 11.205

7.  A graphical and computational modeling platform for biological pathways.

Authors:  Alessandra Livigni; Laura O'Hara; Marta E Polak; Tim Angus; Derek W Wright; Lee B Smith; Tom C Freeman
Journal:  Nat Protoc       Date:  2018-03-15       Impact factor: 13.491

8.  Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS Online.

Authors:  Erica M Forsberg; Tao Huan; Duane Rinehart; H Paul Benton; Benedikt Warth; Brian Hilmers; Gary Siuzdak
Journal:  Nat Protoc       Date:  2018-03-01       Impact factor: 13.491

9.  Metatranscriptomics analysis of cyanobacterial aggregates during cyanobacterial bloom period in Lake Taihu, China.

Authors:  Zhenzhu Chen; Junyi Zhang; Rui Li; Fei Tian; Yanting Shen; Xueying Xie; Qinyu Ge; Zuhong Lu
Journal:  Environ Sci Pollut Res Int       Date:  2017-12-03       Impact factor: 4.223

10.  Microaerophilic alkane degradation in Pseudomonas extremaustralis: a transcriptomic and physiological approach.

Authors:  Paula M Tribelli; Leticia Rossi; Martiniano M Ricardi; Maria Gomez-Lozano; Søren Molin; Laura J Raiger Iustman; Nancy I Lopez
Journal:  J Ind Microbiol Biotechnol       Date:  2017-11-07       Impact factor: 3.346

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.