| Literature DB >> 25462153 |
Lv Jin1, Xiao-Yu Zuo2, Wei-Yang Su3, Xiao-Lei Zhao1, Man-Qiong Yuan4, Li-Zhen Han2, Xiang Zhao1, Ye-Da Chen1, Shao-Qi Rao5.
Abstract
Genetic studies are traditionally based on single-gene analysis. The use of these analyses can pose tremendous challenges for elucidating complicated genetic interplays involved in complex human diseases. Modern pathway-based analysis provides a technique, which allows a comprehensive understanding of the molecular mechanisms underlying complex diseases. Extensive studies utilizing the methods and applications for pathway-based analysis have significantly advanced our capacity to explore large-scale omics data, which has rapidly accumulated in biomedical fields. This article is a comprehensive review of the pathway-based analysis methods-the powerful methods with the potential to uncover the biological depths of the complex diseases. The general concepts and procedures for the pathway-based analysis methods are introduced and then, a comprehensive review of the major approaches for this analysis is presented. In addition, a list of available pathway-based analysis software and databases is provided. Finally, future directions and challenges for the methodological development and applications of pathway-based analysis techniques are discussed. This review will provide a useful guide to dissect complex diseases.Entities:
Keywords: Algorithms; Complex disease; Pathway-based analysis; Software and databases
Mesh:
Year: 2014 PMID: 25462153 PMCID: PMC4411419 DOI: 10.1016/j.gpb.2014.10.002
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Algorithms and their applications in pathway-based analysis
| Over-representation analysis | Fisher’s exact test | SNP | |
| Gene set-based scoring | GSEA | Microarray/SNP | |
| GSA | SNP | ||
| SRT | SNP | ||
| LRpath | Microarray | ||
| Multivariate approaches | A two-stage approach | SNP | |
| SPCA | SNP | ||
| Logistic kernel machine regression | Microarray/SNP | ||
| Topological-based analysis | SPIA | Microarray | |
| CliPPER | Microarray |
Note: GSEA, gene set enrichment analysis; GSA, gene set analysis; SRT, SNP-ratio test; SPCA, supervised principal component analysis; SPIA, signaling pathway impact analysis.
Pathway databases
| Metabolic pathways | Primarily contain a series of biochemical reactions, especially the chemical modifications of the small molecule substrates of enzymes | KEGG | |
| BioCyc | |||
| BIOPATH | |||
| EMP | |||
| Signal transduction pathways | Describe the information spread from one part or sub-process of the cell to another, generally through a series of covalent modifications of protein | CSNDB | |
| SPAD | |||
| TransPath | |||
| BBID | |||
| Protein–protein interaction pathways | Focus on interactions between proteins, most of which are derived from various large-scale experimental methods | CYGD | |
| DIP | |||
| GRID | |||
| HPRD | |||
| Transcriptional regulation pathway | Mainly concern the relationships between transcription factors and the corresponding genes they regulate | STKE | |
| BTITE | |||
| TRANSFAC | |||
| CST |
The common software and web platforms used for microarray data analysis
| PathMAPA | A tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis, based on expression data | Local databases | |
| MetaCore | Based on a high-quality, manually-curated database, MetaCore is an integrated software suite for functional analysis of microarray, metabolic, SAGE, proteomics, NGS, copy number variation, siRNA, microRNA and screening data | MetaRodent, MetaLink, MetaSearch | |
| Ingenuity Pathway Analysis (IPA) | A comprehensive software/database search tool for finding functions and pathways for specific biological states | GO, KEGG, BIND | |
| ePath3D | An easy-to-use and powerful software for creating and managing illustrated 3D pathways for publications and presentations | eProtein, ePathway | |
| Pathway Builder | An online pathway drawing tool which is the fastest and easiest method of creating signal transduction pathways, enabling the users to design their own project or use pre-made pathway templates to help get them started | GenBank, Uniprot/Swiss-Prot, TrEMBL, KEGG, ENZYME, | |
| Interactive Pathways Explorer (iPath) | A web-based tool for the visualization, analysis and customization of various pathways maps from KEGG. The recently-released version 2 could deal with metabolic pathway, regulatory pathway and biosynthesis of secondary metabolites | KEGG | |
| GSEA-P & R-GSEA | GSEA-P is a desktop application for Gene Set Enrichment Analysis, with a friendly graphic interface. R-GSEA is provided as a standalone R program. | MSigDB, Gene Set Cards, GEO | |
| DAVID | A tool for augmenting and integrating functional annotations from other databases | KEGG,GO | |
| MetaCyc | Applications include serving as an encyclopedia of metabolism, providing a reference data set for the computational prediction of metabolic pathways in sequenced organisms, supporting metabolic engineering and helping to compare biochemical networks | KEGG, BioCyc, EcoCyc | |
| Reactome | Intuitive bioinformatics tools for the visualization, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modeling, systems biology and education | KEGG | |
| GenMAPP | Designed to visualize gene expression and other genomic data on maps representing biological pathways and groupings of genes | GenMAPP, GO | |
| FunCluster | An integrative tool for analyzing gene co-expression networks from microarray expression data; the analytic model implemented in the library involves two abstraction layers: transcriptional and functional (biological roles) | GO, KEGG | |
| Graphite web | A novel web tool for pathway analyses, consisting of topological-based analysis and network visualization for gene expression data of both microarray and RNA-seq experiments | KEGG, Reactome |
The available analytical tools and web servers designed for pathway analysis on GWAS data
| INRICH | A pathway-based genome-wide association (GWA) analysis tool that tests for enriched association signals of predefined gene sets across independent genomic intervals | KEGG, GO | |
| GeSBAP | A simple implementation tool of the GSA strategy for the analysis of GWA studies, provides a significant increase in the power testing for this type of studies | GO, BioCarta, KEGG | |
| GSEA-SNP | A program for the identification of disease-associated SNPs and pathways, the understanding of the underlying biological mechanisms, and the identification of markers with weak effects, undetectable in association studies without pathway consideration | ||
| GSA-SNP | A standalone software that implements three widely-used GSA methods: Z-statistic method, Restandardized GSA and GSEA for pathway analysis, providing a fast computation and an easy-to-use interface | dbSNP, GO, MSigDB, GeneCards | |
| dmGWAS | Designed to identify significant protein–protein interaction (PPI) modules and the candidate genes, specifically adapted for GWAS datasets, including data preparation, integration, searching and validation in GWAS permutation data | GO,MINT, IntAct, DIP, BioGRID, HPRD,MIPS/MPact | |
| PLINK | A whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally-efficient manner | ||
| Path | A valuable tool for investigating gene–gene interactions in large genetic association studies, designed to help researchers interface their data with biological information from several bioinformatics resources | NCBI, OMIM, KEGG, UCSC dbSNP, IIDB, PharmGKB, | |
| Pathway-PDT | A new tool to perform pathway analysis using the framework of Pedigree Disequilibrium Test (PDT) for general family data, combining more information from raw genotypes in general nuclear families | KEGG | |
| ALIGATOR | A program for testing for Gene Ontology categories over-represented on a list of significant SNPs from a GWA analysis, defining significant SNPs by prespecified | GO, dbSNP, NCBI | http://x004.psycm.uwcm.ac.uk/≃peter/ |
| GenGen | A suite of free software tools to facilitate the analysis of high-throughput genomics data sets | KEGG, BioCarta, GO | |
| SNP ratio test | Assess significance of enrichment of significant associations in GWA studies and can be applied to pathways such as KEGG/GO or user-defined pathways to test specific hypotheses | KEGG, GO | |
| i-GSEA4GWAS | A tool for SNP label permutation, assign SNPs to genes, calculate modified GSEA enrichment score | KEGG, BioCarta, GO | |
| ICSNPathway | A tool for comprehensive interpretation of GWAS data by integrating LD analysis, functional SNP annotation and pathway-based analysis | KEGG, BioCarta, GO |