| Literature DB >> 32599927 |
Xiangrui Zeng1, Wei Zong2, Chien-Wei Lin3, Zhou Fang2, Tianzhou Ma4, David A Lewis5, John F Enwright5, George C Tseng2.
Abstract
Pathway enrichment analysis provides a knowledge-driven approach to interpret differentially expressed genes associated with disease status. Many tools have been developed to analyze a single study. However, when multiple studies of different conditions are jointly analyzed, novel integrative tools are needed. In addition, pathway redundancy introduced by combining multiple public pathway databases hinders interpretation and knowledge discovery. We present a meta-analytic integration tool, Comparative Pathway Integrator (CPI), to address these issues using adaptively weighted Fisher's method to discover consensual and differential enrichment patterns, a tight clustering algorithm to reduce pathway redundancy, and a text mining algorithm to assist interpretation of the pathway clusters. We applied CPI to jointly analyze six psychiatric disorder transcriptomic studies to demonstrate its effectiveness, and found functions confirmed by previous biological studies as well as novel enrichment patterns. CPI's R package is accessible online on Github metaOmics/MetaPath.Entities:
Keywords: meta-analysis; pathway; text mining
Year: 2020 PMID: 32599927 PMCID: PMC7348908 DOI: 10.3390/genes11060696
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Workflow of Comparative Pathway Integrator (CPI).
Figure 2Workflow of noun phrase extraction.
Figure 3Heatmap of kappa statistics of pair-wise pathways in all clusters.
Figure 4Heatmap of log10-scale pathway enrichment p-values of pathways annotated by eight pathway clusters (I-VIII) and a scattered pathway set (black).
Figure 5Hierarchical clustering of psychiatric studies in each cluster with distance defined by the log10-scale pathway enrichment p-values.
Ten significant keywords with q-value < 0.05 in each pathway cluster.
| Cluster | Keywords |
|---|---|
| I | insulin, NGF, focal adhesion, BDNF, neurotrophins, Trk tyrosine kinase receptor, insulin receptor substrate, insulin receptor tyrosine kinase, Ras MAPK pathway, FAK |
| II | neuron |
| III | transcription, nucleoplasm, chromosome, nuclear content, nucleolus, RNA |
| IV | metabolism, mRNA, ribosome, replication, chemical reaction, cRNA, vRNA, viral protein, NUMB, nucleus |
| V | cell death, apoptotic process, activation, endogenous cellular process, programmed cell death, apoptosis |
| VI | mitochondrion, organelle, mitochondrial envelope, organelle envelope, lipid bilayer, inner elumen facing lipid bilayer, semiautonomous self replicating organelle, tissue respiration, virtually eukaryotic cell, cytoplasm |
| VII | degradation, APC/C, apoptosis, CDC20, CDH1, mitotic protein, MHC, multiubiquitination, ubiquitin 26s proteasome system, exogenous antigen |
| VIII | respiratory electron transport, ATP synthesis, inner mitochondrial membrane, chemiosmotic gradient, brown fat, rotenone, FAD, mitochondrial matrix, body temperature, NAD |
Abbreviation. NGF: Nerve growth factor, BDNF: Brain-derived neurotrophic factor, MAPK: Mitogen-activated protein kinase, FAK: Focal adhesion kinase, RNA: Ribonucleic acid, APC/C: Anaphase-promoting complex, MHC: Major histocompatibility complex, ATP: Adenosine triphosphate, FAD: Flavin adenine dinucleotide, NAD: Nicotinamide adenine dinucleotide, NUMB, CDC20, CDH1: Gene names.