| Literature DB >> 31031796 |
Wenyi Qin1,2,3, Xujun Wang4, Hongyu Zhao4,5, Hui Lu1,2,4,5.
Abstract
Motivation: Gene set enrichment analysis is a widely accepted expression analysis tool which aims at detecting coordinated expression change within a pre-defined gene sets rather than individual genes. The benefit of gene set analysis over individual differentially expressed (DE) gene analysis includes more reproducible and interpretable results and detecting small but consistent change among gene set which could not be detected by DE gene analysis. There have been many successful gene set analysis applications in human diseases. However, when the sample size of a disease study is small and no other public data sets of the same disease are available, it will lead to lack of power to detect pathways of importance to the disease.Entities:
Keywords: EM algorithm; cross disease transcriptome; gene expression; gene set enrichment analysis; mixture model; public data integration
Year: 2019 PMID: 31031796 PMCID: PMC6473067 DOI: 10.3389/fgene.2019.00293
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Simulation parameter setup under different scenarios.
| (0,0) | 45 | 0 | 0 | 5 |
| (EP,0) | 40 | 0 | 5 | 5 |
| (0,EP) | 40 | 5 | 0 | 5 |
| (EP,EP) | 40 | 0 | 0 | 10 |
| (0,0) | 45 | 0 | 0 | 5 |
| (EP,0) | 40 | 5 | 10 | 0 |
| (0,EP) | 40 | 10 | 5 | 0 |
| (EP,EP) | 40 | 0 | 0 | 10 |
| (0,0) | 45 | 0 | 0 | 5 |
| (EP,0) | 35 | 0 | 5 | 10 |
| (0,EP) | 35 | 5 | 0 | 10 |
| (EP,EP) | 35 | 0 | 0 | 15 |
| (0,0) | 45 | 0 | 0 | 5 |
| (EP,0) | 30 | 5 | 15 | 0 |
| (0,EP) | 30 | 15 | 5 | 0 |
| (EP,EP) | 20 | 15 | 15 | 0 |
EP: Enriched Pathway
Figure 1Overview of the proposed joint gene set enrichment frameworks.
Figure 2AUC comparison among different methods under different parameter setup.
Figure 3Venn diagram of identified enriched pathways by JointNormKS and single data set analysis in lung and colorectal adenocarcinoma data sets. FDR cutoff is set to 0.1.
Pathways exclusively identified by JointNormKS in lung adenocarcinoma data set.
| KEGG_BASE_EXCISION_REPAIR | 0.1011 | 0.0797 |
| KEGG_BLADDER_CANCER | 0.1144 | 0.0988 |
| BIOCARTA_MCM_PATHWAY | 0.1144 | 0.0999 |
| BIOCARTA_COMP_PATHWAY | 0.1004 | 0.0797 |
| BIOCARTA_CELLCYCLE_PATHWAY | 0.1011 | 0.0912 |
| PID_MYC_ACTIV_PATHWAY | 0.1093 | 0.0961 |
| PID_AURORA_A_PATHWAY | 0.1035 | 0.0961 |
| REACTOME_MUSCLE_CONTRACTION | 0.1144 | 0.0978 |
| REACTOME_SYNTHESIS_OF_DNA | 0.1011 | 0.0867 |
| REACTOME_METABOLISM_OF_CARBOHYDRATES | 0.1144 | 0.0961 |
| REACTOME_COMPLEMENT_CASCADE | 0.1011 | 0.0797 |
| NABA_ECM_AFFILIATED | 0.1144 | 0.0961 |
Figure 4Venn diagram of identified enriched pathways by JointNormKS and single data set analysis in AD and HD data sets. FDR cutoff is set to 0.1.
Pathways exclusively identified by JointNormKS in AD data set.
| KEGG_APOPTOSIS | 0.0117 | 0.0087 |
| KEGG_ADIPOCYTOKINE_SIGNALING_PATHWAY | 0.0125 | 0.0090 |
| BIOCARTA_CERAMIDE_PATHWAY | 0.0125 | 0.0096 |
| BIOCARTA_PDGF_PATHWAY | 0.0116 | 0.0081 |
| ST_JNK_MAPK_PATHWAY | 0.0128 | 0.0092 |
| REACTOME_DEVELOPMENTAL_BIOLOGY | 0.0268 | 0.0081 |
| REACTOME_NEURONAL_SYSTEM | 0.0128 | 0.0091 |
| REACTOME_MRNA_PROCESSING | 0.0106 | 0.0087 |
| REACTOME_AXON_GUIDANCE | 0.0241 | 0.0091 |
| REACTOME_REGULATION_OF_MITOTIC_CELL_CYCLE | 0.0116 | 0.0087 |
| REACTOME_METABOLISM_OF_LIPIDS_AND_LIPOPROTEINS | 0.0445 | 0.0100 |
| REACTOME_APC_C_CDC20_MEDIATED_DEGRADATION_OF_MITOTIC_PROTEINS | 0.0129 | 0.0081 |
| REACTOME_ACTIVATED_TLR4_SIGNALLING | 0.0129 | 0.0055 |