| Literature DB >> 30984382 |
Michael Prummer1,2.
Abstract
Differential gene expression (DGE) studies often suffer from poor interpretability of their primary results, i.e., thousands of differentially expressed genes. This has led to the introduction of gene set analysis (GSA) methods that aim at identifying interpretable global effects by grouping genes into sets of common context, such as, molecular pathways, biological function or tissue localization. In practice, GSA often results in hundreds of differentially regulated gene sets. Similar to the genes they contain, gene sets are often regulated in a correlative fashion because they share many of their genes or they describe related processes. Using these kind of neighborhood information to construct networks of gene sets allows to identify highly connected sub-networks as well as poorly connected islands or singletons. We show here how topological information and other network features can be used to filter and prioritize gene sets in routine DGE studies. Community detection in combination with automatic labeling and the network representation of gene set clusters further constitute an appealing and intuitive visualization of GSA results. The RICHNET workflow described here does not require human intervention and can thus be conveniently incorporated in automated analysis pipelines.Entities:
Keywords: GSEA; differential gene expression analysis; enrichment analysis; gene set analysis; network analyis
Year: 2019 PMID: 30984382 PMCID: PMC6446501 DOI: 10.12688/f1000research.17824.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Graphical representation of the initial gene set network.
Node colors indicate whether the member genes of a set are predominantly up or down regulated or whether there is no preferential direction (mixed).
Figure 4. Disjoint clusters after community detection and splitting.
List of all singletons, i.e., genesets without sufficient overlap with any other geneset.
| name | NGenes | Direction | FDR |
|---|---|---|---|
| KEGG GLYCOLYSIS GLUCONEOGENESIS | 58 | Mixed | 0.04900 |
| KEGG GLYCINE SERINE AND THREONINE METABOLISM | 29 | Mixed | 0.00020 |
| KEGG ARGININE AND PROLINE METABOLISM | 52 | Up | 0.04100 |
| KEGG GLUTATHIONE METABOLISM | 43 | Mixed | 0.00210 |
| KEGG O GLYCAN BIOSYNTHESIS | 26 | Mixed | 0.01300 |
| KEGG ARACHIDONIC ACID METABOLISM | 48 | Mixed | 0.04300 |
| KEGG NICOTINATE AND NICOTINAMIDE METABOLISM | 23 | Mixed | 0.00290 |
| KEGG CHEMOKINE SIGNALING PATHWAY | 160 | Mixed | 0.02700 |
| KEGG P53 SIGNALING PATHWAY | 67 | Mixed | 0.01400 |
| KEGG APOPTOSIS | 78 | Mixed | 0.02800 |
| KEGG TGF BETA SIGNALING PATHWAY | 82 | Mixed | 0.00140 |
| KEGG ADHERENS JUNCTION | 72 | Up | 0.02900 |
| KEGG LEUKOCYTE TRANSENDOTHELIAL MIGRATION | 99 | Mixed | 0.00360 |
| KEGG PROGESTERONE MEDIATED OOCYTE MATURATION | 80 | Mixed | 0.04900 |
| KEGG ADIPOCYTOKINE SIGNALING PATHWAY | 62 | Up | 0.00180 |
| KEGG PATHOGENIC ESCHERICHIA COLI INFECTION | 53 | Mixed | 0.04000 |
| BIOCARTA AGR PATHWAY | 33 | Mixed | 0.01300 |
| BIOCARTA ATM PATHWAY | 20 | Mixed | 0.00790 |
| BIOCARTA BCELLSURVIVAL PATHWAY | 15 | Up | 0.03400 |
| BIOCARTA LAIR PATHWAY | 14 | Mixed | 0.00910 |
| BIOCARTA EPONFKB PATHWAY | 11 | Mixed | 0.01300 |
| BIOCARTA GABA PATHWAY | 6 | Mixed | 0.04800 |
| BIOCARTA P53HYPOXIA PATHWAY | 22 | Mixed | 0.02600 |
| BIOCARTA EGFR SMRTE PATHWAY | 11 | Mixed | 0.01600 |
| BIOCARTA PPARA PATHWAY | 52 | Mixed | 0.00091 |
| BIOCARTA RAC1 PATHWAY | 20 | Mixed | 0.00440 |
| BIOCARTA NKCELLS PATHWAY | 14 | Mixed | 0.02800 |
| REACTOME METABOLISM OF VITAMINS AND COFACTORS | 50 | Mixed | 0.03100 |
| REACTOME IL 7 SIGNALING | 10 | Mixed | 0.00062 |
| REACTOME SULFUR AMINO ACID METABOLISM | 24 | Up | 0.00530 |
| REACTOME SPHINGOLIPID DE NOVO BIOSYNTHESIS | 30 | Mixed | 0.00970 |
| REACTOME SIGNALING BY HIPPO | 20 | Mixed | 0.00110 |
| REACTOME GASTRIN CREB SIGNALLING PATHWAY VIA PKC AND MAPK | 171 | Mixed | 0.05000 |
| REACTOME PLATELET ADHESION TO EXPOSED COLLAGEN | 10 | Mixed | 0.00910 |
| REACTOME VEGF LIGAND RECEPTOR INTERACTIONS | 10 | Mixed | 0.05000 |
| REACTOME METABOLISM OF AMINO ACIDS AND DERIVATIVES | 182 | Mixed | 0.03100 |
| REACTOME TRANSMISSION ACROSS CHEMICAL SYNAPSES | 161 | Mixed | 0.01900 |
| REACTOME INTEGRATION OF ENERGY METABOLISM | 104 | Mixed | 0.02900 |
| REACTOME CYTOSOLIC TRNA AMINOACYLATION | 24 | Down | 0.03000 |
| REACTOME OLFACTORY SIGNALING PATHWAY | 65 | Mixed | 0.00220 |
| REACTOME SEMA3A PLEXIN REPULSION SIGNALING BY INHIBITING INTEGRIN
| 13 | Mixed | 0.04000 |
| REACTOME NA CL DEPENDENT NEUROTRANSMITTER TRANSPORTERS | 12 | Down | 0.03700 |
| REACTOME SYNTHESIS AND INTERCONVERSION OF NUCLEOTIDE DI AND
| 18 | Up | 0.04700 |
| REACTOME ROLE OF DCC IN REGULATING APOPTOSIS | 10 | Mixed | 0.00420 |
| REACTOME NETRIN1 SIGNALING | 35 | Mixed | 0.00940 |
| REACTOME NEPHRIN INTERACTIONS | 17 | Up | 0.03000 |
| REACTOME RAP1 SIGNALLING | 15 | Mixed | 0.00900 |
| REACTOME ETHANOL OXIDATION | 10 | Up | 0.00012 |
| REACTOME HORMONE SENSITIVE LIPASE HSL MEDIATED TRIACYLGLYCEROL
| 11 | Mixed | 0.01000 |
Figure 2. Gene set network with singletons removed.
The color scheme is the same as above. The node size corresponds to the number of genes in a set.
List of binary clusters as indicated by the id column.
| id | name | NGenes | Direction | FDR |
|---|---|---|---|---|
| 3 | KEGG ALANINE ASPARTATE AND GLUTAMATE METABOLISM | 28 | Mixed | 4.9e-03 |
| 3 | REACTOME AMINO ACID SYNTHESIS AND INTERCONVERSION
| 16 | Mixed | 3.6e-04 |
| 6 | KEGG INOSITOL PHOSPHATE METABOLISM | 49 | Mixed | 2.0e-04 |
| 6 | KEGG PHOSPHATIDYLINOSITOL SIGNALING SYSTEM | 69 | Mixed | 1.5e-04 |
| 17 | BIOCARTA ARF PATHWAY | 16 | Mixed | 1.2e-02 |
| 17 | BIOCARTA CTCF PATHWAY | 22 | Mixed | 3.8e-04 |
| 18 | REACTOME PLATELET ACTIVATION SIGNALING AND AGGREGATION | 178 | Mixed | 2.9e-03 |
| 18 | REACTOME RESPONSE TO ELEVATED PLATELET CYTOSOLIC CA2 | 72 | Mixed | 3.9e-02 |
| 19 | REACTOME NEUROTRANSMITTER RELEASE CYCLE | 28 | Up | 1.4e-02 |
| 19 | REACTOME NOREPINEPHRINE NEUROTRANSMITTER RELEASE CYCLE | 10 | Up | 4.5e-02 |
| 20 | REACTOME AMINO ACID AND OLIGOPEPTIDE SLC TRANSPORTERS | 40 | Mixed | 2.0e-02 |
| 20 | REACTOME AMINO ACID TRANSPORT ACROSS THE PLASMA MEMBRANE | 29 | Mixed | 8.7e-03 |
| 22 | REACTOME MUSCLE CONTRACTION | 42 | Up | 3.4e-05 |
| 22 | REACTOME SMOOTH MUSCLE CONTRACTION | 23 | Up | 0.0e+00 |
| 23 | REACTOME ACTIVATION OF GENES BY ATF4 | 24 | Mixed | 7.8e-03 |
| 23 | REACTOME PERK REGULATED GENE EXPRESSION | 27 | Mixed | 2.0e-02 |
Figure 3. Gene set network with singletons and binary clusters removed.
Colored according to disjoint subnetworks.
Figure 5. Geneset cluster with machine-generated titles.
Only the first 16 connected subnets are shown. Geneset labels are omitted for clusters with more than 5 members.