| Literature DB >> 29659714 |
Abstract
Motivation: Gene set testing, or pathway analysis, has become a critical tool for the analysis of high-dimensional genomic data. Although the function and activity of many genes and higher-level processes is tissue-specific, gene set testing is typically performed in a tissue agnostic fashion, which impacts statistical power and the interpretation and replication of results.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29659714 PMCID: PMC6129311 DOI: 10.1093/bioinformatics/bty217
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Conceptual representation of the proposed approach for computing and using tissue-specific gene set weights. The target gene set collection, e.g. one of the collections from the MSigDB, is represented as a matrix of indicator variables with rows representing gene sets, columns representing genes and elements set to 1 if an annotation exists between the corresponding gene and gene set. Using information from the HPA regarding gene activity in different human tissues, tissue-specific weights are computed according to the process detailed in Section 2.2 for all of the gene sets in the collection. Potential applications of these weights include the functional characterization of human tissues, tissue-specific gene set testing and multi-tissue analyses
Analyzed MSigDB gene set collections
| ID | Collection name | # Sets |
|---|---|---|
| H | Hallmark gene sets | 50 |
| C1 | Positional gene sets | 326 |
| C2.CGP | Chemical and genetic perturbations | 3402 |
| C3.MIR | microRNA targets | 212 |
| C3.TFT | Transcription factor targets | 615 |
| C4.CGN | Cancer gene neighborhoods | 427 |
| C4.CM | Cancer modules | 431 |
| C5.CC | GO cellular component | 580 |
| C5.MF | GO molecular function | 901 |
| C6 | Oncogenic signatures | 189 |
| C7 | Immunologic signatures | 4872 |
Note: The 13 MSigDB version 6.0 collections for which tissue-specific gene set weights were computed. The collections marked in bold (C2.CP and C5.BP) were used to generate the analysis results in Section 3.
Analyzed HPA tissue types
| Adipose tissue | Gallbladder | Seminal vesicle |
| Adrenal gland | ||
| Appendix | Kidney | Skin |
| Bone marrow | ||
| Breast | Smooth muscle | |
| Lymph node | Spleen | |
| Cervix, uterine | Ovary | Stomach |
| Testis | ||
| Duodenum | Parathyroid gland | Thyroid gland |
| Endometrium | Placenta | Tonsil |
| Epididymis | Prostate | Urinary bladder |
| Rectum | ||
| Fallopian tube | Salivary gland |
Note: The 37 HPA tissue types for which tissue-specific gene sets weights were computed. The tissue types in bold were used to generate the analysis results in Section 3.
Multi-tissue analysis for T2D
| C2.CP | C5.BP | ||
|---|---|---|---|
| Gene set | Minimum | Gene set | Minimum |
| weight | weight | ||
| REACTOME_METABOLISM_OF_CARBOHYDRATES | 2 | GO_GLUCOSE_METABOLIC_PROCESS | 7.7 |
| REACTOME_AMINO_ACID_SYNTHESIS_AND_ INTERCONVERSION_… | 1.1 | GO_HEXOSE_METABOLIC_PROCESS | 6.5 |
| KEGG_TYPE_II_DIABETES_MELLITUS | 1.1 | GO_NEGATIVE_REGULATION_OF_ CARBOHYDRATE_METABOLIC_P… | 5.4 |
| KEGG_INSULIN_SIGNALING_PATHWAY | 1 | GO_MONOSACCHARIDE_ BIOSYNTHETIC_PROCESS | 5.4 |
| KEGG_ALANINE_ASPARTATE_AND_GLUTAMATE_ METABOLISM | 0.92 | GO_CARBOHYDRATE_METABOLIC_ PROCESS | 4.7 |
| REACTOME_MITOCHONDRIAL_FATTY_ACID_ BETA_OXIDATION | 0.69 | GO_MONOSACCHARIDE_METABOLIC_ PROCESS | 4.7 |
| REACTOME_BRANCHED_CHAIN_AMINO_ ACID_CATABOLISM | 0.65 | GO_SMALL_MOLECULE_METABOLIC_ PROCESS | 4.2 |
| KEGG_PROXIMAL_TUBULE_BICARBONATE_ RECLAMATION | 0.64 | GO_REGULATION_OF_CARBOHYDRATE_ METABOLIC_PROCESS | 3.7 |
| BIOCARTA_SARS_PATHWAY | 0.64 | GO_REGULATION_OF_GLUCOSE_ METABOLIC_PROCESS | 3.3 |
| KEGG_VALINE_LEUCINE_AND_ISOLEUCINE_ DEGRADATION | 0.62 | GO_SMALL_MOLECULE_BIOSYNTHETIC_ PROCESS | 2.9 |
Note: The 10 MSigDB gene sets from the curated canonical (C2.CP) and GO biological process (C5.BP) collections that have the largest minimum tissue-specific weight across four tissues significantly impacted by T2D (adipose tissue, liver, pancreas and skeletal muscle).
Tissue-specific gene set testing results
| Age | BMI | Cerebrovascular | COPD | Depression | Gender | Heart | Hyper | Liver | T2D | |
|---|---|---|---|---|---|---|---|---|---|---|
| Disease | Disease | -Tension | Disease | |||||||
| Adipose tissue (subcutaneous) | 4/7 | 3/6 | 0/1 | 0/1 | 2/2 | |||||
| Cerebral cortex | 0/3 | 0/2 | ||||||||
| Colon (transverse) | ||||||||||
| Esophagus mucosa | 1/1 | |||||||||
| Heart (left ventricle) | 24/46 | |||||||||
| Liver | ||||||||||
| Lung | 0/1 | |||||||||
| Pancreas | 0/1 | |||||||||
| Skeletal muscle | ||||||||||
| Small intestine (terminal ileum) |
Note: Number of discoveries at an FDR q-value ≤ 0.2 (weighted discoveries/unweighted discoveries) from a gene set testing analysis of GTEx gene expression data from 10 different tissues relative to 10 different phenotypes using the MSigDB v6.0 C2.CP collection. Tissue and phenotype combinations with no discoveries are blank. If the weighted analysis yielded more discoveries than the unweighted analysis, the cell text is bold.
Significant pathways in GTEx liver relative to T2D
| Gene set | Weight | P-value | FDR | wFDR | Support for TD2 association |
|---|---|---|---|---|---|
| KEGG_SNARE_INTERACTIONS_IN_VESICULAR_TRANSPORT | 3.7 | 0.00047 | 0.62 | 0.08 | ( |
| REACTOME_PROTEOLYTIC_CLEAVAGE_OF_SNARE_COMPLEX_PROTEINS | 23 | 0.0052 | 0.69 | 0.08 | ( |
| REACTOME_FGFR1_LIGAND_BINDING_AND_ACTIVATION | 11 | 0.0025 | 0.62 | 0.08 | ( |
| REACTOME_AQUAPORIN_MEDIATED_TRANSPORT | 26 | 0.0063 | 0.76 | 0.08 | ( |
| REACTOME_SIGNALING_BY_ACTIVATED_POINT_MUTANTS_OF_FGFR1 | 10 | 0.0033 | 0.63 | 0.085 | ( |
| REACTOME_G_BETA_GAMMA_SIGNALLING_THROUGH_PI3KGAMMA | 10 | 0.004 | 0.67 | 0.089 | ( |
| PID_HDAC_CLASSII_PATHWAY | 66 | 0.048 | 1 | 0.14 | ( |
| REACTOME_PI3K_CASCADE | 12 | 0.01 | 0.87 | 0.14 | ( |
| REACTOME_MEIOTIC_SYNAPSIS | 59 | 0.057 | 1 | 0.14 | ( |
| REACTOME_FACILITATIVE_NA_INDEPENDENT_GLUCOSE_TRANSPORTERS | 47 | 0.056 | 1 | 0.16 | ( |
Note: Top 10 MSigDB canonical pathways whose gene expression values in GTEx liver samples are most significantly associated with T2D status.
MSigDB genes sets specific to adipose tissue, heart muscle and liver
| C2.CP | C5.BP | |||
|---|---|---|---|---|
| Tissue | Gene set | Weight | Gene set | Weight |
| Adipose tissue | REACTOME_HORMONE_SENSITIVE_ LIPASE_HSL_MEDIATED_TRI… | 133 | GO_REGULATION_OF_ LIPID_STORAGE | 135 |
| REACTOME_TRANSCRIPTIONAL_ REGULATION_OF_WHITE_ADIPO… | 102 | GO_REGULATION_OF_ SEQUESTERING_OF_TRIGLYCERIDE | 125 | |
| KEGG_PPAR_SIGNALING_PATHWAY | 93 | GO_LIPID_STORAGE | 102 | |
| REACTOME_LIPID_DIGESTION_ MOBILIZATION_AND_TRANSPOR… | 59 | GO_LOW_DENSITY_ LIPOPROTEIN_PARTICLE_CLEARANCE | 85 | |
| REACTOME_TRIGLYCERIDE_ BIOSYNTHESIS | 48 | GO_BROWN_FAT_CELL_ DIFFERENTIATION | 83 | |
| KEGG_ADIPOCYTOKINE_SIGNALING_ PATHWAY | 42 | GO_POSITIVE_REGULATION_ OF_LIPID_STORAGE | 81 | |
| REACTOME_FATTY_ACID_ TRIACYLGLYCEROL_AND_KETONE_BOD… | 32 | GO_REGULATION_OF_LIPID_ CATABOLIC_PROCESS | 79 | |
| REACTOME_METABOLISM_OF_LIPIDS_ AND_LIPOPROTEINS | 29 | GO_TRIGLYCERIDE_CATABOLIC_ PROCESS | 75 | |
| NABA_ECM_GLYCOPROTEINS | 23 | GO_REGULATION_OF_LIPID_METABOLIC_PROCESS | 72 | |
| BIOCARTA_LEPTIN_PATHWAY | 23 | GO_NEGATIVE_REGULATION_ OF_LIPID_STORAGE | 68 | |
| Heart muscle | REACTOME_STRIATED_MUSCLE_CONTRACTION | 304 | GO_HEART_PROCESS | 565 |
| KEGG_DILATED_CARDIOMYOPATHY | 203 | GO_STRIATED_MUSCLE_CONTRACTION | 557 | |
| KEGG_HYPERTROPHIC_CARDIOMYOPATHY_HCM | 196 | GO_CARDIAC_MUSCLE_TISSUE_ MORPHOGENESIS | 515 | |
| KEGG_CARDIAC_MUSCLE_CONTRACTION | 179 | GO_CARDIAC_MUSCLE_TISSUE_ DEVELOPMENT | 511 | |
| REACTOME_MUSCLE_CONTRACTION | 169 | GO_MUSCLE_CONTRACTION | 464 | |
| BIOCARTA_ALK_PATHWAY | 110 | GO_MYOFIBRIL_ASSEMBLY | 460 | |
| REACTOME_TCA_CYCLE_AND_RESPIRATORY_ ELECTRON_TRANSP… | 39 | GO_MUSCLE_SYSTEM_PROCESS | 459 | |
| REACTOME_RESPIRATORY_ELECTRON_ TRANSPORT_ATP_SYNTHE… | 36 | GO_CARDIAC_CELL_DEVELOPMENT | 434 | |
| KEGG_PARKINSONS_DISEASE | 36 | GO_ACTIN_MEDIATED_CELL_CONTRACTION | 410 | |
| REACTOME_RESPIRATORY_ELECTRON_ TRANSPORT | 32 | GO_MUSCLE_ORGAN_MORPHOGENESIS | 402 | |
| Liver | KEGG_RETINOL_METABOLISM | 280 | GO_EPOXYGENASE_P450_PATHWAY | 247 |
| KEGG_DRUG_METABOLISM_ CYTOCHROME_P450 | 252 | GO_DRUG_METABOLIC_PROCESS | 244 | |
| REACTOME_BIOLOGICAL_OXIDATIONS | 215 | GO_MONOCARBOXYLIC_ACID_ METABOLIC_PROCESS | 234 | |
| KEGG_COMPLEMENT_AND_COAGULATION_ CASCADES | 212 | GO_ORGANIC_ACID_ METABOLIC_PROCESS | 234 | |
| KEGG_METABOLISM_OF_XENOBIOTICS_BY_ CYTOCHROME_P450 | 201 | GO_ACUTE_PHASE_RESPONSE | 228 | |
| REACTOME_BILE_ACID_AND_BILE_SALT_ METABOLISM | 191 | GO_STEROID_METABOLIC_PROCESS | 224 | |
| REACTOME_PHASE1_FUNCTIONALIZATION_ OF_COMPOUNDS | 187 | GO_SMALL_MOLECULE_ METABOLIC_PROCESS | 192 | |
| REACTOME_XENOBIOTICS | 185 | GO_EXOGENOUS_DRUG_ CATABOLIC_PROCESS | 183 | |
| REACTOME_RECYCLING_OF_BILE_ACIDS_ AND_SALTS | 155 | GO_PROTEIN_ACTIVATION_CASCADE | 177 | |
| REACTOME_CYTOCHROME_P450_ARRANGED_ BY_SUBSTRATE_TYP… | 152 | GO_BILE_ACID_METABOLIC_PROCESS | 177 | |
Note: 10 MSigDB gene sets from the canonical pathways (C2.CP) and GO biological process (C5.BP) collections with the largest tissue-specific weights for adipose tissue, heart muscle and liver.