| Literature DB >> 35625151 |
Amin Mortazavi1, Amir Rashidi1, Mostafa Ghaderi-Zefrehei2, Parham Moradi3, Mohammad Razmkabir1, Ikhide G Imumorin4, Sunday O Peters5, Jacqueline Smith6.
Abstract
Bayesian gene networks are powerful for modelling causal relationships and incorporating prior knowledge for making inferences about relationships. We used three algorithms to construct Bayesian gene networks around genes expressed in the bovine uterus and compared the efficacies of the algorithms. Dataset GSE33030 from the Gene Expression Omnibus (GEO) repository was analyzed using different algorithms for hub gene expression due to the effect of progesterone on bovine endometrial tissue following conception. Six different algorithms (grow-shrink, max-min parent children, tabu search, hill-climbing, max-min hill-climbing and restricted maximum) were compared in three higher categories, including constraint-based, score-based and hybrid algorithms. Gene network parameters were estimated using the bnlearn bundle, which is a Bayesian network structure learning toolbox implemented in R. The results obtained indicated the tabu search algorithm identified the highest degree between genes (390), Markov blankets (25.64), neighborhood sizes (8.76) and branching factors (4.38). The results showed that the highest number of shared hub genes (e.g., proline dehydrogenase 1 (PRODH), Sam-pointed domain containing Ets transcription factor (SPDEF), monocyte-to-macrophage differentiation associated 2 (MMD2), semaphorin 3E (SEMA3E), solute carrier family 27 member 6 (SLC27A6) and actin gamma 2 (ACTG2)) was seen between the hybrid and the constraint-based algorithms, and these genes could be recommended as central to the GSE33030 data series. Functional annotation of the hub genes in uterine tissue during progesterone treatment in the pregnancy period showed that the predicted hub genes were involved in extracellular pathways, lipid and protein metabolism, protein structure and post-translational processes. The identified hub genes obtained by the score-based algorithms had a role in 2-arachidonoylglycerol and enzyme modulation. In conclusion, different algorithms and subsequent topological parameters were used to identify hub genes to better illuminate pathways acting in response to progesterone treatment in the bovine uterus, which should help with our understanding of gene regulatory networks in complex trait expression.Entities:
Keywords: Bayesian network; DNA microarray; algorithms; bovine; endometrium; implantation
Year: 2022 PMID: 35625151 PMCID: PMC9138007 DOI: 10.3390/ani12101305
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 3.231
Estimation of structural Bayesian network parameters using different algorithms.
| Parameters | CBA | SBA | HA | |||
|---|---|---|---|---|---|---|
| GS | MMPC | HC | TS | MMHC | RSMAX | |
| No. of nodes | 89 | 89 | 89 | 89 | 89 | 89 |
| No. of arcs (edges) | 89 | 77 | 388 | 390 | 76 | 70 |
| Undirected edges | 13 | 0 | 0 | 0 | 0 | 0 |
| Directed arcs | 76 | 77 | 388 | 390 | 76 | 70 |
| MB | 2.63 | 1.73 | 25.3 | 25.6 | 1.8 | 1.6 |
| NS | 2 | 1.73 | 8.7 | 8.7 | 1.7 | 1.5 |
| BF | 0.85 | 0 | 4.3 | 4.3 | 0.85 | 0.79 |
| No. of Tests | 43,903 | 55,335 | 43,338 | 43,338 | 55,601 | 23,662 |
Abbreviations: CBA: constraint-based algorithm; SBA: score-based algorithm; HA: hybrid algorithm; GS: grow-shrink; MMPC: max-min parent children; HC: hill-climbing; TS: tabu search; MMHC: max-min hill-climbing; RSMAX: restricted maximize; MB: Markov blanket; NS: neighborhood size; BF: branching factor.
Topological parameters of Bayesian network by different implemented algorithms.
| Parameters | CBA | SBA | HA | |||
|---|---|---|---|---|---|---|
| GS | MMPC | HC | TS | MMHC | RSMAX | |
| Betweenness | 0.008507684 | 0.04031794 | 0.009298721 | 0.009213601 | 0.001715626 | 0.0003272751 |
| Eccentricity | 10.95506 | 10.48315 | 3.505618 | 3.505618 | 10.48315 | 4.033708 |
| Degree | 1.93258427 | 2.674157 | 7.228571 | 7.011236 | 1.348315 | 1.191011 |
| Closeness | 0.04408607 | 0.02552964 | 0.4348416 | 0.4358546 | 0.0255294 | 0.01263703 |
| Centrality | 3.404494 | 6.438202 | 4470240 | 5490951 | 6.438202 | 3.404494 |
| Strength | 2.292135 | 3.460674 | 8.719101 | 8.764045 | 1.707865 | 1.573034 |
| CC | 0.005 | 0.019 | 0.047 | 0.048 | 0 | 0 |
| SP | 685 (8%) | 1777 (22%) | 4736 (60%) | 4736 (60%) | 256 (3%) | 107 (1%) |
| CPL | 4.749 | 6.023 | 3.322 | 3.319 | 3.636 | 1.748 |
| NC | 2.884684685 | 3.529701 | 9.47985 | 9.550446 | 2.318357 | 2.191799 |
Abbreviations: CBA: constraint-based algorithm; SBA: score-based algorithm; HA: hybrid algorithm; GS: grow-shrink; MMPC: max-min parent children; HC: hill-climbing; TS: tabu search; MMHC: max-min hill-climbing; RSMAX: restricted maximize; CC: cluster coefficient; SP: shortest path; CPL: characteristic path length; NC: neighborhood connectivity.
Hub genes identified by the various Bayesian network algorithms used in this study.
| CBA | SBA | HA | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| GS | D | MMPC | D | HC | D | TS | D | MMHC | D | RSMAX | D |
| LPL | 6 | SEMA3E | 6 | CSTB | 19 | CSTB | 19 | BOLA | 5 | ISG15 | 4 |
| PRODH | 5 | ACTG2 | 6 | DGKI | 16 | DGKI | 16 | SPDEF | 4 | PRODH | 4 |
|
| 5 | BOLA | 6 | SEMA3E | 16 | SEMA3E | 16 |
| 4 | MMD2 | 4 |
| BOLA | 4 | SLC13A5 | 6 | DAPL1 | 15 | DAPL1 | 15 | SEMA3E | 4 |
| 3 |
|
| 4 | SPDEF | 6 |
| 13 | LPL | 13 | ACTG2 | 4 | ACTG2 | 3 |
|
| 4 | MMD2 | 6 | LPL | 13 | DKK1 | 13 | ISG15 | 4 |
| 3 |
| SPDEF | 4 |
| 5 |
| 13 | LOC783399 | 13 | SLC13A5 | 3 |
| 3 |
|
| 4 |
| 5 | LOC783399 | 13 |
| 13 |
| 3 | BOLA | 3 |
Abbreviations: CBA: constraint-based algorithm; SBA: score-based algorithm; HA: hybrid algorithm; GS: grow-shrink; D: degree criterion; MMPC: max-min parent children; HC: hill-climbing; TS: tabu search; MMHC: max-min hill-climbing; RSMAX: restricted maximum. The genes in bolded format represent unique genes among all the algorithmic categories.
Figure 1Venn diagram of commonly identified hub genes from all three algorithmic groups. The parentheses represent the subclasses of the overlapped algorithms. GS—grow-shrink; MMPC—max-min parent children; HC—hill-climbing; TS—tabu search; MMHC—max-min hill-climbing; RSMAX—restricted maximum.
Functional annotation of identified hub genes.
| Term | Count | FE | Bonferroni | Benjamini | FDR | |
|---|---|---|---|---|---|---|
| CBA | ||||||
| Extracellular space | 4 | 0.04 | 3.2 | 0.8 | 0.8 | 48 |
| Alternative splicing | 8 | 0.009 | 1.9 | 0.3 | 0.3 | 8 |
|
| ||||||
| Extracellular space | 4 | 0.01 | 7.7 | 0.22 | 0.2 | 5 |
| Secreted | 4 | 0.01 | 5.9 | 0.50 | 0.5 | 12 |
| Glycerolipid metabolism | 2 | 0.03 | 47.6 | 0.28 | 0.2 | 18 |
| Protease binding | 2 | 0.04 | 47.7 | 0.68 | 0.6 | 26 |
| Disulfide bond | 4 | 0.04 | 3.9 | 0.77 | 0.7 | 32 |
|
| ||||||
| Isopeptide bond | 3 | 0.05 | 6.8 | 0.93 | 0.9 | 40 |
| Alternative splicing | 7 | 0.02 | 1.7 | 0.97 | 0.8 | 51 |
| Arginine and proline metabolism | 2 | 0.0 | 1.7 | 0.1 | 0.1 | 9 |
Abbreviations: FE—fold enrichment; FDR—false discovery rate; CBA—constraint-based algorithm; SBA—score-based algorithm; HA—hybrid algorithm.
Figure 2Biological processes of identified hub genes from CBAs (a), SBAs (b) and HAs (c). CBAs—constraint-based algorithms; SBAs—score-based algorithms; HAs—hybrid algorithms.
Figure 3Molecular functions of identified hub genes from CBAs (a), SBAs (b) and HAs (c). CBAs— constraint-based algorithms; SBAs—score-based algorithms; HAs—hybrid algorithms.
Figure 4Cellular components of identified hub genes from CBAs (a), SBAs (b) and HAs (c). CBAs—constraint-based algorithms; SBAs—score-based algorithms; HAs—hybrid algorithms.
Figure 5Pathways associated with hub genes from (a) CBA, (b) SBA and (c) HA algorithms. CBAs—constraint-based algorithms; SBAs—score-based algorithms; HAs—hybrid algorithms.
Over-represented Reactome pathways.
| Gene Set | Description | Size | Expect | Ratio | FDR | |
|---|---|---|---|---|---|---|
| WP216 | Striated muscle contraction | 45 | 1.33 | 12.782 | 3.22 × 10−15 | 1.63 × 10−12 |
| WP447 | Adipogenesis genes | 134 | 3.9604 | 4.2924 | 3.84 × 10−7 | 9.7015 × 10−5 |
| mmu04972 | Pancreatic secretion | 103 | 3.0442 | 3.6134 | 0.00022 | 0.037012 |
| WP2872 | White fat cell differentiation | 32 | 0.94578 | 6.344 | 0.000299 | 0.037763 |
| WP4344 | Sphingolipid metabolism (general overview) | 25 | 0.73889 | 6.7669 | 0.000711 | 0.059801 |
| WP512 | Id signaling pathway | 51 | 1.5073 | 4.644 | 0.000693 | 0.059801 |
| WP4690 | Sphingolipid metabolism (integrated pathway) | 26 | 0.76845 | 6.5066 | 0.000859 | 0.061947 |
| mmu00600 | Sphingolipid metabolism | 48 | 1.4187 | 4.2293 | 0.002729 | 0.17224 |
| WP2084 | SREBF and miR33 in cholesterol and lipid homeostasis | 11 | 0.32511 | 9.2276 | 0.003533 | 0.19822 |
| WP1596 | Iron homeostasis | 15 | 0.44333 | 6.7669 | 0.008926 | 0.38976 |
Figure 6Biological pathways identified from Ingenuity Pathway Analysis (IPA) of hub genes.
Predicted upstream regulators of differentially expressed genes in this study.
| Upstream Regulator | Molecule Type | Target Molecules in Dataset | |
|---|---|---|---|
| progesterone | chemical—endogenous mammalian | 8.44 × 10−10 | ADAMDEC1, CFTR, CYP26A1, DKK1, DPP4, EDN3, GNMT, IGFBP1, LPL, LTF, NPL, PDZK1IP1, PRSS35, PTGS2, SFRP4, STAT5A |
| JAK | group | 6.32 × 10−9 | FBXO32, IFIT1, ISG15, PTGS2, RSAD2, STAT5A |
| STAT3 | transcription regulator | 2.38 × 10−8 | CHEK1, CYP26A1, DKK1, DPP4, HLA-DQA1, IFIT1, IGFBP1, ISG15, LTF, MAP2, PTGS2, RSAD2, TFF3, TRPM3 |
| ACOX1 | enzyme | 2.95 × 10−8 | CSTB, CYP26A1, DPP4, GNMT, HLA-DQA1, IGFBP1, LPL, UCP2 |
| mir-96 | microRNA | 0.000000319 | IFIT1, IGFBP1, ISG15, RSAD2, SAMD9 |
| dexamethasone | chemical drug | 0.000000765 | ACTG2, ANPEP, CHEK1, CHGA, CYP26A1, DGAT2, EDN3, FBXO32, GGT1, IFIT1, IGFBP1, ISG15, ITGB5, LOC102724788/PRODH, LPL, MAP2, MSTN, OR51E1, PTGS2, RSAD2, STAT5A, TOP2A, UCP2 |
| mir-183 | microRNA | 0.000000841 | IFIT1, IGFBP1, ISG15, RSAD2, SAMD9 |
| miR-182-5p (and other miRNAs w/seed UUGGCAA) | mature microRNA | 0.00000239 | IFIT1, IGFBP1, ISG15, RSAD2, SAMD9 |
| beta-estradiol | chemical—endogenous mammalian | 0.0000033 | ADAMDEC1, ANPEP, BLOC1S6, CFTR, CHEK1, CHGA, CSTB, DKK1, HLA-DQA1, IGFBP1, ISG15, LPL, LTF, MEDAG, PDZK1IP1, PRSS35, PTGS2, SFRP4, SLC6A20, SPDEF, STAT5A, TFF3, TOP2A |
| miR-199a-5p (and other miRNAs w/seed CCAGUGU) | mature microRNA | 0.00000526 | ACTG2, CSTB, ISG15, PTGS2, RSAD2 |
| estrogen | chemical drug | 0.00000567 | CFTR, CRABP1, IGFBP1, LPL, LTF, PTGS2, STAT5A, TOP2A |
| mir-15 | microRNA | 0.00000867 | CHEK1, IFIT1, ISG15, PTGS2, UCP2 |
| ciprofibrate | chemical drug | 0.00000957 | CSTB, CYP26A1, DPP4, GNMT, IGFBP1, LPL |
| GALNT6 | Enzyme | 0.0000127 | ANPEP, DPP4, TFF3 |
| 2-methoxyestradiol | chemical—endogenous mammalian | 0.0000155 | ITGB5, LOC102724788/PRODH, LTF, PTGS2 |