| Literature DB >> 35197999 |
Rafiatu Azumah1, Katja Hummitzsch1, Monica D Hartanti1,2, Justin C St John1, Richard A Anderson3, Raymond J Rodgers1.
Abstract
Polycystic Ovary Syndrome (PCOS) is a multifactorial syndrome with reproductive, endocrine, and metabolic symptoms, affecting about 10% women of reproductive age. Pathogenesis of the syndrome is poorly understood with genetic and fetal origins being the focus of the conundrum. Genetic predisposition of PCOS has been confirmed by candidate gene studies and Genome-Wide Association Studies (GWAS). Recently, the expression of PCOS candidate genes across gestation has been studied in human and bovine fetal ovaries. The current study sought to identify potential upstream regulators and mechanisms associated with PCOS candidate genes. Using RNA sequencing data of bovine fetal ovaries (62-276 days, n = 19), expression of PCOS candidate genes across gestation was analysed using Partek Flow. A supervised heatmap of the expression data of all 24,889 genes across gestation was generated. Most of the PCOS genes fell into one of four clusters according to their expression patterns. Some genes correlated negatively (early genes; C8H9orf3, TOX3, FBN3, GATA4, HMGA2, and DENND1A) and others positively (late genes; FDFT1, LHCGR, AMH, FSHR, ZBTB16, and PLGRKT) with gestational age. Pathways associated with PCOS candidate genes and genes co-expressed with them were determined using Ingenuity pathway analysis (IPA) software as well as DAVID Bioinformatics Resources for KEGG pathway analysis and Gene Ontology databases. Genes expressed in the early cluster were mainly involved in mitochondrial function and oxidative phosphorylation and their upstream regulators included PTEN, ESRRG/A and MYC. Genes in the late cluster were involved in stromal expansion, cholesterol biosynthesis and steroidogenesis and their upstream regulators included TGFB1/2/3, TNF, ERBB2/3, VEGF, INSIG1, POR, and IL25. These findings provide insight into ovarian development of relevance to the origins of PCOS, and suggest that multiple aetiological pathways might exist for the development of PCOS.Entities:
Keywords: fetal ovary; mitochondrial dysfunction; polycystic ovary syndrome; steroidogenesis; stromal expansion; upstream regulators
Year: 2022 PMID: 35197999 PMCID: PMC8860493 DOI: 10.3389/fgene.2021.762177
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The three distinct expression patterns of PCOS candidate genes observed during fetal ovary development according to Hartanti, et al. (2020), Liu, et al. (2020); early, late and throughout gestation.
| Early | Late | Throughout |
|---|---|---|
| Fibrillin 3 ( | Insulin receptor ( | Thyroid adenoma associated ( |
| GATA binding protein 4 ( | Follicle stimulating hormone receptor ( | Erb-B2 receptor tyrosine kinase 4 ( |
| High mobility group AT-hook 2 ( | Plasminogen receptor with a C-terminal lysine | DNA repair protein ( |
| TOX high mobility group box family member 3 ( | Zinc finger and BTB domain containing 16 ( | Chromosome 9 open reading frame 3 ( |
| DENN domain-containing 1A ( | Interferon regulatory factor 1 ( | Yes associated protein 1 ( |
| Luteinising hormone/chorionic gonadotrophin receptor | Transforming growth factor beta 1 induced transcript 1 ( | Ras-related protein ( |
| Follicle stimulating hormone beta subunit ( | Luteinizing hormone/choriogonadotropin receptor ( | Sulphite oxidase ( |
| Erb-B2 receptor tyrosine kinase 3 ( | Anti-mullerian hormone ( | Ca2+/calmodulin-dependent protein kinase ( |
| — | Androgen receptor ( | ADP ribosylation factor like GTPase 14 effector protein ( |
| — | — | Farnesyl-diphosphate farnesyltransferase 1 ( |
| — | — | Nei like DNA glycosylase 2 ( |
| — | — | Microtubule associated protein RP/EB family member 1 ( |
Characteristics of bovine fetal ovaries and RNA quality index (RQI).
| Sample ID | Identical samples | Crown-rump length [cm] | Gestational age [days] | Gestational period | RQI |
|---|---|---|---|---|---|
| 15/R12t | Y | 7.7 | 62 | Early I | 9.0 |
| 15/R85t | Y | 9 | 66 | Early I | 8.5 |
| 15/R86t | N | 12 | 76 | Early I | 8.9 |
| 15/R74t | Y | 14 | 82 | Early I | 9.1 |
| 15/R41t | N | 17 | 91 | Early II | 9.3 |
| 15/R57t | N | 19 | 98 | Early II | 9.2 |
| 15/R42t | Y | 24 | 113 | Early II | 9.6 |
| 15/R51t | Y | 28 | 124 | Early II | 9.6 |
| 15/R43t | Y | 32 | 135 | Early II | 9.5 |
| 15/R2t | Y | 39 | 154 | Late | 9.8 |
| 15/R44t | Y | 45 | 170 | Late | 9.3 |
| 15/R1t | Y | 58 | 201 | Late | 9.5 |
| 15/R45t | Y | 74 | 234 | Late | 9.6 |
| 15/R33t | Y | 80 | 245 | Late | 9.6 |
| 15/R47t | N | 86 | 255 | Late | 9.4 |
| 15/R35t | Y | 88 | 258 | Late | 9.5 |
| 15/R38t | Y | 91 | 263 | Late | 9.3 |
| 15/R49t | N | 93 | 266 | Late | 9.7 |
| 15/R50t | Y | 100 | 276 | Late | 9.5 |
Identical samples refer to samples also analysed in Hartanti, et al. (2020), Liu, et al. (2020).
FIGURE 1Principal Component analysis of bovine ovarian samples analysed in the RNA sequencing. The samples are grouping into three stages; early I (red; 62–82 days, n = 4), early II (blue; 91–135 days, n = 5) and late (orange; 154–276 days, n = 10).
Pearson’s correlation coefficients (R) between PCOS candidate genes mRNA expression levels and gestational age (n = 19) in bovine fetal ovaries.
|
|
Positive and negative correlations are marked in pink and blue, respectively. The colour intensity corresponds with the strength of the correlation. p-values: a <0.05; b < 0.01; c < 0.001; d < 0.0001.
FIGURE 2Location of PCOS candidate genes and their respective clusters on the supervised heatmap of 24,889 genes identified in the fetal ovaries (62–276 days, n = 19) using RNA sequencing. Each cluster is made up of PCOS candidate genes as well as genes co-expressed with them; detailed lists of each cluster can be found in Supplementary Tables S2–S5.
FIGURE 3Top canonical pathways associated with cluster 1 (“early” genes) using (A) Ingenuity Pathway Analysis (IPA) and (B) Gene Ontology (GO) biological processes from DAVID database. “Pop Hits” refers to the total number of genes associated with each of the pathway in the database. The bar graphs represent the percentage of genes from the data set that map to each canonical pathway whilst the orange line shows the p-value of overlap between genes in each cluster and a given pathway.
FIGURE 4Top canonical pathways associated with cluster 2 (“late” genes) using (A) Ingenuity Pathway Analysis (IPA) and (B) Gene Ontology (GO), biological processes from DAVID database. “Pop Hits” refers to the total number of genes associated with each of the pathway in the database. The bar graphs represent the percentage of genes from the data set that map to each canonical pathway whilst the orange line shows the p-value of overlap between genes in each cluster and a given pathway.
FIGURE 5Top canonical pathways associated with cluster 4 (“late” genes) using (A) Ingenuity Pathway Analysis (IPA) and (B) Gene Ontology (GO), biological processes from DAVID database. “Pop Hits” refers to the total number of genes associated with each of the pathway in the database. The bar graphs represent the percentage of genes from the data set that map to each canonical pathway whilst the orange line shows the p-value of overlap between genes in each cluster and a given pathway.
FIGURE 6Top canonical pathways associated with cluster 3 (“throughout” genes) using (A) Ingenuity Pathway Analysis (IPA) and (B) Gene Ontology (GO), biological processes from DAVID database. “Pop Hits” refers to the total number of genes associated with each of the pathway in the database. The bar graphs represent the percentage of genes from the data set that map to each canonical pathway whilst the orange line shows the p-value of overlap between genes in each cluster and a given pathway.
Top biological upstream regulators and their respective activation z-score as well as p-value of association for the strong clusters, cluster 1 and 4.
| Upstream regulator | Name | z-score |
| |
|---|---|---|---|---|
| Cluster 1 |
| Caseinolytic mitochondrial matrix peptidase proteolytic subunit | 4.899 | 1.11E-11 |
|
| Phosphatase and tensin homolog | −0.143 | 1.46E-10 | |
|
| Hepatocyte nuclear factor 4 alpha | −3.972 | 1.90E-10 | |
|
| TLE family member 3, Transcriptional corepressor | — | 2.69E-10 | |
|
| Death associated protein 3 | −3.162 | 1.05E-09 | |
|
| Aryl hydrocarbon receptor nuclear translocator | −3.689 | 2.98E-09 | |
|
| Lon peptidase 1, Mitochondrial | −1.457 | 3.71E-09 | |
|
| Estrogen related receptor gamma | −2.006 | 6.71E-09 | |
|
| Firre intergenic repeating RNA element | −4.796 | 1.41E-08 | |
|
| Amyloid beta precursor protein | −2.107 | 3.01E-08 | |
|
| MYC proto-oncogene, BHLH transcription factor | −7.632 | 3.11E-08 | |
|
| Serine/Threonine kinase 11 | −4.041 | 3.31E-08 | |
|
| AlkB homolog 1, Histone H2A dioxygenase | −2.646 | 8.62E-08 | |
|
| NOP2/Sun RNA methyltransferase 3 | -2.646 | 8.62E-08 | |
|
| Presenilin 1 | −0.649 | 1.64E-07 | |
|
| Lysine demethylase 5A | 4.914 | 2.45E-07 | |
|
| Microtubule associated protein tau | — | 3.58E-07 | |
|
| RPTOR independent companion of MTOR complex 2 | 4.854 | 5.14E-07 | |
|
| DEAD-box helicase 5 | −3.638 | 2.46E-06 | |
|
| Tumor protein P53 | −1.511 | 2.90E-06 | |
|
| Estrogen related receptor alpha | −1.849 | 3.18E-06 | |
| Cluster 4 |
| Sterol regulatory element binding transcription factor 2 | 4.586 | 1.41E-22 |
|
| Insulin induced gene 1 | −4.447 | 5.74E-21 | |
|
| Mitogen-activated protein kinase 5 | 4.088 | 3.78E-18 | |
|
| SREBF chaperone | 4.269 | 9.03E-18 | |
|
| Cytochrome P450 oxidoreductase | −3.776 | 4.93E-16 | |
|
| Sterol regulatory element binding transcription factor 1 | 4.508 | 3.27E-14 | |
|
| Major facilitator superfamily domain containing 2A | −3.138 | 8.14E-13 | |
|
| ATPase copper transporting beta | 3.464 | 1.46E-12 | |
|
| Mitogen-activated protein kinase 7 | 3.873 | 2.30E-12 | |
|
| Natriuretic peptide B | −3.283 | 3.41E-12 | |
|
| SH3 domain and tetratricopeptide repeats 2 | 3.162 | 2.27E-11 | |
|
| Prominin 1 | −2.828 | 3.28E-11 | |
|
| Peroxisome proliferator activated receptor alpha | 1.112 | 6.45E-11 | |
|
| Nuclear receptor subfamily 5 group A member 1 | 3.446 | 7.22E-11 | |
|
| Complement component 4 binding protein | 2.449 | 5.69E-10 | |
|
| Regulatory associated protein of MTOR complex 1 | 3.2 | 1.43E-09 | |
|
| Kinesin family member 3A | −2.596 | 1.47E-09 | |
|
| Insulin induced gene 2 | −2.586 | 1.97E-09 | |
|
| Transforming growth factor beta 1 | 2.172 | 3.90E-09 | |
|
| Sirtuin 2 | 1.913 | 6.77E-09 |
Top biological upstream regulators and their respective activation z-score as well as p-value of association for the weak clusters, cluster 2 and cluster 3.
| Upstream regulator | Name | z-score |
| |
|---|---|---|---|---|
| Cluster 2 |
| Transforming growth factor beta 1 | 7.017 | 1.77E-21 |
|
| HRas proto-oncogene, GTPase | −0.699 | 1.60E-14 | |
|
| Erb-B2 receptor tyrosine kinase 2 | −0.667 | 7.95E-13 | |
|
| Coagulation factor II, Thrombin | 5.485 | 9.38E-13 | |
| Alpha catenin | Alpha catenin group | −5.256 | 1.76E-12 | |
|
| Transforming growth factor beta 2 | 3.062 | 3.71E-12 | |
|
| Tumor necrosis factor | 3.597 | 1.00E-11 | |
|
| Mitogen-activated protein kinase 1 | 3.218 | 1.23E-11 | |
|
| Collagen like tail subunit of asymmetric acetylcholinesterase | 1.511 | 1.29E-11 | |
|
| Myocardin related transcription factor B | 5.14 | 1.60E-11 | |
|
| Vascular endothelial growth factor | 5.581 | 9.26E-11 | |
|
| Tumor protein P53 | 4.493 | 9.90E-11 | |
|
| Transforming growth factor beta | 4.093 | 2.28E-10 | |
|
| Angiotensinogen | 3.331 | 3.01E-10 | |
|
| Fibroblast growth factor 2 | 4.223 | 4.14E-10 | |
|
| Integrin subunit beta 1 | −1.223 | 5.19E-10 | |
|
| Sp1 transcription factor | 5.601 | 1.01E-09 | |
|
| Erb-B2 receptor tyrosine kinase 3 | 0.325 | 1.71E-09 | |
|
| Tumor protein P63 | 1.448 | 1.80E-09 | |
|
| Transforming growth factor beta receptor 2 | 3.079 | 3.79E-09 | |
|
| Transforming growth factor beta 3 | 3.759 | 4.62E-09 | |
| Cluster 3 |
| Achaete-scute family BHLH transcription factor 1 | 2.543 | 7.44E-05 |
|
| Forkhead box Q1 | — | 5.54E-04 | |
|
| Dystonin | — | 5.54E-04 | |
|
| NOBOX oogenesis homeobox | — | 9.83E-04 | |
|
| HNF1 homeobox A | 2.88 | 1.51E-03 | |
|
| R-spondin 2 | — | 3.22E-03 | |
|
| TATA-box binding protein like 1 | — | 3.22E-03 | |
|
| Atonal BHLH transcription factor 1 | — | 3.48E-03 | |
|
| GATA binding protein 2 | −0.128 | 4.78E-03 | |
|
| Synuclein alpha | −1.673 | 4.83E-03 | |
|
| Glutamate ionotropic receptor NMDA type subunit 3A | −2.828 | 5.61E-03 | |
|
| Sterile alpha motif domain containing 4A | — | 7.81E-03 | |
|
| Presenilin enhancer, gamma-secretase subunit | — | 1.08E-02 | |
|
| Protein tyrosine phosphatase non-receptor type 11 | — | 1.09E-02 | |
|
| RE1 silencing transcription factor | −2.19 | 1.14E-02 | |
|
| POU class 5 homeobox 1-nanog homeobox | — | 1.41E-02 | |
|
| Interleukin 25 | 0.277 | 1.63E-02 | |
|
| Aph-1 homolog A, gamma-secretase subunit | — | 1.79E-02 | |
|
| Agrin | — | 1.79E-02 |
FIGURE 7Important networks connected with cluster 1. Network 3 (A) and network 12 (B) are associated with mitochondrial functions. Red color of molecules represents downregulation in the second half of gestation and the intensity of each color shows the strength of regulation.
FIGURE 9Important networks of genes associated with cluster 4. (A) Network 1 is associated with the effects of the transcription factor MYC. (B) Network 2 is connected with steroidogenesis. Red represents downregulation and blue upregulation in the second half of gestation. The strength of regulation is shown by the intensity of each color.
FIGURE 8Important networks associated with cluster 2. (A) Network 16 contains β-catenin as central player, whereas (B) network 24 is associated with the components of extracellular matrix, such as collagens and fibronectin. Blue color of molecules represents upregulation in the second half of gestation and the intensity of each color shows the strength of regulation.