Literature DB >> 35282430

Identifying Environmental Endocrine Disruptors Associated With the Age at Menarche by Integrating a Transcriptome-Wide Association Study With Chemical-Gene-Interaction Analysis.

Mengnan Lu1, Ruoyang Feng2, Yujie Qin1, Hongyang Deng1, Biyao Lian1, Chunyan Yin1, Yanfeng Xiao1.   

Abstract

Menarche is the first occurrence of menstrual bleeding and one of the most important events of female puberty. Alarmingly, over the last several decades, the mean age at menarche (AAM) has decreased. Environmental endocrine disruptors (EEDs) are chemicals that may interfere with the endocrine system, resulting in adverse developmental, immunological, neurological, and reproductive effects in humans. Thus, the effects of EEDs on fertility and reproduction are growing concerns in modern societies. In this study, we aimed to determine the influence of genetic and environmental factors on AAM. We used data from an AAM genome-wide association study of 329,345 women to conduct a transcriptome-wide association study (TWAS) with FUSION software. As references, we determined the gene-expression levels in the hypothalamus, pituitary gland, ovaries, uterus, and whole blood. We performed Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses using the significantly dysregulated genes identified by the TWAS. Using the STRING database, we also generated a protein-protein-interaction network to analyze common AAM-specific genes identified by the TWAS with different tissues. We performed chemical-related gene set enrichment analysis (CGSEA) and identified significant TWAS genes to uncover relationships between different chemicals and AAM. The TWAS identified 9,848 genes; among these, 1580 genes were significant (P < 0.05), and 11 genes were significant among the hypothalamus, pituitary, ovary, uterus, and whole blood. CGSEA identified 1,634 chemicals, including 120 chemicals significantly correlated with AAM. In summary, we performed a TWAS (for genetic factors) and CGSEA (for environmental factors) focusing on AAM and identified several AAM-associated genes and EEDs. The results of this study expand our understanding of genetic and environmental factors related to the onset of female puberty.
Copyright © 2022 Lu, Feng, Qin, Deng, Lian, Yin and Xiao.

Entities:  

Keywords:  CGSEA; GWAS; TWAS; environmental endocrine disruptor; menarche; puberty

Mesh:

Substances:

Year:  2022        PMID: 35282430      PMCID: PMC8907571          DOI: 10.3389/fendo.2022.836527

Source DB:  PubMed          Journal:  Front Endocrinol (Lausanne)        ISSN: 1664-2392            Impact factor:   5.555


Introduction

Puberty is a complex process occurring between childhood and adulthood, producing internal and external physical changes that promote the development of primary and secondary sexual characteristics important for sexual reproduction (1). Sex hormones are responsible for the physical manifestations of female puberty, including thelarche, pubarche, and menarche (2). Menarche is the first menstrual bleeding and one of the most important events of female puberty. Over the last several decades, the mean age at menarche (AAM) has declined, which is concerning (3). Epidemiological evidence suggests that the onset of puberty is advancing in humans through undetermined mechanisms (4). Environmental endocrine disruptors (EEDs) are chemicals that interfere with the endocrine system and evoke adverse developmental, immunological, neurological, and reproductive effects in humans (5). EEDs are common in human living environments and include pesticides, plasticizers, industrial by-products, drugs, and some naturally occurring phytochemicals (6). These exogenous chemicals can interfere with the complex endocrine system, causing adverse health effects, such as reproductive disorders, metabolic diseases, and various cancers (7). Chronic exposure to EEDs may play a role in accelerating or delaying the onset of menarche, and extensive research has shown that pesticides, phenols, polycyclic aromatic hydrocarbons, phthalates, and some heavy metals are responsible for hormone metabolism disorders occurring during puberty (8). Consequently, the effects of EEDs on fertility and reproduction are a growing concern in modern societies (9). Recent genome-wide association studies (GWASs) have identified thousands of genetic variants associated with complex phenotypes and have provided insights into their genetic architectures. GWASs are also extremely well-suited for identifying common single-nucleotide polymorphism (SNP)-based variants (10). GWAS analysis has been conducted to research early puberty and identify the genetic characteristics of idiopathic central precocious puberty and validate the polygenic risk for early puberty (11). Felix R Day et al. have identified 389 independent, genome-wide significant signals for AAM, which explained ~7.4% of the population variance in AAM and corresponded to ~25% of the estimated heritability (11). Genetic loci cause trait variations, ranging from growth and fitness in simple organisms to disease in humans. Determining the genetics of gene-expression differences has emerged as a key approach for linking DNA-sequence variations to phenotypes (12). Transcriptome-wide association study (TWAS) analysis has been used to identify significant expression-trait associations by integrating genotypes, gene-expression levels, and phenotypes in order to gain insights into the genetic basis of complex traits (13). In a recent study, a TWAS was performed to discover transcriptome differences that affect the age of natural menopause (ANM), and 34 ANM-associated genes were reported (14). The hypothalamic–pituitary–gonadal axis controls puberty and reproduction (15). Therefore, investigating the associated gene-regulation relationships may help identify important genes that are co-expressed in all tissue types during AAM. Here, we aimed to determine the influences of genetic and environmental factors on AAM by performing a large-scale TWAS for AAM based on a GWAS data set. We investigated gene-expression levels in the hypothalamus, pituitary gland, ovaries, uterus, and whole blood. We also reevaluated the expression of TWAS-identified genes, functionally explored the genes, and identified AAM-associated EEDs.

Methods

Summary of the AAM GWAS Data Used in This Study

We used published GWAS summary data for female AAM (16). Briefly, Day et al. performed a meta-analysis of pooled GWAS data from multiple studies of 329,345 women of European ancestry, including 40 studies from the ReproGen consortium (N = 179,117), 23andMe (N = 76,831), and the United Kingdom Biobank (N = 73,397). The results were grouped into the relatively sparse HapMap 2 reference panel or attributed to the 1000 Genomes Project reference panel using gene-centric arrays. In each study, the associations of SNPs with AAM were based on a two-tailed additive linear-regression model and several factors, including the age at the study visit and other study-specific covariates. Day et al. performed an expanded genomic analysis of AAM in women that was nearly three times larger and used denser genomic data than previous studies. Information on the subjects, genotypes, responsibilities, and quality control were detailed in the published study (16). In addition, ethical approval was not applicable for this study as publicly available data were used for the analysis.

TWAS Analysis

We used FUSION (13) software (http://gusevlab.org/projects/fusion/) to analyze the GWAS summary data for the previous meta-analysis of AAM. The most popular TWAS methods, such as PrediXcan, TWAS-Fusion, and SMR, test causal relationships between gene-expression levels and complex traits (17), among which, the TWAS-Fusion method is used more often. Briefly, Bayesian sparse linear-mixed models (18) were used to calculate SNP expression weights for specific genes at the 1-Mb cis position and estimate the association of predicted expression levels with AAM using the following formula: Ztwas = w + Z/(w × [Lw]1/2) (13), where w denotes the weight, Z denotes the Z-score, and L denotes the SNP correlation matrix (definition, LD). We used the gene-expression weights for the hypothalamus (N samples = 108; N features = 2,315), pituitary (N samples = 157; N features = 4,402), ovary (N samples = 122; N features = 2,809), uterus (N samples = 101; N features = 2,135), and whole blood (N samples = 1,264; N features = 4,701) as references, which are available for download from the FUSION website (http://gusevlab.org/projects/fusion/). We estimated the transcriptome-wide significance as P = 5.08 × 10−6 (0.05/9,848) using the Bonferroni correction (14). Manhattan plot was made by “CMplot”(v. 3.6.2) in R package.

Functional Exploration of Genes

We performed Kyoto Encyclopedia of Genes and Genomes (KEGG) (19) and Gene Ontology (GO) (20) enrichment analyses to identify and confirm related biological processes. KEGG and GO enrichment were performed using the R packages “org.Hs.eg.db” and “clusterProfiler” (R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/).

Interaction-Network Analysis

We generated a protein–protein-interaction (PPI) network using the STRING database, v11.5 (STRING, https://string-db.org), requiring a confidence score of 0.15 and “active interaction sources” based on a previous study (21). Cytoscape (22) was used to visualize all interaction networks, and the Molecular Complex Detection (MCODE) plugin (23) was used for module analysis.

Chemical Gene Expression Annotation Data Set

The chemical-related gene-expression annotation data set used in this study was downloaded from the Comparative Toxicology Genomics Database (CTD) (http://ctdbase.org/downloads/). The CTD mainly provides four data sets, including a chemical–gene-interaction function, a chemical–disease association, a genetic disease association, and a chemical element-phenotypic association. The CTD integrates the four data sets to automatically construct a hypothetical chemical–gene phenotype disease network to illustrate the molecular mechanisms underlying diseases that affect the environment (24). Cheng et al. downloaded and used 1,788,149 chemical–gene pair annotation terms for humans and mice, generating 11,190 chemical substance-related gene sets (25). We also used that data set to perform our chemical-related gene set enrichment analysis (CGSEA).

CGSEA

CGSEA is a flexible tool for assessing associations between chemicals and complex diseases. Briefly, the software uses genome-wide summary data (e.g., a summary of the TWAS data and messenger RNA [mRNA]-expression profiles) to explore functional relationships among chemical substances and diseases from a genomics perspective, for many complex diseases and characteristics. We used the CTD Chemical Gene Interaction Network and TWAS Expression Association to test the AAM statistics and weighted Kolmogorov–Smirnov running sum statistics to explore the relationships between chemicals and AAM, as described previously in greater detail (26). Specifically, in this study, we performed 10,000 permutations to obtain the empirical distribution of the gene set enrichment analysis (GSEA) statistical data for each chemical and then calculated the P-value of each chemical based on the empirical distribution of the CGSEA statistical data. Based on previous findings, we excluded gene sets containing <10 or >200 genes to limit the influence of gene set sizes on the results (27). To avoid deviations from expression correlations between genes, a package of lme4qtl software (28) was used to adapt the mixed-model regression of the TWAS Z-score based on the number of members in each gene set, in order to consider the correlation of the Z-score between genes caused by LD (28). A detailed description of the analytical method used was provided previously (25).

Results

TWAS Analysis of AAM

TWAS analysis identified 9,848 genes from the GWAS summary data and of those 1,580 genes expression was associated with AAM (P <0.05) while 64 showed a significant association (P <5.08 × 10−6); 2,289 genes tested in hypothalamus and of 322 genes expression was associated with AAM (P <0.05) while 8 showed a significant association (P <5.08 × 10-6; ), 4,362 genes tested in pituitary and of 568 genes expression was associated with AAM (P <0.05) while 26 showed a significant association (P <5.08 × 10−6; ), 2,768 genes tested in ovary and of 366 genes expression was associated with AAM (P <0.05) while 14 showed a significant association (P <5.08 × 10−6; ), 2,104 genes tested in uterus and of 253 genes expression was associated with AAM (P <0.05) while 7 showed a significant association (P <5.08 × 10−6; ), and 4,671 genes tested in whole blood and of 604 genes expression was associated with AAM (P <0.05) while 9 showed a significant association (P <5.08 × 10−6; ), respectively ().
Figure 1

Manhattan plots of the association results from the AAM TWAS. The dashed horizontal lines represent P = 5.00 × 10−2. The solid horizontal lines represent P = 5.00 × 10−6 (Bonferroni correction). Each dot represents the genetically predicted expression of one specific gene in the hypothalamus, pituitary gland, ovary, uterus, and whole blood tissues. The X axis represents the chromosome (Chr) encoding the corresponding gene, and the Y axis represents the negative logarithm of the association P value. (A) Gene-expression weights for the hypothalamus. (B) Gene-expression weights for the pituitary gland. (C) Gene-expression weights for the ovaries. (D) Gene-expression weights for the uterus. (E) Gene-expression weights for whole blood.

Manhattan plots of the association results from the AAM TWAS. The dashed horizontal lines represent P = 5.00 × 10−2. The solid horizontal lines represent P = 5.00 × 10−6 (Bonferroni correction). Each dot represents the genetically predicted expression of one specific gene in the hypothalamus, pituitary gland, ovary, uterus, and whole blood tissues. The X axis represents the chromosome (Chr) encoding the corresponding gene, and the Y axis represents the negative logarithm of the association P value. (A) Gene-expression weights for the hypothalamus. (B) Gene-expression weights for the pituitary gland. (C) Gene-expression weights for the ovaries. (D) Gene-expression weights for the uterus. (E) Gene-expression weights for whole blood.

Functional Exploration of TWAS-Identified Genes Associated With AAM

Tissues have unique gene-expression profiles. Thus, we performed an overlap analysis of the significant genes in different tissues to identify the most representative genes. shows the resulting Venn diagram, which indicates the number of genes expressed in one or more tissues. Overall, 163 TWAS-identified significant AAM-specific genes were associated with the hypothalamus: 38 significant genes were associated with the hypothalamus and pituitary gland; 7 significant genes were associated with the hypothalamus, pituitary, and ovary; and 11 significant genes were associated with the hypothalamus, pituitary, ovary, uterus, and whole blood. The 11 novel TWAS-significant AAM-susceptible genes identified in all five tissues were RBM6 (RNA-binding motif protein 6; chromosome 3), PILRB (paired immunoglobin-like type 2 receptor beta; chromosome 7), CPSF1 (cleavage and polyadenylation-specific factor 1; chromosome 8), INPP5E (inositol polyphosphate-5-phosphatase E; chromosome 9), MRPL43 (mitochondrial ribosomal protein L43; chromosome 10), HSD17B12 (hydroxysteroid-(17-β)-dehydrogenase 12; chromosome 11), TIPIN (TIMELESS-interacting protein; chromosome 15), FLYWCH1 (FLYWCH-type zinc finger 1; chromosome 16), EXOSC6 (exosome component 6; chromosome 16), ADORA2B (adenosine A2b receptor; chromosome 17), and SPATA20 (spermatogenesis-associated 20; chromosome 17). presents detailed information regarding these 11 genes, including the rsIDs of the most significant (i.e., best) GWAS SNPs in the locus (i.e., BEST.GWAS.ID) and the TWAS P-value (i.e., P ).
Figure 2

Functional exploration of the TWAS-identified genes associated with AAM. (A) Venn diagram revealing the overlap of TWAS-significant genes in different tissues. Red, hypothalamus; green, pituitary gland; purple, ovaries orange, uterus; blue, whole blood. (B) Bar plot of enriched GO terms for the overlapping genes.

Table 1

Significant TWAS-identified genes associated with AAM in all five tissues studied.

GeneBEST.GWAS.ID P TWAS
HypothalamusPituitary glandOvaryUterusWhole blood
RBM6rs39053300.002710.000090.000410.000410.00303
PILRBrs29505200.028870.034000.029000.038000.03621
CPSF1rs352533560.005280.011880.010560.022240.01915
INPP5Ers104483400.035030.027000.012990.041000.01770
MRPL43rs111909010.000140.000170.000300.000380.00070
HSD17B12rs64854430.002720.000610.003160.000890.00368
TIPINrs21136880.004200.009020.001470.004200.02590
FLYWCH1rs18340260.015000.015000.007720.012100.02630
EXOSC6rs71968420.000190.000750.000030.000010.00002
ADORA2Brs1788370.017400.000270.000270.000270.00110
SPATA20rs9891280.022100.049200.035600.006390.00593
Functional exploration of the TWAS-identified genes associated with AAM. (A) Venn diagram revealing the overlap of TWAS-significant genes in different tissues. Red, hypothalamus; green, pituitary gland; purple, ovaries orange, uterus; blue, whole blood. (B) Bar plot of enriched GO terms for the overlapping genes. Significant TWAS-identified genes associated with AAM in all five tissues studied. We subjected the TWAS-identified genes to GO analysis (). Five enriched GO terms belonged to the biological process (BP) category, including cell cycle DNA replication, long-chain fatty-acyl-CoA metabolic process, DNA replication checkpoints, estrogen biosynthetic processes, and positive regulation of immune effector processes. Three significantly enriched GO terms belonged to the cellular component (CC) category, including the mRNA cleavage factor complex, mRNA cleavage and polyadenylation specificity factor complex, and nuclear exosome (RNase complex). In terms of the molecular function (MF) category, the enriched GO terms primarily involved estrogen metabolism (such as steroid dehydrogenase activity or steroid dehydrogenase activity acting on the CH-OH group of donors), nicotinamide adenine dinucleotide or nicotinamide adenine dinucleotide phosphate as an acceptor, and estradiol 17-beta-dehydrogenase activity.

PPI Network of the TWAS-Identified Genes

We used 1,580 TWAS-significant AAM-associated genes for PPI analysis and successfully transformed 1,056 protein-coding genes (). To effectively find densely connected regions of the PPI network, we formed seven MCODE clusters with the PPI network genes (). The hub genes identified using the MCODE plugin were further analyzed for functional exploration. MCODE cluster 1 (MCODE1) was related to ribosome biogenesis, female pregnancy, and blastocyst development. MCODE2 was characterized by genes in the human leukocyte antigen (HLA) family associated with the immune process. MCODE3, MCODE4, and MCODE5 were related to tRNA, energy metabolism processes, and biosynthetic processes. MCODE6 was associated with lipid oxidation and maternal processes involved in female pregnancy. MCODE7 contained some important significantly enriched terms, including fatty acid metabolic, gonadotropin-releasing hormone (GnRH), estrogen, oxytocin, and PI3K-Akt signaling pathways and breast cancer.
Figure 3

PPI network and significant modules. Red and blue circles indicate upregulated and downregulated TWAS-identified genes. (A) PPI network of the TWAS-identified genes. (B-H) Significant modules 1 to 7 of the PPI network and their functional exploration. GeneRatio: the ratio of the number of genes associated with a term linked to an enriched gene to the total number of enriched genes.

PPI network and significant modules. Red and blue circles indicate upregulated and downregulated TWAS-identified genes. (A) PPI network of the TWAS-identified genes. (B-H) Significant modules 1 to 7 of the PPI network and their functional exploration. GeneRatio: the ratio of the number of genes associated with a term linked to an enriched gene to the total number of enriched genes.

CGSEA of the TWAS-Identified Genes

We performed CGSEA to investigate environmental factors that influence the onset of puberty, which identified 1,634 chemicals, including 120 chemicals that correlated significantly with AAM. These significant chemicals included some drugs (e.g., fluoxetine), pesticides (e.g., ametryne), plant extracts (e.g., isoflavones), nutrients (e.g., cholesterol), phenols (e.g., cannabidiol), phthalates (e.g., monobutyl phthalate), heavy metals (e.g., uranium), and atmospheric pollutants (e.g., phosgene). illustrates our constructed network of EEDs and their target genes based on the TWAS-identified genes. Chemical with absolute normalize enrichment score (NES) values of >1 were considered significantly enriched according to GSEA. We identified 77 significantly enriched chemicals with an |NES| value of >1 and a P-value of <0.05 ().
Figure 4

CGSEA results. Network of EEDs and their target genes, based on the TWAS-identified genes.

Table 2

Significant chemicals identified by CGSEA of the significant TWAS-identified AAM-associated genes.

Chemical nameNESPChemical nameNESP
Chlorine12.354590.00050Acetaldehyde3.245240.01399
Fluoxetine8.681680.00050Heliotrine2.508180.01449
Proton pump inhibitors63.994500.00050Pseudocumene3.385920.01499
Fexofenadine18.122490.00100Chlordecone2.989020.01499
Torcetrapib7.282640.00150Gefitinib1.741580.01549
Methylmethacrylate6.888440.00250Cholesterol2.326180.01599
Chromous chloride8.169880.00350N-nitroso-tris-chloroethylurea2.923140.01649
Thalidomide4.839890.00350Methylmercury cysteine1.125540.01799
Pentosan sulfuric polyester5.335290.00400Pyrene2.665450.01799
Anacardic acid4.531820.00450Dehydroxymethylepoxyquinomicin1.587570.01799
Calcimycin5.926280.00450Casticin2.551380.01899
Palbociclib2.058210.00550Squalestatin 12.607880.01949
Temozolomide4.661280.00550Periodate-oxidized adenosine2.808210.01999
Uranyl acetate3.305520.00600Isoflavones2.199510.02049
Catechol4.401090.00650Melphalan2.674010.02099
Cannabidiol3.317960.00650Clomipramine1.576210.02149
Amitriptyline4.157780.00700Uranium2.493940.02199
Ibuprofen3.810670.00700Monobutyl phthalate2.525500.02249
Uric acid2.270660.00700Rimonabant2.719400.02349
Methylparaben5.839280.00750Toxaphene1.215010.02399
Oleoyl-estrone1.505550.00750Ursodeoxycholic Acid2.449360.02399
2-(4-nitrophenyl)-4-(4-fluorophenyl)-5-(4-pyridinyl)-1H-imidazole1.722540.00750Letrozole1.831990.02449
Benzbromarone3.935960.007504-toluidine2.477430.02499
Ethionine4.067890.00750Carcinogens2.273750.02649
SK&F 839593.568730.00800Okadaic acid2.351050.02649
Potassium perchlorate3.559620.00850Sulfasalazine1.980580.02749
Phenanthrene3.098490.00900Pyrazolanthrone2.257820.02899
SNX 21121.696290.00900Pravastatin1.712240.03098
Aripiprazole1.824750.00950Piperonyl butoxide2.137140.03298
Asbestos3.888680.00950Potassium chloride1.234050.03298
S-Adenosylmethionine2.341650.00950Bucladesine1.808930.03498
Reserpine2.095260.010493-(2-hydroxy-4-(2-methylnonan-2-yl) phenyl) cyclohexan-1-ol2.166250.03598
Ciprofibrate3.596980.01099Caffeic acid1.439150.04248
Probenecid2.549750.01099Methimazole1.971810.04248
Pyrogallol3.565960.01099Methyl cellosolve1.864380.04298
Homocysteine3.546620.01149Estrogens1.476290.04448
Phosgene3.015410.01199Ciglitazone1.267010.04598
Tebuconazole2.515050.01349Hyaluronic acid1.841930.04698
Paraoxon2.582330.01349
CGSEA results. Network of EEDs and their target genes, based on the TWAS-identified genes. Significant chemicals identified by CGSEA of the significant TWAS-identified AAM-associated genes.

Discussion

Puberty is a developmental period affecting the body and behavior of children developing secondary sexual characteristics (29). Pubertal hormones play important roles in the adrenal, gonadal, and growth axes (30). Pubertal timing is determined by the activation of pulsatile hypothalamic GnRH secretion when pituitary gonadotropin secretion and downstream gonadal maturation are also initiated (31). Thus, various tissue types are affected by AAM, including the hypothalamus, pituitary gland, ovaries, uterus, and whole blood. Up to now, previous studies have identified several genes associated with AAM, these results are similar to our studies (). Here, we performed a comprehensive TWAS to evaluate the relationship between AAM and predicted genes found in the hypothalamus, pituitary gland, ovaries, uterus, and whole blood. We identified 11 genes whose genetically predicted expression was associated with AAM (P ≤ 0.05), including 10 novel genes (RBM6, PILRB, CPSF1, INPP5E, MRPL43, TIPIN, FLYWCH1, EXOSC6, ADORA2B, and SPATA20) and one gene (HSD17B12) linked to AAM in a previous GWAS. The 11 protein-coding genes identified in our study have been implicated in estrogen metabolism. We also identified enriched fatty acid-metabolism pathways, similar to the findings of a recent study (36), indicating that different types of fatty acids may influence puberty timing. Additionally, individual fatty acids might have different physiological and metabolic effects, including the progression of pubertal development. Therefore, our study provides new information that improves our understanding of the genetics and etiology of AAM.
Table 3

Previously reported significant AAM-related genes.

AuthorsSNPGenePa PTWASb
John RB Perry, et al. (32)rs10148448MEG32.10E-042.52E-07
Diana L. Cousminer, et al. (33)rs12917823MAPK3P < 5.00E-021.22E-03
rs17046434ADCY3P < 5.00E-022.42E-02
rs7759938LIN28BP < 5.00E-026.37E-49
Cathy E. Elks, et al. (34).rs7759938LIN28B1.60E-586.37E-49
rs10148448BEGAIN1.70E-102.55E-06
rs4929923TRIM662.40E-081.38E-09
rs3905330RBM61.40E-098.60E-05
rs633715SEC16B1.50E-094.07E-03
rs4955420KLHDC8B1.80E-096.13E-06
rs16938437PHF21A1.40E-099.98E-09
rs2687729EEFSEC1.00E-082.51E-11
rs8104651OLFM24.60E-101.10E-07
rs939317ECE22.30E-094.89E-03
rs12641981GNPDA28.70E-083.85E-02
rs12056794MSRA2.40E-021.00E-02
Nicholas Mancuso, et al. (35).rs6580698CCDC65P < 5.00E-087.30E-05
rs7330016COG6P < 5.00E-084.40E-02
rs12917823INO80EP < 5.00E-082.47E-05
rs3761919NUCKS1P < 5.00E-084.86E-08
rs4717903PMS2P5P < 5.00E-086.58E-04
rs3761919RAB7L1P < 5.00E-089.47E-07
rs4717903STAG3L2P < 5.00E-084.37E-08
rs2274351TMEM180P < 5.00E-082.24E-04

P values of previously reported AAM-related genes.

Significant TWAS-identified genes associated with AAM in present research.

Previously reported significant AAM-related genes. P values of previously reported AAM-related genes. Significant TWAS-identified genes associated with AAM in present research. HSD17B12 encodes 17β-hydroxysteroid dehydrogenase (17β-HSD), which is crucial for converting estrone into estradiol and fatty acid elongation. Our results support those of Kemiläinen et al. (37), who found that HSD17B12 played a vital role in female fertility through arachidonic acid metabolism. Specifically, an investigation of the 17β-HSD enzyme expression in human and mouse ovaries revealed that female HSD17B+/- mice more often had frequent dysfunctional oogenesis and ovulation, leading to less frequent births. Under extreme circumstances, haploinsufficiency of the HSD17B12 gene in female mice resulted in subfertility. HSD17B12 can affect fatty acid elongation and ceramide accumulation in the serum (38). Previous data showed that central ceramides participate in the timing of female puberty (39), suggesting that HSD17B12 could be a candidate gene for female puberty. Notably, HSD17B12 was also related to endometrial (40), ovarian (41), and breast (42) cancers. Previous results have shown that INPP5E is concentrated at the cilia base, where it helps control phosphoinositide metabolism (43). Primary cilia are present on virtually all cell types. Cilia appear to exert a crucial modulatory role in appropriate axonal wiring due to INPP5E-dependent activation of the PI3K–AKT signaling pathway, triggering an axonal Ca2+ wave (44). GnRH neurons in adults are multi-ciliated, and the percentage of GnRH neurons possessing multiple Kiss1r-positive cilia increases during puberty, correlating with sexual maturation (45). ADORA2B encodes an adenosine receptor and is involved in axon elongation. A previous report showed that ADORA2B transcripts were significantly downregulated in GnRH neurons during proestrus (46), while it has not been studied in pituitary. Therefore, these newly defined genes may influence the electrophysiology of puberty in hypothalamus and pituitary gland. TIPIN is part of the replisome complex and binds the replication fork-protection complex TIMELESS, which is involved in circadian rhythm regulation (47). Circadian rhythms are well known to play key roles in animal reproduction (48). The existing body of research suggests that during the chronotype and the circadian timekeeping system change during the puberty, including endogenous rhythm period and sensitivity to environmental time cues (49). The results of several studies have established that melatonin promotes follicle-stimulating hormone in the pituitary gland and increases serum estrogen levels, thereby accelerating the onset of puberty (50). Those studies indirectly support a potential role of TIPIN in AAM. CPSF recognizes the AAUAAA signal in pre-mRNA and interacts with other factors to facilitate RNA cleavage and the poly(A) addition. CPSF1 is the largest subunit of the CPSF complex. Previous data established that CPSF1 may promote ovarian cancer (51), cell proliferation, and triple-negative breast cancer (52). However, women have a higher risk for developing autoimmune diseases than men, which is attributed to sweeping endocrinological changes during puberty that considerably affect the immune system (53). CPSF1 was previously found to have a high affinity for HLA molecules (54). Thus, CPSF1 may be linked to the immune response in adolescent girls. We extended the well-established GSEA approach to detect associations between environmental chemicals and AAM using published GWAS summary data sets and identified 120 chemicals, including drugs, organic compounds, inorganic compounds, plant extracts, nutrients, phenols, plasticizers, pesticides, herbicides, pollutants, and heavy metals. Aromatase is a rate-limiting enzyme in the conversion of androgens to estrogens, and letrozole is a selective aromatase inhibitor (55). Letrozole has been used to treat McCune–Albright syndrome, which has been associated with precocious puberty in girls (56) and delayed growth and puberty in boys (57). Further, fluoxetine is drug that is commonly used to treat adults with depressive disorders (58). The effect of fluoxetine on puberty has been a controversial issue. The results of one study demonstrated that fluoxetine exposure in utero delays puberty onset in female rats (59), whereas other data suggested that fluoxetine exposure during gestation did not alter plasma estrogen concentrations in peripubertal offspring (60). Some plant extracts act as EEDs, as they are among a group of secondary metabolites with chemical structures similar to those of endogenous hormones. Isoflavones are phytoestrogens that are mainly produced in soybeans and can promote advancement of the vaginal opening in female rats after exogenous supplementation (61). Galangin is a naturally occurring flavonoid that inhibits the effects of flavonoids on human cytochrome P450 (62). Galangin was found to inhibit the aryl hydrocarbon receptor and is considered a potential drug for treating breast cancer (63, 64). Notable health benefits have also been reported for galangin (65), and previous data suggest that galangin is useful for treating precocious puberty. Zearalenone (ZEN) is a non-steroidal mycoestrogen that can exert adverse endocrine effects in mammals (66). To date, there is little agreement regarding the effect of ZEN on puberty. One group found that ZEN might trigger central precocious puberty (CPP) development in girls (67), whereas another group found an association between ZEN and normal pubertal development in adolescent girls (68). A growing body of literature has associated being overweight or obese with early puberty (69); overweight and obese girls undergo menarche earlier than normal-weight girls (70). Several cross-sectional data have suggested that a substantial proportion of girls with CPP have hypertriglyceridemia (71). Cholesterol is an endogenous ligand of estrogen-related receptor alpha (72) that plays a role in activating estrogen receptors (73). Based on these data, we can infer that a high-cholesterol diet may also be a risk factor for precocious puberty. Plasticizers are additives used to produce or promote plasticity and flexibility in plastics, which are commonly used in everyday life. Further, parabens have been added to personal care products as antimicrobial preservatives (74), and methylparaben exposure increases glandular tissue sizes during critical developmental windows (75). A large survey showed that elevated phthalate metabolite levels pose potentially high health risks to children (76). In addition, monobutyl phthalate exposure might be associated with a risk for sexual precocity in girls (77). Animal experiments have demonstrated that two alkylphenols (4-nonylphenol and 4-tert-octylphenol) modify hormone biosynthesis and delay the onset of puberty (78). This study has some limitations. First, the pooled GWAS data were obtained from the United Kingdom Biobank, and the study subjects were predominantly from European populations; these aspects affect the extrapolation of the results. Therefore, our results should be used cautiously when studying AAM in other populations. Second, although some of the genes we screened have been confirmed in other studies, our enrichment analysis used genes that did not pass correction for multiple testing (P <0.05), which may introduce some false positives so it requires caution when using our results. And some genes related to AAM susceptibility identified here have not been verified via molecular biology experiments, which should be performed in future studies. Further, some EEDs identified in this study were previously demonstrated to play a role in AAM, whereas others are not yet validated, which will require more clinical observations and cohort studies. However, to the best of our knowledge, this is the first large study using CGSEA to identify candidate EEDs related to AAM. Our TWAS analysis detected AAM-associated genes at the DNA level, and our CGSEA extended the classic GSEA approach to detect associations between environmental chemicals and AAM.

Conclusion

In this study, we aimed to determine the influences of genetic and environmental factors on AAM. Therefore, we performed a TWAS and CGSEA related to AAM and identified multiple AAM-associated genes and EEDs. The results of this study expand our understanding of the genetic and environmental factors affecting the timing of female puberty.

Data Availability Statement

The original contributions presented in the study are included in the article/. Further inquiries can be directed to the corresponding author.

Author Contributions

Author ML and RF collected and processed the data, as well as wrote this article. YQ and HD provided language help and writing assistance. BL proofreaded the article. CY and YX designed the study. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  78 in total

1.  Extension and integration of the gene ontology (GO): combining GO vocabularies with external vocabularies.

Authors:  David P Hill; Judith A Blake; Joel E Richardson; Martin Ringwald
Journal:  Genome Res       Date:  2002-12       Impact factor: 9.043

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  Integrative approaches for large-scale transcriptome-wide association studies.

Authors:  Alexander Gusev; Arthur Ko; Huwenbo Shi; Gaurav Bhatia; Wonil Chung; Brenda W J H Penninx; Rick Jansen; Eco J C de Geus; Dorret I Boomsma; Fred A Wright; Patrick F Sullivan; Elina Nikkola; Marcus Alvarez; Mete Civelek; Aldons J Lusis; Terho Lehtimäki; Emma Raitoharju; Mika Kähönen; Ilkka Seppälä; Olli T Raitakari; Johanna Kuusisto; Markku Laakso; Alkes L Price; Päivi Pajukanta; Bogdan Pasaniuc
Journal:  Nat Genet       Date:  2016-02-08       Impact factor: 38.330

4.  Prenatal exposure to mixtures of persistent endocrine disrupting chemicals and early menarche in a population-based cohort of British girls.

Authors:  Kristin J Marks; Penelope P Howards; Melissa M Smarr; W Dana Flanders; Kate Northstone; Johnni H Daniel; Antonia M Calafat; Andreas Sjödin; Michele Marcus; Terryl J Hartman
Journal:  Environ Pollut       Date:  2021-02-09       Impact factor: 8.071

5.  Superresolution Microscopy Reveals Distinct Phosphoinositide Subdomains Within the Cilia Transition Zone.

Authors:  Sarah E Conduit; Elizabeth M Davies; Alex J Fulcher; Viola Oorschot; Christina A Mitchell
Journal:  Front Cell Dev Biol       Date:  2021-04-30

6.  Urinary mycoestrogens and age and height at menarche in New Jersey girls.

Authors:  Zorimar Rivera-Núñez; Emily S Barrett; Elizabeth A Szamreta; Sue A Shapses; Bo Qin; Yong Lin; Helmut Zarbl; Brian Buckley; Elisa V Bandera
Journal:  Environ Health       Date:  2019-03-22       Impact factor: 5.984

7.  Genetic factors of idiopathic central precocious puberty and their polygenic risk in early puberty.

Authors:  Wei-De Lin; Chi-Fung Cheng; Chung-Hsing Wang; Wen-Miin Liang; Chien-Hsiun Chen; Ai-Ru Hsieh; Mu-Lin Chiu; Ting-Hsu Lin; Chiu-Chu Liao; Shao-Mei Huang; Chang-Hai Tsai; Cherry Yin-Yi Chang; Ying-Ju Lin; Fuu-Jen Tsai
Journal:  Eur J Endocrinol       Date:  2021-08-27       Impact factor: 6.664

8.  Polygenic modeling with bayesian sparse linear mixed models.

Authors:  Xiang Zhou; Peter Carbonetto; Matthew Stephens
Journal:  PLoS Genet       Date:  2013-02-07       Impact factor: 5.917

9.  Childhood overweight and obesity and timing of puberty in boys and girls: cohort and sibling-matched analyses.

Authors:  Nis Brix; Andreas Ernst; Lea Lykke Braskhøj Lauridsen; Erik Thorlund Parner; Onyebuchi A Arah; Jørn Olsen; Tine Brink Henriksen; Cecilia Høst Ramlau-Hansena
Journal:  Int J Epidemiol       Date:  2020-06-01       Impact factor: 7.196

10.  Melatonin Administration Accelerates Puberty Onset in Mice by Promoting FSH Synthesis.

Authors:  Chan Yang; Zaohong Ran; Guoshi Liu; Rong Hou; Changjiu He; Qinghua Liu; Yingjun Chen; Yuliang Liu; Xiaodong Wang; Chuqi Ling; Fang Fang; Xiang Li
Journal:  Molecules       Date:  2021-03-09       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.