Literature DB >> 31206546

Integrating genome-wide association and eQTLs studies identifies the genes associated with age at menarche and age at natural menopause.

Gang Wang1, Jian Lv2, Xiaoxin Qiu3, Yujun An4.   

Abstract

OBJECTIVE: An early onset of menarche and, later, menopause are well-established risk factors for the development of breast cancer and endometrial cancer. Although the largest GWASs have identified 389 independent signals for age at menarche (AAM) and 44 regions for age at menopause (ANM), GWAS can only identify the associations between variants and traits. The aim of this study was to identify genes whose expression levels were associated with AAM or ANM due to pleiotropy or causality by integrating GWAS data with genome-wide expression quantitative trait loci (eQTLs) data. We also aimed to identify the pleiotropic genes that influenced both phenotypes.
METHOD: We employed GWAS data of AAM and ANM and genome-wide eQTL data from whole blood. The summary data-based Mendelian randomization method was used to prioritize the associated genes for further study. The colocalization analysis was used to identify the pleiotropic genes associated with both phenotypes.
RESULTS: We identified 31 genes whose expression was associated with AAM and 24 genes whose expression was associated with ANM due to pleiotropy or causality. Two pleiotropic genes were identified to be associated with both phenotypes.
CONCLUSION: The results point out the most possible genes which were responsible for the association. Our study prioritizes the associated genes for further functional mechanistic study of AAM and ANM and illustrates the benefit of integrating different omics data into the study of complex traits.

Entities:  

Mesh:

Year:  2019        PMID: 31206546      PMCID: PMC6576755          DOI: 10.1371/journal.pone.0213953

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Menarche is the first menstrual cycle and signals the possibility of fertility. An early onset of menarche is associated with risks for obesity, type 2 diabetes, cardiovascular disease, breast cancer and all-cause mortality [1]. Menopause is defined as the permanent cessation of menses due to the loss of ovarian follicular activity. Younger age at natural menopause (ANM) is associated with low risk of breast cancer and ovarian cancer, but higher risks of osteoporosis, cardiovascular disease and type 2 diabetes [1]. A Mendelian randomization study has found that later ANM causally increased the risk of breast cancer [2]. These two traits also mark the beginning and the end of a woman’s reproductive life [3]. Genome-wide association studies (GWAS) are capable of identifying the association between target phenotypes and millions of genetic variants. GWAS of age at menarche (AAM) identified 106 loci containing 389 independent signals [4]. GWAS of ANM has successfully identified dozens of significantly associated loci [2, 5, 6]. Most of these loci encode proteins that appear to be involved in DNA repair, immune response and breast cancer processes [2, 5]. However, GWAS can only identify those SNPs strongly associated with the target phenotypes, without pinpointing the target genes and the underlying biological mechanism. For example, the largest GWAS of ANM identified 44 loci containing at least one common variant significantly associated with ANM [2]. However, the significant SNPs in 21 loci were annotated to more than one gene in each locus. It suggested that the specific causal genes remain mostly unidentified. A large number of genetic variants influences the target phenotypes by causal regulatory effect rather than directly influencing the structure of the protein [7]. Expression quantitative trait locus (eQTL), which is a genetic variant influencing a target gene’s expression, is often used to explain the underlying biological mechanism of significant SNPs identified by GWAS. Previous studies have suggested that in the significant loci, those SNPs which were also eQTLs were more likely to be functional SNPs [8]. Zhu et al. proposed a summary-based Mendelian randomization (SMR) analysis to combine GWAS and eQTL data into a single analysis [7]. SMR integrates GWAS and eQTL data identified from the whole blood tissue to identify potential functionally relevant genes at the significant loci identified in GWAS. Previous studies have shown that whole blood can be a proxy of relevant tissues for various phenotypes and diseases [7, 9]. In this study, we identified genes whose expression levels were associated with AAM or ANM due to pleiotropy or causality, by integrating ANM GWAS data with eQTL data. We conducted a colocalization analysis to identify significant SNPs causally associated with both phenotypes.

Materials and methods

AAM GWAS summary dataset

The largest AAM GWAS meta-analysis contained data from ReproGen consortium studies UK Biobank and 23andMe study. Using the 1000 Genomes Project–imputed genotype data in up to ~370,000 European women, 389 independent signals (P < 5 × 10−8) were identified for age at menarche [4]. The summary data were downloaded from the following website (http://www.reprogen.org).

ANM GWAS summary dataset

The largest-scale GWAS meta-analysis summary data of ANM was used in this study [2]. The GWAS meta-analysis conducted with a total sample of 69,360 individuals of European descent from 21 studies identified 44 significant loci. SNPs with the minor allele frequency (MAF) no less than 0.01 and the imputation quality larger than 0.4 were included in the meta-analysis. The summary data were downloaded from the following website (http://www.reprogen.org).

eQTL dataset

Because the Westra eQTL data [9] had a low coverage of human genes (5,967), in this study we used the Consortium for the Architecture of Gene Expression (CAGE) eQTL data which contained 11,829 unique probes to perform the SMR test [10]. The CAGE study was performed to investigate the genetic architecture of gene expression in peripheral blood in 2,765 European individuals [10]. We set the p-value threshold to be 5×10−8 to select the top associated eQTL for the SMR test. After removing those probes where the p-value of the top eQTL was larger than 5×10−8, there were 8,144 probes left in the eQTL summary data. The binary summary data can be downloaded from http://cnsgenomics.com/software/smr/#DataResource.

Genetic correlation

We used stratified linkage disequilibrium score regression (LDSC) to estimate the genetic correlation between AAM and ANM using GWAS summary statistics [11].

SMR analysis

The method of SMR was fully described in the previous paper [7]. In this study, the phenotypic trait is the outcome (Y), gene expression is the exposure (X), and the top cis-eQTL that is strongly associated with gene expression is used as the instrumental variable (Z). SMR method assumes that the eQTL has an effect on the trait through the gene expression. In brief, there were three models including causality (Z - > X - > Y), pleiotropy (Z - > X and Z - > Y) and linkage (Z1 - > X, Z2 - > Y, and Z1 and Z2 are two variants in linkage disequilibrium (LD) in the cis-eQTL region). Since the SMR analysis assumes that the instrument (top cis-eQTL) has a strong effect on the exposure (gene’s expression level), only probes with at least one cis-eQTL at PeQTL (a p-value from the eQTL study indicating the significance of the eQTL associated with the gene expression) smaller than 5×10−8 in the cis-eQTL region were included in the eQTL summary data (hg19) [7]. We excluded cis-eQTL with MAF < 0.01 and cis-eQTL in the MHC region because of the complexity of LD patterns in this region [7]. In this study, we tried to identify those genes with causal or pleiotropic effect on AAM or ANM. To distinguish the causality and pleiotropy models from the linkage model, we conducted the heterogeneity in dependent instruments (HEIDI) test [7]. The HEIDI test considers the pattern of associations using all the SNPs that are significantly associated with gene expression (eQTLs) in the cis-eQTL region (±500kb from the center of the gene probe). The null hypothesis is that there is a single variant affecting trait and gene expression (pleiotropy or causality). The alternative hypothesis is that gene expression and trait are affected by two distinct variants. Under Hardy-Weinberg equilibrium and linkage disequilibrium (LD), bXY (the effect of the gene expression on the trait) estimated at the top associated cis-eQTL (bXY(top)) will be equal to that estimated at any of the cis-SNPs in LD that is associated with gene expression. If we define di = bXY(i)−bXY(top), with bXY(i) being the bXY value of the i-th cis-eQTL, then it is equal to test whether di = 0. If the number of significant eQTL (excluding the top cis-eQTL) in the cis-eQTL region is m, then we can have a normal vector zd(i) = {zd(1),zd(2),…,zd(m)}, where . To test against di = 0, we can use the HEIDI test statistic T = zdIzdT, with zdT being the transposed vector of zd, and I being an identity matrix, as it is estimated as [7]. SNPs in the cis-eQTL region with a PeQTL > 1.6 × 10−3 (equivalent to χ2 < 10) were removed to avoid weak instrumental variables according to the original paper [7]. We used PHEIDI > 0.05 to exclude those genes belonging to the linkage model [7]. The SMR software was downloaded from http://cnsgenomics.com/software/smr/#Download. It was impossible to give a causal conclusion based on only one instrumental variable. In Mendelian randomization studies, multiple uncorrelated instrumental variables (for example, the trans-eQTLs and/or uncorrelated cis-eQTLs) were needed to identify the causality. However, multiple uncorrelated instrumental variables (IVs) were not available in the CAGE eQTL data. In this study, we did not distinguish the causality model from the pleiotropy model.

Colocalization analysis

Colocalization analysis was used to identify the genetic variants affecting both phenotypes. The method was detailed in the previous paper [12]. In brief, the method based on a hierarchical Bayesian model which can be used to find the region containing a variant that influences both phenotypes. According to the previous paper, there were four models that a given genomic region either 1) contains a genetic variant that influences the first trait, 2) contains a genetic variant that influences the second trait, 3) contains a genetic variant that influences both traits, or 4) contains both a genetic variant that influences the first trait and a separate genetic variant that influences the second trait [12]. It estimated the posterior probability of each model. The threshold of posterior probability equal to 0.9 was used to control the false discovery rate at level 0.1 [12].

Results

The genetic correlation between AAM and ANM was 0.0079 (p-value = 0.0032). The genome-wide significant level for SMR analysis was Psmr < 6.14×10−6 (0.05/8,144, Bonferroni test). We identified 98 gene-trait associations with Psmr < 6.14×10−6. After the application of the HEIDI test, this reduced to 54 gene-trait associations (PHEIDI > 0.05). Those genes which did not pass the HEIDI test may be associated with AAM or ANM due to linkage. We identified 31 genes associated with AAM (Table 1) and 24 genes associated with ANM due to pleiotropy or causality (Table 2). Three (ATP1B3, NAAA, and GRTP1) among of the 31 genes associated with AAM can be considered as novel genes, i.e. no previously identified SNP reported as genome-wide significant in the primary GWAS paper in the cis-eQTL region of the probes (Fig 1). Among the 24 genes associated with ANM, seven genes (MSH6, TLK1, SYCP2L, BRCA1, PGAP3, DIDO1, and DDX17) were previously annotated to be responsible for the association based on distance, biological function, eQTL effect and non-synonymous SNP in high LD. We also identified 6 new genes (AK125462, MSL2, CLSTN3, TRAPPC2L, DDX5, and CPNE1) where there was no significant SNP (p < 5×10−8) in the cis-eQTL region of the probes (Fig 2). C17orf46 was the only gene identified to be associated with both phenotypes.
Table 1

Genes identified by SMR analysis for AAM.

probeIDChrGenetopSNPtopSNP_bpp_GWASp_eQTLb_SMRp_SMRp_HEIDIGeneaGenec
ILMN_18691091NUCKS1rs8230942056898074.83E-083.39E-38-0.0648.24E-070.24NUCKS1NUCKS1/ RAB7L1/ PM20D1
ILMN_17650612OXER1rs12617390429853951.25E-074.61E-310.0732.45E-060.46OXER1OXER1
ILMN_16598542PRPF40Ars75926691535506687.80E-122.08E-100.163.75E-060.98PRPF40ANA
ILMN_17833043ATP1B3rs21159351416161980.53.82E-23-0.0103.82E-231.00NEWbNA
ILMN_17324523MAPKAPK3rs13096264506782803.26E-105.31E-47-0.0701.37E-081.00DOCK3MAPKAPK3
ILMN_17526313CGGBP1rs9814057882144724.67E-091.86E-107-0.0432.75E-080.98C3orf38CGGBP1
ILMN_17444713ZNF654rs7653652881893413.56E-098.55E-79-0.0503.17E-080.95C3orf38CGGBP1
ILMN_22855684NAAArs4859572768573880.51.90E-1460.00401.90E-1460.51NEWbNA
ILMN_17494096HLA-Frs3870968296471491.16E-094.51E-370.0736.40E-080.09HCG4NA
ILMN_16973097NCF1rs2267812741381211.87E-138.72E-42-0.0881.54E-100.13GTF2INA
ILMN_21129887NCF1Crs2267812741381211.87E-132.41E-31-0.107.10E-100.16GTF2INA
ILMN_20833337PMS2L5rs3846966741131413.23E-123.35E-22-0.112.10E-080.09GTF2INA
ILMN_17883849C9orf5rs126867361118887391.45E-141.59E-74-0.0622.24E-120.13TMEM245TMEM245
ILMN_21919299C9orf6rs8748641117287185.66E-136.02E-240.116.09E-090.11TMEM245TMEM245
ILMN_21633069FAM120Ars1055710962149281.35E-082.83E-150.125.46E-060.86FAM120ANA
ILMN_237782910NANOS1rs6717361208110734.38E-083.49E-88-0.0432.37E-070.16EIF3ANANOS1
ILMN_166596411GAB2rs901105779246073.51E-138.11E-130.163.99E-070.78GAB2GAB2
ILMN_176764211C11orf46rs7926666303631016.58E-088.15E-34-0.0661.31E-060.24C11orf46C11ORF46
ILMN_209410611HSD17B12rs7118906438173203.67E-071.07E-820.0401.62E-060.05MIR129-2HSD17B12
ILMN_169558512RPS26rs1131017564359293.25E-0700.0197.70E-070.36RPS26NA
ILMN_214235313GRTP1rs49076161140087440.51.52E-120.0161.52E-120.87NEWbNA
ILMN_172727114WARSrs15703051008081552.63E-091.08E-2260.0299.21E-090.11WDR25WARS
ILMN_208076015SNX22rs12102207646074722.15E-109.29E-17-0.125.98E-070.49CSNK1G1TRIP4
ILMN_172440616INO80Ers4787491300153376.98E-134.48E-180.134.14E-080.06TBX6MVP/ KCTD13/ INO80E
ILMN_171756516CLEC18Ars3748388699744483.75E-143.89E-090.203.71E-060.10NFAT5WWP2
ILMN_205668717C17orf56rs1048775792023295.78E-084.60E-380.0669.46E-070.19SLC38A10C17ORF56/ AZI1
ILMN_170739117STXBP4rs244293532307222.26E-134.87E-090.205.35E-060.93STXBP4NA
ILMN_171596819MLL4rs17638853362346521.89E-091.41E-12-0.125.89E-060.23KMT2BCOX6B1/ SNX26
ILMN_177618820MAP1LC3Ars4564863331793671.94E-088.12E-110-0.0389.50E-080.25GGT7MAP1LC3A
ILMN_178122522C22orf27rs5753373312837191.42E-111.76E-780.0573.57E-100.95OSBP2MORC2/ FLJ35801/ OSBP2
ILMN_210359122MORC2rs7284474313901876.47E-117.08E-24-0.115.99E-080.10OSBP2MORC2/ FLJ35801/ OSBP2

topSNP was the most significant SNP in the cis-region of the probe. topSNP_bp was the position of the most significant SNP. p_GWAS was p-value from GWAS. p_eQTL was p-value from eQTL study. b_smr was effect size from SMR test. p_smr was p-value from SMR test. p_HEIDI was p-value from HEIDI test.

a: These genes were annotated by previous AAM GWAS.

b: These genes were considered as novel genes. No SNP in the cis-eQTL region of the probes was identified to be significantly associated with AAM according to the primary GWAS.

c: Genes were identified by previous SMR analysis in the same locus. NA meant that no gene was identified in this locus.

Table 2

Genes identified by SMR analysis for ANM.

probeIDChrGenetopSNP_bptopSNPp_GWASp_eQTLb_SMRp_SMRp_HEIDIGenea
ILMN_18109151FAAH46747301rs121422406.60E-095.31E-280.382.24E-085.54E-02RAD54L
ILMN_17160041NSUN446806703rs104897698.20E-089.38E-770.211.14E-083.25E-01RAD54L
ILMN_17934611AK125462149848885rs12602461.60E-063.92E-790.215.91E-069.53E-01NEWb
ILMN_17328102SNX1727644464rs17289221.20E-145.12E-230.605.06E-106.58E-02BRE / GTF3C2/ EIFB4
ILMN_16700962NRBP127584666rs75866012.30E-141.59E-100.945.85E-071.82E-01BRE / GTF3C2/ EIFB4
ILMN_17290512MSH648018081rs18009323.20E-114.44E-470.301.34E-073.86E-01MSH6
ILMN_18110292TLK1171871997rs130042735.90E-171.67E-101.011.89E-075.00E-01TLK1 / GAD1
ILMN_17880532SLC25A12172704291rs46684142.30E-078.11E-21-0.424.40E-072.27E-01TLK1 / GAD1
ILMN_17668593MSL2136518670rs134336831.10E-052.00E-440.233.08E-079.18E-01NEWb
ILMN_17797436SYCP2L10895260rs68996762.20E-193.03E-101.081.14E-066.02E-01SYCP2L / MAK
ILMN_17988046SRPK135809776rs177050201.70E-067.09E-340.283.79E-065.41E-02MSH5 / HLA
ILMN_176764211C11orf4630363101rs79266661.90E-118.15E-34-0.391.99E-086.12E-02FSHB
ILMN_173402112CLSTN37284301rs21672851.20E-069.69E-20-0.442.53E-065.06E-02NEWb
ILMN_165442112MPHOSPH9123634122rs8845486.80E-072.39E-18-0.447.57E-075.77E-01KNTC1 / PITPNM
ILMN_185990816TRAPPC2L88927221rs38260612.80E-063.93E-198-0.118.13E-077.80E-02NEWb
ILMN_180563617PGAP337833035rs29415062.00E-092.98E-57-0.281.75E-096.23E-01STARD3/ PGAP3/ CDK12
ILMN_231108917BRCA141215825rs30929942.80E-101.18E-660.288.81E-111.58E-01BRCA1
ILMN_170069017VAT141215825rs30929942.80E-104.47E-15-0.611.77E-073.21E-01BRCA1
ILMN_180534417DDX562502435rs19914019.60E-072.73E-840.229.84E-098.01E-02NEWb
ILMN_180205319ZNF9123545004rs2960921.50E-065.82E-1180.158.78E-086.96E-01ZNF729
ILMN_230702520CPNE134221155rs60605241.40E-055.28E-2440.107.62E-071.00E+00NEWb
ILMN_181293420DIDO161558775rs9108317.10E-102.65E-160.553.19E-084.00E-01SLCO4A1 / DIDO1
ILMN_237159022DDX1739021522rs57571871.30E-121.88E-28-0.499.01E-111.00E-01DMC1/ DDX17
ILMN_166853522JOSD139065172rs37885453.60E-123.06E-120.781.46E-077.48E-01DMC1 / DDX17

topSNP was the most significant SNP in the cis-region of the probe. topSNP_bp was the position of the most significant SNP. p_GWAS was p-value from GWAS. p_eQTL was p-value from the eQTL study. b_smr was effect size from SMR test. p_smr was p-value from SMR test. p_HEIDI was p-value from HEIDI test.

a: These genes were annotated by previous ANM GWAS.

b: These genes were considered as novel genes. No SNP in the cis-eQTL region of the probes was identified to be significantly associated with ANM according to the primary GWAS.

Fig 1

The SMR plots of novel genes associated with AAM.

(A) The SMR result at ATP1B3 locus. (B) The SMR result at NAAA locus. (C) The SMR result at GRTP1 locus. In the top plot, black dots represent the p values for the SNPs from the latest GWAS meta-analysis for AAM (Y-axis), diamonds represent the p values for probes from the SMR test. In the bottom plot, the eQTL p values of the SNPs were from the eQTL study (Y-axis).

Fig 2

The SMR plots of novel genes associated with ANM.

(A) The SMR result at AK125462 locus. (B) The SMR result at MSL2 locus. (C) The SMR result at CLSTN3 locus. (D) The SMR result at TRAPPC2L locus. (E) The SMR result at DDX5 locus. (F) The SMR result at CPNE1 locus. In the top plot, black dots represent the p values for the SNPs from the latest GWAS meta-analysis for AAM (Y-axis), diamonds represent the p values for probes from the SMR test. In the bottom plot, the eQTL p values of the SNPs were from the eQTL study (Y-axis).

The SMR plots of novel genes associated with AAM.

(A) The SMR result at ATP1B3 locus. (B) The SMR result at NAAA locus. (C) The SMR result at GRTP1 locus. In the top plot, black dots represent the p values for the SNPs from the latest GWAS meta-analysis for AAM (Y-axis), diamonds represent the p values for probes from the SMR test. In the bottom plot, the eQTL p values of the SNPs were from the eQTL study (Y-axis).

The SMR plots of novel genes associated with ANM.

(A) The SMR result at AK125462 locus. (B) The SMR result at MSL2 locus. (C) The SMR result at CLSTN3 locus. (D) The SMR result at TRAPPC2L locus. (E) The SMR result at DDX5 locus. (F) The SMR result at CPNE1 locus. In the top plot, black dots represent the p values for the SNPs from the latest GWAS meta-analysis for AAM (Y-axis), diamonds represent the p values for probes from the SMR test. In the bottom plot, the eQTL p values of the SNPs were from the eQTL study (Y-axis). topSNP was the most significant SNP in the cis-region of the probe. topSNP_bp was the position of the most significant SNP. p_GWAS was p-value from GWAS. p_eQTL was p-value from eQTL study. b_smr was effect size from SMR test. p_smr was p-value from SMR test. p_HEIDI was p-value from HEIDI test. a: These genes were annotated by previous AAM GWAS. b: These genes were considered as novel genes. No SNP in the cis-eQTL region of the probes was identified to be significantly associated with AAM according to the primary GWAS. c: Genes were identified by previous SMR analysis in the same locus. NA meant that no gene was identified in this locus. topSNP was the most significant SNP in the cis-region of the probe. topSNP_bp was the position of the most significant SNP. p_GWAS was p-value from GWAS. p_eQTL was p-value from the eQTL study. b_smr was effect size from SMR test. p_smr was p-value from SMR test. p_HEIDI was p-value from HEIDI test. a: These genes were annotated by previous ANM GWAS. b: These genes were considered as novel genes. No SNP in the cis-eQTL region of the probes was identified to be significantly associated with ANM according to the primary GWAS. To identify more pleiotropic SNPs and genes associated with both phenotypes, we conducted a colocalization analysis. One region was identified to contain a variant influencing both phenotypes with the posterior probability of 0.92 (Table 3). Thirteen regions were considered to influence the two phenotypes through different variants (Table 3). rs3136249, with the largest posterior probability (0.37), was considered to be the causal SNP influencing both phenotypes.
Table 3

Colocalization analysis results of AAM and ANM.

chunkchrstspPPA_3PPA_4
162chr247318990482125620.920.064
1419chr1588370262904736902.5E-171.00
1579chr1961496720980151.1E-081.00
360chr31354582941373700764.1E-061.00
653chr628918936297378463.3E-051.00
799chr792500845939660365.4E-051.00
1637chr2032819871349602016.7E-050.99
656chr631572333326824293.5E-030.97
655chr630798697315684696.1E-040.94
1682chr2219913726223556404.0E-040.94
128chr12415826682420707312.5E-040.93
1644chr2042680811448381121.9E-030.93
1693chr2237570784393066303.0E-040.92
1213chr1255665948575435723.7E-040.91

Chunk was the internal numerical identifier for the segment. chr: chromosome. st: star position. sp: end position. PPA_3 was the posterior probability of model 3. PPA_4 was the posterior probability of model 4.

Chunk was the internal numerical identifier for the segment. chr: chromosome. st: star position. sp: end position. PPA_3 was the posterior probability of model 3. PPA_4 was the posterior probability of model 4.

Discussion

In this study, we identified 31 genes whose expressions were associated with AAM and 24 genes whose expressions were associated with ANM due to pleiotropy or causality. In total, we identified 9 new genes where there was no significant SNP in the cis-eQTL region of the gene probe. Many of these genes participated in DNA repair, immune response, and breast cancer process [2, 5]. C17orf46 was identified to be associated with both phenotypes by integrating GWAS and eQTLs data. We also found one region with a pleiotropic effect influencing two phenotypes through the colocalization analysis. Although the previous study performed the SMR analysis with Westra eQTL data which had a low coverage of human genes (5,967) compared to CAGE eQTL data (8,144). Thus the previous study may omit many potential genes. Eleven genes of the 31 genes associated with AAM were successfully identified by using CAGE eQTL data (Table 1). SMR demonstrated that it was useful to prioritize genes associated with AAM or ANM. SMR tests reduced the multiple hypothesis burdens by testing tens of thousands of genes instead of millions of SNPs [13]. It suggested that SMR was useful in identifying novel genes associated with AAM or ANM. In this study, we identified 3 novel genes associated with AAM and 6 novel genes associated with ANM. One novel gene DDX5, which is also known as p68, was identified to be associated with ANM. DDX5 is a prototypic member of the DEAD box family of RNA helicases that encompasses multiple functions. DDX5 was highly expressed in a high proportion of breast cancers. Patients with a detectable level of both DDX5 and polo-like kinase-1 (pLK1) often had a poor prognosis [14]. In the significant loci, we redefined the functional genes which were more likely to play important roles in the process of menarche or natural menopause. We presented a list of genes to be followed up in future functional validation experiments. For example, HSD17B12 coding a 17beta-hydroxysteroid dehydrogenase transforms estrone (E1) into estradiol (E2) [15]. E2 is involved in the regulation of the estrous and menstrual female reproductive cycles. However, the previous study annotated the significant SNP to MIR129-2 (Table 1). Fatty acid amide hydrolase (FAAH), the enzyme that breaks down the endocannabinoid anandamide and controls its levels, is regulated by estrogen [16]. The previous study annotated the significant SNP in this locus to RAD54L (Table 2), however, we found that FAAH was more likely to be the causal gene in this locus. Another example is SRPK1, encoding the splicing factor kinase SRSF protein kinase 1, which was highly expressed in basal breast cancer cells [17]. The knockdown of SRPK1 significantly suppressed metastasis of breast cancer cells [18]. Despite the common belief that multiple genes are responsible for controlling the timing of menarche and natural menopause, very few genes have been identified that contain common genetic variants associated with AAM and ANM. In this study, we identified two genes (MSH6 and C11orf46) associated with both traits. rs3136249 is located in the intronic region of MSH6. MSH6, which is a mismatch repair gene, was found to be associated with ANM by the previous study [19]. The colorization analysis showed C11orf46 locus may be associated with both traits with the posterior probability of 0.79. This may be caused by the relatively large posterior probability of model 4 (0.21). However, the function of C11orf46 is unknown, further studies are needed to prove this result. The present study may have some limitations that should be considered. Although we redefined the functional genes in the significant loci, these genes may be associated with age at natural menopause due to pleiotropy which meant that some of these genes may be not the causal genes. Due to the limitation of the method, we did not distinguish those pleiotropic genes from causal genes. So, further works are warranted to confirm the functional genes and explore the underlying mechanism. In conclusion, we highlighted the putative functional genes in the significant loci for AAM and ANM. Our study prioritizes the associated genes for further functional mechanistic study of AAM and ANM and illustrates the benefit of integrating different omics data into the study of complex traits. Our study may help to understand the ovarian function and benefit for women’s reproductive health.
  18 in total

1.  Genetics of reproductive lifespan.

Authors:  Patricia Hartge
Journal:  Nat Genet       Date:  2009-06       Impact factor: 38.330

2.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies.

Authors:  Brendan K Bulik-Sullivan; Po-Ru Loh; Hilary K Finucane; Stephan Ripke; Jian Yang; Nick Patterson; Mark J Daly; Alkes L Price; Benjamin M Neale
Journal:  Nat Genet       Date:  2015-02-02       Impact factor: 38.330

Review 3.  Dissecting the genetics of complex traits using summary association statistics.

Authors:  Bogdan Pasaniuc; Alkes L Price
Journal:  Nat Rev Genet       Date:  2016-11-14       Impact factor: 53.242

4.  A serine kinase regulates intracellular localization of splicing factors in the cell cycle.

Authors:  J F Gui; W S Lane; X D Fu
Journal:  Nature       Date:  1994-06-23       Impact factor: 49.962

5.  Tumor cell migration screen identifies SRPK1 as breast cancer metastasis determinant.

Authors:  Wies van Roosmalen; Sylvia E Le Dévédec; Ofra Golani; Marcel Smid; Irina Pulyakhina; Annemieke M Timmermans; Maxime P Look; Di Zi; Chantal Pont; Marjo de Graauw; Suha Naffar-Abu-Amara; Catherine Kirsanova; Gabriella Rustici; Peter A C 't Hoen; John W M Martens; John A Foekens; Benjamin Geiger; Bob van de Water
Journal:  J Clin Invest       Date:  2015-03-16       Impact factor: 14.808

6.  DNA mismatch repair gene MSH6 implicated in determining age at natural menopause.

Authors:  John R B Perry; Yi-Hsiang Hsu; Daniel I Chasman; Andrew D Johnson; Cathy Elks; Eva Albrecht; Irene L Andrulis; Jonathan Beesley; Gerald S Berenson; Sven Bergmann; Stig E Bojesen; Manjeet K Bolla; Judith Brown; Julie E Buring; Harry Campbell; Jenny Chang-Claude; Georgia Chenevix-Trench; Tanguy Corre; Fergus J Couch; Angela Cox; Kamila Czene; Adamo Pio D'adamo; Gail Davies; Ian J Deary; Joe Dennis; Douglas F Easton; Ellen G Engelhardt; Johan G Eriksson; Tõnu Esko; Peter A Fasching; Jonine D Figueroa; Henrik Flyger; Abigail Fraser; Montse Garcia-Closas; Paolo Gasparini; Christian Gieger; Graham Giles; Pascal Guenel; Sara Hägg; Per Hall; Caroline Hayward; John Hopper; Erik Ingelsson; Sharon L R Kardia; Katherine Kasiman; Julia A Knight; Jari Lahti; Debbie A Lawlor; Patrik K E Magnusson; Sara Margolin; Julie A Marsh; Andres Metspalu; Janet E Olson; Craig E Pennell; Ozren Polasek; Iffat Rahman; Paul M Ridker; Antonietta Robino; Igor Rudan; Anja Rudolph; Andres Salumets; Marjanka K Schmidt; Minouk J Schoemaker; Erin N Smith; Jennifer A Smith; Melissa Southey; Doris Stöckl; Anthony J Swerdlow; Deborah J Thompson; Therese Truong; Sheila Ulivi; Melanie Waldenberger; Qin Wang; Sarah Wild; James F Wilson; Alan F Wright; Lina Zgaga; Ken K Ong; Joanne M Murabito; David Karasik; Anna Murray
Journal:  Hum Mol Genet       Date:  2013-12-19       Impact factor: 6.150

7.  Meta-analyses identify 13 loci associated with age at menopause and highlight DNA repair and immune pathways.

Authors:  Lisette Stolk; John R B Perry; Daniel I Chasman; Chunyan He; Massimo Mangino; Patrick Sulem; Maja Barbalic; Linda Broer; Enda M Byrne; Florian Ernst; Tõnu Esko; Nora Franceschini; Daniel F Gudbjartsson; Jouke-Jan Hottenga; Peter Kraft; Patrick F McArdle; Eleonora Porcu; So-Youn Shin; Albert V Smith; Sophie van Wingerden; Guangju Zhai; Wei V Zhuang; Eva Albrecht; Behrooz Z Alizadeh; Thor Aspelund; Stefania Bandinelli; Lovorka Barac Lauc; Jacques S Beckmann; Mladen Boban; Eric Boerwinkle; Frank J Broekmans; Andrea Burri; Harry Campbell; Stephen J Chanock; Constance Chen; Marilyn C Cornelis; Tanguy Corre; Andrea D Coviello; Pio d'Adamo; Gail Davies; Ulf de Faire; Eco J C de Geus; Ian J Deary; George V Z Dedoussis; Panagiotis Deloukas; Shah Ebrahim; Gudny Eiriksdottir; Valur Emilsson; Johan G Eriksson; Bart C J M Fauser; Liana Ferreli; Luigi Ferrucci; Krista Fischer; Aaron R Folsom; Melissa E Garcia; Paolo Gasparini; Christian Gieger; Nicole Glazer; Diederick E Grobbee; Per Hall; Toomas Haller; Susan E Hankinson; Merli Hass; Caroline Hayward; Andrew C Heath; Albert Hofman; Erik Ingelsson; A Cecile J W Janssens; Andrew D Johnson; David Karasik; Sharon L R Kardia; Jules Keyzer; Douglas P Kiel; Ivana Kolcic; Zoltán Kutalik; Jari Lahti; Sandra Lai; Triin Laisk; Joop S E Laven; Debbie A Lawlor; Jianjun Liu; Lorna M Lopez; Yvonne V Louwers; Patrik K E Magnusson; Mara Marongiu; Nicholas G Martin; Irena Martinovic Klaric; Corrado Masciullo; Barbara McKnight; Sarah E Medland; David Melzer; Vincent Mooser; Pau Navarro; Anne B Newman; Dale R Nyholt; N Charlotte Onland-Moret; Aarno Palotie; Guillaume Paré; Alex N Parker; Nancy L Pedersen; Petra H M Peeters; Giorgio Pistis; Andrew S Plump; Ozren Polasek; Victor J M Pop; Bruce M Psaty; Katri Räikkönen; Emil Rehnberg; Jerome I Rotter; Igor Rudan; Cinzia Sala; Andres Salumets; Angelo Scuteri; Andrew Singleton; Jennifer A Smith; Harold Snieder; Nicole Soranzo; Simon N Stacey; John M Starr; Maria G Stathopoulou; Kathleen Stirrups; Ronald P Stolk; Unnur Styrkarsdottir; Yan V Sun; Albert Tenesa; Barbara Thorand; Daniela Toniolo; Laufey Tryggvadottir; Kim Tsui; Sheila Ulivi; Rob M van Dam; Yvonne T van der Schouw; Carla H van Gils; Peter van Nierop; Jacqueline M Vink; Peter M Visscher; Marlies Voorhuis; Gérard Waeber; Henri Wallaschofski; H Erich Wichmann; Elisabeth Widen; Colette J M Wijnands-van Gent; Gonneke Willemsen; James F Wilson; Bruce H R Wolffenbuttel; Alan F Wright; Laura M Yerges-Armstrong; Tatijana Zemunik; Lina Zgaga; M Carola Zillikens; Marek Zygmunt; Alice M Arnold; Dorret I Boomsma; Julie E Buring; Laura Crisponi; Ellen W Demerath; Vilmundur Gudnason; Tamara B Harris; Frank B Hu; David J Hunter; Lenore J Launer; Andres Metspalu; Grant W Montgomery; Ben A Oostra; Paul M Ridker; Serena Sanna; David Schlessinger; Tim D Spector; Kari Stefansson; Elizabeth A Streeten; Unnur Thorsteinsdottir; Manuela Uda; André G Uitterlinden; Cornelia M van Duijn; Henry Völzke; Anna Murray; Joanne M Murabito; Jenny A Visser; Kathryn L Lunetta
Journal:  Nat Genet       Date:  2012-01-22       Impact factor: 38.330

8.  Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk.

Authors:  Felix R Day; Deborah J Thompson; Hannes Helgason; Daniel I Chasman; Hilary Finucane; Patrick Sulem; Katherine S Ruth; Sean Whalen; Abhishek K Sarkar; Eva Albrecht; Elisabeth Altmaier; Marzyeh Amini; Caterina M Barbieri; Thibaud Boutin; Archie Campbell; Ellen Demerath; Ayush Giri; Chunyan He; Jouke J Hottenga; Robert Karlsson; Ivana Kolcic; Po-Ru Loh; Kathryn L Lunetta; Massimo Mangino; Brumat Marco; George McMahon; Sarah E Medland; Ilja M Nolte; Raymond Noordam; Teresa Nutile; Lavinia Paternoster; Natalia Perjakova; Eleonora Porcu; Lynda M Rose; Katharina E Schraut; Ayellet V Segrè; Albert V Smith; Lisette Stolk; Alexander Teumer; Irene L Andrulis; Stefania Bandinelli; Matthias W Beckmann; Javier Benitez; Sven Bergmann; Murielle Bochud; Eric Boerwinkle; Stig E Bojesen; Manjeet K Bolla; Judith S Brand; Hiltrud Brauch; Hermann Brenner; Linda Broer; Thomas Brüning; Julie E Buring; Harry Campbell; Eulalia Catamo; Stephen Chanock; Georgia Chenevix-Trench; Tanguy Corre; Fergus J Couch; Diana L Cousminer; Angela Cox; Laura Crisponi; Kamila Czene; George Davey Smith; Eco J C N de Geus; Renée de Mutsert; Immaculata De Vivo; Joe Dennis; Peter Devilee; Isabel Dos-Santos-Silva; Alison M Dunning; Johan G Eriksson; Peter A Fasching; Lindsay Fernández-Rhodes; Luigi Ferrucci; Dieter Flesch-Janys; Lude Franke; Marike Gabrielson; Ilaria Gandin; Graham G Giles; Harald Grallert; Daniel F Gudbjartsson; Pascal Guénel; Per Hall; Emily Hallberg; Ute Hamann; Tamara B Harris; Catharina A Hartman; Gerardo Heiss; Maartje J Hooning; John L Hopper; Frank Hu; David J Hunter; M Arfan Ikram; Hae Kyung Im; Marjo-Riitta Järvelin; Peter K Joshi; David Karasik; Manolis Kellis; Zoltan Kutalik; Genevieve LaChance; Diether Lambrechts; Claudia Langenberg; Lenore J Launer; Joop S E Laven; Stefania Lenarduzzi; Jingmei Li; Penelope A Lind; Sara Lindstrom; YongMei Liu; Jian'an Luan; Reedik Mägi; Arto Mannermaa; Hamdi Mbarek; Mark I McCarthy; Christa Meisinger; Thomas Meitinger; Cristina Menni; Andres Metspalu; Kyriaki Michailidou; Lili Milani; Roger L Milne; Grant W Montgomery; Anna M Mulligan; Mike A Nalls; Pau Navarro; Heli Nevanlinna; Dale R Nyholt; Albertine J Oldehinkel; Tracy A O'Mara; Sandosh Padmanabhan; Aarno Palotie; Nancy Pedersen; Annette Peters; Julian Peto; Paul D P Pharoah; Anneli Pouta; Paolo Radice; Iffat Rahman; Susan M Ring; Antonietta Robino; Frits R Rosendaal; Igor Rudan; Rico Rueedi; Daniela Ruggiero; Cinzia F Sala; Marjanka K Schmidt; Robert A Scott; Mitul Shah; Rossella Sorice; Melissa C Southey; Ulla Sovio; Meir Stampfer; Maristella Steri; Konstantin Strauch; Toshiko Tanaka; Emmi Tikkanen; Nicholas J Timpson; Michela Traglia; Thérèse Truong; Jonathan P Tyrer; André G Uitterlinden; Digna R Velez Edwards; Veronique Vitart; Uwe Völker; Peter Vollenweider; Qin Wang; Elisabeth Widen; Ko Willems van Dijk; Gonneke Willemsen; Robert Winqvist; Bruce H R Wolffenbuttel; Jing Hua Zhao; Magdalena Zoledziewska; Marek Zygmunt; Behrooz Z Alizadeh; Dorret I Boomsma; Marina Ciullo; Francesco Cucca; Tõnu Esko; Nora Franceschini; Christian Gieger; Vilmundur Gudnason; Caroline Hayward; Peter Kraft; Debbie A Lawlor; Patrik K E Magnusson; Nicholas G Martin; Dennis O Mook-Kanamori; Ellen A Nohr; Ozren Polasek; David Porteous; Alkes L Price; Paul M Ridker; Harold Snieder; Tim D Spector; Doris Stöckl; Daniela Toniolo; Sheila Ulivi; Jenny A Visser; Henry Völzke; Nicholas J Wareham; James F Wilson; Amanda B Spurdle; Unnur Thorsteindottir; Katherine S Pollard; Douglas F Easton; Joyce Y Tung; Jenny Chang-Claude; David Hinds; Anna Murray; Joanne M Murabito; Kari Stefansson; Ken K Ong; John R B Perry
Journal:  Nat Genet       Date:  2017-04-24       Impact factor: 38.330

9.  Systematic identification of trans eQTLs as putative drivers of known disease associations.

Authors:  Harm-Jan Westra; Marjolein J Peters; Tõnu Esko; Hanieh Yaghootkar; Claudia Schurmann; Johannes Kettunen; Mark W Christiansen; Bruce M Psaty; Samuli Ripatti; Alexander Teumer; Timothy M Frayling; Andres Metspalu; Joyce B J van Meurs; Lude Franke; Benjamin P Fairfax; Katharina Schramm; Joseph E Powell; Alexandra Zhernakova; Daria V Zhernakova; Jan H Veldink; Leonard H Van den Berg; Juha Karjalainen; Sebo Withoff; André G Uitterlinden; Albert Hofman; Fernando Rivadeneira; Peter A C 't Hoen; Eva Reinmaa; Krista Fischer; Mari Nelis; Lili Milani; David Melzer; Luigi Ferrucci; Andrew B Singleton; Dena G Hernandez; Michael A Nalls; Georg Homuth; Matthias Nauck; Dörte Radke; Uwe Völker; Markus Perola; Veikko Salomaa; Jennifer Brody; Astrid Suchy-Dicey; Sina A Gharib; Daniel A Enquobahrie; Thomas Lumley; Grant W Montgomery; Seiko Makino; Holger Prokisch; Christian Herder; Michael Roden; Harald Grallert; Thomas Meitinger; Konstantin Strauch; Yang Li; Ritsert C Jansen; Peter M Visscher; Julian C Knight
Journal:  Nat Genet       Date:  2013-09-08       Impact factor: 38.330

10.  Detection and interpretation of shared genetic influences on 42 human traits.

Authors:  Joseph K Pickrell; Tomaz Berisa; Jimmy Z Liu; Laure Ségurel; Joyce Y Tung; David A Hinds
Journal:  Nat Genet       Date:  2016-05-16       Impact factor: 38.330

View more
  2 in total

1.  Pubertal Growth, IGF-1, and Windows of Susceptibility: Puberty and Future Breast Cancer Risk.

Authors:  Frank M Biro; Bin Huang; Halley Wasserman; Catherine M Gordon; Susan M Pinney
Journal:  J Adolesc Health       Date:  2020-09-01       Impact factor: 5.012

Review 2.  Genetic Regulation of Physiological Reproductive Lifespan and Female Fertility.

Authors:  Isabelle M McGrath; Sally Mortlock; Grant W Montgomery
Journal:  Int J Mol Sci       Date:  2021-03-04       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.