Literature DB >> 31170924

Identification of female-specific genetic variants for metabolic syndrome and its component traits to improve the prediction of metabolic syndrome in females.

Sokanha Kong1, Yoon Shin Cho2.   

Abstract

BACKGROUND: Metabolic syndrome (MetS), defined as a cluster of metabolic risk factors including dyslipidemia, insulin-resistance, and elevated blood pressure, has been known as partly heritable. MetS effects the lives of many people worldwide, yet females have been reported to be more vulnerable to this cluster of risks.
METHODS: To elucidate genetic variants underlying MetS specifically in females, we performed a genome-wide association study (GWAS) for MetS as well as its component traits in a total of 9932 Korean female subjects (including 2276 MetS cases and 1692 controls). To facilitate the prediction of MetS in females, we calculated a genetic risk score (GRS) combining 14 SNPs detected in our GWA analyses specific for MetS.
RESULTS: GWA analyses identified 14 moderate signals (Pmeta < 5X10- 5) specific to females for MetS. In addition, two genome-wide significant female-specific associations (Pmeta < 5X10- 8) were detected for rs455489 in DSCAM for fasting plasma glucose (FPG) and for rs7115583 in SIK3 for high-density lipoprotein cholesterol (HDLC). Logistic regression analyses (adjusted for area and age) between the GRS and MetS in females indicated that the GRS was associated with increased prevalence of MetS in females (P = 5.28 × 10- 14), but not in males (P = 3.27 × 10- 1). Furthermore, in the MetS prediction models using GRS, the area under the curve (AUC) of the receiver operating characteristics (ROC) curve was higher in females (AUC = 0.85) than in males (AUC = 0.57).
CONCLUSION: This study highlights new female-specific genetic variants associated with MetS and its component traits and suggests that the GRS of MetS variants is a likely useful predictor of MetS in females.

Entities:  

Keywords:  Genetic risk score (GRS); Genome-wide association study (GWAS); Metabolic syndrome (MetS); Sex specific genetic variants; Single nucleotide polymorphism (SNP)

Mesh:

Year:  2019        PMID: 31170924      PMCID: PMC6555714          DOI: 10.1186/s12881-019-0830-y

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

Metabolic syndrome (MetS), defined as a cluster of metabolic risk factors including dyslipidemia, insulin-resistance, and elevated blood pressure, is known to increase the prevalence of other non-communicable diseases such as type 2 diabetes (T2D) and cardiovascular diseases (CVD) [1]. In addition, recent studies reported MetS to be associated with various types of cancer including pancreatic, liver, breast, and colon cancer [2]. The prevalence of MetS is approximately 20 to 30% of the general world population and it is associated with increased morbidity and mortality rates worldwide [2, 3]. Prevalence of MetS is known to increase with age in females; furthermore, females are more prone to MetS due to various cultural factors including stress and low socioeconomic status [3]. The prevalence of cardiovascular diseases that are associated with MetS was shown to be higher than in males, as determined via various meta-analysis studies [4]. A recent systematic review reported that the prevalence of MetS in most countries in Asia-pacific region was higher in females, supported by the data from South Korea in 2007 which described that the prevalence of MetS was higher in females (32.9%) than in males (29%) [5]. Therefore, it is important to thoroughly understand risk factors underlying pathophysiological mechanisms of MetS and the nature of the interaction between gender and the prevalence of this disease. Until recently, MetS was thought to be a risk factor independent of sex, despite significant research interest in the scientific community; association with various diseases and treatment options have been evaluated irrespective of sex [3]. Since MetS is influenced by not only environmental but also genetic factors, numerous genetic studies have been conducted to gain insight into the genetic basis of MetS and its component traits [6-8]. To the best of our knowledge, however, no female-specific genetic association studies of MetS and its component traits have been conducted in any of ethnic groups including East-Asian as well as European populations. In this study, we sought to identify female-specific genetic variants associated with MetS and its component traits in Korean females by utilizing GWA analysis. In addition, by combining 14 single nucleotide polymorphisms (SNPs) detected from the MetS association analysis, we specifically aimed to investigate whether a genetic risk score (GRS) composed of these SNPs might assist the prediction of MetS in females of South Korean origin.

Methods

Subjects

Subjects for the discovery stage were recruited from the KARE (Korea Association Resource) study cohort, a population-based cohort of 8842 participants. The KARE cohort consist of two population-based studies, the Ansung and Ansan cohorts which are located in Gyeonggi Province, close to Seoul, the capital of the Republic of Korea. Details of the participant’s recruitment criteria and the study design are provided elsewhere [9]. A total of 4659 females were present in the KARE study cohort, including 1211 MetS cases and 639 normal controls. Participants included in the studies used for the replication stage included unrelated Korean participants from the Rural1816, Rural3667, and HEXA (Health Examinee shared control study) cohorts. The Rural1816 cohort includes 1816 subjects, of which 957 are female; of these, 404 MetS cases and 21 normal controls were included in this study. The Rural1816 study combines subjects from the Wonju, Pyeong Chang, Gangneung, Geumsan, and Naju regional cohorts in Korea. The Rural3667 cohort consists of 3667 subjects of which 2265 females were included in our study; of these, 424 and 330 female subjects were MetS cases and normal controls, respectively. The Rural3667 study combines subjects from the Yangpyeong, Namwon, and Goryeong regional cohorts in Korea. The study design and cohort characteristics of both Rural cohorts have been described previously [9]. A total of 2051 female subjects including 237 MetS cases and 702 normal controls were selected from 3700 subjects of HEXA study that has been described elsewhere [7]. All subjects included for the discovery and replication stages in this study provided written informed consent approved by the local review board. Clinical characteristics of each study cohort are summarized in Table 1.
Table 1

Clinical characteristics female subjects in study cohorts. Numbers indicate average and standard deviation for each trait

KARERural1816Rural3667HEXA
aN (MetS/contol)4659 (1211/639)957 (404/21)2265 (424/330)2051 (237/702)
Age (years)52.6 (±9.0)60.8 (±6.4)58.8 (±10.0)51.6 (±7.7)
Waist circumference (cm)81.6 (±10.1)85.1 (±8.5)82.1 (±8.9)79.2 (±8.4)
WHR0.86 (±0.36)0.89 (±0.06)0.89 (±0.07)0.84 (±0.06)
BMI (kg/m2)24.9 (±3.26)25.2 (±3.4)24.0 (±3.1)23.7 (±3.0)
FPG (mg/dL)82.3 (±25.5)114.8 (±43.3)93.0 (±9.3)91.3 (±26.1)
HDLC (mg/dL)45.6 (±10.1)45.4 (±10.6)46.3 (±10.4)57.6 (±13.3)
LDLC (mg/dL)114.4 (±35.4)133.3 (±35.2)129.0 (±31.9)120.4 (±31.5)
TG (mg/dL)149.0 (±88.9)169.2 (±110.2)129.1 (±70.9)106.6 (±69.4)
TCHL (mg/dL)191.4 (±35.7)210.9 (±41.4)200.6 (±35.7)198.8 (±35.1)
SBP (mmHg)124.2 (±20.0)133.0 (±18.2)114.7 (±11.7)119.1 (±14.4)
DBP (mmHg)81.3 (±11.9)83.4 (±11.2)74.4 (±7.5)75.1 (±9.8)
ALT (IU/L)23.3 (±17.3)25.9 (±19.5)21.1 (±11.4)20.4 (±12.9)
AST (IU/L)27.2 (±13.6)28.2 (±21.2)25.0 (±8.6)22.6 (±9.2)
GTP (IU/L)19.3 (±24.0)27.7 (±50.6)17.9 (±16.9)21.0 (±17.6)
Albumin (g/dL)4.2 (±0.3)4.6 (±0.3)4.4 (±0.2)4.6 (±0.3)

aThese numbers indicate the number of subjects used for the association analysis

WHR waist-hip ration, BMI body mass index, FPG fasting plasma glucose, HDLC high density lipoprotein cholesterol, LDLC low density lipoprotein cholesterol, TG triglyceride, TCHL total cholesterol, SBP systolic blood presure, DBP diastolic blood presure, ALT aspartate transaminase, AST alanine transaminase, GTP Glutamyl Transpeptidase

Clinical characteristics female subjects in study cohorts. Numbers indicate average and standard deviation for each trait aThese numbers indicate the number of subjects used for the association analysis WHR waist-hip ration, BMI body mass index, FPG fasting plasma glucose, HDLC high density lipoprotein cholesterol, LDLC low density lipoprotein cholesterol, TG triglyceride, TCHL total cholesterol, SBP systolic blood presure, DBP diastolic blood presure, ALT aspartate transaminase, AST alanine transaminase, GTP Glutamyl Transpeptidase

Genotyping and quality control

For the KARE study subjects, genotyping was carried out using an Affymetrix Genome-Wide Human SNP array 5.0. Details on genotyping quality control (QC) for genotype data have been described previously [9]. For KARE genotype data, SNP imputation was performed to increase the coverage of common variants employing the IMPUTE program (http://mathgen.stats.ox.ac.uk/impute/impute_v2.html) with International HapMap data (phase 2, release 22) of 90 JPT and CHB individuals as the imputation reference panel. Imputed SNPs of poor imputation quality (imputation info score < 0.5, missing gene call rate > 1%, minor allele frequency (MAF) < 0.01, and Hardy-Weinberg equilibrium (HWE) test P-value < 1 × 10− 7) were excluded [7]. Genotyping of Rural3667 subjects in the replication stage was conducted using Illumina HumanOmni I-Quad vI array. Samples with a genotype missing call rate > 1% and heterozygosity > 30% were excluded from the sample pool. Markers with a missing SNP call rate > 5%, with MAF < 0.01, and with HWE test P-value < 1 х 10− 6 were eliminated. After this QC process, the remaining 3667 subjects and 747,297 SNPs were included for subsequent analyses. For the Rural1816 and HEXA study subjects, SNP genotyping was carried out using an Affymetrix Genome-Wide Human SNP array 6.0. Details on genotyping QC for genotype data have been described previously [7].

Phenotype measurements

In this study, we assessed MetS cases according to International Diabetes Foundation (IDF) criteria. IDF criteria of Korean MetS includes the waist circumference ≥ 85 cm in females (≥ 90 cm in males) plus any two of the following factors: (1) TG ≥ 150 mg/dL, (2) HDLC < 50 mg/dL in females (< 40 mg/dL in males), (3) SBP/DBP ≥ 130/85 mmHg, and (4) FPG ≥ 100 mg/dL [10, 11]. The MetS control group in this study consists of individuals with traits that do not fall into any of the MetS range (i.e., for females, waist circumference < 80 cm, TG < 150 mg/dL, HDLC ≥50 mg/dL, SBP/DBP < 130/85 mmHg, and FPG < 100 mg/dL). Individuals with missing phenotypic values necessary for MetS assessment were not included in the MetS case-control analysis (Table 1). The phenotypic data of six MetS component traits including waist circumference, fasting plasma glucose (FPG), systolic blood pressure (SBP), diastolic blood pressure (DBP), high density lipoprotein cholesterol (HDLC) and triglycerides (TG) were obtained from the KARE, Rural1816, Rural3667 and HEXA studies (Table 1) [7, 9].

Statistical analyses

The associations of genetic variants with MetS or six MetS component traits (waist circumference, FPG, SBP, DBP, HDLC, and TG) were carried out after adjustments for age and participants’ areas of recruitment. Logistic (for MetS) or linear (for MetS component traits) regression analyses with the above mentioned adjustments were performed via an additive model using PLINK version 1.07 software (http://zzz.bwh.harvard.edu/plink/) [12]. Because the KARE cohort for the discovery stage as well as both Rural cohorts for the replication stage combines subjects from several regional cohorts, participants’ areas of recruitment were adjusted for the relevant analyses. However, we did not apply principal components for the adjustment of association analyses in this study because previous studies demonstrated that the population stratification is negligible in KARE subjects used in our GWAS discovery stage [7, 9]. Genetic variants with an association P-value < 0.01 in the discovery stage were further analyzed in the replication stage, utilizing participants from the Rural1816, Rural3667 and HEXA studies. Meta-analyses were performed to combine discovery stage and replication stage analysis results. This meta-analysis was carried out based on an inverse-variance weighting method using the METAL Program (http://csg.sph.umich.edu/abecasis/metal/index.html) [13] (Fig. 1).
Fig. 1

Overall study work flow of two stage genome-wide association analysis for MetS and its component traits

Overall study work flow of two stage genome-wide association analysis for MetS and its component traits

Generation of the genetic risk score (GRS)

A genetic risk score (GRS) was generated by combining risk alleles of 14 SNPs identified by association analyses for MetS in this study. The risk score of each SNP was attributed based on the risk allele numbers (0, 1, or 2) presented. According to an additive genetic model, each SNP is assumed to be independently associated with risk [14]. Therefore, the GRS of individuals is calculated by summing the risk allele numbers of each of the 14 SNPs. In order to reduce bias and noise, only individuals with no missing genotypes of 14 SNPs were included in analyses employing GRS.

Construction of a MetS prediction model

The association between the GRS and MetS was tested using logistical regression analysis in females, males, and all subjects combining females and males in the KARE study. To predict the prevalence of MetS in subjects from the KARE study, the logistic regression model was constructed comprising not only GRS, but also two additional variables such as area and age. The constructed model was evaluated by receiver operating characteristics (ROC) analysis and area under the curve (AUC) calculations. Logistical regression analyses were performed using R software, version 3.3.2 (GNU General Public License). The AUC calculation and ROC plot visualization were executed using R’s ModelGood package. A P-value of < 0.05 was considered to be significant.

Results

Identification of genetic loci for MetS and its component traits

To discover female-specific genetic loci for MetS, we conducted two stage sex-stratified GWAS. In the stage 1, discovery stage, SNPs across the whole genome were tested for their association with MetS in 1211 cases and 639 controls of KARE female subjects. All SNPs showing association P-value < 0.01 (15,482 SNPs) from the stage 1 logistic analysis (adjusted for area and age) were taken forward to the stage 2 replication study (Additional file 5: Figure S1). A total of 1065 cases and 1053 controls were included for the replication analysis of female subjects from the Rural1816, Rural3667, and HEXA study cohorts (Fig. 1). Meta-analysis was conducted for SNPs that were validated for their association with MetS (P-value < 0.05) in the replication stage by combining association results from discovery and replication stages. Our meta-analysis identified 14 SNPs that fulfill our arbitrary cut-off P-value as evidence of moderate association (5 × 10− 8 < Pmeta < 5 × 10− 5) (Additional file 5: Figure S2). The association of these 14 female MetS signals was not detected in male subjects (Table 2 & Additional file 1: Table S1).
Table 2

Metabolic syndrome (MetS) associated loci in females. Discovery stage was GWAS for MetS in each sex-stratified group of KARE study. Overall association results (Pmeta) were obtained from meta-analyses combining Discover (KARE) and Replication (Rural1816, Rural3667 and HEXA) stages

CHRSNPBP (GRCh37)Candidate geneEAEAFFemaleMaleCombining female & male
DiscoveryReplicationOverall
KARERural1816Rural3667HEXA
ORPKAREORPRural1816ORPRural3667ORPHEXAORPmetaNORPmetaNORPmetaN
1rs2209363187,163,851 LINC01036 C0.180.675.32E-040.721.27E-020.692.11E-0524961.083.64E-0118700.884.24E-024366
2rs768072160,233,383 BAZ2B T0.250.727.66E-040.721.08E-020.722.41E-0525960.965.72E-0119220.822.34E-044518
2rs284544217,309,111 SMARCAL1 A0.211.511.65E-041.361.77E-021.441.06E-0525720.977.25E-0119071.151.01E-024479
2rs284541217,368,839 RPL37A T0.201.512.56E-041.342.90E-021.432.66E-0525310.923.28E-0118791.124.59E-024410
2rs2012243217,412,087 LOC101928180 A0.211.501.77E-041.303.98E-021.412.77E-0525990.933.59E-0119241.116.45E-024523
6rs1094764636,881,535 C6orf89 G0.020.332.66E-040.354.97E-020.343.46E-0531961.489.18E-0223530.781.28E-015549
8rs228311317,880,243 PCM1 A0.471.321.42E-031.311.15E-021.314.77E-0525241.073.10E-0118691.183.61E-044393
9rs169232495,592,145 PDCD1LG2-RIC1 A0.030.444.26E-040.392.89E-020.433.46E-0532141.029.29E-0123600.731.28E-025574
13rs956855851,810,953 FAM124A G0.261.334.77E-031.391.11E-021.241.15E-011.324.98E-0534360.998.60E-0124661.139.69E-035902
13rs951641695,103,694 DCT C0.130.682.96E-030.683.71E-010.552.16E-030.642.04E-0531661.009.64E-0123250.824.17E-035491
13rs6492111109,055,846 TNFSF13B-MYO16 C0.020.353.83E-030.969.66E-010.311.74E-030.353.47E-0531760.752.91E-0123250.547.51E-045501
16rs407261720,178,590 GPR139 A0.191.363.83E-031.414.70E-011.363.02E-021.542.91E-031.412.84E-0639540.976.36E-0128571.154.09E-036811
19rs810727437,285,393 ZNF790 C0.034.565.27E-042.261.34E-022.914.92E-0525790.854.67E-0119081.402.94E-024487
21rs282797624,600,783 LOC105372747 G0.190.695.47E-040.611.50E-010.723.74E-020.701.99E-0532050.976.87E-0123570.838.28E-045562

Information for the SNP ID and chromosomal position is based on NCBI genome build 37/hg19

The ‘-’ sign indicates data not available

CHR chromosome, BP Physical position (base-pair), EA effect allele, EAF effect allele frequency, OR Odds Ratio, N sample size of meta-analysis combining cases and controls

Metabolic syndrome (MetS) associated loci in females. Discovery stage was GWAS for MetS in each sex-stratified group of KARE study. Overall association results (Pmeta) were obtained from meta-analyses combining Discover (KARE) and Replication (Rural1816, Rural3667 and HEXA) stages Information for the SNP ID and chromosomal position is based on NCBI genome build 37/hg19 The ‘-’ sign indicates data not available CHR chromosome, BP Physical position (base-pair), EA effect allele, EAF effect allele frequency, OR Odds Ratio, N sample size of meta-analysis combining cases and controls In order to understand the molecular basis of MetS in females, we also performed sex-stratified GWAS for six MetS component traits including waist circumference, TG, HDLC, SBP, DBP, and FPG. For each trait, genome-wide scan data from a total of 4659 KARE female subjects were tested via linear regression analysis adjusted for area and age in the discovery stage. Next, 5273 female subjects from three studies including Rural1816, Rural 3667, and HEXA were included in the replication stage to validate the signals selected in the discovery stage (Pdiscovery < 0.01). SNPs validated in the replication stage (P-value < 0.05) were used in the subsequent meta-analysis. Our meta-analyses combining the discovery and replication stages detected two female specific genome-wide significant loci (Pmeta < 5 × 10− 8) (Fig. 2) as well as 33 female specific loci showing moderate evidence of association for a given trait (5 × 10− 8 < Pmeta < 10− 5) (Table 3, Additional file 2: Table S2 & Additional file 5: Figure S3).
Fig. 2

Regional association plots of newly discovered loci for FPG (a) and HDLC (b) in females. The SNP positions are shown at the top. The trend test -log10P values are shown for SNPs distributed in a 0.8-Mb genomic region centered on the most strongly associated signal, which is depicted as a purple diamond for the combined results from discovery and replication stages. The locations of known genes in the region are shown at the bottom. The genetic information was obtained from the Human Genome build hg 19; the LD structure was based on the 1000 Genomes ASN data

Table 3

Genetic association results for metabolic syndrome component traits in females. Overall association results were obtained from meta-analyses combining KARE, Rural1816, Rural3667, and HEXA studies

TraitCHRSNPBP (GRCh37)Candidate geneEAEAFOverall meta (female)Overall meta (male)Overall meta (male+female)Association with MetS
BETASE P meta P Het NBETASE P meta BETASE P meta P KARE-female
Genetic loci showing strong evidence of association for MetS component traits in females (Pmeta < 5X10−8)
 FPG21rs45548941,939,569 DSCAM C0.026.761.142.92E-090.5374420.511.427.18E-014.030.919.14E-067.07E-02
 HDLC11rs7115583116,784,376 SIK3 T0.151.320.243.58E-080.2069050.460.278.79E-020.940.182.30E-071.61E-01
Genetic loci showing moderate evidence of association for MetS component traits in females (5X10− 8 < Pmeta < 10−5)
 WC1rs6683379223,906,421 CAPN2 A0.48−0.650.148.05E-060.8866900.090.145.11E-01− 0.350.119.53E-043.44E-01
8rs92144934,967,179 LOC105379369 A0.320.580.139.29E-060.749819−0.080.135.31E-010.270.106.65E-036.88E-01
20rs602981840,409,163 CHD6 A0.240.630.148.47E-060.8597790.040.147.58E-010.300.114.43E-033.20E-01
 FPG2rs1017137777,437,279 LRRTM4 G0.31−1.130.258.19E-060.546556−0.160.326.28E-01− 0.680.206.98E-046.93E-01
3rs6801331124,918,754 SLC12A8 T0.111.590.368.45E-060.8797860.300.465.16E-011.040.292.97E-041.10E-02
8rs782149277,823,083 ZFHX4/PEX2 C0.028.021.562.77E-070.1372022.061.762.42E-015.331.197.33E-065.50E-01
10rs1278300165,091,110 JMJD1C C0.026.811.505.36E-060.5574811.371.694.16E-014.321.141.52E-048.35E-01
11rs571911126,481,440 KIRREL3 C0.42−1.760.371.94E-060.507497−0.280.435.13E-01−1.130.297.87E-058.41E-01
12rs1712561062,392,548 FAM19A2 A0.102.590.589.33E-060.957453−0.260.707.11E-011.250.466.23E-036.18E-01
 SBP5rs1047430689,069,875 LOC105379076 G0.141.700.389.37E-060.727531−0.290.394.60E-010.780.285.19E-031.29E-01
5rs31622496,414,272 LIX1-AS1 A0.291.300.299.72E-060.767660−0.170.305.61E-010.650.212.06E-033.90E-01
6rs3936285,109,119 LYRM4 T0.091.970.447.75E-060.8366570.110.488.19E-011.200.332.67E-044.14E-01
7rs441975479,337,748 LOC105375370 C0.05−2.860.587.30E-070.666700−0.530.623.91E-01−1.780.433.20E-051.94E-01
15rs1697323682,235,114 LOC102724001 G0.171.690.387.20E-060.6266680.230.405.68E-010.990.283.72E-046.21E-02
17rs388865864,473,272 PRKCA A0.31−1.300.297.17E-061.0076570.370.302.09E-01−0.510.211.45E-025.46E-01
 DBP12rs73155329,483,063 LOC105369649 C0.370.790.189.89E-060.817654−0.110.195.65E-010.400.132.17E-034.30E-01
14rs241584144,650,411 LINC02307 C0.051.820.392.21E-060.826647−0.090.438.36E-011.040.294.06E-044.17E-01
16rs136412083,740,126 CDH13 G0.32−0.770.176.30E-060.806904−0.020.209.10E-01−0.430.131.06E-032.21E-01
 HDLC11rs11216315117,080,640 PCSK7 G0.131.170.266.87E-060.7876490.390.281.64E-010.820.191.84E-056.30E-01
13rs488383969,720,694naT0.280.850.197.87E-060.2976330.140.204.92E-010.510.142.71E-041.53E-02
16rs649737319,460,842 TMC5 A0.16−1.040.238.45E-060.4676500.020.259.31E-01−0.610.174.15E-045.78E-01
18rs1108276647,132,464 LIPG T0.270.960.196.89E-070.2776490.240.212.45E-010.600.143.39E-056.26E-01
19rs650897441,733,145 AXL C0.370.870.181.09E-060.637652−0.060.197.39E-010.440.138.21E-043.74E-03
20rs604629519,727,038 SLC24A3/RIN2 A0.051.580.354.40E-060.119819−0.340.393.89E-010.660.261.12E-021.13E-01
 TG1rs376623547,049,747 MKNK1 C0.0317.813.956.57E-060.3068933.466.866.15E-0111.453.722.08E-037.11E-01
1rs1120800463,145,439 DOCK7 A0.17−8.101.829.13E-060.426730−5.562.986.19E-02−7.301.671.27E-057.06E-01
5rs109291310,467,702 ROPN1L G0.297.541.462.33E-070.3276581.692.264.54E-015.081.341.49E-042.09E-01
6rs313272229,867,174 LOC105375010 T0.266.901.547.61E-060.8166294.632.516.48E-026.171.421.37E-052.60E-01
9rs138115124,957,550naA0.129.142.058.24E-060.566876−0.753.358.24E-014.841.891.02E-028.20E-01
12rs71069870,369,918 MYRFL G0.306.851.494.34E-060.5266643.582.441.42E-015.621.374.27E-051.88E-01
14rs1712478052,425,506 GNG2 G0.29−6.731.245.52E-080.339920−1.092.025.87E-01−4.351.151.55E-046.47E-01
16rs17960413,814,473 SHISA9/ERCC4 T0.49−5.891.339.47E-060.1676550.762.057.12E-01−2.731.222.48E-021.58E-01
22rs106531433,258,288SYN3/TIMP3C0.0614.582.822.41E-070.4467380.624.558.92E-018.122.571.61E-033.48E-01

Information for the SNP ID and chromosomal position is based on NCBI genome build 37/hg19

CHR chromosome, BP Physical position (base-pair), EA effect allele, EAF effect allele frequency, SE standard error, PHet heterogeneity p-value, N sample size, WC waist circumference, FPG fasting plasma glucose, SBP systolic blood presure, DBP diastolic blood presure, HDLC high density lipoprotein cholesterol, TG triglyceride

Regional association plots of newly discovered loci for FPG (a) and HDLC (b) in females. The SNP positions are shown at the top. The trend test -log10P values are shown for SNPs distributed in a 0.8-Mb genomic region centered on the most strongly associated signal, which is depicted as a purple diamond for the combined results from discovery and replication stages. The locations of known genes in the region are shown at the bottom. The genetic information was obtained from the Human Genome build hg 19; the LD structure was based on the 1000 Genomes ASN data Genetic association results for metabolic syndrome component traits in females. Overall association results were obtained from meta-analyses combining KARE, Rural1816, Rural3667, and HEXA studies Information for the SNP ID and chromosomal position is based on NCBI genome build 37/hg19 CHR chromosome, BP Physical position (base-pair), EA effect allele, EAF effect allele frequency, SE standard error, PHet heterogeneity p-value, N sample size, WC waist circumference, FPG fasting plasma glucose, SBP systolic blood presure, DBP diastolic blood presure, HDLC high density lipoprotein cholesterol, TG triglyceride Of two loci reaching genome-wide significance, rs455489 located in an intron of the DSCAM gene, was significantly associated with FPG levels in females (P = 2.92 × 10− 9, β = 6.76 ± 1.14), but not in males (P = 0.72) (Fig. 2a). The other genome-wide significant locus, rs7115583, located in an intron of the SIK3 gene, was identified for its association with HDLC levels (P = 3.58 × 10− 8, β = 1.32 ± 0.24) (Fig. 2b); however, this SNP was shown to have no effect on HDLC levels in males in this study (P = 0.09) (Table 3). Of a total of 33 moderate signals, 3 were associated with waist circumference, 6 with FPG, 6 with SBP, 3 with DBP, 6 with HDLC, and 9 with TG (Table 3 & Additional file 2: Table S2). The association of these loci was only detected in females, not in males.

Generation of a GRS and a MetS prediction model

Since each genetic variant only confers a modest effect on MetS in females, there is a limitation to MetS prediction in females by applying single variants. To avoid this limitation, we calculated a genetic risk score (GRS) combining risk alleles of 14 SNPs moderately associated with MetS in females. Distribution of the GRS in 912 MetS cases and 473 controls derived from the KARE female subjects indicated that individuals with a high GRS tend to have higher susceptibility to MetS as compared to those with a low GRS (Fig. 3).
Fig. 3

Combined effect of risk alleles on MetS. On the x-axis, each genetic risk score (GRS) category was shown. The y-axis in the histogram indicates the relative number of female subjects in each GRS category. Grey and orange bars represent MetS cases and controls, respectively

Combined effect of risk alleles on MetS. On the x-axis, each genetic risk score (GRS) category was shown. The y-axis in the histogram indicates the relative number of female subjects in each GRS category. Grey and orange bars represent MetS cases and controls, respectively In a logistic regression model, GRS was found to be significantly associated with the prevalence of MetS in female subjects from the KARE study (P = 4.70 × 10− 11). Binary logistic regression adjusted by area and age further increased the association strength between GRS and female MetS (P = 5.28 × 10− 14). This prediction model for MetS comprising area, age, and GRS as predictor variables implies that a one unit increase in GRS increases the odds of developing MetS in females by a factor of 1.27. On the other hand, GRS was not associated with MetS in male subjects from the KARE study in this model (P = 0.33). Although the GRS calculated by this model showed significant association with MetS in all KARE subjects including males and females (P = 1.69 × 10− 5), the odds of developing MetS was almost negligible (OR = 1.09) (Table 4).
Table 4

Logistic regression models for MetS using female-specific GRS

Prediction model for MetSStudySample size (case/control)GroupLogistic PGRS-valueOR (95% CI)AUC
AREA + AGE + GRSdiscovery (KARE)1385 (912/473)female5.28E-141.27 (1.19–1.35)0.85
AREA + AGE1385 (912/473)femalenana0.83
GRS1385 (912/473)female4.70E-111.19 (1.13–1.25)0.60
AREA + AGE + GRS1076 (503/573)male3.27E-010.97 (0.92–1.03)0.57
AREA + AGE + GRS2461 (1415/1046)all1.69E-051.09 (1.05–1.13)0.72

AUC area under the curve, GRS genetic risk score, na not available

Logistic regression models for MetS using female-specific GRS AUC area under the curve, GRS genetic risk score, na not available To evaluate the MetS prediction model, we conducted receiver operating characteristics (ROC) curve analysis employing R’s ModelGood package. In female subjects from the KARE study, the area under the curve (AUC) measured from the ROC curve of a model comprising area, age, and the GRS as predictor variables showed higher values (AUCarea + age + GRS = 0.85) compared to those of other models adjusted with area and age (AUCarea + age = 0.83) or with only GRS (AUCGRS = 0.60) (Table 4 & Fig. 4a). The model adjusted with area, age, and GRS further effectively predicted MetS in females (AUCfemale = 0.85) but not in males (AUCmale = 0.57) or all subjects (AUCall = 0.72) (Table 4 & Fig. 4b). Taken together, the GRS generated from our association analysis for MetS was a likely predictor for MetS in all female but not in male-containing populations.
Fig. 4

Receiver operating characteristic (ROC) curves for models to predict MetS in the discovery stage (KARE study). (a) In KARE females, AUC measured from the ROC curve of a model comprising area, age, and female-specific GRS as predictor variables showed higher value (black) compared to those of other models adjusted with area and age (red) or with only female-GRS (green). (b) In a model comprising area, age, and female-GRS, AUC showed higher value in females (black) than in males (green) or all subjects (red)

Receiver operating characteristic (ROC) curves for models to predict MetS in the discovery stage (KARE study). (a) In KARE females, AUC measured from the ROC curve of a model comprising area, age, and female-specific GRS as predictor variables showed higher value (black) compared to those of other models adjusted with area and age (red) or with only female-GRS (green). (b) In a model comprising area, age, and female-GRS, AUC showed higher value in females (black) than in males (green) or all subjects (red)

Discussion

Several studies have been previously conducted to identify genetic loci influencing MetS or its component traits [15-19]. However, the total heritability of these traits is not yet fully understood [20]. To explain the missing heritability of these traits, more detailed investigations have been proposed, including analyses of less common variants, sequence-level data, epigenetic data, and environmental exposures [21, 22]. It has been suggested that sex-specific genetic architecture also influences many complex diseases [23, 24]. Thus, genetic studies should consider sex-specific effects in their design and interpretation in order not to fail to detect a significant proportion of the genes that contribute to risk for complex diseases. Although Zabaneh et al. [15] reported a genetic association study for MetS in Indian Asian men, most previous genetic studies for MetS or its component traits were analyzed without consideration of sex-specific effects. Looking for ways to treat MetS that are more effective for female patients, we aimed to elucidate the female-specific genetic basis for MetS and its component traits in this study. Genome-wide case-control logistic regression analyses in Korean female populations detected 14 genetic variants showing moderate evidence of association for MetS specifically in females, but not in males (Table 2 & Additional file 1: Table S1). To gain statistical power to detect genome-wide significant signals, the recruitment of a larger number of female subjects may be essential. Gene ontology analysis using the DAVID 6.8 functional annotation tool (https://david.ncifcrf.gov/home.jsp) indicated that the function of genes corresponding to the 14 loci associated with MetS were enriched for the positive regulation of T cell proliferation or T cell co-stimulation (Table 5). Further studies will be necessary to gain insight into the functional relevance of these genes to the development of MetS in females.
Table 5

Functional annotation results of 16 genes for MetS using DAVID Bioinformatics Resources 6.8

TraitNo of genesCategoryTermCountGenesP-value
MetS16GOTERM_BP_DIRECTGO:0042102~positive regulation of T cell proliferation2 TNFSF13B, PDCD1LG2 0.035
GOTERM_BP_DIRECTGO:0031295~T cell costimulation2 TNFSF13B, PDCD1LG2 0.046
GOTERM_CC_DIRECTGO:0005829~cytosol5 DCT, MYO16, RPL37A, PCM1, RIC1 0.091

GO Gene ontology, BP Biological process, CC Cellular component

Functional annotation results of 16 genes for MetS using DAVID Bioinformatics Resources 6.8 GO Gene ontology, BP Biological process, CC Cellular component From genome-wide association analysis for 6 MetS component traits in female populations, we were able to detect two signals reaching genome-wide significance. One genetic variant, rs455489 showing association with FPG (Pmeta-female = 2.92 × 10− 9, β = 6.76 ± 1.14), is located in an intron of the DSCAM (DS Cell Adhesion Molecule) gene (Table 3 & Fig. 2a). This result implies that 1 C allele increase in this SNP has an influence on about 6.76 mg/dL rise of FPG in female populations. It has been known that the coding product of the DSCAM gene is a member of the immunoglobulin superfamily of cell adhesion molecules (Ig-CAMs) and is involved in human central and peripheral nervous system development [25]. Although this gene is a candidate for Down syndrome and congenital heart disease [26], its involvement in FPG in females needs to be elucidated from further studies. Another SNP, rs7115583, showed genome-wide significant association with HDLC (Pmeta-female = 3.58 × 10− 8, β = 1.32 ± 0.24) in females (Table 3 & Fig. 2b). This result indicates that 1 T allele increase of this SNP results in a 0.24 mg/dL rise in HDLC in females. SNP rs7115583 is located in intron of SIK3 (SIK Family Kinase 3). The stronger associations with HDLC appeared in the APOA5 (rs2075291) and BUD13 (rs11216129) genes as compared to the SIK3 gene. These variants, however, showed strong associations with HDLC in male subjects as well (Pmeta-male = 4.60 × 10− 13 for rs2075291; Pmeta-male = 8.42 × 10− 8 for rs11216129). Genetic loci in or near APOA5 and BUD13 have been associated with HDLC levels in large-scale GWASs [7, 27]. About 164 kb from these loci (APOA5 and BUD13), our results suggest that the SIK3 locus is the only independent (r2 < 0.2) female-specific HDLC signal in this region. It has been reported that the expression of SIK3 is related to ovarian cancer development [28]. The molecular biological function of this gene has yet to be elucidated. In addition to the two genome-wide significant signals, 33 loci showed the moderate evidence of association (5 × 10− 8 < Pmeta < 10− 5) with a given MetS component trait in females. Among those, SNP rs1364120 was found to be moderately associated with DBP and is located in an intron of the CDH13 (Cadherin 13) gene, encoding a member of the cadherin superfamily. This CDH13 locus has been previously identified as a susceptibility locus influencing blood pressure by genome-wide study in two European populations [29]. It is known that the CDH13 encoded protein protects vascular endothelial cells from apoptosis due to oxidative stress and is associated with resistance to atherosclerosis [30]. In this study, the minor allele of rs1364120 has a protective effect on DBP (Table 3). The other SNP, rs11082766, showing moderate association with HDLC is located in an intron of the LIPG (Lipase G, Endothelial Type) gene. The protein encoded by LIPG may be involved in lipoprotein metabolism and vascular biology. It is known that this protein has phospholipase and triglyceride lipase activity by hydrolyzing high density lipoproteins (HDL) more efficiently than other lipoproteins [31]. Another SNP, rs3766235 located in an intron of the MKNK1 (MAP Kinase Interacting Serine/Threonine Kinase 1) gene, shows a moderate association with TG. The encoded protein of MKNK1, a Ser/Thr protein kinase, is known to be involved in the p38 MAPK signaling pathway as well as regulation of lipid metabolism and insulin signaling cascades, implying its possible role in TG metabolism [32]. The functional relevance of the other genes corresponding to remaining 30 loci moderately associated with traits of interest will be further investigated in future studies. Since six traits such as waist circumference, FPG, SBP, DBP, HDLC, and TG are used to define MetS; there is speculation that genetic regulation of these traits is closely related to the development of MetS. To gain insight into this hypothesis, we tested the association between a total of 35 loci (2 genome-wide significant signals and 33 moderately associated signals) for MetS component traits and MetS. Except for three SNPs (rs10171377 for FPG, rs4883839 for HDLC, and rs6508974 for HDLC), most signals did not show association with MetS (Table 3). These results indicate that simply combining genetic factors associated with MetS component traits could not predict the development of MetS. In this study, each allele of risk variants for MetS only confers a modest effect on the risk in females (Table 2). Thus, applying single variants has probably a limited ability for MetS prediction. It has been suggested that a genetic risk score (GRS) combining multiple loci might improve prediction of target disease [33]. In this regard, we specifically aimed to investigate whether the GRS could reliably predict the prevalence of MetS in females. To the best of our knowledge, our study is the first to test the prediction model for MetS using GRS combining single variants associated with MetS specifically in females of East-Asian populations. We constructed the GRS based on only subjects from the KARE study since some genotype data from the 14 MetS SNPs detected in this study were not available in other studies (including Rural1667, Rural3667, and HEXA). In order to reduce bias and noise, only female individuals with no missing genotypes of the 14 SNPs were included in the construction of the MetS prediction model employing GRS (912 cases and 473 controls). Our logistic model indicated that a GRS combining the 14 MetS SNPs is strongly associated with MetS prevalence in females; an increase in one unit of the GRS accounts for a 1.19 increase in the prevalence of MetS (P = 4.70 × 10− 11). When this model was adjusted for area (of subject recruitment) and age, the association strength and effect were further increased (P = 5.28 × 10− 14, OR = 1.27). The AUC of this model, evaluated using receiver operating characteristic (ROC) curve analysis, was 0.85 (AUCarea + age + GRS = 0.85) implying that this model comprising area, age, and GRS as predictor variables might be useful for the prediction of MetS in females (Table 4 & Fig. 4a). On the other hand, GRS was not associated with MetS in males (P = 0.33), but substantially associated with MetS in populations containing both male and female subjects (P = 1.69 × 10− 5) (Table 4 & Fig. 4b). To evaluate if the sex-specific GRS is stronger and more powerful to predict MetS in a group of same sex, we also performed a new meta-analysis for MetS in male participants and identified 10 SNPs showing suggestive evidence of association with MetS in males (5 × 10− 8 < Pmeta < 5 × 10− 5). The association of these 10 SNPs was validated in replication analyses (Additional file 3: Table S3). Using risk alleles for MetS detected in males, we calculated male-specific GRS. Our logistic analyses demonstrate that male-GRS was strongly associated with MetS in male participants (P = 2 × 10− 16, OR = 1.35), but just nominally associated in female participants (P = 0.013,, OR = 1.08) (Additional file 4: Table S4). To predict MetS in males and females using male-GRS, we carried out ROC analyses and generated AUCs. Our analyses indicate that ROC-AUCs were 0.66 and 0.54 in male and female participants, respectively (Additional file 4: Table S4 and Additional file 5: Figure S4). Taken together, these results strongly imply that sex-specific GRS is a highly effective indicator to predict the MetS development in a population having the same sex.

Conclusion

In this study, we were able to detect new female-specific genetic variants influencing MetS and its component traits in Korean populations. Our findings including genome-wide significant signals such as DSCAM and SIK3 loci for FPG and HDLC, respectively, as well as several signals moderately associated with traits of interest provide new insights into the underlying sex stratified causes of this cluster of risks. In addition, our study suggests that the GRS based on 14 SNPs detected in our GWA analyses for MetS is a useful predictor of MetS in female but not in male populations. Table S1. Metabolic syndrome associated loci in females. Association results of these loci were shown in the discovery stage (KARE), replication stage (Rural1816, Rural3667, and HEXA), and overall meta-analysis, respectively. (XLSX 13 kb) Table S2. Genetic loci associated with MetS component traits in females. Association results of these loci were shown in the discovery stage (KARE), replication stage (Rural1816, Rural3667, and HEXA), and overall meta-analysis, respectively. (XLSX 18 kb) Table S3. Metabolic syndrome associated loci in males. Association results of these loci were shown in the discovery stage (KARE), replication stage (Rural1816, Rural3667, and HEXA), and overall meta-analysis, respectively. (XLSX 15 kb) Table S4. Logistic regression model for MetS using male-specific GRS. (XLSX 10 kb) Figure S1. The Manhattan plot (A) and the QQ plot (B) of discovery stage GWAS for MetS in females. Both plots were generated using the qqman package in R. Figure S2. Regional association plots of newly discovered loci for MetS in females. Figure S3. Regional association plots of newly discovered loci for MetS component traits in females. Figure S4. Receiver operating characteristic (ROC) curves for models to predict MetS in male (A) and female (B) participants of the discovery stage (KARE study). Each AUC was measured from the ROC curve of a model comprising male-specific GRS in KARE males and females, respectively. (PPTX 4229 kb)
  32 in total

Review 1.  A comprehensive definition for metabolic syndrome.

Authors:  Paul L Huang
Journal:  Dis Model Mech       Date:  2009 May-Jun       Impact factor: 5.758

2.  Metabolic and cardiovascular traits: an abundance of recently identified common genetic variants.

Authors:  Karen L Mohlke; Michael Boehnke; Gonçalo R Abecasis
Journal:  Hum Mol Genet       Date:  2008-10-15       Impact factor: 6.150

Review 3.  The metabolic syndrome.

Authors:  Marc-Andre Cornier; Dana Dabelea; Teri L Hernandez; Rachel C Lindstrom; Amy J Steig; Nicole R Stob; Rachael E Van Pelt; Hong Wang; Robert H Eckel
Journal:  Endocr Rev       Date:  2008-10-29       Impact factor: 19.871

Review 4.  Metabolic syndrome: a closer look at the growing epidemic and its associated pathologies.

Authors:  S O'Neill; L O'Driscoll
Journal:  Obes Rev       Date:  2014-11-18       Impact factor: 9.213

Review 5.  Metabolic Syndrome: Sex-Related Cardiovascular Risk and Therapeutic Approach.

Authors:  Francesca Santilli; Damiano D'Ardes; Maria Teresa Guagnano; Giovanni Davi
Journal:  Curr Med Chem       Date:  2017       Impact factor: 4.530

6.  Mitogen-activated protein kinases activate the serine/threonine kinases Mnk1 and Mnk2.

Authors:  A J Waskiewicz; A Flynn; C G Proud; J A Cooper
Journal:  EMBO J       Date:  1997-04-15       Impact factor: 11.598

7.  DSCAM: a novel member of the immunoglobulin superfamily maps in a Down syndrome region and is involved in the development of the nervous system.

Authors:  K Yamakawa; Y K Huot; M A Haendelt; R Hubert; X N Chen; G E Lyons; J R Korenberg
Journal:  Hum Mol Genet       Date:  1998-02       Impact factor: 6.150

8.  A genome-wide association study of the metabolic syndrome in Indian Asian men.

Authors:  Delilah Zabaneh; David J Balding
Journal:  PLoS One       Date:  2010-08-04       Impact factor: 3.240

9.  Susceptibility loci for metabolic syndrome and metabolic components identified in Han Chinese: a multi-stage genome-wide association study.

Authors:  Yimin Zhu; Dandan Zhang; Dan Zhou; Zhenli Li; Zhiqiang Li; Le Fang; Min Yang; Zhongyan Shan; Hong Li; Jianhua Chen; Xianghai Zhou; Wei Ye; Senhai Yu; Huabin Li; Libin Cai; Chengguo Liu; Jie Zhang; Lixin Wang; Yaxin Lai; Liansheng Ruan; Zhanhang Sun; Shuai Zhang; Hao Wang; Yi Liu; Yuyang Xu; Jie Ling; Chunxiao Xu; Yan Zhang; Duo Lv; Zheping Yuan; Jing Zhang; Yingqi Zhang; Yongyong Shi; Maode Lai
Journal:  J Cell Mol Med       Date:  2017-03-30       Impact factor: 5.310

10.  Genome-wide scan identifies CDH13 as a novel susceptibility locus contributing to blood pressure determination in two European populations.

Authors:  Elin Org; Susana Eyheramendy; Peeter Juhanson; Christian Gieger; Peter Lichtner; Norman Klopp; Gudrun Veldre; Angela Döring; Margus Viigimaa; Siim Sõber; Kärt Tomberg; Gertrud Eckstein; Piret Kelgo; Tiina Rebane; Sue Shaw-Hawkins; Philip Howard; Abiodun Onipinla; Richard J Dobson; Stephen J Newhouse; Morris Brown; Anna Dominiczak; John Connell; Nilesh Samani; Martin Farrall; Mark J Caulfield; Patricia B Munroe; Thomas Illig; H-Erich Wichmann; Thomas Meitinger; Maris Laan
Journal:  Hum Mol Genet       Date:  2009-03-20       Impact factor: 6.150

View more
  7 in total

1.  Identification of Genetic Variants for Female Obesity and Evaluation of the Causal Role of Genetically Defined Obesity in Polycystic Ovarian Syndrome.

Authors:  Yeongseon Ahn; Hyejin Lee; Yoon Shin Cho
Journal:  Diabetes Metab Syndr Obes       Date:  2020-11-12       Impact factor: 3.168

Review 2.  "Big Data" Approaches for Prevention of the Metabolic Syndrome.

Authors:  Xinping Jiang; Zhang Yang; Shuai Wang; Shuanglin Deng
Journal:  Front Genet       Date:  2022-04-27       Impact factor: 4.772

3.  The Variant rs1784042 of the SIDT2 Gene is Associated with Metabolic Syndrome through Low HDL-c Levels in a Mexican Population.

Authors:  Guadalupe León-Reyes; Berenice Rivera-Paredez; Juan Carlos Fernandez López; Eric G Ramírez-Salazar; Arnoldo Aquino-Gálvez; Katia Gallegos-Carrillo; Edgar Denova-Gutiérrez; Jorge Salmerón; Rafael Velázquez-Cruz
Journal:  Genes (Basel)       Date:  2020-10-14       Impact factor: 4.096

4.  Ancestry specific associations of a genetic risk score, dietary patterns and metabolic syndrome: a longitudinal ARIC study.

Authors:  Dale S Hardy; Susan B Racette; Jane T Garvin; Hirut T Gebrekristos; Tesfaye B Mersha
Journal:  BMC Med Genomics       Date:  2021-05-01       Impact factor: 3.622

5.  A nine-hub-gene signature of metabolic syndrome identified using machine learning algorithms and integrated bioinformatics.

Authors:  Guanzhi Liu; Sen Luo; Yutian Lei; Jianhua Wu; Zhuo Huang; Kunzheng Wang; Pei Yang; Xin Huang
Journal:  Bioengineered       Date:  2021-12       Impact factor: 3.269

6.  Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models.

Authors:  Yan Zhang; Jaina Razbek; Deyang Li; Lei Yang; Liangliang Bao; Wenjun Xia; Hongkai Mao; Mayisha Daken; Xiaoxu Zhang; Mingqin Cao
Journal:  BMC Public Health       Date:  2022-02-08       Impact factor: 3.295

7.  Metabolic Syndrome Prediction Models Using Machine Learning and Sasang Constitution Type.

Authors:  Ji-Eun Park; Sujeong Mun; Siwoo Lee
Journal:  Evid Based Complement Alternat Med       Date:  2021-02-08       Impact factor: 2.629

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.