Literature DB >> 28674662

BMI prediction within a Korean population.

Jin Sol Lee^1,2, Hyun Sub Cheong³, Hyoung-Doo Shin^1,2,3.

Abstract

BACKGROUND: Body Mass Index (BMI) is widely regarded as an important clinical trait for obesity and other diseases such as Type 2 diabetes, coronary heart disease, and osteoarthritis.
METHODS: This study uses 6,011 samples of genotype data from ethnic Korean subjects. The data was retrieved from the Korea Association Resource. To identify the BMI-related markers within the Korean population, we collected genome-wide association study (GWAS) markers using a GWAS catalog and also obtained other markers from nearby regions. Of the total 6,011 samples, 5,410 subjects were used as part of a single nucleotide polymorphism (SNP) selection set in order to identify the overlapping BMI-associated SNPs within a 10-fold cross validation.
RESULTS: We selected nine SNPs (rs12566985 (FPGT-TNNI3K), rs6545809 (ADCY3), rs2943634 (located near LOC646736), rs734597 (located near TFAP2B), rs11030104 (BDNF), rs7988412 (GTF3A), rs2241423 (MAP2K5), rs7202116 (FTO), and rs6567160 (located near LOC105372152) to assist in BMI prediction. The calculated weighted genetic risk scores based on the selected 9 SNPs within the SNP selection set were applied to the final validation set consisting of 601 samples. Our results showed upward trends in the BMI values (P < 0.0001) within the 10-fold cross validation process for R2 > 0.22. These trends were also observed within the validation set for all subjects, as well as within the validation sets divided by gender (P < 0.0001, R2 > 0.46). DISCUSSION: The set of nine SNPs identified in this study may be useful for prospective predictions of BMI.

Entities: Disease Gene Mutation Species

Keywords: BMI; Prediction; wGRS

Year: 2017 PMID： 28674662 PMCID： PMC5493974 DOI： 10.7717/peerj.3510

Source DB: PubMed Journal: PeerJ ISSN： 2167-8359 Impact factor: 2.984

Introduction

Body Mass Index (BMI) is widely used as a diagnostic measurement of obesity which in turn is related to various diseases such as heart disease, Type 2 diabetes, hypertension, and osteoarthritis (1998). Because of the important role BMI plays, numerous genome-wide association studies (GWAS) have been conducted to identify BMI-associated single nucleotide polymorphisms (SNPs). From these studies, researchers have identified several significant genes found to be related to BMI such as FTO, BDNF, and MC4R, among others (Felix et al., 2016; Locke et al., 2015; Speliotes et al., 2010). Further replication studies across various populations have also reported significant associations between these genes and BMI (Bonaccorso et al., 2015; Munoz-Yanez et al., 2016; Neocleous et al., 2016). Using these results, several studies have developed prediction models for BMI and obesity among various ethnic groups (Bae et al., 2016; Hung et al., 2015; Peterson et al., 2014). Although several BMI or obesity prediction models have been developed, these models have not been applicable to the Korean population. One issue is the significant genetic difference between the Korean population and other populations. The significance of many BMI-associated SNPs in GWAS conducted on other populations have not been replicated in the Korean population. Moreover, although SNPs in the prediction models have shown significant associations with BMI, the effective size difference of these prediction models are unsuitable for the Korean population. These results suggest a need for new BMI prediction methods that can be applied to the Korean population. There are several methods for predicting specific diseases or clinical traits using multiple loci. The weighted genetic risk score (wGRS) is one simple and effective method for constructing SNP sets integrating the multiplied risk allele number of each SNP with the regression coefficient. Several previous studies have shown the effectiveness of using the wGRS to build a prediction model for various diseases (Chen et al., 2011; Palmieri et al., 2017; Thanassoulis et al., 2012). In the present study, we aimed to identify BMI-associated SNPs for the Korean population using genotype data obtained from the Korea Association Resource (KARE) (Cho et al., 2009). To increase the validity of the study, we used the significant SNPs identified in previous BMI-related GWA-studies. Furthermore, we conducted SNP selection based on 10-fold cross validation used in conjunction with the wGRS on a SNP selection set consisting of 5,410 samples from Korean subjects. The wGRS of the selected SNPs was applied to an independent validation set consisting of 601 samples.

Method

Study subjects

The genotype data used in the present study were obtained from the KARE project (Cho et al., 2009). This study approved by Public Institutional Bioethics Committee designated by the Ministry of Health and Welfare (P01-201502-31-002). To ensure data quality, we eliminated samples and SNPs with a call rate of less than 98%. SNPs with a MAF of less than 0.05 were also excluded from our data set. A total of 6,011 samples (2,903 male and 3,108 female) were used for statistical analyses. The 6,011 samples were divided into one set of 5,410 samples (2,613 male and 2,797 female) to be used as a SNP selection set for 10-fold cross-validation and one set of 601 samples (290 male and 311 female) to be used as a validation set. The G*Power Version 3.1 software (Universität Kiel, Germany) (Faul et al., 2009) was used to calculate statistical powers. The software found both the test set (n = 541) and validation set (n = 601) to be at over 95%. Details on the number of samples are summarized in Table S1.

SNP pruning for statistical analyses

In order to identify reliable SNPs for BMI prediction, we first identified the significant SNPs reported in prior BMI-related GWAS which had been validated by at least one secondary replication study. Then, we obtained the genotype data for the collected GWAS markers including other markers from nearby regions (±10 kb from the GWAS markers) from the KARE data (10,568 SNPs). To avoid the issue of high linkage disequilibrium (LD) found in the wGRS method, the LD coefficients (r2 > 0.2) of all pairs of SNPs were calculated using the Haploview software (Barrett et al., 2005). Finally, we obtained a set of 193 SNPs which showed significant relationships with BMI in the previous GWA-studies. We also identified SNPs in the regions near the reported SNPs. The P-values obtained from regression analyses conducted on the training set (n = 4, 869) were used to identify the most significant SNPs. The regression analysis was conducted using the GoldenHelix SVS8 software (Bozeman, MT, USA).

SNP selection for BMI prediction

From the SNP selection set (5,410 samples), 10-fold cross validation was conducted on the genotype data (training set of 4,869 subjects and test set of 541 subjects) to identify the SNPs to be used for BMI prediction. The natural log-transformed BMI values were used for statistical analyses. SNPs were selected as tagging SNPs for each training set only where their P-value was less than 0.05 and where they were identified to be significant SNPs with the same LD. The wGRS was calculated as the sum of the number of BMI-increasing alleles multiplied by the regression slope across all variants in each set as previously described (; n = number of SNP, Weight: regression slope value of SNP) (Hung et al., 2015). Then, we divided the wGRS of each set into five sections in a subject number-dependent manner and calculated the average BMI values of the sections to obtain the trend lines. After 10-fold cross validation, we selected nine SNPs which overlapped across all training sets (Table S2). Detailed information and analysis of the selected SNPs are detailed in Table 1 and Fig. 1, respectively. The application process of wGRS using the selected nine SNPs to the independent validation set was the same as described above. The overall P-values for the trend lines were calculated using GraphPad Software (La Jolla, CA, USA) for all BMI values of each wGRS section.

Table 1

Results of regression analysis and allele information using SNP selection set including additionally constructed sets.

				P-values		Allele information				Genotype Count with BMI average
Markers	Gene	Location	Position	Present study	GWAS catalog	BMI increasing	Minor	Major	MAF	C/C (BMI log.)	C/R (BMI log.)	R/R (BMI log.)	LD (r2>0.8)	GWAS catalog
rs12566985	FPGT- TNNI3K	1:74536509	Intron	0.001 − 0.03	2.00 × 10⁻¹⁰	G	A	G	0.11	4,260 (3.197)	1,061 (3.187)	89 (3.169)	–	Felix et al. (2016)
rs6545809	ADCY3	2:24903846	Intron	0.002 − 0.02	6.00 × 10⁻⁹	T	T	C	0.44	1,705 (3.191)	2,660 (3.193)	1,045 (3.204)	rs10182181	Locke et al. (2015)
rs2943634	–	2:226203364	Intergenic	0.0002 − 0.02	2.00 × 10⁻¹⁴	C	A	C	0.08	4,577 (3.197)	805 (3.182)	28 (3.184)	–	Manning et al. (2012)
rs734597	–	6:50868566	Intergenic	0.0007 − 0.04	3.00 × 10⁻²⁰	A	A	G	0.19	3,547 (3.192)	1,680 (3.199)	183 (3.202)	rs987237	Speliotes et al. (2010)
rs11030104	BDNF	11:27662970	Intron	0.0004 − 0.006	5.00 × 10⁻¹⁹	A	G	A	0.45	1,617 (3.200)	2,708 (3.195)	1,085 (3.184)	–	Locke et al. (2015)
rs7988412	GTF3A	13:27426145	Intron	0.001 − 0.02	2.00 × 10⁻⁷	T	T	C	0.13	4,053 (3.192)	1,275 (3.203)	82 (3.197)	rs12016871	Locke et al. (2015)
rs2241423	MAP2K5	15:67794500	Intron	0.008 − 0.04	1.00 × 10⁻¹⁸	G	G	A	0.37	2,162 (3.192)	2,508 (3.193)	740 (3.205)	–	Speliotes et al. (2010)
rs7202116	FTO	16:53787703	Intron	0.00004 − 0.0006	2.00 × 10⁻¹⁰	G	G	A	0.13	4,143 (3.191)	1,162 (3.206)	105 (3.202)	–	Yang et al. (2012)
rs6567160	–	18:60161902	Intergenic	0.00006 − 0.002	5.00 × 10⁻³⁰	C	C	T	0.24	3,101 (3.190)	1,979 (3.198)	330 (3.212)	–	Locke et al. (2015)

Notes.

Gene name and location and position of the SNPs were obtained from the NCBI database. The P-values were calculated using logistic regression on the SNP selection set as well as on the additional re-constructed sets (10 sets). Hyphens(–) Indicate that there was no data available or it was not applicable. C/C, C/R, and R/R represent the homozygote of the major allele, and the heterozygote and homozygote of the minor allele, respectively.

Body Mass Index

Minor allele frequency

Linkage Disequilibrium

Figure 1

Analysis flow chart used in the present study.

Notes. Gene name and location and position of the SNPs were obtained from the NCBI database. The P-values were calculated using logistic regression on the SNP selection set as well as on the additional re-constructed sets (10 sets). Hyphens(–) Indicate that there was no data available or it was not applicable. C/C, C/R, and R/R represent the homozygote of the major allele, and the heterozygote and homozygote of the minor allele, respectively. Body Mass Index Minor allele frequency Linkage Disequilibrium

Results

The average age and average BMI values were slightly higher among female subjects than male subjects across the complete set of samples, the SNP selection set, and the validation set. Detailed information of the subjects is given in Fig. 1 and Table S1. An analysis flow chart for the present study is displayed in Fig. 1. The results of each set of 10-fold cross-validations show that the BMI values increased in all training sets and corresponding test sets with respect to wGRS for P-values of <0.0005 and R2 of >0.2. The detailed P-values of the SNPs and BMI trend lines for the training set and test set are detailed in Table S2 and Fig. S1, respectively. Of the 28 SNPs applied to the cross-validation process, we selected only nine SNPs which overlapped across all 10-fold cross-validations conducted for BMI prediction. The nine SNPs are listed in Table 1 with their location, allele information, and genotype data as obtained from the SNP selection set. We calculated the wGRS using the nine SNPs based on the SNP selection set (n = 5, 410) and applied the wGRS to final validation set (n = 601). As expected, the results of the SNP selection set showed upward BMI trends across all three groups with P-values of 0.002 for the complete set, 0.01 for males, and 0.008 for females, respectively (data not shown). To confirm the upward BMI trends and significance of SNPs, we randomly re-constructed an additional nine sets which had the same SNP selection set size (n = 5, 410) and final validation set size (n = 601). The results showed that the nine SNPs were significantly related to BMI across all additionally created sets (P < 0.05). The detailed range of P-values is listed in Table 1. The standard curves for the SNP selection sets also showed significant association with BMI (P < 0.0001) with R2 values of 0.9729 for the complete set, 0.9296 for males, and 0.9598 for females (Fig. 2).

Figure 2

Analysis results applying the wGRS of the nine selected SNPs to the SNP selection set and to the final validation set.

Analysis results applying the wGRS of the nine selected SNPs to the SNP selection set and to the final validation set.

The BMI trends of each analysis group are displayed using their standard curve with the overall P-values and R2. The black closed and punctured circles represent the BMI values of each wGRS section in the SNP selection and validation set, respectively. The black and red dashed lines are the standard curves for the SNP selection and validation set. (A) BMI trends of SNP selection set and validation set using total subjects. (B) BMI trends of SNP selection set and validation set using male samples. (C) BMI trends of SNP selection set and validation set using female samples. Application results of the wGRS to the final validation sets showed increasing trends with a P-value of <0.0001 and R2 value of 0.6766 (Fig. 2A). This trend was also observed in the analyses for male (P-value of <0.0001 with R2 of 0.5359) and female populations (P-value of <0.0001 with R2 of 0.4632) (Figs. 2B and 2C).

Discussion

After the first large-scale GWAS was conducted using the KARE data with respect to several clinical traits (Cho et al., 2009), numerous follow-up studies have also been performed on independent cohorts. Of these studies, one reported that one SNP ( in LOC729076) was found to be significantly associated with BMI after applying Bonferroni correction with a P-value of 1.45 × 10−7 (Bae et al., 2016). Several SNPs across various genes including FTO, BDNF, and MC4R were also found to be associated with BMI. Additional replication studies have consistently confirmed the significance of these SNPs in genes (Cha et al., 2011; Cha et al., 2008; Hong et al., 2012). In the present study, we aimed to identify BMI-related SNPs using the KARE data in order to build a SNP set for BMI prediction. It is widely accepted that the influence of SNPs on BMI is small. Therefore, our study focused on increase of validity of using SNPs to understand BMI. We applied the wGRS to various test sets including validation sets and consistently found an increase in BMI trends. The R2-values of the standard curves in the final validations (>0.4) suggest that the nine selected SNPs were not perfect indicators of BMI, but were significant enough to be considered for BMI prediction within the Korean population. Our results also indicated that the selected markers were not only associated to BMI within other populations but also within the Korean population. There have been only a few studies conducted on BMI and/or obesity prediction in adulthood. We sought to find whether the 9 SNPs used in our models had been included in previous studies. From our research, we found that in MAK2K5 had been selected as one of SNPs for an obesity model for Caucasian populations (Hung et al., 2015). In addition, in BDNF had been used in one study for BMI prediction within Korean populations (Bae et al., 2016). Two other BDNF SNPs ( and ) had been used in prediction models and had been found to be in high LD (r2 > 0.8) with . Similarly, SNPs in FTO ( or , r2 > 0.8) and ADCY3 ( or , r2 > 0.8) were used in other models (Bae et al., 2016; Hung et al., 2015; Sandholt et al., 2010). These results serve as evidence to support the validity of our results. In addition, comparison of two studies conducted on Korean populations suggest that at least 3 SNPs ( in BDNF, in ADCY3, and in FTO) might play crucial roles in BMI prediction for Korean populations. There have been numerous reports which showing significant associations between BMI and SNPs in BDNF, ADCY3, and FTO. The functional role of several SNPs of various genes have also been revealed. Of the three SNPs , , and which had been selected in both previous and the present study using Korean populations, in BDNF was found to influence eating behavior, causing lower satiety responsiveness in children (Monnereau et al., 2017). In addition, a previous meta-analysis reported that in ADCY3 was associated with BMI in East Asian populations (Wen et al., 2012). Although the functional role of in BMI has not yet been fully demonstrated, one bioinformatics study reported that was an expression quantitative trait loci of the ADCY3 gene (Yang et al., 2010). Further, it was demonstrated that in FTO had an impact on weight stabilization (Woehning et al., 2013). One limitation in this study is that there is no further confirmation from other independent subjects. To rectify this limitation, we set aside 601 samples from the total 6,011 samples as a final validation set. Because of this limitation, this study could not identify specific markers for the Korean population, a step which would require additional validation sets. Future studies to build a more appropriate BMI prediction model using Korean-specific markers should be considered. In summary, we identified nine BMI-related SNPs in a Korean population using the KARE data. Our results showed upward BMI trends across the samples using a 10-fold cross validation process. Application of our BMI prediction set to a final validation set showed a similar increasing BMI trend when using the wGRS. Although our study has some limitation as described above, the results from the present study might be useful for further BMI-related research. Click here for additional data file.

Supplementary Figure 1

BMI trends of training sets and test sets. Each set was constructed using only 5,410 samples (4,869 for the training sets and 541 for the test sets). The overall P-values and R2 of each trend line are displayed. The black circle and red punctured circle represent the BMI values for the training set and test set, respectively, using 10-fold cross-validation. The black and red dashed lines are the standard curves for the complete training set and test set. (A) BMI trends of training set and test set using all samples. (B) BMI trends of training set and test set using male samples. (C) BMI trends of training set and test set using female samples. Click here for additional data file. Click here for additional data file.

26 in total

1. A novel MC4R deletion coexisting with FTO and MC1R gene variants, causes severe early onset obesity.

Authors: Vassos Neocleous; Christos Shammas; Marie M Phelan; Pavlos Fanis; Maria Pantelidou; Nicos Skordis; Christos Mantzoros; Leonidas A Phylactou; Meropi Toumba
Journal: Hormones (Athens) Date: 2016-07 Impact factor: 2.885

2. The brain-derived neurotrophic factor (BDNF) Val66Met polymorphism is associated with increased body mass index and insulin resistance measures in bipolar disorder and schizophrenia.

Authors: Stefania Bonaccorso; Monsheel Sodhi; Jiang Li; William V Bobo; Yuejin Chen; Mevhibe Tumuklu; Christos Theleritis; Karuna Jayathilake; Herbert Y Meltzer
Journal: Bipolar Disord Date: 2015-04-15 Impact factor: 6.744

Review 3. Clinical Guidelines on the Identification, Evaluation, and Treatment of Overweight and Obesity in Adults--The Evidence Report. National Institutes of Health.

Authors:
Journal: Obes Res Date: 1998-09

4. A genetic risk score is associated with incident cardiovascular disease and coronary artery calcium: the Framingham Heart Study.

Authors: George Thanassoulis; Gina M Peloso; Michael J Pencina; Udo Hoffmann; Caroline S Fox; L Adrienne Cupples; Daniel Levy; Ralph B D'Agostino; Shih-Jen Hwang; Christopher J O'Donnell
Journal: Circ Cardiovasc Genet Date: 2012-01-10

5. Influence of genetic variants associated with body mass index on eating behavior in childhood.

Authors: Claire Monnereau; Pauline W Jansen; Henning Tiemeier; Vincent W V Jaddoe; Janine F Felix
Journal: Obesity (Silver Spring) Date: 2017-02-28 Impact factor: 5.002

6. Meta-analysis identifies common variants associated with body mass index in east Asians.

Authors: Wanqing Wen; Yoon-Shin Cho; Wei Zheng; Rajkumar Dorajoo; Norihiro Kato; Lu Qi; Chien-Hsiun Chen; Ryan J Delahanty; Yukinori Okada; Yasuharu Tabara; Dongfeng Gu; Dingliang Zhu; Christopher A Haiman; Zengnan Mo; Yu-Tang Gao; Seang-Mei Saw; Min-Jin Go; Fumihiko Takeuchi; Li-Ching Chang; Yoshihiro Kokubo; Jun Liang; Mei Hao; Loïc Le Marchand; Yi Zhang; Yanling Hu; Tien-Yin Wong; Jirong Long; Bok-Ghee Han; Michiaki Kubo; Ken Yamamoto; Mei-Hsin Su; Tetsuro Miki; Brian E Henderson; Huaidong Song; Aihua Tan; Jiang He; Daniel P-K Ng; Qiuyin Cai; Tatsuhiko Tsunoda; Fuu-Jen Tsai; Naoharu Iwai; Gary K Chen; Jiajun Shi; Jianfeng Xu; Xueling Sim; Yong-Bing Xiang; Shiro Maeda; Rick T H Ong; Chun Li; Yusuke Nakamura; Tin Aung; Naoyuki Kamatani; Jian-Jun Liu; Wei Lu; Mitsuhiro Yokota; Mark Seielstad; Cathy S J Fann; Jer-Yuarn Wu; Jong-Young Lee; Frank B Hu; Toshihiro Tanaka; E Shyong Tai; Xiao-Ou Shu
Journal: Nat Genet Date: 2012-02-19 Impact factor: 38.330

7. Prediction of Quantitative Traits Using Common Genetic Variants: Application to Body Mass Index.

Authors: Sunghwan Bae; Sungkyoung Choi; Sung Min Kim; Taesung Park
Journal: Genomics Inform Date: 2016-12-30

8. Crohn's Disease Localization Displays Different Predisposing Genetic Variants.

Authors: Orazio Palmieri; Fabrizio Bossa; Maria Rosa Valvano; Giuseppe Corritore; Tiziana Latiano; Giuseppina Martino; Renata D'Incà; Salvatore Cucchiara; Maria Pastore; Mario D'Altilia; Daniela Scimeca; Giuseppe Biscaglia; Angelo Andriulli; Anna Latiano
Journal: PLoS One Date: 2017-01-04 Impact factor: 3.240

9. On the association of common and rare genetic variation influencing body mass index: a combined SNP and CNV analysis.

Authors: Roseann E Peterson; Hermine H Maes; Peng Lin; John R Kramer; Victor M Hesselbrock; Lance O Bauer; John I Nurnberger; Howard J Edenberg; Danielle M Dick; Bradley T Webb
Journal: BMC Genomics Date: 2014-05-14 Impact factor: 3.969

10. Polymorphisms FTO rs9939609, PPARG rs1801282 and ADIPOQ rs4632532 and rs182052 but not lifestyle are associated with obesity related-traits in Mexican children.

Authors: C Muñoz-Yáñez; R Pérez-Morales; H Moreno-Macías; E Calleros-Rincón; G Ballesteros; R A González; J Espinosa
Journal: Genet Mol Biol Date: 2016-07-14 Impact factor: 1.771

5 in total

1. Genotype-expression interactions for BDNF across human brain regions.

Authors: Patrick Devlin; Xueyuan Cao; Ansley Grimes Stanfill
Journal: BMC Genomics Date: 2021-03-23 Impact factor: 3.969

2. Molecular modelling of novel ADCY3 variant predicts a molecular target for tackling obesity.

Authors: Meropi Toumba; Pavlos Fanis; Dimitrios Vlachakis; Vassos Neocleous; Leonidas A Phylactou; Nicos Skordis; Christos S Mantzoros; Maria Pantelidou
Journal: Int J Mol Med Date: 2021-11-25 Impact factor: 4.101

3. Body mass index but not genetic risk is longitudinally associated with altered structural brain parameters.

Authors: Anne Tüngler; Sandra Van der Auwera; Katharina Wittfeld; Stefan Frenzel; Jan Terock; Nele Röder; Georg Homuth; Henry Völzke; Robin Bülow; Hans Jörgen Grabe; Deborah Janowitz
Journal: Sci Rep Date: 2021-12-20 Impact factor: 4.379

4. Fine Mapping of the MAP2K5 Region Identified rs7175517 as a Causal Variant Related to BMI in China and the United Kingdom Populations.

Authors: Ce Lu; Hai-Jun Wang; Jie-Yun Song; Shuo Wang; Xue-Ying Li; Tao Huang; Hui Wang
Journal: Front Genet Date: 2022-03-16 Impact factor: 4.599

5. Prediction of cholesterol ratios within a Korean population.

Authors: Jin Sol Lee; Hyun Sub Cheong; Hyoung Doo Shin
Journal: R Soc Open Sci Date: 2018-01-17 Impact factor: 2.963

5 in total