Literature DB >> 29545823

Genome-Wide Association Study of Serum 25-Hydroxyvitamin D in US Women.

Katie M O'Brien1, Dale P Sandler2, Min Shi1, Quaker E Harmon2, Jack A Taylor2, Clarice R Weinberg1.   

Abstract

Genetic factors likely influence individuals' concentrations of 25-hydroxyvitamin D [25(OH)D], a biomarker of vitamin D exposure previously linked to reduced risk of several chronic diseases. We conducted a genome-wide association study of serum 25(OH)D (assessed using liquid chromatography-tandem mass spectrometry) and 386,449 single nucleotide polymorphisms (SNPs). Our sample consisted of 1,829 participants randomly selected from the Sister Study, a cohort of women who had a sister with breast cancer but had never had breast cancer themselves. 19,741 SNPs were associated with 25(OH)D (p < 0.05). We re-assessed these hits in an independent sample of 1,534 participants who later developed breast cancer. After pooling, 32 SNPs had genome-wide significant associations (p < 5 × 10-8). These were located in or near GC, the vitamin D binding protein, or CYP2R1, a cytochrome P450 enzyme that hydroxylates vitamin D to form 25(OH)D. The top hit was rs4588, a missense GC polymorphism associated with a 3.5 ng/mL decrease in 25(OH)D per copy of the minor allele (95% confidence interval [CI]: -4.1, -3.0; p = 4.5 × 10-38). The strongest SNP near CYP2R1 was rs12794714, a synonymous variant (p = 3.8 × 10-12; β = 1.8 ng/mL decrease in 25(OH)D per minor allele [CI: -2.2, -1.3]). Serum 25(OH)D concentrations from samples collected from some participants 3-10 years after baseline (811 cases, 780 non-cases) were also strongly associated with both loci. These findings augment our understanding of genetic influences on 25(OH)D and the possible role of vitamin D binding proteins and cytochrome P450 enzymes in determining measured levels. These results may help to identify individuals genetically predisposed to vitamin D insufficiency.

Entities:  

Keywords:  25-dihydroxy vitamin D; CYP2R1; genome-wide association study; single nucleotide polymorphism; vitamin D binding protein

Year:  2018        PMID: 29545823      PMCID: PMC5838824          DOI: 10.3389/fgene.2018.00067

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

While randomized clinical trials of have failed to establish direct links between vitamin D supplementation and health (Bjelakovic et al., 2011, 2014; Avenell et al., 2014), observational studies have demonstrated that high levels of 25-hydroxyvitamin D [25(OH)D], a vitamin D biomarker that can be measured in blood, are associated with lower mortality (Garland et al., 2014) and reduced risk of many chronic diseases, including heart disease and some cancers (Gandini et al., 2011; Autier et al., 2014; Zhang et al., 2017). Determinants of 25(OH)D include ultraviolet-B radiation, dietary supplements, and certain foods, including fish and fortified dairy products. Genetic factors are also thought to play a role in determining blood concentrations. Vitamin D sufficiency is typically assessed by measuring serum or plasma concentrations of 25(OH)D, which is a stable precursor to the active form of vitamin D, 1,25(OH)2D. The Institute of Medicine guidelines suggest that 25(OH)D concentrations above 20 ng/mL are sufficient for bone health (Ross et al., 2011), but higher levels may provide additional health benefits (Garland and Gorham, 2016; Durazo-Arvizu et al., 2017; O'Brien et al., 2017b; Zhang et al., 2017). Previous genome-wide association studies (GWAS) have identified several single nucleotide polymorphisms (SNPs) associated with 25(OH)D concentrations. To date, eight such GWAS have been published (Benjamin et al., 2007; Ahn et al., 2010; Engelman et al., 2010; Wang et al., 2010; Lasky-Su et al., 2012; Anderson et al., 2014; Sapkota et al., 2016; Jiang et al., 2018), identifying a total of 17 SNPs in 7 chromosomal locations (in or near GC, NADSYN1/DHCR7, CYP2R1/RRAS2/PDE3B, CYP24A1, SSTR4/FOXA2, AMDHD1, and SEC23A) with association p-values < 5 × 10−8 (PheGen I: Phenotype-Genotype Integrator-National Center for Biotechnology Information, 2014; MacArthur et al., 2017). Here, we introduce the first vitamin D GWAS to use the current gold-standard measure of 25(OH)D, liquid chromatography/tandem mass spectrometry (LC/MS). LC/MS has been shown to outperform other commonly-utilized methods (Farrell et al., 2012) and is capable of measuring concentrations of 3-epi-25(OH)D3, which is thought to play a similar biological role to 25(OH)D3 (Cashman et al., 2014). We conducted this work using serum collected from a random sample of women participating in a large prospective cohort study. We re-assessed the top SNPs in an independent set of serum samples from women in the same cohort who later developed breast cancer, and again in serum samples collected from some of the same cases (post-diagnosis) and non-cases 3–10 years after baseline. We also performed haplotype analyses of key genetic regions.

Materials and methods

This work was conducted using data from the Sister Study, a prospective cohort of 50,884 women who had a full or half-sister with a history of breast cancer, but who had never had breast cancer themselves at the time they enrolled in the study. Participants aged 35–74 and living in the United States or Puerto Rico joined the study between 2003 and 2009. They were visited in their homes by a trained examiner, who obtained written informed consent and collected the blood samples needed for 25(OH)D and genotype analyses. A subset of participants, including some who had developed breast cancer and some who had not, were asked to provide a second blood sample and other biospecimens in 2013–2014, 3–10 years after baseline. Further details on the study protocol, which also included extensive questionnaires and additional biospecimens, are available elsewhere (Sandler et al., 2017). Approval and oversight for the Sister Study is provided by the Institutional Review Boards of the National Institute of Environmental Health Sciences and the Copernicus group. Analyses were completed using data release 4.1 (July 2014). Participants for the vitamin D and genetics sub-studies were selected using a case-cohort design that included a random sample from the full cohort (n = 1,829, including 67 women who went on to develop breast cancer) and all remaining women diagnosed with invasive breast cancer or ductal carcinoma in situ within 5 years of their baseline blood draw (n = 1,534 additional cases, for 1,601 total cases) (O'Brien et al., 2017b). This included 28 pairs of sisters, who were treated as independent observations despite their genetic similarities [between-sister R2 for 25(OH)D levels = 0.03]. Blood samples from these women were assayed for 533,631 SNPs using the Infinium OncoArray genotyping panel (Illumina Inc.) (Amos et al., 2016). This panel includes a full GWAS backbone, as well as ancestry informative markers (AIMs) and genes presumably or possibly linked to cancer or cancer-related factors. Serum samples were assessed for 25(OH)D concentrations using LC/MS at Heartland Assays, Inc. Measured concentrations were adjusted for batch effects and season of blood draw and thus approximate average annual 25(OH)D. Further details on both the SNP and 25(OH)D analysis can be found elsewhere (Amos et al., 2016; O'Brien et al., 2017a,b). After excluding SNPs that did not meet quality control standards (n = 41,664, as described previously, Amos et al., 2016) or that had a minor allele frequency (MAF) less than 2% in the sub-cohort (n = 105,518), 386,449 SNPs remained. We calculated Hardy-Weinberg equilibrium p-values and examined our top hits for evidence of disequilibrium, but did not exclude SNPs based on these results. We regressed 25(OH)D on the number of copies of the minor allele for each of these SNPs using linear least-squares. The values of 25(OH)D looked normally distributed. We adjusted each model for age at blood draw (in years), self-reported race/ethnicity [Non-Hispanic White (n = 1,576), Black (n = 134), Hispanic (n = 81), or other (n = 38)] and genetic ancestry (proportion CEU, YRI, or CHB) (O'Brien et al., 2017a) and calculated the genomic control inflation factor (λ) to test for evidence of uncontrolled confounding due to population stratification. We also conducted sensitivity analyses adjusting for estimated total vitamin D intake at baseline (dietary plus supplement) and hours spent outdoors per year, as these both contribute to variations in measured 25(OH)D concentrations. Analyses were conducted using SAS (v9.3), R (v3.2.1), or PLINK (v1.07). Locus plots were made using LocusZoom (Pruim et al., 2011). Primary analyses used the baseline 25(OH)D measurements from the randomly selected sub-cohort. For replication analyses, we also analyzed associations in the set of women who later developed breast cancer, assessing the relationship between 25(OH)D and any SNPs for which the p-value had been < 0.05 in the sub-cohort. We excluded the 67 cases selected into the sub-cohort. In analyses pooling the sub-cohort and cases we defined genome-wide significance as p < 5 × 10−8. The pooled analysis included additional adjustment for future case status. We also examined the association between these SNPs and 25(OH)D concentrations in blood samples collected 3–10 years after enrollment. In 2013, a total of 3,762 women who were originally selected for participation in the case-cohort sample were asked to provide secondary biospecimens. We collected second blood samples for 1,227 women who had been diagnosed with breast cancer while on study [assaying 811 for 25(OH)D] and 1,203 who had remained breast cancer-free [assaying 780 for 25(OH)D]. Measurements were based on LC/MS (Heartland Assays, Inc.). We also conducted haplotype analyses of the top two regions using baseline data from the sub-cohort and cases. These were conducted separately in Non-Hispanic whites and African-Americans due to concerns about population stratification and racial differences in linkage disequilibrium (Stram and Seshan, 2012). We used expectation-maximization software (“hapassoc” package in R; Burkett et al., 2004, 2006) to estimate haplotype frequencies and the association between 25(OH)D and each copy of the index haplotype, relative to the most common haplotype in non-Hispanic whites. We assessed all haplotypes with estimated frequency of at least 2%, pooling the rare haplotypes into a single category. Models were adjusted for age at blood draw, ancestry, and future case status.

Results

Women in the sub-cohort were 55.3 years of age, on average, when they joined the study and had average serum 25(OH)D concentrations of 31.8 ng/mL (Table 1). Most participants were non-Hispanic white (86%). Women who later developed breast cancer were 57.4 years of age at baseline and had average 25(OH)D concentration of 31.0 ng/mL. On average, 7.8 years passed between baseline and follow-up blood collection. At that time, serum 25(OH)D concentrations were 40.4 ng/mL in non-cases and 43.5 ng/mL in post-diagnosis cases. Non-Hispanic white women comprised 89% the follow-up group.
Table 1

Description of study sample.

BaselineSecond blood draw 3–10 years after baseline (“Sisters Changing Lives”)
Sub-cohort (n = 1,829)bCases (n = 1,534)c,dNon-cases (n = 780)Cases (n = 811)d
Age at blood draw; mean (std)55.3 (8.9)57.4 (8.9)63.1 (8.7)65.5 (8.8)
25(OH)D (ng/mL)a; mean (std)31.8 (10.5)31.0 (10.1)40.4 (13.7)43.5 (14.3)
Self-reported Race/Ethnicity; n (%)
Non-hispanic white1576 (86)1307 (85)693 (89)722 (89)
African-American134 (7)120 (8)46 (6)38 (5)
Hispanic81 (4)59 (4)27 (3)27 (3)
Other38 (2)47 (3)14 (2)23 (3)

Adjusted for batch and season of blood draw.

Includes 67 women who went on to develop breast cancer.

Excludes cases selected as part of the random sub-cohort.

1 case missing self-reported race/ethnicity.

Description of study sample. Adjusted for batch and season of blood draw. Includes 67 women who went on to develop breast cancer. Excludes cases selected as part of the random sub-cohort. 1 case missing self-reported race/ethnicity. Results from preliminary analyses conducted within the sub-cohort are shown as a Manhattan plot (Figure 1A) and quantile-quantile plot (Figure 1B). There was no evidence of residual confounding due to population stratification (λ = 1.007). 19,741 SNPs were associated with 25(OH)D at p < 0.05. The top hit was rs4588 (p = 6.8 × 10−23; Table 2), located in the vitamin D binding protein (VDBP) gene (GC) on chromosome 4. SNPs in or near CYP2R1, which encodes a cytochrome P450 vitamin D hydroxylase, also showed evidence of an association with 25(OH)D, with rs117913124 having the lowest p-value in the sub-cohort (p = 1.3 × 10−10).
Figure 1

(A) Manhattan plot for 25(OH)D in the sub-cohort (n = 1,829). (B) Quantile-quantile plot for 25(OH)D in the sub-cohort (n = 1,829).

Table 2

Single nucleotide polymorphism associated with serum 25(OH)D levels at p < 5 × 10−8 in the Sister Study (2003–2009).

SNP-rare alleleLocationaMAF, sub-cohortHWE p-value, subcohortP-value, sub-cohortP-value, casesβb (95% CI), pooled samplecP-value, pooled samplecP-value in follow-up studyc,d
CHROMOSOME 4, GC
rs4588-A726183230.270.556.8 × 10−238.9 × 10−17−3.5 (−4.1, −3.0)4.5 × 10−385.8 × 10−10
rs2282679-C726083830.260.289.6 × 10−233.1 × 10−16−3.5 (−4.0, −3.0)1.9 × 10−379.0 × 10−10
rs1155563-G726434880.270.051.4 × 10−192.9 × 10−11−3.0 (−3.5, −2.5)9.6 × 10−296.6 × 10−7
rs705120-A726141400.420.609.5 × 10−121.6 × 10−11−2.4 (−2.8, −1.9)4.5 × 10−226.2 × 10−5
rs2201124-A725970090.310.709.0 × 10−123.5 × 10−7−2.2 (−2.8, −1.7)3.4 × 10−173.2 × 10−6
rs4694105-A725922140.310.621.7 × 10−112.2 × 10−7−2.2 (−2.8, −1.7)3.5 × 10−174.7 × 10−6
rs1526692-G725787240.400.312.7 × 10−96.8 × 10−8−2.0 (−2.5, −1.5)8.8 × 10−161.9 × 10−5
rs6837549-A725968210.490.326.7 × 10−71.1 × 10−61.7 (1.2, 2.2)4.6 × 10−120.003
rs13113067-A727390980.350.265.6 × 10−70.001−1.5 (−2.0, −1.0)3.8 × 10−90.14
rs12639968-A727128720.180.537.7 × 10−70.001−1.8 (−2.4, −1.2)7.7 × 10−90.01
rs962227-A727075170.290.173.0 × 10−70.005−1.5 (−2.1, −1.0)1.1 × 10−80.03
rs10033936-G727434740.250.263.8 × 10−67.5 × 10−4−1.6 (−2.2, −1.0)2.2 × 10−80.09
rs12512631-G726013310.350.723.7 × 10−60.0041.4 (0.9, 1.9)3.8 × 10−80.003
CHROMOSOME 11, CYP2R1
rs12794714-A149135750.420.187.9 × 10−79.4 × 10−7−1.8 (−2.2, −1.3)3.8 × 10−124.1 × 10−4
rs201473898-A148937040.420.239.4 × 10−71.2 × 10−6−1.7 (−2.2, −1.2)6.5 × 10−123.5 × 10−4
rs11023227-G144590870.360.277.1 × 10−72.2 × 10−6−1.7 (−2.2, −1.2)1.5 × 10−110.02
rs10832275-C144782240.360.211.3 × 10−61.6 × 10−6−1.7 (−2.2, −1.2)1.6 × 10−110.02
rs7121171-G144464200.360.169.9 × 10−72.3 × 10−6−1.7 (−2.2, −1.2)1.6 × 10−110.02
rs12295888-G144505310.360.219.9 × 10−72.0 × 10−6−1.7 (−2.2, −1.2)1.7 × 10−110.02
rs72261784-D144460600.360.221.4 × 10−62.2 × 10−6−1.7 (−2.2, −1.2)2.5 × 10−110.02
rs11023223-G144571120.360.191.1 × 10−62.6 × 10−6−1.7 (−2.2, −1.2)2.5 × 10−110.02
rs10832294-A147474270.410.274.1 × 10−61.5 × 10−6−1.7 (−2.2, −1.2)4.2 × 10−113.8 × 10−4
rs10832268-A144650680.360.191.9 × 10−63.0 × 10−6−1.7 (−2.2, −1.2)4.7 × 10−110.01
rs11023332-G147841100.420.311.0 × 10−56.6 × 10−7−1.7 (−2.2, −1.2)5.1 × 10−117.5 × 10−4
rs10832269-A144650690.360.192.0 × 10−63.3 × 10−6−1.7 (−2.1, −1.2)5.6 × 10−110.01
rs11023246-G145369560.360.294.9 × 10−61.7 × 10−6−1.7 (−2.2, −1.2)6.2 × 10−110.03
rs117913124-A149009310.021.001.3 × 10−100.01−5.1 (−6.7, −3.6)1.2 × 10−104.2 × 10−6
rs2305305-A145409420.360.261.1 × 10−51.8 × 10−6−1.6 (−2.1, −1.1)1.5 × 10−100.03
rs10766188-G146608260.340.381.8 × 10−43.9 × 10−6−1.5 (−2.0, −1.0)6.1 × 10−90.003
rs1993116-A149102340.380.691.4 × 10−51.3 × 10−41.4 (0.9, 1.9)9.7 × 10−96.3 × 10−4
rs10741657-A149148780.370.734.2 × 10−52.2 × 10−41.4 (0.9, 1.9)4.7 × 10−80.002
rs11023203-A144098150.330.342.8 × 10−53.3 × 10−4−1.4 (−1.9, −0.9)4.9 × 10−80.03

All models adjusted for age at blood draw, self-reported race/ethnicity, and estimated ancestry proportions; MAF, minor allele frequency; HWE, Hardy-Weinberg equilibrium.

GRch37/hg19.

Change in 25(OH)D (in ng/mL) per copy of the minor allele.

Adjusting for case status.

Serum 25(OH)D levels measured in blood collected 3–10 years after baseline for the “Sisters Changing Lives” follow-up study.

Bold values indicate SNP with lowest p-value for the region (based on pooled sample).

(A) Manhattan plot for 25(OH)D in the sub-cohort (n = 1,829). (B) Quantile-quantile plot for 25(OH)D in the sub-cohort (n = 1,829). Single nucleotide polymorphism associated with serum 25(OH)D levels at p < 5 × 10−8 in the Sister Study (2003–2009). All models adjusted for age at blood draw, self-reported race/ethnicity, and estimated ancestry proportions; MAF, minor allele frequency; HWE, Hardy-Weinberg equilibrium. GRch37/hg19. Change in 25(OH)D (in ng/mL) per copy of the minor allele. Adjusting for case status. Serum 25(OH)D levels measured in blood collected 3–10 years after baseline for the “Sisters Changing Lives” follow-up study. Bold values indicate SNP with lowest p-value for the region (based on pooled sample). When we re-assessed the top 19,741 SNPs in the independent sample of women who later developed breast cancer, 1,121 were statistically-significant at p < 0.05. For the pooled sample, which included both cases and sub-cohort members, we identified 32 SNPs in two regions that were associated with 25(OH)D at p < 5 × 10−8 (Table 2). The top SNP for the GC region was again rs4588 (p = 4.5 × 10−38), with each copy of the variant A allele associated with an estimated 3.5 ng/mL decrease in 25(OH)D (95% confidence interval [CI]: −4.1, −3.0). This SNP also had a strong association in the smaller follow-up sample (p = 5.8 × 10−10). Twelve other SNPs in the region surrounding rs4588 were also strongly associated with 25(OH)D in the pooled baseline sample (Table 2 and Figure 2) and, with a few exceptions, also in the follow-up sample.
Figure 2

Fine-mapping of region surrounding rs4588 (chromosome 4), all participants.

Fine-mapping of region surrounding rs4588 (chromosome 4), all participants. A more common CYP2R1 SNP, rs12794714, replaced rs117913124 as the top hit in the region for the pooled sample (p = 3.8 × 10−12 and MAF = 0.42 for rs12794714 vs. p = 1.2 × 10−10 and MAF = 0.02 for rs117913124), with each copy of the A allele associated with a 1.8 ng/mL decrease in 25(OH)D (95% CI: −2.2, −1.3). Seventeen other SNPs in this region also met criteria for genome-wide statistical significance in the pooled sample (Table 2). Though most of these were in moderate to high linkage disequilibrium with rs12794714, they spanned several gene regions, including COPB1, PSMA1, PDE3B, and RRAS2 (Figure 3). A second peak appeared ~454 kb away from rs12794714 at rs11023227 in COPB1, but the signal was not independent (original p = 1.5 × 10−11; p = 0.007 after adjusting for rs12794714; r2 = 0.33). All 19 of the genome-wide significant hits in this region were also associated with 25(OH)D concentrations from the follow-up visit (all p ≤ 0.03). None of the genome-wide significant hits showed evidence of Hardy-Weinberg disequilibrium.
Figure 3

Fine-mapping plot for the region surrounding rs12794714 (chromosome 11), all participants.

Fine-mapping plot for the region surrounding rs12794714 (chromosome 11), all participants. Fifteen other SNPs that had false discovery rate q-values < 0.10 in the pooled sample are described in Supplementary Table 1. Three were from the chromosome 4 locus and four from the chromosome 11 locus (one on COPB1 and two on PDE3B). The remaining eight were independent signals, only two of which represented known genes (rs4951247 on ELK4 and rs360157 on MYO9B). The associations between these SNPs and 25(OH)D measured during follow-up were mostly consistent with those observed for baseline 25(OH)D. When we additionally adjusted for both estimated vitamin D intake at baseline (measured using a food frequency questionnaire plus self-reported supplement use) and self-reported average hours spent outdoors per year, the results were very similar, with all but one of 32 SNPs reported here showing genome-wide significant associations (Supplementary Table 2). While the rankings shuffled around for the chromosome 11 locus, the only notable change was that while rs117913124 was the top ranked SNP for both the sub-cohort specific and pooled analysis, rs12794714 dropped down to 11th with the additional adjustments. There was little change to any of the effect estimates or p-values. For haplotype analyses (Table 3), which were based on the pooled baseline sample, we selected SNPs with very small p-values (p < 1 × 10−10 for the chromosome 4 region and p < 5 × 10−8 for the chromosome 11 region) that were not in high linkage disequilibrium with any other, more strongly associated SNP (r2 < 0.80 in our sample). We excluded rs117913124 due to its low MAF. This resulted in a 6-SNP set including rs4588 and a 5-SNP set including rs12794714. For non-Hispanic whites, there were seven common (frequency >2%) haplotypes for both chromosomal sets. For chromosome 4, each copy of the “GCAAAG” haplotype (frequency = 21%) was associated with a 3.9 ng/mL decrease in 25(OH)D (95% CI: −4.6, −3.2; p = 9.8 × 10−30) in non-Hispanic whites, relative to the most common haplotype (AAGCCA, frequency = 43%). In African-Americans, only “GCACCA” (frequency = 9%) was significantly associated with 25(OH)D levels, with an estimated 4.6 ng/mL increase in 25(OH)D (95% CI: 1.2, 8.0; p = 0.008), per copy, relative to “AAGCCA” (frequency = 17%). For chromosome 11, “AGGGA” (frequency = 26%) was associated with an estimated 2.1 ng/mL decrease per copy in non-Hispanic whites (95% CI: −2.8, −1.5; p = 6.1 × 10−10), relative to the most common “GAAAG” haplotype (frequency = 34%). Haplotype distributions for the chromosome 11 locus were very different for African-Americans, with “GAAGG” being the most common haplotype (frequency = 53%) and no haplotypes showing a statistically-significant difference in 25(OH)D concentrations compared to “GAAAG” (frequency = 26%). Correlation matrices for SNPs in both of these regions are included as Supplementary Figures 1, 2.
Table 3

Haplotype analysis.

HaplotypeNon-Hispanic whites (n = 2,883)African-Americans (n = 254)
Haplotype frequencyβa (95% CI), pooled sampleeP-value, pooled sampleeHaplotype frequencyβa (95% CI), pooled sampleeP-value, pooled samplee
CHROMOSOME 4 (GC): rs1526692, rs6837549, rs2201124, rs705120, rs4588, rs1155563
AAGCCAa0.430.00.170.0
GCAAAGb0.21−3.9 (−4.6, −3.2)9.8 × 10−300.052.7 (−1.8, 7.1)0.24
ACGACAc0.07−1.0 (−2.0, 0.0)0.050.101.1 (−2.5, 4.7)0.54
GCACCAa0.07−0.7 (−1.7, 0.3)0.170.094.6 (1.2, 8.0)8.5 × 10−3
GCGACAc0.07−1.3 (−2.4, −0.3)0.010.122.1 (−1.2, 5.5)0.21
AAGAAGb0.03−4.6 (−6.2, −3.1)3.2 × 10−9
ACGCCAa0.03−0.7 (−2.4, 0.9)0.400.16−1.0 (−5.2, 2.1)0.52
GAGCCAa0.10−1.4 (−5.1, 2.3)0.47
GAGACAc0.06−1.9 (−6.5, 2.7)0.43
AAGACAc0.041.0 (−4.9, 6.9)0.74
Rare pooledd0.08−2.3 (−3.3, −1.3)5.4 × 10−60.11−1.9 (−5.2, 1.4)0.26
CHROMOSOME 11 (CYP2R1): rs11023203, rs11023227, rs10766188, rs1993116, rs12794714
GAAAG0.340.00.260.0
AGGGA0.26−2.1 (−2.8, −1.5)6.1 × 10−100.09−2.4 (−5.8, 1.0)0.17
GAAGG0.15−0.7 (−1.5, 0.1)0.070.53−1.8 (−7.2, 3.7)0.53
GAAGA0.06−1.4 (−2.5, −0.2)0.020.03−1.6 (−3.9, 0.6)0.15
GGGGA0.05−2.0 (−3.2, −0.8)1.5 × 10−3
GAGGA0.03−1.8 (−3.3, −0.3)0.02
AGAGA0.03−2.9 (−4.6, −1.3)4.0 × 10−4
Rare pooledd0.08−1.0 (−2.1, 0.0)0.050.11−0.8 (−3.9, 2.2)0.59

All models adjusted for age at blood draw, estimated ancestry proportions, and breast cancer case status.

Corresponds to the “Gc1s” vitamin D binding protein variant in Non-Hispanic whites (based on genotypes for rs705120 and rs4588).

Corresponds to the “Gc2” vitamin D binding protein variant in Non-Hispanic whites (based on genotypes for rs705120 and rs4588).

Corresponds to the “Gc1f” vitamin D binding protein variant in Non-Hispanic whites (based on genotypes for rs705120 and rs4588).

Rare haplotypes (frequency < 2%) pooled together.

Change in 25(OH)D (in ng/mL) per copy of the index haplotype relative to the most common haplotype (“AAGCCA” for chromosome 4, “GAAAG” for chromosome 11), controlling for all other haplotypes.

Bold values indicate SNP with lowest independently measured p-value (based on pooled sample).

Haplotype analysis. All models adjusted for age at blood draw, estimated ancestry proportions, and breast cancer case status. Corresponds to the “Gc1s” vitamin D binding protein variant in Non-Hispanic whites (based on genotypes for rs705120 and rs4588). Corresponds to the “Gc2vitamin D binding protein variant in Non-Hispanic whites (based on genotypes for rs705120 and rs4588). Corresponds to the “Gc1f” vitamin D binding protein variant in Non-Hispanic whites (based on genotypes for rs705120 and rs4588). Rare haplotypes (frequency < 2%) pooled together. Change in 25(OH)D (in ng/mL) per copy of the index haplotype relative to the most common haplotype (“AAGCCA” for chromosome 4, “GAAAG” for chromosome 11), controlling for all other haplotypes. Bold values indicate SNP with lowest independently measured p-value (based on pooled sample).

Discussion

In this GWAS of vitamin D serum levels, we identified two regions strongly associated with serum 25(OH)D—one on chromosome 4 surrounding the GC gene and the second on chromosome 11 including SNPs from CYP2R1, COPB1, PSMA1, and PDE3B. The identified loci replicated in an independent sample of women selected because they later developed breast cancer and the SNPs were also strongly associated with 25(OH)D concentrations measured in second blood samples from the same participants collected 3–10 years after baseline. To our knowledge, this was the first GWAS to use the gold-standard LC/MS methods to measure total 25(OH)D and the first to examine haplotypes. The GC gene encodes the VDBP, a member of the albumin family that stores and transports both 25(OH)D and the active form of vitamin D, 1,25(OH)2D (Speeckaert et al., 2006). It seems quite plausible that a variant that affects VDBP could directly impact measured serum 25(OH)D. CYP2R1 polymorphisms also have the capacity to directly impact 25(OH)D concentrations, as this gene encodes a cytochrome P450 enzyme responsible for hydroxylating vitamin D and converting it to 25(OH)D (Shinkyo et al., 2004). Our findings augment the results of previous GWAS. Those that reported hits in GC (Ahn et al., 2010; Wang et al., 2010; Lasky-Su et al., 2012; Anderson et al., 2014; Jiang et al., 2018) observed the smallest p-values for rs1155563, rs17467825, rs2282679, and rs3755967. We did not assess rs17467825 or rs3755967, but rs2282679 and rs1155563 had the second and third smallest p-values in our study. However, they both are synonymous substitutions, while rs4588 is a missense substitution in an exon (exon 12; amino acid change Thr to Lys at position 436). Therefore, given their high correlations with rs4588 (r2 = 0.97 and 0.77, respectively), these previously identified SNPs may just be tags for rs4588, which may be a variant that causally impacts 25(OH)D concentrations. This possibility is supported by a number of candidate gene studies that have reported strong associations between rs4588 and measured 25(OH)D (Engelman et al., 2008; Lu et al., 2012; Perna et al., 2013; Robien et al., 2013; Li et al., 2014; Nissen et al., 2014). Of note, rs4588 was not associated with gene expression in quantitative trait loci analyses, but five other nearby genome-wide significant GC SNPs (rs1155563, rs13113067, rs12639968, rs962227, and rs10033936) were associated with expression of VDBP in stomach tissue. Another putative causal variant is rs7041, which also results in a missense substitution in exon 12 (amino acid change Asp to Glu, position 432). We did not measure this SNP, but it is highly correlated with rs705120 in whites (r2 = 0.97 in the CEU sample of 1,000 genomes) (Johnson et al., 2008; 1000 Genomes Project Consortium et al., 2015), and rs705120 had a p-value of 9.5 × 10−12 in our sample. Together, the genotypes for rs4588 and rs705120/rs7041 determine individual's VDBP variants (“AA” = Gc2, “AC” = Gc1f, or “CC” = Gc1s) (Powe et al., 2013). The combination of these variants determine an individual's phenotype, where each phenotype has a different glycosylation pattern and binding affinity (Braun et al., 1992; Abbas et al., 2008; Powe et al., 2013). In our sample, carriers of the Gc2-coding haplotypes had the lowest 25(OH)D concentrations (Table 3). This is consistent with the results of one previous study, which showed that individuals homozygous for Gc2-coding haplotypes (Gc2/Gc2 phenotype) had significantly lower serum concentrations of 25(OH)D than those with Gc1S/Gc1S, Gc1S/Gc1F, or Gc1s/Gc2 phenotypes (Sollid et al., 2016). The same study reported that individuals with the Gc2/Gc2 phenotype had lower VDBP than all other phenotypes. However, the links between rs4588, rs705120/rs7041, GC haplotypes, and concentrations of VDBP and 25(OH)D remain uncertain, as other studies, including some with large samples of African-Americans, have not observed these same associations (Powe et al., 2013; Batai et al., 2014; Denburg et al., 2016; Yao et al., 2017). The role of the other SNPs included in this haplotype analysis is also unclear, though we note that any haplotypes that contained “A” alleles for both rs4588 and rs705120 also contained the “G” allele for rs1155563, a synonymous polymorphism identified in a previous GWAS (Anderson et al., 2014). In our sample, African-Americans were less likely to carry the minor allele at rs4588 (MAF = 0.11 vs. 0.28 in non-Hispanic whites; Supplementary Table 3) and the SNP was not associated with 25(OH)D in African-Americans, though our sample size was quite small (data not shown). In general, most of the GWAS and candidate gene studies that observed positive associations between 25(OH)D and SNPs in GC were conducted in populations of European or Chinese descent (Engelman et al., 2008, 2010; Ahn et al., 2010; Wang et al., 2010; Jorde et al., 2012; Lasky-Su et al., 2012; Lu et al., 2012; Perna et al., 2013; Robien et al., 2013; Zhang et al., 2013; Anderson et al., 2014; Batai et al., 2014; Li et al., 2014; Nissen et al., 2014; Clendenen et al., 2015), with less consistent results seen for African-American populations (Engelman et al., 2008; Batai et al., 2014; Yao et al., 2017). As rs705120 is not correlated with rs7041 in African-Americans (r2 = 0.10 in YRI sample of 1,000 genomes), we cannot determine the VDBP variants of our African-American participants, but prior studies have reported that VDBP variant distributions differ markedly by race, with Gc1F being much more common in African-Americans than whites (Powe et al., 2013; Denburg et al., 2016). Our inability to capture phenotype-relevant haplotypes in African-Americans may explain why we observed no clear associations between GC haplotypes and 25(OH)D concentrations in this group. The positive association between 25(OH)D and “GCACCA” vs. the referent “AAGCCA” haplotype may indicate a more influential role of the first three SNPs (rs1526692, rs6837549, rs2201124) or other nearby correlated SNPs in African-American women, but we again note that our African-American-specific results are based on very small numbers. Larger studies of African-American are needed to help disentangle these complicated relationships. The relationship between the SNPs on chromosome 11 and 25(OH)D is also complex. Our top hit for the region, rs12794714, is a synonymous substitution in exon 1 of CYP2R1. The rare SNP with the strongest association in the sub-cohort, rs117913124 (MAF = 0.02), also results in a synonymous substitution in CYP2R1 (exon 4) and in a recent whole genome sequencing analysis (Manousaki et al., 2017), this variant was strongly associated with vitamin D even after adjusting for more common, previously-identified SNPs in the same region. Earlier GWAS for serum 25(OH)D identified 5 genome-wide significant SNPs on chromosome 11 near CYP2R1 (rs10741657, rs2060793, rs11023332, rs12287212, rs1007392) (Ahn et al., 2010; Wang et al., 2010; Anderson et al., 2014; Jiang et al., 2018). Of these previous GWAS hits, we assessed rs10741657 and rs11023332 (p = 4.9 × 10−8 and p = 1.5 × 10−11, respectively, in the pooled sample). The former is close to rs12794714 (1.3 kb upstream), but is intergenic and the two are only loosely correlated (r2 = 0.40 in our sample). The latter is further away from rs12794714 (129 kb downstream), but is more strongly correlated (r2 = 0.90) and results in a silent substitution in PDE3B. None of the other previously identified hits is an obvious candidate for a causal association: rs1007392 is an intron variant of PDE3B, rs2060793 is located upstream of CYP2R1, and rs12287212 is intergenic. Though many candidate gene studies have assessed the association between 25(OH)D and rs12794714 or other highly correlated SNPs, there is no clear consensus as to the likely causal variant(s) (Ramos-Lopez et al., 2007; Zhang et al., 2012, 2013; Robien et al., 2013; Batai et al., 2014; Ordóñez-Mena et al., 2016). Previously documented ancestral heterogeneity in allele frequencies and measured associations contribute additional uncertainty (Batai et al., 2014; Elkum et al., 2014). Assessments of expression quantitative trait loci showed that the genome-wide significant SNPs in this region were only associated with expression of genes in the direct vicinity (CYP2R1, RRAS2, COPB1, CALCB, and PDE3B). We did not see genome-wide significant association between 25(OH)D and any of the SNPs in DHCR7/NADSYN1 or SSTR4/FOXA2 identified in previous GWAS (Wang et al., 2010; Sapkota et al., 2016; Jiang et al., 2018), though one of the DHCR7/NADSYN1 hits, rs12785878, had a very low p-value (0.0007 in the sub-cohort and 5.9 × 10−5 in the pooled sample) with the minor allele associated with decreased 25(OH)D in both their report (Wang et al., 2010) and our sample (β = −1.1 ng/mL per allele, 95% CI: −1.5, −0.6 for our study). None of the previously reported SSTR4/FOXA2 SNPs showed evidence of an association in our data (all uncorrected p > 0.3), though we note that these SNPs were originally identified in a sample of Punjabi Sikhs (Sapkota et al., 2016). As none of the SNPs identified in a recent genome-wide meta-analysis of men and women of European descent (Jiang et al., 2018) were directly genotyped in our sample (rs17216707 in CYP24A1, rs10745742 in AMDHD1, or rs8018720 in SEC23A), we looked for signals in nearby SNPs, finding none with low p-values (all >0.05). This failure to replicate could be due to chance, lack of power in our study, which was much smaller than the meta-analysis, or to sex-specific differences in the association. Although our sample size was somewhat limited, we were able to replicate our results in an independent sample of women who later developed breast cancer, and to examine associations between the top SNPs and 25(OH)D concentrations in the same individuals at a later time point. All 25(OH)D measures were based on LC/MS, the current gold standard because of its improved precision (Farrell et al., 2012) and its ability to capture 25(OH)D2, 25(OH)D3, and epi-25(OH)D3, where the latter is an epimer of 25(OH)D3 that is thought to have nearly identical functionality (Cashman et al., 2014). These results may not be generalizable to women of all races, particularly our findings for the GC gene and VDBP variants. Additionally, because our sample included women who had a sister with breast cancer, our effect estimates could be inflated for SNPs that interact with one or more breast cancer-related variants in their influence on vitamin D. However, we saw little difference in our results when we assessed women who later became cases vs. the random sub-cohort and our top hits and their effect estimates are consistent with findings of previous studies. In this sample of women enrolled in the Sister Study, SNPs in GC and CYP2R1 were strongly associated with serum 25(OH)D concentrations measured using LC/MS. Although these loci had been identified in earlier GWAS, these findings extend our understanding by pointing to possible roles for specific SNPs within these regions and further elucidating the importance of VDBP and cytochrome P450 enzymes in determining 25(OH)D concentrations. They may also help to identify individuals who are genetically predisposed to lower 25(OH)D and would most benefit from interventions to improve their circulating vitamin D levels.

Author contributions

KO conceived and designed the research project, analyzed the data, wrote the paper, and has primary responsibility for final content. DS developed the overall research plan, oversaw data collection, and conducted study oversight. She also contributed to writing the manuscript. MS performed statistical analyses and contributed to the writing of the manuscript. QH helped with the interpretation of the results and the writing of the manuscript. JT helped to develop the overall research plan and write the paper. CW oversaw the overall research plan, the study design and statistical analysis, and helped to write the paper.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  51 in total

1.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

2.  Serum 25-hydroxyvitamin D and the risk of cardiovascular disease: dose-response meta-analysis of prospective studies.

Authors:  Runhua Zhang; Bohong Li; Xiang Gao; Rui Tian; Yuesong Pan; Yong Jiang; Hongqiu Gu; Yilong Wang; Yongjun Wang; Gaifen Liu
Journal:  Am J Clin Nutr       Date:  2017-03-01       Impact factor: 7.045

3.  Genetic Variants in the Vitamin D Pathway, 25(OH)D Levels, and Mortality in a Large Population-Based Cohort Study.

Authors:  José Manuel Ordóñez-Mena; Haifa Maalmi; Ben Schöttker; Kai-Uwe Saum; Bernd Holleczek; Thomas J Wang; Barbara Burwinkel; Hermann Brenner
Journal:  J Clin Endocrinol Metab       Date:  2017-02-01       Impact factor: 5.958

Review 4.  Biological and clinical aspects of the vitamin D binding protein (Gc-globulin) and its polymorphism.

Authors:  Marijn Speeckaert; Guangming Huang; Joris R Delanghe; Youri E C Taes
Journal:  Clin Chim Acta       Date:  2006-05-12       Impact factor: 3.786

5.  Genetic and environmental predictors of serum 25-hydroxyvitamin D concentrations among middle-aged and elderly Chinese in Singapore.

Authors:  Kim Robien; Lesley M Butler; Renwei Wang; Kenneth B Beckman; Dinesha Walek; Woon-Puay Koh; Jian-Min Yuan
Journal:  Br J Nutr       Date:  2012-05-15       Impact factor: 3.718

6.  CYP2R1 (vitamin D 25-hydroxylase) gene is associated with susceptibility to type 1 diabetes and vitamin D levels in Germans.

Authors:  Elizabeth Ramos-Lopez; Patrick Brück; Thomas Jansen; Jürgen Herwig; Klaus Badenhoop
Journal:  Diabetes Metab Res Rev       Date:  2007-11       Impact factor: 4.876

7.  Low-Frequency Synonymous Coding Variation in CYP2R1 Has Large Effects on Vitamin D Levels and Risk of Multiple Sclerosis.

Authors:  Despoina Manousaki; Tom Dudding; Simon Haworth; Yi-Hsiang Hsu; Ching-Ti Liu; Carolina Medina-Gómez; Trudy Voortman; Nathalie van der Velde; Håkan Melhus; Cassianne Robinson-Cohen; Diana L Cousminer; Maria Nethander; Liesbeth Vandenput; Raymond Noordam; Vincenzo Forgetta; Celia M T Greenwood; Mary L Biggs; Bruce M Psaty; Jerome I Rotter; Babette S Zemel; Jonathan A Mitchell; Bruce Taylor; Mattias Lorentzon; Magnus Karlsson; Vincent V W Jaddoe; Henning Tiemeier; Natalia Campos-Obando; Oscar H Franco; Andre G Utterlinden; Linda Broer; Natasja M van Schoor; Annelies C Ham; M Arfan Ikram; David Karasik; Renée de Mutsert; Frits R Rosendaal; Martin den Heijer; Thomas J Wang; Lars Lind; Eric S Orwoll; Dennis O Mook-Kanamori; Karl Michaëlsson; Bryan Kestenbaum; Claes Ohlsson; Dan Mellström; Lisette C P G M de Groot; Struan F A Grant; Douglas P Kiel; M Carola Zillikens; Fernando Rivadeneira; Stephen Sawcer; Nicholas J Timpson; J Brent Richards
Journal:  Am J Hum Genet       Date:  2017-07-27       Impact factor: 11.043

8.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

9.  Common variants in CYP2R1 and GC genes predict vitamin D concentrations in healthy Danish children and adults.

Authors:  Janna Nissen; Lone Banke Rasmussen; Gitte Ravn-Haren; Elisabeth Wreford Andersen; Bettina Hansen; Rikke Andersen; Heddie Mejborn; Katja Howarth Madsen; Ulla Vogel
Journal:  PLoS One       Date:  2014-02-27       Impact factor: 3.240

10.  The Sister Study Cohort: Baseline Methods and Participant Characteristics.

Authors:  Dale P Sandler; M Elizabeth Hodgson; Sandra L Deming-Halverson; Paula S Juras; Aimee A D'Aloisio; Lourdes M Suarez; Cynthia A Kleeberger; David L Shore; Lisa A DeRoo; Jack A Taylor; Clarice R Weinberg
Journal:  Environ Health Perspect       Date:  2017-12-20       Impact factor: 9.031

View more
  13 in total

Review 1.  Genetic variants of mineral metabolism in health and disease.

Authors:  Cassianne Robinson-Cohen
Journal:  Curr Opin Nephrol Hypertens       Date:  2020-07       Impact factor: 2.894

2.  The Case for Case-Cohort: An Applied Epidemiologist's Guide to Reframing Case-Cohort Studies to Improve Usability and Flexibility.

Authors:  Katie M O'Brien; Kaitlyn G Lawrence; Alexander P Keil
Journal:  Epidemiology       Date:  2022-05-01       Impact factor: 4.860

3.  Vitamin D concentrations and breast cancer incidence among Black/African American and non-Black Hispanic/Latina women.

Authors:  Katie M O'Brien; Quaker E Harmon; Chandra L Jackson; Mary V Diaz-Santana; Jack A Taylor; Clarice R Weinberg; Dale P Sandler
Journal:  Cancer       Date:  2022-04-25       Impact factor: 6.921

4.  Influence of vitamin D on cancer risk and treatment: Why the variability?

Authors:  M Rita I Young; Ying Xiong
Journal:  Trends Cancer Res       Date:  2018

5.  Influences of Vitamin D Levels and Vitamin D-Binding Protein Polymorphisms on Nonalcoholic Fatty Liver Disease Risk in a Chinese Population.

Authors:  Min Wang; Minxian Wang; Ru Zhang; Chao Shen; Liuxin Zhang; Yajie Ding; Zongzhe Tang; Hongliang Wang; Wei Zhang; Yue Chen; Jie Wang
Journal:  Ann Nutr Metab       Date:  2022-01-31       Impact factor: 5.923

Review 6.  Vitamin D Deficiency in the Gulf Cooperation Council: Exploring the Triad of Genetic Predisposition, the Gut Microbiome and the Immune System.

Authors:  Parul Singh; Manoj Kumar; Souhaila Al Khodor
Journal:  Front Immunol       Date:  2019-05-10       Impact factor: 7.561

7.  Genetic variation in GC and CYP2R1 affects 25-hydroxyvitamin D concentration and skeletal parameters: A genome-wide association study in 24-month-old Finnish children.

Authors:  Anders Kämpe; Maria Enlund-Cerullo; Saara Valkama; Elisa Holmlund-Suila; Jenni Rosendahl; Helena Hauta-Alus; Minna Pekkinen; Sture Andersson; Outi Mäkitie
Journal:  PLoS Genet       Date:  2019-12-16       Impact factor: 5.917

8.  Vitamin D moderates the interaction between 5-HTTLPR and childhood abuse in depressive disorders.

Authors:  Sarah Bonk; Johannes Hertel; Helena U Zacharias; Jan Terock; Deborah Janowitz; Georg Homuth; Matthias Nauck; Henry Völzke; Henriette Meyer Zu Schwabedissen; Sandra Van der Auwera; Hans Jörgen Grabe
Journal:  Sci Rep       Date:  2020-12-28       Impact factor: 4.379

9.  Genetic Association between Serum 25-Hydroxyvitamin D Levels and Lung Function in Korean Men and Women: Data from KNHANES 2011⁻2012.

Authors:  Clara Yongjoo Park; So-Young Kwak; Garam Jo; Min-Jeong Shin
Journal:  Nutrients       Date:  2018-09-23       Impact factor: 5.717

10.  Association of SNPs in GC and CYP2R1 with total and directly measured free 25-hydroxyvitamin D in multi-ethnic postmenopausal women in Saudi Arabia.

Authors:  Shatha Alharazy; Muhammad Imran Naseer; Eman Alissa; M Denise Robertson; Susan Lanham-New; Mohammad H Alqahtani; Adeel G Chaudhary
Journal:  Saudi J Biol Sci       Date:  2021-05-04       Impact factor: 4.219

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.