| Literature DB >> 23754956 |
Niels Grarup1, Patrick Sulem, Camilla H Sandholt, Gudmar Thorleifsson, Tarunveer S Ahluwalia, Valgerdur Steinthorsdottir, Helgi Bjarnason, Daniel F Gudbjartsson, Olafur T Magnusson, Thomas Sparsø, Anders Albrechtsen, Augustine Kong, Gisli Masson, Geng Tian, Hongzhi Cao, Chao Nie, Karsten Kristiansen, Lise Lotte Husemoen, Betina Thuesen, Yingrui Li, Rasmus Nielsen, Allan Linneberg, Isleifur Olafsson, Gudmundur I Eyjolfsson, Torben Jørgensen, Jun Wang, Torben Hansen, Unnur Thorsteinsdottir, Kari Stefánsson, Oluf Pedersen.
Abstract
Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B(12) (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B(12) and folate measurements, respectively. We found six novel loci associating with serum B(12) (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B(12) and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B(12) or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23754956 PMCID: PMC3674994 DOI: 10.1371/journal.pgen.1003530
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Schematic overview of the study.
Novel and previously reported genomic loci that associate with serum B12 levels at P<2.2×10−9.
| SNV name | Locus | Chr. | Position (build 36/hg18) | Annotation | Alleles | EAF | Icelandic | Danish – Inter99 | Danish – Health2006 | Combined | |||||
| Effect |
| Effect |
| Effect |
| N |
|
| |||||||
|
| |||||||||||||||
| rs2336573 |
| 19 | 8,273,709 | G220R | T/ | 0.031 | 0.32 | 1.1×10−51 | 0.22 | 0.0057 | 0.31 | 1.7×10−8 | 45,575 | 8.4×10−59 | 41 (0.033) |
| rs1131603 |
| 22 | 29,348,975 | L376S | C/ | 0.055 | 0.19 | 4.3×10−28 | 0.33 | 1.8×10−9 | 0.33 | 5.3×10−17 | 45,575 | 4.9×10−49 | 62 (0.0050) |
| rs3742801 |
| 14 | 73,828,759 | E368K | T/ | 0.294 | 0.045 | 5.3×10−8 | 0.093 | 7.6×10−4 | 0.083 | 4.5×10−5 | 45,571 | 1.7×10−13 | 0 (0.20) |
| rs2270655 |
| 4 | 146,795,868 | Q363H |
| 0.941 | 0.066 | 3.5×10−5 | 0.30 | 2.8×10−7 | 0.25 | 5.8×10−8 | 45,576 | 2.2×10−13 | 79 (7.1×10−5) |
| rs12272669 |
| 1 | 45,747,242 | R206Q | A/ | 0.0022 | 0.51 | 3.0×10−9 | - | - | - | - | - | - | - |
|
| |||||||||||||||
| rs34324219 |
| 11 | 59,379,954 | D301Y |
| 0.881 | 0.21 | 8.8×10−71 | 0.40 | 3.2×10−23 | 0.30 | 3.5×10−24 | 45,576 | 1.1×10−111 | 70 (0.001) |
| rs7788053 |
| 19 | 5,783,209 | P124S | A/ | 0.254 | 0.046 | 2.1×10−7 | 0.050 | 0.076 | 0.070 | 0.00072 | 45,575 | 1.7×10−10 | 0 (0.64) |
|
| |||||||||||||||
| rs602662 |
| 19 | 53,898,797 | G258S | A/ | 0.596 | 0.16 | 4.1×10−96 | 0.19 | 3.5×10−13 | 0.23 | 1.9×10−34 | 45,568 | 2.4×10−139 | 0 (0.14) |
| rs1801222 |
| 10 | 17,196,157 | F253S | G/ | 0.593 | 0.11 | 1.1×10−52 | 0.14 | 7.6×10−8 | 0.17 | 2.9×10−18 | 45,576 | 3.3×10−75 | 0 (0.48) |
| rs41281112 |
| 13 | 99,316,635 | R259X |
| 0.948 | 0.17 | 9.6×10−27 | 0.24 | 0.0013 | 0.29 | 2.5×10−7 | 45,576 | 8.9×10−35 | 0 (0.90) |
| rs1141321 |
| 6 | 49,520,392 | R532H |
| 0.627 | 0.061 | 1.4×10−16 | 0.12 3 | 1.4×10−5 | 0.11 | 1.4×10−7 | 45,574 | 3.6×10−26 | 0 (0.24) |
Association results for serum B12 in Icelandic and Danish study samples separately and combined. The effect allele is the allele associated with increased serum B12 levels. The effect is on a quantile normalized scale. Data were combined in fixed effect meta-analyses based on P-value and direction of effect adjusted for the number of individuals in each sample. Values of I 2 are percentages.
Chr., chromosome; EAF, effect allele frequency; HET, heterogeneity; SNV, single nucleotide variant.
The annotation is based on the RefSeq hg18.
The reference alleles based on Build 36 hg18 are shown in bold.
In the Icelandic data the strongest signal at the FUT6 locus is for rs708686 located 5′ to the FUT6 gene (see Table S3).
Danish data are given for the perfect proxy rs4267943 (1000 Genomes data: r 2 = 1.0).
Novel and previously reported genomic loci that associate with serum folate levels at P<2.2×10−9.
| SNV name | Locus | Chr. | Position (build 36) | Annotation | Alleles | EAF | Icelandic | Danish – Inter99 | Danish – Health2006 | Combined | |||||
| Effect |
| Effect |
| Effect |
| N |
|
| |||||||
|
| |||||||||||||||
| rs652197 |
| 11 | 71,527,389 | Intron |
| 0.179 | 0.069 | 2.5×10−10 | 0.066 | 0.011 | 0.071 | 0.043 | 37,465 | 1.4×10−12 | 0 (0.77) |
|
| |||||||||||||||
| rs1801133 |
| 1 | 11,778,965 | A222V |
| 0.668 | 0.096 | 1.0×10−28 | 0.18 | 1.1×10−10 | 0.18 | 5,9×10−19 | 37,337 | 9.5×10−53 | 62 (0.005) |
Association results for serum folate in Icelandic and Danish study samples separately and combined. The effect allele is the allele associated with increased serum folate levels. The effect is on a quantile normalized scale. Data were combined in fixed effect meta-analyses based on P-value and direction of effect adjusted for the number of individuals in each sample. Values of I 2 are percentages. Association between serum folate levels and MTHFR rs1801133 in the Inter99 cohort has been published previously [14]. Chr., chromosome; EAF, effect allele frequency; HET, heterogeneity; SNV, single nucleotide variant.
The annotation is based on the RefSeq hg18.
The reference allele based on Build 36 hg18 is shown in bold.
In the Icelandic data a 2 bp INDEL in exon 3 of FOLR3 associated more strongly with serum folate levels. As only SNVs were analyzed in the Danish data this data was not available for the Danish samples.
The rs652197 variant was initially discovered in the Icelandic samples but subsequently genotyped in Danish samples to confirm the association.
Figure 2Regional plots illustrating conditional analyses of loci with more than one independent association signal for serum B12 (CUBN, TCN1 and TCN2) or serum folate (MTHFR).
Genotyped and imputed SNVs passing quality control measures in the Icelandic data are plotted with their P-values (as −log10 values) as a function of genomic position (NCBI Build 36). Only SNVs with P<10−5 in at least one of the models are shown. The analyses were performed in 25,960 and 20,717 chip-genotyped Icelanders for B12 and folate, respectively. Data points illustrated by open circles represent unconditional analyses (M0); blue dots are results of analyses conditional on the most significant SNV in M0 (M1) and orange dots are results of analyses conditional on most significant SNVs in M0 and M1. Estimated recombination rates (HapMap CEU) are plotted to reflect the local LD structure. Gene annotations were obtained from RefGene.
Novel secondary association signals at the serum B12 or serum folate loci that associate at P<5×10−8.
| SNV # | SNV name | Chr. | Position (build 36) | Gene | Annotation | Alleles (effect/other) | EAF | Effect |
| LD with SNV #1 ( | LD with SNV #2 ( |
|
| |||||||||||
| 1 | rs1801222 | 10 | 17,196,157 |
| F253S | G/A | 0.593 | 0.11 | 2.3×10−42 | ||
| 2 | rs56077122 | 10 | 17,247,021 |
| intronic | A/C | 0.335 | 0.087 | 4.8×10−21 | 0.033 | |
|
| |||||||||||
| 1 | rs34324219 | 11 | 59,379,954 |
| D301Y | C/A | 0.889 | 0.21 | 9.8×10−62 | ||
| 2 | rs34528912 | 11 | 59,388,111 |
| R35H | T/C | 0.0361 | 0.17 | 2.1×10−15 | 0.0040 | |
| 3 | rs117456053 | 11 | 59,373,407 | Near | Intergenic | G/A | 0.976 | 0.16 | 1.9×10−9 | 0.0035 | 0.0011 |
|
| |||||||||||
| 1 | rs1131603 | 22 | 29,348,975 |
| L376S | C/T | 0.055 | 0.17 | 1.1×10−21 | ||
| 2 | rs5753231 | 22 | 29,333,069 |
| 5′ | C/T | 0.79 | 0.064 | 7.5×10−10 | 0.014 | |
|
| |||||||||||
| 1 | rs1801133 | 1 | 11,778,965 |
| A222V | G/A | 0.668 | 0.10 | 3.4×10−27 | ||
| 2 | rs17421511 | 1 | 11,780,375 |
| Intronic | G/A | 0.827 | 0.098 | 1.8×10−15 | 0.11 | |
Conditional analyses were performed using imputed sequence data from chip-genotyped Icelanders with information on serum B12 or folate levels. Results for SNV #1 (lead SNVs) at each loci are unconditional on other SNVs. Analysis of SNV #2 is conditional on SNV #1 and SNV #3 is conditional on SNV #1 and #2. The LD between the SNVs at each locus was estimated from the sequence information of the 1,179 whole genome sequenced Icelanders.
Figure 3Genes that associate with serum B12 and folate levels are in pathways affecting their metabolism.
Genes previously identified to harbor variants regulating serum levels of B12 are shown in green. In blue are novel genes identified in the present study. In red, genes containing variants previously suggested to associate with serum folate and in purple are novel genes for serum folate. B12: vitamin B12; HC: Heptocorrin (TCN1); IF: Intrinsic factor; R-A-P: Receptor-Associated-Protein; CUBN: cubilin (intrinsic factor-cobalamin receptor); TCII: Transcobalamin II (TCN2); TCII-R: Transcobalamin II receptor (CD320); MMACHC: methylmalonic aciduria (cobalamin deficiency) cblC type, with homocystinuria; MMAA: methylmalonic aciduria (cobalamin deficiency) cblA type; ABDC4: ATP-binding cassette, sub-family D (ALD), member 4; LMBD1: LMBR1 domain containing 1; FOLR1–3: folate receptors 1–3; Ado-B12: Adenosyl-cobalamin; Me-B12: Methyl-cobalamin; Me-mal-CoA: Methyl-malonyl-CoenzymeA; Suc-CoA: Succinyl-CoenzymeA; MUT: methylmalonyl-CoA mutase; H. pylori: Helicobacter pylori; DHFR: Dihydrofolate reductase; MS: methionin synthase; THF: Tetrahydrofolate; 5,10-MTHF: 5,10-Methyl-tetrahydrofolate; Hcy: Homocysteine MTHFR: 5-methyl-tetrahydrafolate reductase.