| Literature DB >> 21118518 |
Olivier Harismendy1, Vikas Bansal, Gaurav Bhatia, Masakazu Nakano, Michael Scott, Xiaoyun Wang, Colette Dib, Edouard Turlotte, Jack C Sipe, Sarah S Murray, Jean Francois Deleuze, Vineet Bafna, Eric J Topol, Kelly A Frazer.
Abstract
BACKGROUND: Targeted re-sequencing of candidate genes in individuals at the extremes of a quantitative phenotype distribution is a method of choice to gain information on the contribution of rare variants to disease susceptibility. The endocannabinoid system mediates signaling in the brain and peripheral tissues involved in the regulation of energy balance, is highly active in obese patients, and represents a strong candidate pathway to examine for genetic association with body mass index (BMI).Entities:
Mesh:
Substances:
Year: 2010 PMID: 21118518 PMCID: PMC3156957 DOI: 10.1186/gb-2010-11-11-r118
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1BMI distribution in the CRESCENDO cohort (grey) and in the selected controls (147 samples with BMI ≤ 30 kg/m.
Figure 2Sequence coverage distribution. (a,b) Genome Browser tracks showing locations of the 40 LR-PCR amplicons (black rectangles), the number of samples with coverage below 20× (blue histogram, 100-bp windows) and GC percent (red histogram, 10-bp windows) along the FAAH (a) and MGLL (b) re-sequenced intervals. The ends of the intervals have lower coverage due to the fact they were amplified by a single amplicon. The 5' end of the FAAH gene was successfully amplified but coverage is low due to difficulty sequencing high GC content regions. The high GC content at the 5' end of the MGLL gene resulted in an inability to successfully design PCR primer pairs despite several attempts. (c) Distribution of the fraction of bases (y-axis) sequenced at increasing usable coverage (x-axis) for sequence-based association studies. Usable coverage is defined at each base as the minimum coverage reached by 90% or more of the samples.
Coding sequence variants in the two genes and SIFT analysis
| Coordinate | Alleles | Gene | Codon changea | Amino acid change | dbSNP | Coding type | SIFT prediction | SIFT score | MAF | Number observed |
|---|---|---|---|---|---|---|---|---|---|---|
| Chr1_46643348 | C/A | CCA-aCA | P129T | rs324420 | Non-synonymous | Tolerated | 0.46 | 0.216 | 123 | |
| Chr1_46643944 | G/A | GGG-aGG | G226R | Novel | Non-synonymous | Tolerated | 0.15 | 0.002 | 1 | |
| Chr1_46643960 | C/T | CCC-CtC | P231L | Novel | Non-synonymous | Tolerated | 0.63 | 0.002 | 1 | |
| Chr1_46643996 | G/A | CGC-CaC | R243H | Novel | Non-synonymous | Damaging | 0 | 0.003 | 2 | |
| Chr1_46644333 | G/A | GAG-GAa | E274E | Novel | Synonymous | Tolerated | 0.96 | 0.052 | 30 | |
| Chr1_46644573 | T/C | TGT-TGc | C299C | rs324419 | Synonymous | Tolerated | 1 | 0.176 | 101 | |
| Chr1_46646834 | G/A | GCG-GCa | A356A | rs45476901 | Synonymous | Tolerated | 1 | 0.002 | 1 | |
| Chr3_128893754 | C/T | GCA-aCA | A307T | Novel | Non-synonymous | Tolerated | 0.55 | 0.003 | 2 | |
| Chr3_128893854 | A/C | ATT-ATg | I273M | Novel | Non-synonymous | Tolerated | 0.13 | 0.002 | 1 | |
| Chr3_128896571 | T/C | CTA-CTg | L251L | rs4881 | Synonymous | Tolerated | 1 | 0.073 | 42 | |
| Chr3_128922669 | C/T | GCA-aCA | A143T | Novel | Non-synonymous | Tolerated | 0.36 | 0.002 | 1 | |
| Chr3_128983328 | C/T | GAC-aAC | D86N | Novel | Non-synonymous | Damaging | 0 | 0.002 | 1 | |
| Chr3_129023325 | C/T | CGG-CGa | R19R | rs11538698 | Synonymous | Tolerated | 0.86 | 0.052 | 30 | |
| Chr3_129023335 | G/A | TCC-TtC | S16F | Novel | Non-synonymous | Damaging | 0.01 | 0.002 | 1 |
aChanging nucleotide indicated as lower case.
Concordance of the sequence-derived genotype calls with genotypes from the MassARRAY genotyping for 19 SNPs
| Hardy-Weinberg statistic | |||||||
|---|---|---|---|---|---|---|---|
| SNP rsID | Number matching genotype | Number of under-callsa | Number of over-callsb | Number of N/Nc | Sequencing | MassARRAY | Hidden variant |
| rs594323 | 253 | 1 | 33 | 2 | 1.2 | 4.2 | SNP at 22 bp |
| rs9759081 | 272 | 15 | 2 | 0 | 0 | 8.6 | SNP at 19 bp |
| rs9852837 | 276 | 0 | 13 | 0 | 0.2 | 0.01 | Indel at 38 bp |
| rs4141964 | 287 | 1 | 0 | 1 | 1.6 | 1.2 | - |
| rs324419 | 287 | 2 | 0 | 0 | 0.7 | 0.9 | - |
| rs17203666 | 289 | 0 | 0 | 0 | 0.6 | 0.6 | - |
| rs11715363 | 286 | 3 | 0 | 0 | 0.1 | 0 | - |
| rs17203659 | 288 | 1 | 0 | 0 | 0.2 | 0 | - |
| rs6778770 | 287 | 0 | 1 | 0 | 0.2 | 0.3 | - |
| rs17282181 | 283 | 4 | 2 | 0 | 0 | 0.2 | - |
| rs497897 | 287 | 1 | 1 | 0 | 1.2 | 0.2 | - |
| rs567384 | 283 | 4 | 2 | 0 | 2.3 | 0.6 | - |
| rs3773155 | 286 | 3 | 0 | 0 | 0 | 0 | - |
| rs3773159 | 286 | 1 | 2 | 0 | 2.3 | 2.1 | - |
| rs13076593 | 288 | 1 | 0 | 0 | 0.8 | 0.1 | - |
| rs936839 | 288 | 1 | 0 | 0 | 1.1 | 1.2 | - |
| rs13066225 | 289 | 0 | 0 | 0 | 0.1 | 0.1 | - |
| rs324420 | 289 | 0 | 0 | 0 | 0 | 0 | - |
| rs7652615 | 289 | 0 | 0 | 0 | 0.4 | 0.4 | - |
aGenotype called as reference homozygote by sequencing and heterozygote by MassARRAY or heterozygote by sequencing and alternative homozygote by MassARRAY. bGenotype called as heterozygote by sequencing and reference homozygote by MassARRAY or alternative homozygote by sequencing and heterozygote by MassARRAY. cUncalled genotype or tri-allelic in one of the two.
Figure 3Quality control of SNV identification. Distribution of the matching status of 1,697 genotypes obtained from the 9 replicated samples.
List of variants associated with high BMI by single marker tests
| MAF | Chi-square | Permutation | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| LD block | SNV-ID | Chr 3 coordinate | Gene location | Minor/major alleles | Cohort | Cases | Controls | OR | ||
| Left | rs16830415 | 128956957 | Intron3 | C/T | 0.028 | 0.045 | 0.010 | 9.95E-03 | 4.59 | 8.00E-03 |
| Chr3_128957192 | 128957192 | Intron3 | G/T | 0.028 | 0.045 | 0.010 | 9.95E-03 | 4.59 | 8.00E-03 | |
| Chr3_128958587 | 128958587 | Intron3 | C/T | 0.043 | 0.066 | 0.021 | 6.70E-03 | 3.39 | 5.00E-03 | |
| Chr3_128958866 | 128958866 | Intron3 | -/T | 0.08 | 0.049 | 0.1103 | 7.07E-03 | 0.41 | 7.40E-03 | |
| rs9832418 | 128961356 | Intron3 | C/T | 0.028 | 0.045 | 0.010 | 9.95E-03 | 4.59 | 8.00E-03 | |
| rs547801a | 128964929 | Intron3 | T/C | 0.029 | 0.049 | 0.010 | 5.93E-03 | 4.96 | 5.00E-03 | |
| rs520154a | 128965687 | Intron3 | A/G | 0.028 | 0.049 | 0.007 | 2.04E-03 | 7.46 | 1.20E-03 | |
| rs60963555 | 128967982 | Intron3 | T/C | 0.026 | 0.045 | 0.007 | 3.52E-03 | 6.91 | 1.60E-03 | |
| rs684358b | 128969940 | Intron3 | G/T | 0.028 | 0.049 | 0.007 | 2.04E-03 | 7.46 | 1.20E-03 | |
| rs9852837 | 128973744 | Intron3 | A/G | 0.028 | 0.045 | 0.010 | 9.95E-03 | 4.59 | 1.16E-02 | |
| Right | rs9289319 | 129009856 | Intron2 | G/A | 0.192 | 0.138 | 0.243 | 1.42E-03 | 0.50 | 1.80E-03 |
| rs9289320 | 129010946 | Intron2 | G/C | 0.192 | 0.143 | 0.240 | 3.27E-03 | 0.53 | 6.00E-03 | |
| rs9289321 | 129011459 | Intron2 | A/G | 0.165 | 0.123 | 0.206 | 7.84E-03 | 0.54 | 9.80E-03 | |
| rs9877819c | 129012220 | Intron2 | A/G | 0.164 | 0.122 | 0.206 | 7.03E-03 | 0.54 | 7.40E-03 | |
| rs28753886 | 129013477 | Intron2 | A/G | 0.163 | 0.119 | 0.206 | 4.79E-03 | 0.52 | 5.60E-03 | |
| rs35948688 | 129014938 | Intron2 | C/T | 0.159 | 0.112 | 0.206 | 2.10E-03 | 0.49 | 2.40E-03 | |
| rs874546c | 129021102 | Intron2 | G/A | 0.183 | 0.140 | 0.226 | 8.99E-03 | 0.56 | 1.00E-02 | |
| rs2011138 | 129026619 | Upstream | A/C | 0.352 | 0.412 | 0.295 | 3.18E-03 | 1.68 | 4.40E-03 | |
| Chr3_129026621 | 129026621 | Upstream | A/G | 0.049 | 0.021 | 0.075 | 2.65E-03 | 0.27 | 4.00E-03 | |
| Chr3_129029015 | 129029015 | Upstream | A/G | 0.336 | 0.398 | 0.276 | 1.98E-03 | 1.74 | 3.60E-03 | |
aPresent on the Affymetrix 500 k genotyping array. bAlso part of a locus-variant associated with high BMI using the collapsed marker test RareCover (Table S5 in Additional file 1). cPresent on the Illumina HumanHap300 genotyping array. LD, linkage disequilibrium.
Figure 4Association with BMI. (a) Significance of the association with BMI identified by single marker tests (-log10(chi-square P-value)) for all SNVs located in the MGLL interval (x-axis, NCBI36 coordinates). SNPs with a P-value < 0.01 are highlighted in red. The recombination rate [56] in the HapMap CEU population for this region is indicated by a blue line and measured on the right axis. (b,c) Significance of the association with BMI for all locus-variants identified by RareCover (see Materials and methods) in the MGLL (a) and FAAH (b) sequenced intervals. For both genes, locus-variants with a P-value < 0.01 are highlighted in red. The MGLL and FAAH gene structures are aligned based on their genomic positions.
Figure 5Functional annotation of the associated variants in the . Track A: variants identified by the single marker tests (blue bars) or the merge of the locus-variants identified by the collapsed marker test RareCover (red boxes) are indicated. Track B: predicted intervals for promoters (green) or transcriptional enhancers (orange) in HeLa cells [45]. Tracks C and D: the distribution of chromatin binding proteins from the ENCODE data obtained from the UCSC genome browser (15 November 2009). Track C: Broad/MGH ENCODE group chromatin signatures corresponding to enhancers (H3K4me1, H3K27ac) and promoters (H3K4me1+3, Pol2) in various cell types (GM12878, HUVEC, K562, Keratinocytes) [46]. Track D: Yale/UCD/Harvard ENCODE group identified AP2α and γ, c-Myc, Max, c-Fos and Pol2 binding sites in HeLa cells, and STAT1 binding sites in HeLa treated with IFNγ [46].
Average endocannabinoid levels of FAAH and MGLL rare variant carrier groups at the most significant locus-variants in the three intervals
| AEA | 2-AG | |||||||
|---|---|---|---|---|---|---|---|---|
| N | Average (pmol/ml) | SD | N | Average (pmol/ml) | SD | |||
| 0.05 | 0.10 | |||||||
| Carrier | ||||||||
| Cases | 14 | 17.11 | 5.79 | 14 | 10.55 | 10.67 | ||
| Controls | 0 | NA | NA | 0 | NA | NA | ||
| Non-carrier | ||||||||
| Cases | 80 | 15.13 | 5.34 | 67 | 5.63 | 3.04 | ||
| Controls | 48 | 13.76 | 5.51 | 35 | 6.98 | 4.15 | ||
| 0.49 | 0.80 | |||||||
| Carrier | ||||||||
| Cases | 5 | 15.6 | 6.35 | 4 | 6.43 | 2.08 | ||
| Controls | 0 | NA | NA | 0 | NA | NA | ||
| Non-carrier | ||||||||
| Cases | 89 | 15.24 | 5.44 | 77 | 6.49 | 5.57 | ||
| Controls | 48 | 13.76 | 5.52 | 35 | 6.99 | 4.16 | ||
| 0.36 | 0.37 | |||||||
| Carrier | ||||||||
| Cases | 26 | 15.09 | 5.6 | 24 | 6.31 | 3.06 | ||
| Controls | 3 | 13.16 | 5.38 | 3 | 4.41 | 4.46 | ||
| Non-carrier | ||||||||
| Cases | 68 | 15.56 | 5.43 | 57 | 6.56 | 6.2 | ||
| Controls | 45 | 13.8 | 5.58 | 32 | 7.22 | 4.11 | ||
aTwo-tailed t-test P-value between carriers/cases and non-carriers/controls. NA, not available.