| Literature DB >> 34419169 |
Anna Díez-Villanueva1,2,3, Mireia Jordà4, Robert Carreras-Torres1,2,3, Henar Alonso1,2,3,5, David Cordero1,2,3, Elisabet Guinó1,2,3, Xavier Sanjuan2,6, Cristina Santos2,7,8, Ramón Salazar2,5,7,8, Rebeca Sanz-Pamplona9,10,11, Victor Moreno12,13,14,15.
Abstract
BACKGROUND: DNA methylation is involved in the regulation of gene expression and phenotypic variation, but the inter-relationship between genetic variation, DNA methylation and gene expression remains poorly understood. Here we combine the analysis of genetic variants related to methylation markers (methylation quantitative trait loci: mQTLs) and gene expression (expression quantitative trait loci: eQTLs) with methylation markers related to gene expression (expression quantitative trait methylation: eQTMs), to provide novel insights into the genetic/epigenetic architecture of colocalizing molecular markers.Entities:
Keywords: DNA methylation; Epigenetic regulation; Gene expression; Genetic and epigenetic control; Genetics; eQTLs; eQTMs; mQTLs
Mesh:
Year: 2021 PMID: 34419169 PMCID: PMC8380335 DOI: 10.1186/s13148-021-01148-9
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1Scheme of the performed analysis. SNPs filt: Only SNPs in autosomal chromosomes and with a MAF between 0.05 and 0.95 have been considered. Duplicated SNPs and SNPs with more than 10% of missing values have been removed. CpGs filt: SNPs CpGs or CpGs with missing values have been removed. Only CpGs in autosomal chromosomes have been included. CpGs with a standard deviation greater than 0.05 have been filtered in. Genes filt: Only genes in autosomal chromosomes and with a standard deviation greater than 0.05 have been considered. All the analysis has been adjusted by sex, age and site and the maximum distance between elements was 1 Mb
Fig. 2Causal relationship models of the 12,720 meQTLs in normal tissues
Number of significant associations identified
| Normal ( | Adjacent ( | Tumor ( | |
|---|---|---|---|
| 439,043 | 227,934 | 56,666 | |
| independent mQTLs | 8195 | 4229 | 840 |
| CpGs | 6713 | 4167 | 850 |
| CpGs blocks | 4524 | 2845 | 645 |
| SNPs | 246,758 | 141,207 | 38,751 |
| SNPs blocks | 8064 | 4138 | 840 |
| 557 | 290 | 1732 | |
| independent eQTMs | 165 | 78 | 490 |
| Genes | 158 | 78 | 487 |
| CpGs | 482 | 253 | 1563 |
| CpGs blocks | 155 | 75 | 466 |
| 32,446 | 17,274 | 8530 | |
| independent eQTLs | 658 | 279 | 82 |
| Genes | 374 | 220 | 80 |
| SNPs | 31,482 | 17,070 | 8395 |
| SNPs blocks | 650 | 274 | 79 |
| 13,987 | 5517 | 1926 | |
| independent meQTLs | 54 | 19 | 6 |
| Genes | 52 | 19 | 6 |
| CpGs | 114 | 45 | 16 |
| CpGs blocks | 51 | 19 | 6 |
| SNPs | 6850 | 2720 | 1231 |
| SNPs blocks | 54 | 19 | 6 |
Fig. 3Methylation distribution in Normal tissues. Distribution of methylation median of A 211,268 variable CpGs, B 6713 CpGs in mQTLs, C 482 CpGs in eQTMs and D 114 CpGs in meQTLs
Fig. 4CpG distribution by gene region context. Proportion of CpGs by gene region context. A 211,268 variable CpGs associated with their nearest gene, B 6713 CpGs in mQTLs associated with their nearest gene, C 482 CpGs in eQTMs associated with their nearest gene, D 482 CpGs in eQTMs associated with the correlated gene, E 114 CpGs in meQTLs associated with their nearest gene and F 114 CpGs in meQTLs associated with the correlated gene
Fig. 5Boxplot of the correlation between gene and CpG by gene region context. A 557 eQTMs, B 13,987 eQTMs in meQTLs and C 119 unique eQTMs in meQTLs
haploReg results
| eQTLs | mQTLs | meQTLs | Random SNPs | Association | Name | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total number | 31,482 | 246,758 | 6850 | 150,000 | – | – | – | – | – | – | – | – | |
| Conservation | GERP | 1049 (3.66%) | 8112 (3.59%) | 148 (2.35%) | 6349 (4.65%) | 5.04e−14 | 2.36e−55 | 8.57e−21 | – | – | – | – | – |
| Siphy | 716 (2.5%) | 5305 (2.35%) | 116 (1.84%) | 3619 (2.65%) | 1.43e−01 | 1.29e−08 | 4.42e−05 | – | – | – | – | – | |
| Chromatin states | 1_TssA | 195 (0.68%) | 1057 (0.47%) | 32 (0.51%) | 222 (0.16%) | 1.03e−43 | 8.55e−57 | 1.19e−07 | 1.74e−42 | 1.45e−55 | 2.03e−06 | Promoter-associated | Active transcription start site |
| 2_PromU | 203 (0.71%) | 1598 (0.71%) | 62 (0.98%) | 766 (0.56%) | 3.78e−03 | 9.90e−08 | 8.23e−05 | 6.42e−02 | 1.68e−06 | 1.40e−03 | Promoter-associated | Promoter upstream transcription start site | |
| 3_PromD1 | 138 (0.48%) | 841 (0.37%) | 68 (1.08%) | 375 (0.27%) | 6.39e−08 | 7.11e−07 | 5.56e−19 | 1.09e−06 | 1.21e−05 | 9.46e−18 | Promoter-associated | Promoter downstream transcription start site 1 | |
| 4_PromD2 | 129 (0.45%) | 1195 (0.53%) | 31 (0.49%) | 300 (0.22%) | 1.01e−10 | 5.71e−49 | 1.23e−04 | 1.72e−09 | 9.71e−48 | 2.10e−03 | Promoter-associated | Promoter downstream transcription start site 2 | |
| 22_PromP | 90 (0.31%) | 924 (0.41%) | 29 (0.46%) | 306 (0.22%) | 6.32e−03 | 1.71e−21 | 6.99e−04 | 1.08e−01 | 2.91e−20 | 1.19e−02 | Promoter-associated | Poised promoter | |
| 23_PromBiv | 218 (0.76%) | 2482 (1.1%) | 47 (0.75%) | 1227 (0.9%) | 2.33e−02 | 5.59e−09 | 2.18e−01 | 3.96e−01 | 9.51e−08 | 1.00e + 00 | Promoter-associated | Bivalent promoter | |
| 10_TxEnh5 | 299 (1.04%) | 2394 (1.06%) | 83 (1.32%) | 542 (0.4%) | 1.08e−36 | 4.22e−114 | 8.44e−19 | 1.83e−35 | 7.17e−113 | 1.44e−17 | Enhancer-associated | Transcribed 5′preferential and enhancer | |
| 11_TxEnh3 | 417 (1.46%) | 2332 (1.03%) | 92 (1.46%) | 474 (0.35%) | 7.80e−92 | 5.23e−129 | 5.68e−27 | 1.33e−90 | 8.89e−128 | 9.66e−26 | Enhancer-associated | Transcribed 3′preferential and enhancer | |
| 12_TxEnhW | 171 (0.6%) | 840 (0.37%) | 21 (0.33%) | 259 (0.19%) | 5.81e−28 | 2.28e−23 | 1.86e−02 | 9.87e−27 | 3.87e−22 | 3.15e−01 | enhancer-associated | Transcribed and weak enhancer | |
| 13_EnhA1 | 219 (0.76%) | 1165 (0.52%) | 111 (1.76%) | 365 (0.27%) | 4.68e−31 | 9.22e−31 | 1.30e−47 | 7.96e−30 | 1.57e−29 | 2.21e−46 | Enhancer-associated | Active enhancer 1 | |
| 14_EnhA2 | 124 (0.43%) | 1554 (0.69%) | 54 (0.86%) | 388 (0.28%) | 8.46e−05 | 6.00e−64 | 2.32e−11 | 1.44e−03 | 1.02e−62 | 3.95e−10 | Enhancer-associated | Active enhancer 2 | |
| 15_EnhAF | 139 (0.49%) | 1421 (0.63%) | 30 (0.48%) | 434 (0.32%) | 3.16e−05 | 1.78e−39 | 4.04e−02 | 5.38e−04 | 3.02e−38 | 6.87e−01 | enhancer-associated | Active enhancer flank | |
| 16_EnhW1 | 155 (0.54%) | 1136 (0.5%) | 23 (0.37%) | 271 (0.2%) | 7.73e−21 | 9.90e−51 | 9.47e−03 | 1.31e−19 | 1.68e−49 | 1.61e−01 | Enhancer-associated | Weak enhancer 1 | |
| 17_EnhW2 | 247 (0.86%) | 1963 (0.87%) | 26 (0.41%) | 580 (0.43%) | 8.48e−19 | 3.79e−58 | 1.00e + 00 | 1.44e−17 | 6.44e−57 | 1.00e + 00 | Enhancer-associated | Weak enhancer 2 | |
| 18_EnhAc | 110 (0.38%) | 767 (0.34%) | 32 (0.51%) | 284 (0.21%) | 2.27e−07 | 3.67e−13 | 1.43e−05 | 3.86e−06 | 6.24e−12 | 2.43e−04 | Enhancer-associated | Primary H3K27ac–possible enhancer | |
| 9_TxReg | 129 (0.45%) | 1274 (0.56%) | 21 (0.33%) | 807 (0.59%) | 3.24e−03 | 2.97e−01 | 6.42e−03 | 5.50e−02 | 1.00e + 00 | 1.09e−01 | Promoter/enhancer associated | Transcribed and regulatory | |
| 19_DNase | 110 (0.38%) | 948 (0.42%) | 38 (0.6%) | 404 (0.3%) | 1.68e−02 | 2.31e−09 | 1.06e−04 | 2.85e−01 | 3.93e−08 | 1.80e−03 | Promoter/enhancer associated | Primary DNase | |
| Histone marks | H3K4me3_Pro | 3322 (11.6%) | 27,570 (12.21%) | 775 (12.3%) | 7896 (5.79%) | 7.20e−242 | 0.00e + 00 | 2.25e−79 | 2.88e−241 | 0.00e + 00 | 9.00e−79 | Promoter-associated | Histone H3 lysine 4 trimethylation |
| H3K9ac_Pro | 4356 (15.21%) | 30,036 (13.3%) | 1207 (19.16%) | 10,517 (7.71%) | 7.45e−313 | 0.00e + 00 | 1.21e−175 | 2.98e−312 | 0.00e + 00 | 4.83e−175 | Promoter-associated | Histone H3 lysine 9 acetylation | |
| H3K4me1_Enh | 3019 (10.54%) | 22,816 (10.1%) | 794 (12.6%) | 5877 (4.31%) | 0.00e + 00 | 0.00e + 00 | 7.46e−146 | 0.00e + 00 | 0.00e + 00 | 2.98e−145 | Enhancer-associated | Histone H3 lysine 4 monomethylation | |
| H3K27ac_Enh | 5179 (18.08%) | 43,529 (19.27%) | 1097 (17.41%) | 14,640 (10.73%) | 1.83e−241 | 0.00e + 00 | 5.42e−54 | 7.31e−241 | 0.00e + 00 | 2.17e−53 | Enhancer-associated | Histone H3 lysine 27 acetylation | |
| eQTLs | eQTLs | 20,199 (70.53%) | 52,029 (23.04%) | 5370 (85.22%) | 6452 (4.73%) | 0.00e + 00 | 0.00e + 00 | 0.00e + 00 | – | – | – | – | – |
SNPs involved in meQTLs found in the GWAS catalog
| GWAS | Normal | Adjacent | Tumor | |
|---|---|---|---|---|
| Traits | 1722 | 48 (3%) | 19 (1%) | 15 (1%) |
| SNPs | 48,881 | 64 (0.1%) | 26 (0%) | 15 (0%) |
| Age at menopause | 84 | 1 (1%) | 1 (1%) | 0 (0%) |
| Alzheimer's disease (late onset) | 55 | 1 (2%) | 1 (2%) | 1 (2%) |
| Asthma | 206 | 2 (1%) | 0 (0%) | 1 (0%) |
| Asthma or allergic disease (pleiotropy) | 36 | 1 (3%) | 0 (0%) | 0 (0%) |
| Blood metabolite levels | 195 | 1 (1%) | 1 (1%) | 0 (0%) |
| Blood protein levels | 2772 | 10 (0%) | 7 (0%) | 2 (0%) |
| Blood urea nitrogen levels | 111 | 1 (1%) | 0 (0%) | 0 (0%) |
| Childhood ear infection | 19 | 2 (11%) | 2 (11%) | 2 (11%) |
| Chronic lymphocytic leukemia | 73 | 1 (1%) | 0 (0%) | 1 (1%) |
| Circulating chemerin levels | 4 | 4 (100%) | 0 (0%) | 0 (0%) |
| Colorectal cancer | 116 | 3 (3%) | 1 (1%) | 1 (1%) |
| Colorectal cancer or advanced adenoma | 94 | 1 (1%) | 0 (0%) | 0 (0%) |
| Drug-induced liver injury (amoxicillin-clavulanate) | 2 | 1 (50%) | 1 (50%) | 0 (0%) |
| Educational attainment (MTAG) | 1320 | 1 (0%) | 0 (0%) | 0 (0%) |
| Eosinophil percentage of granulocytes | 179 | 1 (1%) | 1 (1%) | 1 (1%) |
| Hair color | 449 | 1 (0%) | 0 (0%) | 0 (0%) |
| Heart rate response to exercise | 20 | 1 (5%) | 1 (5%) | 1 (5%) |
| Heel bone mineral density | 2262 | 1 (0%) | 0 (0%) | 0 (0%) |
| High density lipoprotein cholesterol levels | 306 | 2 (1%) | 0 (0%) | 0 (0%) |
| Highest math class taken (MTAG) | 1084 | 1 (0%) | 0 (0%) | 0 (0%) |
| Intraocular pressure | 512 | 2 (0%) | 0 (0%) | 0 (0%) |
| Liver enzyme levels | 9 | 1 (11%) | 0 (0%) | 0 (0%) |
| Liver enzyme levels (gamma-glutamyl transferase) | 26 | 2 (8%) | 2 (8%) | 1 (4%) |
| Lumiracoxib-related liver injury | 1 | 1 (100%) | 0 (0%) | 0 (0%) |
| Mean platelet volume | 323 | 1 (0%) | 0 (0%) | 0 (0%) |
| Medication use (adrenergics, inhalants) | 55 | 1 (2%) | 0 (0%) | 0 (0%) |
| Menopause (age at onset) | 63 | 1 (2%) | 1 (2%) | 0 (0%) |
| Metabolite levels | 66 | 2 (3%) | 0 (0%) | 0 (0%) |
| Metabolite levels (small molecules and protein measures) | 32 | 1 (3%) | 0 (0%) | 0 (0%) |
| Multiple sclerosis | 158 | 1 (1%) | 0 (0%) | 0 (0%) |
| Multiple sclerosis (OCB status) | 6 | 2 (33%) | 1 (17%) | 1 (17%) |
| Oligoclonal band status in multiple sclerosis | 1 | 1 (100%) | 1 (100%) | 1 (100%) |
| Plasma homocysteine levels (post-methionine load test) | 5 | 1 (20%) | 1 (20%) | 0 (0%) |
| Platelet count | 323 | 1 (0%) | 0 (0%) | 0 (0%) |
| Plateletcrit | 258 | 1 (0%) | 0 (0%) | 0 (0%) |
| Pulse pressure | 747 | 1 (0%) | 0 (0%) | 0 (0%) |
| Red cell distribution width | 821 | 1 (0%) | 0 (0%) | 0 (0%) |
| S-phenylmercapturic acid levels in smokers | 1 | 1 (100%) | 0 (0%) | 0 (0%) |
| Serum metabolite levels | 76 | 1 (1%) | 1 (1%) | 0 (0%) |
| Systemic lupus erythematosus | 195 | 1 (1%) | 1 (1%) | 1 (1%) |
| Systolic blood pressure | 1393 | 1 (0%) | 0 (0%) | 0 (0%) |
| Triglyceride levels in current drinkers | 39 | 1 (3%) | 0 (0%) | 0 (0%) |
| Triglyceride levels x alcohol consumption (drinkers vs non-drinkers) interaction | 55 | 1 (2%) | 0 (0%) | 0 (0%) |
| Triglyceride levels x alcohol consumption (regular vs non-regular drinkers) interaction | 55 | 1 (2%) | 0 (0%) | 0 (0%) |
| Type 1 diabetes | 87 | 1 (1%) | 0 (0%) | 1 (1%) |
| Type 2 diabetes | 352 | 1 (0%) | 1 (0%) | 1 (0%) |
| Urinary 1,3-butadiene metabolite levels in smokers | 2 | 2 (100%) | 1 (50%) | 1 (50%) |
| White blood cell count | 854 | 2 (0%) | 1 (0%) | 0 (0%) |
Fig. 6Examples of the models obtained from the Bayesian networks. First plot shows the diagram of the model and second plot shows the dotplot between methylation and gene expression taking genotypes as color legend
Fig. 7Common mQTLs, eQTLs, eQTMs and meQTLs between normal tissues and tumor samples. Venn diagrams of Normal tissue (healthy mucosae and adjacent normal tissue combined), Adjacent normal tissue alone and Tumor tissue