| Literature DB >> 28122634 |
Roby Joehanes1,2,3, Xiaoling Zhang1,4,5, Tianxiao Huan1, Chen Yao1, Sai-Xia Ying2, Quang Tri Nguyen2, Cumhur Yusuf Demirkale2, Michael L Feolo6, Nataliya R Sharopova6, Anne Sturcke6, Alejandro A Schäffer6, Nancy Heard-Costa7, Han Chen8,4, Po-Ching Liu9, Richard Wang9, Kimberly A Woodhouse9, Kahraman Tanriverdi10, Jane E Freedman10, Nalini Raghavachari11, Josée Dupuis1,4, Andrew D Johnson1, Christopher J O'Donnell1,12, Daniel Levy13, Peter J Munson14,15.
Abstract
BACKGROUND: Identification of single nucleotide polymorphisms (SNPs) associated with gene expression levels, known as expression quantitative trait loci (eQTLs), may improve understanding of the functional role of phenotype-associated SNPs in genome-wide association studies (GWAS). The small sample sizes of some previous eQTL studies have limited their statistical power. We conducted an eQTL investigation of microarray-based gene and exon expression levels in whole blood in a cohort of 5257 individuals, exceeding the single cohort size of previous studies by more than a factor of 2.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28122634 PMCID: PMC5264466 DOI: 10.1186/s13059-016-1142-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Demographic characteristics
| Characteristic | Offspring cohort | Generation 3 cohort |
|
|---|---|---|---|
| Males (%) | 45.1% | 46.8% | 0.2291 |
| Age, in years | 66.4 ± 9.0 | 46.4 ± 8.8 | 1.36E-895 |
| White blood cell count (× 103/mL)b | 6.2 ± 1.3 | 6.0 ± 1.5 | 2.57E-7 |
| Neutrophil (%)b | 59.7 ± 7.9 | 58.7 ± 7.7 | 8.63E-7 |
| Lymphocyte (%)b | 27.1 ± 7.5 | 28.8 ± 6.9 | 2.20E-17 |
| Monocyte (%)b | 9.2 ± 1.9 | 8.6 ± 2.0 | 5.79E-22 |
| Eosinophil (%)b | 3.3 ± 1.6 | 3.1 ± 1.9 | 0.0039 |
| Basophil (%)b | 0.8 ± 0.2 | 0.8 ± 0.3 | 0.0019 |
| Platelet count (× 103/mL)b | 253.0 ± 36.5 | 247.5 ± 51.5 | 6.34E-6 |
*P values are from two-sample t-tests. For sex phenotype, the P value is from Fisher’s exact test
bCBC values are imputed based on actual measurements of 2274 samples within the Generation 3 cohort
Number of independent, significant eQTL-gene pairs with number of unique eQTLs or unique genes with P value corresponding to indicated FDR cutoff
| Pair type | eQTL-TranscriptCluster pairs | Unique eQTLs | Independent pairs | Independent unique lead eQTLs | Unique genesa(% available) |
|
|---|---|---|---|---|---|---|
| Nominal FDR < 0.05 | ||||||
| Cis | 4,285,456 | 2,221,013 | 19,613 | 19,239 | 10,327 (58%) | 1.00 E-4* |
| Trans | 216,169 | 91,559 | 6741 | 5749 | 4958 (28%) | 1.41 E-7 |
| Nominal FDR < 0.0005 | ||||||
| Cis | 3,698,429 | 2,008,734 | 17,452 | 17,119 | 9232 (52%) | 1.78E-5 |
| Trans | 116,960 | 52,426 | 1464 | 888 | 1025 (6%) | 8.82E-10 |
| Available | 1.521 E11 | 8,510,936 | 1.521 E11 | 8,510,936 | 17,873 (100%) | |
*P value cutoff corresponding to FDR. Upper bound P value for pairs retained in computation was 1E-4, therefore highest attained FDR for cis-eQTLs was 0.0024
aTranscript cluster ID is used as a proxy for genes. Only 244 genes were represented by more than one Transcript cluster IDs. Approximately 270 Transcript cluster IDs could not be assigned to an Entrez Gene entry
Top 25 non-redundant gene level cis-eQTL and top 25 trans-eQTL-transcript cluster pairs
| eQTL marker position | Rs ID | Transcript cluster ID | Trans Chr | Gene symbol | R2 | Beta | Cluster number |
|---|---|---|---|---|---|---|---|
| Top | |||||||
| 12:10118747 | rs12231872 | 3404530 R | 12 |
| 57% | −0.15 [G] | |
| 15:48596713 | rs74011998 G | 3593065 | 15 |
| 56% | −0.15 [T] | |
| 5:96252589 | rs2910686 H G | 2821347 R | 5 |
| 55% | −0.11 [T] | |
| 1:207280764 | rs12063500 G | 2377165 R | 1 |
| 54% | −0.32 [C] | |
| 6:31238135 | rs1050317 G | 2948887 | 6 | 50% | −0.3 [A] | ||
| 6:32576341 | rs9271093 G | 4048241 R | 6 |
| 50% | −0.45 [A] | |
| 22:42498204 | rs12157818 | 3947310 R | 22 |
| 49% | 0.16 [C] | |
| 6:26354100 | rs67509210 G | 2899333 R | 6 |
| 48% | 0.19 [G] | |
| 7:150480007 | rs1985881 | 3079172 R | 7 |
| 48% | −0.17 [C] | |
| 4:6697822 | rs3822260 H G | 2717078 | 4 |
| 48% | 0.24 [C] | |
| 22:45744854 | rs8136319 | 3948543 R | 22 |
| 48% | −0.2 [G] | |
| 6:167382449 | rs434093 G | 2984884 R | 6 |
| 48% | 0.12 [T] | 21 |
| 1:17421764 | rs2076613 | 2398820 | 1 |
| 47% | 0.08 [T] | |
| X:109206541 | rs2499412 | 3987029 | X |
| 47% | 0.11 [G] | |
| 4:47858518:ATAG_ | 2768273 | 4 |
| 47% | −0.1 [R] | ||
| 5:64858687 | rs432206 H | 2859667 R | 5 |
| 47% | −0.14 [C] | |
| 7:150478052 | rs6464101 H | 3031624 R | 7 |
| 47% | −0.15 [G] | |
| 1:109706880 | rs647294 H | 2350489 R | 1 |
| 47% | 0.14 [G] | |
| 6:32575544 | rs9271061 G | 4048265 R | 6 |
| 46% | −0.55 [T] | |
| 5:102118794 | rs2431321 H | 2822215 R | 5 |
| 46% | −0.08 [T] | |
| 6:32603854 | rs9272302 G | 2903219 R | 6 |
| 46% | −0.77 [C] | |
| 7:26952139 | rs2960785 H | 3042610 R | 7 |
| 46% | 0.08 [C] | |
| 1:43265985 | rs2816599 | 2409069 R | 1 |
| 45% | −0.31 [C] | |
| 9:132588337 | rs7470675 | 3227121 S | 9 |
| 45% | 0.17 [T] | |
| 7:75247329 | rs1186222 | 3057370 R | 7 |
| 45% | 0.09 [C] | |
| Top | |||||||
| 7:50427982 | rs6592965 G | 4007437 R | X |
| 22% | 0.09 [G] | 25 |
| 17:44026739 | rs242562 H G | 3767230 | 17 |
| 20% | 0.16 [G] | |
| 3:30722412 | rs3773654 | 3638566 | 15 |
| 16% | –0.18 [G] | |
| 1:58992071 | rs7520008 | 2414558 | 1 |
| 12% | 0.04 [T] | |
| 3:50078541 | rs9814664 H G | 3981931 | X |
| 12% | 0.11 [T] | 9 |
| 7:50427982 | rs6592965 G | 2520069 R | 2 |
| 11% | 0.1 [G] | 25 |
| 1:248039451 | rs3811444 H G | 3357237 R | 11 |
| 11% | 0.08 [C] | 4 |
| 3:49978069 | rs6772095 H G | 4024310 | X |
| 11% | 0.06 [C] | 9 |
| 16:69973655 | rs4985461 | 2876543 | 5 |
| 11% | –0.08 [G] | |
| 7:50427982 | rs6592965 G | 3724505 | 17 |
| 9% | 0.08 [G] | 25 |
| 21:44473062 | rs11700748 H G | 3416019 | 12 |
| 9% | 0.1 [C] | 56 |
| 5:50106439 | rs32396 H G | 2359431 | 1 |
| 8% | 0.08 [G] | |
| 11:617537 | rs2740380 G | 3864107 | 19 |
| 8% | 0.13 [T] | |
| 3:49971514 | rs7613875 H G | 3693591 | 16 |
| 8% | 0.06 [C] | 9 |
| 3:50008566 | rs6446189 G | 3939154 | 22 |
| 8% | 0.05 [A] | 9 |
| 3:56849749 | rs1354034 H G | 3724545 R | 17 |
| 8% | –0.08 [T] | 10 |
| 11:108623805:CTAT_ | 3504617 | 13 |
| 8% | –0.1 [R] | ||
| 17:33875262 | rs8073060 H | 3089102 R | 8 |
| 7% | –0.05 [T] | 51 |
| 3:56849749 | rs1354034 H G | 2735759 | 4 |
| 7% | –0.07 [T] | 10 |
| 11:55113534 | rs75905900 G | 3329983 | 11 |
| 7% | –0.05 [A] | |
| 6:170847101 | rs75159687 | 2390976 | 1 |
| 7% | –0.13 [T] | |
| 22:50027210 | rs5769712 | 3581090 | 14 |
| 7% | 0.06 [C] | 58 |
| 7:50427982 | rs6592965 G | 3714729 R | 17 |
| 7% | –0.05 [G] | 25 |
| 22:50027210 | rs5769712 | 3203199 | 9 |
| 6% | 0.07 [C] | 58 |
| 3:56849749 | rs1354034 H G | 2476510 | 2 |
| 6% | –0.06 [T] | 10 |
R2 - Percentage variance explained
Marker position is annotated as chromosome number:location in hg19 coordinate
Beta is regression estimate (log base 2 expression difference per dosage of effect allele), with effect allele in brackets. [R] refers to the reference allele for an indel polymorphism
H = HapMap SNPs
G = in LD with GWAS SNPs as recorded in the NHGRI GWAS catalog
R = In LD with SNP replicated in at least one of the databases associated with references GTEx [1], Westra et al. [5], Liang et al. [6], Kirsten et al. [8]
All 25 cis-eQLTs and 25 trans-eQLTs are internally validated and have consistent sign of expression change
S = SNP-in-probe problem likely inflates R2
Fig. 1Genomic eQTL location vs. transcript cluster location for highly significant eQTL-gene pairs (FDR < 1E-8). Bubble size is inversely proportional to the FDR. The largest bubble indicates FDR < 1E-100
Number of independent, significant pairs validated in previous studies
| Pair type | Comparison study | Eligible lead eQTL-gene pairsa | Validated pairsa (rate) | Expected pairs (rate) |
|---|---|---|---|---|
| Cis | Multiple studies [ | 10,584 | 5700 (54%) | 90 (0.8%) |
| Battle et al. [ | 11,466 | 2911 (25%) | 10 (0.08%) | |
| Kirsten et al. [ | 11,179 | 6503 (58%) | 919 (8%) | |
| Trans | Multiple studies [ | 1777 | 40 (2%) | 0.0007 (0%) |
| Battle et al. [ | 2596 | 102 (4%) | 0.0001 (0%) | |
| Kirsten et al. [ | 2337 | 135 (6%) | 0.03 (0%) |
aSee “Methods” and Additional file 1: Supplementary Methods for details
P values (comparing Validated to Expected pairs are based on Poisson distribution) are all <1E-200
Fig. 2Box plots of very strong cis-eQTL or trans-eQTL-transcript cluster pairs. a rs2499412 – TMEM164 (cis, R2 = 47%). b rs3773654 – PEX11A (trans, R2 = 16%). y-axis: expression level in RMA units; x-axis: imputed major allele count
Clusters of trans-eQTLs with many targets
| Cluster | Cluster address (hg19) | Width | N | Ng | Tr | Tr-max | Tr-Ext | Ret | GWAS traits | Top GSEA categories |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | chr1:118158079-118174222 | 16,144 | 8 | 0 | 10 | 8 | 0 | 1 | Platelet | |
| 2 | chr1:158810312-158827365 | 17,054 | 2 | 0 | 7 | 7 | 0 | 0 | ||
| 3 | chr1:205040943-205268803 | 227,861 | 114 | 4 | 14 | 11 | 0 | 5 | Mean platelet volume, Platelet counts | . |
| 4 | chr1:248038210-248047688 | 9479 | 4 | 3 | 15 | 12 | 2 | 1 | Mean corpuscular volume, Platelet counts, Red blood cell traits | . |
| 5 | chr2:8740722-8766192 | 25,471 | 23 | 0 | 24 | 21 | 0 | 2 | Catabolic process | |
| 6 | chr2:60708597-60727629 | 19,033 | 19 | 24 | 18 | 15 | 7 | 1 | Beta thalassemia/hemoglobin E disease, F-cell distribution, Fetal hemoglobin levels, Sickle cell anemia (hemolysis) | . |
| 7 | chr2:85715315-85809703 | 94,389 | 51 | 1 | 15 | 12 | 0 | 0 | Prostate cancer | Neutrophil |
| 8 | chr2:160326049-160674656 | 348,608 | 381 | 0 | 26 | 21 | 1 | 0 | Liver cancer progression, fibroblast cell response, Neutrophil, MicroRNA targets | |
| 9 | chr3:49734040-50308718 | 574,679 | 435 | 2 | 38 | 34 | 0 | 0 | HDL cholesterol, Menarche (age at onset) | Dendritic cells response |
| 10 | chr3:56829892-56953738 | 123,847 | 185 | 9 | 128 | 127 | 85 | 3 | Mean platelet volume, Platelet counts | Platelet, hemostasis, Neutrophil |
| 11 | chr3:100905910-101550022 | 644,113 | 544 | 0 | 37 | 33 | 17 | 0 | Genes in band chr19q13, transcription pathway | |
| 12 | chr3:121350573-121471367 | 120,795 | 35 | 0 | 8 | 8 | 0 | 1 | ||
| 13 | chr3:176869498-176928657 | 59,160 | 104 | 0 | 9 | 8 | 0 | 0 | ||
| 14 | chr4:103391275-103540762 | 149,488 | 77 | 1 | 7 | 7 | 0 | 0 | Ulcerative colitis | . |
| 15 | chr5:148198999-148213506 | 14,508 | 42 | 0 | 35 | 32 | 0 | 0 | Neutrophil, kidney glomeruli cell response | |
| 16 | chr5:173287763-173363889 | 76,127 | 9 | 2 | 14 | 13 | 0 | 5 | Crohn’s disease, Waist–hip ratio | . |
| 17 | chr6:135402339-135459837 | 57,499 | 27 | 24 | 26 | 23 | 7 | 6 | Beta thalassemia/hemoglobin E disease, Cholesterol, total, F-cell distribution, HbA2 levels, Hematocrit, Hematological parameters, Hematology traits, Hodgkin's lymphoma, Mean corpuscular hemoglobin, Mean corpuscular hemoglobin concentration, Mean corpuscular volume, Mean platelet volume, Other erythrocyte phenotypes, Platelet counts, Red blood cell traits, White blood cell count, White blood cell types | CD71+, MicroRNA targets |
| 18 | chr6:139764573-139844429 | 79,857 | 48 | 9 | 73 | 50 | 13 | 17 | Adiponectin levels, HDL cholesterol, Mean corpuscular hemoglobin, Mean corpuscular volume | CD71+, Neutrophil, MicroRNA targets |
| 19 | chr6:144190433-144674657 | 484,225 | 172 | 0 | 43 | 34 | 1 | 0 | Neutrophil, MicroRNA targets | |
| 20 | chr6:159498130-159539213 | 41,084 | 14 | 0 | 10 | 9 | 0 | 0 | Hematopoetic cells response, Neutrophil | |
| 21 | chr6:167362976-167403400 | 40,425 | 6 | 3 | 7 | 6 | 0 | 0 | Crohn’s disease, Graves’ disease, Inflammatory bowel disease | Hematopoetic cells response |
| 22 | chr7:28716154-28874761 | 158,608 | 11 | 0 | 19 | 17 | 0 | 0 | Neutrophil | |
| 23 | chr7:33035342-33096644 | 61,303 | 5 | 0 | 12 | 11 | 0 | 5 | . | |
| 24a | chr7:50366637-50417632 | 50,996 | 3 | 0 | 40 | 40 | 2 | 2 | Dendritic cell vs monocyte response, Neutrophil | |
| 25a | chr7:50367656-50671350 | 303,695 | 292 | 6 | 93 | 82 | 19 | 17 | Acute lymphoblastic leukemia (B-cell precursor), Acute lymphoblastic leukemia (childhood), Mean corpuscular volume, Red blood cell traits | CD71+, MicroRNA targets |
| 26 | chr7:106367604-106373718 | 6115 | 8 | 6 | 6 | 6 | 0 | 1 | Mean platelet volume | . |
| 27 | chr8:61595671-61658073 | 62,403 | 10 | 0 | 6 | 6 | 0 | 0 | ||
| 28 | chr9:99087217-99192919 | 105,703 | 12 | 1 | 7 | 7 | 0 | 0 | Height | Bone marrow progenitor cells response |
| 29 | chr9:126971204-127002414 | 31,211 | 20 | 0 | 20 | 16 | 1 | 0 | Kidney glomeruli cells response, Neutrophil, MicroRNA targets | |
| 30 | chr9:131561110-131645659 | 84,550 | 58 | 0 | 10 | 10 | 0 | 0 | Hematopoetic cells response, Neutrophil | |
| 31 | chr9:140655551-140696524 | 40,974 | 14 | 0 | 7 | 6 | 0 | 0 | . | |
| 32 | chr10:37976990-38692432 | 715,443 | 221 | 0 | 9 | 7 | 0 | 0 | . | |
| 33 | chr10:65016174-65104500 | 88,327 | 18 | 6 | 6 | 6 | 1 | 0 | Mean platelet volume, Platelet counts, Triglycerides | Platelet |
| 34 | chr11:212408-247028 | 34,621 | 15 | 1 | 6 | 6 | 1 | 0 | Platelet counts | Coagulation, Platelet |
| 35 | chr11:108021205-108367453 | 346,249 | 248 | 3 | 8 | 7 | 3 | 0 | Melanoma, Response to metformin, Response to metformin in type 2 diabetes (glycemic) | Neutrophil |
| 36 | chr12:54584330-54584330 | 1 | 1 | 0 | 7 | 7 | 0 | 0 | Genes with certain promoter motif | |
| 37a | chr12:54622328-54733728 | 111,401 | 6 | 2 | 7 | 6 | 0 | 0 | Mean platelet volume, Platelet counts | Genes in band chr14q24 |
| 38a | chr12:54668471-54761931 | 93,461 | 27 | 3 | 29 | 14 | 2 | 0 | Mean platelet volume, Platelet counts | Platelet |
| 39 | chr12:111833788-112896839 | 1,063,052 | 25 | 35 | 17 | 15 | 7 | 0 | Beta-2 microglobulin plasma levels, Blood pressure, Celiac disease, Celiac disease and Rheumatoid arthritis, Cholesterol, total, Chronic kidney disease, Coronary artery disease or ischemic stroke, Coronary artery disease or large artery stroke, Coronary heart disease, Diastolic blood pressure, Eosinophil counts, Hematocrit, Hematological parameters, Hemoglobin, Hypothyroidism, Ischemic stroke, LDL cholesterol, Mean platelet volume, Platelet counts, Red blood cell traits, Retinal vascular caliber, Rheumatoid arthritis, Systolic blood pressure, Tetralogy of Fallot, Type 1 diabetes, Type 1 diabetes autoantibodies, Upper aerodigestive tract cancers, Urate levels, Vitiligo | Interferon signaling, Cytokine signaling, Immune system |
| 40 | chr12:122216910-122366612 | 149,703 | 3 | 5 | 12 | 12 | 0 | 0 | Mean platelet volume, Platelet counts | Platelet, Neutrophil |
| 41 | chr12:129277164-129305613 | 28,450 | 56 | 1 | 17 | 16 | 0 | 0 | Systemic lupus erythematosus | . |
| 42 | chr14:24888248-24904602 | 16,355 | 19 | 0 | 13 | 13 | 0 | 3 | ||
| 43 | chr14:35372518-35847784 | 475,267 | 360 | 2 | 18 | 17 | 1 | 0 | Psoriasis | Pancreatic cancer cell response, Neutrophil, MicroRNA targets |
| 44 | chr16:57045349-57070617 | 25,269 | 31 | 0 | 11 | 11 | 1 | 0 | IFNG responsive | |
| 45 | chr16:87749634-87770469 | 20,836 | 6 | 0 | 9 | 9 | 0 | 0 | ||
| 46a | chr16:89802396-89992238 | 189,843 | 4 | 14 | 11 | 8 | 0 | 0 | Freckling, Hair color, Homocysteine levels, Vitiligo | . |
| 47a | chr16:89919081-89993206 | 74,126 | 102 | 9 | 9 | 8 | 0 | 0 | Basal cell carcinoma, Blond vs. brown hair color, Freckles, Hair color, Non-melanoma skin cancer, Red vs non-red hair color, Skin sensitivity to sun, Sunburns, Tanning | . |
| 48 | chr17:15899871-16239832 | 339,962 | 137 | 0 | 9 | 7 | 1 | 0 | Beta catenin signaling, Neutrophil | |
| 49 | chr17:26887271-27323322 | 436,052 | 252 | 0 | 44 | 41 | 0 | 11 | PBMC response, CD71+, Neutrophil, MicroRNA targets | |
| 50 | chr17:33763678-33819401 | 55,724 | 22 | 0 | 23 | 21 | 0 | 1 | PBMC response, MicroRNA targets | |
| 51a | chr17:33823254-34067892 | 244,639 | 360 | 2 | 141 | 95 | 8 | 26 | Platelet counts, Mean platelet volume | PBMC response, CD71+, Platelet, MicroRNA targets |
| 52a | chr17:33885467-33899846 | 14,380 | 3 | 2 | 11 | 9 | 0 | 0 | Platelet counts, Mean platelet volume | Platelet, PBMC vs. Tcells |
| 53 | chr18:43716475-43856354 | 139,880 | 121 | 0 | 10 | 9 | 0 | 2 | ||
| 54 | chr20:4139757-4188267 | 48,511 | 70 | 0 | 40 | 33 | 0 | 8 | PBMC response, CD71+ | |
| 55 | chr20:62529985-62565833 | 35,849 | 54 | 0 | 9 | 9 | 0 | 1 | Glioma cell response | |
| 56 | chr21:44471469-44492843 | 21,375 | 7 | 3 | 6 | 6 | 1 | 0 | Blood trace element (Se levels), Homocysteine levels, Obesity-related traits | . |
| 57 | chr22:32311619-32326021 | 14,403 | 17 | 0 | 8 | 8 | 0 | 0 | Hematopoetic cell response, Neutrophil | |
| 58 | chr22:49967868-50069539 | 101,672 | 128 | 0 | 18 | 16 | 0 | 0 | ||
| 59 | chrX:40802981-40847852 | 44,872 | 52 | 0 | 31 | 30 | 0 | 7 | CD71+ |
aOverlapping clusters
N number of significant eQTLs in cluster, Ng number of GWAS SNPs in cluster, Tr number of transcript clusters targeted by any trans-eQTL in cluster, Tr-max max number of transcript clusters targeted by single trans-eQTL in cluster, Tr-Ext maximum number of transcripts validated in one or more of five previous results [5–8, 59–62], Ret number of early reticulocyte specific transcripts targeted by trans-eQTLs in cluster, GSEA Category selected GSEA themes over-represented in cluster targets, GSEA gene set enrichment analysis
Fig. 3Number of transcript clusters targeted by each trans-eQTL. eQTLs targeting six or more extrachromosomal transcript clusters fall into color-coded clusters. a Number of extra-chromosomal trans targets. b Number of intrachromosomal trans targets. Note presence of unlabeled clusters on Chr 6, the HLA region, and on Chr 8
Fig. 4Screenshot of NCBI molecular QTL browser