| Literature DB >> 33313492 |
Elyse Geoffroy1, Isabelle Gregga2, Heather E Wheeler1,2.
Abstract
Most genome-wide association studies (GWAS) and transcriptome-wide association studies (TWAS) focus on European populations; however, these results cannot always be accurately applied to non-European populations due to genetic architecture differences. Using GWAS summary statistics in the Population Architecture using Genomics and Epidemiology study, which comprises ∼50,000 Hispanic/Latinos, African Americans, Asians, Native Hawaiians, and Native Americans, we perform TWAS to determine gene-trait associations. We compared results using three transcriptome prediction models derived from Multi-Ethnic Study of Atherosclerosis populations: the African American and Hispanic/Latino (AFHI) model, the European (EUR) model, and the African American, Hispanic/Latino, and European (ALL) model. We identified 240 unique significant trait-associated genes. We found more significant, colocalized genes that replicate in larger cohorts when applying the AFHI model than the EUR or ALL model. Thus, TWAS with population-matched transcriptome models have more power for discovery and replication, demonstrating the need for more transcriptome studies in diverse populations.Entities:
Keywords: Genetics; Genomics; Human Genetics; Population
Year: 2020 PMID: 33313492 PMCID: PMC7721644 DOI: 10.1016/j.isci.2020.101850
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Population Architecture Using Genomics and Epidemiology (PAGE) Phenotypes Tested in TWAS and the Significant Gene Counts for Each Phenotype and Transcriptome Prediction Model
| Trait | Total N or N Cases/N Controls | Mean or % Cases | SD of Mean | TWAS with AFHI Count | TWAS with EUR Count | TWAS with all Count |
|---|---|---|---|---|---|---|
| Inflammatory traits | ||||||
| C-reactive protein (CRP) (mg/L) | 28,520 | 4.114 | 4.836 | 9 | 8 | 9 |
| White blood cell (WBC) count (109 cells/L) | 28,608 | 6.253 | 1.943 | 78 | 34 | 91 |
| Mean corpuscular hemoglobin concentration (MCHC) (g/dL) | 19,803 | 32.909 | 1.249 | 1 | 2 | 2 |
| Platelets (per mcL) | 29,328 | 246.783 | 64.273 | 4 | 4 | 3 |
| Lipid traits | ||||||
| HDL cholesterol (mg/dL) | 33,063 | 50.738 | 15.372 | 11 | 5 | 12 |
| LDL cholesterol (mg/dL) | 32,221 | 137.777 | 40.945 | 4 | 5 | 3 |
| Triglycerides (mg/dL) | 33,096 | 137.830 | 92.125 | 9 | 9 | 15 |
| Total Cholesterol (mg/dL) | 33,185 | 214.864 | 46.452 | 9 | 7 | 11 |
| Lifestyle traits | ||||||
| Cigarettes/day exclude nonsmokers | 15,862 | 12.507 | 9.088 | 0 | 0 | 0 |
| Coffee (cups/day) | 35,902 | 0.893 | 1.130 | 0 | 0 | 0 |
| Glycemic traits | ||||||
| HbA1c (mmol/mol) | 11,178 | 36.823 | 4.520 | 0 | 0 | 0 |
| Fasting insulin (pmol/L) | 21,551 | 10.233 | 7.979 | 0 | 0 | 0 |
| Fasting glucose (mmol/L) | 23,911 | 5.050 | 0.633 | 1 | 1 | 0 |
| Type 2 diabetes (cases/controls) | 14,042/31,683 | 30.7% | 1 | 0 | 2 | |
| Electrocardiogram traits | ||||||
| QT interval (ms) | 17,348 | 410.678 | 30.580 | 3 | 3 | 3 |
| QRS interval (ms) | 17,046 | 89.023 | 9.596 | 0 | 1 | 2 |
| PR interval (ms) | 17,422 | 158.909 | 22.364 | 3 | 1 | 2 |
| Blood Pressure traits | ||||||
| Systolic blood pressure (mm Hg) | 35,433 | 132.150 | 22.243 | 0 | 0 | 0 |
| Diastolic blood pressure (mm Hg) | 35,433 | 80.681 | 13.827 | 0 | 0 | 0 |
| Hypertension (cases/controls) | 27,123/22,018 | 55.2% | 0 | 0 | 0 | |
| Anthropometric traits | ||||||
| WHR-females | 24,838 | 0.855 | 0.082 | 0 | 0 | 0 |
| WHR-males | 9,066 | 0.952 | 0.066 | 0 | 0 | 0 |
| WHR | 33,904 | NA | NA | 0 | 0 | 0 |
| Height (cm) | 49,796 | 163.893 | 9.568 | 19 | 11 | 21 |
| BMI (kg/m2) | 49,335 | 29.333 | 6.285 | 0 | 0 | 0 |
| Kidney traits | ||||||
| Chronic kidney disease (cases/controls) | 4,154/41,573 | 10.0% | 0 | 0 | 0 | |
| End-stage renal disease (cases/controls) | 602/32,459 | 1.9% | 0 | 0 | 0 | |
| eGFR (mL/min) | 27,900 | 90.548 | 21.880 | 0 | 0 | 0 |
Phenotype information and GWAS sample sizes were taken from Table S1 in Wojcik et al., 2019. Wojcik et al., 2019 had a combined Nmax = 49,839.
SD = standard deviation; WHR = waist-to-hip ratio; HbA1c = hemoglobin A1c; eGFR = estimated glomerular filtration rate; CRP = c-reactive protein; MCHC = mean corpuscular hemoglobin concentration; BMI = body mass index; AFHI = African American and Hispanic/Latino transcriptome prediction model; EUR = European transcriptome model; ALL = African American, Hispanic/Latino, and European transcriptome model; MESA = Multi-Ethnic Study of Atherosclerosis; PAGE = Population Architecture using Genomics and Epidemiology study.
Traits have been adjusted for medications by adding a constant.
Traits have been adjusted for BMI.
Estimated glomerular filtration rate (eGFR) was calculated using the CKD-EPI (Chronic Kidney Disease Epidemiology Collaboration) formula from Levey et al., 2009. See Wojcik et al., 2019 for details.
Figure 1Z score Comparison of TWAS Significant Genes Identified by AFHI and EUR MESA Transcriptome Prediction Models in PAGE
Gene-trait pairs that were identified as significant (P < 0.05/n, n = the number of genes in the transcriptome model tested in S-PrediXcan) by either model are displayed. The Pearson correlation of displayed gene-trait pairs is shown in the upper left corner (R = 0.63). AFHI = African American and Hispanic/Latino transcriptome prediction model; EUR = European transcriptome prediction model; MESA = Multi-Ethnic Study of Atherosclerosis; PAGE = Population Architecture using Genomics and Epidemiology study.
Figure 2Manhattan Plot of the 14 of 28 PAGE Phenotypes Tested that Returned Significant TWAS Gene-Trait Pairs Using the AFHI, EUR, and ALL MESA Gene Expression Prediction Models
Each point represents the -log10(p) of a gene association test and gene chromosomal position colored by phenotype. Only significant gene-trait pairs are shown (P < 0.05/n, n = the number of genes in the transcriptome model tested in S-PrediXcan). The dotted line is at the more conservative significance threshold calculated using all tests (P < 1.1 × 10−7). 11 phenotypes have gene associations that meet this more stringent threshold. Using the AFHI, EUR, and ALL models, we identified 95, 46, and 121 significant gene-trait pairs, respectively, at this threshold. Gene-trait pairs with P < 1e-50 are displayed at P = 1e-50 for readability. AFHI = African American and Hispanic/Latino transcriptome prediction model; EUR = European transcriptome model; ALL = African American, Hispanic/Latino, and European transcriptome model; MCHC = mean corpuscular hemoglobin concentration; CRP levels = c-reactive protein levels; WBC count = white blood cell count; MESA = Multi-Ethnic Study of Atherosclerosis; PAGE = Population Architecture using Genomics and Epidemiology study.
S-PrediXcan Significant Genes in PAGE with Colocalization Probability (P4) > 0.5 that Replicated in Independent Studies in PhenomeXcan
| Gene Name | Z Score | Effect Size | P | CHR | P3 | P4 | Model | Phenotype | Best PhenomeXcan P | RCP |
|---|---|---|---|---|---|---|---|---|---|---|
| −18 | −12 | 4.2 × 10−73 | 16 | 2.3 × 10−3 | 1 | AFHI | HDL cholesterol | 6.1 × 10−97 | NA | |
| −4.8 | −17 | 1.7 × 10−6 | 11 | 7.1 × 10−3 | 0.95 | AFHI | HDL cholesterol | 1.6 × 10−6 | NA | |
| 4.7 | −9.7 | 2.3 × 10−6 | 5 | 0.19 | 0.80 | AFHI | Height | 9.6 × 10−17 | 0.57 | |
| 4.5 | −7.7 | 5.7 × 10−6 | 3 | 6.5 × 10−2 | 0.92 | AFHI | Height | 2.1 × 10−105 | NA | |
| 5.4 | 9.4 | 2.7 × 10−8 | 17 | 0.23 | 0.77 | AFHI | Height | 4.5 × 10−48 | NA | |
| 4.8 | 0.09 | 1.3 × 10−6 | 14 | 0.03 | 0.97 | AFHI | Height | 5.8 × 10−25 | NA | |
| −5.0 | −0.05 | 5.3 × 10−7 | 5 | 0.17 | 0.81 | AFHI | Height | 6.2 × 10−47 | NA | |
| −6.6 | 0.16 | 3.1 × 10−11 | 8 | 0.10 | 0.90 | AFHI | MCHC | 2.8 × 10−23 | 0.58 | |
| −11 | 0.02 | 2.6E-30 | 6 | 4.4 × 10−3 | 1 | AFHI | Platelet count | 2.6 × 10−149 | 0.97 | |
| −4.5 | −0.06 | 6.0 × 10−6 | 11 | 1.8 × 10−2 | 0.98 | AFHI | Platelet count | 6.9 × 10−60 | 0.81 | |
| 9.7 | −0.05 | 3.9 × 10−22 | 1 | 2.2 × 10−2 | 0.95 | AFHI | WBC count | 5.8 × 10−6 | NA | |
| 4.9 | −0.11 | 1.2 × 10−6 | 3 | 1.7 × 10−2 | 0.98 | EUR | Height | 9.5 × 10−150 | 0.58 | |
| −4.9 | −2.6 | 8.0 × 10−7 | 16 | 6.7 × 10−3 | 0.99 | EUR | Height | 1.9 × 10−32 | NA | |
| −4.4 | 0.08 | 9.8 × 10−6 | 5 | 4.8 × 10−2 | 0.95 | EUR | Height | 6.2 × 10−47 | NA | |
| −12 | 0.08 | 2.8 × 10−32 | 6 | 2.5 × 10−3 | 1 | EUR | Platelet count | 2.6 × 10−149 | 0.97 | |
| −5.7 | 0.11 | 1.4 × 10−6 | 12 | 3.3 × 10−3 | 1 | EUR | Platelet count | 3.9 × 10−47 | NA | |
| −12 | −13 | 7.0 × 10−34 | 6 | 3.9 × 10−3 | 1 | ALL | Platelet count | 2.6 × 10−149 | 0.97 | |
| 7.5 | 0.74 | 6.7 × 10−14 | 6 | 0.21 | 0.54 | ALL | Height | 9.0 × 10−132 | NA | |
| −20 | −7.7 | 4.2 × 10−73 | 16 | 2.3 × 10−3 | 1 | ALL | HDL cholesterol | 6.1 × 10−97 | NA | |
| −7.1 | −3.7 | 1.4 × 10−12 | 16 | 0.31 | 0.66 | ALL | HDL cholesterol | 2.0 × 10−65 | NA | |
| −4.5 | −0.04 | 5.6 × 10−6 | 16 | 1.3 × 10−2 | 0.95 | ALL | Height | 1.9 × 10−32 | NA | |
| 4.6 | 0.02 | 4.3 × 10−6 | 5 | 0.19 | 0.80 | ALL | Height | 9.6 × 10−17 | 0.57 | |
| −4.5 | −0.25 | 7.9 × 10−6 | 8 | 0.32 | 0.68 | ALL | MCHC | 7.3 × 10−21 | 0.51 | |
| −4.7 | −0.05 | 2.4 × 10−6 | 5 | 0.10 | 0.89 | ALL | Height | 6.2 × 10−47 | NA | |
| 8.8 | 0.08 | 1.2 × 10−18 | 1 | 0.27 | 0.69 | ALL | WBC count | 5.8 × 10−6 | NA | |
| 6.7 | 0.18 | 2.6 × 10−11 | 3 | 8.3 × 10−3 | 0.99 | ALL | Height | 9.5 × 10−150 | 0.58 |
Details of the studies used in PhenomeXcan are in Table S2.
P3 = COLOC probability eQTL and GWAS signals are independent; P4 = COLOC probability eQTL and GWAS signals are colocalized; AFHI = African American and Hispanic/Latino transcriptome prediction model; EUR = European transcriptome model; ALL = African American, Hispanic/Latino, and European transcriptome model; MESA = Multi-Ethnic Study of Atherosclerosis; PAGE = Population Architecture using Genomics and Epidemiology study; RCP = PhenomeXcan regional colocalization probability.
Figure 3SMIM19 GWAS and eQTL Signals are Colocalized in AFHI, but not EUR
LocusCompare (Liu et al., 2019) plots for mean corpuscular hemoglobin concentration (MCHC) PAGE GWAS p values compared to (A) AFHI MESA eQTL p values and (B) EUR MESA eQTL p values of SNPs in the SMIM19 prediction models. When most points are located on the diagonal, it indicates the GWAS and eQTL signals are likely colocalized. The lead SNP in the AFHI eQTL and PAGE GWAS, rs2923403, is located among the top signals and in the upper right corner, supporting the COLOC evidence for colocalization AFHI (P4 = 0.90). When using EUR eQTL data in COLOC, the GWAS and eQTL signals did not colocalize (EUR P4 = 0.047). Points are colored according to the pairwise LD r2 with rs2923403 in (A) AMR and (B) EUR 1000 Genomes populations.