| Literature DB >> 31798965 |
C S Park1, T De1, Y Xu1,2, Y Zhong1, E Smithberger1,3, C Alarcon1, E R Gamazon4,5,6, M A Perera1.
Abstract
African Americans (AAs) are an admixed population with widely varying proportion of West African ancestry (WAA). Here we report the correlation of WAA to gene expression and DNA methylation in AA-derived hepatocytes, a cell type important in disease and drug response. We perform mediation analysis to test whether methylation is a mediator of the effect of ancestry on expression. GTEx samples and a second cohort are used as validation. One hundred and thirty-one genes are associated with WAA (FDR < 0.10), 28 of which replicate and represent 220 GWAS phenotypes. Among PharmGKB pharmacogenes, VDR, PTGIS, ALDH1A1, CYP2C19, and P2RY1 nominally associate with WAA (p < 0.05). We find 1037 WAA-associated, differentially methylated regions (FDR < 0.05), with hypomethylated genes enriched in drug-response pathways. In conclusion, WAA contributes to variability in hepatocyte expression and DNA methylation with identified genes previously implicated for diseases disproportionately affecting AAs, including cardiovascular (PTGIS, PLAT) and renal (APOL1) disease, and drug response (CYP2C19).Entities:
Keywords: Gene expression; Population genetics
Year: 2019 PMID: 31798965 PMCID: PMC6877651 DOI: 10.1038/s41525-019-0102-y
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Fig. 1Flowchart outlining each cohort use in the analyses. a Hepatocyte Discovery Cohort from AA primary hepatocyte cultures were RNA-sequenced and assessed for DNA methylation. b The GTEx Liver Replication Cohort was comprised of AA liver samples with available genotype and gene expression data. c Innocenti Liver Validation Cohort was comprised of liver samples from both EAs and AAs obtained through GEO (accession GSE124076), in which differential gene expression was used to validate findings from the Hepatocyte Discovery Cohort
Demographics and clinical characteristics of hepatocyte/liver cohorts
| Hepatocyte Discovery Cohort | GTEx Liver Cohort | Innocenti Validation Liver Cohort | ||
|---|---|---|---|---|
| RNA-Seq cohort | Methylation cohort | |||
| Variable | AA ( | AA ( | AA ( | AAa and EAb ( |
| Percentage of AA subjects (%) | 100 | 100 | 100 | 11.1 |
| Age, yearsc (mean ± SD) | 39 ± 20.5 | 46 ± 12.2 | 54.5 ± 6.6 | 46 ± 22 |
| Sex (female %)d | 48.3 | 52.3 | 20 | 36.4 |
| West Africane ancestry (%) (mean ± SD) | 78.17 ± 12.86 | 79.83 ± 11.86 | 84.04 ± 7.13 | NA |
aAA: African Americans >40% WAA
bEA: European Americans with <1% WAA
cp = 0.0280 for age (between groups, one-way ANOVA), p = 0.0409 (between RNA-Seq and GTEx Liver cohorts)
dp = 0.0439 for sex
ep = 0.2331 for ancestry
Fig. 2Gene expression traits and methylation patterns associated with West African ancestry in hepatocytes. a Enhanced volcano plot of gene expression traits associated with increasing WAA plotted against −log10 p values of all 18,854 genes expressed in hepatocytes and 131 genes significantly associated with WAA represented as red and blue circles (red circles: FDR < 0.05 and logFC > 1.5 and <−1.5; blue circles: FDR < 0.05 and logFC < 1.5 and >−1.5; green circles: FDR > 0.05 and logFC > 1.5 and <−1.5, gray circles: FDR > 0.05 and logFC < 1.5 and >−1.5) and b enhanced volcano plot of gene expression traits associated with increasing WAA plotted against −log10 p values within the subset of 64 PharmGKB “very important genes” (red circles: p < 0.05 and logFC > 1.0; blue circles: p value < 0.05 and logFC < 1.0; gray circles: p value > 0.05). c Correlation of 1034 unique genes containing DMRs significantly associated with WAA (mean Beta fold change from DMRcate) with coefficient of gene expression at each gene (indicating the direction of association to ancestry). Each point represents a gene, with gray triangles representing hypomethylated genes (Pearson’s r = −0.014, p = 0.009) and maroon red circles representing hypermethylated genes (Pearson’s r = 0.036, p = 0.55). Gray and maroon hash marks on the x- and y-axis represent genes plotted with both expression and methylation values. Gray and maroon shading around each regression line represents the C.I. 95%. d Q–Q plot of the observed versus expected −log10 p values in the replication cohort (n = 206). Each point represents a gene with the colored lines representing different FDR thresholds of significance
Significant and replicated DE genes between AAs and EAs (p < 0.05) from a genome-wide discovery of genes associated with West African ancestry (FDR < 0.10)
| Geneb | AA Hepatocyte Cohort | DE in AA vs EAa | ||
|---|---|---|---|---|
| Effect size |
| FDR < 0.10 | ||
|
| 3.152 | 7.25E−05 | 0.0471 | 0.00000126 |
|
| −2.848 | 3.83E−05 | 0.0380 | 0.000202 |
|
| −2.246 | 4.91E−05 | 0.0421 | 0.000502 |
|
| 0.437 | 1.86E−04 | 0.0773 | 0.000696 |
|
| −1.945 | 2.93E−04 | 0.0788 | 0.00167 |
|
| −0.306 | 6.73E−04 | 0.0988 | 0.00249 |
|
| −1.007 | 1.89E−04 | 0.0773 | 0.00266 |
|
| −0.929 | 5.29E−04 | 0.0882 | 0.00268 |
|
| 2.192 | 1.62E−04 | 0.0751 | 0.00278 |
|
| 1.131 | 2.57E−05 | 0.0380 | 0.00282 |
|
| −2.191 | 6.86E−04 | 0.0988 | 0.00327 |
|
| −2.986 | 3.17E−04 | 0.0790 | 0.00385 |
|
| −1.454 | 6.36E−04 | 0.0967 | 0.00445 |
|
| −0.505 | 3.04E−04 | 0.0790 | 0.00605 |
|
| −1.316 | 3.08E−04 | 0.0790 | 0.00706 |
|
| −0.833 | 2.37E−04 | 0.0788 | 0.00743 |
|
| −1.453 | 4.58E−05 | 0.0411 | 0.0123 |
|
| −1.271 | 6.86E−04 | 0.0988 | 0.0125 |
|
| −1.567 | 9.64E−06 | 0.0307 | 0.013 |
|
| −1.643 | 1.06E−04 | 0.0583 | 0.0134 |
|
| 2.640 | 3.94E−04 | 0.0854 | 0.0137 |
| MKNK2 | −1.545 | 3.44E−04 | 0.0811 | 0.0262 |
| MGRN1 | −1.010 | 2.76E−04 | 0.0788 | 0.0313 |
| C3orf33 | 0.506 | 6.45E−04 | 0.0973 | 0.0314 |
| PTPN4 | 0.484 | 5.37E−04 | 0.0882 | 0.0321 |
| NPR2 | 1.235 | 6.11E−05 | 0.0471 | 0.0444 |
| MSX1 | 1.432 | 2.90E−04 | 0.0788 | 0.0448 |
| PDK1 | −2.038 | 6.30E−04 | 0.0966 | 0.0498 |
aReplication in differentially expressed (DE) genes between 23 African Americans (AAs) and 183 European Americans (EAs)
bGenes with FDR < 0.1 in the Replication Cohort are shown in bold
cReplication p value (p < 0.05) based on 131 discovery genes (Supplemental Data S1)
Fig. 3A Circos plot of significantly associated CpGs, DMRs, and gene expression traits associated with West African ancestry and GWAS catalog traits associated with replicated gene expression traits. The innermost ring represents the 220 GWAS catalog traits associated with the 28 genes replicated in the Innocenti et al. dataset (p < 0.05) with each purple circle representing the scaled −log(p value) of a study from the GWAS catalog. The second ring represents 83 genes associated with decreased expression with increased WAA (red bars represent fold change, ranging from 0 to −6). The third ring represents 48 genes associated with increased expression with increased WAA (blue bars represent fold change >1.5, ranging from 0 to +6). The fourth ring represents 1037 DMRs significantly associated with WAA (purple tiles represent DMRs that stack when overlapping). The fifth ring represents 23,317 significant differentially methylated CpGs (black squares represent 7913 hypomethylated CpG sites; orange circles represent 15,404 hypermethylated CpG sites; not all CpGs are depicted due to reduced crowding implemented in the Circos program). The next ring represents the karyotype of the human genome (reference hg38) and the outermost ring corresponds to the gene names of the 131 WAA-associated gene expression traits identified
Fig. 4Enrichment of biological processes and molecular functions of differentially methylated genes associated with West African ancestry and corresponding gene expression traits. a Gene ontology terms that are enriched for biological processes (BP) and molecular functions (MF) for 432 genes annotated to differentially hypermethylated regions, b 602 genes annotated to differentially hypomethylated regions, and c 131 genes with gene expression traits associated with WAA (FDR < 0.10). p Values are BH-adjusted p values obtained from gProfiler