| Literature DB >> 33600772 |
Hari K Somineni1, Sini Nagpal2, Suresh Venkateswaran3, David J Cutler4, David T Okou3, Talin Haritunians5, Claire L Simpson6, Ferdouse Begum7, Lisa W Datta7, Antonio J Quiros8, Jenifer Seminerio9, Emebet Mengesha5, Jonathan S Alexander10, Robert N Baldassano11, Sharon Dudley-Brown12, Raymond K Cross13, Themistocles Dassopoulos14, Lee A Denson15, Tanvi A Dhere16, Heba Iskandar16, Gerald W Dryden17, Jason K Hou18, Sunny Z Hussain19, Jeffrey S Hyams20, Kim L Isaacs21, Howard Kader22, Michael D Kappelman23, Jeffry Katz24, Richard Kellermayer25, John F Kuemmerle26, Mark Lazarev27, Ellen Li28, Peter Mannon29, Dedrick E Moulton30, Rodney D Newberry31, Ashish S Patel32, Joel Pekow33, Shehzad A Saeed34, John F Valentine35, Ming-Hsi Wang36, Jacob L McCauley37, Maria T Abreu38, Traci Jester39, Zarela Molle-Rios40, Sirish Palle41, Ellen J Scherl42, John Kwon43, John D Rioux44, Richard H Duerr45, Mark S Silverberg46, Michael E Zwick4, Christine Stevens47, Mark J Daly47, Judy H Cho48, Greg Gibson2, Dermot P B McGovern5, Steven R Brant49, Subra Kugathasan50.
Abstract
Whether or not populations diverge with respect to the genetic contribution to risk of specific complex diseases is relevant to understanding the evolution of susceptibility and origins of health disparities. Here, we describe a large-scale whole-genome sequencing study of inflammatory bowel disease encompassing 1,774 affected individuals and 1,644 healthy control Americans with African ancestry (African Americans). Although no new loci for inflammatory bowel disease are discovered at genome-wide significance levels, we identify numerous instances of differential effect sizes in combination with divergent allele frequencies. For example, the major effect at PTGER4 fine maps to a single credible interval of 22 SNPs corresponding to one of four independent associations at the locus in European ancestry individuals but with an elevated odds ratio for Crohn disease in African Americans. A rare variant aggregate analysis implicates Ca2+-binding neuro-immunomodulator CALB2 in ulcerative colitis. Highly significant overall overlap of common variant risk for inflammatory bowel disease susceptibility between individuals with African and European ancestries was observed, with 41 of 241 previously known lead variants replicated and overall correlations in effect sizes of 0.68 for combined inflammatory bowel disease. Nevertheless, subtle differences influence the performance of polygenic risk scores, and we show that ancestry-appropriate weights significantly improve polygenic prediction in the highest percentiles of risk. The median amount of variance explained per locus remains the same in African and European cohorts, providing evidence for compensation of effect sizes as allele frequencies diverge, as expected under a highly polygenic model of disease.Entities:
Keywords: CALB2; PTGER4; differential genetic architecture; polygenic risk scores; rare variants; trans-ethnic comparative analysis; understudied populations; whole-genome sequencing of African Americans
Mesh:
Substances:
Year: 2021 PMID: 33600772 PMCID: PMC8008495 DOI: 10.1016/j.ajhg.2021.02.001
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025
Figure 1Regional plots of the PTGER4 region in the discovery whole-genome sequence data
Crohn disease (top); ulcerative colitis (middle); inflammatory bowel disease (bottom). Dashed red lines indicate genome-wide significance (p = 5 × 10−8). Variants are color coded to show linkage disequilibrium structure relative to a variant, rs6896969, that showed the strongest association with Crohn disease in African Americans in our previous GWAS.
Figure 2QQ plot of rare-variant gene-level association analysis of ulcerative colitis
Observed negative log p values are from the SKAT-O test.
Figure 3Comparison of effect sizes across populations
(A) Statistical power of the discovery whole-genome sequencing dataset to replicate previously known variants at p < 0.05. The known variants that showed an association (p < 0.05) with Crohn disease, ulcerative colitis, or inflammatory bowel disease in African Americans are colored to denote their p value of association.
(B–D) Comparison of effect sizes at known variants in African Americans and Europeans. Each variant is colored to denote the p value of association for Crohn disease (B), ulcerative colitis (C), or inflammatory bowel disease (D) in African Americans from the discovery whole-genome sequencing cohort. The red line shows the linear regression fit to indicate the general trend. Significance and goodness-of-fit are shown. ADCY7 was not included while computing goodness-of-fit in (C) (see subjects and methods).
Figure 4Median proportion of variance explained by inflammatory bowel disease loci in African Americans and Europeans via effects estimated from the two populations
(A and B) Median proportion of variance explained (PVE) by the known loci (n = 215 SNPs) assessed via allele frequencies and effect sizes estimated in African American pseudo-controls and UK Biobank datasets measured on lnOR and liability scales across five discovery sets. PVE is computed as 2p.q.ln(OR)2 on lnOR scale (A); 2.p.alpha12 + 2.q.alpha22, where alpha1 and alpha2 are liability estimates on liability scale (B).
Figure 5Comparison of proportion of variance explained (PVE) per locus in African Americans and Europeans at known loci (n = 215 SNPs) via effects estimated on the lnOR and liability scales
(A) PVE is computed as 2p.q.ln(OR)2.
(B) PVE is computed as 2.p.alpha12 + 2.q.alpha22, where alpha1 and alpha2 are liability scale estimates.
Figure 6Polygenic risk scores (PRSs) in African Americans and Europeans as a function of different discovery ancestry groups
PRSs in African Americans and UK Biobank individuals derived via 215 of the known disease SNPs and liability scale effects estimated from the African American discovery cohort (AA_liability, red) or the UK Biobank discovery cohort (UKBB_liability, green).
(A and B) Prevalence of IBD (%) versus percentile of PRS in each cohort.
(C and D) Mean proportion of variance explained (PVE) by each PRS. Standard error bars are from five test sets.