| Literature DB >> 33057025 |
Xutong Zhao1, Dandi Qiao2, Chaojie Yang3, Silva Kasela4,5, Wonji Kim2, Yanlin Ma3, Nick Shrine6, Chiara Batini6, Tamar Sofer7,8, Sarah A Gagliano Taliun1, Phuwanat Sakornsakolpat2, Pallavi P Balte9, Dmitry Prokopenko2, Bing Yu10, Leslie A Lange11, Josée Dupuis12, Brian E Cade7,8, Jiwon Lee8, Sina A Gharib13, Michelle Daya11, Cecelia A Laurie14, Ingo Ruczinski15, L Adrienne Cupples12,16, Laura R Loehr17, Traci M Bartz14, Alanna C Morrison18, Bruce M Psaty19,20, Ramachandran S Vasan16,21, James G Wilson22, Kent D Taylor23, Peter Durda24, W Craig Johnson14, Elaine Cornell24, Xiuqing Guo23, Yongmei Liu25, Russell P Tracy24, Kristin G Ardlie26, François Aguet26, David J VanDenBerg27, George J Papanicolaou28, Jerome I Rotter23, Kathleen C Barnes11, Deepti Jain14, Deborah A Nickerson29, Donna M Muzny30, Ginger A Metcalf30, Harshavardhan Doddapaneni30, Shannon Dugan-Perez30, Namrata Gupta26, Stacey Gabriel26, Stephen S Rich3, George T O'Connor31, Susan Redline7,8,32, Robert M Reed33, Cathy C Laurie14, Martha L Daviglus34, Liana K Preudhomme35, Kristin M Burkart9, Robert C Kaplan36,37, Louise V Wain6,38, Martin D Tobin6,38, Stephanie J London39, Tuuli Lappalainen4,5, Elizabeth C Oelsner9, Goncalo R Abecasis1, Edwin K Silverman2, R Graham Barr9, Michael H Cho40, Ani Manichaikul41.
Abstract
Chronic obstructive pulmonary disease (COPD), diagnosed by reduced lung function, is a leading cause of morbidity and mortality. We performed whole genome sequence (WGS) analysis of lung function and COPD in a multi-ethnic sample of 11,497 participants from population- and family-based studies, and 8499 individuals from COPD-enriched studies in the NHLBI Trans-Omics for Precision Medicine (TOPMed) Program. We identify at genome-wide significance 10 known GWAS loci and 22 distinct, previously unreported loci, including two common variant signals from stratified analysis of African Americans. Four novel common variants within the regions of PIAS1, RGN (two variants) and FTO show evidence of replication in the UK Biobank (European ancestry n ~ 320,000), while colocalization analyses leveraging multi-omic data from GTEx and TOPMed identify potential molecular mechanisms underlying four of the 22 novel loci. Our study demonstrates the value of performing WGS analyses and multi-omic follow-up in cohorts of diverse ancestry.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33057025 PMCID: PMC7598941 DOI: 10.1038/s41467-020-18334-7
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Summary of the 19,996 study-participants included in analyses.
| Stratum | Study | Sample size (COPD cases) | |||
|---|---|---|---|---|---|
| Non Hispanic White | African American | Other* | All combined | ||
| Population-and family-based | Atherosclerosis risk in communities (ARIC) | 3075 (554) | 181 (16) | — | 3256 (570) |
| Cleveland Family Study (CFS) | 373 (24) | 346 (34) | — | 719 (58) | |
| Cardiovascular Health Study (CHS) | 39 (8) | — | 8 (2) | 47 (10) | |
| Framingham Heart Study (FHS) | 1835 (187) | — | — | 1835 (187) | |
| Jackson Heart Study (JHS) | — | 2,388 (121) | — | 2388 (121) | |
| Multi-Ethnic Study of Atherosclerosis (MESA) | 1224 (173) | 804 (84) | 1224 (76) | 3252 (333) | |
| Total | 6546 (946) | 3719 (255) | 1232 (78) | 11,497 (1279) | |
| COPD-enriched | Genetic Epidemiology of COPD (COPDGene) | 5713 (2416) | 2731 (717) | — | 8444 (3133) |
| Boston Early Onset COPD (EOCOPD) | 55 (54) | — | — | 55 (54) | |
| Total | 5768 (2470) | 2731 (717) | — | 8499 (3187) | |
| Combined | Total | 12,314 (3416) | 6450 (972) | 1232 (78) | 19,996 (4466) |
*The total number of CHS participants includes 8 African American individuals who were not included in stratified analysis of African Americans only due to the small number.
Fig. 1Overview of workflow for the study.
Whole genome sequence analysis of lung function and COPD was carried out in TOPMed participants from population- and family-based studies, as well as in COPD-enriched studies. We performed gene-based analysis of pLOF variants as well as single variant analysis. Genetic variants and loci identified by single variant analysis were further examined for colocalization with gene expression (eQTL) and methylation (mQTL) traits, as well as through replication and phenome-wide association studies (Phewas). Note: Novel loci demonstrating evidence of colocalization with eQTL are labeled according to the corresponding gene expression targets. All other loci are labeled using the nearest gene as indicated in Tables 2 and 3.
Four distinct novel variants at three distinct signals* with replication evidence.
| TOPMed Discovery variant: rsid Chr:Pos (effect/other allele) | EAF (TOPMed stratum, race/ethnicity) | Trait (direction) | HC Beta (SE) P-value | Annotation | UK Biobank European ancestry replication**: EffHC Beta (SE) | Additional supporting evidence |
|---|---|---|---|---|---|---|
| rs17308514 15:68020833 (G/A) | 0.384 (Combined, All) | FVC (decreased) | HC = 8092 −0.04 (0.01) | 5′ of | EffHC = 160039 −0.007 (0.002 | Colocalized methylation sites: cg00154119, cg20631419 |
| rs7188378 16:53872940 (C/T) | 0.475 (Combined, White) | FEV1/FVC (decreased) | HC = 6026 −0.79 (0.14) | EffHC = 158908 −0.009 (0.002) | Previous GWAS variant*** rsid, Chr:Pos (effect/other allele): rs35420030 16:53901495 (C/T) EAF (discovery population): 0.052 (UK Biobank European ancestry) Trait (direction): FEV1/FVC (increased) Phenome-wide association**** trait (direction): unspecified diffuse connective tissue disease (increased) | |
| rs12556310 X:47087005 (G/C) | 0.440 (COPD- enriched, All) | FVC (increased) | HC = 1901 0.05 (0.01) | EffHC = 136386 0.006 (0.002) | Colocalized gene expression traits: | |
| rs35917906 X:47100766 (T/C) | 0.489 (Combined, All) | FVC (increased) | HC = 3818 0.03 (0.01) | 3′ of | EffHC = 119997 0.009 (0.002) | Colocalized gene expression traits: |
Variants are reported based on genome-wide significance threshold of P = 5 × 10−8. All variant positions are presented based on Human Genome Build 38; EAF = effect allele frequency; HC = heterozygosity count; EffHC = effective heterozygosity count Genetic variant effects (betas) in TOPMed are reported for phenotypes under the heterogeneous variance model[56] such that the effect estimates reflect the scale of variance for FEV1 (in L), FVC (in L) and FEV1/FVC ratio (in %). P-values for genetic association in TOPMed as reported based on the SAIGE score test55.
*In determining the number of distinct signals, we grouped together two variants (rs12556310 and rs35917906) on chromosome X that were in LD with R-squared = 0.71 and 0.44 based on White and all race/ethnicities in our TOPMed sample, respectively.
**Genetic variant effects for replication in UK Biobank are reported for inverse normal transformed residualized lung function traits, following the model used in Shrine et al17. P-values are reported based on the BOLT-LMM genetic association test[66].
***Information on prior GWAS association reported based on result from Shrine et al.17.
****The phenome-wide association P-value is reported based on the SAIGE score test[55].
Additional genome-wide significant results with suggestive supporting evidence.
| TOPMed variant rsid Chr:Pos (effect/other allele) | EAF (TOPMed stratum, race/ethnicity) | Trait (direction) | HC Beta (SE) P-value | Annotation | Supporting evidence type | Supporting evidence details |
|---|---|---|---|---|---|---|
| rs9295345 6:166400303 (T/G) | 0.725 (Combined, White) | FEV1 (decreased) | HC = 4970 −0.08 (0.01) | 3′ of | Colocalized gene expression traits: | |
| rs5953026 X:47317317 (G/A) | 0.602 (Combined, White) | FVC (increased) | HC = 2923 0.04 (0.01) | 3′ of | Colocalized gene expression trait: | |
| rs184101688 7:7140556 (C/A) | 0.001 (Combined, All) | FEV1/FVC (decreased) | HC = 41 −9.42 (1.66) | 5′ of | Overlap with known GWAS region | Previous GWAS variant* rsid, Chr:Pos (effect/other allele): rs4318980, 7:7216859 (A/G) EAF (discovery population): 0.415 (UK Biobank European ancestry) Trait (direction): FEV1/FVC (decreased) |
| rs74469188 16:81611365 (C/T) | 0.150 (COPD-enriched, African American) | FVC (increased) | HC = 714 0.14 (0.03) | Association in UK Biobank with inconsistent direction of effect | UK Biobank European ancestry** EffHC = 64854 Beta (SE) = −0.017 (0.004) | |
| rs371740347 1:196989333 (C/T) | 0.006 (COPD-enriched, All) | FVC (increased) | HC = 104 0.38 (0.07) | Phenome-wide association evidence with inconsistent direction of effect | Phenome-wide association*** trait (direction): respiratory failure, insufficiency, arrest (increased) |
Variants are reported based on genome-wide significance threshold of P = 5 × 10−8. All variant positions are presented based on Human Genome Build 38; EAF = effect allele frequency; HC = heterozygosity count; EffHC = effective heterozygosity count Genetic variant effects (betas) in TOPMed are reported for phenotypes under the heterogeneous variance model[56] such that the effect estimates reflect the scale of variance for FEV1 (in L), FVC (in L) and FEV1/FVC ratio (in %). P-values for genetic association in TOPMed as reported based on the SAIGE score test[55].
*Information on prior GWAS association reported based on result from Shrine et al.[17].
**Genetic variant effects for association in UK Biobank are reported for inverse normal transformed residualized lung function traits, following the model used in Shrine et al.[17]. P-values are reported based on the BOLT-LMM genetic association test[66].
***The phenome-wide association P-value is reported based on the SAIGE score test[55].