Literature DB >> 21946350

Genome-wide association and large-scale follow up identifies 16 new loci influencing lung function.

María Soler Artigas1, Daan W Loth, Louise V Wain, Sina A Gharib, Ma'en Obeidat, Wenbo Tang, Guangju Zhai, Jing Hua Zhao, Albert Vernon Smith, Jennifer E Huffman, Eva Albrecht, Catherine M Jackson, David M Evans, Gemma Cadby, Myriam Fornage, Ani Manichaikul, Lorna M Lopez, Toby Johnson, Melinda C Aldrich, Thor Aspelund, Inês Barroso, Harry Campbell, Patricia A Cassano, David J Couper, Gudny Eiriksdottir, Nora Franceschini, Melissa Garcia, Christian Gieger, Gauti Kjartan Gislason, Ivica Grkovic, Christopher J Hammond, Dana B Hancock, Tamara B Harris, Adaikalavan Ramasamy, Susan R Heckbert, Markku Heliövaara, Georg Homuth, Pirro G Hysi, Alan L James, Stipan Jankovic, Bonnie R Joubert, Stefan Karrasch, Norman Klopp, Beate Koch, Stephen B Kritchevsky, Lenore J Launer, Yongmei Liu, Laura R Loehr, Kurt Lohman, Ruth J F Loos, Thomas Lumley, Khalid A Al Balushi, Wei Q Ang, R Graham Barr, John Beilby, John D Blakey, Mladen Boban, Vesna Boraska, Jonas Brisman, John R Britton, Guy G Brusselle, Cyrus Cooper, Ivan Curjuric, Santosh Dahgam, Ian J Deary, Shah Ebrahim, Mark Eijgelsheim, Clyde Francks, Darya Gaysina, Raquel Granell, Xiangjun Gu, John L Hankinson, Rebecca Hardy, Sarah E Harris, John Henderson, Amanda Henry, Aroon D Hingorani, Albert Hofman, Patrick G Holt, Jennie Hui, Michael L Hunter, Medea Imboden, Karen A Jameson, Shona M Kerr, Ivana Kolcic, Florian Kronenberg, Jason Z Liu, Jonathan Marchini, Tricia McKeever, Andrew D Morris, Anna-Carin Olin, David J Porteous, Dirkje S Postma, Stephen S Rich, Susan M Ring, Fernando Rivadeneira, Thierry Rochat, Avan Aihie Sayer, Ian Sayers, Peter D Sly, George Davey Smith, Akshay Sood, John M Starr, André G Uitterlinden, Judith M Vonk, S Goya Wannamethee, Peter H Whincup, Cisca Wijmenga, O Dale Williams, Andrew Wong, Massimo Mangino, Kristin D Marciante, Wendy L McArdle, Bernd Meibohm, Alanna C Morrison, Kari E North, Ernst Omenaas, Lyle J Palmer, Kirsi H Pietiläinen, Isabelle Pin, Ozren Pola Sbreve Ek, Anneli Pouta, Bruce M Psaty, Anna-Liisa Hartikainen, Taina Rantanen, Samuli Ripatti, Jerome I Rotter, Igor Rudan, Alicja R Rudnicka, Holger Schulz, So-Youn Shin, Tim D Spector, Ida Surakka, Veronique Vitart, Henry Völzke, Nicholas J Wareham, Nicole M Warrington, H-Erich Wichmann, Sarah H Wild, Jemma B Wilk, Matthias Wjst, Alan F Wright, Lina Zgaga, Tatijana Zemunik, Craig E Pennell, Fredrik Nyberg, Diana Kuh, John W Holloway, H Marike Boezen, Debbie A Lawlor, Richard W Morris, Nicole Probst-Hensch, Jaakko Kaprio, James F Wilson, Caroline Hayward, Mika Kähönen, Joachim Heinrich, Arthur W Musk, Deborah L Jarvis, Sven Gläser, Marjo-Riitta Järvelin, Bruno H Ch Stricker, Paul Elliott, George T O'Connor, David P Strachan, Stephanie J London, Ian P Hall, Vilmundur Gudnason, Martin D Tobin.   

Abstract

Pulmonary function measures reflect respiratory health and are used in the diagnosis of chronic obstructive pulmonary disease. We tested genome-wide association with forced expiratory volume in 1 second and the ratio of forced expiratory volume in 1 second to forced vital capacity in 48,201 individuals of European ancestry with follow up of the top associations in up to an additional 46,411 individuals. We identified new regions showing association (combined P < 5 × 10(-8)) with pulmonary function in or near MFAP2, TGFB2, HDAC4, RARB, MECOM (also known as EVI1), SPATA9, ARMC2, NCR3, ZKSCAN3, CDC123, C10orf11, LRP1, CCDC38, MMP15, CFDP1 and KCNE2. Identification of these 16 new loci may provide insight into the molecular mechanisms regulating pulmonary function and into molecular targets for future therapy to alleviate reduced lung function.

Entities:  

Mesh:

Year:  2011        PMID: 21946350      PMCID: PMC3267376          DOI: 10.1038/ng.941

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Introduction

Pulmonary function, reliably measurable by spirometry, is a heritable trait reflecting the physiological state of the airways and lungs[1]. Pulmonary function measures are important predictors of population morbidity and mortality[2-4], and are used in the diagnosis of chronic obstructive pulmonary disease (COPD), which ranks among the leading causes of death in developed and developing countries[5,6]. A reduced ratio of forced expiratory volume in 1 second (FEV1) to forced vital capacity (FVC) is used to define airway obstruction, and a reduced FEV1 is used to grade the severity of airway obstruction[7]. Recently, two large genome-wide association studies (GWAS), each comprising discovery sets of more than 20,000 individuals of European ancestry identified novel loci for lung function[8,9]. Recognizing the need for larger datasets to increase the power to detect loci of individually modest effect size, we conducted a meta-analysis of 23 lung function GWAS comprising a total of 48,201 individuals of European ancestry (Stage 1) and followed-up potentially novel loci in 17 further studies comprising up to 46,411 individuals (Stage 2). We identified 16 additional novel loci for lung function, and provided evidence corroborating association of loci previously associated with lung function[8-11]. Our findings implicate a number of different mechanisms underlying regulation of lung function and highlight loci shared with complex traits and diseases, including height, lung cancer, and myocardial infarction.

Results

Genome-wide analysis (stage 1)

Meta-analyses for cross-sectional lung function measures were undertaken for approximately 2.5 million genotyped or imputed SNPs across 23 studies with a combined sample size of 48,201 adult individuals of European ancestry. Characteristics of the cohort participants and the genotyping are shown in Supplementary Tables 1A and 1B. FEV1 and FEV1/FVC were adjusted for ancestry principal components, age, age2, sex, and height as covariates. Association testing of the inverse-normal transformed residuals for FEV1 and FEV1/FVC assumed an additive genetic model and was stratified by ever-smoking (versus never-smoking) status. Meta-analyses of the smoking strata within study, and the study-specific results, were undertaken using inverse variance weighting (the inverse of the standard error squared was used as the weight). We applied genomic control twice at study level (to each smoking stratum separately and to the study level pooled estimates) and also at meta-analysis level to avoid inflation of test statistics due to cryptic population structure or relatedness (see Supplementary Table 1A for study level estimates). Our application of genomic control at the three stages is likely to be overly conservative because it has recently been shown that in large meta-analyses, test statistics are expected to be elevated under polygenic inheritance even when there is no population structure[12]. Test statistic inflation (λGC) prior to applying genomic control at meta-analysis level was 1.12 for FEV1 and 1.09 for FEV1/FVC. Genomic inflation estimates increase with sample size, as has been shown for other traits[13-15]; standardised estimates to a sample of 1000 individuals (λGC_1000) were 1.002 for FEV1 and 1.002 for FEV1/FVC. Plots of meta-analysis P-values for FEV1 and FEV1/FVC against a uniform distribution of P values expected under the null hypothesis showed deviations which were attenuated, but persisted, after removal of SNPs in loci reported previously, consistent with additional loci being associated with lung function (Supplementary Figure 1A).

Follow-up analysis (stage 2)

Twenty-nine new loci showing evidence of association with lung function (P<3×10−6) in Stage 1 were followed up in Stage 2 by utilizing in silico data from seven studies, and by undertaking additional genotyping in 10 studies for the 10 highest-ranked SNPs (Figure 1). Full details of the SNP selection are given in the Online Methods. Inverse variance weighting meta-analysis was performed across Stages 1 and 2, and two-sided p-values were obtained for the pooled estimates. Sixteen new loci reached genome-wide significance (P<5×10−8) and showed consistent direction of effects in both stages, comprising 12 new loci for FEV1/FVC, 3 new loci for FEV1, and one new locus reaching genome-wide significance for both traits (Figure 2, Table 1). To assess the heterogeneity across studies included in Stage 1 and Stage 2, Chi-square tests were undertaken for all 16 SNPs; and none of them was statistically significant after applying a Bonferroni correction for 16 tests. The sentinel SNPs at these loci were in or near the genes MFAP2 (1p36.13), TGFB2/LYPLAL1 (1q41), HDAC4/FLJ43879 (2q37.3), RARB (3p24.2), MECOM (EVI1) (3q26.2), SPATA9/RHOBTB3 (5q15), ARMC2 (6q21), NCR3/AIF1 (6p21.33), ZKSCAN3 (6p22.1), CDC123 (10p13), C10orf11 (10q22.3), LRP1 (12q13.3), CCDC38 (12q22), MMP15 (16q13), CFDP1 (16q23.1) and KCNE2/C21orf82 (21q22.11) (Supplementary Figures 1B and 1C). The strongest signals in AGER (rs2070600)[8,9] and two of the novel signals (rs6903823 in ZKSCAN3 and rs2857595, upstream of NCR3) lie within a~3.8Mb interval at 6p21.32-22.1 which is characterised by long-range linkage disequilibrium. Nevertheless, the leading SNPs in these regions which are within the major histocompatibility complex (MHC) were statistically independent (Supplementary Note).
Figure 1

Study design. A total of 34 SNPs showing novel evidence of association (P<3×10−6) with FEV1 and/or FEV1/FVC in a meta-analysis of the Stage 1 studies were followed up in Stage 2. Studies with a combined total of 24,737 individuals undertook genotyping and association testing of the top 10 SNPs. Seven studies (*) with a combined total of 11,275 individuals had genome-wide association data and provided results for up to 34 SNPs. #GS:SFHS undertook genotyping on a 32-SNP multiplex genotyping platform and so included the 32 top ranking SNPs (including proxies and both SNPs from regions which showed association with both FEV1 and FEV1/FVC). This assay failed for one SNP (rs3769124) which was subsequently replaced with the 33rd SNP (rs4762767). SNP rs2284746 was excluded due to poor clustering. Although rs3743563 was chosen as proxy for rs12447804 which had N effective <80% in the Stage 1 meta-analysis, BHS2 were unable to genotype rs3743563 and so undertook genotyping for rs12447804 instead. See Table 1 for definitions of abbreviations.

Figure 2

Manhattan plots of association results for (a) FEV1/FVC and (b) FEV1. Manhattan plots ordered by chromosome position. SNPs for which −log10P>5 are indicated in red. Novel regions which reached genome-wide significance after Stage 1 + Stage 2 are labelled.

Table 1

Loci associated with lung function. Shown are FEV1 and FEV1/FVC results for the leading SNPs, ordered by chromosome and position, for each independent locus associated (P <5×10−8) with FEV1 or FEV1/FVC in a joint analysis of up to 94,612 individuals of European ancestry from the SpiroMeta-CHARGE GWAS (Stage 1) and follow-up (Stage 2). Two-sided p-values are given for Stage 1, Stage 2 and the joint meta-analysis of all stages. P-values reaching genome-wide significance (P<5×10−8) in the joint meta-analysis of all stages are indicated in bold. SNPs reaching independent replication in Stage 2 (P=0.05/34 = 1.47×10−3) are indicated with their Stage 2 p-value in bold. The sample sizes (N) shown are the effective sample sizes. Effective sample size within each study is the product of sample size and imputation quality metric. Joint meta-analysis includes data from Stage 1 and Stage 2. Beta values reflect effect-size estimates on an inverse-normal transformed scale after adjustments for age, age2, sex, height and ancestry principal components. The estimated proportion of the variance explained by each SNP can be found in Supplementary Table 6.

Chr.SNP_ID(NCBI36 position), functionCodedallelefreq.MeasureStage 1Stage 2Joint meta-analysis ofall stages
Beta (s.e.m.)PCodedallele freq.NBeta (s.e.m.)PCodedallele freq.NBeta (s.e.m.)P
1rs2284746(17179262), MFAP2(intron)GFEV1/FVC−0.042 (0.007)2.47E-090.51645944−0.038 (0.007)2.64E-070.52235371−0.04 (0.005)7.50E-16
FEV10.008 (0.007)2.78E-010.006 (0.007)3.70E-010.007 (0.005)1.48E-01
1rs993925(216926691), TGFB2(downstream)TFEV1/FVC0.04 (0.007)2.54E-070.308424020.023 (0.01)1.76E-020.348214140.034 (0.006)1.16E-08
FEV10.025 (0.007)1.51E-030.003 (0.007)7.29E-010.014 (0.005)8.71E-03
2rs12477314(239542085), HDAC4(downstream)TFEV1/FVC0.052 (0.008)4.48E-090.202455850.031 (0.008)8.41E-050.206458210.041 (0.006)1.68E-12
FEV10.032 (0.008)2.77E-040.025 (0.007)1.82E-040.028 (0.005)1.02E-07
3rs1529672(25495586), RARB(intron)CFEV1/FVC−0.06 (0.009)7.75E-100.82940624−0.038 (0.009)1.16E-050.83145466−0.048 (0.006)3.97E-14
FEV1−0.037 (0.009)1.78E-04−0.011 (0.007)9.33E-02−0.02 (0.006)2.16E-04
3rs1344555(170782913), MECOM(intron)TFEV1/FVC−0.019 (0.008)2.61E-020.20546067−0.017 (0.012)1.55E-010.20921313−0.018 (0.007)6.65E-03
FEV1−0.042 (0.008)1.91E-06−0.025 (0.009)6.44E-03−0.034 (0.006)2.65E-08
5rs153916(95062456), SPATA9(upstream)TFEV1/FVC−0.033 (0.007)2.06E-060.55247530−0.025 (0.009)6.67E-030.53521647−0.031 (0.005)2.12E-08
FEV1−0.001 (0.007)8.91E-010.004 (0.007)6.22E-010.001 (0.005)8.20E-01
6rs6903823(28430275), ZKSCAN3(intron)/ZNF323(intron)GFEV1/FVC−0.027 (0.008)2.28E-030.20947057−0.013 (0.011)2.34E-010.24621489−0.021 (0.007)1.19E-03
FEV1−0.046 (0.008)2.00E-07−0.029 (0.008)4.75E-04−0.037 (0.006)2.18E-10
6rs2857595(31676448), NCR3(upstream)GFEV1/FVC0.049 (0.009)7.86E-080.809455400.028 (0.008)5.36E-040.796461070.037 (0.006)2.28E-10
FEV10.04 (0.009)1.46E-050.017 (0.007)9.41E-030.025 (0.005)1.30E-06
6rs2798641(109374743), ARMC2(intron)TFEV1/FVC−0.047 (0.009)2.81E-070.18346369−0.03 (0.012)1.57E-020.17921173−0.041 (0.007)8.35E-09
FEV1−0.046 (0.009)5.39E-07−0.009 (0.01)3.35E-01−0.03 (0.006)4.69E-06
10rs7068966(12317998), CDC123(intron)TFEV1/FVC0.045 (0.007)1.28E-100.519470850.023 (0.006)3.86E-040.518460670.033 (0.005)6.13E-13
FEV10.04 (0.007)1.19E-080.022 (0.005)3.56E-050.029 (0.004)2.82E-12
10rs11001819(77985230), C10orf11(intron)GFEV1/FVC−0.019 (0.007)6.50E-030.52245546−0.006 (0.006)3.17E-010.50645932−0.012 (0.005)7.58E-03
FEV1−0.041 (0.007)1.42E-08−0.022 (0.005)3.10E-05−0.029 (0.004)2.98E-12
12rs11172113(55813550), LRP1(intron)TFEV1/FVC−0.035 (0.007)1.36E-060.60745387−0.026 (0.01)5.83E-030.5920509−0.032 (0.006)1.24E-08
FEV1−0.021 (0.007)3.55E-03−0.003 (0.007)6.94E-01−0.013 (0.005)1.19E-02
12rs1036429(94795559), CCDC38(intron)TFEV1/FVC0.049 (0.008)1.24E-080.2478140.028 (0.008)3.35E-040.214463110.038 (0.006)2.30E-11
FEV10.01 (0.008)2.67E-010.004 (0.006)5.38E-010.006 (0.005)2.26E-01
16rs12447804(56632783), MMP15(intron)TFEV1/FVC−0.053 (0.009)7.12E-080.20835123−0.021 (0.01)4.20E-020.22224398−0.038 (0.007)3.59E-08
FEV1−0.017 (0.009)8.02E-020.004 (0.007)5.71E-01−0.004 (0.006)4.73E-01
16rs2865531(73947817), CFDP1(intron)TFEV1/FVC0.039 (0.007)2.30E-080.418475940.024 (0.006)1.94E-040.409463040.031 (0.005)1.77E-11
FEV10.024 (0.007)6.30E-040.011 (0.005)3.89E-020.016 (0.004)1.09E-04
21rs9978142(34574109), KCNE2(upstream)TFEV1/FVC−0.048 (0.009)8.23E-070.15644577−0.031 (0.013)1.75E-020.14920944−0.043 (0.008)2.65E-08
FEV1−0.012 (0.009)2.47E-01−0.015 (0.01)1.35E-01−0.013 (0.007)5.57E-02

Gene expression

We investigated mRNA expression of the nearest gene for each of the 16 novel loci in human lung tissue and in a range of human primary cells including lung, brain, airway smooth muscle cells and bronchial epithelial cells. Transcripts were detected for all selected genes in lung tissue except CCDC38 and transcripts for most genes were also detected in airway smooth muscle cells and in bronchial epithelial cells (Table 2). As we were unable to detect expression of CCDC38 in any tissue, we also examined expression of SNPRF, which is the adjacent gene (Table 2), and found expression in all four cell types. TGFB2, MFAP2, EVI1 and MMP15 were expressed in one or more lung cell types but not in peripheral blood mononuclear cells providing evidence that these genes may exhibit tissue-specific expression.
Table 2

Expression profiling of candidate genes in the lung and periphery. (+) indicates the gene is expressed in the cell type used and (−) indicates the gene expression at mRNA level is not detected following 40 cycles of PCR. PCR profiling of gene transcripts in the human lung demonstrates expression of all candidates except CCDC38, for which two sets of primers were designed and tested under different optimization conditions. None of these assays detected expression of CCDC38 in the cell types analysed. The SNRPF gene neighbouring CCDC38 and harbouring SNPs with strong LD with CCDC38’s sentinel SNP was assayed instead. All PCR products were sequence verified. Glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) was used as a positive control for the cDNA and was expressed in all tissues. HASM: Human airway smooth muscle. HBEC: Human bronchial epithelial cells. PBMC: peripheral blood mononuclear cells.

Sentinel SNP(relationshipto gene)Chr.GenePutative Function of encodedproteinTissue
LungHASMHBECPBMC
rs993925(intron)1TGFB2Cytokine with roles in pro-fibroticcytokine modulating epithelial repairmechanisms and extracellular matrixhomeostasis including collagendeposition[40]++
rs2284746(intron)1MFAP2Major antigen of elastin-associatedmicrofibrils17 and a candidate forinvolvement in the etiology of inheritedconnective tissue diseases.+++
rs12477314(downstream)2HDAC4Deacetylase of histone surrounding DNAthus influencing transcription factoraccess to the DNA possibly repressinggene transcription.++++
rs1344555(intron)3EVI1Zinc finger transcription factor, encodedas part of MDS1/EVI1 complex locus(MECOM).+++
rs1529672(intron)3RARBNuclear retinoic acid receptor responsiveto retinoic acid, a vitamin A derivativeand which also controls cell proliferationand differentiation.++++
rs153916(intron)5SPATA9Initially identified as a mediator ofspermatogenesis, other family membersmay have a role in pancreaticdevelopment and β-cell proliferation[41]++++
rs2798641(intron)6ARMC2Function unknown although other familymembers have been identified as havingroles in cell signalling, proteindegradation and cytoskeleton functions[42]++++
rs2857595(upstream)6NCR3Required for efficient cytotoxicityresponses by natural killer cells againstnormal cells and tumours[43]++
rs6903823(intron)6ZKSCAN3Transcription factor involved in cellgrowth/cell cycle/signal transduction++++
rs7068966(intron)10CDC123Homologue in yeast shown to be acritical control protein modulatingEukaryotic initiation factor 2 in times ofcell stress++++
rs11001819(intron)10C10orf11Function unknown++++
rs11172113(intron)12LRP1Potentially diverse roles including cellsignalling and migration[44]++++
rs1036429(intron)12CCDC38Function unknown although other familymembers involved in a diverse array offunctions skeletal and motor function[45]
rs1036429(r2=0.96 withrs4762633 inSNRPF)12SNRPFSmall nuclear ribonucleoprotein F++++
rs12447804(intron)16MMP15Member of a large protease family withdiverse functional roles via proteaseactivity and specificity including; tissueremodelling, wound healing,angiogenesis, and tumor invasion.+++
rs2865531(intron)16CFDP1Craniofacial Development Protein 1++++
rs9978142(upstream)21KCNE2KCNQ1-KCNE2 K+ channels maymodulate transepithelial anion secretionin Calu3 airway epithelial cells[46].++
Referencegene12GAPDH++++
We assessed whether SNPs in these new regions, or their proxies (r2>0.6), were associated with gene expression using a database of expression-associated SNPs in lymphoblastoid cell lines[16]. Four loci showed regional (cis) effects on expression (P<1×10−7, Supplementary Note). A proxy for our sentinel SNP in CFDP1, rs2865531, coincided with the peak of the expression signal for CFDP1 and the strongest proxy for rs6903823 in ZKSCAN3 coincided with the peak of expression for ZSCAN12.

Plausible pathways for lung function involving new loci

The putative function of the genes within, or closest to, the association peaks identify a range of plausible mechanisms for impacting lung function. The most statistically significant new signal for FEV1/FVC (P=7.5×10−16) was in the gene encoding MFAP2, an antigen of elastin-associated microfibrils[17], although correlated SNPs in the region potentially implicate other genes that could plausibly influence lung function, such as CROCC, which encodes rootletin, a component of cilia[18]. Our second strongest new signal, also for FEV1/FVC, was in the gene encoding the retinoic acid receptor beta (RARB). Rarb-null knockout mice exhibit premature alveolar septation[19]. The third most statistically significant new signal for FEV1/FVC, and the most statistically significant new signal for FEV1, was in CDC123. This was the only novel region to show genome-wide association with both traits. CDC123 encodes a homologue of a yeast cell division cycle protein which plays a critical role in modulating Eukaryotic initiation factor 2 in times of cell stress[20]. The fourth signal for FEV1/FVC is downstream of HDAC4 which encodes a histone deacetylase; reductions in the expression of other histone deacetylases (specifically HDAC2, HDAC5 and HDAC8) have been noted in COPD[21]. The regions we observed in the MHC are much more difficult to localize with multiple genes being tagged by the top SNP, including non-synonymous SNPs in ZKSCAN3, PGBD1, ZSCAN12, ZNF323, TCF19, LTA, C6orf15 and GPANK1 (also known as BAT4) (Supplementary Table 2). At 6p21.33, the strongest association with lung function was observed for rs2857595, which is in linkage disequilibrium (LD, r2=0.47) with a non-synonymous SNP in LTA (encoding lymphotoxin alpha) and with a SNP in the upstream promoter region of TNFA (encoding tumour necrosis factor alpha, r2=0.86), both of which are plausible candidates[22,23]. Our top SNP in MMP15 is in strong LD (r2=1) with a non-synonymous SNP (rs3743563, which has an association with FEV1/FVC at P=1.8×10−7) within the same gene. Plausible mechanisms implicated by the other novel signals of association with lung function reported here include TGF-beta signalling; TGFB2 expression is upregulated in bronchial epithelial cells in asthma[24]. The putative function of key genes (as defined by LD with the leading SNP) in each of the 16 loci, and relevant findings from animal models, are summarised in Table 2 and detailed in Supplementary Table 2.

Associations with lung function in children

Alleles representing 11 of the 16 novel loci showed directionally consistent effects on lung function in 6,281 children (7 to 9 years of age) (Supplementary Table 3A) suggesting that genetic determination of lung function in adults may in part act via effects on lung development, or alternatively, that some genetic determinants of lung growth and lung function decline are shared.

Association of lung function loci with other traits

Although we stratified for ever-smoking versus never-smoking, we did not adjust for the amount smoked. In order to investigate the possibility that the associations at any of our 16 novel regions were driven by an effect of the SNP on smoking behaviour, we evaluated in silico data for associations with smoking amount from the Ox-GSK consortium[25] for the leading SNPs in these 16 regions. None of these 16 SNPs showed statistically significant association with the number of cigarettes smoked per day (Supplementary Table 3B). In addition, in our Stage 1 and Stage 2 datasets combined, we assessed whether the estimated effect sizes of the variants on lung function phenotypes differed substantially between ever-smokers and never-smokers (Supplementary Table 4) across the 16 loci. For the most strongly associated trait at each locus, we tested the SNP interaction with ever-smoking (versus never-smoking). None of the 16 novel loci showed a significant interaction (Bonferroni corrected threshold for 16 independent SNPs P=0.003125). These analyses suggest that the genetic effects we have identified underlie lung function variability irrespective of smoking exposure. Our lung function associations were adjusted for height, but there are some overlaps between loci associated with height and those associated with lung function. Therefore, we evaluated in silico data for height associations of our novel regions in the GIANT consortium[14] dataset. The G allele of rs2284746 (MFAP2, intron), which was associated with decreased FEV1/FVC was associated with increased height (Supplementary Table 3C). Given reported associations between lung cancer and either COPD or lung function decline, we also assessed in silico data for sentinel or proxy SNPs in these 16 regions for associations with lung cancer in the International Lung Cancer Consortium (ILCCO) GWAS meta-analysis[26]. Alleles associated with reduced lung function were associated with risk of lung cancer at the strongest available proxy SNP for rs2857595 (upstream of NCR3) at 6p21.33 (rs3099844, r2=0.67), and the strongest proxy SNP for rs6903823 (SNP in intron of ZKSCAN3 and ZNF323) at 6p22.1 (rs209181, r2=0.69) (lung cancer associations, P=2.2×10−7 and P=3.4×10−5, respectively, Supplementary Table 3D). No significant associations with lung cancer were seen at the other new loci (proxy SNPs were available for 15 of the 16 loci, Bonferroni corrected P<0.0033). In addition to the effects on height, smoking and lung cancer described above, we examined the literature for evidence for associations with other traits for each of the 16 new loci (detailed in Supplementary Table 2). Genome-wide significant associations (P<5×10−8) have been reported in KCNE2 with myocardial infarction[27], and at 6p21.33 near NCR3/AIF1, with neonatal lupus[28] and with systemic lupus erythematosus[29]. Other significant complex disease associations have also been noted in the regions of CDC123 (type 2 diabetes[30]), CFDP1 (type 1 diabetes[31]) and MECOM (blood pressure[32,33]), but with weaker LD (r2<0.3) between the reported SNP and the sentinel SNP for lung function in the region (Supplementary Table 2).

Proportion of variance explained by loci discovered to date

Associations in 10 loci previously reported for lung function[8,9] reached genome-wide significance (P<5×10−8) in our Stage 1 data, namely loci in or near TNS1, FAM13A, GSTCD/NPNT, HHIP, HTR4, ADAM19, AGER, GPR126, PTCH1, and TSHD4 (Supplementary Table 5A). Thus, a total of 26 regions showed genome-wide significant association with lung function in our study. In aggregate, variants at these 26 regions explain approximately 3.2% of the additive polygenic variance for FEV1/FVC and 1.5% for FEV1 (see Supplementary Note). Following the approach described by Park et al.[34] we estimated that there is a total of 102 (95% CI 57-155) independent variants with similar effect sizes to the 26 variants we report. In combination these 102 variants, comprising 26 discovered variants and 76 putative undiscovered variants, collectively explain around 7.5% of the additive polygenic variance for FEV1/FVC and 3.4% for FEV1 (see Supplementary Table 6, Online Methods and Supplementary Note).

Discussion

In meta-analysis of 23 studies comprising 48,201 individuals of European ancestry and follow-up in 17 studies comprising up to 46,411 individuals, we report genome-wide significant associations with an additional 12 regions for FEV1/FVC, an additional three regions for FEV1 and one additional region associated with both FEV1 and FEV1/FVC. We also confirm genome-wide association with 10 regions previously associated with lung function, bringing to 26 the total number of loci associated with lung function in these data. Most of the new loci are in regions not previously suspected to have been involved in lung development, the control of pulmonary function or risk of developing COPD. Elucidating the mechanisms through which these regions influence lung function should lead to a more complete understanding of lung function regulation and the pathogenesis of COPD. Four of the new loci (MFAP2, ZKSCAN3, near NCR3 and near KCNE2) we show to be associated with lung function are also associated with other complex traits and diseases (P<5×10−8 for the other trait at a SNP with r2>0.3 with the top lung function SNP in the region). Understanding the intermediates underlying these pleiotropic effects could also reveal crucial insights into the pathophysiology of lung disease. One potential explanation is that these loci underlie control of the mechanisms regulating the development and resolution of inflammation and subsequent tissue remodelling in a range of tissues. The effect sizes of the variants in the 26 loci associated with lung function explain a modest proportion of the additive genetic variance in FEV1/FVC and in FEV1, even after accounting for putative undetected variants with a similar distribution of effect sizes[34]. Our findings are consistent with those from other common complex traits, where it is thought that many as yet unidentified common and rare sequence variants, and potentially structural variants could explain the remaining heritability[35]. That our study more than doubled the number of loci known to be associated with lung function underlines the utility of large sample sizes to achieve the power to detect common variants associated with complex traits. Nevertheless, it is likely that additional variants with similar effect sizes remain undiscovered[14]. In addition, our study was not designed to detect rare variants or structural variants associated with lung function. Identification of rare variants associated with lung function could be helpful in narrowing the scope of ongoing functional work to those genes most likely to be causally related to the association signals we detected. Our study focused on cross-sectional measures of lung function. Adult lung function at a particular time point is influenced by the peak lung function achieved by 25-35 years of age, as well as the rate of decline of lung function after that peak[36]. The 26 loci now confirmed to be associated with lung function could affect either pre- or post-natal lung development and growth or decline in lung function during adulthood, or both. We showed consistent directions of estimated effects on lung function between adults and children at 7-9 years of age for SNPs at 11 of the 16 new loci, and eight of 10 previously reported loci (Supplementary Table 3A). The results we show for lung function in children provide some indication that these loci affect lung function development, although studies in larger populations of children would provide greater clarity for SNPs in the new loci. Further investigations will be required in large populations with longitudinal data to delineate the influence of these variants on the rates of development of, and decline in, lung function and on the risk of developing COPD. Of the sentinel SNPs at the 16 new loci associated with lung function, only rs2284746 (MFAP2) was associated with height in the GIANT consortium[14] dataset. The G allele of rs2284746 was associated with both increased height and reduced lung function. A similar relationship between lung function and height was previously reported for the G allele of rs3817928 in GPR126[8,14], which is associated with decreased height, but with increased FEV1/FVC. A further three of the 180 loci found to be associated with height[14] showed association (for 180 loci, Bonferroni corrected threshold P=2.8×10−4) with either FEV1 (CLIC4 and BMP6) or FEV1/FVC (PIP4K2B) (Supplementary Table 3E). In each case, the allele associated with an increase in height was associated with a decrease in lung function. This is not the case for the association of rs1032296 near HHIP, which has shown consistent directions of effects on lung function and height[14,11]. However, the strongest SNP associated with height in the HHIP region lies within an intron of HHIP but shows no association with FEV1 or FEV1/FVC. Furthermore, while height is an important predictor of FEV1, this is not true for its ratio to FVC[37]. These observations argue against the associations with lung function at these loci being simply due to incomplete adjustment for height. We stratified by ever- and never-smoker status in our analyses and in our investigation of amount smoked in the Ox-GSK consortium[25] none of the sentinel SNPs in the 16 new regions showed association with the number of cigarettes smoked per day. Additionally, none of these regions was associated with ever-smoking in the Ox-GSK consortium data (Supplementary Table 3B). Thus the SNP associations with lung function we observed are unlikely to have arisen simply as a consequence of inadequate adjustment for smoking. We did not observe any interactions with ever-smoking for any of the sentinel SNPs in the 16 new regions that exceeded a Bonferroni-corrected significance level (for 16 SNPs). Thus, the effects on lung function of the novel variants we identified are apparent in both ever-smokers and in never-smokers, and the effects of smoking and of these genetic variants may be independent and additive. In other common complex diseases, follow up studies that incorporate common genetic risk variants into models to predict disease have not been shown to add substantially to existing risk models, particularly when such models already include family history[38,39]. The same may also prove to be true for the 26 genetic variants described in this paper, as the effect size of any individual variant is small, but further work is required in this area. The major utility of our findings will be in the knowledge they provide about previously unknown pathways underlying lung function. Elucidating the mechanisms that these genes are involved in will lead to improved understanding of the regulation of lung function and potentially to new therapeutic targets for COPD.
  53 in total

1.  Performance of common genetic variants in breast-cancer risk models.

Authors:  Sholom Wacholder; Patricia Hartge; Ross Prentice; Montserrat Garcia-Closas; Heather Spencer Feigelson; W Ryan Diver; Michael J Thun; David G Cox; Susan E Hankinson; Peter Kraft; Bernard Rosner; Christine D Berg; Louise A Brinton; Jolanta Lissowska; Mark E Sherman; Rowan Chlebowski; Charles Kooperberg; Rebecca D Jackson; Dennis W Buckman; Peter Hui; Ruth Pfeiffer; Kevin B Jacobs; Gilles D Thomas; Robert N Hoover; Mitchell H Gail; Stephen J Chanock; David J Hunter
Journal:  N Engl J Med       Date:  2010-03-18       Impact factor: 91.245

Review 2.  Forced expiratory volume in one second: not just a lung function test but a marker of premature death from all causes.

Authors:  R P Young; R Hopkins; T E Eaton
Journal:  Eur Respir J       Date:  2007-10       Impact factor: 16.671

3.  A genome-wide association study of global gene expression.

Authors:  Anna L Dixon; Liming Liang; Miriam F Moffatt; Wei Chen; Simon Heath; Kenny C C Wong; Jenny Taylor; Edward Burnett; Ivo Gut; Martin Farrall; G Mark Lathrop; Gonçalo R Abecasis; William O C Cookson
Journal:  Nat Genet       Date:  2007-09-16       Impact factor: 38.330

4.  Ventilatory function, height, and mortality among lifelong non-smokers.

Authors:  D P Strachan
Journal:  J Epidemiol Community Health       Date:  1992-02       Impact factor: 3.710

5.  Decreased histone deacetylase activity in chronic obstructive pulmonary disease.

Authors:  Kazuhiro Ito; Misako Ito; W Mark Elliott; Borja Cosio; Gaetano Caramori; Onn Min Kon; Adam Barczyk; Shizu Hayashi; Ian M Adcock; James C Hogg; Peter J Barnes
Journal:  N Engl J Med       Date:  2005-05-12       Impact factor: 91.245

6.  Retinoic acid receptor-beta: an endogenous inhibitor of the perinatal formation of pulmonary alveoli.

Authors:  G D Massaro; D Massaro; W Y Chan; L B Clerch; N Ghyselinck; P Chambon; R A Chandraratna
Journal:  Physiol Genomics       Date:  2000-11-09       Impact factor: 3.107

Review 7.  Beyond endocytosis: LRP function in cell migration, proliferation and vascular permeability.

Authors:  A P Lillis; I Mikhailenko; D K Strickland
Journal:  J Thromb Haemost       Date:  2005-08       Impact factor: 5.824

8.  Transforming growth factor-beta2 induces bronchial epithelial mucin expression in asthma.

Authors:  Hong Wei Chu; Silvana Balzar; Gregory J Seedorf; Jay Y Westcott; John B Trudeau; Phil Silkoff; Sally E Wenzel
Journal:  Am J Pathol       Date:  2004-10       Impact factor: 4.307

9.  The natural history of chronic airflow obstruction revisited: an analysis of the Framingham offspring cohort.

Authors:  Robab Kohansal; Pablo Martinez-Camblor; Alvar Agustí; A Sonia Buist; David M Mannino; Joan B Soriano
Journal:  Am J Respir Crit Care Med       Date:  2009-04-02       Impact factor: 21.405

10.  Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function.

Authors:  Dana B Hancock; Mark Eijgelsheim; Jemma B Wilk; Sina A Gharib; Laura R Loehr; Kristin D Marciante; Nora Franceschini; Yannick M T A van Durme; Ting-Hsu Chen; R Graham Barr; Matthew B Schabath; David J Couper; Guy G Brusselle; Bruce M Psaty; Cornelia M van Duijn; Jerome I Rotter; André G Uitterlinden; Albert Hofman; Naresh M Punjabi; Fernando Rivadeneira; Alanna C Morrison; Paul L Enright; Kari E North; Susan R Heckbert; Thomas Lumley; Bruno H C Stricker; George T O'Connor; Stephanie J London
Journal:  Nat Genet       Date:  2009-12-13       Impact factor: 41.307

View more
  211 in total

1.  The Rotterdam Study: 2016 objectives and design update.

Authors:  Albert Hofman; Guy G O Brusselle; Sarwa Darwish Murad; Cornelia M van Duijn; Oscar H Franco; André Goedegebure; M Arfan Ikram; Caroline C W Klaver; Tamar E C Nijsten; Robin P Peeters; Bruno H Ch Stricker; Henning W Tiemeier; André G Uitterlinden; Meike W Vernooij
Journal:  Eur J Epidemiol       Date:  2015-09-19       Impact factor: 8.082

2.  A Genome-Wide Association Study of Post-bronchodilator Lung Function in Children with Asthma.

Authors:  John M Brehm; Sze Man Tse; Damien C Croteau-Chonka; Erick Forno; Augusto A Litonjua; Benjamin A Raby; Wei Chen; Qi Yan; Nadia Boutaoui; Edna Acosta-Pérez; Lydiana Avila; Scott T Weiss; Manuel Soto-Quiros; Michelle M Cloutier; Donglei Hu; Maria Pino-Yanes; Sally E Wenzel; Melissa L Spear; Jay K Kolls; Esteban G Burchard; Glorisa Canino; Juan C Celedón
Journal:  Am J Respir Crit Care Med       Date:  2015-09-01       Impact factor: 21.405

3.  Genomic analysis of snub-nosed monkeys (Rhinopithecus) identifies genes and processes related to high-altitude adaptation.

Authors:  Li Yu; Guo-Dong Wang; Jue Ruan; Yong-Bin Chen; Cui-Ping Yang; Xue Cao; Hong Wu; Yan-Hu Liu; Zheng-Lin Du; Xiao-Ping Wang; Jing Yang; Shao-Chen Cheng; Li Zhong; Lu Wang; Xuan Wang; Jing-Yang Hu; Lu Fang; Bing Bai; Kai-Le Wang; Na Yuan; Shi-Fang Wu; Bao-Guo Li; Jin-Guo Zhang; Ye-Qin Yang; Cheng-Lin Zhang; Yong-Cheng Long; Hai-Shu Li; Jing-Yuan Yang; David M Irwin; Oliver A Ryder; Ying Li; Chung-I Wu; Ya-Ping Zhang
Journal:  Nat Genet       Date:  2016-07-11       Impact factor: 38.330

4.  On the simultaneous association analysis of large genomic regions: a massive multi-locus association test.

Authors:  Dandi Qiao; Michael H Cho; Heide Fier; Per S Bakke; Amund Gulsvik; Edwin K Silverman; Christoph Lange
Journal:  Bioinformatics       Date:  2013-11-20       Impact factor: 6.937

5.  The Rotterdam Study: 2014 objectives and design update.

Authors:  Albert Hofman; Sarwa Darwish Murad; Cornelia M van Duijn; Oscar H Franco; André Goedegebure; M Arfan Ikram; Caroline C W Klaver; Tamar E C Nijsten; Robin P Peeters; Bruno H Ch Stricker; Henning W Tiemeier; André G Uitterlinden; Meike W Vernooij
Journal:  Eur J Epidemiol       Date:  2013-11-21       Impact factor: 8.082

Review 6.  Of pleiotropy and trajectories: Does the TGF-β pathway link childhood asthma and chronic obstructive pulmonary disease?

Authors:  Avery DeVries; Donata Vercelli
Journal:  J Allergy Clin Immunol       Date:  2018-04-27       Impact factor: 10.793

7.  Genetic Control of Fatty Acid β-Oxidation in Chronic Obstructive Pulmonary Disease.

Authors:  Zhiqiang Jiang; Nelson H Knudsen; Gang Wang; Weiliang Qiu; Zun Zar Chi Naing; Yan Bai; Xingbin Ai; Chih-Hao Lee; Xiaobo Zhou
Journal:  Am J Respir Cell Mol Biol       Date:  2017-06       Impact factor: 6.914

Review 8.  Lung functional development and asthma trajectories.

Authors:  Fabienne Decrue; Olga Gorlanova; Jakob Usemann; Urs Frey
Journal:  Semin Immunopathol       Date:  2020-01-27       Impact factor: 9.623

9.  Secreted phosphoprotein 1 is a determinant of lung function development in mice.

Authors:  Koustav Ganguly; Timothy M Martin; Vincent J Concel; Swapna Upadhyay; Kiflai Bein; Kelly A Brant; Leema George; Ankita Mitra; Tania A Thimraj; James P Fabisiak; Louis J Vuga; Cheryl Fattman; Naftali Kaminski; Holger Schulz; George D Leikauf
Journal:  Am J Respir Cell Mol Biol       Date:  2014-11       Impact factor: 6.914

Review 10.  Risk of Lung Disease in PI MZ Heterozygotes. Current Status and Future Research Directions.

Authors:  Edwin K Silverman
Journal:  Ann Am Thorac Soc       Date:  2016-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.