Literature DB >> 30175238

Meta-analysis of exome array data identifies six novel genetic loci for lung function.

Victoria E Jackson1, Jeanne C Latourelle2, Louise V Wain1,3, Albert V Smith4,5, Megan L Grove6, Traci M Bartz7, Ma'en Obeidat8, Michael A Province9, Wei Gao10, Beenish Qaiser11, David J Porteous12, Patricia A Cassano13,14, Tarunveer S Ahluwalia15,16, Niels Grarup15, Jin Li17, Elisabeth Altmaier18, Jonathan Marten19, Sarah E Harris20,21, Ani Manichaikul22, Tess D Pottinger23,24, Ruifang Li-Gao25, Allan Lind-Thomsen26, Anubha Mahajan27, Lies Lahousse28,29, Medea Imboden30,31, Alexander Teumer32, Bram Prins33, Leo-Pekka Lyytikäinen34,35, Gudny Eiriksdottir5, Nora Franceschini36, Colleen M Sitlani37, Jennifer A Brody37, Yohan Bossé38, Wim Timens39,40, Aldi Kraja9, Anu Loukola11, Wenbo Tang13,41, Yongmei Liu42, Jette Bork-Jensen15, Johanne M Justesen16, Allan Linneberg43,44,45, Leslie A Lange46, Rajesh Rawal18, Stefan Karrasch47,48, Jennifer E Huffman19, Blair H Smith49, Gail Davies20,50, Kristin M Burkart23, Josyf C Mychaleckyj22, Tobias N Bonten51,52, Stefan Enroth26, Lars Lind53, Guy G Brusselle28,54,55, Ashish Kumar27,30,31,56, Beate Stubbe57, Mika Kähönen58,59, Annah B Wyss60, Bruce M Psaty61,62, Susan R Heckbert63, Ke Hao64,65, Taina Rantanen66, Stephen B Kritchevsky67, Kurt Lohman42, Tea Skaaby43, Charlotta Pisinger43, Torben Hansen15, Holger Schulz47,68, Ozren Polasek69, Archie Campbell12, John M Starr20,70, Stephen S Rich22, Dennis O Mook-Kanamori25,52, Åsa Johansson26, Erik Ingelsson71,72, André G Uitterlinden54,73, Stefan Weiss74,75, Olli T Raitakari76,77, Vilmundur Gudnason4,5, Kari E North78, Sina A Gharib79, Don D Sin8,80, Kent D Taylor81, George T O'Connor82,83, Jaakko Kaprio11,84,85, Tamara B Harris86, Oluf Pederson15, Henrik Vestergaard15,16, James G Wilson87, Konstantin Strauch88,89, Caroline Hayward19, Shona Kerr19, Ian J Deary20,50, R Graham Barr23,90, Renée de Mutsert25, Ulf Gyllensten26, Andrew P Morris27,91, M Arfan Ikram54,92,93, Nicole Probst-Hensch30,31, Sven Gläser57,94, Eleftheria Zeggini33, Terho Lehtimäki34,35, David P Strachan95, Josée Dupuis10, Alanna C Morrison6, Ian P Hall96, Martin D Tobin1,3, Stephanie J London60.   

Abstract

Background: Over 90 regions of the genome have been associated with lung function to date, many of which have also been implicated in chronic obstructive pulmonary disease.
Methods: We carried out meta-analyses of exome array data and three lung function measures: forced expiratory volume in one second (FEV 1), forced vital capacity (FVC) and the ratio of FEV 1 to FVC (FEV 1/FVC). These analyses by the SpiroMeta and CHARGE consortia included 60,749 individuals of European ancestry from 23 studies, and 7,721 individuals of African Ancestry from 5 studies in the discovery stage, with follow-up in up to 111,556 independent individuals.
Results: We identified significant (P<2·8x10 -7) associations with six SNPs: a nonsynonymous variant in RPAP1, which is predicted to be damaging, three intronic SNPs ( SEC24C, CASC17 and UQCC1) and two intergenic SNPs near to LY86 and FGF10. Expression quantitative trait loci analyses found evidence for regulation of gene expression at three signals and implicated several genes, including TYRO3 and PLAU. Conclusions: Further interrogation of these loci could provide greater understanding of the determinants of lung function and pulmonary disease.

Entities:  

Keywords:  COPD; GWAS; Lung function; exome array; respiratory

Year:  2018        PMID: 30175238      PMCID: PMC6081985          DOI: 10.12688/wellcomeopenres.12583.3

Source DB:  PubMed          Journal:  Wellcome Open Res        ISSN: 2398-502X


Introduction

Measures of lung function act as predictors of mortality and morbidity and form the basis for the diagnosis of several diseases, most notably chronic obstructive pulmonary disease (COPD), one of the leading causes of death globally [1]. Environmental factors, including smoking and exposure to air pollution play a significant role in lung function; however there has also been shown to be a genetic component, with estimates of the narrow sense heritability ranging between 39–66% [2– 5]. Genome-wide association studies (GWAS) of lung function have identified associations between single nucleotide polymorphisms (SNPs) and lung function at over 150 independent loci to date [6– 14]. Associations have also been identified in GWAS of COPD [15– 19]; however, the identification of disease associated SNPs has been restricted by limited sample sizes. Many signals first identified in powerful studies of quantitative lung function traits, have been found to be associated with risk of COPD, highlighting the potential clinical usefulness of comprehensive identification of lung function associated SNPs [13]. Low frequency (minor allele frequency (MAF) 1–5%) and rare (MAF<1%) variants have been largely underexplored by GWAS to date. Exome arrays have been designed to facilitate the investigation of these low frequency and rare variants, predominately within coding regions, in large sample sizes. Alongside a core content of rare coding SNPs, the exome array additionally includes common variation, including tags for previously identified GWAS hits, ancestry informative SNPs, a grid of markers for estimating identity by descent and a random selection of synonymous SNPs [20]. An earlier version of this article can be found on bioRxiv ( https://doi.org/10.1101/164426)

Results

We carried out a meta-analysis of exome array data and three lung function measures: forced expiratory volume in one second (FEV 1), forced vital capacity (FVC) and the ratio of FEV 1 to FVC (FEV 1/FVC). These analyses included 68,470 individuals from the SpiroMeta and CHARGE consortia in a discovery analysis, with follow-up in an independent sample of up to 111,556 individuals. All studies are listed with their study-specific sample characteristics in Table 1, with full study descriptions, including details of spirometry and other measurements described in the Supplementary Note. The genotype calling procedures implemented by each study ( Supplementary Table 1) and quality control of genotype data are described in the Supplementary Methods. We have undertaken both single variant analyses, and gene-based associations, which test for the joint effect of several rare variants in a gene (see Methods for details).
Table 1.

Sample characteristics of 11 SpiroMeta and 12 CHARGE studies contributing to the discovery analyses and three studies contributing to the replication analyses.

Discovery studies
SpiroMeta studiesTotal samplen (%) MaleEver smokers, n (%)Age, mean (SD)FEV 1, litres. mean (SD)FVC, litres. mean (SD)FEV 1/FVC, mean (SD)
1958 British Birth Cohort (B58C)52702961 (56·2%)2866 (53·3%)44·00 (0·00)3·35 (0·79)4·29 (1·03)0·788 (0·09)
Generation Scotland (GS:SFHS)81643413 (41·8%)3806 (46·6%)51·59 (13·33)2·78 (0·87)3·91 (1·01)0·710 (0·12)
Cooperative Health Research in the Region of Augsburg (KORA F4)1447701 (48·5%)900 (62·2%)54·82 (9·66)3·24 (0·85)4·20 (1·04)0·771 (0·07)
CROATIA-Korcula cohort (KORCULA)791296 (36·8%)418 (52·0%)55·56 (13·69)2·72 (0·83)3·29 (0·95)0·829 (0·10)
Lothian Birth Cohort 1936 (LBC1936)974501 (50·6%)554 (55·9%)69·55 (0·84)2·38 (0·67)3·04 (0·87)0·787 (0·10)
Study of Health in Pomerania (SHIP)1681831 (49·4%)955 (56·8%)52·25 (13·43)3·29 (0·88)3·88 (1·03)0·848 (0·07)
Northern Swedish Population Health Study (NSPHS)880407 (46·3%)122 (13·9%)49·13 (19·96)2·93 (0·90)3·53 (1·06)0·831 (0·09)
Prospective Investigation of the Vasculature in Uppsala Seniors (PIVUS)836413 (49·4%)426 (51·0%)70·20 (0·17)2·44 (0·68)3·20 (0·87)0·76 (0·10)
Swiss study on Air Pollution and Lung Disease in adults (SAPALDIA)27071379 (50·9%)1399 (51·7%)40·86 (10·92)3·65 (0·83)4·62 (1·04)0·794 (0·07)
The Cardiovascular Risk in Young Finns Study (YFS)434198 (47·3%)186 (44·4%)38·88 (5·07)3·73 (0·75)4·68 (0·99)0·800 (0·06)
Finnish Twin Cohort (FTC)2140 (0%)0 (0%)68·73 (3·31)2·18 (0·47)2·79 (0·58)0·786 (0·08)
Total 23,398
CHARGE studies (European Ancestry)Total samplen (%) MaleEver smokers, n (%)Age, mean (SD)FEV 1, litres. mean (SD)FVC, litres. mean (SD)FEV 1/FVC, mean (SD)
AGES-Reykjavik study (AGES)1566649 (41·4%)900 (57·5%)76·1 (5·62)2·13 (0·70)2·87 (0·86)0·744 (0·09)
Atherosclerosis Risk in Communities Study (ARIC)10,6805015 (47·0%)631 (59·1%)54·3 (5·70)2·94 (0·77)3·98 (0·98)0·738 (0·07)
Cardiovascular Health Study (CHS)39671737 (43·8%)2089 (52·7%)72·8 (5·55)2·11 (0·66)3·00 (0·86)0·702 (0·10)
NHLBI Family Heart Study (FAMHS)1651718 (43·5%)698 (42·3)53·5 (12·60)2·91 (0·853)3·89 (1·05)0·746 (0·08)
Framingham Heart Study (FHS)71133241 (45·5%)3780 (53·1)50·7 (14·12)3·10 (0·925)4·09 (1·12)0·755 (0·08)
Health Aging and Body Composition Study (HABC)1457786 (53·2%)831 (56·5%)73·7 (2·83)2·31 (0·66)3·11 (0·81)0·741 (0·08)
Health2006 Study27141217 (44·8%)1577 (58·1%)49·4 (13·04)3·13 (0·82)3·99 (0·99)0·784 (0·07)
Health2008 Study687297 (43·2%)384 (55·9%)46·7 (8·22)3·27 (0·79)4·13 (0·97)0·791 (0·06)
Inter99 Study (without pack-years)1115549 (49·2%)1115 (100%)47·2 (7·76)3·26 (0·71)4·12 (0·92)0·796 (0·07)
Inter99 Study (with pack-years)41792027 (48·5%)2307 (55·2%)45·8 (7·95)3·21 (0·76)4·10 (0·97)0·788 (0·08)
Multi-Ethnic Study of Atherosclerosis (MESA)1323654 (49·4%)751 (56·8%)66·0 (9·8)2·57 (0·76)3·51 (0·10)0·733 (0·08)
The Rotterdam Study (RS)546299 (54·8%)382 (70·0%)79·4 (5·00)2·27 (0·68)3·03 (0·86)0·750 (0·08)
Total 36,998
CHARGE studies (African Ancestry)Total Samplen (%) MaleEver smokers, n (%)Age, mean (SD)FEV 1, litres. mean (SD)FVC, litres. mean (SD)FEV 1/FVC, mean (SD)
Atherosclerosis Risk in Communities Study (ARIC)31801183 (37·2%)1680 (59·1%)53·6 (5·83)2·48 (0·65)3·25 (0·82)0·765 (0·08)
Cardiovascular Health Study (CHS)624232 (37·2%)340 (54·4%)73·2 (5·49)1·76 (0·58)2·48 (0·80)0·717 (0·11)
Health Aging and Body Composition Study (HABC)943433 (45·9%)543 (57·6%)73·4 (2·90)1·96 (0·57)2·61 (0·71)0·749 (0·09)
Jackson Heart Study (JHS)2143793 (36·8%)688 (31·9%)52·8 (12·6)2·43 (0·72)3·02 (0·86)0·807 (0·09)
Multi-Ethnic Study of Atherosclerosis (MESA)861404 (46·9%)467 (54·2%)65·6 (9·6)2·19 (0·66)2·92 (0·86)0·756 (0·09)
Total 7721
Replication studies
Study nameTotal Samplen (%) MaleEver smokers, n (%)Age, mean (SD)FEV 1, litres. mean (SD)FVC, litres. mean (SD)FEV 1/FVC, mean (SD)
UK Biobank 98,65745,166 (45·8%)56,404 (57·2%)56·7 (7·92)2·75 (0·80)3·67 (0·98)0·75 (0·07)
UK Household Longitudinal Study (UKHLS) 74433293 (44·2%)4509 (60·5%)53·10 (15·94)2·89 (0·90)3·83 (1·08)0·753 (0·09)
Netherlands Epidemiology of Obesity study (NEO) 54562672 (48·0%)3674 (66·0%)55·9 (5·9)3·26 (0·80)4·26 (1·02)0·77 (0·07)
Total 111,556

Meta-analyses of single variant associations

We first evaluated single variant associations between FEV 1, FVC and FEV 1/FVC and the 179,215 SNPs that passed study level quality control and were polymorphic in both consortia. These analyses identified 34 SNPs in regions not previously associated with lung function, showing association with at least one trait at overall P<10 -5, and showing association with consistent direction and P<0·05 in both consortia (full results in Supplementary Table 2, quantile-quantile and Manhattan plots shown in Supplementary Figure 1). We followed up these SNP associations in a replication analysis comprising 3 studies with 111,556 individuals. Combining the results from the discovery and replication stages in a meta-analysis identified six SNPs in total that were independent to known signals and met the pre-defined significance threshold (P<2·8×10 -7) overall in, or near to FGF10, LY86, SEC24C, RPAP1, CASC17 and UQCC1 ( Table 2, Supplementary Figure 2). A SNP near to the CASC17 signal (rs11654749, r 2=0·3 with rs1859962) has previously been associated with FEV 1 in a genome-wide analysis of gene-smoking interactions, although this association was not replicated at the time [21]; the present analysis provides the first evidence for independent replication of this signal. A seventh signal was also identified in LCT ( Table 2, Supplementary Figure 2); whilst this locus has not previously been implicated in lung function, this SNP is known to vary in frequency across European populations [22], and we cannot rule out that this association is not an artefact of population structure. Our discovery analysis furthermore identified associations (P<10 -5) in 25 regions previously associated with one or more of FEV 1, FVC and FEV 1/FVC ( Supplementary Table 3).
Table 2.

Novel loci associated with lung function traits.

Results are shown for variant in novel loci associated (P<2·7×10 -7) with lung function traits in a two stage meta-analysis consisting of up to 68,470 individuals from the SpiroMeta and CHARGE Consortia in the discovery analyses, with follow-up in up to 111,556 individuals from UK Biobank, UKHLS and NEO. For each SNP, the result for the trait-smoking-ancestry combination which resulted in the most statistically significant association is given. The results for these SNPs and all three traits are shown in Supplementary Table 12. Beta values from SpiroMeta (β Sp) reflect effect-size estimates on an inverse-normal transformed scale after adjustments for age, age 2, sex, height and ancestry principal components, and stratified by ever smoking status (Analysis of All individuals only). Beta values from CHARGE (β CH) reflect effect-size estimates on an untransformed scale (litres for FEV 1 and FVC; ratio for FEV 1/FVC). Samples sizes (N), Z-statistics (Z) and two-sided P-values (P) are given for the combined discovery analysis and the replication analysis. Two-sided P-values are also given for the full two-stage combined analyses (discovery + replication).

Consortium resultsCombined discovery meta-analysisReplicationTwo-stage combined
SNPChr:Pos(Nearest) gene(s)TraitSmokingAncestryEffect/other alleleEffect allele frequency (Discovery)β CH β Sp N disc Z disc P disc N rep Z rep P rep P meta
rs23226592:136555659LCT (nonsynonymous)FVCAll IndividualsEA OnlyT/C23·5%27·340·03255,5915·5972·18×10 -8 12,8992·2860·02231·70 ×10 -9
rs14480445:44296986 FGF10(dist=8111), NNT(dist=591,318)FVCEver SmokersEA+AAA/G35·6%18·630·05730,9664·8131·49 ×10 -6 64,4004·8051·55 ×10 -6 2·22 ×10 -11
rs12944216:6743149 LY86(dist=87,933), RREB1(dist=364,681)FEV 1/ FVCAll IndividualsEA+AAT/G36·8%-0·222-0·03868,099-5·4794·27 ×10 -8 111,556-8·1713·06 ×10 -16 9·74 ×10 -23
rs384996910: 75525999 SEC24C (intronic)FEV1All IndividualsEA+AAT/C29·4%13·100·03668,1164·7671·87 ×10 -6 111,5565·0424·60 ×10 -7 4·99 ×10 -12
rs120034515: 41819716 RPAP1 (nonsynonymous)FEV1/ FVCAll IndividualsEA onlyC/T48·8%-0·217-0·02560,381-4·5864·51 ×10 -6 111,556-5·7251·03 ×10 -8 2·33 ×10 -13
rs185996217: 69108753 CASC17 (intronic)FEV 1 All IndividualsEA onlyG/T48·2%15·390·02660,3954·8761·08 ×10 -6 111,5544·6123·99 ×10 -6 4·10 ×10 -11
rs608881320: 33975181 UQCC1 (intronic)FVCAll IndividualsEA+AAC/A36·7%-16·16-0·02368,115-4·6343·58×10 -6 111,556-7·6881·50 ×10 -14 4·90 ×10 -19

Novel loci associated with lung function traits.

Results are shown for variant in novel loci associated (P<2·7×10 -7) with lung function traits in a two stage meta-analysis consisting of up to 68,470 individuals from the SpiroMeta and CHARGE Consortia in the discovery analyses, with follow-up in up to 111,556 individuals from UK Biobank, UKHLS and NEO. For each SNP, the result for the trait-smoking-ancestry combination which resulted in the most statistically significant association is given. The results for these SNPs and all three traits are shown in Supplementary Table 12. Beta values from SpiroMeta (β Sp) reflect effect-size estimates on an inverse-normal transformed scale after adjustments for age, age 2, sex, height and ancestry principal components, and stratified by ever smoking status (Analysis of All individuals only). Beta values from CHARGE (β CH) reflect effect-size estimates on an untransformed scale (litres for FEV 1 and FVC; ratio for FEV 1/FVC). Samples sizes (N), Z-statistics (Z) and two-sided P-values (P) are given for the combined discovery analysis and the replication analysis. Two-sided P-values are also given for the full two-stage combined analyses (discovery + replication). Generally, the observed effect of the SNPs at the novel signals were similar in ever and never smokers; the exception was rs1448044 near FGF10, which showed a significant association with FVC only in ever smokers in our discovery analysis (ever smokers P=1·49×10 -6; never smokers P=0·695, Supplementary Table 4 and Supplementary Figure 3). In the replication analysis, however, this association was observed in both ever and never smokers (ever smokers P=3·14×10 -5; never smokers P=1·40×10 -4, Supplementary Table 5). For rs1200345 ( RPAP1) and rs1859962 ( CASC17), associations were most statistically significant in the analyses restricted to individuals of European Ancestry ( Supplementary Table 4 and Supplementary Figure 3), as was the association with rs2322659 ( LCT), giving further support that this association may be due to population stratification.

Meta-analyses of gene-based associations

We undertook Weighted Sum Tests (WST) [23] and Sequence Kernel Association tests (SKAT) [24] to assess the joint effects of multiple low frequency variants within genes on lung function traits. In our discovery analyses of all 68,470 individuals, we tested up to 14,380 genes that had at least two variants with MAF<5% and met the inclusion criteria (exonic or loss of function [LOF], see Methods for definitions) in both consortia. The SKAT analyses identified 16 genes associated (P<0·05 in both consortia and overall P<10 -4) with FEV 1, FVC or FEV 1/FVC ( Supplementary Table 6), whilst the WST analyses identified 12 genes ( Supplementary Table 7). There was one gene ( LY6G6D) that was identified in both analyses. These genes were followed up in UK Biobank, with two genes, GPR126 and LTBP4, showing evidence of replication in the exonic SKAT analysis (P<3·5×10 -6); however conditional analyses in UK Biobank showed that both these associations were driven by single SNPs, that were identified in the single variant association analyses and have been previously reported in GWAS of these traits ( Supplementary Table 6 and Supplementary Table 7).

Functional characterization of novel loci

In order to gain further insight into the six loci identified in our analyses of single variant associations (excluding LCT), we employed functional annotation and assessed whether identified SNPs in these regions were associated with gene expression levels. One of the identified novel SNPs was nonsynonymous, three intronic and two were intergenic. We found evidence that three of the SNPs may be involved in cis-acting regulation of the expression of several genes in multiple tissues ( Supplementary Table 8). SNP rs1200345 in RPAP1 is a nonysynomous variant, predicted to be deleterious by both SIFT (deleterious) and Polyphen (possibly damaging) ( Supplementary Table 9); RPAP1 is ubiquitously expressed, with high levels of protein detected in the lung ( Supplementary Table 10). SNP rs1200345 or proxies (r 2>0·8) were also found to be amongst the most strongly associated SNPs with expression levels of RPAP1 in several tissues, including lung, and with a further six genes in lung tissue ( Supplementary Table 8), including TYRO3, one of the TAM family of receptor tyrosine kinases. TYRO3 regulates several processes including cell survival, migration and differentiation and is highly expressed in lung macrophages ( Supplementary Table 10). Evidence of association with gene expression was found at two more of the novel signals (sentinel SNPs rs3849969 and rs6088813), implicating a further 16 genes. Of note, in blood expression quantitative trait loci (eQTL) databases, a proxy of a SNP in complete linkage disequilibrium (r 2=1) with rs3849969 (rs3812637) was an eQTL for plasminogen activator, urokinase ( PLAU).

Discussion

We undertook an analysis of 68,470 individuals from 23 studies with data from the exome array and three lung function traits, following up the most significant single SNP and gene-based associations in an independent sample of up to 111,556 individuals. There were six SNPs which reached P<10 -5 in the discovery stage meta-analysis of single variant associations, and subsequently met the Bonferroni corrected significance threshold for independent replication (P<1⋅47×10 -3, corrected for 34 SNPs being tested). In the combined analyses of our discovery and replication analyses, these six SNPs met the exome chip-wide significance threshold (P<2⋅8×10 -7). One of the SNPs is in a region that has previously been implicated in lung function (near KCJN2/SOX9) [21], whilst the remaining five SNPs, although all common, have not previously been identified in other GWAS of lung function. In a recent 1000 Genomes imputed analysis of lung function (which includes some of the studies contributing to the present discovery analysis), all of these SNPs showed at least suggestive association (2·97×10 -3>P>1·28×10 -5) with one or more lung function trait, but none reached the required threshold (P<5×10 -6) to be taken forward for replication in that analysis [12]. We further identified a seventh association with rs2322659 in LCT (MAF=23⋅5%; combined discovery + replication P=1⋅70×10 -9). Given SNPs in this region are known to vary in frequency across European populations, we cannot dismiss the possibility that this association may be confounded by population stratification; hence we do not report this signal as a novel lung function locus. For SNPs at 7 loci that have been shown to have differences in allele frequency between individuals from different regions of the UK [25], and subsequently European populations (including the LCT locus), we undertook a look-up of associations with lung function in our discovery analyses. and subsequently across European populations [26]. Aside from the association between the LCT locus and FVC, no significant associations were observed between SNPs at these loci and any lung function trait, in either the analyses restricted to European Ancestry (EA) individuals, or in the analysis of EA and African Ancestry (AA) individuals combined ( Supplementary Table 11); this suggests population structure was generally accounted for adequately in our analyses. One of the novel signals was with a nonsynonymous SNP, rs1200345 in RPAP1, (MAF=48⋅8%; P=2⋅33×10 -13), which is predicted to be deleterious. This SNP and proxies with r 2>0·8 were also associated with expression in lung tissue of seven genes, including RPAP1 and the TAM receptor TYRO3. TAM receptors play a role in the inhibition of Toll-like receptors (TLRs)-mediated innate immune response by initiating the transcription of cytokine signalling genes (SOCS-1 and 3), which limit cytokine overproduction and inflammation [27, 28]. It has been shown that influenza viruses H5N1 and H7N9 can cause downregulation of Tyro3, resulting in an increased inflammatory cytokine response [28]. Three further signals were with intronic SNPs in SEC24C (MAF=29⋅4%; P=4⋅99×10 -12), CASC17 (MAF=48⋅2%; P=4⋅10×10 -11), and UQCC1 (MAF=36⋅7%; P=4⋅90×10 -19). Two of these intronic SNPs have previously been implicated in GWAS of other traits: rs1859962 in CASC17 with prostate cancer [29] and rs6088813 in UQCC1 with height [30]. The CASC17 locus, near KCNJ2/SOX9 has also previously been implicated in lung function, showing significant association with FEV 1 in a genome-wide analysis of gene-smoking interactions; however, this association was not formally replicated [21]. Whilst the individuals utilised in the discovery stage of this analysis overlap with those included in this previous interaction analysis, the replication stage of the present study provides the first evidence of replication for this signal in independent cohorts. In the present analysis, there was no evidence that the results differed by smoking status. SNPs rs6088813 in UQCC1 and rs3849969 in SEC24C were identified as eQTLs for multiple genes. Whilst our eQTL analysis did not include formal tests of colocalisation, a SNP in complete linkage disequilibrium with rs3849969 (rs3812637, r 2=1) was associated with expression of PLAU in blood. The plasminogen activator, urokinase (PLAU) plays a role in fibrinolysis and immunity, and with its receptor (PLAUR) is involved in degradation of the extra cellular matrix, cell migration, cell adhesion and cell proliferation [31]. A study of preterm infants with respiratory distress syndrome, a condition characterised by intra-alveolar fibrin deposition, found PLAU and its inhibitor SERPINE1 to be expressed in the alveolar epithelium, and an increased ratio of SERPINE1 to PLAU was associated with severity of disease [32]. Studies in mice have also shown that increased expression of Plau may be protective against lung injury, by reducing fibrosis [33]. PLAU has also been found to be upregulated in lung epithelial cells subjected to cyclic strain [34] and in patients with COPD and lung cancer, PLAU was found to be expressed in alveolar macrophages and epithelial cells [31]. The final two signals were with common intergenic SNPs close to LY86 (MAF=36⋅8%; P=9⋅74×10 -23) and FGF10 (MAF=35⋅6%; P=2⋅22×10 -11). LY86 (lymphocyte antigen 86) interacts with the Toll-like receptor signalling pathway, to form a heterodimer, when bound with RP105 [35]. The sentinel SNP in the present analysis (rs1294421) has previously shown association with waist-hip ratio [36], whilst an intronic SNP within LY86 (rs7440529, r 2=0·005 with rs1294421) has been implicated in asthma in two studies of individuals of Han Chinese ancestry [37, 38]. FGF10 is a member of the fibroblast growth factor family of proteins, and is involved in a range of biological processes, including embryonic development and morphogenesis, cell growth and repair, tumor growth and invasion. Specifically, the FGF10 signalling pathway is thought to play an criticial role in the development of the lung and in lung epithelial renewal [39]. A deficiency in Fgf10 has been demonstrated to lead to a fatal disruption of branching morphogenesis during lung development in mice [40]. Our discovery analyses included individuals of both EA and AA. Two of the identified six novel signals showed inconsistent effects in the AA and EA individuals. For these SNPs, the associations in AA individuals were not statistically significant, and we report associations from the analysis restricted to EA individuals only. For the remaining four SNPs similar effects were observed in both the EA and AA individuals ( Supplementary Figure 3). We also examined the effects of the novel SNPs in ever smokers and never smokers separately and found these to be broadly similar, with the exception of rs1448044 in FGF10, which in the discovery analysis showed significant association with FVC in ever smokers, whilst showing no association in never smokers (P=0·695). However, in our replication stage analyses, similar effects were seen in both ever and never smokers for this SNP, and the combined analysis of discovery and replication stages for this SNP, including both ever and never smokers, met the exome chip-wide significance level overall (P=4·22×10 -9). We also considered whether this signal could be driven by smoking behaviour in our discovery stage as our primary analyses in SpiroMeta did not adjust for smoking quantity. We undertook a look-up of this SNP in the publicly available results of a GWAS of several smoking behaviour traits [41]; there was only weak evidence that this SNP was associated with ever versus never smoking (P=0·039), and no evidence for association with amount smoked (cigarettes per day, P=0·10). Through the use of the exome array, we aimed to identify associations with low frequency and rare functional variants, thereby potentially uncovering some of the missing heritability of lung function. However, whilst our discovery analyses identified single SNP associations with 23 low frequency variants ( Supplementary Table 2), we did not replicate any of these findings. Eleven of these 23 SNPs we were unable to follow-up in our replication studies, due to them either being not genotyped, or monomorphic. Overall, limited statistical power is likely to explain our lack of convincing single variant associations with rare variants, in particular if those variants exhibit only modest effects [42]. We additionally investigated the joint effects of low frequency and rare variants within genes, on lung function trait, by employing SKAT and WST gene-based tests. These analyses identified associations with a number of genes that could not attributed to the effect of a single SNP. Replication of these gene-based signals proved difficult however, as again a number of SNPs included in the discovery stage of these analyses were monomorphic, or had not been not genotyped in the replication studies. This lead to a disparity in the gene unit being tested in our discovery and replication samples; hence interpretation of these results was not clear-cut. In the end, we were able to replicate only findings with common SNPs. This finding is in line with several other studies of complex traits and exome array data that have been unable to report robust associations with low frequency variants [43– 45] and it is clear that future studies will require increasingly larger sample sizes in order to fully evaluate the effect of variants across the allele frequency spectrum. The identification of common SNPs remains important, however, as such findings have the potential to highlight drug targets [46], and these variants collectively could have utility in risk prediction. In our replication analyses using UK Biobank, we applied adjustment for covariates including ancestry principal components, before undertaking inverse-normal transformations of the lung function phenotypes. Association analyses were then performed using these transformed phenotypes. It has recently been shown that such transformation has the potential to introduce correlations between principal components and phenotypes [47]; we undertook sensitivity analyses for the six reported SNPs by repeating the association analyses with phenotypes that had been transformed without prior adjustment, with covariate adjustment made as part of the SNP-trait association test. We found there to be some difference in P-values for some SNP-trait combinations; however, the six novel SNP associations we report all met the replication P-value threshold (P<1·47×10 -3) in the sensitivity analyses ( Supplementary Figure 4). This issue may also be relevant to the gene-based tests; however no replicated novel gene-based associations were identified in this study. Future studies should avoid undertaking adjustment for principal components of ancestry prior to trait transformation, in order to avoid this potential bias. This study has identified six common SNPs, independent to signals previously implicated in lung function. Additional interrogation of these loci could lead to greater understanding of lung function and lung disease, and could provide novel targets for therapeutic interventions.

Methods

Study design, cohorts and genotyping

The SpiroMeta analysis included 23,751 individuals of EA from 11 studies, and the CHARGE analysis comprised 36,998 EA individuals and a further 7,721 individuals of AA from 12 studies. Follow-up analyses were conducted in an independent sample of up to 111,556 individuals from UK Biobank (2015 interim release), the UK Household Longitudinal Study (UKHLS) and the Netherlands Epidemiology of Obesity (NEO) Study ( Figure 1). All studies (excluding UK Biobank) were genotyped using either the Illumina Human Exome BeadChip v1 or the Illumina Infinium HumanCoreExome-12 v1·0 BeadChip. UK Biobank samples were genotyped using the Affymetrix Axiom UK BiLEVE or UK Biobank arrays.
Figure 1.

Study design.

Statistical analyses

Consortium level analyses: Within the SpiroMeta Consortium, each study contributing to the discovery analyses calculated single-variant score statistics, along with covariance matrices describing correlations between variants, using RAREMETALWORKER [48] or rvtests [49]. For each trait, these summary statistics were generated separately in ever and never smokers. Traits were adjusted for sex, age, age 2 and height, and inverse normally transformed prior to association testing. For studies with unrelated individuals, SNP-trait associations were tested using linear models, with adjustments made for the first 10 ancestry principal components, whilst studies with related individuals utilised linear mixed models to account for familial relationships and underlying population structure. Within the CHARGE Consortium, each study generated equivalent summary statistics using the R package SeqMeta [50]. For each trait, summary statistics were generated in ever and never smokers separately, and in all individuals combined. The untransformed traits were used for all analyses, adjusted for smoking status and pack-years, age, age 2, sex, height, height 2, centre/cohort. Models for FVC were additionally adjusted for weight. Linear regression models, with adjustment for principal components of ancestry were used for studies with unrelated individuals, and linear mixed models were used for family-based studies. Within each consortium we used the score statistics and variance-covariance matrices generated by each study to construct both single variant and gene-based tests using either RAREMETAL [48] (SpiroMeta) or SeqMeta [50] (CHARGE). For single variant associations, score statistics were combined in fixed effects meta-analyses. Two gene-based tests were constructed: first, the Weighted Sum Test (WST) using Madsen Browning weightings [23], and secondly, the Sequence Kernel Association Test (SKAT) [24]. We performed the SKAT and WST tests using two subsets of SNPs: 1) including all SNPs with an overall consortium-wide MAF<5% that were annotated as splicing, stopgain, stoploss, or frameshift (loss of function [LOF] analysis), and 2) including all SNPs meeting the LOF analysis criteria in addition to all other nonsynonymous variants with consortium wide MAF<5% (exonic analysis). Variants were annotated to genes using dbNSFP v2·6 [51] on the basis of the GRCh37/hg19 database. For both single variant and gene-based associations, consortium-level results were generated for ever smokers and never smokers separately, and in all individuals combined. Within the CHARGE Consortium, results were combined separately for the EA and AA studies and also in a trans-ethnic analysis of both ancestries. Combined meta-analysis: The single variant association results from the SpiroMeta and CHARGE consortia were combined as follows: The genomic inflation statistic (λ) was calculated for SNPs with consortium-wide MAF>1%; where λ had a value greater than one, genomic control adjustment was applied to the consortium level P-values. The consortium-level results were then combined using sample size weighted z-score meta-analysis. The λ was again calculated for the meta-analysis results and genomic control applied, as appropriate. λ values at the consortium and meta-analysis level are shown in Supplementary Table 13. Since we were interested in identifying low frequency and rare variants, we applied no MAF or minor allele count (MAC) filter. We identified SNPs of interest as those with an overall P<10 -5 and a consistent direction of effect and P<0·05 observed in both consortia. Rather than using a strict Bonferroni correction for defining the significance threshold, we adopted the more lenient P<10 -5 threshold in order to increase the power to detect variants with modest effect in our discovery analyses, whilst the requirement for consistency in results from the two consortia aimed to limit false positives. All SNPs meeting these thresholds were followed up in independent replication cohorts. Where we identified a SNP within 1Mb of a previously identified lung function SNP, we deemed the SNP to represent an independent signal if it had r 2<0·2 with the known SNP, and if it retained a P <10 -5, when conditional analyses were carried out with the known SNP, or a genotyped proxy, using data from the SpiroMeta Consortium, or UK Biobank. Our primary meta-analysis included all individuals; we additionally carried out analyses in smoking subgroups (ever and never smokers), and in the subgroup of individuals of European ancestry only. For genes which contained at least 2 polymorphic SNPs in both consortia, we combined the results of the consortium level gene based tests using either z-score meta-analysis (for the WST analysis) or Fisher’s Method for combining P-values (in the case of SKAT). We identified genes of interest as those with P<0·05 observed in both consortia and an overall P<10 -4, thresholds again chosen to limit both false positive and false negative findings. As in the analyses of single variant associations, our primary meta-analyses included all individuals, with secondary analyses undertaken in smoking and ancestry specific subgroups. Replication analyses: All SNP and gene-based associations were followed up for the trait with which they showed the most statistically significant association only. For associations identified through the smoking subgroup analyses, we followed up associations in the appropriate smoking strata; however, no ancestry stratified follow-up was undertaken as replication studies included only a sufficient number of individuals of European Ancestry. Single variant associations in UK Biobank were tested in ever smokers and never smokers separately, and stratified by genotyping array (UK BiLEVE array or UK Biobank array) using the score test as implemented in SNPTEST v2·5b4 [52]. Traits were adjusted for age, age 2, height, sex, ten principal components and pack-years (ever smokers only), and the adjusted traits were inverse normally transformed. Correlations between principal components and transformed phenotypes may be introduced where adjustment is made prior to transformation. In this analysis, we found any introduced correlations to have no impact on the conclusion of our replication analyses; however future studies should apply transformation of phenotypes prior to covariate adjustment, to avoid this issue. For UKHLS, analyses were undertaken analogously to the SpiroMeta discovery studies using RAREMETALWORKER, while for NEO, analyses were undertaken in the same way as was done in the CHARGE discovery studies using SeqMeta. The single variant results from all replication studies were combined using sample size weighted Z-score meta-analysis. Subsequently, we combined the results from the discovery and replication stage analyses and we report SNPs with overall exome-wide significance of P<2·8×10 -7 (Bonferroni corrected for the original 179,215 SNPs tested). We followed up genes of interest (P<10 -4) using data from UK Biobank only. Summary statistics for UK Biobank were generated using RAREMETALWORKER, with gene-based tests then constructed using RAREMETAL. Finally, we combined the results from the discovery analysis with the replication results in an overall combined meta-analysis using either z-score meta-analysis (WST) or Fisher’s Method (SKAT). We declared genes with overall P<3·5×10 -6 (Bonferroni corrected for 14,380 genes tested) in our combined meta-analysis to be statistically significant. For these statistically significant genes, we carried out additional analyses using the UK Biobank data in which we conditioned on the most significantly associated individual SNP within that gene, to determine whether this was a true gene-based signal, or whether the association could be ascribed to the single SNP (if the conditional P<0·01, then association was deemed to not be driven by the single SNP).

Characterization of findings

In order to gain further insight into the loci identified in our analyses of single variant associations, we assessed whether these regions were associated with gene expression levels in various tissues (FDR of 5%, or q-value<0·05), by querying a publically available blood eQTL database [53] and the GTEx project [54] for the sentinel SNPs, or any proxy (r 2>0·8). We further assessed SNPs of interest (and proxies) within a lung eQTL resource based on non-tumour lung tissues of 1,111 individuals [55– 57]. Descriptions of these resources and further details of the look-ups are provided in the Supplementary Methods. Moreover, all sentinel SNPs and proxies with r 2>0.8 were annotated using ENSEMBL’s Variant Effect Predictor (VEP) [58]; potentially deleterious coding variants were identified as those annotated as ‘deleterious’ by SIFT [59] or ‘probably damaging’ or ‘possibly damaging’ by PolyPhen-2 [60]. For all genes implicated through the expression data or functional annotation, we searched for evidence of protein expression in the respiratory system by querying the Human Protein Atlas [61].

Data availability

Summary level results for all analyses are available on OSF: https://doi.org/10.17605/OSF.IO/NSDPJ [62] Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0). This research has been conducted using the UK Biobank Resource. The genetic and phenotypic UK Biobank data are available upon application to the UK Biobank ( https://www.ukbiobank.ac.uk/) to all registered health researchers. These data are from Understanding Society: The UK Household Longitudinal Study (UKHLS), which is led by the Institute for Social and Economic Research at the University of Essex and funded by the Economic and Social Research Council. The data were collected by NatCen and the genome wide scan data were analysed by the Wellcome Trust Sanger Institute. Information on how to access the data can be found on the Understanding Society website https://www.understandingsociety.ac.uk/. Thank you to the authors for responding to and addressing our comments. I have one further comment on the replication analysis using UK Biobank data. The sensitivity analyses which the authors carried out showed that adjusting for covariates prior to inverse-normalization does affect the results. While this does not affect the main conclusions drawn, it may affect the results of the gene-based tests, and in addition, other investigators using the methods as a guide may draw inappropriate conclusions if adjusting for principal components prior to inverse-normalizing their phenotype. Ideally, the UK Biobank analysis should be redone with the appropriate phenotype transformation, and the methods and results sections updated accordingly. However, if the authors consider that such a revision would be too extensive, given that the conclusions do not change, it would at least be helpful to note the issue as a limitation in the discussion and make it clear in the methods that adjusting for covariates (such as principal components) should be done after inverse-normalising the phenotype – so it can be used appropriately by others. We have read this submission. We believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. Thank you for you approval of our article, and your additional comment. As suggested, we have added a further limitation to the discussion of the paper outlining the issue regarding the trait transformation. This has also been noted in the methods. We have also included the results of the sensitivity analyses in the supplement (Supplementary Figure 4). The authors have performed a large genome-wide association study in subjects of European (36,998 in the discovery set and 111,556 in the replication set) and African (7,721 in the discovery set) ancestries for various lung function measures: FEV1, FVC and FEV1/FVC ratio. Both common and rare variant analyses are performed, and the effect of smoking on the associations is also assessed. The discovery set consisted of CHARGE and SpiroMeta consortia meta analysis using the Human Exome array, while the replication set consisted of genotypes on the HumanCoreExome array and the UK Biobank’s custom arrays. A total of 7 novel regions were identified by the authors that met the overall (discovery+replication) Bonferroni-adjusted P-value of 2.8x10^-7 after adjustment for various covariates such as age, sex, height, and ancestry using principal components. All identified novel SNPs are of common frequency, and two of the SNPs are in high LD with missense variants predicted to be damaging. Some areas for improvement: Two rare variant tests were chosen and applied to the data as opposed to choosing a combined test (e.g. Derkach et al 2013 Genetic Epidemiology). A combined test would be more powerful. The authors should explain why there was an inverse normalization of the traits in SpiroMeta but not in CHARGE, and provide some sensitivity analysis. There appear to be very large differences in Effect Allele Frequencies between the discovery and replication samples. Do the authors have an explanation for this? This might point to local ancestry differences that could be relevant, and should be further investigated. The eQTL analysis could formally investigate colocalization as opposed to cross-referencing individual associated SNPs with public repositories, and there are  several different methods that achieve this goal: e.g. COLOC, eCAVIAR, Sherlock, RTC or EnLoc. In the replication analyses section, it is stated that “Traits were adjusted for age, age^2, height, sex, ten principal components and pack-years (ever smokers only), and  inverse normally transformed.” For clarity, the authors should be specific about whether the trait (FEV1, FVC, or FEV1/FVC) was inverse normalized first and age, age^2, sex, 10 PCs were then added as covariates in the genetic association model In the methods section for the rare variant testing Skat appears to be incorrectly referred to as a Fisher’s combined method. The authors should provide the justification for their various significance criteria used in each of the analyses. The authors should list the MAF alongside the p-values reported in the text for clarity for the single variant analysis results We have read this submission. We believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. The authors performed GWAS of FEV1, FVC and FEV1/FVC ratio at 179,215 SNPs from exome arrays. They identified 6 common frequency SNPs associated with at least one of these traits. They also identified 1 SNP in a region with known frequency differences across European populations suggesting that population structure may not have been fully accounted for in their analyses. Strengths of the study include the large sample size and comprehensive approach to assessing associations with low frequency and rare variants. We have the following concerns. Main concerns: The phenotypes seem to have been adjusted for covariates and ancestry specific principal components prior to being inverse normally transformed. This transformation has the potential to introduce correlations between principal components and the inverse normally transformed phenotype ( https://www.biorxiv.org/content/early/2017/05/15/137232). Since one of the SNPs identified as being associated with the phenotype is known to vary in frequency across European populations, and the authors note that they cannot rule out the effects of population structure on the identified associations this raises concerns that some of the other associations could also be artefacts driven by failure to properly account for population stratification. It should explicitly be mentioned in the methods whether adjustments were made for ancestry specific principal components prior to inverse normal transforming the phenotype in the SpiroMeta Consortium component of the meta analysis or was included as a covariate in the phenotype - SNP association analysis. Indeed, in the replication analysis in UK Biobank principal components were adjusted for prior to inverse normally transforming the data. Was genotyping chip adjusted for in this cohort (which should be done in the phenotype - SNP analysis)? The UKBiLEVE chip was enriched for smokers, which could affect association analyses unless chip is included as a covariate. In addition the interim data release (which seems to be what is used here - please clarify in the methods whether the data comes from the interim (2015) or full (2017) data release) featured some discrepancies between the two chips, which can introduce spurious associations especially if adjustment is not made for genotyping chip. Why was raw trait used in CHARGE but inverse normalised in SpiroMeta Consortium? This seems an odd choice Minor concerns: In the discussion, the authors mention that the 6 identified SNPs not attributed to population structure passed the Bonferroni significance threshold. They then mention that the SNPs ALSO pass Bonferroni corrected significance thresholds in the replication analysis. This could be misleading, since not all SNPs passed the Bonferroni threshold in the discovery only dataset. The authors mention that correction was made for genomic inflation statistic (λ), but we could not find the statistics relating to this. The figures should be given in the manuscript. We have read this submission. We believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however we have significant reservations, as outlined above.
  59 in total

1.  Familial aggregation and heritability of adult lung function: results from the Busselton Health Study.

Authors:  L J Palmer; M W Knuiman; M L Divitini; P R Burton; A L James; H C Bartholomew; G Ryan; A W Musk
Journal:  Eur Respir J       Date:  2001-04       Impact factor: 16.671

2.  Genome-wide meta-analyses identify multiple loci associated with smoking behavior.

Authors: 
Journal:  Nat Genet       Date:  2010-04-25       Impact factor: 38.330

3.  dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.

Authors:  Xiaoming Liu; Xueqiu Jian; Eric Boerwinkle
Journal:  Hum Mutat       Date:  2013-07-10       Impact factor: 4.878

4.  Fgf10 is essential for limb and lung formation.

Authors:  K Sekine; H Ohuchi; M Fujiwara; M Yamasaki; T Yoshizawa; T Sato; N Yagishita; D Matsui; Y Koga; N Itoh; S Kato
Journal:  Nat Genet       Date:  1999-01       Impact factor: 38.330

5.  Genome-wide association analysis identifies six new loci associated with forced vital capacity.

Authors:  Daan W Loth; María Soler Artigas; Sina A Gharib; Louise V Wain; Nora Franceschini; Beate Koch; Tess D Pottinger; Albert Vernon Smith; Qing Duan; Chris Oldmeadow; Mi Kyeong Lee; David P Strachan; Alan L James; Jennifer E Huffman; Veronique Vitart; Adaikalavan Ramasamy; Nicholas J Wareham; Jaakko Kaprio; Xin-Qun Wang; Holly Trochet; Mika Kähönen; Claudia Flexeder; Eva Albrecht; Lorna M Lopez; Kim de Jong; Bharat Thyagarajan; Alexessander Couto Alves; Stefan Enroth; Ernst Omenaas; Peter K Joshi; Tove Fall; Ana Viñuela; Lenore J Launer; Laura R Loehr; Myriam Fornage; Guo Li; Jemma B Wilk; Wenbo Tang; Ani Manichaikul; Lies Lahousse; Tamara B Harris; Kari E North; Alicja R Rudnicka; Jennie Hui; Xiangjun Gu; Thomas Lumley; Alan F Wright; Nicholas D Hastie; Susan Campbell; Rajesh Kumar; Isabelle Pin; Robert A Scott; Kirsi H Pietiläinen; Ida Surakka; Yongmei Liu; Elizabeth G Holliday; Holger Schulz; Joachim Heinrich; Gail Davies; Judith M Vonk; Mary Wojczynski; Anneli Pouta; Asa Johansson; Sarah H Wild; Erik Ingelsson; Fernando Rivadeneira; Henry Völzke; Pirro G Hysi; Gudny Eiriksdottir; Alanna C Morrison; Jerome I Rotter; Wei Gao; Dirkje S Postma; Wendy B White; Stephen S Rich; Albert Hofman; Thor Aspelund; David Couper; Lewis J Smith; Bruce M Psaty; Kurt Lohman; Esteban G Burchard; André G Uitterlinden; Melissa Garcia; Bonnie R Joubert; Wendy L McArdle; A Bill Musk; Nadia Hansel; Susan R Heckbert; Lina Zgaga; Joyce B J van Meurs; Pau Navarro; Igor Rudan; Yeon-Mok Oh; Susan Redline; Deborah L Jarvis; Jing Hua Zhao; Taina Rantanen; George T O'Connor; Samuli Ripatti; Rodney J Scott; Stefan Karrasch; Harald Grallert; Nathan C Gaddis; John M Starr; Cisca Wijmenga; Ryan L Minster; David J Lederer; Juha Pekkanen; Ulf Gyllensten; Harry Campbell; Andrew P Morris; Sven Gläser; Christopher J Hammond; Kristin M Burkart; John Beilby; Stephen B Kritchevsky; Vilmundur Gudnason; Dana B Hancock; O Dale Williams; Ozren Polasek; Tatijana Zemunik; Ivana Kolcic; Marcy F Petrini; Matthias Wjst; Woo Jin Kim; David J Porteous; Generation Scotland; Blair H Smith; Anne Viljanen; Markku Heliövaara; John R Attia; Ian Sayers; Regina Hampel; Christian Gieger; Ian J Deary; H Marike Boezen; Anne Newman; Marjo-Riitta Jarvelin; James F Wilson; Lars Lind; Bruno H Stricker; Alexander Teumer; Timothy D Spector; Erik Melén; Marjolein J Peters; Leslie A Lange; R Graham Barr; Ken R Bracke; Fien M Verhamme; Joohon Sung; Pieter S Hiemstra; Patricia A Cassano; Akshay Sood; Caroline Hayward; Josée Dupuis; Ian P Hall; Guy G Brusselle; Martin D Tobin; Stephanie J London
Journal:  Nat Genet       Date:  2014-06-15       Impact factor: 38.330

6.  GSTCD and INTS12 regulation and expression in the human lung.

Authors:  Ma'en Obeidat; Suzanne Miller; Kelly Probert; Charlotte K Billington; Amanda P Henry; Emily Hodge; Carl P Nelson; Ceri E Stewart; Caroline Swan; Louise V Wain; María Soler Artigas; Erik Melén; Kevin Ushey; Ke Hao; Maxime Lamontagne; Yohan Bossé; Dirkje S Postma; Martin D Tobin; Ian Sayers; Ian P Hall
Journal:  PLoS One       Date:  2013-09-18       Impact factor: 3.240

7.  RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data.

Authors:  Xiaowei Zhan; Youna Hu; Bingshan Li; Goncalo R Abecasis; Dajiang J Liu
Journal:  Bioinformatics       Date:  2016-02-15       Impact factor: 6.937

8.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

9.  Meta-analysis of genome-wide scans for human adult stature identifies novel Loci and associations with measures of skeletal frame size.

Authors:  Nicole Soranzo; Fernando Rivadeneira; Usha Chinappen-Horsley; Ida Malkina; J Brent Richards; Naomi Hammond; Lisette Stolk; Alexandra Nica; Michael Inouye; Albert Hofman; Jonathan Stephens; Eleanor Wheeler; Pascal Arp; Rhian Gwilliam; P Mila Jhamai; Simon Potter; Amy Chaney; Mohammed J R Ghori; Radhi Ravindrarajah; Sergey Ermakov; Karol Estrada; Huibert A P Pols; Frances M Williams; Wendy L McArdle; Joyce B van Meurs; Ruth J F Loos; Emmanouil T Dermitzakis; Kourosh R Ahmadi; Deborah J Hart; Willem H Ouwehand; Nicholas J Wareham; Inês Barroso; Manjinder S Sandhu; David P Strachan; Gregory Livshits; Timothy D Spector; André G Uitterlinden; Panos Deloukas
Journal:  PLoS Genet       Date:  2009-04-03       Impact factor: 5.917

Review 10.  The role of airway macrophages in apoptotic cell clearance following acute and chronic lung inflammation.

Authors:  Aleksander M Grabiec; Tracy Hussell
Journal:  Semin Immunopathol       Date:  2016-03-08       Impact factor: 9.623

View more
  9 in total

1.  A systematic analysis of protein-altering exonic variants in chronic obstructive pulmonary disease.

Authors:  Matthew Moll; Victoria E Jackson; Bing Yu; Megan L Grove; Stephanie J London; Sina A Gharib; Traci M Bartz; Colleen M Sitlani; Josée Dupuis; George T O'Connor; Hanfei Xu; Patricia A Cassano; Bonnie Kaufmann Patchen; Woo Jin Kim; Jinkyeong Park; Kun Hee Kim; Buhm Han; R Graham Barr; Ani Manichaikul; Jennifer N Nguyen; Stephen S Rich; Lies Lahousse; Natalie Terzikhan; Guy Brusselle; Phuwanat Sakornsakolpat; Jiangyuan Liu; Christopher J Benway; Ian P Hall; Martin D Tobin; Louise V Wain; Edwin K Silverman; Michael H Cho; Brian D Hobbs
Journal:  Am J Physiol Lung Cell Mol Physiol       Date:  2021-04-28       Impact factor: 6.011

Review 2.  FGF10 and Human Lung Disease Across the Life Spectrum.

Authors:  Lawrence S Prince
Journal:  Front Genet       Date:  2018-10-31       Impact factor: 4.599

3.  Rare and low-frequency exonic variants and gene-by-smoking interactions in pulmonary function.

Authors:  Tianzhong Yang; Victoria E Jackson; Albert V Smith; Han Chen; Traci M Bartz; Colleen M Sitlani; Bruce M Psaty; Sina A Gharib; George T O'Connor; Josée Dupuis; Jiayi Xu; Kurt Lohman; Yongmei Liu; Stephen B Kritchevsky; Patricia A Cassano; Claudia Flexeder; Christian Gieger; Stefan Karrasch; Annette Peters; Holger Schulz; Sarah E Harris; John M Starr; Ian J Deary; Ani Manichaikul; Elizabeth C Oelsner; R G Barr; Kent D Taylor; Stephen S Rich; Tobias N Bonten; Dennis O Mook-Kanamori; Raymond Noordam; Ruifang Li-Gao; Marjo-Riitta Jarvelin; Matthias Wielscher; Natalie Terzikhan; Lies Lahousse; Guy Brusselle; Stefan Weiss; Ralf Ewert; Sven Gläser; Georg Homuth; Nick Shrine; Ian P Hall; Martin Tobin; Stephanie J London; Peng Wei; Alanna C Morrison
Journal:  Sci Rep       Date:  2021-09-29       Impact factor: 4.996

4.  Whole-Exome Sequencing of Bronchial Epithelial Cells Reveals a Genetic Print of Airway Remodelling in COPD.

Authors:  Adeline Germain; Jeanne-Marie Perotin; Gonzague Delepine; Myriam Polette; Gaëtan Deslée; Valérian Dormoy
Journal:  Biomedicines       Date:  2022-07-15

5.  Causal Effects of Body Mass Index on Airflow Obstruction and Forced Mid-Expiratory Flow: A Mendelian Randomization Study Taking Interactions and Age-Specific Instruments Into Consideration Toward a Life Course Perspective.

Authors:  Nicole Probst-Hensch; Ayoung Jeong; Daiana Stolz; Marco Pons; Paola M Soccal; Robert Bettschart; Deborah Jarvis; John W Holloway; Florian Kronenberg; Medea Imboden; Christian Schindler; Gianfranco F Lovison
Journal:  Front Public Health       Date:  2021-05-11

6.  Genetic landscape of chronic obstructive pulmonary disease identifies heterogeneous cell-type and phenotype associations.

Authors:  Phuwanat Sakornsakolpat; Dmitry Prokopenko; Brian D Hobbs; Michael H Cho; Maxime Lamontagne; Nicola F Reeve; Anna L Guyatt; Victoria E Jackson; Nick Shrine; Dandi Qiao; Traci M Bartz; Deog Kyeom Kim; Mi Kyeong Lee; Jeanne C Latourelle; Xingnan Li; Jarrett D Morrow; Ma'en Obeidat; Annah B Wyss; Per Bakke; R Graham Barr; Terri H Beaty; Steven A Belinsky; Guy G Brusselle; James D Crapo; Kim de Jong; Dawn L DeMeo; Tasha E Fingerlin; Sina A Gharib; Amund Gulsvik; Ian P Hall; John E Hokanson; Woo Jin Kim; David A Lomas; Stephanie J London; Deborah A Meyers; George T O'Connor; Stephen I Rennard; David A Schwartz; Pawel Sliwinski; David Sparrow; David P Strachan; Ruth Tal-Singer; Yohannes Tesfaigzi; Jørgen Vestbo; Judith M Vonk; Jae-Joon Yim; Xiaobo Zhou; Yohan Bossé; Ani Manichaikul; Lies Lahousse; Edwin K Silverman; H Marike Boezen; Louise V Wain; Martin D Tobin
Journal:  Nat Genet       Date:  2019-02-25       Impact factor: 38.330

7.  Lung Development Genes and Adult Lung Function.

Authors:  Laura Portas; Miguel Pereira; Seif O Shaheen; Annah B Wyss; Stephanie J London; Peter G J Burney; Matthew Hind; Charlotte H Dean; Cosetta Minelli
Journal:  Am J Respir Crit Care Med       Date:  2020-09-15       Impact factor: 21.405

8.  A genome wide association study for lung function in the Korean population using an exome array.

Authors:  Kyu-Sun Lee; Kun Hee Kim; Yeon-Mok Oh; Buhm Han; Woo Jin Kim
Journal:  Korean J Intern Med       Date:  2020-04-29       Impact factor: 2.884

9.  Whole genome sequence analysis of pulmonary function and COPD in 19,996 multi-ethnic participants.

Authors:  Xutong Zhao; Dandi Qiao; Chaojie Yang; Silva Kasela; Wonji Kim; Yanlin Ma; Nick Shrine; Chiara Batini; Tamar Sofer; Sarah A Gagliano Taliun; Phuwanat Sakornsakolpat; Pallavi P Balte; Dmitry Prokopenko; Bing Yu; Leslie A Lange; Josée Dupuis; Brian E Cade; Jiwon Lee; Sina A Gharib; Michelle Daya; Cecelia A Laurie; Ingo Ruczinski; L Adrienne Cupples; Laura R Loehr; Traci M Bartz; Alanna C Morrison; Bruce M Psaty; Ramachandran S Vasan; James G Wilson; Kent D Taylor; Peter Durda; W Craig Johnson; Elaine Cornell; Xiuqing Guo; Yongmei Liu; Russell P Tracy; Kristin G Ardlie; François Aguet; David J VanDenBerg; George J Papanicolaou; Jerome I Rotter; Kathleen C Barnes; Deepti Jain; Deborah A Nickerson; Donna M Muzny; Ginger A Metcalf; Harshavardhan Doddapaneni; Shannon Dugan-Perez; Namrata Gupta; Stacey Gabriel; Stephen S Rich; George T O'Connor; Susan Redline; Robert M Reed; Cathy C Laurie; Martha L Daviglus; Liana K Preudhomme; Kristin M Burkart; Robert C Kaplan; Louise V Wain; Martin D Tobin; Stephanie J London; Tuuli Lappalainen; Elizabeth C Oelsner; Goncalo R Abecasis; Edwin K Silverman; R Graham Barr; Michael H Cho; Ani Manichaikul
Journal:  Nat Commun       Date:  2020-10-14       Impact factor: 17.694

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.