Literature DB >> 17903307

Framingham Heart Study genome-wide association: results for pulmonary function measures.

Jemma B Wilk1, Robert E Walter, Jason M Laramie, Daniel J Gottlieb, George T O'Connor.   

Abstract

BACKGROUND: Pulmonary function measures obtained by spirometry are used to diagnose chronic obstructive pulmonary disease (COPD) and are highly heritable. We conducted genome-wide association (GWA) analyses (Affymetrix 100K SNP GeneChip) for measures of lung function in the Framingham Heart Study.
METHODS: Ten spirometry phenotypes including percent of predicted measures, mean spirometry measures over two examinations, and rates of change based on forced expiratory volume in one second (FEV1), forced vital capacity (FVC), forced expiratory flow from the 25th to 75th percentile (FEF25-75), the FEV1/FVC ratio, and the FEF25-75/FVC ratio were examined. Percent predicted phenotypes were created using each participant's latest exam with spirometry. Predicted lung function was estimated using models defined in the set of healthy never-smokers, and standardized residuals of percent predicted measures were created adjusting for smoking status, pack-years, and body mass index (BMI). All modeling was performed stratified by sex and cohort. Mean spirometry phenotypes were created using data from two examinations and adjusting for age, BMI, height, smoking and pack-years. Change in pulmonary function over time was studied using two to four examinations with spirometry to calculate slopes, which were then adjusted for age, height, smoking and pack-years.
RESULTS: Analyses were restricted to 70,987 autosomal SNPs with minor allele frequency > or = 10%, genotype call rate > or = 80%, and Hardy-Weinberg equilibrium p-value > or = 0.001. A SNP in the interleukin 6 receptor (IL6R) on chromosome 1 was among the best results for percent predicted FEF25-75. A non-synonymous coding SNP in glutathione S-transferase omega 2 (GSTO2) on chromosome 10 had top-ranked results studying the mean FEV1 and FVC measurements from two examinations. SNPs nearby the SOD3 and vitamin D binding protein genes, candidate genes for COPD, exhibited association to percent predicted phenotypes.
CONCLUSION: GSTO2 and IL6R are credible candidate genes for association to pulmonary function identified by GWA. These and other observed associations warrant replication studies. This resource of GWA results for pulmonary function measures is publicly available at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?id=phs000007 webcite.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17903307      PMCID: PMC1995616          DOI: 10.1186/1471-2350-8-S1-S8

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

Chronic obstructive pulmonary disease (COPD), which affects approximately six percent of the adult US population [1] and is the fourth most common cause of death in the US [2], has been defined as "airflow limitation that is not fully reversible" [3], a definition based on spirometry. Environmental factors, most notably tobacco smoking, are associated with accelerated longitudinal decline of pulmonary function and are important causes of COPD. Several lines of evidence indicate that genetic factors also contribute to the development of this condition. First, family studies have revealed increased risk of lung function impairment in smoking first degree relatives of COPD cases [4] and substantial heritability of spirometry measures in population based studies [5]. Second, severe alpha-1-antitrypsin deficiency due to homozygous mutations of the SERPINA1 (AAT) gene is a documented cause of COPD, although this condition explains only a small proportion of COPD in the population. Finally, family-based and case-control studies are beginning to reveal genetic variants other than those of the SERPINA1 gene that are associated with chronic airflow obstruction [6]. Despite this evidence of a genetic basis for susceptibility to COPD, the specific genetic risk factors underlying most cases of COPD remain uncertain. The Framingham Heart Study offers the opportunity to conduct family-based linkage and association studies seeking potential genetic factors that influence obstructive (COPD, asthma), restrictive, or developmentally related lung function impairment. Using spirometry measurements as quantitative phenotypes, we have previously reported results of genome-wide linkage analyses among these same families using microsatellite markers spaced approximately 10 centiMorgans apart [7] and fine-mapping of a promising region on chromosome 6q [8,9]. The availability of data on over 100,000 single nucleotide polymorphisms (SNPs) throughout the genome now permits the application of genome-wide association (GWA) testing to the search for genetic risk factors for chronic airflow limitation. The longitudinal pulmonary function data that have been obtained over the years of the Framingham Heart Study, in combination with the 100K SNP data, make this a unique resource for the discovery of novel genetic risk factors for chronic airflow obstruction.

Methods

Three types of spirometry phenotypes were evaluated for GWA: 1) measurements taken at a participant's most recent available examination and expressed as a percent of predicted; 2) the mean of measurements taken at two specified examinations; and 3) the annual rate of decline of spirometry measurements derived by calculating the slope of measurements across multiple examinations. All spirometry was performed without bronchodilator testing.

Measurements as percent of predicted at latest examination

The spirometry measurements from each participant's latest examination with acceptable pulmonary function data [10] were used; eligible examinations included Cohort exams 19, 17, and 13 and Offspring exams 7, 6, 5, and 3. Predicted values for each lung function measurement were calculated using cohort and gender-specific regression models predicting spirometry measurements on the basis of age, age squared, and height squared [11] among Framingham subjects who were lifetime nonsmokers and had no history of chronic bronchitis, pulmonary disease, COPD/emphysema, asthma, or wheezing. The percent of predicted value was calculated by dividing the observed by the predicted value. Standardized residuals were then created by regressing the percent predicted on current smoking (y/n), former smoking (y/n), pack-years, and body mass index (BMI: kg/m2), in cohort and gender-specific models. Forced expiratory volume in one second (FEV1), forced vital capacity (FVC), forced expiratory flow between the 25th and 75th percentile (FEF25–75), the FEV1/FVC ratio, and the FEF25–75/FVC ratio were examined as cross-sectional percent of predicted measures. These phenotypes are referenced in tables preceded by the letters "pp" (for percent predicted), see Table 1 for explanation of phenotype abbreviations.
Table 1

Characteristics of phenotypes studied*

Eligible exam cycles
PhenotypeNaDescriptionOffspringCohortAdjustmentbHeritability
ppfev11217Percent predicted FEV1 for latest exam7, 6, 5, 319, 17, 13Predicted defined by age, age2, height, % predicted adjusted for current or former smoking, pack-years, and BMI0.36
ppfvc1217Percent predicted FVC for latest exam7, 6, 5, 319, 17, 13Same as ppfev10.45
ppratio1217Percent predicted FEV1/FVC for latest exam7, 6, 5, 319, 17, 13Same as ppfev10.29
ppfef1212Percent predicted FEF25–75 for latest exam7, 6, 5, 319, 17, 13Same as ppfev10.40
ppfefrat1212Percent predicted FEF25–75/FVC for latest exam7, 6, 5, 319, 17, 13Same as ppfev10.41
meanfev11222Mean FEV1 from two exams3 and 55 or 6 and 13age, age2, BMI, height, current or former smoking, and pack-years0.35
meanfvc1222Mean FVC from two exams3 and 55 or 6 and 13Same as meanfev10.51
meanratio1222Mean FEV1/FVC from two exams3 and 55 or 6 and 13Same as meanfev10.25
fev1slope1097Longitudinal slope of FEV17, 6, 5, 319, 17, 13age, age2, height, height2, pack-years at first exam, interim pack-years, and sustained smoking0.10
fefslope1059Longitudinal slope of FEF25–757, 6, 5, 319, 17, 13Same as fev1slope0.20

*Results for additional longitudinal slope phenotypes, residual from predicted phenotypes, phenotypes created from spirometry at a single exam, and phenotypes limited to smokers reporting 10 or more pack-years are available at

a) sample size with phenotype out of 1345 with GWA genotyping

b) adjustment performed in four separate regression models by sex and Original or Offspring cohort

Characteristics of phenotypes studied* *Results for additional longitudinal slope phenotypes, residual from predicted phenotypes, phenotypes created from spirometry at a single exam, and phenotypes limited to smokers reporting 10 or more pack-years are available at a) sample size with phenotype out of 1345 with GWA genotyping b) adjustment performed in four separate regression models by sex and Original or Offspring cohort

Mean of measurements at two specified examinations

In previous analyses of the genetics of lung function in the Framingham families [7,8], we have used the mean of the values of each spirometry measure from two specified examinations. For Cohort participants, spirometry data from exam cycles 5 or 6 and cycle 13 were used to generate the mean value. In Offspring participants, spirometry data from cycle 3 and cycle 5 were used to generate the mean value. The mean FEV1, FVC, and FEV1/FVC ratio were adjusted for the effects of age, age2, BMI, height, dummy variables indicating never, former, or current smoking status, and, for former and current smokers, pack-years. Standardized residuals were generated separately by sex and within Cohort or Offspring samples. These phenotypes are referenced in tables as meanfev1, meanfvc, and meanratio.

Annual rate of decline of measurements

Rate of decline phenotypes were defined by fitting a slope to the spirometry data from different time points. The examinations incorporated were the same as those eligible for percent of predicted phenotypes described above. Slopes were calculated by ordinary least-squares using all available data. A minimum of two eligible exams were needed to calculate a slope. Slopes were adjusted for the covariates age, age squared, height, and height squared, using mean ages and heights from included exams. Slopes were also adjusted for pack-years at first exam, interim pack-years, and sustained smoking (y/n). FEV1 and FEF25–75, the slope phenotypes with the highest heritability, were studied in the GWA. These phenotypes are referenced in tables as fev1slope and fefslope.

Statistical analysis methods

All SNPs were studied using family-based association tests (FBAT) and generalized estimating equations (GEE) [12] (see 100K Overview). SNP results reported met the criteria of having a minor allele frequency ≥10%, a Hardy-Weinberg p-value ≥ 0.001, and a call rate ≥ 80%. All reported FBAT tests also required a minimum of ten informative families. Multipoint variance component linkage analysis was implemented with a subset of 10,588 SNPs and all 612 available microsatellites studied in previous linkage analyses [12]. Multipoint identity-by-descent estimates were generated using the software Merlin [13]. Heritability estimates, estimating the proportion of the total phenotypic variance due to genetic effects, and variance component linkage analysis were performed using the software SOLAR [14]. In addition to evaluating all SNP associations with each phenotype individually, we developed a method to identify SNPs in or near genes that exhibited the strongest associations (as assessed by p-value) to multiple spirometry phenotypes. For each phenotype, we identified the 200 lowest p-values that met the criteria above and were localized within 60,000 base pairs of the transcription start or stop of a gene. All gene annotations are derived from the UCSC genome browser May 2004 assembly, build 125 [15,16]. We evaluated the frequency that a SNP appeared among the 200 lowest p-values in gene regions for the ten phenotypes. This strategy was based on the hypothesis that SNPs identified to be associated with multiple spirometry measures are more likely to reflect a true association with lung function than SNPs identified to be associated with only a single measurement. SNPs in gene regions that appeared among the lowest 200 p-values in five or more of the phenotypes studied are reported.

Candidate genes

Genes previously reported in the literature to be associated with spirometry measures or pulmonary disease were examined to determine whether any available 100K SNPs in or near the genes were associated with spirometry phenotypes. Twelve COPD candidate genes studied in the Boston Early-Onset COPD cohort [17], and the SERPINE2 gene, a novel gene identified through linkage and association with COPD in the same cohort [6], were reviewed. The previously established COPD gene alpha-1-antitrypsin (SERPINA1) and the cystic fibrosis transmembrane conductance regulator (CFTR) as well as additional genes in the class of Glutathione S-transferases (O1, O2, M2, T1, T2) and surfactant proteins (SFTPA1, SFTPC) were reviewed. In addition, extracellular super oxide dismutase (SOD3) [18,19], interleukin-8 receptor alpha (IL8RA) [20], interleukin-10 (IL10) [21], beta-2 adrenergic receptor (ADRB2) [22], and transforming growth factor beta-1 (TGFB1) [23] were examined as COPD candidates. The GEE and FBAT SNP association results in or within 60 kilobase pairs (kb) of these 27 genes was reviewed. By specifying a 60 kb distance around the gene to screen results, we were able to identify SNPs near most candidate gene regions, but often the SNP reported does not lie strictly within the transcription start or stop of the gene of interest.

Results

Table 1 reports the heritability estimates for each of the ten phenotypes presented. The slope phenotypes have lower heritability than their corresponding cross sectional phenotypes. FVC has the highest heritability among phenotypes defined using the same method (percent predicted or mean). The percent predicted FEF25–75/FVC ratio had a higher heritability estimate than either FEV1/FVC ratio phenotype. Table 2 reports the SNPs with the lowest p-values for any of the 10 phenotypes evaluated in GWA analysis. Results were ranked by p-value and the top 25 SNPs are reported. In some cases, a SNP had top ranked results for multiple correlated phenotypes, and thus the p-value for all phenotypes is reported at the ranking position of the best p-value. Table 2a was ranked by GEE p-value, and Table 2b was ranked by FBAT p-value. SNP positions are reported according to NCBI Human Genome Build 35. SNPs in the Affymetrix set whose chromosome and position are unknown and SNPs on sex chromosomes are not reported for association. In total, 70,987 SNPs were considered in association analyses. None of these results achieved a conservative threshold for genome-wide significance. All results, regardless of allele frequency, call rate, or deviation from Hardy-Weinberg are publicly available at .
Table 2

GEE, FBAT, and linkage results

2a Top Association results based on GEE p-value
PhenotypeSNPChrPhysical positionGEE p-valueFBAT p-valueGene Region

fev1slope1. rs386749815226298801.36 × 10-060.07SNRPN
meanfvc2. rs4410517936986512.16 × 10-060.005COL1A2
meanratio3. rs283881521454540182.63 × 10-060.0002ADARB1
ppfvc4. rs145578215908519704.23 × 10-060.18FLJ32831
meanfvc5. rs1051654141084728264.32 × 10-067.19 × 10-05
meanfev11.7 × 10-050.0002
ppratio6. rs3105588515751445.14 × 10-060.14SNTG1
ppfev17. rs382092822275989715.33 × 10-060.0005RHBDD1
ppfef8. rs73053214515885615.89 × 10-060.15NID2
ppfefrat9. rs80822514574676697.38 × 10-060.0008
ppratio1.45 × 10-050.007
ppfef10. rs412926711512393377.39 × 10-060.07IL6R
ppfev111. rs2906966*17152722428.31 × 10-065.03 × 10-05CDRT4
ppfvc12. rs35739471375066428.77 × 10-060.14
fefslope13. rs199416912334367839.55 × 10-060.05SYT10
meanratio14. rs222543421454585749.68 × 10-060.0004ADARB1
meanfvc15. rs156697(a)101060291759.78 × 10-069.42 × 10-05 GSTO2
meanfev11.8 × 10-050.002
ppfefrat16. rs56442513467970961.03 × 10-050.02
ppfef17. rs1049844114516139741.06 × 10-050.28NID2
ppratio18. rs88071321288478441.11 × 10-050.26AK128224
ppfefrat19. rs81173214574567381.3 × 10-050.20
ppfvc20. rs65581328295268661.58 × 10-050.74
fev1slope21. rs6972823730091371.69 × 10-050.10
ppfef22. rs9285611(b)1818950761.7 × 10-050.07
meanfev123. rs105048368886697091.72 × 10-050.28
meanfev124. rs149152031932431531.77 × 10-050.12
ppfvc25. rs9300826131030011761.79 × 10-050.11

2b Top Association results based on FBAT p-value

PhenotypeSNPChrPhysical positionGEE p-valueFBAT p-valueGene Region

ppratio1. rs109225301891392730.0018.7 × 10-07CCBL2
ppfefrat0.0023.78 × 10-06
ppfef0.0052.36 × 10-05
ppfefrat2. rs37536831891395920.0041.87 × 10-06CCBL2
ppratio0.0043.59 × 10-06
ppfef0.011.42 × 10-05
ppratio3. rs21934914595251680.016.15 × 10-06LRRC9
ppfefrat0.044.91 × 10-05
ppratio4. rs21939114595546840.026.18 × 10-06LRRC9
ppratio5. rs21932614595129430.028.51 × 10-06LRRC9
ppfefrat0.063.74 × 10-05
ppfefrat6. rs14091491891411700.0059.12 × 10-06CCBL2
ppratio0.0051.08 × 10-05
ppfef0.015.14 × 10-05
ppfvc7. rs929919191109441000.0091.79 × 10-05
meanfvc8. rs10515289(c)5993158450.172.21 × 10-05
ppfvc9. rs1049813722229669430.052.23 × 10-05PAX3
ppfef10. rs385828210904243910.0012.92 × 10-05LIPF
ppratio11. rs9053674598963630.022.95 × 10-05
meanfvc12. rs104958722379152660.053.02 × 10-05
fev1slope13. rs134722212808966220.013.27 × 10-05
ppratio14. rs20094884598958950.033.43 × 10-05
meanfvc15. rs648125710586085580.023.52 × 10-05
ppratio16. rs46195114596089860.023.63 × 10-05LRRC9
meanfev117. rs49155261514031690.013.86 × 10-05MTHFD1L
meanfev118. rs19101374270873290.084.04 × 10-05
ppfvc19. rs2831605(d)21284670640.034.41 × 10-05
ppfev120. rs2906966*17152722428.31 × 10-065.03 × 10-05CDRT4
fefslope21. rs67409192675317060.00095.17 × 10-05ETAA16
meanfev122. rs104988186631341290.025.25 × 10-05
ppfefrat23. rs13935934599076890.065.44 × 10-05
ppfefrat24. rs93120804598914510.115.63 × 10-05
ppfefrat25. rs78513639207483060.0065.7 × 10-05KIAA1797

2c Linkage peaks with LOD score > 2.0

PhenotypeSNPChrPhysical position1.5-LOD interval start1.5-LOD interval endLOD score

meanfev1rs230008161680975261655130971707885502.89
fefslopers5909594X(e)1180571981139704341243116952.86
ppfev1AGC001b61707885501642681521707885502.65
ppratiors1049704221506758131482967931542032612.40
meanratiors105186691828117397499996588728520(b)2.37
ppfefrs2974490511343377595608129124687730(c)2.29
meanfvcrs1048890821219047041980536134461762(d)2.29
ppfev1rs1051803241703227641617684721815016902.28
ppfvcrs1048954212251692602136251962334784382.17
ppfefrs721411173690247028313357525196352.17
ppfev1rs753765175732378245345729596770872.17
meanfev1rs491876210114389096102080914122214789(a)2.12
ppfvcrs1051782541662510911565077191910913332.02

*) SNP occurring in both Table 2a and 2b

a) SNP associated to meanfvc and meanfev1 in Table 2a under linkage peak for meanfev1 on chromosome 10.

b) SNP associated to ppfef in Table 2a under linkage peak for meanratio on chromosome 1.

c) SNP associated to meanfvc in Table 2b under linkage peak for ppfef on chromosome 5.

d) SNP associated to ppfvc in Table 2b under linkage peak for meanfvc on chromosome 21.

e) X chromosome linkage results are not available online

GEE, FBAT, and linkage results *) SNP occurring in both Table 2a and 2b a) SNP associated to meanfvc and meanfev1 in Table 2a under linkage peak for meanfev1 on chromosome 10. b) SNP associated to ppfef in Table 2a under linkage peak for meanratio on chromosome 1. c) SNP associated to meanfvc in Table 2b under linkage peak for ppfef on chromosome 5. d) SNP associated to ppfvc in Table 2b under linkage peak for meanfvc on chromosome 21. e) X chromosome linkage results are not available online Only a single SNP among those reported in Tables 2a and 2b is a known coding SNP. Among the top ranked GEE p-values was a non-synonymous coding SNP (rs156697) in the Glutathione S-transferase omega 2 gene (GSTO2) on chromosome 10. The SNP was among the top ranked GEE p-values for association with the mean FEV1 and mean FVC phenotypes. Other SNPs reported in Table 2a were localized to intronic gene regions in COL1A2, ADARB1, SNTG1, RHBDD1, NID2, IL6R, and SYT10. The best GEE p-value localized to the untranslated region of SNRPN. SNPs identified by FBAT p-value, reported in Table 2b, located in introns were in CCBL2, LRRC9, PAX3, LIPF, MTHFD1L, and KIAA1797. Additional genes reported in Tables 2a and 2b were within 60 kb of the associated SNP. SNP rs2906966, among the top 25 p-values for both tests, is located near CDRT4. SNPs reported may be in linkage disequilibrium (LD). Among the top six FBAT results, strong LD was observed between the three chromosome 1 SNPs in CCBL2, and the three chromosome 14 SNPs in LRRC9, thus only 2 regions are being identified. Linkage to all autosomes and the X chromosome was performed. Table 2c reports all LOD scores above 2.0 with the 1.5-LOD support interval. The best LOD score observed is in a region of linkage on chromosome 6q that was reported previously using microsatellites in the Framingham families [7,8]. The original LOD score of 2.4 for mean FEV1 using genome-wide microsatellites [7] was increased to a LOD of 2.89 with the addition of SNP data, and a LOD of 2.65 was observed in the same region for the percent predicted FEV1 phenotype. The second highest LOD score observed was 2.86 for the longitudinal FEF25–75 phenotype, which was located on the X chromosome. The percent predicted FEV1 and FVC phenotypes both had LOD scores over 2.0 on chromosome 4, with overlapping LOD support intervals centered around 166–170 Mb. Table 3 reports SNPs that met the quality control criteria and were among the top 200 p-values for SNPs located within 60 kb of a gene for at least five of the phenotypes studied. A SNP in an intron of the interleukin 6 receptor (IL6R) and a SNP near the sodium channel, voltage gated, type I alpha (SCN1A) gene were among the top ranked result for six phenotypes. All others were among the top ranked results for five phenotypes. The genes syntrophin, gamma 1 (SNTG1) and the chromosome 20 open reading frame 133 (C20orf133) appear in both the FBAT and GEE lists of top results across 5 phenotypes. The two SNPs in SNTG1 are separated by 104 kb and have low LD (r2 = 0.05 and D' = 0.28 in the HapMap CEU sample). The two C20orf133 SNPs are in strong LD (r2 = 0.78 in HapMap CEU), though rs10485771 is located 3' of the gene.
Table 3

Top ranked SNPs within 60 kb of a gene and associated with five or more of the phenotypes studied

SNPChrbp positionGene region
3a GEE

rs41292671151239337IL6R
rs75870262166804257SCN1A
rs38209282227598971RHBDD1
rs445347553011487NDUFS4
rs310558851575144SNTG1
rs5814461810646848FAM38B
rs104857702013943214C20orf133

3b FBAT

rs104890304245211053' of SOD3
rs2438345590198041GPR98
rs77590336116404207FRK
rs2391996731487207C7orf16
rs10504106851471079SNTG1
rs104857712013999103C20orf133
Top ranked SNPs within 60 kb of a gene and associated with five or more of the phenotypes studied Table 4 reports the best p-value observed when examining SNP results specifically in the regions of the 27 candidate genes. SNPs reported are within 60 kb of the transcription start or stop for the gene listed, and sometimes lie within a different gene. Only 20 of the 27 genes of interest had SNPs in or within 60 kb of the gene. No SNPs were near enough to the genes GSTM1 and TNF, presented by Hersh et al. (2005), and GSTO1, GSTT1, SFTPA1, SFTPC and IL8RA to be considered. Of the 20 best FBAT results, 12 were p-values less than 0.05, but only 5 were p-values less than 0.01. Of the best GEE results, 14 were p-values less than 0.05, and 5 were p-values less than 0.01. Three gene regions had p-values less than 0.01 for both types of test, and these were CFTR, GSTO2, and SOD3. The vitamin D binding protein (GC) region produced an FBAT p-value of 0.0009, though GEE results were weaker.
Table 4

Candidate gene evaluation

Gene regionChrSNPPhenotypeFBAT p-valueGEE p-value
Alpha-1-antitrypsin (AAT/SERPINA1)14rs2402446rs10484042ppfvcfev1slope0.020.900.300.01
Beta-2 adrenergic receptor (ADRB2)5rs30329rs9325117fefslopemeanratio0.009-0.070.04
cystic fibrosis transmembrane regulator (CFTR)7rs213987rs10487367meanfvcppratio0.0060.25 0.380.002
Microsomal epoxide hydrolase (EPHX1)1rs3738051fefslopeppfefrat0.010.030.300.04
Vitamin D binding protein (GC)4rs423817rs842873ppfefratppfefrat0.00090.05 0.060.02
Glutathione S-transferase M2 (GSTM2)1rs542338meanratiofev1slope0.190.430.630.20
Glutathione S-transferase O2 (GSTO2)10rs156697meanfvc9.4 × 10-059.8 × 10-06
Glutathione S-transferase P1 (GSTP1)11rs688878ppfefratppfvc0.070.390.350.02
Glutathione S-transferase T2 (GSTT2)22rs140289ppfvcppfev10.170.410.130.10
Heme oxygenase (HMOX1)22rs2267331rs10483190fev1slopemeanratio0.070.820.800.008
Interleukin-10 (IL10)1rs10494879fefslopefev1slope0.060.460.730.02
Matrix metalloproteinase-1 (MMP1)11rs495366fefslopeppfef0.020.200.130.03
Matrix metalloproteinase-9 (MMP9)20rs2903908ppfefrat0.190.05
Alpha-1-antichymotrypsin (SERPINA3)14rs10484047ppfvcfev1slope0.020.100.030.01
Serine proteinase inhibitor E2 (SERPINE2)2rs717610meanfvcmeanratio0.040.190.0030.001
Surfactant protein B (SFTPB)2rs7577293meanratio0.020.06
Surfactant protein D (SFTPD)10rs726289fefslope0.020.03
extracellular super oxide dismutase (SOD3)4rs10489030ppfev1ppfvc0.00050.010.020.007
transforming growth factor beta-1 (TGFB1)19rs3745295fev1slopeppfev10.210.740.620.18
Tissue inhibitor of metalloproteases-2 (TIMP2)17rs2889529ppfefmeanratio0.070.180.110.09

Best p-value for FBAT and GEE tests reported with specific SNP*phenotype producing result. P-values < 0.01 bolded.

Candidate gene evaluation Best p-value for FBAT and GEE tests reported with specific SNP*phenotype producing result. P-values < 0.01 bolded.

Discussion

This is the first GWA of quantitative lung function measures to be reported, and it provides an opportunity for both hypothesis generation and hypothesis testing. We have identified a number of novel gene regions associated with pulmonary function. Associations with these SNPs and gene regions require replication in other study samples as well as functional studies before any statement about causality is warranted. Many of the best p-values are likely to reflect false positive results, and GEE results exhibited elevated Type I error [12] (see 100K Overview). Additional studies of this data will be useful, including smoking stratified analyses and more sophisticated approaches to creating multivariate phenotypes. However, several of the observed associations involve genes for which there are plausible biologic rationales for a relation to lung function phenotypes. The Glutathione S-Transferase (GST) superfamily genes are of interest because of their role in metabolism of xenobiotics, such as cigarette smoke. Recently, Hersh et al. studied GSTP1 and GSTM1 in two independent analyses of COPD and reported null findings. In contrast, a study of annual change in lung function measures in a population based cohort reported that the GSTT1 deletion alone or in combination with the GSTM1 deletion influenced decline in FEV1 in men [24]. Using the Affymetrix 100K SNP GeneChip, we have limited ability to directly confirm or refute the aforementioned findings because no SNPs were genotyped within 10 kb of the genes. Here, we show that a non-synonymous SNP in exon 5 of GSTO2 encoding an Asn142Asp amino acid change is among the most striking GWA results for mean FEV1 and FVC phenotypes. Both the non-synonymous SNP, rs156697, and a second SNP, rs156699, located in an intron exhibited strong association using both FBAT and GEE tests (r2 between SNPs = 0.8 in HapMap CEU). Linkage results also support the evidence for GSTO2, as the gene's position lies within the confidence interval around the LOD of 2.12 observed on chromosome 10 for mean FEV1 (Table 2c). GSTO2 is involved in the biotransformation of arsenic, which is a component of cigarette smoke, and may exhibit modest expression in bronchial epithelial cells. Gene expression studies in COS-1 cells demonstrated that the Asp142 variant exhibited 76% of the level of expressed protein occurring in the wild-type, and expression levels were further reduced to 15% when the Asp142 occurred in conjunction with an Ile158 variant [25]. The observation of strong association to a non-synonymous polymorphism with demonstrated effects on gene expression is compelling. Moreover, this finding in conjunction with a growing literature on GST gene association with pulmonary phenotypes suggests that a complete evaluation of functional variants in this gene family may be warranted. The IL6R SNP was not only among the top 25 p-values for percent predicted FEF25–75, but also among the top 200 p-values in gene regions for six of the ten phenotypes presented. IL6R is thought to be expressed in lung, and may play a role in the immune response. Recently, we have shown that IL6 levels in blood were associated with impaired lung function in the Framingham offspring cohort [26]. The IL6 pathway, as a mediator of the inflammatory process, is of interest as it relates to lung function phenotypes. The SNP identified in the SOD3 region lies within a hypothetical protein 3' of the SOD3 gene. The non-synonymous SNP in SOD3 that has been reported for association with COPD (rs1799895) was not included in the HapMap, so we could not determine the extent of LD between it and the SNPs genotyped in this study. Another exon 3 SNP (rs2536512) located 519 base pairs away from rs1799895 is present in the HapMap. The associated SNP identified in this study, rs10489030, is in very low LD with the SOD3 exon 3 HapMap SNP (D' = 0.32 and r2 = 0.005 in HapMap CEU). The low LD between the SOD3 exonic SNP and this 3' SNP, separated by 43.5 kb, does not suggest a clear replication of association with the SOD3 gene, but suggests that the genomic region is of continued interest. Also on chromosome 4, a SNP in the region of the vitamin D binding protein (GC) was associated with the percent predicted FEF25–75/FVC ratio with a p-value of 0.0009 using the FBAT test, but this SNP is in low LD with the Asp432Gly polymorphism, rs7041 (D' = 0.17 and r2 = 0.007 in HapMap CEU). The SNP identified in the region of SERPINE2 (rs717610) was not in LD with the six reported SNPs replicating significant associations to COPD [6] that were also available in HapMap, as the r2 values ranged from 0.002 to 0.008. Two of the top SNPs (rs3820928, GEE rank #7; rs10498137, FBAT rank #9) lie within the linkage region identified for FEV1/FVC in severe early-onset COPD cases [27] that subsequently led to the discovery of the SERPINE2 associated SNP. SNP rs3820928 was among those identified with association to five of the phenotypes studied and lies in the gene RHBDD1 (alias DKFZp547E052). Not much is known about the function of this gene, but the associated SNP is located in a region with LD extending to the adjacent COL4A4 gene. The rs3820928 exhibited an r2 of 0.81 with two non-synonymous coding SNPs in COL4A4 in the HapMap CEU data. Defects in Type IV collagen genes have been shown to influence Goodpasture's syndrome, an autoimmune disease affecting the lung [28], and both COL4A4 and COL4A3 lie in this region, sharing a common promoter. These results do not provide a strong replication of the original SERPINE2 SNP associations due to the low LD between SNPs reported in the literature and the SNPs from the 100K data with association. However, the results suggest that chromosome 2q is of continued interest and may harbor multiple genes influencing lung function. Studying lung function measures in a community-based sample may identify genetic variants associated with lung growth and development, susceptibility to obstructive ventilatory impairment related to asthma, emphysema and COPD or susceptibility to restrictive ventilatory impairment due to pulmonary fibrosis or other processes. The relevance to disease pathogenesis of associations between SNPs and lung function must be interpreted with caution, and some of the observed associations may reflect polymorphisms that protect against ventilatory impairment by leading to better lung function in early life or protection against the adverse effects of cigarette smoking. All of the GWA results are publicly available at . Replication of novel results identified by GWA will be the true test of the value of the GWA approach to gene discovery.

Conclusion

These publicly available results provide a resource for investigators to assess whether their findings of association to pulmonary function phenotypes replicate in the Framingham population. In addition, we have identified novel results that warrant replication studies in other populations.

Abbreviations

BMI = body mass index; COPD = chronic obstructive pulmonary disease; FEV1 = forced expiratory volume in one second; FBAT = family based association test; FVC = forced vital capacity; FEF25–75 = forced expiratory flow between the 25th and 75th percentile; GEE = generalized estimating equation; GWA = genome-wide association; kb = kilobase pairs (1000 base pairs); LD = linkage disequilibrium; SNPs = single nucleotide polymorphisms.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JBW created the phenotypic residuals that were analyzed, reviewed GWA results, and drafted the manuscript text and tables. REW, DJG, and GTO assessed the quality of spirometry data, identified exclusion criteria, and defined the covariates and methods to develop phenotypes for analysis. JML wrote scripts to parse through the GWA results and facilitate the presentation of results. All authors read and approved the final manuscript.
  28 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

Review 2.  Alport's syndrome, Goodpasture's syndrome, and type IV collagen.

Authors:  Billy G Hudson; Karl Tryggvason; Munirathinam Sundaramoorthy; Eric G Neilson
Journal:  N Engl J Med       Date:  2003-06-19       Impact factor: 91.245

3.  Linkage and association with pulmonary function measures on chromosome 6q27 in the Framingham Heart Study.

Authors:  Jemma B Wilk; Anita L DeStefano; Oscar Joost; Richard H Myers; L Adrienne Cupples; Karen Slater; Larry D Atwood; Nancy L Heard-Costa; Alan Herbert; George T O'Connor; Daniel J Gottlieb
Journal:  Hum Mol Genet       Date:  2003-09-09       Impact factor: 6.150

4.  Secreted modular calcium-binding protein 2 haplotypes are associated with pulmonary function.

Authors:  Jemma B Wilk; Alan Herbert; Christina M Shoemaker; Daniel J Gottlieb; Samer Karamohamed
Journal:  Am J Respir Crit Care Med       Date:  2007-01-04       Impact factor: 21.405

5.  The transforming growth factor-beta1 (TGFB1) gene is associated with chronic obstructive pulmonary disease (COPD).

Authors:  Juan C Celedón; Christoph Lange; Benjamin A Raby; Augusto A Litonjua; Lyle J Palmer; Dawn L DeMeo; John J Reilly; David J Kwiatkowski; Harold A Chapman; Nan Laird; Jody S Sylvia; Melvin Hernandez; Frank E Speizer; Scott T Weiss; Edwin K Silverman
Journal:  Hum Mol Genet       Date:  2004-06-02       Impact factor: 6.150

6.  Chronic obstructive pulmonary disease surveillance--United States, 1971-2000.

Authors:  David M Mannino; David M Homa; Lara J Akinbami; Earl S Ford; Stephen C Redd
Journal:  Respir Care       Date:  2002-10       Impact factor: 2.258

7.  Systemic inflammation and COPD: the Framingham Heart Study.

Authors:  Robert E Walter; Jemma B Wilk; Martin G Larson; Ramachandran S Vasan; John F Keaney; Izabella Lipinska; George T O'Connor; Emelia J Benjamin
Journal:  Chest       Date:  2007-10-01       Impact factor: 9.410

8.  Genomewide linkage analysis of quantitative spirometric phenotypes in severe early-onset chronic obstructive pulmonary disease.

Authors:  Edwin K Silverman; Lyle J Palmer; Jonathan D Mosley; Matthew Barth; Jody M Senter; Alison Brown; Jeffrey M Drazen; David J Kwiatkowski; Harold A Chapman; Edward J Campbell; Michael A Province; D C Rao; John J Reilly; Leo C Ginns; Frank E Speizer; Scott T Weiss
Journal:  Am J Hum Genet       Date:  2002-03-25       Impact factor: 11.025

9.  Glutathione S-transferase genotypes modify lung function decline in the general population: SAPALDIA cohort study.

Authors:  Medea Imboden; Sara H Downs; Oliver Senn; Gabor Matyas; Otto Brändli; Erich W Russi; Christian Schindler; Ursula Ackermann-Liebrich; Wolfgang Berger; Nicole M Probst-Hensch
Journal:  Respir Res       Date:  2007-01-11

10.  The Framingham Heart Study 100K SNP genome-wide association study resource: overview of 17 phenotype working group reports.

Authors:  L Adrienne Cupples; Heather T Arruda; Emelia J Benjamin; Ralph B D'Agostino; Serkalem Demissie; Anita L DeStefano; Josée Dupuis; Kathleen M Falls; Caroline S Fox; Daniel J Gottlieb; Diddahally R Govindaraju; Chao-Yu Guo; Nancy L Heard-Costa; Shih-Jen Hwang; Sekar Kathiresan; Douglas P Kiel; Jason M Laramie; Martin G Larson; Daniel Levy; Chun-Yu Liu; Kathryn L Lunetta; Matthew D Mailman; Alisa K Manning; James B Meigs; Joanne M Murabito; Christopher Newton-Cheh; George T O'Connor; Christopher J O'Donnell; Mona Pandey; Sudha Seshadri; Ramachandran S Vasan; Zhen Y Wang; Jemma B Wilk; Philip A Wolf; Qiong Yang; Larry D Atwood
Journal:  BMC Med Genet       Date:  2007       Impact factor: 2.103

View more
  47 in total

Review 1.  Genetic inheritance effects on endurance and muscle strength: an update.

Authors:  Aldo M Costa; Luiza Breitenfeld; António J Silva; Ana Pereira; Mikel Izquierdo; Mário C Marques
Journal:  Sports Med       Date:  2012-06-01       Impact factor: 11.136

2.  Lung function and airway diseases.

Authors:  Scott T Weiss
Journal:  Nat Genet       Date:  2010-01       Impact factor: 38.330

Review 3.  Genetics of complex respiratory diseases: implications for pathophysiology and pharmacology studies.

Authors:  Ma'en Obeidat; Ian P Hall
Journal:  Br J Pharmacol       Date:  2011-05       Impact factor: 8.739

Review 4.  The Effect of Ancestry and Genetic Variation on Lung Function Predictions: What Is "Normal" Lung Function in Diverse Human Populations?

Authors:  Victor E Ortega; Rajesh Kumar
Journal:  Curr Allergy Asthma Rep       Date:  2015-04       Impact factor: 4.806

5.  Pneumonia: host susceptibility and shared genetics with pulmonary function and other traits.

Authors:  M B Khadzhieva; A N Kuzovlev; L E Salnikova
Journal:  Clin Exp Immunol       Date:  2019-10-01       Impact factor: 4.330

6.  A role for glutathione transferase Omega 1 (GSTO1-1) in the glutathionylation cycle.

Authors:  Deepthi Menon; Philip G Board
Journal:  J Biol Chem       Date:  2013-07-25       Impact factor: 5.157

7.  Vitamin D binding protein gene polymorphisms and chronic obstructive pulmonary disease: a meta-analysis.

Authors:  Huan Chen; Lei Zhang; Zhiyi He; Xiaoning Zhong; Jianquan Zhang; Meihua Li; Jing Bai
Journal:  J Thorac Dis       Date:  2015-08       Impact factor: 2.895

8.  Superoxide dismutase 3 polymorphism associated with reduced lung function in two large populations.

Authors:  Morten Dahl; Russell P Bowler; Klaus Juul; James D Crapo; Samuel Levy; Børge G Nordestgaard
Journal:  Am J Respir Crit Care Med       Date:  2008-08-14       Impact factor: 21.405

9.  A pooling-based genome-wide analysis identifies new potential candidate genes for atopy in the European Community Respiratory Health Survey (ECRHS).

Authors:  Francesc Castro-Giner; Mariona Bustamante; Juan Ramon González; Manolis Kogevinas; Deborah Jarvis; Joachim Heinrich; Josep-Maria Antó; Matthias Wjst; Xavier Estivill; Rafael de Cid
Journal:  BMC Med Genet       Date:  2009-12-06       Impact factor: 2.103

10.  Chronic obstructive pulmonary disease: towards pharmacogenetics.

Authors:  Alice M Wood; See Ling Tan; Robert A Stockley
Journal:  Genome Med       Date:  2009-11-30       Impact factor: 11.117

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.