Literature DB >> 20698975

Asthma-susceptibility variants identified using probands in case-control and family-based analyses.

Blanca E Himes1, Jessica Lasky-Su, Ann C Wu, Jemma B Wilk, Gary M Hunninghake, Barbara Klanderman, Amy J Murphy, Ross Lazarus, Manuel E Soto-Quiros, Lydiana Avila, Juan C Celedón, Christoph Lange, George T O'Connor, Benjamin A Raby, Edwin K Silverman, Scott T Weiss.   

Abstract

BACKGROUND: Asthma is a chronic respiratory disease whose genetic basis has been explored for over two decades, most recently via genome-wide association studies. We sought to find asthma-susceptibility variants by using probands from a single population in both family-based and case-control association designs.
METHODS: We used probands from the Childhood Asthma Management Program (CAMP) in two primary genome-wide association study designs: (1) probands were combined with publicly available population controls in a case-control design, and (2) probands and their parents were used in a family-based design. We followed a two-stage replication process utilizing three independent populations to validate our primary findings.
RESULTS: We found that single nucleotide polymorphisms with similar case-control and family-based association results were more likely to replicate in the independent populations, than those with the smallest p-values in either the case-control or family-based design alone. The single nucleotide polymorphism that showed the strongest evidence for association to asthma was rs17572584, which replicated in 2/3 independent populations with an overall p-value among replication populations of 3.5E-05. This variant is near a gene that encodes an enzyme that has been implicated to act coordinately with modulators of Th2 cell differentiation and is expressed in human lung.
CONCLUSIONS: Our results suggest that using probands from family-based studies in case-control designs, and combining results of both family-based and case-control approaches, may be a way to augment our ability to find SNPs associated with asthma and other complex diseases.

Entities:  

Mesh:

Year:  2010        PMID: 20698975      PMCID: PMC2927535          DOI: 10.1186/1471-2350-11-122

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

Asthma [MIM 600807] is a chronic respiratory disease that affects over 20 million Americans and 300 million people worldwide [1,2]. The genetic basis of asthma has been explored for over two decades in candidate gene association, where more than 40 genes have been associated with asthma and replicated in at least one independent population [3,4]. Recently, genome-wide association (GWA) studies of asthma have found that variants in or near several genes, including ORMDL3 [MIM 610075] [5-12], CHI3L1 [MIM 601525] [13], TLE4 [MIM 605132] [14], PDE4D [MIM 600129] [15], DENND1B [MIM 613292] [16], RAD50-IL13 [MIM 604040, 147683] [17], and the HLA-DR/DQ region on chromosome 6p21.3 [17], contribute to the risk of asthma. The primary findings of the PDE4D GWA study were obtained in a case-control design consisting of children from an asthma clinical trial and publicly available population controls. Although we used a case-control design, genetic studies of these asthmatic children were originally intended to be part of a family-based design, as DNA was collected from children and their parents. We chose the case-control design in an effort to increase the power to detect genetic associations. Consistent with the approach taken in most GWA studies, we selected the variants with the lowest p-values in our primary population for replication in independent populations. Here, we extensively compare the GWA results of the case-control and family-based designs using probands from the same population, and we attempt to replicate the initial association findings of many SNPs with nominally significant p-values in a two-stage process. Our results help to determine whether the case-control design, the family-based design, or a combination of results from both designs, is more powerful to identify asthma-susceptibility variants.

Methods

Subjects

Our primary population is composed of 422 non-Hispanic white subjects from the Childhood Asthma Management Program (CAMP), a clinical trial that followed 1,041 asthmatic children for four years and nearly 80% of the original participants for 12 years [18]. CAMP participants and their parents provided DNA for family-based genetic studies. Additionally, CAMP probands were used in a case-control design by matching them with 1,533 white population controls that are publicly available through the Illumina iControlDB resource (http://www.illumina.com/science/icontroldb.ilmn) [15].

Genotyping and Quality Control

Genome-wide SNP genotyping for 422 Caucasian CAMP subjects, their families, and iControlDB controls was performed on Illumina's HumanHap550 Genotyping BeadChip (Illumina, Inc., San Diego, CA). Details of the quality control (QC) criteria used to screen the genome-wide SNP data have been provided previously [15]. Briefly, of the 422 CAMP subjects who were genotyped, 403 had genotyping completion rates greater than 95% and were used in subsequent analyses. SNPs were excluded for having low clustering scores (n = 6,257), flanking sequences that did not map to a unique position on the HG17 reference genome (n = 1,329), having 5 or more Mendel errors (n = 2,445), or being monomorphic (n = 3790). For the family-based analysis, hereafter referred to as CAMP Trio study, 534,290 SNPs in 403 probands and their parents passed QC filters. For the case-control analysis, additional QC filters were used. Subjects were excluded for being siblings of other subjects (23 cases) or showing evidence of identity by descent (IBD) (57 controls) or sex discordance (3 controls). SNPs were excluded if they were missing in more than 5% of subjects (n = 3,837), had minor allele frequency (MAF) less than 1% (n = 17,088), had Hardy-Weinberg equilibrium p-values among controls < 0.001 (n = 2,046), or had a significantly different (p-value < 1E-05) missing rate in cases and controls (n = 6,642). After these QC filters, 518,230 SNPs remained. Genetic matching (GEM) [19] was used to control for population stratification, and as a result of implementing this procedure, 21 cases and 687 controls were dropped. The remaining 1,205 subjects (359 cases, 846 controls) had a genomic inflation factor of 1.03, demonstrating minimal population stratification. After re-enforcing a SNP MAF threshold of greater than 1% in the remaining subjects, 516,617 SNPs remained for the analysis of CAMP cases and Illumina controls, which will hereafter be referred to as the CAMP/Illumina study.

Statistical Analysis

Figure 1 is an overview of our study design. First, we performed CAMP/Illumina and CAMP Trio GWA analyses [Figure 1A]. SNPs for further consideration were selected by choosing nominal p-value cutoffs in both the case-control and family-based analyses. Case-control associations were measured in PLINK [20] using Cochran-Armitage trend tests. Family-based association statistics for 403 CAMP trios, assuming an additive model of inheritance, were calculated using PBAT version 3.6 [21]. Those SNPs with CAMP/Illumina Cochran-Armitage trend test p-values < 0.01 and FBAT additive model p-values < 0.05 were selected for replication in an initial independent population [Figure 1B]. Further replication of association results was attempted in two additional independent populations [Figure 1C]. This two-staged replication approach was taken so that a larger number of SNPs could be genotyped in the initial replication population, which was most similar to CAMP as it was also composed of children who were carefully ascertained for asthma studies. Subsequently, a smaller number of SNPs that successfully replicated in the initial replication population could be tested in two additional populations. Joint evidence for association across replication populations was measured by combining p-values using the Liptak method [22] [Figure 1D]. In combining p-values, all hypothesis tests in replication populations had one-sided alternatives (based on the direction of the association in the testing population) so that SNPs with association tests in opposite directions would not produce inappropriately small p-values. Effect estimates were calculated using allelic odds ratios (ORs) for case-control data. To evaluate directionality of effect in trios, transmitted to untransmitted ratios (T:U) were calculated in Haploview. Power calculations for "TDT with discrete traits" and "case-control with discrete traits" designs were performed using the Genetic Power Calculator by Purcell S, et al [23] with high risk and marker allele frequencies of 0.10/0.40, prevalence of 0.10, D-prime of 1, assuming use of unselected controls for the case-control statistics, and with default error rates (alpha = 0.05, power = 0.80).
Figure 1

Study Design. (A) The initial study population consisted of CAMP probands used in (1) a case-control design composed of 359 CAMP cases and 846 Illumina controls and (2) a family-based design composed of 403 trios. Genome-wide association of individual SNPs to asthma status was assessed in each of these designs. SNPs with a case-control Cochran-Armitage trend test p-value < 0.01 and a family-based PBAT additive model p-value < 0.05 were selected for replication analysis. (B) The first replication stage, carried out in CR, measured the association of 1378 SNPs with asthma. Those SNPs with either (1) a CR PBAT additive model p-value < 0.05 or (2) a CR PBAT additive model p-value < 0.10 and a p-value < 1E-05 from the designs in (A) were selected for the next replication stage. (C) The second replication stage was carried out in two additional independent populations, FHS and iCAP. (D) Joint association analysis for 85 SNPs with data in the three independent populations was performed.

Study Design. (A) The initial study population consisted of CAMP probands used in (1) a case-control design composed of 359 CAMP cases and 846 Illumina controls and (2) a family-based design composed of 403 trios. Genome-wide association of individual SNPs to asthma status was assessed in each of these designs. SNPs with a case-control Cochran-Armitage trend test p-value < 0.01 and a family-based PBAT additive model p-value < 0.05 were selected for replication analysis. (B) The first replication stage, carried out in CR, measured the association of 1378 SNPs with asthma. Those SNPs with either (1) a CR PBAT additive model p-value < 0.05 or (2) a CR PBAT additive model p-value < 0.10 and a p-value < 1E-05 from the designs in (A) were selected for the next replication stage. (C) The second replication stage was carried out in two additional independent populations, FHS and iCAP. (D) Joint association analysis for 85 SNPs with data in the three independent populations was performed.

Replication Studies

(1) CR. This cohort consists of 592 probands from the Genetics of Asthma in Costa Rica Study (CR), which is comprised of Costa Rican schoolchildren with asthma and their parents [24,25]. Children had a high probability of having at least six great-grandparents born in the Central Valley of Costa Rica and were defined as having asthma if they had a doctor's diagnosis of asthma and at least two respiratory symptoms or asthma attacks in the year prior to enrollment in the study. Most genotype data to replicate CAMP/Illumina findings was obtained with an Illumina 1536 GoldenGate assay. Of the 1536 SNPs attempted, 1375 met the following quality thresholds for analysis, after removal of 9 failed subjects: 1) completion rate > 95%, 2) three or fewer Mendelian inconsistencies, 3) zero discordance among replicate samples, and 4) MAF > 0. Data for 2 SNPs (rs261119, rs2777899) were obtained with the SEQUENOM MassARRAY system (Sequenom, Inc., San Diego, CA), which utilizes iPlex chemistry. Data for one SNP (rs11778371) was obtained with Taqman real-time PCR with an ABI Prism 7900 machine (Applied Biosystems, Foster City, CA). Standard PCR conditions, as recommended by the manufacturer, were used. Overall, 583 trios had good genotype data available for 1378 SNPs. Family-based association statistics for asthma affection status under an additive model were calculated using Golden Helix PBAT version 6.4.0 [21]. T:U of alleles were calculated in Haploview [26]. (2) FHS. The Framingham Heart Study (FHS) is a family-based study that conducted clinical examinations, including spirometry and collection of smoking history data, on three generations of white adults of European descent, and research participants provided DNA samples that have recently been genotyped for genome-wide association studies [27,28]. Asthma was classified based on self-report of physician diagnosis, and according to this definition, there were 961 cases and 6,516 controls. In FHS subjects, genotyping was performed using the Affymetrix GeneChip Human Mapping 500 K Array Set and an additional Affymetrix 50 K Array (HuGeneFocused50K). Because data from these assays did not include that of some associated SNPs that passed the replication stage in CR, those genotypes were inferred using imputation with the Markov Chain Haplotyping software (MaCH) [29]. The ratio of the empirically observed dosage variance to the expected (binomial) dosage variance for these imputed SNPs was greater than 0.9, indicating good quality of imputation. Association to asthma was measured using logistic regression models with robust variance estimated via generalized estimating equations with each pedigree as a cluster, while adjusting for age, former smoking, current smoking, pack-years, sex, BMI, and membership in one of the three recruited generations. The genomic inflation factor for the imputed genome-wide results was 1.048, indicating minimal population stratification. (3) iCAP. The i2b2 Crimson Asthma Project (iCAP) consists of Partners Healthcare System, Inc. (Boston, MA) patients who were selected based on extracted de-identified electronic medical record (EMR) data and whose DNA was obtained via discarded clinical samples. Specifically, tools developed by the National Center for Biomedical Computing entitled "Informatics for Integrating Biology to the Bedside" (i2b2, http://www.i2b2.org) have facilitated extraction of de-identified demographic and clinical information from EMRs of patients. Using i2b2 resources, a large set of asthmatic and non-asthmatic Partners Healthcare patients has been identified on the basis of International Classification of Diseases, Ninth Revision (ICD-9) codes for asthma (i.e. those beginning with 493) [30,31]. In order to conduct genomic studies of these patients, clinical samples that are routinely collected at healthcare visits were obtained via the Crimson Project (http://www.crimsonproject.org), which identifies discarded Partners Healthcare clinical samples that are ordered for routine clinical tests, and prospectively collects the samples that have been requested by an approved study. For this study, to further ensure that cases truly had asthma, medication history extracted from EMR records was utilized. Cases (n = 220) were defined as those patients whose EMRs contained an asthma ICD-9 code and whose medication history included usage of at least one beta-agonist or inhaled corticosteroid. Controls (n = 853) were selected as those patients who had been seen in the three years prior to blood collection in at least one of over 850 outpatient clinics but did not have any asthma ICD-9 codes. The gender composition of cases (19.5% male, 80.5% female) and controls (15.8% male, 84.2% female) was not statistically different (Fisher's exact p-value 0.19) although there is a high prevalence of female subjects overall due to a large portion of patients being recruited at Brigham and Women's Hospital, which has a high proportion of female patients. There was no significant difference in age between cases (mean = 28.8 years [SD 6.0, range = 4 to 35]) and controls (mean = 30.0 years [SD 4.1, range = 18 to 35]) at the time of DNA collection (Wilcoxon rank sum p-value = 0.082). Genotyping of the SNPs of interest was carried out with an Illumina GoldenGate assay. Markers were analyzed if they met the same quality standards as described for Costa Rica genotyping, with the exception of Mendelian checks, which were not applicable. Genotyped SNPs included those for replication of association to asthma as well as two panels of SNPs to measure population stratification: (1) a set of intergenic SNPs selected randomly throughout the genome [32], and (2) a set of ancestry informative markers (AIMs) [33]. The random panel of 187 SNPs had an association χ2 187 df = 168.7, corresponding to a p-value of 0.83. The set of 248 AIM SNPs was used to compute principal components describing variation in iCAP subject data using EIGENSTRAT [34]. The principal components were used to obtain EIGENSTRAT-corrected association statistics for replication SNPs. The r2 between corrected and uncorrected association statistics was 0.95. Thus, no significant evidence of population stratification was found by two methods.

Results

Probands from CAMP were used to measure association of SNPs to asthma using both a case-control and a family-based GWA study design. The top SNPs according to each design are shown in Table 1. In the case-control design (i.e. CAMP/Illumina), which utilized publicly available population controls, 516,617 SNPs in 1,205 subjects (359 cases, 846 controls) were evaluated for association using the Cochran-Armitage trend test, and 15 SNPs had p-values < 1E-05. In the family-based GWA (i.e. CAMP Trio study), 534,290 SNPs in 403 trios were ranked according to PBAT additive model p-values, and 13 SNPs had p-values less than a nominally significant level of 1E-05. There was no overlap among the top (i.e. those with p-value < 1E-05) CAMP/Illumina and CAMP Trio SNPs.
Table 1

Top-ranked SNPs in each Initial Study (CAMP/Illumina or CAMP Trio p-value < 1E-05)

RankDiscovery P-valuesStage 1Stage 2


SNPCHRBPCAMP/IlluminaCAMP TrioCAMP/IlluminaCAMP TrioReplication Attempted?CRFHSCAPLiptak P-value*
Top CAMP/Illumina

rs2548659559419643131222.07E-075.9E-03yes0.064NA0.150.035
rs1588265559405551250005.11E-079.4E-03reported in [15]----
rs9832805594808943102535.71E-070.019yes0.0945.7E-03NA3.3E-03
rs11778371827375822429477.92E-075.6E-03yes0.29---
rs15447915594748395108521.16E-060.020yes/reported in [15]0.15---
rs6849091135843495678493.12E-060.015yes0.99---
rs12725071110502248873064154.52E-060.57no----
rs172197734619268238113764.56E-060.021yes0.83---
rs12930287166449590792283384.80E-060.42no----
rs11751990613082946910198524.97E-060.037yes0.0180.810.430.21
rs2761647239515931411NA5.83E-06NAno----
rs1272412911188022612433546.83E-060.082no----
rs77653746870013181382697.34E-060.015yes0.18---
rs291083055950296014168907.79E-060.032yes0.0690.061NA0.016
rs9318942138292851515342488.72E-060.065no----

Top CAMP Trio

rs12885484186536979495518.6E-038.7E-07yes0.24---
rs26113754414120106821.8E-031.2E-06yes0.0530.260.270.049
rs127343381200736346NA3NA1.7E-06no----
rs10483294186536752163742.8E-033.1E-06yes0.29---
rs2611595439698721253.3E-043.4E-06yes0.0860.290.270.072
rs12247820105327219655869.0E-044.2E-06yes0.74---
rs127434011200743271NA7NA4.2E-06no----
rs26111954418468499588.7E-034.2E-06yes0.44---
rs10396035434561630994.8E-045.0E-06yes0.0630.32NA0.079
rs27778991755187173828101.4E-036.1E-06yes0.34---
rs261125544228873764116.5E-036.6E-06yes0.62---
rs132674378418347420383120.0379.5E-06no----
rs946342564862099479984130.159.7E-06no----

*In all available independent populations

Top-ranked SNPs in each Initial Study (CAMP/Illumina or CAMP Trio p-value < 1E-05) *In all available independent populations Under the assumption that consistency of results in CAMP/Illumina and CAMP Trio increased the likelihood of an association being true, we proceeded to replicate those SNPs with CAMP/Illumina p-value < 0.01 and CAMP Trio p-value < 0.05 [Figure 2A]. Such thresholds were selected to have enough SNPs to fill a genotyping assay for 1536 SNPs, and the lower p-value threshold for CAMP/Illumina was chosen because of the increased power that CAMP/Illumina had to detect associations relative to CAMP Trio. Of 5604 SNPs with CAMP/Illumina p-values < 0.01, 1726 SNPs had CAMP Trio p-values < 0.05. Among this set of 1726 SNPs were some of the top SNPs according to the individual primary analyses: 9 CAMP Trio SNPs with p-values < 1E-05, and 9 CAMP/Illumina SNPs with p-values < 1E-05 [Table 1]. One (rs1588265) of the top 10 CAMP/Illumina SNPs that had CAMP Trio p-value < 0.05 was not genotyped because it was replicated previously in independent populations, including CR and FHS, and is in very tight linkage disequilibrium with three of the 9/10 other replicated SNPs [15].
Figure 2

Association Results. (A) Plot of CAMP Trio vs. CAMP/Illumina GWA results. SNPs selected for replication analysis have family-based PBAT additive model p-values < 0.05 and case-control Cochran-Armitage trend test p-values < 0.01. The subset of these SNPs that was genotyped in CR is shown in blue. Points with a pink background are SNPs whose association in several populations was reported previously [15]. (B) Plot of CAMP Trio vs. CAMP/Illumina GWA results for the subset of SNPs that was successfully genotyped in CR. Shown in green are those SNPs with CR p-values < 0.05. Shown in red are those SNPs that have CR p-values < 0.05 and either a FHS or iCAP p-value < 0.05.

Association Results. (A) Plot of CAMP Trio vs. CAMP/Illumina GWA results. SNPs selected for replication analysis have family-based PBAT additive model p-values < 0.05 and case-control Cochran-Armitage trend test p-values < 0.01. The subset of these SNPs that was genotyped in CR is shown in blue. Points with a pink background are SNPs whose association in several populations was reported previously [15]. (B) Plot of CAMP Trio vs. CAMP/Illumina GWA results for the subset of SNPs that was successfully genotyped in CR. Shown in green are those SNPs with CR p-values < 0.05. Shown in red are those SNPs that have CR p-values < 0.05 and either a FHS or iCAP p-value < 0.05. For replication Stage 1, a subset of 1378 of the 1726 SNPs was successfully genotyped in CR, an independent population of children with asthma [Figure 2A]. Because of our selected genotyping platform, an Illumina 1536 GoldenGate assay, we had to constrain the initial set of 1726 SNPs to 1536. Most SNPs were excluded based on LD: we attempted to capture all regions of association by selecting at least one SNP within sets of SNPs that were in strong (pairwise r2 > 0.80) LD, but we excluded remaining SNPs within such sets. However, some SNPs were excluded because they had low GoldenGate assay design scores while others were excluded because they failed the GoldenGate assay. Three of the SNPs (rs261119, rs2777899, rs11778371) that failed the GoldenGate assay were genotyped by other methods in Stage 1 because they were among the top-ranked SNPs in the family-based and case-control designs, and we felt that their results were essential to compare the replication of SNPs selected based on the different study designs. Of the 1378 SNPs that were successfully genotyped in Stage 1, 78 had 1-sided p-values < 0.05 with CR effects in the same direction as CAMP [Figure 2B, Table 2, Additional file 1, Table S1]. Only one (rs11751990) of these 78 SNPs was among the top-ranked SNPs from the individual studies shown in Table 1.
Table 2

SNPs that replicated in 2 of 3 independent populations

Discovery P-valuesRankReplication P-values

Stage 1Stage 2

SNPCHRBPCAMP/IlluminaCAMP TrioCAMP/IlluminaCAMP TrioCRFHSiCAPLiptak P-valueNearest RefSeq Gene(s)Distance From Gene
rs1757258431443707334.3E-030.0432539225131.4E-030.0110.0523.5E-05CHST246233
rs1048934112079987448.6E-030.0474946251221.9E-030.0230.0569.2E-05TRAF3IIn gene
rs465363712236906392.9E-040.036189191340.0120.236.0E-037.6E-04LBRIn gene
rs53091411952718047.8E-030.0314525161968.9E-030.140.0351.2E-03MTMR2In gene
rs1081678991110323303.5E-030.0302082161210.0126.9E-030.291.2E-03EPB41LIn gene
rs24705216565247191.8E-040.023128122530.0363.3E-030.271.5E-03CNGB1In gene
rs153483714794527786.5E-048.8E-0341646170.0470.0590.0371.9E-03NRXN352265
rs71467911446102402.6E-037.7E-03154240530.0320.150.0282.9E-03CD8212351
rs119470344354486268.6E-030.018496398300.0130.430.0204.8E-03CENTD1295390
rs115249014558659873.4E-030.0312029162360.0450.0390.216.6E-03PELI228204
SNPs that replicated in 2 of 3 independent populations For replication Stage 2, 85 SNPs were evaluated in two additional independent populations, FHS and iCAP. These SNPs included the 78 SNPs with CR p-values < 0.05 and an additional 7 SNPs with CR p-values < 0.10 that were part of the top CAMP/Illumina and CAMP Trio SNPs. We found that 10 SNPs had a p-value < 0.05 in FHS or iCAP and had the same direction of effect across all populations [Tables 2, 3 and 4]. Overall Liptak p-values in the three independent populations revealed that one SNP passes a multiple comparisons correction threshold of 3.6E-05 (= 0.05/1378), corresponding to the number of SNPs genotyped in CR.
Table 3

Minor Allele Frequencies of SNPs that replicated in 2 of 3 independent populations

CAMP/IlluminaCAMP TrioCRFHSiCAP
SNPMinor AlleleAffectedUnaffectedAffectedUnaffectedAffectedUnaffectedAffectedUnaffectedAffectedUnaffected

rs17572584T0.0780.120.0850.100.0570.0740.0830.100.0730.10
rs10489341T0.0360.0180.0360.0270.0580.0460.0280.0220.0500.033
rs4653637G0.400.320.390.370.340.320.320.310.390.32
rs530914C0.540.480.520.500.540.470.510.490.510.46
rs10816789T0.190.140.190.180.140.130.180.160.150.14
rs247052T0.260.340.260.280.330.350.280.310.290.30
rs1534837A0.0420.0790.0460.0570.0110.0140.0650.0740.0550.080
rs714679G0.390.460.410.430.330.340.420.430.400.45
rs11947034C0.0330.0600.0410.0520.0540.0630.0610.0620.0500.079
rs1152490C0.180.240.180.210.200.210.210.230.210.23
Table 4

Association directions of SNPs that replicated in 2 of 3 independent populations

SNPCAMP/IlluminaCAMP TrioCRFHSiCAP
rs175725840.63 (0.47-0.87)0.710.610.82 (0.75-0.89)0.72 (0.49-1.07)
rs104893412.01 (1.19-3.42)1.871.831.33 (1.15-1.53)1.52 (0.92-2.52)
rs46536371.40 (1.17-1.67)1.251.241.04 (0.98-1.10)1.32 (1.06-1.64)
rs5309141.26 (1.06-1.51)1.241.231.06 (1.00-1.12)1.21 (0.98-1.50)
rs108167891.43 (1.13-1.80)1.361.261.20 (1.12-1.30)1.09 (0.81-1.46)
rs2470520.69 (0.57-0.84)0.770.850.84 (0.79-0.90)0.93 (0.74-1.17)
rs15348370.51 (0.34-0.76)0.570.530.85 (0.77-0.94)0.67 (0.43-1.04)
rs7146790.76 (0.63-0.91)0.750.860.94 (0.88-1.00)0.81 (0.66-1.01)
rs119470340.54 (0.35-0.86)0.550.670.98 (0.89-1.08)0.62 (0.39-0.98)
rs11524900.72 (0.57-0.90)0.750.850.89 (0.83-0.95)0.90 (0.70-1.16)

CAMP/Illumina, FHS, and iCAP columns contain ORs (95% confidence intervals). CAMP Trio and CR columns contain T:Us.

Minor Allele Frequencies of SNPs that replicated in 2 of 3 independent populations Association directions of SNPs that replicated in 2 of 3 independent populations CAMP/Illumina, FHS, and iCAP columns contain ORs (95% confidence intervals). CAMP Trio and CR columns contain T:Us. Three additional SNPs on chromosome 17q21 near ORMDL3 were evaluated in all populations using previously published results for CAMP/Illumina [15] and CAMP and CR trios [12], and novel results for CAMP Trio, FHS and iCAP [Table 5].
Table 5

Replication of 3 Chromosome 17 SNPs near ORMDL3 initially reported by Moffat et al. [5]

SNPBPCAMP/IlluminaCAMP TrioCAMP trios Reported in [12]CR Reported in [12]FHSiCAP
rs9303277352299950.0110.0414.0E-031.5E-058.6E-033.3E-03
rs8067378353048747.0E-030.107.0E-038.1E-060.0198.4E-03
rs7216389353234751.7E-030.150.139.3E-060.0230.040

Associations were in the same direction (data not shown). "CAMP Trio" refers to GWA results corresponding to the 403 trios reported in the current study. "CAMP trios Reported in [12]" refers to results from a previous candidate gene study corresponding to 444 trios.

Replication of 3 Chromosome 17 SNPs near ORMDL3 initially reported by Moffat et al. [5] Associations were in the same direction (data not shown). "CAMP Trio" refers to GWA results corresponding to the 403 trios reported in the current study. "CAMP trios Reported in [12]" refers to results from a previous candidate gene study corresponding to 444 trios.

Discussion

The CAMP population was designed for, and has often been used in, family-based candidate gene association studies [35-37]. More recently, GWA data have been acquired for a cohort of 403 Caucasian probands and their parents. This cohort has low statistical power to detect associations corresponding to the expected small (i.e. 1.1-1.3) effect sizes that have been observed to underlie many complex traits, including asthma [38]. We sought to increase statistical power by using the CAMP probands as cases in a case-control design, where controls were obtained from Illumina's iControlDB resource. Although we increased statistical power with the case-control design, which is composed of 359 cases and 846 controls, this design also remains underpowered to detect associations of small effect sizes. For example, for a SNP such as rs17572584, which has an MAF near 0.10 and an effect size of 1.6, the power to detect its association to asthma via a TDT in CAMP Trio is 0.37, while the power in CAMP/Illumina is 0.45. For a SNP such as rs4653637, which has an MAF near 0.40 and an effect size of 1.4, the power in CAMP Trio is 0.39, while the power in CAMP/Illumina is 0.47. Thus, despite a significant increase in power, neither design individually has a large enough number of subjects to make it adequate to detect most of the small effect sizes that are expected for asthma. In this work, we attempted to find out whether a combination of the results of the two designs would be helpful to find SNPs that replicated in independent populations, and hence, were likely to be truly associated with asthma. The top SNPs according to the results for individual designs did not overlap [Figure 2], and only 1/18 of the top SNPs that were genotyped for replication Stage 1 had a p-value < 0.05 [Table 1]. Because of the low p-values in either the CAMP/Illumina or CAMP Trio designs, we were more lenient with the replication threshold for these SNPs to be analyzed for Stage 2, and proceeded to consider the SNPs with p-value < 0.10 in CR. Based on the overall results in Table 1, only the three CAMP/Illumina SNPs on Chromosome 5 (Table 1, CAMP/Illumina Rank 1, 3, 14) show evidence of association to asthma. These three SNPs and the two others near them on Chromosome 5 (Table 1, CAMP/Illumina Rank 1-3, 5, 14) are near and within sequences of the PDE4D gene as we reported previously [15]. In this previous work, we attempted to replicate the top CAMP/Illumina findings in independent populations. We found that in 2/7 independent populations our results replicated with p < 0.05 and that in 5/7 the results did not replicate but had consistent effect directions with CAMP/Illumina, providing overall p-values in the independent populations that supported the association of PDE4D variants with asthma. These previous findings would have been more difficult to identify based on our current replication strategy because the CR population did not convincingly replicate the PDE4D associations. Thus, having the first stage be a filtering stage limits our ability to identify some of the SNPs that may be truly associated with asthma because they may not replicate in an initial independent population but would in other independent populations. This limitation may be overcome in future studies that pool GWA results across multiple populations. Most of the SNPs that passed replication Stage 1 were distributed closer to the nominal p-value thresholds than to the low p-value extremes of either CAMP/Illumina or CAMP Trio [Figure 2B in green]. The SNPs that passed replication Stage 2 were even closer to the nominal p-value thresholds than those of replication Stage 1 [Figure 2B in red]. Thus, SNPs with consistent results in the two initial study designs were more likely to replicate in independent populations than those with extreme p-values in either individual design. Tables 2, 3 and 4 detail the characteristics of the SNPs that passed replication Stage 2 (i.e. had p-value < 0.05 in CR and either FHS or iCAP, with consistent effect direction across all populations), and hence, are most likely to be truly associated with asthma. Most of these SNPs are in or near genes [Table 2], making them more likely to have biological relevance than SNPs that are often found to be associated with asthma and are in gene deserts. The top SNP (rs17572584) in Table 2 has an overall p-value across the independent populations that is significant after multiple comparisons corrections for the number of SNPs that were genotyped in Stage 1 (p-value = 0.048). If the results for this SNP in the independent populations are combined with those of the primary CAMP studies, then the overall p-value would be 9.7E-07 with CAMP/Illumina and 8.3E-06 with CAMP Trio. The rs17572584 SNP is downstream of the carbohydrate (N-acetylglucosamine-6-O) sulfotransferase 2 gene (CHST2 [MIM 603798]), which encodes a sulfotransferase for 6-sulfated glycan synthesis. Transcription of this gene has been found to be coordinated with that of NF-kappaB and GATA-3, both of which are involved in Th2 cell differentiation [39]. Because Th2 cells are prominent in asthma, especially allergic asthma, genes such as CHST2 that modulate Th2 cell differentiation are likely to play a role in asthma susceptibility. Further, the CHST2 protein product has been shown to be expressed in the human lung [40]. Even though CHST2 is a plausible candidate gene for asthma, rs17572584 is located downstream of this gene, and hence, further study is needed to find out whether the association we measured is stronger in SNPs located nearer or within the gene across multiple populations. However, identification of this association using either CAMP/Illumina or CAMP Trio alone would have been difficult since this SNP ranked 2,539 and 22,513 in these primary studies, respectively. The fact that our strongest associations were located in regions that were not at the lowest p-value extremes, but were at nominally significant levels, reflects the potential of small populations to contain useful genetic associations that can be found by increases in power. As stated previously, this power limitation may be overcome in future studies that pool GWA results across multiple populations. Currently, among regions reported to be associated with asthma, the one on chromosome 17q21 near ORMDL3 [MIM 610075] [5-12] has been the most consistently replicated. We previously reported [15] that nine CAMP/Illumina SNPs support the original association findings by Moffatt et al. in this region [5], by having p-values < 0.05. Of these nine SNPs, only one (rs9303277) has a CAMP Trio p-value < 0.05. However, this SNP was excluded from our Replication Stage 1 because its CAMP/Illumina p-value was equal to 0.011 (i.e. was greater than our 0.010 cutoff). In a separate study of association in the 17q21 region, rs9303277 was reported to have a p-value = 4.0E-03 in CAMP trios and = 1.5E-05 in CR trios [12]. The lower CAMP p-value reported in [12] was obtained using TaqMan SNP Genotyping Assays (Applied Biosystems) data corresponding to 444 white trios, and not the GWA data corresponding to 403 trios reported in the current study. The difference in CAMP p-values as a result of using a slightly increased number of trios and different genotyping platforms is another reflection of the low power of CAMP Trio, and suggests that additional SNPs with more modest p-values than those considered by our replication strategy are truly associated with asthma. Because the 17q21 region near ORMDL3 has been consistently replicated in so many studies, including CAMP and CR, we evaluated the association for three of the SNPs originally reported by Moffat, et al. in iCAP and FHS. As Table 5 shows, these three SNPs replicated with nominal p-values < 0.05 in these two populations, growing the total number of independent populations replicating this region's results to over 12. Additionally, the replication of the chromosome 17q21 results in FHS and iCAP helps validate the use of these populations, which were not ascertained to have asthma using as strict criteria as CAMP and CR, in asthma association studies. In previous work that developed a method to identify reproducible associations in small affected trio-based cohorts, two CAMP Trio SNPs were found to reach genome-wide significance according to the developed criteria [41]. These SNPs (rs10863712 and rs1294497) ranked first and second according to an initial power screen and had FBAT p-values that passed the corresponding genome-wide significance thresholds (i.e. p-value < 0.005). The CAMP/Illumina p-values (ranks) for these two SNPs were 0.28 (149891) and 0.049 (27336) for rs10863712 and rs1294497, respectively. The CAMP Trio p-values (ranks) for these two SNPs were 0.0032 (1684) and 0.0048 (2547) for rs10863712 and rs1294497, respectively. Thus, neither of these SNPs met the criteria for replication Stage 1 and neither was tested for replication in our independent cohorts. Future attempts to replicate the top SNPs according to the power-based screening methodology would be helpful to compare its utility with that of the current approach of combining case-control and family-based results. Our study has limitations similar to those of many other GWA studies. In addition to having low power to detect associations and some missing genotypic data for all of our populations, there were differences in the way our populations were ascertained to have asthma. CAMP and CR are the most similar populations, as they are composed of children with asthma who were carefully ascertained for asthma studies. However, the two populations are ethnically different and have markedly different environmental exposures. The FHS population was not initially gathered for asthma research, but asthma status has been assessed for its participants based on self-reported doctor's diagnosis. The iCAP cohort was selected based on data, including ICD-9 codes for asthma and medication history, extracted from de-identified EMRs. The diversity of our populations increases the certitude that the SNPs that are in Table 2 are general asthma variants, rather than variants that are associated to asthma sub-phenotypes that characterize each of the populations. Any errors in the assignment of cases and controls would decrease our ability to find associations, and thus, does not increase the likelihood that the SNPs in Table 2 are false positive associations.

Conclusions

Our results suggest that SNPs with similar case-control and family-based association results in designs that share probands are those that are most likely to replicate in independent populations, rather than the SNPs with the smallest p-values in either case-control or family-based designs alone. Thus, using probands from family-based studies in case-control designs, and combining results of both approaches, may be a way to augment our ability to find SNPs associated with asthma and other complex diseases.

Competing interests

EKS received an honorarium for a talk on COPD genetics in 2006, and grant support and consulting fees from GlaxoSmithKline for two studies of COPD genetics. EKS received an honorarium from Bayer for a symposium at the ERS Meeting in 2005. EKS received honoraria in 2007 and 2008 and consulting fees from AstraZeneca. The remaining authors declare that there are no conflicts of interest to disclose.

Authors' contributions

BEH participated in the design of the study, performed the statistical analysis and drafted the manuscript. ACW and BAR participated in the design of the study. BK performed genotyping experiments for CAMP and CR and helped draft the manuscript. JL, AJM, RL and CL performed the statistical analysis. JBW and GTO provided and analyzed data for the FHS population and helped draft the manuscript. GMH, MES, LA, and JCC provided data for the CR population and helped draft the manuscript. EKS, and STW participated in the design of the study and helped draft the manuscript. All authors read and approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here: http://www.biomedcentral.com/1471-2350/11/122/prepub

Additional file 1

Table S1. CAMP/Illumina and CAMP Trio SNPs that replicated in CR with p-value < 0.05. Click here for file
  38 in total

1.  PBAT: tools for family-based association studies.

Authors:  Christoph Lange; Dawn DeMeo; Edwin K Silverman; Scott T Weiss; Nan M Laird
Journal:  Am J Hum Genet       Date:  2004-02       Impact factor: 11.025

2.  Genetic Power Calculator: design of linkage and association genetic mapping studies of complex traits.

Authors:  S Purcell; S S Cherny; P C Sham
Journal:  Bioinformatics       Date:  2003-01       Impact factor: 6.937

3.  The IL12B gene is associated with asthma.

Authors:  Adrienne G Randolph; Christoph Lange; Edwin K Silverman; Ross Lazarus; Eric S Silverman; Benjamin Raby; Alison Brown; Al Ozonoff; Brent Richter; Scott T Weiss
Journal:  Am J Hum Genet       Date:  2004-08-20       Impact factor: 11.025

4.  Allele-specific chromatin remodeling in the ZPBP2/GSDMB/ORMDL3 locus associated with the risk of asthma and autoimmune disease.

Authors:  Dominique J Verlaan; Soizik Berlivet; Gary M Hunninghake; Anne-Marie Madore; Mathieu Larivière; Sanny Moussette; Elin Grundberg; Tony Kwan; Manon Ouimet; Bing Ge; Rose Hoberman; Marcin Swiatek; Joana Dias; Kevin C L Lam; Vonda Koka; Eef Harmsen; Manuel Soto-Quiros; Lydiana Avila; Juan C Celedón; Scott T Weiss; Ken Dewar; Daniel Sinnett; Catherine Laprise; Benjamin A Raby; Tomi Pastinen; Anna K Naumova
Journal:  Am J Hum Genet       Date:  2009-09       Impact factor: 11.025

5.  Univariate and multivariate family-based association analysis of the IL-13 ARG130GLN polymorphism in the Childhood Asthma Management Program.

Authors:  D L DeMeo; C Lange; E K Silverman; J M Senter; J M Drazen; M J Barth; N Laird; S T Weiss
Journal:  Genet Epidemiol       Date:  2002-11       Impact factor: 2.135

6.  Genome-wide association study of asthma identifies RAD50-IL13 and HLA-DR/DQ regions.

Authors:  Xingnan Li; Timothy D Howard; Siqun L Zheng; Tmirah Haselkorn; Stephen P Peters; Deborah A Meyers; Eugene R Bleecker
Journal:  J Allergy Clin Immunol       Date:  2010-02       Impact factor: 10.793

7.  Assessing the reproducibility of asthma candidate gene associations, using genome-wide data.

Authors:  Angela J Rogers; Benjamin A Raby; Jessica A Lasky-Su; Amy Murphy; Ross Lazarus; Barbara J Klanderman; Jody S Sylvia; John P Ziniti; Christoph Lange; Juan C Celedón; Edwin K Silverman; Scott T Weiss
Journal:  Am J Respir Crit Care Med       Date:  2009-03-05       Impact factor: 21.405

8.  Family-based association analysis of beta2-adrenergic receptor polymorphisms in the childhood asthma management program.

Authors:  Edwin K Silverman; David J Kwiatkowski; Jody S Sylvia; Ross Lazarus; Jeffrey M Drazen; Christoph Lange; Nan M Laird; Scott T Weiss
Journal:  J Allergy Clin Immunol       Date:  2003-11       Impact factor: 10.793

9.  Variants of DENND1B associated with asthma in children.

Authors:  Patrick M A Sleiman; James Flory; Marcin Imielinski; Jonathan P Bradfield; Kiran Annaiah; Saffron A G Willis-Owen; Kai Wang; Nicholas M Rafaels; Sven Michel; Klaus Bonnelykke; Haitao Zhang; Cecilia E Kim; Edward C Frackelton; Joseph T Glessner; Cuiping Hou; F George Otieno; Erin Santa; Kelly Thomas; Ryan M Smith; Wendy R Glaberson; Maria Garris; Rosetta M Chiavacci; Terri H Beaty; Ingo Ruczinski; Jordan S Orange; Jordan M Orange; Julian Allen; Jonathan M Spergel; Robert Grundmeier; Rasika A Mathias; Jason D Christie; Erika von Mutius; William O C Cookson; Michael Kabesch; Miriam F Moffatt; Michael M Grunstein; Kathleen C Barnes; Marcella Devoto; Mark Magnusson; Hongzhe Li; Struan F A Grant; Hans Bisgaard; Hakon Hakonarson
Journal:  N Engl J Med       Date:  2009-12-23       Impact factor: 91.245

10.  Genome-wide association study implicates chromosome 9q21.31 as a susceptibility locus for asthma in mexican children.

Authors:  Dana B Hancock; Isabelle Romieu; Min Shi; Juan-Jose Sienra-Monge; Hao Wu; Grace Y Chiu; Huiling Li; Blanca Estela del Rio-Navarro; Saffron A G Willis-Owen; Saffron A G Willis-Owens; Scott T Weiss; Benjamin A Raby; Hong Gao; Celeste Eng; Rocio Chapela; Esteban G Burchard; Hua Tang; Patrick F Sullivan; Stephanie J London
Journal:  PLoS Genet       Date:  2009-08-28       Impact factor: 5.917

View more
  9 in total

Review 1.  The state of genome-wide association studies in pulmonary disease: a new perspective.

Authors:  Jamie L Todd; David B Goldstein; Dongliang Ge; Jason Christie; Scott M Palmer
Journal:  Am J Respir Crit Care Med       Date:  2011-07-28       Impact factor: 21.405

2.  Reconstructing recent population history while mapping rare variants using haplotypes.

Authors:  Ural Yunusbaev; Albert Valeev; Milyausha Yunusbaeva; Hyung Wook Kwon; Reedik Mägi; Mait Metspalu; Bayazit Yunusbayev
Journal:  Sci Rep       Date:  2019-04-10       Impact factor: 4.379

3.  Association of SERPINE2 with asthma.

Authors:  Blanca E Himes; Barbara Klanderman; John Ziniti; Jody Senter-Sylvia; Manuel E Soto-Quiros; Lydiana Avila; Juan C Celedón; Christoph Lange; Thomas J Mariani; Jessica Lasky-Su; Craig P Hersh; Benjamin A Raby; Edwin K Silverman; Scott T Weiss; Dawn L DeMeo
Journal:  Chest       Date:  2011-03-24       Impact factor: 9.410

4.  A signal peptide missense mutation associated with nicotine dependence alters α2*-nicotinic acetylcholine receptor function.

Authors:  Bhagirathi Dash; Ronald J Lukas; Ming D Li
Journal:  Neuropharmacology       Date:  2014-01-24       Impact factor: 5.250

Review 5.  Genetics and Epigenetics in Asthma.

Authors:  Polyxeni Ntontsi; Andreas Photiades; Eleftherios Zervas; Georgina Xanthou; Konstantinos Samitas
Journal:  Int J Mol Sci       Date:  2021-02-27       Impact factor: 5.923

6.  Novel eosinophilic gene expression networks associated with IgE in two distinct asthma populations.

Authors:  Yamini V Virkud; Rachel S Kelly; Damien C Croteau-Chonka; Juan C Celedón; Amber Dahlin; Lydiana Avila; Benjamin A Raby; Scott T Weiss; Jessica A Lasky-Su
Journal:  Clin Exp Allergy       Date:  2018-11-21       Impact factor: 5.018

Review 7.  Lessons from ten years of genome-wide association studies of asthma.

Authors:  Cristina T Vicente; Joana A Revez; Manuel A R Ferreira
Journal:  Clin Transl Immunology       Date:  2017-12-15

Review 8.  Insights Into Nicotinic Receptor Signaling in Nicotine Addiction: Implications for Prevention and Treatment.

Authors:  Wuyi Liu; Ming D Li
Journal:  Curr Neuropharmacol       Date:  2018       Impact factor: 7.363

9.  Pharmacometabolomics of Bronchodilator Response in Asthma and the Role of Age-Metabolite Interactions.

Authors:  Rachel S Kelly; Joanne E Sordillo; Sharon M Lutz; Lydiana Avila; Manuel Soto-Quiros; Juan C Celedón; Michael J McGeachie; Amber Dahlin; Kelan Tantisira; Mengna Huang; Clary B Clish; Scott T Weiss; Jessica Lasky-Su; Ann Chen Wu
Journal:  Metabolites       Date:  2019-09-07
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.