Literature DB >> 30554485

Clinical Implications of Single Nucleotide Polymorphisms in Diagnosis of Asthma and its Subtypes.

Jong Sook Park¹, Ji Hye Son², Choon Sik Park¹, Hun Soo Chang³.

Abstract

For the past three decades, a large number of genetic studies have been performed to examine genetic variants associated with asthma and its subtypes in hopes of gaining better understanding of the mechanisms underlying disease pathology and to identify genetic biomarkers predictive of disease outcomes. Various methods have been used to achieve these objectives, including linkage analysis, candidate gene polymorphism analysis, and genome-wide association studies (GWAS); however, the degree to which genetic variants contribute to asthma pathogenesis has proven to be much less significant than originally expected. Subsequent application of GWAS to well-defined phenotypes, such as occupational asthma and non-steroidal anti-inflammatory drugexacerbated respiratory diseases, has overcome some of these limitations, although with only partial success. Recently, a combinatorial analysis of single nucleotide polymorphisms (SNPs) identified by GWAS has been used to develop sets of genetic markers able to more accurately stratify asthma subtypes. In this review, we discuss the implications of the identified SNPs in diagnosis of asthma and its subtypes and the progress being made in combinatorial analysis of genetic variants. © Copyright: Yonsei University College of Medicine 2019.

Entities: Chemical Disease Gene Species

Keywords: Asthma; aspirin; biomarkers; genetic techniques; non-steroidal anti-inflammatory agents; single nucleotide polymorphism

Mesh：

Substances：
Genetic Markers

Year: 2019 PMID： 30554485 PMCID： PMC6298887 DOI： 10.3349/ymj.2019.60.1.1

Source DB: PubMed Journal: Yonsei Med J ISSN： 0513-5796 Impact factor: 2.759

INTRODUCTION

Asthma and its associated traits exhibit a strong familial connection, indicating that a genetic component likely contributes to the disease pathology. Twin studies are commonly used to determine whether complex multifactorial diseases, such as asthma, have a measurable genetic component. In a study of 7000 monozygotic twins performed over 100 years ago, the concordance rate for asthma between monozygotic twin pairs was 19%, nearly four times higher than that of dizygotic twins (4.8%).1 The overall heritability of asthma has since been estimated at anywhere from 15–60%, and genetic variants, such as single nucleotide polymorphisms (SNPs), are presumed to play an important role in the etiology of the disease. To identify SNPs in multifactorial complex diseases, several approaches, including genome-wide linkage studies, biologically plausible candidate gene association studies, and genome-wide association studies (GWAS) have been performed, revealing more than 100 loci on autosomal and sex chromosomes to be linked to asthma. Linkage analysis has identified SNPs on several genes, including ADAM33,2 DPP10,3 PHF11,4 GPR,5 and PTGDR,6 as strong genetic variants for asthma. Over the past decade, several GWAS have been performed, examining 300K–500K SNPs and producing a nearly complete survey of all common genetic variability.7 In 2010, the GABRIEL consortium used GWAS to confirm the association of asthma with many of the previously identified SNPs associated with antigen presentation (HLA-DR/DQ), inflammation (ORMDL3-GSDMB: Gasdermin B), and TH1/TH2 immune responses (IL33, IL-1RL1-IL18R1, RAD50-IL13, and TSLP-WDR36).8 However, the odds ratios (ORs) for these markers ranged from 0.5–2.0, much lower than originally expected. Furthermore, combination of these seven SNPs for the classification of childhood asthma produced a sensitivity of only 35% and a specificity of 75%, indicating these markers as poor predictors of asthma overall. Despite the poor predictive power suggested by the GABRIEL consortium study,8 analysis of genetic variants associated with asthma continues to flourish. As of December 2017, 3437 publications were identified in the PubMed database using the searching terms “asthma” and “polymorphism.” Interestingly, the ORs for most of the associated SNPs ranged between 0.5 and 2.0, indicating a much smaller genetic component in the development of asthma than initially expected. Possible explanations for this poor predictive power included a lack of rare allele variants in the GWAS database, imprecise disease phenotypes, and incomplete consideration of environmental factors.9 Furthermore, a more extensive combinatorial analysis using the identified SNPs has not been thoroughly studied to date. The present review focuses on the genetic impact of rare allele variants on asthma and the clinical usefulness of combinational analysis using identified SNPs on nonsteroidal anti-inflammatory drug-exacerbated respiratory diseases (NERD).

GENETIC EFFECT OF RARE VARIANTS IN ASTHMA

Until recently, much of the speculation regarding the missing heritability not detected using standard GWAS has focused on the possible contribution of rare variants [minor allele frequency (MAF) <0.5%], as only common variants (MAF>5%) were available in previous GWAS datasets. For more information on these rare variants, the 1000 Genomes Project sequenced 1000 individual genomes, revealing more than 15 million new SNPs, 1 million short insertions and deletions, and 20000 structural variants10. More recently, Igartua, et al.11 investigated associations among rare (<1%) and low-frequency (1–5%) variants using a Human Exome Bead Chip array containing 246139 SNPs in ~5000 asthmatic patients and 5000 controls, representing a broad cross section of European, African American/African Caribbean, and Latino subjects. This analysis revealed a number of rare variants for asthma to be associated with ethnicity, including SNPs in GRASP (general receptor for phosphoinositides 1-associated scaffold protein), MTHFR (methylenetetrahydrofolate reductase), and GSDMB in association with asthma in Latino, African, and all ethnicities, respectively. These data suggest that associations with rare and low-frequency variants are specific for each ethnicity and not likely to explain a significant proportion of the genetic variants for asthma.

IMPROVEMENT OF GENETIC EFFECTS ACCORDING TO STRATIFICATION OF PHENOTYPES

Because asthma is a heterogeneous disease, use of well-defined sub-phenotypes may improve the genetic power of SNPs. Although asthma has been traditionally divided into two major pathways based on the mechanism of pathogenesis,12 further subgrouping is possible based on the factors that trigger asthma symptoms,13 including allergic asthma, exercise-induced asthma, NERD, occupational asthma, menstruation-associated asthma, and others. Furthermore, inflammatory patterns in sputum and peripheral blood analysis can be used to divide asthma into four groups: eosinophilic, neutrophilic, mixed, and paucigranulocytic types. Clinically and physiologically, asthmatics are also stratified into well controlled, exacerbation-prone, and remodeled asthma. Recently, biological classification has been introduced on the basis of molecular pathogenesis. Candidate endotypes include several biological mediators and functional immune subsets, including TH1, TH2, TH17, innate TH2, epithelial, and smooth muscle dysfunctions, as well as others (Table 1). Identification of genetic associations would therefore have to be studied using cluster analysis of well-defined sub-phenotypes, with patients stratified based on clinical manifestations and endotypes.1415 Furthermore, quantitative loads of environmental factors should also be considered, as asthma is known to be triggered or exacerbated by a wide range of compounds, including air pollutants,16 occupational materials,1718 nutritional status1920 and food allergens.21222324 Of these factors, occupation is one of the most easily accessible triggers. In a GWAS using the well-defined Korean toluene diisocyanate (TDI)-induced asthmatic cohort,25 SNPs in CTNNA3 (catenin alpha 3, alpha-T catenin) were significantly associated with the risk of TDI-induced asthma (OR=5.84 for rs10762058). The attributable fraction increased up to 24%, indicating that the missing heritability of asthma has been solved in part by the consideration of environmental variables. Very recently, homozygosity of the minor allele SNPs rs10762058 and rs7088181 was shown to increase the risk for di-isocyanate asthma in Caucasian workers, with an OR=9.05 [95% confidence interval (CI): 1.69–48.54],26 a level high enough to be considered useful as a genetic marker. Another example is that of drug-induced reactions in susceptible persons, such as those with NERD. Initial genetic association studies of NERD patients used a set of biologically plausible genes associated with the over- or under-production of arachidonic acid metabolites. Genes such as LTC4S, ALOX5, NAT2, CysLTR1, and CysLTR2, all members of the cysteinyl leukotriene pathway, exhibited several NERD-associated SNPs, with ORs ranging from 1.88 to 9.78.1827 Genes of prostaglandin and thromboxane pathways discovered using GWAS28 also showed good ORs, although not at levels high enough for use as a standalone diagnostic biomarker

Table 1

Classification of Asthma Phenotypes according to Etiologic Agent, Clinical and Inflammatory Patterns, and Molecular Mechanisms

1. Phenotypes related to disease etiology

Aspirin or non-steroidal anti-inflammatory drugs

Environmental allergens

Occupational allergens or irritants

Estrogens

Exercise

Viral infection

Obesity

2. Clinical or physiological phenotypes

Severity-defined: from mild to severe

Exacerbation-prone: brittle vs. non-brittle

Defined by chronic airflow restriction: remodeled

Treatment-resistant: resistance to steroids

Defined by age at onset: childhood, adulthood, old age

3. Inflammatory phenotypes

Eosinophilic

Neutrophilic

Pauci-granulocytic

4. Molecular phenotypes

T_H1 dominant

T_H2 dominant

T_H17 dominant

Innate T_H2 dominant

Epithelial dysfunction

Smooth muscle dysfunction

IMPROVEMENT IN DIAG NOSTIC VALUES BY COMBINATORIAL ANALYSIS OF MULTIPLE SNPS OBTAINED FROM GWAS

The method most often used for measuring predictive factors is the area under a receiver operator characteristic (ROC) curve (Fig. 1).29 The area under this curve (AUC) is frequently used to discriminate between those with and without disease. AUC values range from 0.5 (providing no discrimination between the two conditions) to 1.0 (perfect discrimination), with ideal classification exhibiting almost square curves, indicative of high true-positive and low false-positive rates. Disease risk prediction is frequently described using a combination of sensitivity, specificity, and accuracy. In order to obtain a sensitivity and specificity of 80% or more, which is typically required for clinical utility, the absolute OR should be greater than 10 (Fig. 1). AUC values predictive of a hypothetical condition carry modest (1.5), sizeable (10), and large (50) ORs, showing false-positive fractions at 80% sensitivity (dotted line; fractions are >75, >25, and <10%, respectively) are shown in Fig. 1.30

Fig. 1

Area under receiver operator characteristic (ROC) curve values predictive of a hypothetical condition carry modest (1.5), sizeable (10), and large (50) odds ratios (ORs), showing false-positive fractions at 80% sensitivity (dotted line; fractions are >75, >25, and <10%, respectively). The graph demonstrates that very large ORs are needed to provide acceptably low false-positive fractions. Reprinted from Jakobsdottir, et al. PLoS Genet 2009;5:e1000337, with permission of Jakobsdottir, et al.30

To obtain diagnostic marker sets of SNPs with an OR >10, combinatory analyses were performed on SNPs from GWAS. The first GWAS of 80 NERD subjects and 100 aspirin-tolerant asthma (ATA) subjects was performed using 100K Bead Chips, covering only a small portion of genetic variants. Despite this limitation, eleven genes were identified as having p values <7×10−5. One hundred fifty non-monomorphic tagging-SNPs in and around the 11 identified genes were genotyped on 163 NERD and 429 ATA as the second-stage mapping.31 Among the SNPs, rs7572857 G>A (Gly74Ser) of CEP68 was most significantly associated with the risk of NERD (p=6.06×10−5, 95% CI=1.64−4.21, OR=2.63). Because the ratio of ATA and NERD subjects varies among studies, the number of NERD and ATA patients should be normalized according to the prevalence of NERD in a given population. When we applied the incidence of positive response in oral aspirin challenge (16.03%) in Korean adult asthmatics,32 the diagnostic value of rs7572857 G>A on CEP68 was not high enough to justify clinical application due to the low accuracy (53%), sensitivity (18%), and specificity (88%) of this marker. Taken together, these data highlight the need for better methods capable of increasing the discriminating power of individual SNPs, as well as the evaluation of combinatorial effects of SNPs on disease resolution.33 Using the 10 SNPs with the lowest p values from the 1st GWAS of NERD patients,31 relative risk (RR) was calculated for each individual subject using a multiple logistic regression analysis of the 1023 possible combinations (210−1) of SNPs.34 The ROC curve analysis demonstrated that a combinatorial set of eight SNPs was the best model for maximizing the AUC (0.9) for distinguishing NERD from ATA (Fig. 2). This model exhibited 82% accuracy, 78% sensitivity, and 88% specificity with an OR of 20.74 (p=3.24×10−19).34 Each of the eight SNPs, including Parkin RBR E3 Ubiquitin Protein Ligase, DDB1 and CUL4 Associated Factor 4, Vaccinia Related Kinase 2, Carbohydrate Sulfotransferase 11, Long Intergenic Non-Protein Coding RNA 314, PDZ Domain Containing 2, Small G Protein Signaling Modulator 1, and Centromere Protein F exhibited ORs ranging from 0.32 to 2.91 (Table 2). These data suggest that the diagnostic accuracy of the SNPs as genetic markers is robustly increased by combinatorial analysis, compared to that of single SNPs alone.

Fig. 2

ROC curves used to devise the best combinatorial model using the 10 most statistically significant SNPs. Values were taken from the first genomewide association study, which used 100K Bead Chips, to examine 80 NERD and 100 ATA subjects. RR was calculated for each individual subject using a multiple logistic regression analysis examining all 1023 combinations (210−1) of the 10 SNPs described in reference 34. A model consisting of eight SNPs shows the highest area under the ROC curve of 0.9 with an accuracy of 82.01%. The sensitivity and specificity are 78% and 88%, respectively, with an odds ratio of 20.74 (p=3.24E^−19). The eight SNPs are listed in Table 2. Reprinted from Shin, et al. DNA Cell Biol 2012;31:1604–9, with permission of Mary Ann Liebert, Inc.34 ROC, receiver operator characteristic; SNP, single nucleotide polymorphism; NERD, nonsteroidal antiinflammatory drug-exacerbated respiratory diseases; ATA, aspirin-tolerant asthma; RR, relative risk.

Table 2

List of the Eight SNPs in Fig. 2 and their RRs for Diagnosis of Nonsteroidal Anti-Inflammatory Drug-Exacerbated Respiratory Diseases

SNP	Gene	Position	Alleles	p value	OR	RR
SNP	Gene	Position	Alleles	p value	OR	CC	CM	MM
rs6911768	PARK2	Intron	A>C	6.75E-05	2.901	0.43	1.248	3.62
rs3213729	WDR21	5′ UTR	C>G	0.000411	2.724	0.507	1.382	3.764
rs6714952	VRK2	Intron	C>T	0.000723	2.486	0.526	1.307	3.248
rs746035	CHST11	Intron	C>T	0.000532	2.292	0.492	1.127	2.583
rs4501026	LINC00314	Intergenic	A>C	4.76E-05	0.4198	1.44	0.604	0.254
rs4867084	PDZK3	Intron	G>A	0.00017	0.4177	1.269	0.53	0.221
rs139719	SGSM1	Intron	C>T	3.18E-05	0.4109	1.416	0.582	0.239
rs6659655	CENPF	Intergenic	A>G	3.49E-05	0.3181	1.159	0.369	0.117

SNP, single nucleotide polymorphism; RR, relative risk; OR, odds ratio; PARK2, Parkin RBR E3 Ubiquitin Protein Ligase; WDR21, DDB1 and CUL4 Associated Factor 4; VRK2, Vaccinia Related Kinase 2; CHST11, Carbohydrate Sulfotransferase 11; LINC00314, Long Intergenic Non-Protein Coding RNA 314; PDZK3, PDZ Domain Containing 2; SGSM1, Small G Protein Signaling Modulator 1; CENPF, Centromere Protein F.

Reprinted from Shin, et al. DNA Cell Biol 2012;31:1604–9, with permission of Mary Ann Liebert, Inc.34

In the second GWAS using the 660K chip, as well as a replication study on 141 subjects with NERD and 996 ATA subjects,28 rs1042151 (Met105Val) in exon 2 of HLA-DPB1 showed the highest susceptibility to NERD (p=5.11×10−7; OR=2.40). The rs1042151 acts as a potential cis regulator of the expression of HLA-DPB1 with an expression quantitative trait loci (eQTL) score of 36.83, as calculated using the eQTL browser (http://eqtl.uchicago.edu/cgi-bin/gbrowse/eqtl/). Further analysis of SNP function (https://snpinfo.niehs.nih.gov/snpinfo/snpfunc.html) indicated that the rs1042151 SNP is located in an exonic splicing enhancer region; however, despite the location of this SNP, its true effect remains questionable, as rs1042151 alone exhibits very low accuracy (54.7%) and sensitivity (17.7%) for predicting NERD, despite strong specificity (91.7%). Furthermore, because this GWAS was limited to common variants (MAF>5%) with minimal coverage of exonic SNPs,35 a second NERD-associated SNP study was extended to exonic SNPs using an Exome BeadChip assay containing 240K SNP.36 In this analysis, another exonic SNP (exm537513 in HLA-DPB1, rs1042136) showed the lowest p value (p=3.4×10−8, OR: 3.28) in association with the risk of NERD, although with low sensitivity (16.7%) and accuracy (55.5%) as before, in spite of relatively high specificity (94.3%). These data suggest that SNPs on HLA-DPB1 should contribute to the development of NERD. but are not clinically useful as genetic markers of NERD. In the exon-wide association study, 16 of the top 20 SNPs were located on chromosome 6 (Table 3). Among them, six SNPs were exonic and located on HLA-DPB1 [exm537513 (rs1042136, I94L), exm537522 (re1042151, M105V), and exm537523 (rs1042153, M105I)], HLA-DPA1 [exm-rs3097671 (silent) and exm537485 (rs1126504, L327V)], and HLA-DPB2 [exm-rs3129294 (silent)]. Furthermore, a strong linkage disequilibrium (LD) was noted between the six exonic SNPs in the HLA genes (Fig. 3). AUCs for the combinations of these SNPs (26−1) were calculated by multiple logistic regression analysis, with a combination of four SNPs (exm537513, exm537522, exm-rs3097671, and exmrs3129294) exhibiting the best AUC value of 0.629 with OR=3.677, 68.4% accuracy, 29.8% sensitivity, and 90.3% specificity. To further enhance the diagnostic value of the exonic SNPs, the other six exonic SNPs were selected according to their p values (marked with # in Table 3) and added to the four HLA SNPs. A multiple logistic regression analysis was undertaken for all combinations of these 10 SNPs under the additive mode, yielding a total of 1023 models (210−1) generated. From this analysis, a combination of seven SNPs (exm537513, exm83523, exm1884673, exm538564, exm2264237, exm396794, and exm791954) showed the best AUC of 0.75 (p=7.94×10−21), with OR=7.291, 40.8% sensitivity, 88.0% specificity, and 75% accuracy for the discrimination of NERD from ATA. To date, 2161 SNPs in HLA-DPB1 and 2602 SNPs in HLA-DPA1 have been identified (http://www.ncbi.nlm.nih.gov/snp/?term=HLA-DPB1 and http://www.ncbi.nlm.nih.gov/snp/?term=HLA-DPA1), of which 239 and 193 exonic SNPs are present in HLA-DPB1 and HLA-DPA1, respectively. This indicates that the number of SNPs within the 660K GWAS and 240K exon chips used in our and other studies is still not sufficient to discover all of the SNPs associated with NERD. Thus, further discovery of SNPs in HLA-DPB1 and HLA-DPA1 will be necessary for the development of diagnostic genetic markers for NERD.

Table 3

List of the Top 30 SNPs Associated with NERD

n	name	rs#	Allele	Chr	Position	Gene(s)	in EXON	Mutation(s)	MAF (NERD, n=165)	MAF (ATA, n=397)	OR (95% CI)	p value
1	exm537513	rs1042136	A>C	6	33048628	HLA-DPB1^,*	EXON	I94L	0.167	0.057	3.28 (2.13–5.04)	3.40E-08
2	exm537522	rs1042151	A>G	6	33048661	HLA-DPB1^*	EXON	M105V	0.194	0.087	(1.71–3.67)	2.38E-06
3	exm-rs3097671	rs3097671	G>C	6	33047612	HLA-DPB1^*	.	Silent	0.191	0.088	2.47 (1.69–3.62)	3.19E-06
4	exm-rs2179920	rs2179920	G>A	6	33058874	.	.	.	0.194	0.089	2.45 (1.67–3.58)	3.95E-06
5	exm-rs2281389	rs2281389	T>C	6	33059796	.	.	.	0.191	0.091	2.43 (1.66–3.55)	4.62E-06
6	exm537485	rs1126504	C>G	6	33048457	HLA-DPB1	EXON	Silent, L37V	0.285	0.166	2.17 (1.55–3.02)	5.32E-06
7	exm-rs2064478	rs2064478	G>A	6	33072266	.	.	.	0.194	0.092	2.40 (1.64–3.51)	5.88E-06
8	exm-rs3117230	rs3117230	T>C	6	33075635	.	.	.	0.194	0.092	2.40 (1.64–3.51)	5.88E-06
9	exm-rs2064474	rs2064474	C>T	6	33073463	.	.	.	0.197	0.094	2.38 (1.63–3.46)	5.92E-06
10	exm-rs3129270	rs3129270	G>A	6	33097423	.	.	.	0.088	0.026	4.09 (2.20–7.60)	6.00E-06
11	exm-rs3129294	rs3129294	T>G	6	33084671	HLA-DPB2^*	.	Silent	0.324	0.207	1.91 (1.41–2.58)	2.75E-05
12	exm83523	rs3818822	G>A	1	111857208	CHIA#,^**	EXON	Silent, G102R	0.133	0.064	2.55 (1.64–3.95)	3.00E-05
13	exm-rs3116999	rs3116999	C>G	6	33097166	.	.	.	0.127	0.058	2.74 (1.70–4.41)	3.58E-05
14	exm537523	rs1042153	G>A	6	33048663	HLA-DPB1	EXON	M105I	0.258	0.152	2.04 (1.45–2.87)	3.91E-05
15	exm1884673	rs118191745	A>G	17	38241000	THRA^#,**	EXON	I170V	0.033	0.003	13.61 (2.89–63.96)	6.20E-05
16	exm-rs3130267	rs3130267	T>G	6	33306794	.	.	.	0.418	0.301	1.78 (1.34–2.36)	6.80E-05
17	exm538564	rs3130100	T>C	6	33283766	ZBTB22^#,**	EXON	T310A	0.385	0.273	1.79 (1.33–2.39)	9.25E-05
18	exm-rs3106192	rs3106192	T>C	6	33326250	.	.	.	0.400	0.288	1.75 (1.32–2.32)	9.59E-05
19	exm2264237	rs1074373	A>C	7	30616761	LOC401320 ^#,**	.	Silent	0.330	0.438	0.57 (0.42–0.76)	0.0001
20	exm396794	rs1047626	G>A	4	42003671	SLC30A9 ^#,**	EXON	M50V	0.009	0.052	0.13 (0.03–0.53)	0.0001
21	exm-rs3117324	rs3117324	C>A	6	33324527	.	.	.	0.433	0.320	1.69 (1.28–2.23)	0.0002
22	exm-rs3128927	rs3128927	C>T	6	33074288	.	.	.	0.258	0.162	1.87 (1.35–2.61)	0.0002
23	exm791954	rs7862221	T>C	9	135782221	TSC1^#,**	EXON	E444E, E394E, E445E	0.130	0.063	2.38 (1.50–3.76)	0.0002
24	exm-rs455567	rs455567	C>T	6	33252115	WDR46	.	Silent	0.376	0.266	1.73 (1.29–2.33)	0.0003
25	exm1010999	rs10876838	C>T	12	56030938	OR10P1	EXON	P88L	0.224	0.316	0.55 (0.39–0.76)	0.0003
26	exm1011022	rs7970885	G>A	12	56031273	OR10P1	EXON	V200M	0.224	0.316	0.55 (0.39–0.76)	0.0003
27	exm-rs469064	rs469064	G>T	6	33250476	WDR46	.	Silent	0.376	0.266	1.72 (1.28–2.31)	0.0003
28	exm-rs2237892	rs2237892	C>T	11	2839751	KCNQ1	.	Silent	0.458	0.336	1.66 (1.26–2.20)	0.0003
29	exm1352136	rs35489971	G>A	17	72700943	RAB37	EXON	Silent, V19A	0.303	0.213	1.80 (1.31–2.49)	0.0003
30	exm2259454	rs7866939	C>T	9	85126163	.	.	.	0.524	0.422	1.67 (1.26–2.22)	0.0003

NERD, nonsteroidal antiinflammatory drug-exacerbated respiratory diseases; MAF, minor allele frequency; ATA, aspirin-tolerant asthma; SNP, single nucleotide polymorphism; AUC, area under the curve; OR, odds ratio. An exonic SNP (exm537513 in HLA-DPB1, rs1042136) exhibited the strongest association with NERD (p=3.4×10−8, OR=3.28). Among six exonic SNPs within the HLA genes, a combination of four SNPs (marked with *) had the best AUC value of 0.629, with an OR=3.677, 68.4% accuracy, 29.8% sensitivity, and 90.3% specificity. The other six SNPs (marked with #) were then added to the four HLA SNPs, and multiple logistic regression analysis was performed. Relative risk was calculated for each individual subject using a multiple logistic regression analysis examining all 1023 combinations (210−1) of the 10 SNPs. A combination model of seven SNPs marked with ** showed the best AUC of 0.75 (p=7.94×10−21), with OR=7.291, 40.8% sensitivity, 88.0% specificity, and 75% accuracy for discrimination between NERD from ATA. Reprinted from Shin, et al. PLoS One 2014;9:e111887, with permission of Shin, et al.36

Fig. 3

Linkage disequilibrium (LD) between the six exonic SNPs in the HLA genes. The number in the box indicates LDr2. Reprinted from Shin, et al. PLoS One 2014;9:e111887, with permission of Shin, et al.36

Our third combinatorial analysis of SNPs was performed with the 660W genome-wide association data examining 142 NERD and 996 ATA.28 Fourteen genes [HLA-DPB1, MEIS1, NAB1, NRP2, CLDN1, DCDC2, TMEM196, AK8(C9orf98), TSC1, ZFYVE1-RBM25, PCSK6, ILVBL, GP6, and RDH13] were selected on the basis of OR and p values (Table 4). In this study, summed genetic risk scores were calculated as a combinatorial analysis and used in place of OR.37 To exclude the effect of LD, one SNP on each of the 14 genes was chosen, and the genetic risk score of each of the subjects was calculated by assigning ORs to the risk alleles in comparison to reference alleles. This was followed by summing the ORs from multiple variants in different genetic loci into a single RR value using the following formula:

Table 4

List of the 14 Top SNPs Predictive of NERD from the 660K GWAS

SNPID	Chr.	Gene (nearest gene)	Location	Results of GWAS
SNPID	Chr.	Gene (nearest gene)	Location	Odd ratio	p value (95% CI)
rs12619205	2	MEIS1	Intron	1.72 (1.30–2.28)	1.14×10⁻⁴
rs10931468	2	NAB1	Intron	1.45 (1.09–1.93)	9.94×10⁻³
rs849530	2	NRP2	Intron	1.8 (1.35–2.41)	6.91×10⁻⁵
rs10513846	3	CLDN1	Intron	0.68 (0.51–0.91)	8.15×10⁻³
rs2281389	6	HLA-DPB1	3′	2.4 (1.61–3.58)	5.6×10⁻⁶
rs3789224	6	DCDC2	Intron	1.98 (1.41–2.77)	8.55×10⁻⁵
rs9886152	7	(TMEM196)	Intron	0.54 (0.39–0.76)	3.09×10⁻⁴
rs2771994	9	AK8 (C9orf98)	Intron	1.87 (1.39–2.52)	2.48×10⁻⁵
rs1050700	9	TSC1	3′UTR	1.82 (1.35–2.45)	5.82×10⁻⁵
rs12432987	14	ZFYVE1-RBM25	Intron	1.88 (1.38–2.57)	4.36×10⁻⁵
rs3825915	15	PCSK6	Intron	0.51 (0.37–0.70)	2.5×10⁻⁵
rs2240299	19	ILVBL	Intron	0.51 (0.37–0.72)	7.61×10⁻⁵
rs11669150	19	GP6	Intron	0.68 (0.47–0.98)	3.74×10⁻²
rs1671215	19	RDH13	3′	0.49 (0.32–0.74)	6.34×10⁻⁴

SNP, single nucleotide polymorphism; NERD, nonsteroidal antiinflammatory drug-exacerbated respiratory diseases; GWAS, genome-wide association studies; CI, confidence interval.

Using the 14 SNPs listed in the table, combinatorial analysis was performed, and the results are summarized in Table 5. Reprinted from Chang, et al. Pharmacogenomics J 2015;15:316–21, with permission of Springer Nature.37

The summed risk scores of the 14 SNPs ranged from 0.774 to 1.142, with an average of 0.973. These summed risk scores were significantly higher in NERD patients than in ATA controls (1.033±0.005 vs. 0.963±0.002, p=8.58×10−37). The AUC of the NERD score was 0.821. The best cutoff value with a score of 1.01328 was determined using the ROC curve and the Youden's Index. A higher proportion of NERD subjects showed summed risk scores above the cutoff value relative to ATA subjects [64.66% vs. 15.04%, p=1.38×10−32, OR=10.34 (6.69−15.96)]. The sensitivity and specificity of the scores were 64.7% and 85.0%, respectively, with an accuracy of 82.0% (42.1% positive and 93.4% negative predictive values) (Table 5).

Table 5

Diagnostic Value of the Summed Risk Scores Using the Top 14 SNPs to Predict NERD

RR score	Diagnosis, n (%)			Statistics	Diagnostic value
RR score	ATA	NERD	Total	Statistics	Sensitivity	Specificity	Accuracy
<1.01328	582 (72.7)	41 (5.1)	623 (77.8)	χ²=141.31	64.70%	85.00%	82.00%
>1.01328	103 (12.9)	75 (9.4)	178 (22.2)	p=1.38×10⁻³²		+PV 42.1%	LR+ 4.300
Total	685 (85.5)	116 (14.5)	801 (100)			−PV 93.4%	LR− 0.416

SNP, single nucleotide polymorphism; NERD, nonsteroidal antiinflammatory drug-exacerbated respiratory diseases; ATA, aspirin-tolerant asthma; +PV, positive predictive value; −PV, negative predictive value; LR+, positive likelihood ratio; LR−, negative likelihood ratio; OR, odds ratio; RR, relative risk.

To calculate the genetic risk score of each of the subjects, ORs were assigned to the risk alleles in contrast to the value of 1 assigned to reference alleles. ORs from multiple variants in different genetic loci were summed into a single RR value. The summed risk scores of the 14 SNPs in the study subjects ranged from 0.774 to 1.142. The summed risk scores were significantly higher in the NERD group than in the ATA group (1.033±0.005 vs. 0.963±0.002, p=8.58×10−37). A higher proportion of NERD subjects showed summed risk scores above the cutoff value [64.66% vs. 15.04%, p=1.38×10−32, OR: 10.34 (6.69–15.96)]. The sensitivity and specificity of the scores were 64.7% and 85.0%, respectively, with 42.1% positive and 93.4% negative predictive values. The overall accuracy of the test was 82.0%. Reprinted from Chang, et al. Pharmacogenomics J 2015;15:316–21, with permission of Springer Nature.37

COMPARISON OF THE GENETIC MARKERS IN DIAG NOSIS OF NERD

The three most significant SNPs identified in the GWAS and exome chip studies exhibited ORs of only 2.4−3.28, with accuracies of 53.0−55.5% (Table 6). Although the specificity of these markers was very high, nearing 90% in many cases, the sensitivity of these markers was consistently low, typically at less than 20%. In contrast, both the OR and accuracy were elevated using combinatorial analysis of the top 10 SNPs obtained from the GWAS. A combinatorial analysis of several SNPs from GWAS and exome-wide association studies provided a more comprehensive genetic set for the diagnosis of NERD, compared to that of single SNPs alone. In the data presented here, the combination of eight SNPs obtained from a 100K GWAS and the summed risk score of 14 SNPs obtained from 660K GWAS showed higher accuracy (82.0% and 82.01%), compared with that observed for the 240K exome chip (68.4% and 75.0%). The improved accuracy of GWAS in these studies was derived from the higher sensitivity of the method (64.7% and 78.0%), compared to that seen using exome-wide methods (29.8% and 40.8%), despite equal specificities of ~90%.

Table 6

Diagnostic Accuracy of SNPs Identified in GWAS and Exon-Wide Association Studies for NERD

Genetic makers	OR	Accuracy (%)	Sensitivity (%)	Specificity (%)	Reference
1) 7572857 G>A on CEP68	2.63	53	18	88	32
2) rs1042151(Met105Val) in exon 2 of HLA-DPB1 obtained from a 660K GWAS	2.40	54.7	17.7	91.7	29
3) rs1042136 (exm537513) in HLA-DPB1 obtained from a 240K exome chip	3.28	55.5	16.7	94.3	39
4) A combination model of the eight SNPs obtained from a 100K GWAS^*	20.74	82.01	78	88	34
5) Four exonic SNPs on HLA-DPB1 obtained from a 240K exome chip^†	3.67	68.4	29.8	90.3	36
6) Seven exonic SNPs obtained from a 240K exome chip^‡	7.29	75	40.8	88	36
7) A summed risk score of 14 SNPs obtained from 660K GWAS^§	10.34	82	64.7	85	37

SNP, single nucleotide polymorphism; GWAS, genome-wide association studies; NERD, nonsteroidal antiinflammatory drug-exacerbated respiratory diseases; OR, odds ratio.

*A combination set of eight SNPs listed in Table 2, †A combination of four exonic SNPs (exm537513, exm537522, exm-rs3097671, and exmrs3129294) listed in Table 3, ‡A combination model of seven SNPs (exm537513, exm83523, exm1884673, exm538564, exm2264237, exm396794, and exm791954) listed in Table 3, §14 SNPs on the genes [HLA-DPB1, MEIS1, NAB1, NRP2, CLDN1, DCDC2, TMEM196, AK8(C9orf98), TSC1, ZFYVE1-RBM25, PCSK6, ILVBL, GP6 and RDH13] listed in Table 4.

PERSPECTIVES FOR IMPROVED DIAG NOSTIC VALUES USING COMBINATORIAL ANALYSIS OF MULTIPLE SNPS

GWAS have been used to identify numerous loci associated with multifactorial complex diseases and quantitative traits. However, most GWAS have not studied the gene-gene interactions (epistasis) that could be important in complex trait genetics and in combinatorial analysis of the SNPs. Existing statistical techniques, such as logistic regression and multifactor dimensionality reduction, can be used to identify the combinatorial effects of multiple SNPs in disease susceptibility; however, methods for the analysis of these techniques remains computationally intensive. To overcome some of these computational bottlenecks, a series of methods have been developed, including the use of computers equipped with particular graphical processing units, such as the BiForce Toolbox (University of Turku, Turku, Finland). The BiForce Toolbox incorporates additional tests of interactions involving SNPs with significant marginal effects and identifies interesting interaction signals and pathways.38 Pathway-based approaches may also narrow the search space and enhance power, for example by seeking pathways39 or by identifying common pathways enriched in epistatic genes with modest interaction signals detected from multiple GWAS populations.40 In either case, rapid screening of pairwise interactions in individual GWAS populations appears to be critical in order to provide the information necessary to design and perform pathway-based analyses. Another limitation is the relatively small sizes used in these studies, thereby limiting the ability to replicate significant epistasis signals.41 A recently proposed statistical method, called Limitless Arity Multiple-testing Procedure (LAMP),42 provides the possibility of detecting statistically significant higher-order interactions. LAMP is a multiple testing procedure for identifying statistically significant combinatorial effects by introducing a theoretical upper bound for a familywise error rate tighter than a Bonferroni correction. Its application to GWAS may uncover synergistic effects of SNPs associated with diseases, while filtering out uninformative combinations. Another program, LAMPLINK (National Institute of Advanced Industrial Science and Technology, Tokyo, Japan), also has the ability to quickly identify combinatorial effects of SNPs. Outside of computational methodologies, an important factor complicating the diagnosis of NERD is the low overall penetration of genetic traits. If the genetically susceptible subjects do not take aspirin or NSAID in their lifetime, they will never develop NERD or aspirin-induced urticaria. Thus, a history of aspirin or NSAID use should be available at the time of diagnosis, although the exact dosage and duration of drug exposure remains difficult to estimate for both patients and physicians. In the future, better combinatorial data will be possible as more information related to the measurable exposome becomes available and incorporated into big data-type analyses, including factors such as lifestyle, prescription drug use, and exposure to air pollution. In the effort to identify SNP markers, GWAS has provided useful information for asthma. Understandably, however, identified SNP markers alone show poor discriminating power for diagnosis, mainly due to biological complexity and phenotypic heterogeneity of asthma. Research to identify rare variants in asthma not only seems to be unsuccessful at discovering genetic markers, but also fails to provide meaningful information on the missing heritability of the disease. In contrast, studies using sub-phenotypes and combination/integration of SNPs seem to be more successful in achieving clinically useful diagnostics. In the near future, it is expected that statistical methods providing improved sensitivity and specificity could be achieved by the combination and integration of common genetic variants with exposome data for individual patients.

3 in total

Review 1. Complementary Participation of Genetics and Epigenetics in Development of NSAID-exacerbated Respiratory Disease.

Authors: Jong Uk Lee; Jong Sook Park; Hun Soo Chang; Choon Sik Park
Journal: Allergy Asthma Immunol Res Date: 2019-11 Impact factor: 5.764

2. Structure analysis of deleterious nsSNPs in human PALB2 protein for functional inference.

Authors: Noshin Nawar; Anik Paul; Hamida Nooreen Mahmood; Md Ismail Faisal; Md Ismail Hosen; Hossain Uddin Shekhar
Journal: Bioinformation Date: 2021-03-31

3. Generalizable characteristics of false-positive bacterial variant calls.

Authors: Stephen J Bush
Journal: Microb Genom Date: 2021-08

3 in total