| Literature DB >> 26190474 |
Yufei Dai1, Ying Chen2, Hanlin Huang3, Wei Zhou4, Yong Niu1, Mingrong Zhang2, Ping Bin1, Haiyan Dong5, Qiang Jia6, Jianxun Huang3, Juan Yi4, Qijun Liao2, Haishan Li7, Yanxia Teng8, Dan Zang9, Qingfeng Zhai10, Huawei Duan1, Juan Shen2, Jiaxi He4, Tao Meng1, Yan Sha4, Meili Shen1, Meng Ye1, Xiaowei Jia1, Yingping Xiang4, Huiping Huang4, Qifeng Wu3, Mingming Shi2, Xianqing Huang4, Huanming Yang2, Longhai Luo2, Sai Li2, Lin Li2, Jinyang Zhao2, Laiyu Li3, Jun Wang2, Yuxin Zheng1.
Abstract
Trichloroethylene induced hypersensitivity syndrome is dose-independent and potentially life threatening disease, which has become one of the serious occupational health issues and requires intensive treatment. To discover the genetic risk factors and evaluate the performance of risk prediction model for the disease, we conducted genomewide association study and replication study with total of 174 cases and 1761 trichloroethylene-tolerant controls. Fifty seven SNPs that exceeded the threshold for genome-wide significance (P < 5 × 10(-8)) were screened to relate with the disease, among which two independent SNPs were identified, that is rs2857281 at MICA (odds ratio, 11.92; P meta = 1.33 × 10(-37)) and rs2523557 between HLA-B and MICA (odds ratio, 7.33; P meta = 8.79 × 10(-35)). The genetic risk score with these two SNPs explains at least 20.9% of the disease variance and up to 32.5-fold variation in inter-individual risk. Combining of two SNPs as predictors for the disease would have accuracy of 80.73%, the area under receiver operator characteristic curves (AUC) scores was 0.82 with sensitivity of 74% and specificity of 85%, which was considered to have excellent discrimination for the disease, and could be considered for translational application for screening employees before exposure.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26190474 PMCID: PMC4507183 DOI: 10.1038/srep12169
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Genomewide association results.
The genomewide P values of the principal component 1-adjusted logistic regression analysis from 1,392,644 SNPs in 98 cases of trichloroethylene hypersensitivity syndrome and 98 trichloroethylene-tolerant controls were presented. The chromosomal distribution of all the P values (−log10P) were shown. The red horizontal dash line represented the genomewide significance threshold of P = 5 × 10−8.
Figure 2Regional association plot in the MHC for the subjects.
SNPs were colored on the basis of their linkage disequilibrium (LD) with the rs2857281(highlighted with purple diamonds) which was one of the independent markers identified through conditional association study. Another identified independent marker is rs2523557 (bigger green circle). x axes, physical distance (Mb); left y axes, −log10P for association statistics. Right y axes, recombination rates, light blue line. Recombination rates were based on the 1000 Genomes Mar 2012 ASN population.
Association results for the independent SNPs in the MHC region identified by conditional association study.
| SNP | Chr | Location & Related Genes | Allele (A1/A2) | Study | Frequency in cases | Frequency in tolerant controls | Odds Ratioadda (95%CI) | P adda | Phetb |
|---|---|---|---|---|---|---|---|---|---|
| rs2857281 | 6 | intronic, MICA | C/A | Discovery (98cases, 98controls) | 0.41 | 0.056 | 24.21 (10.55,55.59) | 5.64 × 10−14 | 0.061 |
| Replication (71cases, 1661controls) | 0.44 | 0.082 | 9.90 (6.47,15.15) | 5.22 × 10−26 | |||||
| Meta-analysis (169cases, 1759controls) | 0.42 | 0.081 | 11.92 (8.16,17.42) | 1.33 × 10−37 | |||||
| rs2523557 | 6 | intergenic, HLA-B(dist = 6268), MICA (dist = 40114) | C/T | Discovery (98cases, 98controls) | 0.59 | 0.107 | 11.22 (5.871,21.44) | 2.53 × 10−13 | 0.140 |
| Replication (72cases, 1659controls) | 0.50 | 0.131 | 6.41 (4.45,9.23) | 1.56 × 10−23 | |||||
| Meta-analysis (170cases, 1757controls) | 0.55 | 0.130 | 7.33 (5.34,10.07) | 8.79 × 10−35 | |||||
| HLA-B*13:01 | 6 | HLA-B | +/− | Discovery (98cases, 98controls) | 0.72 | 0.102 | 36.97 (14.87,91.89) | 7.77 × 10−15 | 0.742 |
| Replication (69cases, 1589controls) | 0.83 | 0.096 | 44.58 (23.40,84.92) | 7.46 × 10−31 | |||||
| Meta-analysis (167cases, 1687controls) | 0.76 | 0.097 | 41.88 (24.75,70.87) | 5.02 × 10−44 |
aOdds Ratioadd and Padd values were computed for principal component 1-adjusted logistic regression models. bHeterogeneity across studies was examined using the Cochran’s Q statistics. The odds ratio and frequency is given for the A1 allele, where odds ratio >1 indicates an adverse effect. Meta-analysis was performed using the fixed-effects model. Chr., Chromosome; C, cytosine; G, guanine; T, thymine; A, adenine. CI, Confidence interval.
Cumulative effect of two independent loci stratified by the number of risk alleles.
| Number of risk alleles | Discovery (n = 196)* 98 cases / 98 controls | replication (n = 1729)* 70 cases / 1659 controls | ||||
|---|---|---|---|---|---|---|
| Number (cases/controls) | Genetic risk score (mean ± SD) | Odds Ratio (95%CI) | Number (cases/controls) | Genetic risk score (mean ± SD) | Odds Ratio (95%CI) | |
| 0 (lowest risk) | 18/78 | 0.000 | 1(reference) | 13/1242 | 0.00 | 1(reference) |
| 1 | 5/10 | 2.42 ± 0.00 | 2.17 (0.66,7.12) | 5/162 | 1.90 ± 0.13 | 2.95 (1.04,8.38) |
| 2+ (highest risk) | 75/10 | 6.89 ± 1.86 | 32.50 (14.09,74.94) | 52/255 | 4.52 ± 0.99 | 19.48 (10.45,36.31) |
| Odds Ratiochangea | 32.50 | 19.48 | ||||
| P value, Odds Ratio (95%CI)b | 9.11 × 10−15, 4.37 (3.01,6.34) | <2.0 × 10−16, 3.42 (2.72,4.29) | ||||
| Nagelkerkec | 52.9% | 20.9% | ||||
*Risk scores were calculated on the basis of the Odds Ratio and allele frequencies for each stage study. Only individuals with nonmissing genotypes for all two alleles were included in this analysis. aFold-change in Odds Ratio between highest and lowest risk group. bP value and Odds Ratio for the genetic risk score prediction model. cNagelkerke’s pseudo r2 indicates fraction of the variance in risk explained by the risk score model. SD, standard deviation; CI, Confidence interval.
Figure 3Receiver operator characteristic curves for prediction model of trichloroethylene induced hypersensitivity syndrome using two independent SNPs as predictor.
a, Receiver operator characteristic curve of 10-fold cross validation using discovery dataset. b, Receiver operator characteristic curve using discovery and replication data. Discovery data was used to train prediction model, replication data was for testing data.