| Literature DB >> 25372662 |
Katherine Shortt1, Suman Chaudhary2, Dmitry Grigoryev1, Daniel P Heruth2, Lakshmi Venkitachalam3, Li Q Zhang2, Shui Q Ye1.
Abstract
Acute respiratory distress syndrome (ARDS) is a lung condition characterized by impaired gas exchange with systemic release of inflammatory mediators, causing pulmonary inflammation, vascular leak and hypoxemia. Existing biomarkers have limited effectiveness as diagnostic and therapeutic targets. To identify disease-associating variants in ARDS patients, whole-exome sequencing was performed on 96 ARDS patients, detecting 1,382,399 SNPs. By comparing these exome data to those of the 1000 Genomes Project, we identified a number of single nucleotide polymorphisms (SNP) which are potentially associated with ARDS. 50,190SNPs were found in all case subgroups and controls, of which89 SNPs were associated with susceptibility. We validated three SNPs (rs78142040, rs9605146 and rs3848719) in additional ARDS patients to substantiate their associations with susceptibility, severity and outcome of ARDS. rs78142040 (C>T) occurs within a histone mark (intron 6) of the Arylsulfatase D gene. rs9605146 (G>A) causes a deleterious coding change (proline to leucine) in the XK, Kell blood group complex subunit-related family, member 3 gene. rs3848719 (G>A) is a synonymous SNP in the Zinc-Finger/Leucine-Zipper Co-Transducer NIF1 gene. rs78142040, rs9605146, and rs3848719 are associated significantly with susceptibility to ARDS. rs3848719 is associated with APACHE II score quartile. rs78142040 is associated with 60-day mortality in the overall ARDS patient population. Exome-seq is a powerful tool to identify potential new biomarkers for ARDS. We selectively validated three SNPs which have not been previously associated with ARDS and represent potential new genetic biomarkers for ARDS. Additional validation in larger patient populations and further exploration of underlying molecular mechanisms are warranted.Entities:
Mesh:
Year: 2014 PMID: 25372662 PMCID: PMC4221189 DOI: 10.1371/journal.pone.0111953
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Participant demographics and comorbidities for the ARDS cases.
| Whole-exome SequencingSample group | Participants | Gender(%male) | Age± SD | APACHEIIscore ±SD | Ventilator-free daysper 28 days ±SD | 60-day mortality(% dead at 60 days post-onset) |
| ARDS total | 96 | 46.9 | 50±14.6 | 97.2±30.4 | 12.6±10.2 | 29.4 |
| Caucasian ARDS | 70 | 44.3 | 50.7±13.8 | 94.5±30.2 | 13.6±10.0 | 23.9 |
| African American ARDS | 26 | 53.8 | 47.8±16.6 | 104.6±30.2 | 9.9±10.5 | 44 |
| ARDS with Sepsis total | 48 | 50 | 52.0±15.1 | 105.2±32.2 | 10.2±10.1 | 38.3 |
| Caucasian ARDS with Sepsis | 37 | 43.2 | 51.8±15.0 | 102.6±31.1 | 10.5±9.8 | 33.3 |
| African American ARDS with Sepsis | 11 | 72.7 | 52.9±16.1 | 113.6±35.8 | 9.3±11.7 | 54.5 |
| ARDS with Pneumonia total | 48 | 43.8 | 47.9±13.9 | 88.9±26.2 | 14.9±9.87 | 20 |
| Caucasian ARDS with Pneumonia | 33 | 45.5 | 49.5±12.4 | 85.0±26.6 | 17.0±9.2 | 12.9 |
| African American ARDS with Pneumonia | 15 | 40 | 44.1±16.6 | 97.6±24.0 | 10.3±10.0 | 35.7 |
|
|
|
|
|
|
|
|
| ARDS Total | 117 | 46.2 | 50.8±16.6 | 105.1±32.7 | 12.0±10.5 | 31 |
| Caucasian ARDs | 75 | 40 | 52.1±15.4 | 104.3±31.4 | 11.8±10.5 | 29.2 |
| African American ARDS | 17 | 64.7 | 51.6±22.8 | 113.3±33.8 | 12.2±10.6 | 29.4 |
| Other ancestry ARDS | 25 | 52 | 46.4±15.3 | 101.8±36.2 | 12.6±10.7 | 37.5 |
| ARDS with Sepsis total | 59 | 52.5 | 51.5±17.3 | 112.9±31.5 | 11.3±11.0 | 37.9 |
| Caucasian ARDS with Sepsis | 34 | 50 | 51.1±15.6 | 113.6±29.7 | 10.9±11.1 | 33.3 |
| African American ARDS with Sepsis | 9 | 66.7 | 56.6±27.1 | 121.8±33.0 | 12.9±11.5 | 33.3 |
| Other ancestry ARDS with Sepsis | 16 | 50 | 49.6±14.8 | 106.4±34.7 | 11.4±11.0 | 50 |
| ARDS with Pneumonia total | 58 | 39.7 | 50.1±16.0 | 96.9±32.3 | 12.7±9.9 | 23.6 |
| Caucasian ARDS with Pneumonia | 41 | 31.7 | 52.9±15.3 | 96.4±31.0 | 12.6±10.0 | 25.6 |
| African American ARDS with Pneumonia | 8 | 62.5 | 46±16.9 | 103.8±34.3 | 11.4±10.2 | 25 |
| Other ancestry ARDS with Pneumonia | 9 | 55.6 | 46.8±15.2 | 92.4±39.7 | 14.9±10.3 | 12.5 |
|
|
|
|
|
|
|
|
| All ARDS patients | 213 | 46.5 | 50.4±15.7 | 101.6±31.9 | 12.3±10.3 | 30.2 |
*8 samples, 4 exome sequenced and 4 TaqMan genotyped ARDS patients did not have these phenotypes available. An additional 2 exome sequenced ARDS patients did not have ventilator-free day’s data. In addition to the 8 patients missing severity and mortality phenotype data, 2 patients were excluded from the regression because their phenotypes were thought to be missing until after the regressions were completed.
Summary of the filtering applied to candidate SNPs.
| Criteria for filtering | Number of remaining variants |
| SNPs detected in ARDS SNPs | 1,382,399 |
| matched in 1000 Genomes Project | 169,376 |
| common across 4 ARDS race and etiology groups | 50,190 |
| ?2 p-value <0.01 in top 5 ARDS groups** | 99 |
| ?2 p-value <2.95×10−7 in at least 2/5 top 5 ARDS groups | 76(in 65 genes) |
| coding variants | 38(in 32 genes) |
| nonsynonymous | 20 |
| synonymous variants | 18 |
*African American with pneumonia, African American with sepsis, Caucasian with pneumonia, Caucasian with sepsis subgroups. Out of the 50,190 SNPs, 49,723 are bi-allelic. **, All ARDS, all pneumonia, all sepsis, all African American, all Caucasian groups.
Figure 1Pipeline of the exome-seq data analysis workflow.
After processing the data using the GATK pipeline, this filtering workflow was derived to identify SNPs which were associated with measures of susceptibility across the racial and etiology groups of cases. SNPs were filtered based on strength of association, coding effect, and functional prediction prior to testing for association with other ARDS phenotypes. *, The sample contains African American and Caucasian patients, so the EUR and ASW healthy controls from 1000 Genomes were used for comparison; **, In the 1000 Genomes Project exome sequence, the same 714,074 SNPs are present for all 440 EUR and ASW; §, HWE = Hardy Weinberg Equilibrium, p>0.0001; +, African American with pneumonia, African American with sepsis, Caucasian with pneumonia, Caucasian with sepsis; + +, χ2 test of ARDS vs. respective 1000 Genomes Project control groups; ‡, SNPs with P-value <0.01 in the overall comparison, Caucasian ARDS comparison, and African American comparison with 1000 Genomes were filtered further by p<0.01 in the sepsis comparison and pneumonia comparison; ‡ ‡, All ARDS cases, all pneumonia cases, all sepsis cases, all African American cases, all Caucasian cases.
Figure 2Manhattan plot of ARDS patients and 1000 genomes project controls.
A number of strong associations with susceptibility to ARDS were observed using a χ2 test. (A) A Manhattan plot of the whole exome sequence all ARDS cases vs. European Ancestry and ASW 1000 Genomes Project controls created using SVS v8.2.0. This is a graphic representation of the chromosome location (x axis) vs. the –log10 (χ2 p-value) of the allele frequencies. SNPs whose chi-square tests yield a smaller p-value fall higher on the log scale are more significant [42]. (B) The same Manhattan plot with a zoomed Y-axis.
Pathway analysis.
| Top Canonical Pathways | p-value* | Ratio** | Molecules |
| Graft-versus-Host Disease Signaling | 8.07×10−9 | 6/48 | KIR2DL1/KIR2DL3, HLA-DRB1, HLA-B, HLA-DQA1, HLA-DQB1, HLA-DRB5 |
| Autoimmune Thyroid Disease Signaling | 4.35×10−7 | 5/49 | HLA-DRB1, HLA-B, HLA-DQA1, HLA-DQB1, HLA-DRB5 |
| Nur77 Signaling in T Lymphocytes | 9.37×10−7 | 5/57 | HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DRB5, SIN3A |
| Calcium-induced T Lymphocyte Apoptosis | 1.68×10−6 | 5/64 | HLA-DRB1, HLA-DQA1, HLA-DQB1, ITPR1, HLA-DRB5 |
| B Cell Development | 3.75×10−6 | 4/34 | HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DRB5 |
Top canonical pathways as predicted from the 65genes containing the76 SNPs that were identified using χ2tests. Pathway predictions were done using the Core Analysis function of Ingenuity Pathway Analysis. *, P-Value of <0.05 indicates a non-random association between the genes and pathway; **, Ratio of the number of genes in the dataset involved in the pathway to the total number of genes in the pathway.
Profile of 3 SNPs.
| rs3848719 | rs9605146 | rs78142040 | |
| Location | 20∶44596545 | 22∶17265194 | X:2832771 |
| Gene ID | ZNF335 | XKR3 | ARSD |
| Call Rate (Cases+Controls) | 0.962 | 0.995 | 0.991 |
| Call Rate (Cases) | 0.883 | 0.986 | 0.972 |
| HWE P-value (Cases) | 2.86E-4 | 0.422 | 1.18E-3 |
| HWE P-value (Controls) | 0.887 | 3.72E-5 | 1 |
| HWE P-value (Cases+Controls) | 0.034 | 1.94E-12 | 0.471 |
| Reference: Alternate Allele | G>A | G>A | C>T |
| Alt. Allele Freq. (Cases) | 39.4% | 38.6% | 22.0% |
| Alt. Allele Freq. (Controls) | 38.5% | 4.0% | 0% |
| Number AA (%) (Cases) | 41 (21.8%) | 34 (16.2%) | 2 (1.0%) |
| Number AA (%) (Controls) | 66(15.0%) | 4(0.9%) | 0(0%) |
| Number Ar (%) (Cases) | 66 (35.1%) | 94 (44.8%) | 87 (42.0%) |
| Number Ar (%) (Controls) | 207 (47.0%) | 27 (6.1%) | 0 (0%) |
| Number rr (%) (Cases) | 81 (45.1%) | 82 (39.1%) | 118 (57.0%) |
| Number rr (%) (Controls) | 339 (77.0%) | 35 (8.0%) | 0 (0%) |
| SNP classification | Coding | Coding | Intronic |
| Coding classification | Synonymous | Nonsynonymous | Intronic |
| Reference amino acid | S | P | NK |
| Alternate amino acid | S | L | NK |
| Provean prediction (cutoff = −25)1 | Neutral | Deleterious | NK |
| Sift prediction (cutoff = 0.05)1 | Tolerated | Tolerated | NK |
| Pph2 prediction2 | NK | benign | NK |
NK: Not known, HWE, Hardy-Weinberg equilibrium; AA, alternate genotype or homozygous minor genotype; Ar, heterozygous genotype; rr, reference genotype or homozygous major genotype; 1, http://provean.jcvi.org/genome_submit.php; 2, http://genetics.bwh.harvard.edu/pph2/.
Overall association summary.
| rs3848719 | rs9605146 | rs78142040 | |
| Susceptibility (cases vs. controls) | |||
| Chi-squared p-value | 0.780 | 1.68E-59 | 3.64E-47 |
| Odds Ratio (95% CI)1 | 1.04(0.81–1.33) | 15.16(10.25–22.41) | 498.09 (30.83–8,047.51) |
| Severity (ventilator-free days/28 days) | |||
| p-value | NS | NS | NS |
| Odds Ratio (95% CI)2 | NS | NS | NS |
| Severity (APACHE II score) | |||
| p-value | 0.032 | NS | 0.061** |
| Odds Ratio (95% CI)2 | 0.55 (0.31–0.96) | NS | 2.60(0.93–7.26) |
| Outcome (60-day mortality) | |||
| p-value | NS* | NS | 0.017 |
| Odds Ratio (95% CI)2 | NS | NS | 2.04 (1.13–3.68) |
NS, Not significant; * significantly associated in genotyped Caucasian, pneumonia, and Caucasian pneumonia subgroups; **, for 117 genotyped samples only, not in total 213; 1, Odds ratio for alternate allele (allelic test); 2, additive genotypic model.
Figure 3Quantile-quantile plots of Caucasian ARDS and EUR 1000 genomes.
In the example of our Caucasian cases and EUR controls, we observe that correction for principal components improves the fit of our data with the expected distribution. (A) QQ plot of expected χ2values versus the actual χ2values for the genotypic trend test of case-control status. The data are filtered on HWE, LD, and SNP call rate but not PCA corrected. (B) QQ plot of expected χ2values versus the actual χ2values for the genotypic trend test of case-control status. The data have been filtered and corrected for 6 PCs. (C) QQ plot of expected χ2values versus the actual χ2values for the genotypic trend test of case-control status. The data have been filtered and corrected for 6 PCs and undergone sample outlier removal.