| Literature DB >> 29506519 |
Jiyoun Yeo1, Diego A Morales2, Tian Chen3, Erin L Crawford2, Xiaolu Zhang4, Thomas M Blomquist1, Albert M Levin5, Pierre P Massion6, Douglas A Arenberg7, David E Midthun8, Peter J Mazzone9, Steven D Nathan10, Ronald J Wainz11, Patrick Nana-Sinkam12,13, Paige F S Willey14, Taylor J Arend15, Karanbir Padda16, Shuhao Qiu17, Alexei Federov3,4, Dawn-Alita R Hernandez18, Jeffrey R Hammersley18, Youngsook Yoon18, Fadi Safi18, Sadik A Khuder18, James C Willey19.
Abstract
BACKGROUND: There is a need for more powerful methods to identify low-effect SNPs that contribute to hereditary COPD pathogenesis. We hypothesized that SNPs contributing to COPD risk through cis-regulatory effects are enriched in genes comprised by bronchial epithelial cell (BEC) expression patterns associated with COPD.Entities:
Keywords: Bronchial epithelial cells; CAT; CEBPG; COPD; ERCC5; GPX1; GWAS; KEAP1; TP73; XPA; cis-regulation; eQTL
Mesh:
Substances:
Year: 2018 PMID: 29506519 PMCID: PMC5838965 DOI: 10.1186/s12890-018-0603-y
Source DB: PubMed Journal: BMC Pulm Med ISSN: 1471-2466 Impact factor: 3.317
Fig. 1Schematic description of research design. 1RNAseq: RNA sequencing by next generation sequencing; 2BEC: bronchial epithelial cell; 3COPD, chronic obstructive pulmonary disease; 4GWAS, genome wide association study; 5ASE: allele-specific expression; 6LHS GWAS: Lung Health Study Genome Wide Association Study; 7DAE: differential allelic expression; 8COPDgene NHW: COPDgene Non-Hispanic White Cohort
Clinical characteristics of study population
| Non-COPD ( | COPD ( | ||
|---|---|---|---|
| Age, yr | 64.3 | 63.6 | 0.713 |
| Sex | 0.009 | ||
| Male | 11 | 22 | |
| Female | 19 | 8 | |
| Smoking status | 1.0 | ||
| Current | 10 | 9 | |
| Former | 20 | 21 | |
| Never | 0 | 0 | |
| Pack-years | 49 | 60 | 0.088 |
| FEV1/FVC | 0.81 | 0.53 | 5.81E-13 |
| Ethnicity | |||
| White | 28 | 26 | |
| AA | 2 | 4 |
†p-values were calculated using Student’s t-test for age and Pack-years, and Fisher exact test for sex and smoking history
Fig. 2Network of bivariate correlation among genes (transcript abundance values) for control and COPD cohorts. Each line represents Pearson r-value with p-value < 0.05. Left: Control, Right: COPD. (See Additional file 1: Table S3 for r- and p-value of each gene pair)
Fig. 3Inter-gene correlation differences in control vs COPD cohorts. a, b TP73–2 vs ERCC5
Fig. 4Receiver operating characteristic curve (ROC) (a) and summary of performance of classifier (b) in 30 control and 30 COPD subjects
COPD classifier gene features selected by SLDA1 and 10-fold cross-validation
| Feature | Gene Function | CAT2 score | Ranking | Missing value (%) |
|---|---|---|---|---|
| KEAP1 | AO3 | 3.35 | 1 | 8% |
| GPX1 | AO | 3.32 | 2 | 5% |
| CEBPG | TF4 | 2.98 | 3 | 18% |
| XPA | DNAR5 | 2.64 | 4 | 28% |
| CAT-2 | AO | 2.64 | 5 | 22% |
| TP73–2 | CCC6/DNAR | 2.30 | 6 | 28% |
1SLDA shrinkage linear discriminant analysis, 2CAT score correlation-adjusted t-scores, 3AO antioxidant, 4TF transcription factor, 5DNAR DNA repair, 6CCC cell cycle control
SLDA COPD classifier gene differential allelic expression (DAE) in bronchial epithelial cell (BEC) or in GTEx lung tissue database
| DAE in BEC | GTEx Lung Tissue ( | |||||
|---|---|---|---|---|---|---|
| 1SNP | MAF | Subjects Assessed ( | 2Heterozygote Subjects with DAE data ( | 3 | eQTL | 4 |
| KEAP1-rs1048287 | 0.1 | 159 | 30 |
| KEAP1 | 5N.R. |
| CEBPG-rs3745968 | 0.11 | 128 | 17 |
| CEBPG | N.R. |
| CAT-rs1049982 | 0.34 | 156 | 52 |
| CAT |
|
| TP73-rs1801174 | 0.09 | 158 | 27 |
| TP73 | N.R. |
Significant p-values indicated in italicized font
1SNP that served as marker for DAE. SNPs with highest minor allele frequency chosen
2n = number of subjects for whom each SNP allele was measurable in BEC after filtering to prevent stochastic sampling error. The fraction of gDNA samples with heterozygotes was comparable to that for cDNA samples and both approximated Hardy Weinberg Equilibrium expectations
3p-value for F-test comparing inter-individual variation in cDNA to inter-individual variation in gDNA samples
4p-value reported in GTEx database
5N.R not reported
Fig. 5Inter-individual variation allelic ratio for cDNA compared with gDNA. Each symbol represents results from a single heterozygous individual. a CAT-rs1049982, b CEBPG-rs3745968, c ERCC5-rs17655, d KEAP1-rs1048287, e TP73-rs1801174