| Literature DB >> 24498427 |
Dorothy M Ryan1, Thomas L Vincent1, Jacqueline Salit1, Matthew S Walters1, Francisco Agosto-Perez2, Renat Shaykhiev1, Yael Strulovici-Barel1, Robert J Downey3, Lauren J Buro-Auriemma1, Michelle R Staudt1, Neil R Hackett1, Jason G Mezey4, Ronald G Crystal1.
Abstract
Genome-wide association studies (GWAS) and candidate gene studies have identified a number of risk loci associated with the smoking-related disease COPD, a disorder that originates in the airway epithelium. Since airway basal cell (BC) stem/progenitor cells exhibit the earliest abnormalities associated with smoking (hyperplasia, squamous metaplasia), we hypothesized that smoker BC have a dysregulated transcriptome, enriched, in part, at known GWAS/candidate gene loci. Massive parallel RNA sequencing was used to compare the transcriptome of BC purified from the airway epithelium of healthy nonsmokers (n = 10) and healthy smokers (n = 7). The chromosomal location of the differentially expressed genes was compared to loci identified by GWAS to confer risk for COPD. Smoker BC have 676 genes differentially expressed compared to nonsmoker BC, dominated by smoking up-regulation. Strikingly, 166 (25%) of these genes are located on chromosome 19, with 13 localized to 19q13.2 (p<10⁻⁴ compared to chance), including 4 genes (NFKBIB, LTBP4, EGLN2 and TGFB1) associated with risk for COPD. These observations provide the first direct connection between known genetic risks for smoking-related lung disease and airway BC, the population of lung cells that undergo the earliest changes associated with smoking.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24498427 PMCID: PMC3912203 DOI: 10.1371/journal.pone.0088051
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Smoking-induced dysregulated transcripts in human airway basal cells.
A. Principal component analysis. Shown is gene expression of basal cells (BC) of nonsmokers (n = 10, green circles) and smokers (n = 7, orange circles) using all 13,385 expressed genes as an input dataset. B. Volcano plot, smoker vs nonsmoker airway BC. Ordinate – p value; abscissa – fold-change (log2). C. Hierarchical cluster analysis of smoker vs nonsmoker basal cells based on expression of 676 smoking-dysregulated genes [fold-change>1.5, p<0.05 with false discovery rate (FDR) correction]. Genes expressed above the average are represented in red, below average in blue and average in grey. The genes are represented vertically, and individual samples horizontally. D. Functional categories of the 676 unique genes significantly differentially expressed in smoker vs nonsmoker human airway BC (≥1.5 fold-change up- or down-regulated; p<0.05 with FDR correction). Shown are fold-changes of the smoking-responsive genes on a log2 scale.
Top 50 Smoking-dysregulated Genes in Human Airway Basal Cells1.
| Functional category | Gene symbol | Gene title | Nonsmoker mean expression level (FPKM) | Smoker mean expression level (FPKM) | Absolute Difference | Fold-change | p value |
| Adhesion | ITGB4 | integrin, beta 4 | 233.5 | 375.7 | 142.2 | 1.6 | 3.2×10−2 |
| COL7A1 | collagen, type VII, alpha 1 | 63.2 | 122.6 | 59.4 | 1.9 | 2.6×10−2 | |
| PKP3 | plakophilin 3 | 38.7 | 74.4 | 35.7 | 1.9 | 6.0×10−3 | |
| Apoptosis | TNFRSF12A | tumor necrosis factor receptor superfamily, member 12A | 152.8 | 267.1 | 114.3 | 1.7 | 1.2×10−2 |
| ARHGDIA | Rho GDP dissociation inhibitor (GDI) alpha | 103.4 | 185.7 | 82.3 | 1.8 | 1.5×10−2 | |
| PHLDA2 | pleckstrin homology-like domain, family A, member 2 | 34.1 | 104.3 | 70.2 | 3.1 | 1.0×10−3 | |
| FKBP8 | FK506 binding protein 8, 38 kDa | 70.1 | 106.8 | 36.7 | 1.5 | 3.5×10−2 | |
| MFSD10 | major facilitator superfamily domain containing 10 | 30.0 | 65.3 | 35.3 | 2.2 | 2.0×10−3 | |
| BAD | BCL2-associated agonist of cell death | 20.3 | 52.4 | 32.2 | 2.6 | 3.0×10−4 | |
| Development | PPDPF | pancreatic progenitor cell differentiation and proliferation factor homolog (zebrafish) | 159.8 | 385.9 | 226.1 | 2.4 | 7.0×10−3 |
| PLEC | plectin | 69.5 | 186.3 | 116.8 | 2.7 | 1.0×10−3 | |
| JUNB | jun B proto-oncogene | 44.6 | 134.2 | 89.5 | 3.0 | 4.0×10−3 | |
| FSCN1 | fascin homolog 1, actin-bundling protein (Strongylocentrotus purpuratus) | 107.4 | 186.1 | 78.7 | 1.7 | 4.2×10−2 | |
| ARPC1B | actin related protein 2/3 complex, subunit 1B, 41 kDa | 114.8 | 180.7 | 65.9 | 1.6 | 1.2×10−2 | |
| LAMA5 | laminin, alpha 5 | 19.7 | 54.6 | 34.8 | 2.8 | 2.0×10−3 | |
| BCAR1 | breast cancer anti-estrogen resistance 1 | 14.0 | 45.3 | 31.3 | 3.2 | 1.9×10−4 | |
| Immune response | PDLIM7 | PDZ and LIM domain 7 (enigma) | 35.9 | 73.6 | 37.7 | 2.1 | 2.0×10−3 |
| Ion transport | SLC16A3 | solute carrier family 16, member 3 (monocarboxylic acid transporter 4) | 82.7 | 142.5 | 59.8 | 1.7 | 1.9×10−2 |
| GLTSCR2 | glioma tumor suppressor candidate region gene 2 | 44.0 | 93.2 | 49.1 | 2.1 | 7.0×10−2 | |
| Metabolism | MIF | macrophage migration inhibitory factor (glycosylation-inhibiting factor) | 32.6 | 182.6 | 150.0 | 5.6 | 1.8×10−6 |
| MZT2B | mitotic spindle organizing protein 2B | 23.7 | 84.3 | 60.6 | 3.6 | 1.8×10−4 | |
| CHPF | chondroitin polymerizing factor | 25.6 | 67.3 | 41.7 | 2.6 | 2.7×10−4 | |
| NAPRT1 | nicotinate phosphoribosyltransferase domain containing 1 | 17.4 | 49.5 | 32.1 | 2.8 | 2.6×10−2 | |
| Oxidation reduction | GPX1 | glutathione peroxidase 1 | 243.0 | 464.3 | 221.4 | 1.9 | 1.4×10−2 |
| Signal transduction | GPC1 | glypican 1 | 42.4 | 86.0 | 43.6 | 2.0 | 2.1×10−2 |
| DVL1 | dishevelled, dsh homolog 1 (Drosophila) | 25.5 | 62.1 | 36.6 | 2.4 | 1.0×10−3 | |
| MAP7D1 | MAP7 domain containing 1 | 38.7 | 70.2 | 31.5 | 1.8 | 2.7×10−2 | |
| LRP10 | low density lipoprotein receptor-related protein 10 | 56.1 | 87.3 | 31.2 | 1.6 | 2.2×10−2 | |
| IER2 | immediate early response 2 | 38.0 | 68.6 | 30.6 | 1.8 | 8.0×10−3 | |
| Transcription | TSPO | translocator protein (18 kDa) | 92.7 | 220.8 | 128.1 | 2.4 | 1.0×10−3 |
| MGST1 | microsomal glutathione S-transferase 1 | 358.2 | 236.4 | 121.8 | −1.5 | 1.0×10−3 | |
| AGRN | agrin | 40.9 | 116.6 | 75.7 | 2.9 | 1.9×10−4 | |
| RNH1 | ribonuclease/angiogenin inhibitor 1 | 76.0 | 150.0 | 73.9 | 2.0 | 2.8×10−2 | |
| CEBPD | CCAAT/enhancer binding protein (C/EBP), delta | 25.4 | 71.0 | 45.6 | 2.8 | 4.6×10−2 | |
| GIPC1 | GIPC PDZ domain containing family, member 1 | 62.7 | 104.4 | 41.8 | 1.7 | 5.0×10−3 | |
| EPN1 | epsin 1 | 29.7 | 70.3 | 40.6 | 2.4 | 3.6×10−4 | |
| H1FX | H1 histone family, member X | 24.4 | 64.1 | 39.7 | 2.6 | 2.0×10−3 | |
| ADRM1 | adhesion regulating molecule 1 | 51.6 | 85.6 | 34.0 | 1.7 | 1.3×10−2 | |
| IRAK1 | interleukin-1 receptor-associated kinase 1 | 50.8 | 84.4 | 33.7 | 1.7 | 2.6×10−2 | |
| NDUFS7 | NADH dehydrogenase (ubiquinone) Fe-S protein 7, 20 kDa (NADH-coenzyme Q reductase) | 19.1 | 51.2 | 32.1 | 2.7 | 1.0×10−3 | |
| HOMER3 | homer homolog 3 (Drosophila) | 35.6 | 67.7 | 32.1 | 1.9 | 3.0×10−3 | |
| RBCK1 | RanBP-type and C3HC4-type zinc finger containing 1 | 26.5 | 55.5 | 29.0 | 2.1 | 3.5×10−2 | |
| CEBPB | CCAAT/enhancer binding protein (C/EBP), beta | 26.4 | 55.3 | 28.9 | 2.1 | 2.1×10−2 | |
| Translation | RPLP1 | ribosomal protein, large, P1 | 1632.0 | 2533.0 | 901.0 | 1.6 | 4.8×10−2 |
| MRPL41 | mitochondrial ribosomal protein L41 | 22.5 | 71.0 | 48.5 | 3.2 | 1.0×10−3 | |
| Other | ATP6V0C | ATPase, H+ transporting, lysosomal 16 kDa, V0 subunit c | 74.9 | 138.6 | 63.7 | 1.9 | 1.0×10−3 |
| Unknown | MSMO1 | methylsterol monooxygenase 1 | 207.6 | 119.7 | 87.9 | −1.7 | 3.0×10−2 |
| CCDC124 | coiled-coil domain containing 124 | 30.4 | 69.4 | 39.1 | 2.3 | 2.4×10−6 | |
| MZT2A | mitotic spindle organizing protein 2A | 25.6 | 56.8 | 31.2 | 2.2 | 2.4×10−2 | |
| ZNF598 | zinc finger protein 598 | 14.8 | 44.5 | 29.6 | 3.0 | 6.0×10−3 |
Top 50 genes identified by largest absolute difference in expression and sorted in descending order of absolute difference within each category.
FPKM = fragments per kilobase of exon per million fragments mapped.
Absolute difference = smoker mean - nonsmoker mean.
Fold-change = mean in smokers/mean in nonsmokers.
False discovery rate controlled to 0.05 using Partek ‘step-up’ (Benjamini-Hochberg) procedure.
Figure 2Comparison of chromosomal location of basal cell smoking-dysregulated genes to known COPD risk loci.
A. Chromosomal distribution of SNPs identified by GWAS (p<10−5) as risk loci for COPD and related phenotypes. B. Chromosomal location of the 676 significant smoking-dysregulated basal cell (BC) genes as compared to distribution expected by random chance. Red dots represent the number of BC smoking-dysregulated genes localized to each chromosome. Box and whisker plots represent 104 permutations of 676 randomly chosen genes. The red dots above chromosomes 16, 19 and 22 represent the number of BC smoking-dysregulated genes in each location, along with the % of total dysregulated genes on each chromosome. C. Enrichment of 676 BC smoking-dysregulated genes on known COPD risk loci. Red dots represent number of BC smoking-dysregulated genes at each known COPD risk loci; the loci are identified by chromosome number and chromosome subband. Box and whisker plots represent the distribution of 676 randomly chosen genes permutated 104 times for each GWAS chromosome subband location. The only statistically significant locus was 19q13.2 (p<10−4).
Figure 3Basal cell (BC) smoking-dysregulated genes localized to COPD-risk locus 19q13.2.
A. Genome distribution of 13 significant BC smoking-dysregulated genes on locus 19q13.2. Red bar – known COPD locus; red dots – known COPD candidate genes (see Table S1 in File S1). B. Expression of BC smoking-dysregulated genes on 19q13.2. Expression is in fragments per kilobase of exon per million fragments mapped (FPKM). Nonsmoker (n = 10, green bars), smoker (n = 7, yellow bars). All smoker to nonsmoker comparisons minimum p<0.05. The 4 COPD risk genes are in red.
Figure 4Hierarchical clustering of the correlation coefficients of mean gene expression of 13 smoking-dysregulated genes on chromosome locus 19q13.2 in nonsmoker and smoker BC.
A. Nonsmokers; B. Smokers. The correlation coefficients allow us to assess the relationship between pairs of genes, and range from −1 (blue) to 1 (red). Positive correlation coefficient is represented in red, consistent with co-expression in the same direction. Negative correlation coefficient is represented in blue, consistent with co-expression in opposite directions.