| Literature DB >> 28985737 |
Margaret M Parker1,2, Robert P Chase1, Andrew Lamb1, Alejandro Reyes3, Aabida Saferali1,2, Jeong H Yun1,4, Blanca E Himes5, Edwin K Silverman1,2,4, Craig P Hersh1,2,4, Peter J Castaldi6,7,8.
Abstract
BACKGROUND: Cigarette smoking is the leading modifiable risk factor for disease and death worldwide. Previous studies quantifying gene-level expression have documented the effect of smoking on mRNA levels. Using RNA sequencing, it is possible to analyze the impact of smoking on complex regulatory phenomena (e.g. alternative splicing, differential isoform usage) leading to a more detailed understanding of the biology underlying smoking-related disease.Entities:
Keywords: Cigarette smoking; Differential expression; Exon usage; Isoforms; RNA-seq
Mesh:
Substances:
Year: 2017 PMID: 28985737 PMCID: PMC6225866 DOI: 10.1186/s12920-017-0295-9
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Summary of analyzed COPDGene subjects by former (n = 286) and current (n = 229) smoking status. Values represent mean (SD)
| Former Smokers (n = 286) | Current Smokers (n = 229) |
| |
|---|---|---|---|
| Race (% NHW) | 86% | 57% | <0.01 |
| Gender (% female) | 45% | 49% | 0.4 |
| Age | 69 (8.1) | 61 (7.6) | <0.01 |
| Pack-Years Smoked | 44 (23) | 46 (21) | 0.5 |
| FEV1 percent predicted | 73 (29) | 79 (24) | 0.02 |
| COPD cases | 41% | 48% | <0.01 |
| Neutrophil percentage | 61 (11) | 58 (11) | <0.01 |
| Lymphocyte percentage | 28 (11) | 31 (9.9) | <0.01 |
| Eosinophil percentage | 2.6 (2.1) | 2.7 (3.0) | 0.9 |
| Monocyte percentage | 8.5(2.5) | 7.6(2.2) | <0.01 |
| Basophil percentage | 0.59(0.55) | 0.59(0.65) | 0.9 |
Fig. 1Mean-Average(MA)-plot of log2 average expression versus log2 fold change between current and former smokers. Log fold change values greater than zero indicate genes with higher expression in current smokers (n = 136 genes), Log fold change values less than zero indicate genes with higher expression in former smokers (n = 35 genes). Blue dots indicate genes that are significantly differentially expressed (adjusted p value <0.05)
Differentially expressed long non-coding RNAs (lncRNA) between current and former smokers (adjusted p-value <0.05). Of the 7 differentially expressed lncRNAs, 6 have higher expression in current smokers as compared to former smokers
| Ensembl Gene ID | Gene Symbol(s) | Chr | Log Fold Change | Average Expression | Moderated T Statistic | P Value | Adjusted P Value |
|---|---|---|---|---|---|---|---|
| ENSG00000253230 | LINC00599,RNCR3 | 8 | 1.777 | −2.655 | 11.439 | 5.87E-27 | 5.46E-23 |
| ENSG00000227240 | RP11-563D10.1 | 1 | 0.426 | −0.222 | 4.912 | 1.24E-06 | 7.52E-04 |
| ENSG00000230817 | LINC01362 | 1 | 0.553 | −0.903 | 4.319 | 1.91E-05 | 6.99E-03 |
| ENSG00000237011 | RP11-98G13.1 | 1 | 0.399 | −0.408 | 4.099 | 4.88E-05 | 1.49E-02 |
| ENSG00000267453 | AC004791.2 | 19 | 0.685 | −1.743 | 4.021 | 6.72E-05 | 1.94E-02 |
| ENSG00000254275 | LINC00824, LINC01263 | 8 | 0.407 | 1.028 | 4.013 | 6.96E-05 | 1.98E-02 |
| ENSG00000227508 | LINC01624,TCONS_00011425 | 6 | −0.225 | 1.356 | −3.948 | 9.07E-05 | 2.22E-02 |
Fig. 2Comparison of effect sizes between differential expression analyses of: 1) current versus former smoking (defined as smoking cigarettes within the past month) on the x-axis and 2) acute smoking (defined as smoking cigarettes within the past 2 h) on the y-axis. Each point represents a differentially expressed gene in the current smoking analysis (n = 171). Blue dots represent 6 genes significantly differentially expressed between those who have smoked within the past 2 h and those who have not (Bonferroni corrected for 171 genes). The red line is a fitted via lowess smoothing. The labeled gene (SIGLEC1) shows an opposite direction of effect between the 2 analyses (logFC current smoking = 0.51, logFC acute smoking = −0.36)
Differential exon usage (DEU) between current and former smokers. Significant DEU is defined as an adjusted p-value <0.05 from limma exon-based T statistic and DEXseq p-value <0.05.Gene-based P value is unadjusted from limma differential expression analysis
| Ensembl Exon ID | Gene Symbol | Limma Adjusted | DEXseq | Gene-based | Transcripts Containing Exon | Exon Number in Transcript |
|---|---|---|---|---|---|---|
| ENSE00001810132 | EPS15 | 4.00E-02 | 2.00E-02 | 8.16E-01 | ENST00000478657 | first |
| ENSE00002071373 | GALNT7 | 2.22E-03 | 1.66E-06 | 1.30E-01 | ENST00000502407 | first |
| ENSE00001444573 | SASH1a | 1.80E-09 | 2.39E-03 | 4.22E-21 | ENST00000367467 | last |
| ENSE00001400828 | AREL1 | 1.27E-02 | 1.18E-03 | 9.98E-01 | ENST00000356357 | last |
| ENSE00001444981 | UTRN | 5.72E-16 | 8.82E-02 | 1.75E-01 | ENST00000367545 | last |
| ENSE00001635177 | MAN1A2 | 4.00E-03 | 3.25E-02 | 9.48E-03 | ENST00000356554 | second |
| ENSE00001231507 | LRRN3a | 3.23E-10 | 4.69E-04 | 4.19E-36 | ENST00000308478 | first |
| ENSE00001175333 | ERAP1 | 7.87E-12 | 1.27E-03 | 4.27E-01 | ENST00000296754 | first |
| ENSE00001641703 | ERAP1 | 1.19E-06 | 3.63E-03 | 4.27E-01 | ENST00000443439 | first |
aDifferentially expressed gene
Fig. 3Exon usage in the GALNT7 gene by smoking status. The top plot shows exon usage for each analyzed exon by smoking status (red = former, blue = current). One exon showed significant differential usage between current and former smokers (ENSE00002071373). The bottom table maps tested exons to known isoforms (1 = exon present in that transcript, 0 = exon not present in that transcript)