| Literature DB >> 23537407 |
Luka A Clarke1, Lisete Sousa, Celeste Barreto, Margarida D Amaral.
Abstract
BACKGROUND: Microarray studies related to cystic fibrosis (CF) airway gene expression have gone some way in clarifying the complex molecular background of CF lung diseases, but have made little progress in defining a robust "molecular signature" associated with mutant CFTR expression. Disparate methodological and statistical analyses complicate comparisons between independent studies of the CF transcriptome, and although each study may be valid in isolation, the conclusions reached differ widely.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23537407 PMCID: PMC3637641 DOI: 10.1186/1465-9921-14-38
Source DB: PubMed Journal: Respir Res ISSN: 1465-9921
Identity of nasal epithelial cell samples used in microarray analysis
| | | ||||
| F | 9 | 5.1 | 1.70 L (88.7%) | 1.39 L (84.6%) | |
| M | 17 | 9.8 | 4.16 L (90.6%) | 3.36 L (88.5%) | |
| M | 16 | 6.5 | 2.65 L (60.7%) | 1.43 L (39.7%) | |
| F | 10 | 4.2 | 0.60 L (38.3%) | 0.60 L (43.8%) | |
| F | 18 | 4.3 | 3.00 L (78.5%) | 2.28 L (70.7%) | |
| - | | | |||
| | | ||||
| F | 16 | 1.6 | n/a | n/a | |
| M | 16 | 3.7 | n/a | n/a | |
| M | 14 | 4.4 | n/a | n/a | |
| M | 14 | 8.1 | n/a | n/a | |
| F | 14 | 5.7 | n/a | n/a | |
| - | |||||
Neither age (p = 0.69, student’s t test) nor percentage of inflammatory cells per sample (p = 0.42, student’s t test) showed any significant difference between groups. Data on lung function were obtained for CF patients only: percentage of expected function for relevant age group shown in brackets.
Primers used in qRT-PCR amplification
| ACTB (Ref. Gene) | Fwd: CTCTTCCAGCCTTCCTTCCT | N/A | 116 bp | 100% |
| Rev: AGCACTGTGTTGGCGTACAG | ||||
| ADM | Fwd: ATGAAGCTGGTTTCCGTCG | 4501945a1 | 112 bp | 100% |
| Rev: GCCCACTTATTCCACTTCTTTCG | ||||
| AQP9 | Fwd: CTGCAACCGTCTTTGGCATTT | 10280624a3 | 118 bp | 106% |
| Rev: AGATACGGAGCTGGGTATGTT | ||||
| AREG | Fwd: GTGGTGCTGTCGCTCTTGATA | 4502199a1 | 171 bp | 97% |
| Rev: ACTCACAGGGGAAATCTCACT | ||||
| GAPDH (Ref. Gene) | Fwd: CATGAGAAGTATGACAACAGCCT | 7669492a3 | 113 bp | 97% |
| Rev: AGTCCTTCCACGATACCAAAGT | ||||
| GJA1 (Connexin 43) | Fwd: GTGCCTGAACTTGCCTTTTC | N/A | 165 bp | 98% |
| Rev: CCCTCCAGCAGTTGAGTAGG | ||||
| IGFBP3 | Fwd: AGAGCACAGATACCCAGAACT | 4504617a2 | 105 bp | 100% |
| Rev: TGAGGAACTTCAGGTGATTCAGT | ||||
| NDRG1 | Fwd: TCGAGACTTTACATGGCTCTGT | 207028746b2 | 93 bp | 106% |
| Rev: TCATGCCGATGTCATGGTAGG | ||||
| SCGB1A1 (Uteroglobin) | Fwd: TTCAGCGTGTCATCGAAACCC | 4507809a1 | 189 bp | 100% |
| Rev: ACAGTGAGCTTTGGGCTATTTTT | ||||
| SPAG6 | Fwd: GTAAGGTGCTGCCGCATGATA | 6912678a1 | 152 bp | 100% |
| Rev: CCTCACTATTTCCTCGGGGTA | ||||
| TEKT1 | Fwd: CAGATTCGGATGAACCGCTCT | 16753231a3 | 140 bp | 103% |
| Rev: CTCACGGCGTTCTCAGAATATC | ||||
| TMEM45a | Fwd: GTTCACTTCCTGTGTCCTTAACC | 8922242b2 | 95 bp | 97% |
| Rev: CATTTCCCGGCCATGAGTGT |
Forward and reverse primers for amplification of the indicated genes were found at Harvard Primerbank (primer IDs shown; N/A indicates in-house primer design). Product sizes and primer efficiencies for Pfaffl calculations are also shown.
Summary of independent microarray experiments compared in the present study
| Clarke (this paper) | Native nasal epithelium (brushings) | 5 CF (F508del homoz.) | Affymetrix Custom HsAirwaya520108F |
| Virella-Lowell | Isogenic bronchial cells (IB3-1 and S9) | F508del/W1282X vs. WT-CFTR corrected: 3 technical replicates each | Affymetrix U95Av2 |
| Zabner | Primary tracheal and bronchial cell cultures | 10 CF (F508del homoz.) | Affymetrix HGU-133A |
| Wright | Native nasal epithelium (brushings) | 4 CF (F508del homoz.) vs. 12 controls | Affymetrix HGU-133A,B |
| Verhaeghe | Fetal tracheal cells (CFT-2 and NT-1) | F508del homoz. | Affymetrix HGU-133Plus2 |
| Ogilvie et al., 2011 [ | Native bronchial (and nasal) epithelium (brushings) | F508del homoz. (in most cases) | Illumina HumanRef-8 v1 Expression BeadChips |
The table highlights the diversity of experimental design, materials and microarray platforms used in the five independent studies used for comparison with our data. (1)We used the genelists resulting from Hampton & Stanton’s 2010 [21] reanalysis of these four data sets for our comparison. (2)For our preliminary comparison with this study we used the published genelist for bronchial samples only. For our secondary comparison we reanalysed the data from both bronchial and nasal samples. (3)The gene list used here represents the “mild” CF samples from this study only.
Figure 1Visualization of Microarray data. A) R-I plot, in which the log2 of the ratio of mean intensities in CF-vs.-non CF samples is plotted against the log of their product, demonstrating the absence of an intensity dependent fold-change bias. The regulated genes chosen for further analysis using the RP statistic (p < 0.0001 cut-off) are shown in red (up-regulated genes) and green (down-regulated genes). B) Dendrogram of clustered samples, in which CF and non CF samples cluster with respect to their phenotype. C) Volcano plot of microarray data, in which the Log2 of the RP statistic is plotted against Log2 of Fold Change. The p < 0.0001 cut-off (horizontal dotted line) illustrates the gene list chosen for further analysis. Genes outside the vertical dotted lines have more than 2-fold differential expression.
Functional enrichment from DAVID analysis
| GOTERM_BP_FAT (Biological process) | GO:0008285 | 14 | 5.1 | 6.77 E-04 | 3.1 | 1.12 |
| GO:0006953 | 5 | 1.8 | 1.58 E-03 | 9.8 | 2.60 | |
| GO:0008202 | 9 | 3.3 | 4.16 E-03 | 3.5 | 6.70 | |
| GO:0001960 | 3 | 1.1 | 4.23 E-03 | 29.5 | 6.81 | |
| GO:0042127 | 20 | 7.3 | 5.03 E-03 | 2.0 | 8.04 | |
| GO:0010038 | 7 | 2.5 | 5.92 E-03 | 4.3 | 9.40 | |
| GOTERM_ CC_FAT (Cell compartment) | GO:0005929 | 10 | 3.6 | 3.26 E-05 | 6.2 | 0.04 |
| GO:0030286 | 6 | 2.2 | 5.81 E-05 | 14.1 | 0.07 | |
| GO:0005615 | 22 | 8.0 | 1.16 E-04 | 2.6 | 0.15 | |
| GO:0005930 | 6 | 2.2 | 1.46 E-04 | 11.7 | 0.19 | |
| GO:0044421 | 26 | 9.5 | 3.26 E-04 | 2.2 | 0.42 | |
| GO:0042995 | 21 | 7.6 | 4.05 E-04 | 2.4 | 0.52 | |
| GO:0005856 | 31 | 11.3 | 1.63 E-03 | 1.8 | 2.08 | |
| GO:0005875 | 7 | 2.5 | 1.70 E-03 | 5.5 | 2.16 | |
| GO:0044430 | 23 | 8.4 | 3.48E-03 | 1.9 | 4.39 | |
| GO:0035085 | 4 | 1.5 | 3.55 E-03 | 12.8 | 4.48 | |
| GO:0005576 | 39 | 14.2 | 4.51 E-03 | 1.6 | 5.65 | |
| GOTERM_MF_FAT (Molecular Function) | GO:0003777 | 6 | 2.2 | 2.95 E-03 | 6.1 | 4.03 |
| GO:0005509 | 23 | 8.4 | 2.98 E-03 | 2.0 | 4.07 | |
| UP_TISSUE | 63 | 22.9 | 6.08 E-06 | 1.8 | 0.01 | |
| 80 | 29.1 | 2.59 E-03 | 1.3 | 3.06 | ||
| 13 | 4.7 | 7.16 E-03 | 2.4 | 8.26 | ||
| 11 | 4.0 | 7.45 E-03 | 2.7 | 8.58 |
The most enriched GO terms found following submission of our gene list to the DAVID functional annotation tool, showing categories and GO terms enriched, number and percentage of gene lists genes represented in each enriched category, significance level (P), fold enrichment and false discovery rate (FDR).
Figure 2ESR1 transcription targets regulated in CF. Network adapted from Genego Metacore. Grey lozenges indicate genes up-regulated in CF (including ESR1); white lozenges indicate genes down-regulated in CF. Arrows represent effect predicted by Metacore of ESR1 on gene, as shown on key. Gene names shown in bold are those whose direction of regulation in CF would be reinforced by ESR1, based on its predicted effect.
Genes for which differential expression was reanalysed by qRT-PCR in independent nasal cell samples
| | | | | | |
| 202912_at | Adrenomedullin | 2.9 | 1.7 | 0.08 | |
| 205568_at | Aquaporin 9 | 4.9 | 1.8 | 0.41 | |
| 205239_at | Amphiregulin | 3.6 | 1.3 | 0.21 | |
| 201667_at | Gap junction protein alpha 1 or Connexin 43 | 3.4 | 2.7 | 0.22 | |
| 210095_s_at | Insulin-like growth factor binding protein 3 | 2.2 | 1.2 | 0.49 | |
| 200632_s_at | N-myc downstream regulated gene 1 | 2.6 | 2.0 | 0.18 | |
| 219410_at | Transmembrane protein 45A or DERP7 | 3.0 | 1.0 | 0.5 | |
| | | | | | |
| 205725_at | Secretoglobin family 1A member 1 or Uteroglobin | −3.3 | −2.3 | 0.11 | |
| 210033_s_at | Sperm associated antigen 6 | −2.5 | −3.4 | 0.1 | |
| 239216_at | Tektin 1 | −3.3 | −3.0 | 0.12 | |
| | | | | | |
| Actin beta | / | ||||
| Glyceraldehyde 3-phosphate dehydrogenase | / |
Table shows gene symbols and description, Affymetrix probeset ID as represented on the HsAirway array, fold change in CF-vs.-non CF samples for both microarray and qRT-PCR, and p value for qRT-PCR data.
Figure 3Comparison of Microarray and qRT-PCR expression data for selected genes. Comparison of Log2 (fold change) of gene expression in CF vs. Control nasal cell samples as measured by microarray analysis (black) and qRT-PCR (grey) using independent samples (n = 6 CF and 5 Ctrl in each case). Microarray data are log2(fold change) of normalized intensities, and qRT-PCR data are log2(mean fold change). Direction of differential expression was in agreement for all genes except TMEM45a, whose up-regulation in CF was not supported.
Percentages of genes in common among differentially expressed genes from six microarray studies of CF related gene expression
| (N = 300) | (N = 300) | (N = 300) | (N = 300) | (N = 115) | (N = 117) | |
| 0.7 | 0.3 | 1.0 | 1.7 | 1.7 | ||
| 0.7 | 3.7 | 5.0 | 2.6 | 1.7 | ||
| 0.3 | 3.7 | 4.0 | 2.6 | 1.7 | ||
| 1.0 | 5.0 | 4.0 | 14.8 | 4.3 | ||
| 0.7 | 1.0 | 1.0 | 5.7 | 7.7 | ||
| 0.7 | 0.7 | 0.7 | 1.7 | 7.8 | ||
| (N = 300) | (N = 300) | (N = 300) | (N = 300) | (N = 110) | (N = 220) | |
| 6.7 | 5.3 | 3.3 | 2.7 | 0.5 | ||
| 6.7 | 7.0 | 2.7 | 1.8 | 1.4 | ||
| 5.3 | 7.0 | 10.3 | 2.7 | 2.3 | ||
| 3.3 | 2.7 | 10.3 | 0.9 | 3.2 | ||
| 1.0 | 0.7 | 1.0 | 0.3 | 1.4 | ||
| 0.3 | 1.0 | 1.7 | 2.3 | 2.7 | ||
| (N = 300) | (N = 300) | (N = 300) | (N = 300) | (N = 115) | (N = 117) | |
| 1.3 | 1 | 4.3 | 1.8 | 0.9 | ||
| 7 | 6 | 5.7 | 6.4 | 12.7 | ||
| 7.7 | 3.7 | 4.7 | 1.8 | 1.4 | ||
| 3.7 | 12.3 | 13 | 0 | 1.4 | ||
| 1 | 7 | 3.3 | 0 | 1.8 | ||
| 1.3 | 3.7 | 1.7 | 1.3 | 2.7 | ||
| (N = 300) | (N = 300) | (N = 300) | (N = 300) | (N = 110) | (N = 220) | |
| 7 | 7.7 | 3.7 | 2.6 | 3.4 | ||
| 1.3 | 3.7 | 12.3 | 18.3 | 9.4 | ||
| 1 | 6 | 13 | 8.7 | 4.3 | ||
| 4.3 | 5.7 | 4.7 | 0 | 3.4 | ||
| 0.7 | 2.3 | 0.7 | 0 | 2.6 | ||
| 0.7 | 9.3 | 1 | 1 | 3.5 | ||
Percentages are of number of regulated genes in study indicated in horizontal row regulated in the same direction or showing inverted expression in the other five studies as shown at left (Z: Zabner et al., 2005 [17]; W: Wright et al., 2006 [18]; V-L: Virella-Lowell et al., 2004 [16]; V: Verhaeghe et al., 2007 [20]; O: Ogilvie et al., 2011 [19]; C: current study). In each case, ALL refers to total percentage of genes shared or inverted between one study and all five other studies (summarized in Figure 4).
Figure 4Histogram showing percentages of genes in common among lists of differentially expressed genes from six microarray studies of CF related gene expression. Percentages are of number of regulated genes in one study regulated in the same direction (dark columns) or showing inverted expression (light columns) in the other five studies (Z: Zabner et al., 2005; W: Wright et al., 2006; V-L: Virella-Lowell et al., 2004; V: Verhaeghe et al., 2007; O: Ogilvie et al., 2011; C: current study). MEAN refers to mean percentage of all columns (data are summaries of percentages shown in Table 6).
Differentially expressed genes common to two or more of six comparable studies of CF related gene expression
| Zabner et al., 2005 | |
| Wright et al., 2006 | |
| Virella-Lowell et al., 2004 | |
| Verhaeghe et al., 2007 | |
| Ogilvie et al., 2011 | |
Genes shared between two other lists and the present study (ie, differentially expressed in the same direction with respect to CF-vs.-Control cells) are underlined. The lower panel shows genes shared between two or more studies when all six studies are combined. Genes shared with present study (see upper panel) are shown in italics, genes shared between three studies are underlined.
Functional enrichment analysis of genes shared between two or three studies
| GOTERM_BP_FAT | GO:0006952 | 25 | 13.4 | 6.00E-07 | 3.25 | 0.001 |
| GO:0042060 | 14 | 7.5 | 7.37E-07 | 5.87 | 0.001 | |
| GO:0042127 | 28 | 15.1 | 1.37E-06 | 2.85 | 0.002 | |
| GO:0006955 | 25 | 13.4 | 4.51E-06 | 2.90 | 0.008 | |
| GO:0006928 | 20 | 10.8 | 6.96E-06 | 3.37 | 0.012 | |
| GOTERM_CC_FAT | GO:0005576 | 49 | 26.3 | 1.88E-07 | 2.12 | 0.000 |
| GO:0044421 | 29 | 15.6 | 3.41E-06 | 2.63 | 0.004 | |
| GO:0005615 | 23 | 12.4 | 1.01E-05 | 2.92 | 0.013 | |
| GO:0000267 | 27 | 14.5 | 2.17E-04 | 2.17 | 0.275 | |
| GO:0005625 | 13 | 7.0 | 2.48E-04 | 3.61 | 0.314 | |
| GOTERM_MF_FAT | GO:0004857 | 13 | 7.0 | 7.39E-05 | 4.11 | 0.101 |
| GO:0032395 | 4 | 2.2 | 0.001303 | 17.98 | 1.768 | |
| GO:0004866 | 8 | 4.3 | 0.001519 | 4.71 | 2.058 | |
| GO:0030414 | 8 | 4.3 | 0.002074 | 4.47 | 2.800 | |
| GO:0005520 | 4 | 2.2 | 0.00294 | 13.67 | 3.947 | |
| KEGG_PATHWAY | hsa05332: | 8 | 4.3 | 2.41E-06 | 12.42 | 0.003 |
| hsa04940: | 8 | 4.3 | 4.07E-06 | 11.53 | 0.004 | |
| hsa05330: | 7 | 3.8 | 2.08E-05 | 11.77 | 0.023 | |
| hsa04610: | 8 | 4.3 | 1.14E-04 | 7.02 | 0.126 | |
| hsa05416: | 8 | 4.3 | 1.37E-04 | 6.82 | 0.151 | |
| UP_TISSUE | 52 | 28.0 | 5.15E-07 | 2.03 | 0.001 |
The most enriched GO terms found by the DAVID functional annotation tool among the 75 up-regulated and 114 down-regulated genes shared between two or three of the 6 compared studies (see Table 7), showing categories and GO terms enriched, number and percentage of gene lists genes represented in each enriched category, p value, fold enrichment and false discovery rate (FDR).
Figure 5Comparison of independent microarray studies and present study by GSEA. GSEA enrichment plots showing non-significant enrichment of up-regulated genes from A) Ogilvie et al. (2011), B) Verhaeghe et al. (2007), C) Virella-Lowell et al. (2004), and down-regulated genes from D) Wright et al. (2006) at the CF end of our dataset (red bar), and non-significant enrichment of down-regulated genes from E) Ogilvie et al. (2011), and up-regulated genes from F) Wright et al. (2006), at the control end of our dataset (green bar), demonstrating a partial inversion of CF-related gene expression between our study and Wright et al. (2006).
Figure 6Demonstration of pathway enrichment in present study by GSEA. GSEA enrichment plots showing significant enrichment (ES significant at p < 0.05 level) of genesets for A) positive and B) negative regulation of cell proliferation, C) ESR1 targets, D) cell proliferation and E) defence response for the CF phenotype, and F) motor activity for the non CF phenotype. The lower panel shows G) significant enrichment of the NF-kB pathway in the CF phenotype and non-significant skews of H) antigen presentation pathway genes in the non CF phenotype and I) protein ubiquitination genes in the CF phenotype.
Figure 7CF gene signature in native airway tissues. Representations of a putative gene signature in native airway tissues. A) and B) Venn diagrams showing numbers of up-regulated (A), and down-regulated (B) genes shared between the present study (C-N: Clarke nasal) and the reanalyzed study [19] (O-N: Ogilvie nasal; O-B: Ogilvie bronchial [19] in F508del-CFTR expressing vs. control airway epithelial cells. Numbers in brackets refer to total number of genes in each list, and numbers at centre of each diagram refer to the 30 genes shared between all three lists (see Table 9). C) Venn diagram distribution of shared genes among the most significant GO terms for biological process (BP), cell compartment (CC) and molecular function (MF) as revealed by DAVID analysis. D) GeneMania network showing relationships between 30 shared genes (black circles) and other connecting genes (grey circles). Relationships are divided into co-expression (purple lines), co-localization (blue lines), pathway (light blue lines), physical interactions (pink lines) and predicted (orange lines). The red arrow indicates IL1B, which is the most connected gene involved with the highest number of identifiable functions including inflammation, defence, response to bacteria, and cytokine production.
Small molecular signature for native CF airway epithelial cells
| arachidonate 5-lipoxygenase-activating protein; #241 | |
| BCL2-related protein A1; #597 | |
| chemokine (C-X-C motif) receptor 4; #7852 | |
| Fc fragment of IgG, low affinity IIIa, receptor (CD16a); #2214 | |
| FBJ murine osteosarcoma viral oncogene homolog; #2353 | |
| G0/G1switch 2; #50486 | |
| glucosaminyl (N-acetyl) transferase 3, mucin type; #9245 | |
| histone cluster 1, H1c; #3006 | |
| histone cluster 1, H2bk; #85236 | |
| interleukin 1, beta; #3553 | |
| interleukin 1 receptor, type II; #7850 | |
| lipopolysaccharide-induced TNF factor; #9516 | |
| oncostatin M; #5008 | |
| pleckstrin; #5341 | |
| prostaglandin-endoperoxide synthase 2; #5743 | |
| regulator of G-protein signaling 2, 24 kDa; #5997 | |
| S100 calcium binding protein A8; #6279 | |
| S100 calcium binding protein A9; #6280 | |
| serpin peptidase inhibitor, clade A, member 3; #12 | |
| transcobalamin I (vitamin B12 bp, R binder family); #6947 | |
| tropomyosin 4; #7171 | |
| calcyphosine; #828 | |
| carbohydrate (N-acet.gal.am. 4–0) sulfotransferase 9; #83539 | |
| dynein, axonemal, light intermediate chain 1; #7802 | |
| monoamine oxidase B; #4129 | |
| NEL-like 2 (chicken); #4753 | |
| protein phosphatase 1, regulatory subunit 16A; #84988 | |
| protein S (alpha); #5627 | |
| prostaglandin F receptor (FP); #5737 | |
| MOK protein kinase; #5891 | |
Regulated genes shared between our study and reanalysed lists from another recent study [19]. Gene symbols, descriptions and Entrez IDs (#) are shown.