| Literature DB >> 21699700 |
Franc Llorens1, Manuela Hummel, Xavier Pastor, Anna Ferrer, Raquel Pluvinet, Ana Vivancos, Ester Castillo, Susana Iraola, Ana M Mosquera, Eva González, Juanjo Lozano, Matthew Ingham, Juliane C Dohm, Marc Noguera, Robert Kofler, Jose Antonio del Río, Mònica Bayés, Heinz Himmelbauer, Lauro Sumoy.
Abstract
BACKGROUND: Epidermal Growth Factor (EGF) is a key regulatory growth factor activating many processes relevant to normal development and disease, affecting cell proliferation and survival. Here we use a combined approach to study the EGF dependent transcriptome of HeLa cells by using multiple long oligonucleotide based microarray platforms (from Agilent, Operon, and Illumina) in combination with digital gene expression profiling (DGE) with the Illumina Genome Analyzer.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21699700 PMCID: PMC3141672 DOI: 10.1186/1471-2164-12-326
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Microarray interplatform analysis. (A) Overlap of unique and named genes shared among the 3 microarray platforms used in this study. The pool of 17070 shared genes was used for further cross-platform analysis. The total numbers of genes for each platform and for all platforms combined are indicated. (B) Overlap of significantly regulated genes at 6 h after EGF treatment considering each of the 3 microarray platforms independently.
Figure 2GSEA analysis on significantly regulated gene sets across microarray platforms. Profile of the Running ES Score & Positions of Gene Set Members on the Rank Ordered List using 6 h EGF treatment data according to each of the three microarray platforms. In each panel, the vertical black lines indicate the position of each of the genes of the tested gene set in the reference data set (ranked by average of the three respective EGF versus control log2ratios of replicate experiments). The green curve plots the ES (enrichment score), which is the running sum of the weighted enrichment score obtained from GSEA software. Within each queried gene set, the farther the position of a gene to the left (red) implies a higher correlation with EGF up-regulated genes in the reference platform, and the farther to the right (blue) implies a higher correlation with genes down-regulated upon EGF treatment in the reference platform. Studied gene sets correspond to lists of up- or down-regulated genes in each platform at 6 h of EGF treatment. Significantly enriched data sets are defined according to GSEA default settings (p < 0.001 and a false discovery rate (FDR) < 0.25). R.L.M = ranked list metric.
Deep tag sequencing statistics
| total reads (6 runs) | 54072498 | 14042145 | 6265092 | 5244975 | 54072498 | 100.00% | 9012083.00 | |
| unambiguous (6 runs) | 24940641 | 3665152 | 779972 | 263786 | 29649551 | 54.83% | 100.00% | 4941591.83 |
| 0 mismatches | 18084026 | 286158 | 5470 | 95713 | 18471367 | 62.30% | 3078561.17 | |
| 1 mismatch | 3913853 | 1961500 | 102902 | 87180 | 6065435 | 20.46% | 1010905.83 | |
| 2 mismatches | 2942762 | 1417494 | 671600 | 80893 | 5112749 | 17.24% | 852124.83 | |
| ambiguous (6 runs) | 15083422 | 4111901 | 240145 | 410907 | 19846375 | 36.70% | 3307729.17 | |
| no matches (6 runs) | 14042145 | 6265092 | 5244975 | 4570282 | 4570282 | 8.45% | 761713.67 | |
Summary statistics of the mapping of reads generated by the DGE pipeline.
Figure 3Microarray versus DGE analysis. (A) Overlap of unique and named genes shared among the 3 microarray platforms and genes detected by DGE. The pool of 14645 shared genes was used for further cross-platform analysis. The total numbers of genes for each platform and for all platforms combined are indicated. (B) Overlap of significantly regulated genes considering the 3 microarray platforms at 6 h after EGF treatment and the genes found regulated after assessing significance by grouping microarray and DGE data in a RankProd analysis. Left panels show up-regulated genes and right panels show down-regulated genes.
Figure 4Correlation between microarrays and Illumina GA-I sequencing. (A) Comparison of estimated log2ratios from DGE (Y-axis) and the mean of all microarray platforms (X-axis). We consider only genes that were interrogated using all platforms and genes with a mean number of counts across lanes greater than 0. Genes with counts greater than 32 reads (colored red or green) or less than (black) 32 reads in at least one sample are shown. (Red dots) Genes called differentially expressed based on DGE data at an 10% FDR by RankProd. (Green dots) Genes not called as differentially expressed but above 32 counts. (Inset box) Correlation between technologies is higher when considering genes above the 32 count detection level (0.57) than when all genes are included (0.49). (B-C) Concordance at the top (CAT) plots of the different platforms with the 500 top genes from a reference platform, shown for Agilent in (B) and DGE in (C). See inset box for color codes identifying each platforms compared to the remaining platform used as reference. (D) Correlation plots with regression lines between log2ratios of the five high content platforms measurements (Y-axis) and quantitative real time PCR results using SYBR green assays (X-axis), based on measurements for 21 genes at the 6 h time point (see Additional file 2, Table S1).
Figure 5Top regulated genes derived from meta-analysis. RankProd analysis of the combination of microarray and Illumina GA-I ultrasequencing data sets. Heatmap of the top 50 up and down-regulated genes detected in all four platforms ordered by Median Fold Change (all have RankProd adjusted p-values < 0.0001). IL11, IL8, PLAUR, ANXA10 and FOS were validated by RT-qPCR showing concordant results (See Additional file 2, Table S1). The full RankProd matrix from these experiments is accessible in Additional file 6, Table S5. The list of all 1164 significantly regulated genes (median |FC| > 1.2 and RankProd q-value < 0.05) is given in Additional file 7, Table S6.
Figure 6Significant pathways and interactions among EGF-regulated geneset. (A) Core functional analysis of EGF-regulated genes derived from the RankProd test clustering around canonical pathways performed using the Ingenuity Pathway Analysis software. (B) Pathway analysis based on the Ingenuity Pathway Knowledge base. The two best ranked networks holding EGF-regulated genes derived from the RankProd test were merged showing a unique network. Up-regulated genes are indicated in red and down-regulated genes in green. The shape of the node denotes the main function of the protein encoded by the gene (see boxed inset). Continuous lines indicate interaction between the products of the genes; dashed lines indicate an indirect interaction; lines with an arrow indicate that the source gene "acts on" the target gene. Regulated genes are shown as grey boxes and non-regulated but associated with the regulation of some of these genes are shown as white nodes. Orange lines indicate new gene relationships appearing after merging different networks.
Functional analysis of differentially expressed EGF responsive genes.
| TOP NETWORKS | ||
|---|---|---|
| Associated Network Functions | Focus Molecules | Score |
| 1. Cell Death, Embryonic Development, Renal and Urological Disease | 28 | 45 |
| 2. Amino Acid Metabolism, Post-Translational Modification, Small Molecule Biochemistry | 26 | 40 |
| 3. Cell Cycle, Cancer, Cardiovascular System Development and Function | 24 | 35 |
| 4. Cellular Growth and Proliferation, Hematological System and Connective Tissue Development and Function | 24 | 35 |
| 5. Cellular Movement, Cellular Assembly and Organization, Cell-to-Cell Signaling and Interaction | 24 | 35 |
| 1. Cell Death | 7.41e-19-6.45e-04 | 145 |
| 2. Cell Growth and Proliferation | 1.77e-16-6.15e-04 | 160 |
| 3. Cellular Movement | 3.16e-12-6.50e-04 | 101 |
| 4. Cellular Development | 9.85e-11-6.28e-04 | 115 |
| 5. Cell Cycle | 1.23e-10-6.63e-04 | 75 |
| 1. Cancer | 1.82e-17-6.63e-04 | 193 |
| 2. Reproductive System Disease | 5.14e-15-6.57e-04 | 98 |
| 3. Immunological Disease | 1.09e-10-6.63e-04 | 70 |
| 4. Dermatological Disease and Conditions | 1.59e-08-3.33e-04 | 61 |
| 5. Inflammatory Disease | 1.59e-08-5.61e-04 | 58 |
List of Ingenuity Networks and Biological Functions generated by mapping the 1164 focus molecules that were differentially expressed during EGF treatment according to RankProd.
Figure 7Higher order network of interactions among EGF-regulated genes. Network of genomic interactions among EGF-regulated pathways (Holm-adjusted p-value < 0.01) as defined by GlobalAncova using KEGG database functional annotation. Nodes (pathways) that have at least two regulated genes (as defined by RankProd analysis) in common with other pathways are connected by continuous lines to these other pathways. The strength of each pathway interconnection (i.e. the number of shared regulated genes) is expressed by the width of the continuous lines connecting the two nodes. The node color indicates the interconnectivity of the nodes ranging from no connection to any other pathway (white) to many connections with other pathways (red). Numbers define KEGG categories as listed in Table 3.
Functional analysis of EGF responsive pathways.
| ID | name | genes | regulated genes | connections | |
|---|---|---|---|---|---|
| 1 | 05200 | Pathways in cancer | 265 | 31 | 139 |
| 2 | 04510 | Focal adhesion | 156 | 17 | 97 |
| 3 | 04660 | T cell receptor signaling pathway | 80 | 7 | 69 |
| 4 | 05215 | Prostate cancer | 74 | 7 | 65 |
| 5 | 04662 | B cell receptor signaling pathway | 59 | 6 | 63 |
| 6 | 04010 | MAPK signaling pathway | 203 | 25 | 62 |
| 7 | 05210 | Colorectal cancer | 55 | 6 | 61 |
| 8 | 05222 | Small cell lung cancer | 76 | 12 | 60 |
| 9 | 04012 | ErbB signaling pathway | 74 | 9 | 60 |
| 10 | 04722 | Neurotrophin signaling pathway | 104 | 9 | 60 |
| 11 | 05220 | Chronic myeloid leukemia | 64 | 7 | 59 |
| 12 | 04810 | Regulation of actin cytoskeleton | 163 | 14 | 46 |
| 13 | 05214 | Glioma | 55 | 5 | 44 |
| 14 | 05211 | Renal cell carcinoma | 58 | 9 | 44 |
| 15 | 05223 | Non-small cell lung cancer | 51 | 6 | 42 |
| 16 | 05142 | Chagas disease | 71 | 5 | 41 |
| 17 | 04310 | Wnt signaling pathway | 118 | 7 | 41 |
| 18 | 05219 | Bladder cancer | 35 | 5 | 37 |
| 19 | 05140 | Leishmaniasis | 42 | 4 | 36 |
| 20 | 04620 | Toll-like receptor signaling pathway | 59 | 4 | 36 |
| 21 | 04912 | GnRH signaling pathway | 70 | 6 | 35 |
| 22 | 04210 | Apoptosis | 74 | 8 | 32 |
| 23 | 04916 | Melanogenesis | 76 | 3 | 30 |
| 24 | 04630 | Jak-STAT signaling pathway | 94 | 12 | 27 |
| 25 | 05410 | Hypertrophic cardiomyopathy (HCM) | 60 | 7 | 27 |
| 26 | 05414 | Dilated cardiomyopathy | 65 | 7 | 27 |
| 27 | 04115 | p53 signaling pathway | 58 | 11 | 25 |
| 28 | 04621 | NOD-like receptor signaling pathway | 40 | 6 | 25 |
| 29 | 05412 | Arrythmogenic right ventricular cardiomyopathy (ARVC) | 52 | 6 | 25 |
| 30 | 04920 | Adipocytokine signaling pathway | 55 | 6 | 24 |
| 31 | 04120 | Ubiquitin mediated proteolysis | 120 | 15 | 22 |
| 32 | 04370 | VEGF signaling pathway | 57 | 3 | 21 |
| 33 | 04060 | Cytokine-cytokine receptor interaction | 132 | 14 | 18 |
| 34 | 04640 | Hematopoietic cell lineage | 43 | 7 | 16 |
| 35 | 04530 | Tight junction | 97 | 7 | 16 |
| 36 | 04340 | Hedgehog signaling pathway | 39 | 3 | 15 |
| 37 | 04622 | RIG-I-like receptor signaling pathway | 46 | 4 | 15 |
| 38 | 04360 | Axon guidance | 105 | 9 | 14 |
| 39 | 05217 | Basal cell carcinoma | 41 | 1 | 12 |
| 40 | 04144 | Endocytosis | 162 | 13 | 11 |
| 41 | 05014 | Amyotrophic lateral sclerosis (ALS) | 41 | 3 | 8 |
| 42 | 04350 | TGF-beta signaling pathway | 69 | 6 | 6 |
| 43 | 00561 | Glycerolipid metabolism | 38 | 4 | 5 |
| 44 | 00564 | Glycerophospholipid metabolism | 59 | 4 | 5 |
| 45 | 00600 | Sphingolipid metabolism | 27 | 4 | 4 |
| 46 | 04070 | Phosphatidylinositol signaling system | 66 | 4 | 4 |
| 47 | 00562 | Inositol phosphate metabolism | 50 | 3 | 3 |
| 48 | 00565 | Ether lipid metabolism | 20 | 1 | 3 |
| 49 | 05020 | Prion diseases | 22 | 3 | 3 |
| 50 | 04710 | Circadian rhythm - mammal | 20 | 4 | 2 |
| 51 | 00601 | Glycosphingolipid biosysnthesis - lacto and neolacto series | 21 | 2 | 0 |
| 52 | 00532 | Glycosaminoglycan biosynthesis - chondroitin sulphate | 17 | 2 | 0 |
| 53 | 00760 | Nicotinate and nicotinamide metabolism | 17 | 4 | 0 |
| 54 | 00750 | Vitamin B6 metabolism | 5 | 1 | 0 |
| 55 | 00790 | Folate biosynthesis | 9 | 0 | 0 |
List of GlobalAncova derived differentially expressed KEGG functions upon EGF treatment indicating the total number of genes, the number of regulated genes and the number of connections (shared regulated genes) to other pathways. The number identifying each KEGG category are the same used for the nodes in the graph on Figure 7.