| Literature DB >> 20822536 |
Kristian Ovaska1, Marko Laakso, Saija Haapa-Paananen, Riku Louhimo, Ping Chen, Viljami Aittomäki, Erkka Valo, Javier Núñez-Fontarnau, Ville Rantanen, Sirkku Karinen, Kari Nousiainen, Anna-Maria Lahesmaa-Korpinen, Minna Miettinen, Lilli Saarinen, Pekka Kohonen, Jianmin Wu, Jukka Westermarck, Sampsa Hautaniemi.
Abstract
BACKGROUND: Coordinated efforts to collect large-scale data sets provide a basis for systems level understanding of complex diseases. In order to translate these fragmented and heterogeneous data sets into knowledge and medical benefits, advanced computational methods for data analysis, integration and visualization are needed.Entities:
Year: 2010 PMID: 20822536 PMCID: PMC3092116 DOI: 10.1186/gm186
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Figure 1Schematic of the Anduril platform. Anduril is an extensible framework for analyzing large-scale data sets using workflows. Elementary analysis and reporting methods, as well as connections to external databases, are implemented as reusable Anduril components. Components can utilize libraries such as Bioconductor and Weka and are not limited to a particular programming language. Components are then wired into custom workflows, which implement complete analyses that take complex high-throughput data as input and automatically produce comprehensive final reports as result. Reports include generated web sites that show the most relevant features of genes at a glance, and detailed figures and tables produced by analysis methods such as Kaplan-Meier analysis, Gene Ontology enrichment, and so on. Analysis workflows and their parameters are also documented in reports.
Analyses performed and corresponding TCGA glioblastoma data sets
| Primary analysis | TCGA dataset(s) |
|---|---|
| Differentially expressed genes | Gene expression |
| Differentially expressed transcripts (DETs) | Exon expression |
| Differentially expressed miRNAs | miRNA expression |
| Survival affecting germline SNPs | Blood SNP arrays, clinical data |
| Survival affecting DETs | Exon expression, clinical data |
| Survival affecting differentially expressed miRNAs | miRNA expression, clinical data |
| Chromosomal aberrations (amplification and deletion) | aCGH |
| Integration of differential expression and chromosomal aberrations | aCGH, exon expression |
| DNA methylation | Methylation arrays |
Figure 2Example of Anduril-generated result website and links to external sources. Anduril generates a browsable website based on analysis results. (a) A screenshot of the gene level view of the data. The genes are sorted according to the survival P-value on the exon platform. The data are divided into 13 fields corresponding to analysis results and data sources. For example, the field 'GeneExpression' illustrates fold changes between GBM and control samples using data from gene expression microarrays. Exon array values are computed at the gene ('MedianExonExpression') and transcript levels ('TranscriptExpression'). For the transcript data the minimum and maximum transcript expression values show GBM-specific alternative splice variant candidates. The fields 'TranscriptExpression:Survival' and 'MedianExonExpression:Survival' show survival analysis P-values for the best transcript and gene in the exon arrays, whereas 'SNPSurvival' contains P-values for the survival associated SNPs. The green color for 'GeneExpression', 'FoldChange', 'Min', 'Max', 'Gain', 'Loss' and 'Methylation' denote downregulation and red denotes upregulation. The red color for P-values for the fields 'Survival', 'SNPSurvival' and 'ExonIntegration' denotes low P-values. (b) A web page that opens after clicking the gene MSN. This page contains detailed results and external links. (c, d) Clicking 'GeneName' opens a website in Genecards [28] (c), and 'GeneID' connects to Ensembl [29] (d). (e) Clicking 'Protein Interactions' opens a page listing known protein-protein interactions in PINA [27]. (f) Clicking an entry in 'KEGG pathway' allows accessing pathways at the KEGG [26] website. (g) Each splice variant is listed separately and if the survival P-value is < 0.01, the users can view the Kaplan-Meier curves. The groups '1', '-1' and '0' denote overexpression, underexpression (not shown for MSN) and stable expression, respectively ('-1' is not present in the figure). The dotted lines are 95% confidence intervals.
Functional siRNA screening data for 11 GBM survival-associated genes in the A172 glioblastoma cell line
| Symbol | Description | siRNA name | CTG | Caspase | Survival | Expression |
|---|---|---|---|---|---|---|
| AllStars Cell Death Vontrol siRNA (Cell death ctrl) | NA | NA | ||||
| AllStars Negative Control siRNA (siNEG) | 0.00 | 0.00 | NA | NA | ||
| Kinesin family member 11 | KIF11_7 | NA | NA | |||
| Polo-like kinase 1 | PLK1_7 | 0.31 | NA | NA | ||
| Filamin C, gamma | FLNC_2 | -0.89 | 0.000189 | 2.51 | ||
| FLNC_5 | -0.37 | 0.29 | ||||
| FLNC_6 | -0.57 | -0.94 | ||||
| FLNC_7 | 0.49 | |||||
| H19, imprinted | H19_1 | 0.38 | -0.14 | 0.000588 | 3.54 | |
| maternally expressed | H19_2 | 1.17 | ||||
| transcript (non-protein | H19_3 | |||||
| coding) | H19_4 | |||||
| Histone cluster 1, H4l | HIST1H4L_1 | 1.63 | -0.60 | 0.001560 | 5.01 | |
| HIST1H4L_2 | -2.00 | |||||
| HIST1H4L_5 | -0.36 | -1.19 | ||||
| HIST1H4L_7 | -0.97 | |||||
| KIAA0040 | KIAA0040_11 | -0.64 | 0.000867 | 2.63 | ||
| KIAA0040_12 | 0.63 | -0.83 | ||||
| LOC100129443_3 | 0.69 | -0.71 | ||||
| LOC100129443_4 | -0.31 | -0.30 | ||||
| Lactotransferrin | LTF_1 | -1.74 | 0.22 | 0.001570 | 3.95 | |
| LTF_2 | -0.03 | 0.07 | ||||
| LTF_5 | -1.04 | 0.44 | ||||
| LTF_6 | -1.18 | -0.79 | ||||
| Nicotinamide N- | NNMT_5 | -0.91 | -1.29 | 0.000074 | 7.41 | |
| methyltransferase | NNMT_6 | -0.48 | -1.87 | |||
| NNMT_7 | -1.26 | -0.32 | ||||
| NNMT_8 | -0.30 | -0.10 | ||||
| Periostin, osteoblast | POSTN_1 | 0.24 | -1.16 | 0.001950 | 14.9 | |
| specific factor | POSTN_2 | 0.28 | ||||
| POSTN_6 | 0.56 | |||||
| POSTN_7 | 0.52 | 0.12 | ||||
| Transgelin 2 | TAGLN2_10 | -1.74 | 0.001010 | 4.21 | ||
| TAGLN2_11 | 0.48 | |||||
| TAGLN2_8 | -0.43 | 0.45 | ||||
| TAGLN2_9 | -1.94 | |||||
| TIMP metallopeptidase | TIMP1_2 | -0.09 | -0.96 | 0.000109 | 3.21 | |
| inhibitor 1 | TIMP1_4 | 1.07 | 0.88 | |||
| TIMP1_5 | 0.56 | -1.87 | ||||
| TIMP1_6 | -0.08 | 0.41 | ||||
| Moesin | MSN_8 | -1.68 | 1.61 | 0.000028 | 3.42 | |
| MSN_9 | -1.05 | |||||
| MSN_5 | -0.36 | -1.57 | ||||
| MSN_1 | 0.80 | |||||
| Cyclin-dependent kinase inhibitor 2A | NA (gene deleted in A172) | |||||
Cell proliferation (CTG) and induction of caspase-3 and -7 activities (Caspase) were assayed after transfection of A172 cells with four siRNAs against each gene. Z-scores from the proliferation and caspase-3/7 assays are presented, centered on the scramble siRNA. Values in bold diverge by more than two standard deviation units from the median of scramble negative control siRNA and are considered significant. For each gene, the best survival P-value (Survival) and the corresponding fold change in the exon array (Expression) are given.
Figure 3Functional effects of knocking out . Four MSN targeting siRNAs at a final concentration of 13 nM were transfected with Silenfect (BioRad) transfection reagent to A172, LN405 and U87MG glioma cell lines and the SVGp12 control cell line. (a) Cell proliferation was assayed 72 h after transfection using CellTiter-Glo Cell Viability assay. (b) Induction of caspase-3 and -7 activities was detected 48 h after transfection with homogeneous Apo-ONE assay (Promega). Loess normalized signals from the proliferation and caspase-3/7 assays are presented as relative scores to the mean of lipid-containing wells. Significant P-values < 0.05*, < 0.01** and < 0.001*** calculated by t-test are shown. Error bars indicate standard error of the mean (SEM).