| Literature DB >> 18823535 |
Atsushi Niida1, Andrew D Smith, Seiya Imoto, Shuichi Tsutsumi, Hiroyuki Aburatani, Michael Q Zhang, Tetsu Akiyama.
Abstract
BACKGROUND: Microarray technology has unveiled transcriptomic differences among tumors of various phenotypes, and, especially, brought great progress in molecular understanding of phenotypic diversity of breast tumors. However, compared with the massive knowledge about the transcriptome, we have surprisingly little knowledge about regulatory mechanisms underling transcriptomic diversity.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18823535 PMCID: PMC2572072 DOI: 10.1186/1471-2105-9-404
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schema of our method. We first calculate correlations between phenotypes and expression values as meta-expression values, while preparing a sequence feature table by searching promoter sequences for cis-regulatory motifs. Cis-regulatory motif data are prepared from two different sources: already known motifs, which are downloaded from databases, and de novo identified motifs, which were discovered by an ab initio motif finder program, DME. Then, associations between sequence features and meta-expression values were inferred by structure learning of Bayesian networks.
Figure 2Sequence features associated with differential expression between G1 and G3 breast tumors.
Motif associated with histological grades or prognosis identified based on independent datasets
| JSP$NF_Y(20) | 20 | 3.1 × 10-10 | 0.000158 | |
| V$NRF1_Q6(10) | 15 | 3.09 × 10-14 | 6.02 × 10-7 | |
| motifs associated with histological grades based on the data by Sotiriou | V$ELK1_02(20) | 12 | 9.25 × 10-26 | 1.41 × 10-6 |
| DME$CTTCCGSYN(5) | 9 | 5.71 × 10-14 | 6.82 × 10-5 | |
| V$E2F1_Q4_01(5) | 7 | 5.71 × 10-15 | 0.002372 | |
| JSP$NF_Y(10) | 15 | 2.46 × 10-14 | 0.011049 | |
| DME$RMSYSSARGCGC(5) | 11 | 4.02 × 10-5 | 0.063412 | |
| V$ELK1_02(10) | 10 | 2.03 × 10-16 | 2.08 × 10-7 | |
| motifs associated with prognosis based on the data by Sotiriou | DME$YYYGSGCMYGCG(5) | 8 | 1.65 × 10-9 | 0.008054 |
| V$E2F1_Q4_01(10) | 8 | 1.05 × 10-17 | 2.37 × 10-5 | |
| V$IRF_Q6_01(10) | 7 | 2.06 × 10-8 | 0.000152 | |
| DME$NMSTTCYKSYR(5) | 6 | 0.000669 | 0.084446 | |
| V$NRF1_Q6(20) | 6 | 9.02 × 10-22 | 1.31 × 10-6 | |
| JSP$NF_Y(20) | 22 | 5.93 × 10-8 | 0.01116 | |
| motifs associated with histological grades based on the data by Pawitan | V$E2F1_Q4_01(5) | 10 | 6.56 × 10-7 | 0.049423 |
| DME$RCRKGCGCAVN(5) | 6 | 5.71 × 10-8 | 0.060899 | |
| V$E2F1_Q4_01(15) | 6 | 9.59 × 10-6 | 0.017285 | |
| V$ELK1_02(20) | 16 | 1.26 × 10-27 | 6.13 × 10-12 | |
| V$NRF1_Q6(15) | 11 | 9.2 × 10-23 | 3.89 × 10-7 | |
| motifs associated with prognosis based on the data by Pawitan | V$NRF1_Q6(20) | 11 | 4.31 × 10-22 | 2.49 × 10-7 |
| V$ELK1_02(15) | 9 | 1.63 × 10-25 | 6.41 × 10-11 | |
| DME$RCGCHKGCGY(5) | 6 | 3.23 × 10-20 | 4.8 × 10-6 | |
IDs starting from "V$", "JSP$", and "DME$" Motifs denote motifs from the TRANSFAC database, the JASPAR database, and our DME analysis, respectively, followed by values of the threshold parameter for motif searches in parentheses.
The number of appearances of sequence feature in 30 searches with bootstrap resampling.
P values calculated by Wilcoxon rank sum tests for training and test data, respectively.
Figure 3Dependency of differential expression between G1 and G3 breast tumors on sequence features. Genes are divided into five groups based on patterns of four sequence features, V$ELK1_02(20), V$E2F1_Q4_01(10), V$NRF1_Q6(10) and JSP$NF_Y(10) (the left red boxes indicates the presence of sequence features). The distributions of their differential expression values between G1 and G3 are displayed using box plots.
Figure 4Sequence features associated with the correlation value calculated for breast cancer prognosis.
Figure 5Dependency of the correlation value with breast cancer prognosis on sequence features. Genes are divided into five groups based on patterns of four sequence features, V$ELK1_02(5), V$E2F1_Q4_01(10), V$NRF1_Q6(15) and JSP$NF_Y(10) (the left red boxes indicates the presence of sequence features). The distributions of their correlation value with breast cancer prognosis are displayed using box plots.