| Literature DB >> 21426557 |
Ting Gong1, Jianhua Xuan, Li Chen, Rebecca B Riggins, Huai Li, Eric P Hoffman, Robert Clarke, Yue Wang.
Abstract
BACKGROUND: Genes work coordinately as gene modules or gene networks. Various computational approaches have been proposed to find gene modules based on gene expression data; for example, gene clustering is a popular method for grouping genes with similar gene expression patterns. However, traditional gene clustering often yields unsatisfactory results for regulatory module identification because the resulting gene clusters are co-expressed but not necessarily co-regulated.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21426557 PMCID: PMC3072956 DOI: 10.1186/1471-2105-12-82
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1A block diagram of the motif-guided sparse decomposition (mSD) approach. The mSD approach consists of the following two steps: (1) transcription factor activity estimation by motif-guided clustering and (2) regulation strength estimation by sparse decomposition.
AUCs of mSD, SD and FastNCA methods, respectively, under different cut-off p-values
| mSD | SD | Fast NCA | |
|---|---|---|---|
| cut-off | 0.6912 | 0.5707 | |
| cut-off | 0.6881 | 0.5891 | |
| cut-off | 0.6801 | 0.5547 |
Twenty six breast cancer and ER-related transcription factors
| V$AP1_Q2_01 | V$AP1_Q4_01 | V$CREBP1CJUN_01 | V$CEBP_Q3 | V$CEBPA_01 |
| V$CEBPGAMMA_Q6 | V$CREB_02 | V$CREB_Q3 | V$CREB_Q2 | V$NFKB_Q6_01 |
| V$SP1_Q6 | V$ER_Q6 | V$ETF_Q6 | V$MYCMAX_03 | V$STAT_Q6 |
| V$STAT_01 | V$EGR1_01 | V$FOXJ2_02 | V$FOXP1_01 | V$MYB_Q3 |
| V$P53_02 | V$PBX_Q3 | V$PBX1_03 | V$NFY_Q6_01 | V$NFY_01 |
| V$CEBPDELTA_Q6 |
Figure 2Transcription factor activity estimated by the mSD approach. (a) Estimated activities of the five transcription factors (AP1, ETF, ER, STAT and NFκB) in estrogen-induced condition. (b) Estimated activities of the five transcription factor bind sites in estrogen-deprived condition.
Target genes of ETF (V$ETF_Q6) in both E2-induced and ER-deprived conditions
| Probe Set ID | GENE_SYMBOL | Gene Name |
|---|---|---|
| 200646_S_AT | NUCB1 | NUCLEOBINDIN 1 |
| 200690_AT | HSPA9 | HEAT SHOCK 70 KDA PROTEIN 9B (MORTALIN-2) |
| 201373_AT | PLEC1 | PLECTIN 1, INTERMEDIATE FILAMENT BINDING PROTEIN 500 KDA |
| 201573_S_AT | ETF1 | EUKARYOTIC TRANSLATION TERMINATION FACTOR 1 |
| 201601_X_AT | IFITM1 | INTERFERON INDUCED TRANSMEMBRANE PROTEIN 1 (9-27) |
| 201753_S_AT | ADD3 | ADDUCIN 3 (GAMMA) |
| 201842_S_AT | EFEMP1 | EGF-CONTAINING FIBULIN-LIKE EXTRACELLULAR MATRIX PROTEIN 1 |
| 201910_AT | FARP1 | FERM, RHOGEF (ARHGEF) AND PLECKSTRIN DOMAIN PROTEIN 1 (CHONDROCYTE-DERIVED) |
| 201984_S_AT | EGFR | EPIDERMAL GROWTH FACTOR RECEPTOR (ERYTHROBLASTIC LEUKEMIA VIRAL (V-ERB-B) ONCOGENE HOMOLOG, AVIAN) |
| 202088_AT | SLC39A6 | SOLUTE CARRIER FAMILY 39 (ZINC TRANSPORTER), MEMBER 6 |
| 202235_AT | SLC16A1 | SOLUTE CARRIER FAMILY 16 (MONOCARBOXYLIC ACID TRANSPORTERS), MEMBER 1 |
| 202295_S_AT | CTSH | CATHEPSIN H |
| 202304_AT | FNDC3A | FIBRONECTIN TYPE III DOMAIN CONTAINING 3A |
| 202429_S_AT | PPP3CA | PROTEIN PHOSPHATASE 3 (FORMERLY 2B), CATALYTIC SUBUNIT, ALPHA ISOFORM (CALCINEURIN A ALPHA) |
| 202602_S_AT | HTATSF1 | HIV-1 TAT SPECIFIC FACTOR 1 |
| 202730_S_AT | PDCD4 | PROGRAMMED CELL DEATH 4 (NEOPLASTIC TRANSFORMATION INHIBITOR) |
| 202826_AT | SPINT1 | SERINE PEPTIDASE INHIBITOR, KUNITZ TYPE 1 |
| 202979_S_AT | CREBZF | HCF-BINDING TRANSCRIPTION FACTOR ZHANGFEI |
| 203079_S_AT | CUL2 | CULLIN 2 |
| 203278_S_AT | PHF21A | PHD FINGER PROTEIN 21A |
| 203358_S_AT | EZH2 | ENHANCER OF ZESTE HOMOLOG 2 (DROSOPHILA) |
| 203456_AT | PRAF2 | PRA1 DOMAIN FAMILY, MEMBER 2 |
| 203493_S_AT | CEP57 | CENTROSOMAL PROTEIN 57 KDA |
| 203607_AT | INPP5F | INOSITOL POLYPHOSPHATE-5-PHOSPHATASE F |
| 203855_AT | WDR47 | WD REPEAT DOMAIN 47 |
| 203869_AT | USP46 | UBIQUITIN SPECIFIC PEPTIDASE 46 |
| 204129_AT | BCL9 | B-CELL CLL/LYMPHOMA 9 |
| 204527_AT | MYO5A | MYOSIN VA (HEAVY POLYPEPTIDE 12, MYOXIN) |
| 204629_AT | PARVB | PARVIN, BETA |
| 204710_S_AT | WIPI2 | WD REPEAT DOMAIN, PHOSPHOINOSITIDE INTERACTING 2 |
| 204989_S_AT | ITGB4 | INTEGRIN, BETA 4 |
| 204995_AT | CDK5R1 | CYCLIN-DEPENDENT KINASE 5, REGULATORY SUBUNIT 1 (P35) |
| 205222_AT | EHHADH | ENOYL-COENZYME A, HYDRATASE/3-HYDROXYACYL COENZYME A DEHYDROGENASE |
| 205258_AT | INHBB | INHIBIN, BETA B (ACTIVIN AB BETA POLYPEPTIDE) |
| 206231_AT | KCNN1 | POTASSIUM INTERMEDIATE/SMALL CONDUCTANCE CALCIUM- ACTIVATED CHANNEL, SUBFAMILY N, MEMBER 1 |
| 206574_S_AT | PTP4A3 | PROTEIN TYROSINE PHOSPHATASE TYPE IVA, MEMBER 3 |
| 206604_AT | OVOL1 | OVO-LIKE 1(DROSOPHILA) |
| 207038_AT | SLC16A6 | SOLUTE CARRIER FAMILY 16 (MONOCARBOXYLIC ACID TRANSPORTERS), MEMBER 6 |
| 207844_AT | IL13 | INTERLEUKIN 13 |
| 208296_X_AT | TNFAIP8 | TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 8 |
| 208754_S_AT | NAP1L1 | NUCLEOSOME ASSEMBLY PROTEIN 1-LIKE 1 |
| 208876_S_AT | PAK2 | P21 (CDKN1A)-ACTIVATED KINASE 2 |
| 209135_AT | ASPH | ASPARTATE BETA-HYDROXYLASE |
| 209241_X_AT | MINK1 | MISSHAPEN-LIKE KINASE 1 (ZEBRAFISH) |
| 209288_S_AT | CDC42EP3 | CDC42 EFFECTOR PROTEIN (RHO GTPASE BINDING) 3 |
| 209354_AT | TNFRSF14 | TUMOR NECROSIS FACTOR RECEPTOR SUPERFAMILY, MEMBER 14 (HERPESVIRUS ENTRY MEDIATOR) |
| 209736_AT | SOX13 | SRY (SEX DETERMINING REGION Y)-BOX 13 |
| 209872_S_AT | PKP3 | PLAKOPHILIN 3 |
| 209900_S_AT | SLC16A1 | SOLUTE CARRIER FAMILY 16 (MONOCARBOXYLIC ACID TRANSPORTERS), MEMBER 1 |
| 209988_S_AT | ASCL1 | ACHAETE-SCUTE COMPLEX-LIKE 1 (DROSOPHILA) |
| 210184_AT | ITGAX | INTEGRIN, ALPHA X (COMPLEMENT COMPONENT 3 RECEPTOR 4 SUBUNIT) |
| 210513_S_AT | VEGFA | VASCULAR ENDOTHELIAL GROWTH FACTOR |
| 210854_X_AT | SLC6A8 | SOLUTE CARRIER FAMILY 6 (NEUROTRANSMITTER TRANSPORTER, CREATINE), MEMBER 8 |
| 211097_S_AT | PBX2 | PRE-B-CELL LEUKEMIA TRANSCRIPTION FACTOR 2 |
| 211527_X_AT | VEGFA | VASCULAR ENDOTHELIAL GROWTH FACTOR |
| 212375_AT | EP400 | TRINUCLEOTIDE REPEAT CONTAINING 12 |
| 212467_AT | DNAJC13 | DNAJ (HSP40) HOMOLOG, SUBFAMILY C, MEMBER 13 |
| 212594_AT | PDCD4 | PROGRAMMED CELL DEATH 4 (NEOPLASTIC TRANSFORMATION INHIBITOR) |
| 212739_S_AT | NME4 | NON-METASTATIC CELLS 4, PROTEIN EXPRESSED IN |
| 212837_AT | KIAA0157 | KIAA0157 |
| 212878_S_AT | KLC1 | KINESIN 2 |
| 213051_AT | ZC3HAV1 | ZINC FINGER CCCH-TYPE, ANTIVIRAL 1 |
| 213187_X_AT | FTLL1 | FERRITIN, LIGHT POLYPEPTIDE |
| 213271_S_AT | DOPEY1 | DOPEY FAMILY MEMBER 1 |
| 213451_X_AT | TNXB | TENASCIN XB |
| 213505_S_AT | SFRS14 | SPLICING FACTOR, ARGININE/SERINE-RICH 14 |
| 213756_S_AT | HSF1 | HEAT SHOCK TRANSCRIPTION FACTOR 1 |
| 213757_AT | EIF5A | EUKARYOTIC TRANSLATION INITIATION FACTOR 5A |
| 213856_AT | CD47 | CD47 ANTIGEN (RH-RELATED ANTIGEN, INTEGRIN-ASSOCIATED SIGNAL TRANSDUCER) |
| 214095_AT | SHMT2 | SERINE HYDROXYMETHYLTRANSFERASE 2 (MITOCHONDRIAL) |
| 214437_S_AT | SHMT2 | SERINE HYDROXYMETHYLTRANSFERASE 2 (MITOCHONDRIAL) |
| 214697_S_AT | ROD1 | ROD1 REGULATOR OF DIFFERENTIATION 1 (S. POMBE) |
| 215735_S_AT | TSC2 | TUBEROUS SCLEROSIS 2 |
| 216017_S_AT | NAB2 | NGFI-A BINDING PROTEIN 2 (EGR1 BINDING PROTEIN 2) |
| 216080_S_AT | FADS3 | FATTY ACID DESATURASE 3 |
| 216237_S_AT | MCM5 | MCM5 MINICHROMOSOME MAINTENANCE DEFICIENT 5, CELL DIVISION CYCLE 46 (S. CEREVISIAE) |
| 217693_X_AT | LOC388335 | SIMILAR TO RIKEN CDNA A730055C05 GENE |
| 217928_S_AT | SAPS3 | CHROMOSOME 11 OPEN READING FRAME 23 |
| 218807_AT | VAV3 | VAV 3 ONCOGENE |
| 218887_AT | MRPL2 | MITOCHONDRIAL RIBOSOMAL PROTEIN L2 |
| 218889_AT | NOC3L | NUCLEOLAR COMPLEX ASSOCIATED 3 HOMOLOG (S. CEREVISIAE) |
| 219829_AT | ITGB1BP2 | INTEGRIN BETA 1 BINDING PROTEIN (MELUSIN) 2 |
| 220116_AT | KCNN2 | POTASSIUM INTERMEDIATE/SMALL CONDUCTANCE CALCIUM- ACTIVATED CHANNEL, SUBFAMILY N, MEMBER 2 |
| 221014_S_AT | RAB33B | RAB33B, MEMBER RAS ONCOGENE FAMILY |
| 221926_S_AT | IL17RC | INTERLEUKIN 17 RECEPTOR C |
| 222071_S_AT | SLCO4C1 | HYPOTHETICAL PROTEIN PRO2176 |
| 46947_AT | GNL3L | GUANINE NUCLEOTIDE BINDING PROTEIN-LIKE 3 (NUCLEOLAR)-LIKE |
Figure 3Identified target genes of EGFR-specific transcription factor (ETF) in estrogen-induced and estrogen-deprived conditions. (a) Yellow diamond: target genes of ETF; purple circle: direct neighbors of the target genes from protein-protein interaction (PPI) data. (b) Gene expression pattern of EGFR and its direct PPI in estrogen-deprived condition.
Figure 4Expression pattern and $ER_Q6 binding distribution of ER target genes in estrogen-induced condition (MCF-7 cell line). (a) The identified ER target genes show a consistent gene expression pattern of either being early-induced (≤ 4 hours) or late-induced (4 - 24 hours) by estrogen. (b) 52 out of 68 ER target genes have at least one $ER_Q6 binding site.