| Literature DB >> 19292896 |
Li-Yu D Liu1, Chien-Yu Chen, Mei-Ju M Chen, Ming-Shian Tsai, Cho-Han S Lee, Tzu L Phang, Li-Yun Chang, Wen-Hung Kuo, Hsiao-Lin Hwa, Huang-Chun Lien, Shih-Ming Jung, Yi-Shing Lin, King-Jen Chang, Fon-Jou Hsieh.
Abstract
BACKGROUND: A variety of high-throughput techniques are now available for constructing comprehensive gene regulatory networks in systems biology. In this study, we report a new statistical approach for facilitating in silico inference of regulatory network structure. The new measure of association, coefficient of intrinsic dependence (CID), is model-free and can be applied to both continuous and categorical distributions. When given two variables X and Y, CID answers whether Y is dependent on X by examining the conditional distribution of Y given X. In this paper, we apply CID to analyze the regulatory relationships between transcription factors (TFs) (X) and their downstream genes (Y) based on clinical data. More specifically, we use estrogen receptor alpha (ERalpha) as the variable X, and the analyses are based on 48 clinical breast cancer gene expression arrays (48A).Entities:
Mesh:
Substances:
Year: 2009 PMID: 19292896 PMCID: PMC2679734 DOI: 10.1186/1471-2105-10-85
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Number of selected primary target genes of ER (. (a) results of using ESR1 on gene set II; (b) results of using ER+/- on gene set II; (c) results of using ESR1 on gene set III; (d) results of using ER+/- on gene set III.
Figure 2The established ERα mediated regulatory network is partially constructed by conducting GPCC and CID analyses in 48A.
Figure 3The brief demonstration of the selected situations when CID, GPCC, and STT evaluate the significance differently based on the same data distribution. (a)-(b) represent when variables are continuous and thus the CID analysis is compared with GPCC. (c)-(d) represent when one of the variables is discrete and thus the CID analysis is compared with STT.
Figure 4Statistical analyses for each gene in 48A. Two examples are demonstrated in this figure. One is BRCA1 (a)-(b), which has been significantly identified by CID as the ERα target gene. Another is CCNA2 (c)-(f), which was significantly recognized by CID/STT.
Summary for characteristics of the identified gene sets when constructing the two-layer network in Figure 2.
| Reported as significance by: | ER direct targets (18#) | E2F targets (11&) |
| GPCC-ESR1 (60*) | 13 | 1 |
| CID-ESR1 but not GPCC-ESR1 (22*) | 1 | 7 |
| Neither CID-ESR1 nor GPCC-ESR1 (101*) | 4 | 3 |
* The number of genes in 48A identified by the statistic of interest.
# The number of genes in 48A appeared in gene set II.
&The number of genes in 48A appeared in the gene list of E2F family direct target by others [38].