| Literature DB >> 21524308 |
Cristian A Gallo1, Jessica A Carballido, Ignacio Ponzoni.
Abstract
BACKGROUND: Gene regulatory networks have an essential role in every process of life. In this regard, the amount of genome-wide time series data is becoming increasingly available, providing the opportunity to discover the time-delayed gene regulatory networks that govern the majority of these molecular processes.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21524308 PMCID: PMC3111372 DOI: 10.1186/1471-2105-12-123
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Types of rules inferred by GRNCOP2
| Rule type | Time-lagged rule associated |
|---|---|
| -3 | + gene |
| -2 | - gene |
| -1 | +/- gene |
| 0 | gene |
| 1 | +/- gene |
| 2 | + gene |
| 3 | - gene |
Summary of the different types of rules inferred by GRNCOP2, where w denotes the time-delay in the regulation.
Figure 1General schema of the GRNCOP2 Algorithm.
Characteristics of the 190 possible gene pair-wise interactions
| Yeastnet | Co-citation | GO | number of possible associations | |||
|---|---|---|---|---|---|---|
| All gene pair-wise combinations | 51.58% | 1.53033843 | 43.68% | 1.3487118 | 45.26% | 190 |
Characteristics of the 190 possible gene pair-wise interactions for 20 genes in terms of precision and score metrics on the Yeastnet, Co-citation and GO reference sets.
Average values for the metrics achieved by GRNCOP2 and GRNCOP
| GRNCOP2 | GRNCOP | RANDOM | ||
|---|---|---|---|---|
| average | 76.69% | 51.58% | ||
| average | 16.25% | - | ||
| average | 82.43% | - | ||
| average | 2.49 | 1.53033843 | ||
| average | 74.86% | 43.68% | ||
| average | 19.02% | - | ||
| average | 82.76% | - | ||
| average | 2.50 | 1.3487118 | ||
| average | 52.25% | 45.26% | ||
| average | 13.93% | - | ||
| average | 76.60% | - | ||
| average number of associations | 20.84 | - | ||
Average precision, sensitivity, specificity and score values achieved by GRNCOP2 and GRNCOP over 56 runs. The precision and score of the random selection is also included. The bolded scores denote the best values.
Figure 2. Values of precision and score metrics achieved by GRNCOP2 and GRNCOP in each of the 56 runs w.r.t. the CP-CSS. Figure 2a: yeastnet precision. Figure 2b: yeastnet score. Figure 2c: co-citation precision. Figure 2d: co-citation score. Figure 2e: GO precision.
Figure 3. The sensitivity and specificity values achieved by GRNCOP2 and GRNCOP on the three benchmarking sets for the 56 runs. Figure 3a: sensitivity vs. specificity regarding the Yeastnet set. Figure 3b: sensitivity vs. specificity regarding the Co-citation set. Figure 3c: sensitivity vs. specificity regarding the GO set.
Values of the metrics achieved by GRNCOP2, Soinov et al. and Bulashevska and Eils
| GRNCOP2 | Soinov | Bulashevska and Eils | RANDOM | ||
|---|---|---|---|---|---|
| 50.00% | 88.89% | 51.58% | |||
| 3.06% | 8.16% | - | |||
| 96.74% | - | ||||
| 1.84 | 2.77 | 1.5303384 | |||
| 50.00% | 88.89% | 43.68% | |||
| 3.61% | 9.64% | - | |||
| 97.20% | - | ||||
| 1.85 | 2.84 | 1.3487118 | |||
| 50.00% | 55.56% | 45.26% | |||
| 3.49% | 5.81% | - | |||
| 96.15% | 96.15% | - | |||
| 6.00 | 9.00 | - | |||
Values achieved by GRNCOP2, Soinov et al. and Bulashevska and Eils for the proposed metrics, for simultaneous rules at an accuracy level of 0.75. The precision and score of the random selection is also included. The bolded scores denote the best values.
Values of the metrics achieved by GRNCOP2 and Li et al
| GRNCOP2 | Li | RANDOM | ||
|---|---|---|---|---|
| 51.58% | ||||
| 5.10% | - | |||
| - | ||||
| 2.89 | 1.5303384 | |||
| 43.68% | ||||
| 6.02% | - | |||
| - | ||||
| 2.99 | 1.3487118 | |||
| 80.00% | 45.26% | |||
| 4.65% | - | |||
| - | ||||
| number of associations | 5.00 | - | ||
Values achieved by GRNCOP2 and Li et al. for the proposed metrics, for rules with time-delays from 1 to 5 at an accuracy level of 0.75. The precision and score of the random selection is also included. The bolded scores denote the best values.
Rules inferred by GRNCOP2
| Rule found by | ||||||||
|---|---|---|---|---|---|---|---|---|
| Rules | GRNCOP | Bulashevska and Eils | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | -/+ | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
| +/- | → | +/- | ||||||
Rules inferred by GRNCOP2 using 20 cyclin genes of the datasets from Spellman et al. (1998). The last three columns indicate whether the rules were found by any of the other methods. The complete rules are represented by an *; + for positive regulatory relationship, and - for the negative regulatory relationship.
Figure 4Discretized values for . Real and discretized expression profiles of SWI4 and CLB5 genes with 0 (left) and 4 (right) units of time-delay for the cdc15 dataset.
List of genome-wide datasets
| Microarray time-series dataset | Reference | Sample count |
|---|---|---|
| GDS1752_d1 | Ronen and Botstein [ | 12 |
| GDS1752_d2 | 14 | |
| GDS2003_d1 | Lai | 15 |
| GDS2003_d2 | 15 | |
| GDS2347 | Pramila | 13 |
| GDS2350_d1 | Pramila | 25 |
| GDS2350_d2 | 25 | |
| GDS759 | Sapra | 24 |
| ELUTRIATION | Spellman | 14 |
| ALPHA FACTOR | 18 | |
| CDC15 | 24 | |
| CDC28 | 17 |
List of datasets employed in the genome-wide study. Some datasets were separated into two different sets of samples based on the experimental conditions described for each one.
Figure 5. Values of the precision and score metrics achieved by GRNCOP2 with the Accuracy and RCA parameters varying from 0.70 to 1 and from 0.60 to 1 respectively, with the SCP parameter fixed in 0.95 and with W = 4. The number of associations is also showed. Figure 5a: yeastnet precision. Figure 5b: yeastnet score. Figure 5c: co-citation precision. Figure 5d: co-citation score. Figure 5e: GO precision. Figure 5f: number of associations.
Figure 6Reconstructed GRN with .
Ontological analysis for the eight largest sub-networks
| Biological Process | Molecular Function | Celullar Component | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 5 | ||||||||||
| 9 | ||||||||||
| DNA binding | 35.71% | 0.06779 | nucleus | 71.43% | 0.04154 | 14 | ||||
| 35 | ||||||||||
| molecular function | 38.79% | 0.02142 | 214 | |||||||
| molecular function | 50.00% | 0.35995 | cytoplasm | 50.00% | 0.73076 | 4 | ||||
| transferase activity | 75.00% | 0.01018 | 4 | |||||||
| biological process | 60.00% | 0.18427 | cellular component | 60.00% | 0.04812 | 5 | ||||
| all | molecular function | 33.24% | 0.31227 | 352 | ||||||
The eight largest sub-networks, with their respective ontological enrichment, for the Biological Process, Molecular Function and Cellular Component categories. The annotation column denotes the most common annotation for the genes in the sub-network, whereas the percentage is the percentage of genes w.r.t. the number of genes in the sub-network that receives such annotation. The corrected p-value is the statistical significance of the annotation. Finally, the bolded categories and scores remark the cases where the annotation was statistically significant at an α level of 0.01.