| Literature DB >> 21047387 |
David Oviatt1, Mark Clement, Quinn Snell, Kenneth Sundberg, Chun Wan J Lai, Jared Allen, Randall Roper.
Abstract
BACKGROUND: Modern approaches to treating genetic disorders, cancers and even epidemics rely on a detailed understanding of the underlying gene signaling network. Previous work has used time series microarray data to infer gene signaling networks given a large number of accurate time series samples. Microarray data available for many biological experiments is limited to a small number of arrays with little or no time series guarantees. When several samples are averaged to examine differences in mean value between a diseased and normal state, information from individual samples that could indicate a gene relationship can be lost.Entities:
Mesh:
Year: 2010 PMID: 21047387 PMCID: PMC2975420 DOI: 10.1186/1471-2164-11-S2-S6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Sample AIRnet Network inferred using 40% threshold
Figure 2Sample AIRnet Network inferred using 80% threshold
Figure 3True in-silico network
AIRnet results for DREAM3 competition
| score | AUPR | AUROC | |
| empty network | 1.1816e+00 | 8.5675e-03 | 5.0578e-01 |
| trajectories | 1.6298e+00 | 2.1759e-03 | 2.5279e-01 |
| heterozygous | 2.2401e+00 | 3.6441e-04 | 9.0845e-02 |
| null-mutant | 2.8198e+00 | 4.1550e-05 | 5.5198e-02 |
| score | AUPR | AUROC | |
| empty network | 2.4438e+00 | 2.6065e-05 | 4.9687e-01 |
| trajectories | 2.6700e+00 | 6.1865e-06 | 7.3901e-01 |
| heterozygous | 2.6207e+00 | 7.7215e-06 | 7.4297e-01 |
| null-mutant | 1.4152e+01 | 5.2984e-26 | 9.3634e-04 |
| score | AUPR | AUROC | |
| empty network | 5.2312e+00 | 6.8572e-11 | 5.0297e-01 |
| trajectories | 3.8264e+00 | 4.7395e-08 | 4.6923e-01 |
| heterozygous | 5.2881e+00 | 6.3523e-11 | 4.1762e-01 |
| null-mutant | 3.7911e+01 | 1.0263e-71 | 1.4694e-05 |
AIRnet results for 3 network sizes from the DREAM3 competition. The empty network row shows values for graphs with 0 edges and provide a baseline to interpret the scores for other networks. The empty network scores were generated by submitting an empty file with no predictions to the DREAM score evaluator. The other rows correspond to the type of data used to infer the networks. In the score column, larger values are better. In the other two columns, smaller values are better. Scores reported are produced using an 80% threshold parameter for AIRnet.
AIRnet results on experimental data
| Intermediate Genes | Connections | Percentage |
|---|---|---|
| 1 | 148 | 0.41% |
| 2 | 3110 | 8.71% |
| 3 | 12253 | 34.34% |
| 4 | 8065 | 22.60% |
| 5 | 1349 | 3.78% |
AIRnet inferred connections found in public databases. The table shows the number of connections in the AIRnet inferred network that have the given number of intermediate connections. A Connection count of one indicates that the genes are directly connected in the graph. A count of two indicates that the genes have one intermediate gene between them in the pathway database. The percentage shows the percentage of connections in the inferred network that have this Connection count. AIRnet connections were validated almost 70% of the time, where less than 30% of the connections in random networks were validated.
Figure 4Differences between Euploid and Trisomic regulatory networks inferred by AIRnet. Numbers on edges are the absolute value of the difference between the edge weight in Euploid and Trisomic samples. Red edges were found only in Euploid and Green edges were found only in Trisomic samples. Genes with three copies in the trisomic Ts65DN mice are shown in yellow.
Data discretization
| wt | G1(-/-) | G2(-/-) | G3(-/-) | G4(-/-) | G5(-/-) | G6(-/-) | G7(-/-) | G8(-/-) | |
| G1 | 0.105 | 0.034 | 0.927 | 0.088 | 0.015 | 0.049 | 0.102 | 0.105 | 0.018 |
| G2 | 0.877 | 0.804 | 0.000 | 0.864 | 0.870 | 0.981 | 0.837 | 0.873 | 0.797 |
| G3 | 0.054 | 0.000 | 0.838 | 0.000 | 0.103 | 0.000 | 0.069 | 0.000 | 0.085 |
| G4 | 0.386 | 0.310 | 0.611 | 0.243 | 0.083 | 0.432 | 0.440 | 0.394 | 0.364 |
| G5 | 0.801 | 0.808 | 0.748 | 0.903 | 0.793 | 0.000 | 0.880 | 0.741 | 0.686 |
| wt | G1(-/-) | G2(-/-) | G3(-/-) | G4(-/-) | G5(-/-) | G6(-/-) | G7(-/-) | G8(-/-) | |
| G1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| G2 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 |
| G3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| G4 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 |
| G5 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
Discretization of data by k-means clustering for an in-silico network consisting of 5 genes [17] - genes are divided by row, samples are divided by column.
Activation state change examples
| wt | G1(-/-) | G2(-/-) | G3(-/-) | G4(-/-) | G5(-/-) | G6(-/-) | G7(-/-) | G8(-/-) | |
| G1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| G2 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 |
| G3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| G4 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 |
| G5 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
| G6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| wt | G1(-/-) | G2(-/-) | G3(-/-) | G4(-/-) | G5(-/-) | G6(-/-) | G7(-/-) | G8(-/-) | |
| G1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| G2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| G3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| G4 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 |
| G5 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
| G6 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| wt | G1(-/-) | G2(-/-) | G3(-/-) | G4(-/-) | G5(-/-) | G6(-/-) | G7(-/-) | G8(-/-) | |
| G1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| G2 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 |
| G3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | ||
| G4 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | ||
| G5 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
| G6 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
Activation State Change Examples - genes are divided by row, samples are divided by column