| Literature DB >> 25962835 |
Michele Fratello1,2, Angela Serra3, Vittorio Fortino4, Giancarlo Raiconi5, Roberto Tagliaferri6, Dario Greco7.
Abstract
BACKGROUND: OMICs technologies allow to assay the state of a large number of different features (e.g., mRNA expression, miRNA expression, copy number variation, DNA methylation, etc.) from the same samples. The objective of these experiments is usually to find a reduced set of significant features, which can be used to differentiate the conditions assayed. In terms of development of novel feature selection computational methods, this task is challenging for the lack of fully annotated biological datasets to be used for benchmarking. A possible way to tackle this problem is generating appropriate synthetic datasets, whose composition and behaviour are fully controlled and known a priori.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25962835 PMCID: PMC4448275 DOI: 10.1186/s12859-015-0577-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Motifs of interactions. Graphical representation of the interactions between genes and miRNAs. Arrows are for activation, blunt edges are for repression.
Figure 2Hill functions. Shapes of the Hill function for different values of the parameters. The solid red line is a Hill function with parameters θ=0.5 and μ=5. The shaded red area is the family of Hill functions obtained when θ∈[0.3,0.8] and μ is fixed. Similarly, the solid blue line is the Hill function of parameters θ=0.3 and μ=6. The shaded blue area is the family of Hill functions obtained when μ∈[2,10] and θ is fixed.
Figure 3Fitting of degree distribution. The degree distribution of 50 networks generated with the same size is fitted by a line in log-log space. The resulting estimated scale parameter is with R 2=0.9362.
Figure 4Scale invariance of clustering coefficient. Simulation of 100 networks of random size in [10,1000] shows that the estimated scaling parameter of the clustering coefficient is independent from the network size and approximates the value found in real networks.
Gene-only path length
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| 165 true | - | 166 | 1 | - | - | 1 | 165 | 165 |
| 414 true | - | 414 | - | - | - | 2 | 414 | 414 |
| 828 true | - | 829 | 7 | 1 | - | 4 | 828 | 828 |
| 1242 true | 1 | 1243 | 3 | - | 1 | 5 | 1242 | 1242 |
| 1656 true | - | 1656 | 2 | - | - | 3 | 1656 | 1656 |
Path length of significant interactions confirmed by PANDA on the gene-only regulatory network with different amounts of correct prior information.
Gene-only path length with false information
|
|
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| 1656 true + 165 false | 1656 | 1 | - | 8 | 1656 | 1656 |
| 1656 true + 414 false | 1656 | 2 | 1 | 7 | 1656 | 1656 |
| 1656 true + 828 false | 1656 | - | - | 4 | 1654 | 1656 |
| 1656 true + 1242 false | 1656 | 2 | 1 | 7 | 1653 | 1656 |
| 1656 true + 1656 false | 1656 | 2 | - | 4 | 1647 | 1656 |
Whole-network path length
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| 396 true | - | 396 | - | - | - | - | - | 396 | 396 |
| 992 true | 1 | 993 | - | - | 1 | 1 | - | 992 | 992 |
| 1984 true | 1 | 1985 | - | 2 | 2 | 1 | 1 | 1984 | 1984 |
| 2976 true | - | 2976 | 3 | - | 3 | 2 | 1 | 2976 | 2976 |
| 3969 true | - | 3969 | - | 1 | 1 | - | - | 3969 | 3969 |
Path length of significant interactions confirmed by PANDA on the whole regulatory network with different amounts of correct prior information.
Whole-network path length with false information
|
|
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| 3969 true + 396 false | 3969 | - | - | 1 | 3671 | 3930 |
| 3969true + 992 false | 3969 | - | - | 1 | 3653 | 3918 |
| 3969 true + 1984 false | 3969 | - | 1 | - | 3640 | 3899 |
| 3969 true + 2976 false | 3969 | - | 1 | 1 | 3655 | 3874 |
| 3969 true + 3969 false | 3968 | 1 | 1 | 1 | 3659 | 3859 |
Path length of significant interactions confirmed by PANDA on the whole regulatory network with the presence of different amounts of noisy prior information.
Whole-network path length with ARACNE
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| Gene-only Interactions | 25 | 8 | 9 | 6 | 4 | 3 | - | - | - | 6493 |
| Whole-network Interactions | 43 | 150 | 681 | 1342 | 1367 | 449 | 130 | 12 | 2 | 3704 |
Path length of interactions inferred by ARACNE on the gene-only and full regulatory networks.
Figure 5Revealed interactions among clustered genes. Clustered genes and miRNAs together with interactions. The majority of nodes that are clustered together are actually connected in the network from which data has been simulated.