| Literature DB >> 26283601 |
Yi Wang1, Yi Li2, Hongbao Cao3, Momiao Xiong4, Yin Yao Shugart5,6, Li Jin7,8.
Abstract
BACKGROUND: Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y).Entities:
Mesh:
Substances:
Year: 2015 PMID: 26283601 PMCID: PMC4539721 DOI: 10.1186/s12859-015-0697-7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Simulation power in nine simple functions
| N = 50, x ~ U(−1,1) | CANOVA2 | CANOVA4 | CANOVA8 | CANOVA12 | Pearson | Kendall | Spearman | Distance | Hoeffding | MIC |
|---|---|---|---|---|---|---|---|---|---|---|
| y = 0 + N(0,1) | 0.051 | 0.048 | 0.048 | 0.050 | 0.047 | 0.048 | 0.049 | 0.039 | 0.059 | 0.051 |
| y = x + N(0,1) | 0.564 | 0.798 | 0.889 | 0.902 |
| 0.962 | 0.961 | 0.950 | 0.953 | 0.591 |
| y = 0.5;*(x + 1)2 + N(0,1) | 0.606 | 0.836 | 0.904 | 0.918 |
| 0.953 | 0.962 | 0.964 | 0.953 | 0.633 |
| y = sin(Pi*x) + N(0,1) | 0.758 | 0.941 | 0.966 | 0.962 | 0.936 | 0.918 | 0.930 |
|
| 0.829 |
| y = sin(2*Pi*x) + N(0,1) | 0.713 |
| 0.812 | 0.294 | 0.318 | 0.328 | 0.320 | 0.341 | 0.405 | 0.579 |
| y = sin(3*Pi*x) + N(0,1) | 0.677 |
| 0.254 | 0.076 | 0.178 | 0.192 | 0.199 | 0.186 | 0.219 | 0.423 |
| y = cos(Pi*x) + N(0,1) | 0.784 | 0.940 |
| 0.942 | 0.067 | 0.076 | 0.083 | 0.660 | 0.710 | 0.660 |
| y = cos(2*Pi*x) + N(0,1) | 0.738 |
| 0.754 | 0.142 | 0.045 | 0.054 | 0.053 | 0.100 | 0.129 | 0.548 |
| y = cos(3*Pi*x) + N(0,1) | 0.673 |
| 0.160 | 0.031 | 0.053 | 0.054 | 0.057 | 0.074 | 0.090 | 0.371 |
The bold means the first place result of all methods compared
Power comparison in kidney cancer dataset (The significance level α = 0.05/20531)
| Kidney cancer dataset | CANOVA | Kendall | Pearson | Spearman | Hoeffding | Distance | MIC |
|---|---|---|---|---|---|---|---|
| Significant gene number | 5901 | 11569 | 8239 |
| 4953 | 10946 | 8081 |
| Computing time (seconds) |
| 65 | 32 | 32 | 44 | ~106 | 114 |
The bold means the first place result of all methods compared
“~” means about or approximately
Significant genes detected only by CANOVA and corresponding p-value of all methods in kidney cancer data (α = 0.05/20531)
| CANOVA_gene | CANOVA | Distance | Hoeffding | Kendall | Pearson | Spearman | MIC |
|---|---|---|---|---|---|---|---|
| ACY3|91703 | 0 | 4.00E-06 | 0.47918 | 0.286872598 | 0.002263414 | 0.287245869 | 0.189931316 |
| AMD1|262 | 0 | 4.40E-05 | 0.08116 | 0.005927801 | 0.733642545 | 0.005833208 | 0.212586042 |
| AMDHD1|144193 | 3.40E-07 | 8.00E-06 | 0.67325 | 0.030092326 | 0.000717698 | 0.029975253 | 0.170029851 |
| C17orf37|84299 | 5.80E-07 | 5.20E-05 | 0.04005 | 3.61E-05 | 0.417383349 | 3.24E-05 | 0.219216883 |
| C21orf57|54059 | 2.40E-07 | 4.00E-06 | 0.04784 | 6.30E-06 | 3.99E-05 | 5.38E-06 | 0.19141914 |
| CRAT|1384 | 5.80E-07 | 8.00E-06 | 0.32615 | 0.000160458 | 3.77E-06 | 0.000149343 | 0.196028813 |
| ETV5|2119 | 0 | 0.000172 | 0.42256 | 0.001755105 | 0.003401714 | 0.001702658 | 0.202086913 |
|
| 0 | 0.000933998 | 0.48933 | 0.153797268 | 0.457070256 | 0.153962124 | 0.212691814 |
| FAM105A|54491 | 0 | 2.00E-05 | 0.72088 | 0.005901803 | 7.68E-05 | 0.005807373 | 0.198623556 |
| FTL|2512 | 0 | 0.002467995 | 0.4743 | 0.048315704 | 0.23060211 | 0.048231442 | 0.212746271 |
| GDA|9615 | 1.60E-07 | 0.00025 | 0.48634 | 0.160122916 | 0.459724584 | 0.160300937 | 0.185681164 |
| HSD17B14|51171 | 0 | 8.20E-05 | 0.19284 | 0.001051631 | 0.006799576 | 0.001012728 | 0.208298029 |
| LOC100132111|100132111 | 1.00E-08 | 1.40E-05 | 0.08222 | 0.001103681 | 0.357830837 | 0.001063627 | 0.20892751 |
|
| 0 | 6.00E-06 | 0.50714 | 0.033769054 | 1.98E-05 | 0.033657199 | 0.197222887 |
| MSL3L2|151507 | 5.00E-08 | 4.00E-06 | 0.0658 | 0.00022671 | 0.000309573 | 0.000212513 | 0.197191821 |
| NPEPPS|9520 | 6.30E-07 | 1.80E-05 | 0.12107 | 0.006358039 | 0.294981611 | 0.006260864 | 0.193740442 |
| RASEF|158158 | 4.00E-08 | 2.00E-05 | 0.15806 | 0.038695575 | 0.339964949 | 0.038592039 | 0.221013132 |
| RASGRF1|5923 | 4.50E-07 | 0.000509999 | 0.29384 | 0.005697491 | 0.944454242 | 0.005604368 | 0.192676281 |
| SLC9A3R1|9368 | 0 | 6.00E-06 | 0.2351 | 0.001772375 | 0.000600274 | 0.001719639 | 0.211044758 |
| SRGAP2|23380 | 1.49E-06 | 1.60E-05 | 0.13479 | 0.00010228 | 0.00076986 | 9.43E-05 | 0.16085031 |
| SYTL2|54843 | 9.40E-07 | 0.000357999 | 0.49524 | 0.156725188 | 0.013347293 | 0.156896177 | 0.197737514 |
|
| 5.00E-08 | 1.60E-05 | 0.5995 | 0.278490278 | 1.33E-05 | 0.278854528 | 0.18041022 |
| ZNF280B|140883 | 6.80E-07 | 0.000431999 | 0.17067 | 0.073453284 | 0.259146202 | 0.073428429 | 0.203602346 |
| ZNF577|84765 | 0 | 4.60E-05 | 0.13197 | 0.063783754 | 0.410213566 | 0.063735193 | 0.208902832 |
As the p-value of mic is calculated by table lookup, so we just list the MIC value (If MIC >0.22378, then the p-value of MIC < 2.435342e-06)
The genes reported in pubmed was shown in bold italics
Fig. 1The Scatterplot and probability density distribution of three gene expressions (FAH, MCM3 and UGT1A9) between kidney-cancer and normal groups