Literature DB >> 34936492

Detecting Unbiased Associations in Large Data Sets.

Chuanlu Liu1, Shuliang Wang1,2, Hanning Yuan1, Xiaojia Liu1.   

Abstract

Maximal information coefficient (MIC) explores the associations between pairwise variables in complex relationships. It approaches the correlation by optimized partition on the axis. However, when the relationships meet special noise, MIC may overestimate the correlated value, which leads to the misidentification of the relationship without noiseless. In this article, a novel method of weighted information coefficient mean (WICM) is proposed to detect unbiased associations in large data sets. First, we mathematically analyze the cause of giving an abnormal correlation value to a noisy relationship. Then, the WICM is presented in two core steps. One is to detect the potential overestimation from the relationships with high value, and the other is to rectify the overestimation by calculating information coefficient mean instead of just selecting the maximum element in the characteristic matrix. Finally, experiments in functional relationships and real-world data relationships show that the overestimation can be solved by WICM with both feasibility and effectiveness.

Entities:  

Keywords:  characteristic matrix; large data set; maximal information coefficient (MIC); relationship overestimation; unbiased associations; weighted information coefficient mean (WICM)

Mesh:

Year:  2021        PMID: 34936492     DOI: 10.1089/big.2021.0193

Source DB:  PubMed          Journal:  Big Data        ISSN: 2167-6461            Impact factor:   4.426


  1 in total

1.  Detecting Trivariate Associations in High-Dimensional Datasets.

Authors:  Chuanlu Liu; Shuliang Wang; Hanning Yuan; Yingxu Dang; Xiaojia Liu
Journal:  Sensors (Basel)       Date:  2022-04-06       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.