Literature DB >> 28866555

Visualizing Big Data Outliers through Distributed Aggregation.

Leland Wilkinson.   

Abstract

Visualizing outliers in massive datasets requires statistical pre-processing in order to reduce the scale of the problem to a size amenable to rendering systems like D3, Plotly or analytic systems like R or SAS. This paper presents a new algorithm, called hdoutliers, for detecting multidimensional outliers. It is unique for a) dealing with a mixture of categorical and continuous variables, b) dealing with big-p (many columns of data), c) dealing with big-n (many rows of data), d) dealing with outliers that mask other outliers, and e) dealing consistently with unidimensional and multidimensional datasets. Unlike ad hoc methods found in many machine learning papers, hdoutliers is based on a distributional model that allows outliers to be tagged with a probability. This critical feature reduces the likelihood of false discoveries.

Year:  2017        PMID: 28866555     DOI: 10.1109/TVCG.2017.2744685

Source DB:  PubMed          Journal:  IEEE Trans Vis Comput Graph        ISSN: 1077-2626            Impact factor:   4.579


  5 in total

Review 1.  Static Analysis of Information Systems for IoT Cyber Security: A Survey of Machine Learning Approaches.

Authors:  Igor Kotenko; Konstantin Izrailov; Mikhail Buinevich
Journal:  Sensors (Basel)       Date:  2022-02-10       Impact factor: 3.576

2.  Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering.

Authors:  Zeynel Cebeci; Cagatay Cebeci; Yalcin Tahtali; Lutfi Bayyurt
Journal:  PeerJ Comput Sci       Date:  2022-09-27

3.  Farm management practices, biosecurity and influenza a virus detection in swine farms: a comprehensive study in colombia.

Authors:  Ciuoderis-Aponte Karl; Diaz Andres; Muskus Carlos; Mario Peña; Hernández-Ortiz Juan; Osorio Jorge
Journal:  Porcine Health Manag       Date:  2022-10-05

4.  Changes in Tea Plant Secondary Metabolite Profiles as a Function of Leafhopper Density and Damage.

Authors:  Eric R Scott; Xin Li; Ji-Peng Wei; Nicole Kfoury; Joshua Morimoto; Ming-Ming Guo; Amma Agyei; Albert Robbat; Selena Ahmed; Sean B Cash; Timothy S Griffin; John R Stepp; Wen-Yan Han; Colin M Orians
Journal:  Front Plant Sci       Date:  2020-05-29       Impact factor: 5.753

5.  High-Throughput Imaging Assay for Drug Screening of 3D Prostate Cancer Organoids.

Authors:  Nicholas Choo; Susanne Ramm; Jennii Luu; Jean M Winter; Luke A Selth; Amy R Dwyer; Mark Frydenberg; Jeremy Grummet; Shahneen Sandhu; Theresa E Hickey; Wayne D Tilley; Renea A Taylor; Gail P Risbridger; Mitchell G Lawrence; Kaylene J Simpson
Journal:  SLAS Discov       Date:  2021-06-11       Impact factor: 3.341

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.