Literature DB >> 26357403

A Dimensionally Reduced Clustering Methodology for Heterogeneous Occupational Medicine Data Mining.

Foued Saâdaoui, Pierre R Bertrand, Gil Boudet, Karine Rouffiac, Frédéric Dutheil, Alain Chamoux.   

Abstract

Clustering is a set of techniques of the statistical learning aimed at finding structures of heterogeneous partitions grouping homogenous data called clusters. There are several fields in which clustering was successfully applied, such as medicine, biology, finance, economics, etc. In this paper, we introduce the notion of clustering in multifactorial data analysis problems. A case study is conducted for an occupational medicine problem with the purpose of analyzing patterns in a population of 813 individuals. To reduce the data set dimensionality, we base our approach on the Principal Component Analysis (PCA), which is the statistical tool most commonly used in factorial analysis. However, the problems in nature, especially in medicine, are often based on heterogeneous-type qualitative-quantitative measurements, whereas PCA only processes quantitative ones. Besides, qualitative data are originally unobservable quantitative responses that are usually binary-coded. Hence, we propose a new set of strategies allowing to simultaneously handle quantitative and qualitative data. The principle of this approach is to perform a projection of the qualitative variables on the subspaces spanned by quantitative ones. Subsequently, an optimal model is allocated to the resulting PCA-regressed subspaces.

Mesh:

Year:  2015        PMID: 26357403     DOI: 10.1109/TNB.2015.2477407

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  3 in total

1.  At-risk and intervention thresholds of occupational stress using a visual analogue scale.

Authors:  Frédéric Dutheil; Bruno Pereira; Farès Moustafa; Geraldine Naughton; François-Xavier Lesage; Céline Lambert
Journal:  PLoS One       Date:  2017-06-06       Impact factor: 3.240

2.  Revisiting Islamic banking efficiency using multivariate adaptive regression splines.

Authors:  Foued Saâdaoui; Monjia Khalfi
Journal:  Ann Oper Res       Date:  2022-02-05       Impact factor: 4.854

3.  Multiscaled causality of infections on viral testing volumes: The case of COVID-19 in Tunisia.

Authors:  Foued Saâdaoui; Hana Rabbouch; Hayet Saadaoui; Frédéric Dutheil
Journal:  Int J Health Plann Manage       Date:  2022-02-12
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.