Literature DB >> 27126063

Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth.

Zhaoyang Zhang1, Hua Fang2, Honggang Wang3.   

Abstract

Web-delivered trials are an important component in eHealth services. These trials, mostly behavior-based, generate big heterogeneous data that are longitudinal, high dimensional with missing values. Unsupervised learning methods have been widely applied in this area, however, validating the optimal number of clusters has been challenging. Built upon our multiple imputation (MI) based fuzzy clustering, MIfuzzy, we proposed a new multiple imputation based validation (MIV) framework and corresponding MIV algorithms for clustering big longitudinal eHealth data with missing values, more generally for fuzzy-logic based clustering methods. Specifically, we detect the optimal number of clusters by auto-searching and -synthesizing a suite of MI-based validation methods and indices, including conventional (bootstrap or cross-validation based) and emerging (modularity-based) validation indices for general clustering methods as well as the specific one (Xie and Beni) for fuzzy clustering. The MIV performance was demonstrated on a big longitudinal dataset from a real web-delivered trial and using simulation. The results indicate MI-based Xie and Beni index for fuzzy-clustering are more appropriate for detecting the optimal number of clusters for such complex data. The MIV concept and algorithms could be easily adapted to different types of clustering that could process big incomplete longitudinal trial data in eHealth services.

Entities:  

Keywords:  Big data; Fuzzy clustering; Longitudinal trial; Missing data; Multiple imputation; Validation

Mesh:

Year:  2016        PMID: 27126063      PMCID: PMC4881752          DOI: 10.1007/s10916-016-0499-0

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  15 in total

1.  Model-based clustering and data transformations for gene expression data.

Authors:  K Y Yeung; C Fraley; A Murua; A E Raftery; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-10       Impact factor: 6.937

2.  A stability based method for discovering structure in clustered data.

Authors:  Asa Ben-Hur; Andre Elisseeff; Isabelle Guyon
Journal:  Pac Symp Biocomput       Date:  2002

3.  Stability-based validation of clustering solutions.

Authors:  Tilman Lange; Volker Roth; Mikio L Braun; Joachim M Buhmann
Journal:  Neural Comput       Date:  2004-06       Impact factor: 2.026

4.  Pattern Recognition of Longitudinal Trial Data with Nonignorable Missingness: An Empirical Case Study.

Authors:  Hua Fang; Kimberly Andrews Espy; Maria L Rizzo; Christian Stopp; Sandra A Wiebe; Walter W Stroup
Journal:  Int J Inf Technol Decis Mak       Date:  2009-09-01

5.  Bayesian clustering using hidden Markov random fields in spatial population genetics.

Authors:  Olivier François; Sophie Ancelet; Gilles Guillot
Journal:  Genetics       Date:  2006-08-03       Impact factor: 4.562

6.  A new nonlinear classifier with a penalized signed fuzzy measure using effective genetic algorithm.

Authors:  Hua Fang; Maria L Rizzo; Honggang Wang; Kimberly Andrews Espy; Zhenyuan Wang
Journal:  Pattern Recognit       Date:  2010       Impact factor: 7.740

7.  Detecting graded exposure effects: a report on an East Boston pregnancy cohort.

Authors:  Hua Fang; Vanja Dukic; Kate E Pickett; Lauren Wakschlag; Kimberly Andrews Espy
Journal:  Nicotine Tob Res       Date:  2012-01-20       Impact factor: 4.244

8.  The QUIT-PRIMO provider-patient Internet-delivered smoking cessation referral intervention: a cluster-randomized comparative effectiveness trial: study protocol.

Authors:  Thomas K Houston; Rajani S Sadasivam; Daniel E Ford; Joshua Richman; Midge N Ray; Jeroan J Allison
Journal:  Implement Sci       Date:  2010-11-17       Impact factor: 7.327

9.  CONSORT-EHEALTH: improving and standardizing evaluation reports of Web-based and mobile health interventions.

Authors:  Gunther Eysenbach
Journal:  J Med Internet Res       Date:  2011-12-31       Impact factor: 5.428

10.  Evaluating the QUIT-PRIMO clinical practice ePortal to increase smoker engagement with online cessation interventions: a national hybrid type 2 implementation study.

Authors:  Thomas K Houston; Rajani S Sadasivam; Jeroan J Allison; Arlene S Ash; Midge N Ray; Thomas M English; Timothy P Hogan; Daniel E Ford
Journal:  Implement Sci       Date:  2015-11-02       Impact factor: 7.327

View more
  7 in total

1.  An Enhanced Visualization Method to Aid Behavioral Trajectory Pattern Recognition Infrastructure for Big Longitudinal Data.

Authors:  Hua Fang; Zhaoyang Zhang
Journal:  IEEE Trans Big Data       Date:  2017-01-16

2.  A New MI-Based Visualization Aided Validation Index for Mining Big Longitudinal Web Trial Data.

Authors:  Zhaoyang Zhang; Hua Fang; Honggang Wang
Journal:  IEEE Access       Date:  2016-05-16       Impact factor: 3.367

3.  Multiple- vs Non- or Single-Imputation based Fuzzy Clustering for Incomplete Longitudinal Behavioral Intervention Data.

Authors:  Zhaoyang Zhang; Hua Fang
Journal:  IEEE Int Conf Connect Health Appl Syst Eng Technol       Date:  2016-08-18

4.  MIFuzzy Clustering for Incomplete Longitudinal Data in Smart Health.

Authors:  Hua Fang
Journal:  Smart Health (Amst)       Date:  2017-04-27

5.  Acculturation, Depression, and Smoking Cessation: a trajectory pattern recognition approach.

Authors:  Sun S Kim; Hua Fang; Kunsook Bernstein; Zhaoyang Zhang; Joseph DiFranza; Douglas Ziedonis; Jeroan Allison
Journal:  Tob Induc Dis       Date:  2017-07-24       Impact factor: 2.600

6.  Observational study protocol for evaluating control of hypertension and the effects of social determinants.

Authors:  Heather Angier; Nathalie Huguet; Miguel Marino; Beverly Green; Heather Holderness; Rachel Gold; Megan Hoopes; Jennifer DeVoe
Journal:  BMJ Open       Date:  2019-03-15       Impact factor: 2.692

Review 7.  Transforming big data into computational models for personalized medicine and health care.

Authors:  S M Reza Soroushmehr; Kayvan Najarian
Journal:  Dialogues Clin Neurosci       Date:  2016-09       Impact factor: 5.986

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.