Literature DB >> 27885310

Concurrent generation of multivariate mixed data with variables of dissimilar types.

Anup Amatya1, Hakan Demirtas2.   

Abstract

Data sets originating from wide range of research studies are composed of multiple variables that are correlated and of dissimilar types, primarily of count, binary/ordinal and continuous attributes. The present paper builds on the previous works on multivariate data generation and develops a framework for generating multivariate mixed data with a pre-specified correlation matrix. The generated data consist of components that are marginally count, binary, ordinal and continuous, where the count and continuous variables follow the generalized Poisson and normal distributions, respectively. The use of the generalized Poisson distribution provides a flexible mechanism which allows under- and over-dispersed count variables generally encountered in practice. A step-by-step algorithm is provided and its performance is evaluated using simulated and real-data scenarios.

Entities:  

Keywords:  Generalized Poisson; Mutivariate ordinal; discretization

Year:  2016        PMID: 27885310      PMCID: PMC5117654          DOI: 10.1080/00949655.2016.1177530

Source DB:  PubMed          Journal:  J Stat Comput Simul        ISSN: 0094-9655            Impact factor:   1.424


  9 in total

1.  Simultaneous generation of binary and normal data with specified marginal and association structures.

Authors:  Hakan Demirtas; Beyza Doganay
Journal:  J Biopharm Stat       Date:  2012       Impact factor: 1.051

2.  Simulating Multivariate Nonnormal Data Using an Iterative Algorithm.

Authors:  John Ruscio; Walter Kaczetow
Journal:  Multivariate Behav Res       Date:  2008 Jul-Sep       Impact factor: 5.923

3.  Simulating Ordinal Data.

Authors:  Pier Alda Ferrari; Alessandro Barbiero
Journal:  Multivariate Behav Res       Date:  2012-07       Impact factor: 5.923

4.  Simulation of massive public health data by power polynomials.

Authors:  Hakan Demirtas; Donald Hedeker; Robin J Mermelstein
Journal:  Stat Med       Date:  2012-04-25       Impact factor: 2.373

5.  Joint regression analysis of correlated data using Gaussian copulas.

Authors:  Peter X-K Song; Mingyao Li; Ying Yuan
Journal:  Biometrics       Date:  2008-05-28       Impact factor: 2.571

6.  Concurrent Generation of Ordinal and Normal Data.

Authors:  Hakan Demirtas; Yasemin Yavuz
Journal:  J Biopharm Stat       Date:  2015       Impact factor: 1.051

Review 7.  The Interstitial Cystitis Data Base Study: concepts and preliminary baseline descriptive statistics.

Authors:  L J Simon; J R Landis; D R Erickson; L M Nyberg
Journal:  Urology       Date:  1997-05       Impact factor: 2.649

8.  Some covariance models for longitudinal count data with overdispersion.

Authors:  P F Thall; S C Vail
Journal:  Biometrics       Date:  1990-09       Impact factor: 2.571

9.  A prospective study of interstitial cystitis: results of longitudinal followup of the interstitial cystitis data base cohort. The Interstitial Cystitis Data Base Study Group.

Authors:  K J Propert; A J Schaeffer; C M Brensinger; J W Kusek; L M Nyberg; J R Landis
Journal:  J Urol       Date:  2000-05       Impact factor: 7.450

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.