| Literature DB >> 28989981 |
Demetris Avraam1, Rebecca C Wilson1, Paul Burton1.
Abstract
Three synthetic datasets - of observation size 15,000, 155,000 and 1,555,000 participants, respectively - were created by simulating eleven cardiac and anthropometric variables from nine collection ages of the ALSAPC birth cohort study. The synthetic datasets retain similar data properties to the ALSPAC study data they are simulated from (co-variance matrices, as well as the mean and variance values of the variables) without including the original data itself or disclosing participant information. In this instance, the three synthetic datasets have been utilised in an academia-industry collaboration to build a prototype virtual reality data analysis software, but they could have a broader use in method and software development projects where sensitive data cannot be freely shared.Entities:
Keywords: ALSPAC; Simulated data; data visualisation; synthetic data; virtual reality; visual analytics
Year: 2017 PMID: 28989981 PMCID: PMC5605951 DOI: 10.12688/wellcomeopenres.12441.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
A description of the ALSPAC variables used to generate the simulated datasets.
| ALSPAC
| Description | Simulated
|
|---|---|---|
| kz021 | Sex | sex |
| f7ms010 | Height (cm): F@7 | height.7 |
| f7ms012 | Sitting height (cm): F@7 | height.sit.7 |
| f7ms018 | Waist circumference (cm): F@7 | waist.7 |
| f7ms020 | Hip circumference (cm): F@7 | hip.7 |
| f7ms026 | Weight (kg): F@7 | weight.7 |
| f7ms026a | BMI: F@7 | BMI |
| f7sa021 | Mean BP systolic: samples F@7 | sbp.7 |
| f7sa022 | Mean BP diastolic: samples F@7 | dbp.7 |
| f7sa023 | Mean Pulse: samples F@7 | pulse.7 |
| f7003c | Age (months) at Focus @ 7 visit | age.7 |
| f8lf020 | Child height (cm): LF, F@8 | height.8 |
| f8lf021 | Child weight (kg): LF, F@8 | weight.8 |
| f8003c | Age (months) at Focus @ 8 visit | age.8 |
| f9ms010 | Height (cm): F@9 | height.9 |
| f9ms012 | Sitting height (cm): F@9 | height.sit.9 |
| f9ms018 | Waist circumference (cm): F@9 | waist.9 |
| f9ms020 | Hip circumference (cm): F@9 | hip.9 |
| f9ms026 | Weight (kg): F@9 | weight.9 |
| f9ms026a | BMI: F@9 | BMI.9 |
| f9sa021 | Mean BP systolic: samples F@9 | sbp.9 |
| f9sa022 | Mean BP diastolic: samples F@9 | dbp.9 |
| f9sa023 | Mean Pulse: samples F@9 | pulse.9 |
| f9003c | Age (months) at Focus @ 9 visit | age.9 |
| fdms010 | Height (cm): F10+ | height.10 |
| fdms012 | Sitting height (cm): F10+ | height.sit.10 |
| fdms018 | Waist circumference (cm): F10+ | waist.10 |
| fdms026 | Weight (kg): F10+ | weight.10 |
| fdms026a | BMI: F10+ | BMI.10 |
| SBP | Systolic blood pressure_AS | sbp.10 |
| DBP | Diastolic blood pressure_AS | dbp.10 |
| fd003c | Age (months) at F10+ visit | age.10 |
| fems010 | Height (cm): F11+ | height.11 |
| fems012 | Sitting height (cm): F11+ | height.sit.11 |
| fems018 | Waist circumference (cm): F11+ | waist.11 |
| fems020 | Hip circumference (cm): F11+ | hip.11 |
| fems026 | Weight (kg): F11+ | weight.11 |
| fems026a | BMI: F11+ | BMI.11 |
| fesa021 | Mean BP systolic: samples F11+ | sbp.11 |
| fesa022 | Mean BP diastolic: samples F11+ | dbp.11 |
| fesa023 | Mean Pulse: samples F11+ | pulse.11 |
| fe003c | Age (months) at F11+ visit | age.11 |
| ff2000 | M5: Height (cms) | height.12 |
| ff2005 | M7: Sitting height (cms) | height.sit.12 |
| ff2020 | M11: Waist circumference (cms) | waist.12 |
| ff2620 | B8: BP result 1 - systolic | sbp.12 |
| ff2621 | B9: BP result 1 - diastolic | dbp.12 |
| ff2622 | B10: BP result 1 - pulse | pulse.12 |
| ff0011a | DV: Age of study child at attendance
| age.12 |
| fg3100 | M5: Height (cms) : TF2 | height.13 |
| fg3120 | M11: Waist circumference (cms) :
| waist.13 |
| fg3130 | M15: Weight (Kgs) : TF2 | weight.13 |
| fg6120 | B15: BP result 1 - systolic : TF2 | sbp.13 |
| fg6121 | B16: BP result 1 - diastolic : TF2 | dbp.13 |
| fg6122 | B17: BP result 1 - pulse : TF2 | pulse.13 |
| fg0011a | DV: Age of study child at attendance
| age.13 |
| fh3000 | M5: Height (cms) : TF3 | height.15 |
| fh3010 | M15: Weight (Kgs) : TF3 | weight.15 |
| fh4020 | M11: Waist circumference (cms) :
| waist.15 |
| fh4030 | V6: Sitting height (cms) : TF3 | height.sit.15 |
| fh2030 | AC18: BP result 1 - systolic : TF3 | sbp.15 |
| fh2031 | AC19: BP result 1 - diastolic : TF3 | dbp.15 |
| fh2032 | AC20: BP result 1 - pulse : TF3 | pulse.15 |
| fh0011a | DV: Age of study child at attendance
| age.15 |
| FJMR020 | M5: Height (cms) [F17] | height.17 |
| FJMR022 | M15: Weight (kgs) [F17] | weight.17 |
| FJAR020a | dv: Right arm BP mean: systolic | sbp.17 |
| FJAR020b | dv: Right arm BP mean: diastolic | dbp.17 |
| FJAR020c | dv: Right arm BP mean: pulse | pulse.17 |
| FJMR022a | dv: BMI [F17] | bmi.17 |
| FJ003a | Age in months at clinic visit [F17] | age.17 |
A summary of data capture in clinics for the respective ALSPAC variables.
| Variable (units) | F@7 | F@8 | F@9 | F@10 | F@11 | TF1 | TF2 | TF3 | TF4 |
|---|---|---|---|---|---|---|---|---|---|
| Gender (1 male, 2 female) | yes | yes | yes | yes | yes | yes | yes | yes | yes |
| Exact Age (months) | yes | yes | yes | yes | yes | yes | yes | yes | yes |
| Height (cm) | yes | yes | yes | yes | yes | yes | yes | yes | yes |
| Sitting Height (cm) | yes | NA | yes | yes | yes | yes | NA | yes | NA |
| Waist Circumference (cm) | yes | NA | yes | yes | yes | yes | yes | yes | NA |
| Hip Circumference (cm) | yes | NA | yes | NA | yes | NA | NA | NA | NA |
| Weight (kg) | yes | yes | yes | yes | yes | yes | yes | yes | yes |
| Systolic Blood Pressure (mmHg) | yes | NA | yes | yes | yes | yes | yes | yes | yes |
| Diastolic Blood Pressure (mmHg) | yes | NA | yes | yes | yes | yes | yes | yes | yes |
| Pulse (Beats per minute) | yes | NA | yes | NA | yes | yes | yes | yes | yes |
| BMI (kg/m2) | yes | NA | yes | yes | yes | NA | NA | NA | yes |