| Literature DB >> 28812323 |
Therese Sheppard1, Robyn Tamblyn2,3,4, Michal Abrahamowicz2, Mark Lunt1, Matthew Sperrin5,6, William G Dixon1,6.
Abstract
PURPOSE: To compare the more complex technique, functional principal component analysis (FPCA), to simpler methods of estimating values of sparse and irregularly spaced continuous variables at given time points in longitudinal data using a diabetic patient cohort from UK primary care.Entities:
Keywords: continuous variable; functional principal component analysis; linear interpolation; mean prediction error; pharmacoepidemiology; predictive accuracy; sparse longitudinal data
Mesh:
Substances:
Year: 2017 PMID: 28812323 PMCID: PMC5724699 DOI: 10.1002/pds.4273
Source DB: PubMed Journal: Pharmacoepidemiol Drug Saf ISSN: 1053-8569 Impact factor: 2.890
Figure 1Establishment of cohort of new users of oral hypoglycaemics during July 2007 – January 2009, with index date of 1 January 2009 and 30‐month follow‐up period from January 2009 – June 2011
Figure 2Schematic of a single patient's HbA1c measurements since first ever using oral hypoglycaemic medication, where triangles and squares represent true measured values
Definition of linear interpolation methods to impute missing observations
|
| |
| Last occurrence carried forward (LOCF) |
Missing value imputed from the previous nonmissing value (Figure |
| Global |
Missing value imputed by prolonging a line joining the first and last nonmissing values (Figure |
| Local |
Missing value imputed locally by prolonging a line joining the penultimate and last nonmissing values (Figure |
| Bisector |
Missing value imputed from an intermediate line, the bisector, between the global and local lines (Figure |
|
| |
| Last occurrence carried forward (LOCF) |
Missing value imputed from the previous nonmissing value (Figure |
| Next occurrence carried backward (NOCB) |
Missing value imputed from the next nonmissing value (Figure |
| Global |
Missing value imputed by drawing a line between first and last nonmissing values (Figure |
| Local |
Missing value imputed by drawing a line between the nonmissing values immediately surrounding the missing one (Figure |
Figure 3Various definitions of estimation methods. (a) LOCF indicates last occurrence carried forward linear interpolation. (b) Global linear interpolation. (c) Local linear interpolation. (d) Bisector linear interpolation. (e) Arithmetic mean method. (f) Simple linear regression. (g) FPCA indicates functional principal component analysis. When imputing values missing in the middle: (h) NOCB, next occurrence carried backward; (i) global linear interpolation, (j) local linear interpolation [Colour figure can be viewed at wileyonlinelibrary.com]
Prediction error (absolute difference between predicted and actual values of last observation), described as MAPE and root MSAPE of 4009 subjects and proportion of these subjects with predictions within clinical acceptability and measurement error. Reporting pooled results of 6 randomly sampled data sets from whole new user cohort using mean, SD, and CV
| LOCF | Global | Local | Bisector | AM | SLR | RE‐no covs | RE‐withcovs | FPCA | |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| Mean | 0.60 | 1.63 | 1.63 | 1.59 | 0.66 | 0.96 | 0.65 | 0.65 | 0.59 |
| SD | 0.009 | 0.010 | 0.033 | 0.023 | 0.009 | 0.016 | 0.005 | 0.008 | 0.008 |
| CV | 1.5% | 0.6% | 2% | 1.45% | 1.36% | 1.67% | 0.77 | 1.23% | 1.36% |
|
| |||||||||
| Mean | 60% | 37% | 37% | 34% | 55% | 49% | 57% | 57% | 61% |
| SD | 0.516 | 0.816 | 0.984 | 0.816 | 0.753 | 0.753 | 0.837 | 0.516 | 1.033 |
| CV | 0.86% | 2.21% | 2.66% | 2.4% | 1.37% | 1.54% | 1.47% | 0.91% | 1.69% |
|
| |||||||||
| Mean | 52% | 32% | 31% | 29% | 48% | 42% | 49% | 49% | 54% |
| SD | 0.753 | 0.753 | 0.753 | 0.753 | 0.894 | 0.516 | 0.548 | 0.408 | 0.548 |
| CV | 1.45% | 2.35% | 2.43% | 2.6% | 1.86% | 1.23% | 1.12% | 0.83% | 1.01% |
|
| |||||||||
| Mean | 0.95 | 2.93 | 3.40 | 2.84 | 0.99 | 1.85 | 0.97 | 0.97 | 0.91 |
| SD | 0.018 | 0.093 | 0.445 | 0.203 | 0.012 | 0.164 | 0.011 | 0.012 | 0.010 |
| CV | 1.84% | 3.15% | 13.09% | 7.15% | 1.25% | 8.88% | 1.11% | 1.26% | 1.13% |
Clinically important difference.
Measurement error.
Age and gender.
LOCF indicates last occurrence carried forward; Global, Local and Bisector, linear interpolation methods for value missing at the end; AM, arithmetic mean method; SLR, simple linear regression method; RE‐no covs: random effects modelling method with no covariates included; RE‐with covs: random effects modelling method with covariates included; FPCA, functional principal component analysis method.
Prediction error (absolute difference between predicted and actual values of middle observation), described as MAPE and root MSAPE of 4009 subjects and proportion of these subjects with predictions within clinical acceptability and measurement error. Reporting pooled results of 6 randomly sampled data sets from whole new user cohort using mean, SD, and CV
| LOCF | NOCB | Global | Local | AM | SLR | RE‐no covs | RE‐with covs | FPCA | |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| Mean | 0.62 | 0.62 | 0.93 | 0.67 | 0.55 | 0.53 | 0.56 | 0.56 | 0.52 |
| SD | 0.010 | 0.009 | 0.010 | 0.008 | 0.005 | 0.006 | 0.005 | 0.005 | 0.005 |
| CV | 1.61% | 1.45% | 1.08% | 1.19% | 0.91% | 1.13% | 0.89% | 0.89% | 0.96% |
|
| |||||||||
| Mean | 58% | 58% | 51% | 61% | 63% | 65% | 63% | 63% | 66% |
| SD | 0.548 | 1.095 | 0.816 | 0.837 | 0.516 | 0.408 | 0.632 | 0.632 | 0.753 |
| CV | 0.94% | 1.89% | 1.6% | 1.37% | 0.82% | 0.63% | 1% | 1% | 1.14% |
|
| |||||||||
| Mean | 51% | 51% | 44% | 54% | 56% | 58% | 54% | 54% | 57% |
| SD | 0.408 | 0.632 | 0.00 | 0.548 | 0.408 | 0.408 | 0.516 | 1.472 | 0.753 |
| CV | 0.8% | 1.24% | 0% | 1.01% | 0.73% | 0.70% | 0.96% | 2.73% | 1.32% |
|
| |||||||||
| Mean | 1.02 | 1.00 | 1.53 | 1.13 | 0.85 | 0.83 | 0.84 | 0.84 | 0.80 |
| SD | 0.024 | 0.014 | 0.026 | 0.029 | 0.009 | 0.017 | 0.010 | 0.010 | 0.010 |
| CV | 2.36% | 1.39% | 1.69% | 2.57% | 1.08% | 2.04% | 1.19% | 1.19% | 1.29% |
Clinically important difference.
Measurement error.
Age and gender.
LOCF indicates last occurrence carried forward; NOCB, next occurrence carried backward; Global and Local, linear interpolation methods for value missing in the middle; AM, arithmetic mean method; SLR, simple linear regression method; .RE‐no covs, random effects modelling method with no covariates included; RE‐with covs, random effects modelling method with covariates included; FPCA, functional principal component analysis method.