| Literature DB >> 35730052 |
Salvatore D Tomarchio1, Antonio Punzo1, Antonello Maruotti2,3.
Abstract
Hidden Markov models (HMMs) have been extensively used in the univariate and multivariate literature. However, there has been an increased interest in the analysis of matrix-variate data over the recent years. In this manuscript we introduce HMMs for matrix-variate balanced longitudinal data, by assuming a matrix normal distribution in each hidden state. Such data are arranged in a four-way array. To address for possible overparameterization issues, we consider the eigen decomposition of the covariance matrices, leading to a total of 98 HMMs. An expectation-conditional maximization algorithm is discussed for parameter estimation. The proposed models are firstly investigated on simulated data, in terms of parameter recovery, computational times and model selection. Then, they are fitted to a four-way real data set concerning the unemployment rates of the Italian provinces, evaluated by gender and age classes, over the last 16 years.Entities:
Keywords: Clustering; Hidden Markov models; Matrix-variate; Parsimonious models
Year: 2022 PMID: 35730052 PMCID: PMC9198417 DOI: 10.1007/s11222-022-10107-0
Source DB: PubMed Journal: Stat Comput ISSN: 0960-3174 Impact factor: 2.324
Nomenclature, covariance matrix structure, and number of free parameters in for the parsimonious models obtained via the eigen decomposition of the state covariance matrices. is the identity matrix
| Family | Model | Type | Volume | Shape | Orientation | # of free parameters in |
|---|---|---|---|---|---|---|
| Spherical | EII | Equal | Spherical | – | 1 | |
| Spherical | VII | Variable | Spherical | – | ||
| Diagonal | EEI | Equal | Equal | Axis-Aligned | ||
| Diagonal | VEI | Variable | Equal | Axis-Aligned | ||
| Diagonal | EVI | Equal | Variable | Axis-Aligned | ||
| Diagonal | VVI | Variable | Variable | Axis-Aligned | ||
| General | EEE | Equal | Equal | Equal | ||
| General | VEE | Variable | Equal | Equal | ||
| General | EVE | Equal | Variable | Equal | ||
| General | VVE | Variable | Variable | Equal | ||
| General | EEV | Equal | Equal | Variable | ||
| General | VEV | Variable | Equal | Variable | ||
| General | EVV | Equal | Variable | Variable | ||
| General | VVV | Variable | Variable | Variable |
Average MSEs of the parameter estimates for the EII-II MV-HMM. The average is computed among the MSEs of the elements of each estimated parameter, over the K states and 50 data sets in each scenario
| Dimension | Parameter | |||||
|---|---|---|---|---|---|---|
| 2 | 0.0083 | 0.0040 | 0.0063 | 0.0033 | ||
| 0.0020 | 0.0016 | 0.0016 | 0.0012 | |||
| 0.0026 | 0.0029 | 0.0024 | 0.0031 | |||
| 0.0013 | 0.0004 | 0.0010 | 0.0005 | |||
| 4 | 0.0164 | 0.0084 | 0.0135 | 0.0069 | ||
| 0.0029 | 0.0010 | 0.0024 | 0.0013 | |||
| 0.0022 | 0.0017 | 0.0017 | 0.0021 | |||
| 0.0009 | 0.0006 | 0.0009 | 0.0004 | |||
| 2 | 0.0083 | 0.0042 | 0.0064 | 0.0031 | ||
| 0.0003 | 0.0002 | 0.0003 | 0.0001 | |||
| 0.0045 | 0.0042 | 0.0029 | 0.0029 | |||
| 0.0018 | 0.0011 | 0.0012 | 0.0005 | |||
| 4 | 0.0131 | 0.0071 | 0.0130 | 0.0067 | ||
| 0.0004 | 0.0003 | 0.0004 | 0.0001 | |||
| 0.0017 | 0.0023 | 0.0019 | 0.0021 | |||
| 0.0009 | 0.0005 | 0.0009 | 0.0004 | |||
Average MSEs of the parameter estimates for the VVE-EV MV-HMM. The average is computed among the MSEs of the elements of each estimated parameter, over the K states and 50 data sets in each scenario
| Dimension | Parameter | |||||
|---|---|---|---|---|---|---|
| 2 | 0.0095 | 0.0042 | 0.0069 | 0.0033 | ||
| 0.0083 | 0.0032 | 0.0076 | 0.0039 | |||
| 0.0025 | 0.0014 | 0.0022 | 0.0011 | |||
| 0.0032 | 0.0026 | 0.0025 | 0.0018 | |||
| 0.0015 | 0.0001 | 0.0007 | 0.0005 | |||
| 4 | 0.0135 | 0.0073 | 0.0113 | 0.0054 | ||
| 0.0101 | 0.0055 | 0.0098 | 0.0050 | |||
| 0.0038 | 0.0018 | 0.0034 | 0.0015 | |||
| 0.0020 | 0.0018 | 0.0022 | 0.0018 | |||
| 0.0008 | 0.0004 | 0.0008 | 0.0004 | |||
| 2 | 0.0072 | 0.0037 | 0.0067 | 0.0035 | ||
| 0.0006 | 0.0004 | 0.0007 | 0.0003 | |||
| 0.0032 | 0.0016 | 0.0031 | 0.0016 | |||
| 0.0034 | 0.0027 | 0.0034 | 0.0022 | |||
| 0.0010 | 0.0006 | 0.0010 | 0.0004 | |||
| 4 | 0.0720 | 0.0225 | 0.0169 | 0.0095 | ||
| 0.0142 | 0.0061 | 0.0018 | 0.0008 | |||
| 0.0342 | 0.0171 | 0.0062 | 0.0033 | |||
| 0.0068 | 0.0031 | 0.0021 | 0.0019 | |||
| 0.0011 | 0.0007 | 0.0007 | 0.0004 | |||
Fig. 1Heat maps of the average computational time for fitting the 98 MV-HMMs, computed over 50 data sets generated by a EII-II MV-HMM with and (a), and (b), and (c), and (d)
Fig. 2Heat maps of the average computational time for fitting the 98 MV-HMMs, computed over 50 data sets generated by a VVE-EV MV-HMM with and (a), and (b), and (c), and (d)
Average computational times (in seconds) for fitting all the 98 HHMs with K states over the 50 data sets generated by the EII-II MV-HMM and VVE-EV MV-HMM on each scenario
| Dimension | MV-HMM | Type |
| ||||
|---|---|---|---|---|---|---|---|
| EII-II | Sequential | 2 | 82.38 | 159.96 | 41.37 | 78.03 | |
| 4 | 346.51 | 753.71 | 89.32 | 176.54 | |||
| Parallel | 2 | 9.65 | 15.53 | 6.64 | 9.42 | ||
| 4 | 29.87 | 60.01 | 10.39 | 17.34 | |||
| VVE-EV | Sequential | 2 | 112.23 | 243.09 | 39.62 | 87.13 | |
| 4 | 373.85 | 730.61 | 93.08 | 193.70 | |||
| Parallel | 2 | 10.55 | 20.23 | 5.21 | 8.84 | ||
| 4 | 31.12 | 59.62 | 9.43 | 17.27 | |||
| EII-II | Sequential | 2 | 289.68 | 548.37 | 75.23 | 146.05 | |
| 4 | 285.00 | 531.69 | 202.20 | 374.94 | |||
| Parallel | 2 | 24.85 | 44.17 | 8.37 | 14.87 | ||
| 4 | 25.14 | 46.56 | 19.28 | 33.6 | |||
| VVE-EV | Sequential | 2 | 268.88 | 529.20 | 85.49 | 149.50 | |
| 4 | 981.04 | 2007.42 | 206.9 | 378.85 | |||
| Parallel | 2 | 22.99 | 43.48 | 9.10 | 13.70 | ||
| 4 | 78.09 | 167.51 | 20.68 | 39.08 | |||
Number of times, over the 50 data sets generated by the two MV-HMMs on each scenario, for which the true parsimonious structure is selected by the BIC when all the 98 MV-HMMs are fitted for
| Dimension | MV-HMM |
| ||||
|---|---|---|---|---|---|---|
| EII-II | 2 | 47 | 48 | 49 | 49 | |
| 4 | 46 | 50 | 50 | 50 | ||
| VVE-EV | 2 | 48 | 50 | 50 | 48 | |
| 4 | 50 | 50 | 50 | 50 | ||
| EII-II | 2 | 48 | 49 | 49 | 50 | |
| 4 | 50 | 50 | 50 | 50 | ||
| VVE-EV | 2 | 50 | 50 | 50 | 50 | |
| 4 | 45 | 49 | 50 | 50 | ||
Parsimonious structure (Pars), number of states (K) and value of the information criterion (BIC) for the best among each competing model according to the BIC
| Model | Pars | BIC | |
|---|---|---|---|
| M-HMMs | VEE | 8 | 13942.92 |
| MVN-Ms | VEE-VE | 6 | 17451.99 |
Fig. 3Italian provinces map colored according to the estimated state memberships