| Literature DB >> 33805175 |
Rocío Hernández-Sanjaime1, Martín González1, Antonio Peñalver1, Jose J López-Espín1.
Abstract
The presence of unaccounted heterogeneity in simultaneous equation models (SEMs) is frequently problematic in many real-life applications. Under the usual assumption of homogeneity, the model can be seriously misspecified, and it can potentially induce an important bias in the parameter estimates. This paper focuses on SEMs in which data are heterogeneous and tend to form clustering structures in the endogenous-variable dataset. Because the identification of different clusters is not straightforward, a two-step strategy that first forms groups among the endogenous observations and then uses the standard simultaneous equation scheme is provided. Methodologically, the proposed approach is based on a variational Bayes learning algorithm and does not need to be executed for varying numbers of groups in order to identify the one that adequately fits the data. We describe the statistical theory, evaluate the performance of the suggested algorithm by using simulated data, and apply the two-step method to a macroeconomic problem.Entities:
Keywords: Leonenko estimator; Shannon entropy; clustering; computational econometrics; simultaneous equation model; variational algorithms
Year: 2021 PMID: 33805175 PMCID: PMC8064307 DOI: 10.3390/e23040384
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Akaike information criterion (AIC) mean value for different estimated models over simulation runs and clustering classification error rate committed by the algorithm regarding known group membership model. Note: GM, group membership; PM, percentage model; AGG, aggregate model; CA, proposed clustering algorithm.
| Size | GM | PM | AGG | CA | CA Clustering Error | ||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
| Aggregate | % | |
| 2 | 76,003.15 | 76,069.52 | 76,200.78 | 76,601.07 | 82,421.45 | 75,964.97 | 1.65 |
| 4 | 144,038.25 | 148,422.05 | 150,889.29 | 152,474.46 | 153,744.05 | 144,130.05 | 0.77 |
| 6 | 238,407.47 | 244,234.13 | 247,418.12 | 248,074.44 | 249,785.10 | 238,794.03 | 0.67 |
| 8 | 332,163.59 | 338,568.06 | 343,189.90 | 344,504.72 | 349,888.61 | 333,404.96 | 0.39 |
Evolution of AIC mean value for different estimated models.
| Size | Number of Clusters | |||
|---|---|---|---|---|
|
| 1 | 2 | 3 | 4 |
| 2 | 83,795.08 | 78,139.16 | 76,307.94 | 75,408.18 |
| 4 | 136,575.21 | 136,703.96 | 135,996.17 | 132,493.36 |
| 6 | 255,276.91 | 256,088.06 | 251,367.12 | 249,979.87 |
| 8 | 348,972.97 | 351,896.28 | 341,170.68 | 331,409.30 |
Statistical Criteria for Model Selection.
| Number of Clusters |
|
|
|---|---|---|
| 1 | 2004.415 | 2004.415 |
| 2 | 1856.893 | 1855.913 |
| 3 | 1811.696 | 1638.756 |