| Literature DB >> 24001000 |
Joanne K Daggy1, Huiping Xu, Siu L Hui, Roland E Gamache, Shaun J Grannis.
Abstract
BACKGROUND: Methods for linking real-world healthcare data often use a latent class model, where the latent, or unknown, class is the true match status of candidate record-pairs. This commonly used model assumes that agreement patterns among multiple fields within a latent class are independent. When this assumption is violated, various approaches, including the most commonly proposed loglinear models, have been suggested to account for conditional dependence.Entities:
Mesh:
Year: 2013 PMID: 24001000 PMCID: PMC3766252 DOI: 10.1186/1472-6947-13-97
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Loglinear model results for last name/first name block
| | | ||||||
|---|---|---|---|---|---|---|---|
| | | | | ||||
| Year of birth | 0.581 | 0.0047 | 0.615 | 0.0048 | 0.615 | 0.0048 | |
| SSN | 0.025 | 0.0011 | 0.027 | 0.0011 | 0.026 | 0.0011 | |
| Day of birth | 0.572 | 0.0046 | 0.608 | 0.0048 | 0.608 | 0.0048 | |
| Telephone | 0.173 | 0.0029 | 0.141 | 0.0026 | 0.140 | 0.0026 | |
| Zip code | 0.409 | 0.0041 | 0.363 | 0.0040 | 0.362 | 0.0040 | |
| Sex | 0.710 | 0.0037 | 0.695 | 0.0038 | 0.694 | 0.0038 | |
| Month of birth | 0.716 | 0.0044 | 0.768 | 0.0044 | 0.769 | 0.0045 | |
| Year of birth | 0.026 | 0.0002 | 0.026 | 0.0002 | 0.026 | 0.0002 | |
| SSN | 6E-06 | 8E-06 | 1E-05 | 9.00E-06 | 5E-05 | 1E-05 | |
| Day of birth | 0.032 | 0.0003 | 0.031 | 0.0003 | 0.031 | 0.0003 | |
| Telephone | 5E-04 | 0.0001 | 0.002 | 0.0001 | 0.002 | 0.0001 | |
| Zip code | 0.037 | 0.0003 | 0.039 | 0.0003 | 0.039 | 0.0003 | |
| Sex | 0.661 | 0.0006 | 0.661 | 0.0006 | 0.661 | 0.0006 | |
| Month of birth | 0.082 | 0.0004 | 0.081 | 0.0004 | 0.081 | 0.0004 | |
| G2 | 8852.9 | 2974.26 | 2881.45 | ||||
Loglinear model results for MCHD data blocked on last name and first name (Number of record pairs = 618,213). All parameters are statistically significant (p < .001) for all three models, except for u2 which is not significant for conditional independence model (p = .468) or Loglinear Model I (p = .143).
Figure 1Correlation residual plots for last name/first name block. Last name/first name block: pairwise correlation residuals for Model 0 (Panel A), Model I (Panel B), and Model II (Panel C).