| Literature DB >> 24376874 |
Katie Harron1, Harvey Goldstein2, Angie Wade1, Berit Muller-Pebody3, Roger Parslow4, Ruth Gilbert1.
Abstract
BACKGROUND: Linkage of risk-factor data for blood-stream infection (BSI) in paediatric intensive care (PICU) with bacteraemia surveillance data to monitor risk-adjusted infection rates in PICU is complicated by a lack of unique identifiers and under-ascertainment in the national surveillance system. We linked, evaluated and performed preliminary analyses on these data to provide a practical guide on the steps required to handle linkage of such complex data sources.Entities:
Mesh:
Year: 2013 PMID: 24376874 PMCID: PMC3869925 DOI: 10.1371/journal.pone.0085278
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Steps involved in linkage of PICANet and LabBase2 for enhanced BSI surveillance in PICU.
Figure 2Completeness of identifiers in LabBase2 by PICU.
Figure 3Match weight calculation process.
Figure 4Prior-informed imputation for ‘incomplete’ linkage between PICANet and LabBase2.
Predictor variables: Length of stay, age, admission type, admission source, renal status, quarter-year at admission.
Figure 5Total number of reports (all ages) submitted to LabBase2 for laboratories serving PICUs between 2003-2010.
Initial weight estimates based on first training dataset (records agreeing on NHS number or hospital number).
|
| |||
|---|---|---|---|
|
|
|
| |
|
| 12.58 | -7.94 | -0.17 |
|
| 12.80 | -2.23 | 0.10 |
|
| 6.20 | -3.88 | 0.26 |
|
| 5.26 | -3.78 | -0.46 |
|
| 5.19 | -3.22 | 0.25 |
|
| 1.28 | -6.08 | 1.66 |
|
| 1.18 | -6.81 | 1.66 |
|
| 0.91 | -6.68 | 1.66 |
|
| 0.92 | -5.63 | -0.39 |
Identification of links through deterministic linkage.
|
|
|
|
|
|---|---|---|---|
| NHS number or hospital number | 4595 | 4586 | 9 |
| First name, surname and date of birth | 832 | 832 | 0 |
| Postcode prefix and postcode suffix | 538 | 416 | 122 |
| Postcode prefix or postcode suffix and date of birth | 94 | 52 | 42 |
| At least 2 elements of date of birth and either first name or surname | 1559 | 115 | 1444 |
| Total reviewed | 7618 | 6001 | 1617 |
Final probabilistic match weights.
|
| ||
|---|---|---|
|
|
| |
|
| 5.18 | -4.05 |
|
| 4.66 | -6.89 |
|
| 0.91 | -4.70 |
|
| 5.53 | -1.06 |
Figure 6Four iterations for match weight calculation.
Lines=thresholds.
Match probabilities under independence and dependence assumptions.
|
|
|
| |||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| 0 | 0 | 0 | 0 | 1.000 | 0.000 | 1.000 | 0.000 |
| 0 | 0 | 0 | 1 | 1.000 | 0.000 | 1.000 | 0.000 |
| 0 | 0 | 1 | 0 | 1.000 | 0.000 | 1.000 | 0.000 |
| 0 | 0 | 1 | 1 | 1.000 | 0.000 | 0.998 | 0.002 |
| 0 | 1 | 0 | 0 | 1.000 | 0.000 | 1.000 | 0.000 |
| 0 | 1 | 0 | 1 | 0.993 | 0.007 | 0.999 | 0.001 |
| 0 | 1 | 1 | 0 | 0.883 | 0.117 | 0.957 | 0.043 |
| 0 | 1 | 1 | 1 | 0.600 | 0.400 | 0.530 | 0.470 |
| 1 | 0 | 0 | 0 | 1.000 | 0.000 | 0.999 | 0.001 |
| 1 | 0 | 0 | 1 | 1.000 | 0.000 | 1.000 | 0.000 |
| 1 | 0 | 1 | 0 | 0.998 | 0.002 | 0.999 | 0.001 |
| 1 | 0 | 1 | 1 | 0.949 | 0.051 | 0.883 | 0.117 |
| 1 | 1 | 0 | 0 | 1.000 | 0.000 | 1.000 | 0.000 |
| 1 | 1 | 0 | 1 | 0.985 | 0.015 | 0.980 | 0.020 |
| 1 | 1 | 1 | 0 | 0.175 | 0.825 | <0 | >1 |
| 1 | 1 | 1 | 1 | 0.009 | 0.991 | <0 | >1 |
Estimated bias based on gold-standard data (BCH and OUH).
|
|
|
|
|
|---|---|---|---|
|
|
|
| |
| Deterministic | 125 | 1.14% | -70.5% |
| Highest-weighted: Relaxed threshold | 492 | 4.47% | 15.5 |
| Highest weighted: Conservative threshold | 418 | 3.80% | -1.9 |
| Prior-informed imputation | 424 | 3.85% | -0.5 |