| Literature DB >> 29500200 |
Abstract
OBJECTIVES: To quality assure a Trusted Third Party linked data set to prepare it for analysis.Entities:
Keywords: administrative data; births and maternities; data linkage; quality assurance
Mesh:
Year: 2018 PMID: 29500200 PMCID: PMC5855305 DOI: 10.1136/bmjopen-2017-017898
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
NHS Digital hierarchical stepwise linkage algorithm used to link ONS birth records to HES delivery records
| Step/match rank | NHS number | DOB | Sex | Postcode | |
| 1 | Exact | Exact | Exact | Exact | |
| 2 | Exact | Exact | Exact | ||
| 3 | Exact | Partial | Exact | Exact | |
| 4 | Exact | Partial | Exact | ||
| 5 | Exact | Exact | |||
| 6 | Exact | Exact | Exact | Where NHSNO does not contradict the match and DOB is not 1 January and the postcode is not in the ’ignore' list | |
| 7 | Exact | Exact | Exact | Where NHSNO does not contradict the match and DOB is not 1 January | |
| 8 | Exact |
DOB, date of birth; HES, Hospital Episode Statistics; NHSNO, NHS number; ONS, Office for National Statistics.
Summary of linkage rates for singleton and multiple births by calendar year
| (A) Singleton births by calendar year | |||||
| Year | Total singleton ONS birth records | Total singleton ONS births linked to any Maternity HES by NHS Digital | Total singleton ONS births not linked to any Maternity HES by NHS Digital | Singleton ONS birth records linked to HESID but no delivery record | % Total singleton ONS births linked to any Maternity HES by NHS Digital |
| 2005 | 599 237 | 565 559 | 33 678 | 16 349 | 94.4 |
| 2006 | 620 730 | 589 127 | 31 603 | 15 802 | 94.9 |
| 2007 | 638 995 | 612 782 | 26 213 | 12 889 | 95.9 |
| 2008 | 656 196 | 635 411 | 20 785 | 9908 | 96.8 |
| 2009 | 653 322 | 636 284 | 17 038 | 9099 | 97.4 |
| 2010 | 665 599 | 652 533 | 13 066 | 7009 | 98.0 |
| 2011 | 666 582 | 653 552 | 13 030 | 8329 | 98.0 |
| 2012 | 676 399 | 661 511 | 14 888 | 11 444 | 97.8 |
| 2013 | 647 666 | 633 222 | 14 444 | 11 913 | 97.8 |
| 2014 | 643 860 | 628 032 | 15 828 | 13 514 | 97.5 |
| Total | 6 468 586 | 6 268 013 | 200 573 | 116 256 | 96.9 |
HES, Hospital Episode Statistics; ONS, Office for National Statistics.
Comparison of percentages of missing values in key baby data items on ONS birth records and HES normalised delivery records for singleton and multiple births 2005–2014
| Variable | ONS singleton birth records | HES singleton delivery records | ONS multiple birth records | HES multiple delivery records |
| Total | 6468 586 | 7040 590 | 208 326 | 230 019 |
| % with missing values for: | ||||
| Baby’s date of birth | 0.00 | 19.4 | 0.00 | 16.3 |
| Sex of baby | 0.00 | 23.1 | 0.00 | 16.1 |
| Gestational age | 0.71 | 37.2 | 0.80 | 27.8 |
| Birth weight | 0.56 | 22.7 | 1.70 | 15.3 |
HES, Hospital Episode Statistics; ONS, Office for National Statistics.
Categories of linked ONS birth records and HES delivery records
| Category | Summary | Details | Action |
| SMSB | ONS birth record linking to one relevant HES delivery record | One-to-one linkage as expected | Keep link |
| SMSB | ONS birth record linking to many relevant HES delivery records, no clear episode order sequence | Due to duplicate HES records, or predelivery and postdelivery HES records | Keep link to one of the duplicates or the most appropriate delivery record |
| SMSB | ONS birth record linking to many relevant HES records, with a clear episode order sequence | Due to multiple episodes as part of a hospital spell | Keep link to one episode record and maximise delivery information |
| SMDB | ONS birth record linking to one or more incorrect HES delivery records (same mother) | Another birth to the same mother within the study period | Move to unlinked file |
| WL | ONS birth record linking to one or more incorrect HES delivery records | The maternal linkage is incorrect or poor data quality | Move to unlinked file |
HES, Hospital Episode Statistics; ONS, Office for National Statistics; SMDB, same mother different baby; SMSB, same mother same baby; WL, wrong link.
The four combinations of baby data items that confirm that the link between the ONS birth record and HES relates to the same baby
| Combination | ONS birth record and HES delivery location of birth matches | ONS birth record and HES delivery baby date of birth matches | ONS birth record and HES delivery birth weight matches | ONS birth record and HES delivery gestational age matches | ONS birth record and HES delivery sex of baby matches | ||||
| 1 | Exact | and | Exact | and | Exact | and | Exact | and | Exact |
| 2 | Exact | and | Exact | and | (Exact | or | Exact | or | Exact) |
| 3 | Exact | and | Exact | ||||||
| 4 | Exact | and | Differs by up to 4 days* | and | Missing or not completely different | and | Missing or not completely different | and | Missing or not completely different |
*By up to 10 days for multiple births.
HES, Hospital Episode Statistics; ONS, Office for National Statistics.
Summary of quality assurance results
| (A) Singleton births by calendar year | |||||||||
| Year | Total singleton ONS birth records | Total singleton ONS births linked to any HES delivery records by NHS Digital | % Total singleton ONS births linked to any HES delivery records by NHS Digital | Of singleton ONS births linked to any HES delivery records by NHS Digital, number left with link to one HES delivery record after QA | Of singleton ONS births linked to any HES delivery records by NHS Digital, % left with link to one HES delivery record after QA | Of singleton ONS births linked to any HES delivery records by NHS Digital, number left with no links to HES delivery records after QA | Of singleton ONS births linked to any HES delivery records by NHS Digital, % left with no links to HES delivery records after QA | % of | % of |
| 2005 | 599 237 | 565 559 | 94.4 | 554 566 | 98.1 | 10 993 | 1.9 | 92.5 | 7.5 |
| 2006 | 620 730 | 589 127 | 94.9 | 573 770 | 97.4 | 15 357 | 2.6 | 92.4 | 7.6 |
| 2007 | 638 995 | 612 782 | 95.9 | 595 585 | 97.2 | 17 197 | 2.8 | 93.2 | 6.8 |
| 2008 | 656 196 | 635 411 | 96.8 | 621 006 | 97.7 | 14 405 | 2.3 | 94.6 | 5.4 |
| 2009 | 653 322 | 636 284 | 97.4 | 621 423 | 97.7 | 14 861 | 2.3 | 95.1 | 4.9 |
| 2010 | 665 599 | 652 533 | 98.0 | 641 167 | 98.3 | 11 366 | 1.7 | 96.3 | 3.7 |
| 2011 | 666 582 | 653 552 | 98.0 | 642 263 | 98.3 | 11 289 | 1.7 | 96.4 | 3.6 |
| 2012 | 676 399 | 661 511 | 97.8 | 648 501 | 98.0 | 13 010 | 2.0 | 95.9 | 4.1 |
| 2013 | 647 666 | 633 222 | 97.8 | 622 943 | 98.4 | 10 279 | 1.6 | 96.2 | 3.8 |
| 2014 | 643 860 | 628 032 | 97.5 | 617 263 | 98.3 | 10 769 | 1.7 | 95.9 | 4.1 |
| Total | 6 468 586 | 6 268 013 | 96.9 | 6 138 487 | 97.9 | 129 526 | 2.1 | 94.9 | 5.1 |
HES, Hospital Episode Statistics; ONS, Office for National Statistics; QA, quality assurance.
Breakdown of reasons for discarding HES delivery records linked to an ONS birth excluding duplicate copies of HES delivery episodes and invalid deliveries for England singleton births by year
| Year | All | A | B | C | D | E | F |
| Total links broken (excluding exact duplicates and invalid records) | Same mother different baby | Multiple episodes—pre/postdelivery admission | Multiple episodes—part of spell | Multiple episodes—exact duplicates with different epikeys | Wrong link | Potential multiple birth | |
| 2005 | 46 164 | 24 457 | 14 941 | 5805 | 104 | 817 | 40 |
| 2006 | 49 706 | 25 082 | 16 565 | 6984 | 204 | 782 | 89 |
| 2007 | 48 999 | 26 304 | 13 187 | 8301 | 323 | 799 | 85 |
| 2008 | 47 748 | 26 032 | 9466 | 11 473 | 89 | 614 | 74 |
| 2009 | 40 519 | 25 688 | 6137 | 7680 | 283 | 673 | 58 |
| 2010 | 37 007 | 24 993 | 5176 | 5819 | 366 | 599 | 54 |
| 2011 | 37 013 | 24 679 | 6033 | 5367 | 741 | 145 | 48 |
| 2012 | 36 176 | 24 966 | 5899 | 4298 | 838 | 146 | 29 |
| 2013 | 34 285 | 23 071 | 5709 | 5018 | 281 | 169 | 37 |
| 2014 | 31 690 | 21 980 | 3662 | 3957 | 1863 | 110 | 118 |
| All years | 409 307 | 247 252 | 86 775 | 64 702 | 5092 | 4854 | 632 |
| Percentage | 100 | 60.41 | 21.2 | 15.81 | 1.24 | 1.19 | 0.15 |
HES, Hospital Episode Statistics; ONS, Office for National Statistics.
Comparison of differences in the percentage distributions of values for key variables for ONS birth records that remained correctly linked to one HES delivery record after the quality assurance procedure, and the ONS birth records with all links to HES discarded after the quality assurance procedure, for singleton and multiple births
| Variable | Singleton births | Multiple births | ||
| Value | Difference in percentages* | Value | Difference in percentages* | |
| Match rank of linkage algorithm | 6 | 5.4 | ||
| Region of birth | Elsewhere | 2.3 | East of England | 1.9 |
| Home | 38.3 | London | 5.8 | |
| Home | 2.8 | |||
| Year of birth | 2006 | 2.5 | 2012 | 7.4 |
| 2007 | 3.6 | 2013 | 6.9 | |
| 2009 | 1.4 | 2014 | 7.2 | |
| Month of birth | March | 2.0 | March | 1.7 |
| April | 1.5 | |||
| Day of birth (8) | Saturday | 1.2 | ||
| Day of birth (11) | Saturday | 1.2 | ||
| Hour of birth | 3.00–5.59 | 1.7 | 6.00–8.59 | 1.4 |
| 6.00–8.59 | 1.8 | 9.00–11.59 | 1.5 | |
| Sex of baby | Male | 1.1 | ||
| Age of mother | 30–34 | 1.7 | 30–34 | 1.2 |
| 35–39 | 1.3 | 35–39 | 2.4 | |
| 40–44 | 3.9 | |||
| Ethnicity of baby | Not known | 3.4 | Black African | 1.9 |
| Not known | 1.4 | |||
| Gestational age | Preterm | 1.1 | Preterm | 1.1 |
| Stillbirth | Yes | 2.3 | Yes | 2.5 |
| Gestational age missing on HES | Yes | 20.1 | Yes | 10.9 |
| Birth weight missing on HES | Yes | 12.3 | Yes | 7.0 |
| Trust | PPL (Portland) | 1.7 | PPL (Portland) | 2.5 |
| RGQ (Ipswich) | 2.0 | RGQ (Ipswich) | 1.5 | |
| RKE (Whittington) | 4.0 | RJ6 (Croydon) | 1.3 | |
| RVL (Barnet and Chase Farm) | 1.4 | RKE (Whittington) | 3.9 | |
| RQ8 (Mid Essex) | 1.6 | |||
| RTH (Oxford University Hospitals) | 1.6 | |||
| RVL (Barnet and Chase Farm) | 1.4 | |||
| RVV (East Kent) | 2.0 | |||
| RW3 (Central Manchester) | 1.1 | |||
| RWH (East and North Hertfordshire) | 1.1 | |||
*Over 1%.
HES, Hospital Episode Statistics; ONS, Office for National Statistics.