| Literature DB >> 31662345 |
Yifan Zhang1, Erin E Holsinger1, Lea Prince1, Jonathan A Rodden2, Sonja A Swanson3, Matthew M Miller4, Garen J Wintemute5, David M Studdert6,7.
Abstract
BACKGROUND: Virtually all existing evidence linking access to firearms to elevated risks of mortality and morbidity comes from ecological and case-control studies. To improve understanding of the health risks and benefits of firearm ownership, we launched a cohort study: the Longitudinal Study of Handgun Ownership and Transfer (LongSHOT).Entities:
Keywords: cohort study; firearm; mortality; violence
Mesh:
Year: 2019 PMID: 31662345 PMCID: PMC7146924 DOI: 10.1136/injuryprev-2019-043385
Source DB: PubMed Journal: Inj Prev ISSN: 1353-8047 Impact factor: 2.399
Summary of data linkage steps used to assemble the LongSHOT cohort
| Step | Blocking key | Focus of linkage algorithms and manual reviews | Step rationale | Percentage of all matches identified | |
| Purchasers-voter file | Deaths-voter file | ||||
| A | Same date of birth+same residential address | Name fields | Identifies highest probability matches | 80.12% | 79.74% |
| B | Same date of birth+similar first and last names | Name fields, address fields | Removes address as a blocking criterion to capture relocators between SVRD extract date and purchase/death date. | 17.92% | 15.64% |
| C | Same date of birth+both persons female+similar first and middle names | Name fields, address fields | Same as step B, except also allows changes of last names among women. | 0.73% | 0.06% |
| D | Same address | Name fields, date of birth field | Removes date of birth from blocking key to allow for errors in this field. | 1.24% | 4.56% |
LongSHOT, Longitudinal Study of Handgun Ownership and Transfer; SVRD, Statewide Voter Registration Database.
Inter-rater and intrarater reliability of manual review
| DROS–SVRD linkage | Mortality–SVRD linkage | |||
| Inter-rater | Intrarater | Inter-rater | Intrarater | |
| Expected agreement by chance alone (%) | 50.51 | 50.60 | 51.79 | 52.10 |
| Observed agreement (95% CI) | 94.20% (91.78% to 96.08%) | 93.40% (90.86% to 95.41%) | 91.80% (89.04% to 94.05%) | 92.60% (89.94% to 94.74%) |
| Kappa (95% CI) | 0.88 (0.83 to 0.91) | 0.87 (0.82 to 0.90) | 0.83 (0.77 to 0.87) | 0.85 (0.79 to 0.89) |
DROS, Dealer Record of Sale database; SVRD, Statewide Voter Registration Database.
Techniques for identifying matched records with discrepant first, middle and/or last names
| Extent of discrepancy | Source of name discrepancy | Retrieval technique | Place of application* |
| Slight or moderate | Misspelling, entry errors, and so on | Jaro-Winkler distance | Throughout all steps |
| Phonetically similar but different spelling | Encoded all names using the Soundex function in R to allow matching of phonetically similar names | Throughout all steps | |
| Shortened or expanded/hyphenated versions of same names | Allowed for substring matches between name fields | Step A: name bins 3, 4, 5 | |
| Extreme | Use of nicknames and contractions (eg, Elizabeth—Betty, Tommy Joe—TJ) | Allowed for matches to common nicknames (see section VII of | Step A: name bins 1, 2, 6, 7, 9 |
| Change or concatenation of last names among females | Relaxed last name matching criteria | Step C | |
| Allowed for matches between current last names (in purchaser and mortality records) and former last names (in voter records) | Throughout all steps | ||
| Switches in name order | Allowed for reverse matching of first-middle and first-last | Step A: name bins 1, 2, 3, 5, 6, 9 |
*Refers to locations in the charts of linkage algorithms provided in section V of the online supplementary appendix.
Additional variables used to inform match determinations in hard cases
| Consideration | Intuition | Place of application* |
| Rarity of name in the population | Two records with same name but minor discrepancies on another link variable are more likely to be the same if first, middle or last name is uncommon. | Step B: substep 2, substep 3(2)(d) |
| Geodistance between discrepant addresses | Persons who move addresses are more likely to relocate near (eg, same city or county) than far (eg, distant city or county). | Step A: all name bins |
| Geodistance+rurality | All else equal, two records that match on all variables except address are more likely to be true matches if both are in the same sparsely populated area than if both are in the same densely populated area. | Manual review only |
| Time interval between discrepant dates of birth | When errors in (or intended alternate uses of) birth dates occur, the conflicting dates are more likely to be proximate than distant. | Step D: blocking key; auto rule-in bin D |
*Refers to locations in the charts of linkage algorithms provided in section V of the online supplementary appendix.
Characteristics of sharp and fuzzy matches*
| DROS–SVRD linkage | Mortality–SVRD linkage | |||
| Matches identified in step A | Matches identified in step B, C or D | Matches identified in step A | Matches identified in step B, C or D | |
| Male (%) | 86.05 | 85.45 | 51.96 | 48.81 |
| Age† | ||||
| Mean (years) | 45 | 38 | 73 | 73 |
| Median (years) | 45 | 35 | 76 | 77 |
| Race/ethnicity | ||||
| White (%) | 74.52 | 69.33 | 70.73 | 66.33 |
| Hispanic (%) | 13.18 | 15.76 | 13.30 | 14.41 |
| Black (%) | 3.78 | 5.79 | 8.34 | 11.82 |
| Asian (%) | 5.73 | 5.80 | 7.00 | 6.55 |
| Other (%) | 2.79 | 3.32 | 0.63 | 0.90 |
| Residential location‡ | ||||
| Urban (%) | 81.96 | 81.16 | 87.80 | 87.41 |
| Suburban (%) | 12.22 | 10.95 | 6.94 | 6.10 |
| Large rural town (%) | 3.11 | 3.46 | 3.11 | 2.64 |
| Small rural town (%) | 2.28 | 2.48 | 1.61 | 2.37 |
*In both linkages, all variables in the step A match are significantly different (p<0.01) from step B/C/D matches.
†Refers to cohort members’ age at the midpoint of their observation period.
‡Categories are based on the US Census Bureau’s urban-rural classification system and refer to cohort members’ residential location at time they entered the cohort.
DROS, Dealer Record of Sale database; SVRD, Statewide Voter Registration Database.
Key challenges to address in future analyses of the LongSHOT cohort exploring the relationship between handgun ownership and mortality
| Challenge | Description |
| Mismeasurement of exposure | Non-handgun owners may in fact be owners due to, for example, failure to match their purchases in probabilistic linkage, unlawful handgun acquisition and purchases made prior to 1985. Non-handgun owners may own unobserved long guns. |
| Unobserved confounding | Relevant differences may exist between handgun owners and non-owners that are not measured in the linked data (eg, incidence of mental illness, risk-taking propensity). |
| Restriction of cohort to voter file registrants in California | Generalisations to non-registrants in California and to residents of other states may be impaired by relevant unobserved heterogeneity. |
LongSHOT, Longitudinal Study of Handgun Ownership and Transfer.