| Literature DB >> 27227157 |
Adam Allston1, Reshma Bhattacharjee2, Sahithi Boggavarapu3, Sharon Carter3, Amanda D Castel4, Jeff Collmann5, Colin Flynn2, Auntré Hamp6, Diana Jordan3, Seble Kassaye7, Michael Kharfen6, Garret Lum6, Raghu Pemmaraju8, Anne Rhodes3, Jeff Stover3, Mary A Young7, Joanne Michelle F Ocampo5, J C Smart9.
Abstract
BACKGROUND: The National HIV/AIDS Strategy calls for active surveillance programs for human immunodeficiency virus (HIV) to more accurately measure access to and retention in care across the HIV care continuum for persons living with HIV within their jurisdictions and to identify persons who may need public health services. However, traditional public health surveillance methods face substantial technological and privacy-related barriers to data sharing.Entities:
Keywords: HIV; data sharing; public health; surveillance; technology
Year: 2016 PMID: 27227157 PMCID: PMC4869245 DOI: 10.2196/publichealth.5317
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Overview of categories and definitions used in the study’s person-matching algorithm.
| Matching categories | Variable definitionsa |
| Exact | if m.last_name and m.first_name and m.dob and m.ssn and m.sex and m.race then m.score := exact; |
| Very high | elsif (m.last_name and m.first_name and m.dob and m.sex) or m.ssn then m.score := very_high; |
| High | elsif m.last_name and m.first_name and m.dob and (m.sex or m.race) then m.score := high; |
| Medium high | elsif m.last_name and m.first_soundex and m.dob and m.sex then m.score := medium_high; |
| Medium (1st definition) | elsif m.last_name and m.dob and m.sex and m.race then m.score:= medium; |
| Medium (2nd definition) | elsif m.last_soundex and m.first_soundex and m.dob and (m.sex or m.race) then m.score := medium; |
| Medium low | elsif m.last_soundex and m.first_soundex and m.partial_dob and m.partial_ssn and (m.sex or m.race) then m.score := medium_low; |
| Low | elsif m.last_soundex and (m.partial_dob and m.partial_ssn) and (m.sex or m.race) then m.score := low; |
| Very low | elsif m.last_soundex and (m.partial_dob or m.partial_ssn) then m.score := very_low; |
aLast name=Last name of PLWH in eHARS person file; First name=First name of PLWH in eHARS person file; DOB=Date of birth of PLWH in eHARS person file; SSN=Social Security Number of PLWH in eHARS person; Race=hierarchical race/ethnicity assignment for PLWH in eHARS person-view; Soundex=Soundex is a phonetic, alphanumeric code created by converting a name into an index letter and a 3-digit code. The index letter is the first letter of the name. The 3-digit code is calculated from the remaining letters of the name, based on rules found in the eHARS Technical Guidance. There is a Soundex variable for first name and a Soundex for last Name.
Overview of person matches in eHARS databases across DC, MD, and VA from 1981 to 2015.
| Person matches across jurisdictions | Exact | Very high | High | Medium high | Medium | Very low | Total |
| DC-MDa | 4013 | 5907 | 53 | 268 | 645 | 482 | 11,368 |
| MD-VAb | 856 | 2343 | 11 | 117 | 377 | 865 | 4569 |
| VA-DCc | 1064 | 3340 | 15 | 149 | 438 | 529 | 5535 |
| Total | 5933 | 11,590 | 79 | 534 | 1460 | 1876 | 21,472 |
aDC-reported MD matches were equal to MD-reported DC matches.
bMD-reported VA matches were equal to VA-reported MD matches.
cVA-reported DC matches were equal to DC-reported VA matches.
DC validation results.
|
| Nonmatch | Match | Total | ||||
|
| N | % | N | % | N | % | |
|
| |||||||
|
| Exact | 0 | 0.0 | 4009 | 100.0 | 4009 | 100.0 |
|
| Very High | 264 | 4.5 | 5560 | 95.5 | 5824 | 100.0 |
|
| High | 0 | 0.0 | 52 | 100.0 | 52 | 100.0 |
|
| Medium High | 3 | 1.1 | 264 | 98.9 | 267 | 100.0 |
|
| Medium | 178 | 28.0 | 457 | 72.0 | 635 | 100.0 |
|
| Very Low | 329 | 69.9 | 142 | 30.2 | 471 | 100.0 |
|
| Total | 774 | 6.9 | 10484 | 93.1 | 11,258 | 100.0 |
|
| |||||||
|
| Exact | 0 | 0.0 | 1067 | 100.0 | 1067 | 100.0 |
|
| Very High | 33 | 1.0 | 3286 | 99.0 | 3319 | 100.0 |
|
| High | 0 | 0.0 | 13 | 100.0 | 13 | 100.0 |
|
| Medium High | 5 | 3.4 | 144 | 96.6 | 149 | 100.0 |
|
| Medium | 91 | 20.9 | 344 | 79.1 | 435 | 100.0 |
|
| Very Low | 401 | 79.1 | 106 | 20.9 | 507 | 100.0 |
|
| Total | 530 | 9.7 | 4960 | 90.4 | 5490 | 100.0 |
VA validation results.
|
| Nonmatch | Match | Total | ||||
|
| N | % | N | % | N | % | |
|
| |||||||
|
| Exact | 0 | 0.0 | 214 | 100.0 | 214 | 100.0 |
|
| Very High | 21 | 3.6 | 562 | 96.4 | 583 | 100.0 |
|
| High | 0 | 0.0 | 4 | 100.0 | 4 | 100.0 |
|
| Medium High | 3 | 6.4 | 44 | 93.6 | 47 | 100.0 |
|
| Medium | 98 | 53.3 | 86 | 46.7 | 184 | 100.0 |
|
| Very Low | 400 | 98.0 | 8 | 2.0 | 408 | 100.0 |
|
| Total | 522 | 36.3 | 918 | 63.8 | 1440 | 100.0 |
|
| |||||||
|
| Exact | 0 | 0.0 | 264 | 100.0 | 264 | 100.0 |
|
| Very High | 14 | 1.7 | 802 | 98.3 | 816 | 100.0 |
|
| High | 0 | 0.0 | 4 | 100.0 | 4 | 100.0 |
|
| Medium High | 4 | 6.9 | 54 | 93.1 | 58 | 100.0 |
|
| Medium | 69 | 33.3 | 138 | 66.7 | 207 | 100.0 |
|
| Very Low | 199 | 87.7 | 28 | 12.3 | 227 | 100.0 |
|
| Total | 286 | 18.1 | 1290 | 81.9 | 1576 | 100.0 |
MD validation resultsd.
|
| Nonmatch | Match | Total | ||||
|
| N | % | N | % | N | % | |
|
| |||||||
|
| Exact | 0 | 0.0 | 4030 | 100.0 | 4030 | 100.0 |
|
| Very High | 24 | 0.4 | 5846 | 99.2 | 5870 | 100.0 |
|
| High | 0 | 0.0 | 52 | 100.0 | 52 | 100.0 |
|
| Medium High | 0 | 0.0 | 272 | 100.0 | 272 | 100.0 |
|
| Medium | 431 | 67.5 | N/A | 73.9 | 638 | 100.0 |
|
| Very Low | 441 | 94.4 | N/A | 28.6 | 467 | 100.0 |
|
| Total |
|
|
| 98.6 | 11,329 | 100.0 |
|
| |||||||
|
| Exact | 0 | 0.0 | 855 | 100.0 | 855 | 100.0 |
|
| Very High | 10 | 0.4 | 2336 | 99.7 | 2344 | 100.0 |
|
| High | 0 | 0.0 | 11 | 100.0 | 11 | 100.0 |
|
| Medium High | 0 | 0.0 | 118 | 100.0 | 118 | 100.0 |
|
| Medium | 292 | 77.5 | N/A | 90.3 | 377 | 100.0 |
|
| Very Low | 827 | 96.3 | N/A | 15.3 | 858 | 100.0 |
|
| Total |
|
|
| 97.7 | 4563 | 100.0 |
dSince a 5% random sample was used to manually review Medium & Very Low categories, exact numbers (N) of matches could not be shown in this table.