Griffin M Weber1. 1. Information Technology, Harvard Medical School, Boston, Massachusetts 02115, USA. weber@hms.harvard.edu
Abstract
BACKGROUND AND OBJECTIVE: In 2008 we developed a shared health research information network (SHRINE), which for the first time enabled research queries across the full patient populations of four Boston hospitals. It uses a federated architecture, where each hospital returns only the aggregate count of the number of patients who match a query. This allows hospitals to retain control over their local databases and comply with federal and state privacy laws. However, because patients may receive care from multiple hospitals, the result of a federated query might differ from what the result would be if the query were run against a single central repository. This paper describes the situations when this happens and presents a technique for correcting these errors. METHODS: We use a one-time process of identifying which patients have data in multiple repositories by comparing one-way hash values of patient demographics. This enables us to partition the local databases such that all patients within a given partition have data at the same subset of hospitals. Federated queries are then run separately on each partition independently, and the combined results are presented to the user. RESULTS: Using theoretical bounds and simulated hospital networks, we demonstrate that once the partitions are made, SHRINE can produce more precise estimates of the number of patients matching a query. CONCLUSIONS: Uncertainty in the overlap of patient populations across hospitals limits the effectiveness of SHRINE and other federated query tools. Our technique reduces this uncertainty while retaining an aggregate federated architecture.
BACKGROUND AND OBJECTIVE: In 2008 we developed a shared health research information network (SHRINE), which for the first time enabled research queries across the full patient populations of four Boston hospitals. It uses a federated architecture, where each hospital returns only the aggregate count of the number of patients who match a query. This allows hospitals to retain control over their local databases and comply with federal and state privacy laws. However, because patients may receive care from multiple hospitals, the result of a federated query might differ from what the result would be if the query were run against a single central repository. This paper describes the situations when this happens and presents a technique for correcting these errors. METHODS: We use a one-time process of identifying which patients have data in multiple repositories by comparing one-way hash values of patient demographics. This enables us to partition the local databases such that all patients within a given partition have data at the same subset of hospitals. Federated queries are then run separately on each partition independently, and the combined results are presented to the user. RESULTS: Using theoretical bounds and simulated hospital networks, we demonstrate that once the partitions are made, SHRINE can produce more precise estimates of the number of patients matching a query. CONCLUSIONS: Uncertainty in the overlap of patient populations across hospitals limits the effectiveness of SHRINE and other federated query tools. Our technique reduces this uncertainty while retaining an aggregate federated architecture.
Entities:
Keywords:
Algorithms; Hospital Shared Services; Medical Record Linkage; Medical Records Systems, Computerized; Search Engine
Authors: Scott Oster; Stephen Langella; Shannon Hastings; David Ervin; Ravi Madduri; Joshua Phillips; Tahsin Kurc; Frank Siebenlist; Peter Covitz; Krishnakant Shanbhag; Ian Foster; Joel Saltz Journal: J Am Med Inform Assoc Date: 2007-12-20 Impact factor: 4.497
Authors: David B Keator; J S Grethe; D Marcus; B Ozyurt; S Gadde; Sean Murphy; S Pieper; D Greve; R Notestine; H J Bockholt; P Papadopoulos Journal: IEEE Trans Inf Technol Biomed Date: 2008-03
Authors: Andrew J McMurry; Clint A Gilbert; Ben Y Reis; Henry C Chueh; Isaac S Kohane; Kenneth D Mandl Journal: J Am Med Inform Assoc Date: 2007-04-25 Impact factor: 4.497
Authors: Griffin M Weber; Shawn N Murphy; Andrew J McMurry; Douglas Macfadden; Daniel J Nigrin; Susanne Churchill; Isaac S Kohane Journal: J Am Med Inform Assoc Date: 2009-06-30 Impact factor: 4.497
Authors: Shawn N Murphy; Griffin Weber; Michael Mendis; Vivian Gainer; Henry C Chueh; Susanne Churchill; Isaac Kohane Journal: J Am Med Inform Assoc Date: 2010 Mar-Apr Impact factor: 4.497
Authors: Thomas A Drake; Jonathan Braun; Alberto Marchevsky; Isaac S Kohane; Christopher Fletcher; Henry Chueh; Bruce Beckwith; David Berkowicz; Frank Kuo; Qing T Zeng; Ulysses Balis; Ana Holzbach; Andrew McMurry; Connie E Gee; Clement J McDonald; Gunther Schadow; Mary Davis; Eyas M Hattab; Lonnie Blevins; John Hook; Michael Becich; Rebecca S Crowley; Sheila E Taube; Jules Berman Journal: Hum Pathol Date: 2007-05-08 Impact factor: 3.466
Authors: Rainu Kaushal; George Hripcsak; Deborah D Ascheim; Toby Bloom; Thomas R Campion; Arthur L Caplan; Brian P Currie; Thomas Check; Emme Levin Deland; Marc N Gourevitch; Raffaella Hart; Carol R Horowitz; Isaac Kastenbaum; Arthur Aaron Levin; Alexander F H Low; Paul Meissner; Parsa Mirhaji; Harold A Pincus; Charles Scaglione; Donna Shelley; Jonathan N Tobin Journal: J Am Med Inform Assoc Date: 2014-05-12 Impact factor: 4.497
Authors: Jeffrey G Klann; Michael D Buck; Jeffrey Brown; Marc Hadley; Richard Elmore; Griffin M Weber; Shawn N Murphy Journal: J Am Med Inform Assoc Date: 2014-04-03 Impact factor: 4.497
Authors: V M Castro; S W Kong; C C Clements; R Brady; A J Kaimal; A E Doyle; E B Robinson; S E Churchill; I S Kohane; R H Perlis Journal: Transl Psychiatry Date: 2016-01-05 Impact factor: 6.222