Literature DB >> 23349080

Federated queries of clinical data repositories: the sum of the parts does not equal the whole.

Griffin M Weber1.   

Abstract

BACKGROUND AND
OBJECTIVE: In 2008 we developed a shared health research information network (SHRINE), which for the first time enabled research queries across the full patient populations of four Boston hospitals. It uses a federated architecture, where each hospital returns only the aggregate count of the number of patients who match a query. This allows hospitals to retain control over their local databases and comply with federal and state privacy laws. However, because patients may receive care from multiple hospitals, the result of a federated query might differ from what the result would be if the query were run against a single central repository. This paper describes the situations when this happens and presents a technique for correcting these errors.
METHODS: We use a one-time process of identifying which patients have data in multiple repositories by comparing one-way hash values of patient demographics. This enables us to partition the local databases such that all patients within a given partition have data at the same subset of hospitals. Federated queries are then run separately on each partition independently, and the combined results are presented to the user.
RESULTS: Using theoretical bounds and simulated hospital networks, we demonstrate that once the partitions are made, SHRINE can produce more precise estimates of the number of patients matching a query.
CONCLUSIONS: Uncertainty in the overlap of patient populations across hospitals limits the effectiveness of SHRINE and other federated query tools. Our technique reduces this uncertainty while retaining an aggregate federated architecture.

Entities:  

Keywords:  Algorithms; Hospital Shared Services; Medical Record Linkage; Medical Records Systems, Computerized; Search Engine

Mesh:

Year:  2013        PMID: 23349080      PMCID: PMC3715334          DOI: 10.1136/amiajnl-2012-001299

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  16 in total

1.  Analysis of identifier performance using a deterministic linkage algorithm.

Authors:  Shaun J Grannis; J Marc Overhage; Clement J McDonald
Journal:  Proc AMIA Symp       Date:  2002

2.  Automatic linkage of vital records.

Authors:  H B NEWCOMBE; J M KENNEDY; S J AXFORD; A P JAMES
Journal:  Science       Date:  1959-10-16       Impact factor: 47.728

3.  A submission model for use in the indexing, searching, and retrieval of distributed pathology case and tissue specimens.

Authors:  Ahmad H Namini; David A Berkowicz; Isaac S Kohane; Henry Chueh
Journal:  Stud Health Technol Inform       Date:  2004

4.  caGrid 1.0: an enterprise Grid infrastructure for biomedical research.

Authors:  Scott Oster; Stephen Langella; Shannon Hastings; David Ervin; Ravi Madduri; Joshua Phillips; Tahsin Kurc; Frank Siebenlist; Peter Covitz; Krishnakant Shanbhag; Ian Foster; Joel Saltz
Journal:  J Am Med Inform Assoc       Date:  2007-12-20       Impact factor: 4.497

5.  A national human neuroimaging collaboratory enabled by the Biomedical Informatics Research Network (BIRN).

Authors:  David B Keator; J S Grethe; D Marcus; B Ozyurt; S Gadde; Sean Murphy; S Pieper; D Greve; R Notestine; H J Bockholt; P Papadopoulos
Journal:  IEEE Trans Inf Technol Biomed       Date:  2008-03

6.  A self-scaling, distributed information architecture for public health, research, and clinical care.

Authors:  Andrew J McMurry; Clint A Gilbert; Ben Y Reis; Henry C Chueh; Isaac S Kohane; Kenneth D Mandl
Journal:  J Am Med Inform Assoc       Date:  2007-04-25       Impact factor: 4.497

7.  The Shared Health Research Information Network (SHRINE): a prototype federated query tool for clinical data repositories.

Authors:  Griffin M Weber; Shawn N Murphy; Andrew J McMurry; Douglas Macfadden; Daniel J Nigrin; Susanne Churchill; Isaac S Kohane
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

8.  The urge to merge: linking vital statistics records and Medicaid claims.

Authors:  R M Bell; J Keesey; T Richards
Journal:  Med Care       Date:  1994-10       Impact factor: 2.983

9.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2).

Authors:  Shawn N Murphy; Griffin Weber; Michael Mendis; Vivian Gainer; Henry C Chueh; Susanne Churchill; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2010 Mar-Apr       Impact factor: 4.497

10.  A system for sharing routine surgical pathology specimens across institutions: the Shared Pathology Informatics Network.

Authors:  Thomas A Drake; Jonathan Braun; Alberto Marchevsky; Isaac S Kohane; Christopher Fletcher; Henry Chueh; Bruce Beckwith; David Berkowicz; Frank Kuo; Qing T Zeng; Ulysses Balis; Ana Holzbach; Andrew McMurry; Connie E Gee; Clement J McDonald; Gunther Schadow; Mary Davis; Eyas M Hattab; Lonnie Blevins; John Hook; Michael Becich; Rebecca S Crowley; Sheila E Taube; Jules Berman
Journal:  Hum Pathol       Date:  2007-05-08       Impact factor: 3.466

View more
  10 in total

Review 1.  Clinical Decision Support: a 25 Year Retrospective and a 25 Year Vision.

Authors:  B Middleton; D F Sittig; A Wright
Journal:  Yearb Med Inform       Date:  2016-08-02

2.  Fold-stratified cross-validation for unbiased and privacy-preserving federated learning.

Authors:  Romain Bey; Romain Goussault; François Grolleau; Mehdi Benchoufi; Raphaël Porcher
Journal:  J Am Med Inform Assoc       Date:  2020-08-01       Impact factor: 4.497

3.  Federated queries of clinical data repositories: Scaling to a national network.

Authors:  Griffin M Weber
Journal:  J Biomed Inform       Date:  2015-05-06       Impact factor: 6.317

4.  Securely measuring the overlap between private datasets with cryptosets.

Authors:  S Joshua Swamidass; Matthew Matlock; Leon Rozenblit
Journal:  PLoS One       Date:  2015-02-25       Impact factor: 3.240

5.  Secure and scalable deduplication of horizontally partitioned health data for privacy-preserving distributed statistical computation.

Authors:  Kassaye Yitbarek Yigzaw; Antonis Michalas; Johan Gustav Bellika
Journal:  BMC Med Inform Decis Mak       Date:  2017-01-03       Impact factor: 2.796

6.  Expected 10-anonymity of HyperLogLog sketches for federated queries of clinical data repositories.

Authors:  Ziye Tao; Griffin M Weber; Yun William Yu
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.931

7.  Changing the research landscape: the New York City Clinical Data Research Network.

Authors:  Rainu Kaushal; George Hripcsak; Deborah D Ascheim; Toby Bloom; Thomas R Campion; Arthur L Caplan; Brian P Currie; Thomas Check; Emme Levin Deland; Marc N Gourevitch; Raffaella Hart; Carol R Horowitz; Isaac Kastenbaum; Arthur Aaron Levin; Alexander F H Low; Paul Meissner; Parsa Mirhaji; Harold A Pincus; Charles Scaglione; Donna Shelley; Jonathan N Tobin
Journal:  J Am Med Inform Assoc       Date:  2014-05-12       Impact factor: 4.497

8.  Query Health: standards-based, cross-platform population health surveillance.

Authors:  Jeffrey G Klann; Michael D Buck; Jeffrey Brown; Marc Hadley; Richard Elmore; Griffin M Weber; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2014-04-03       Impact factor: 4.497

9.  Absence of evidence for increase in risk for autism or attention-deficit hyperactivity disorder following antidepressant exposure during pregnancy: a replication study.

Authors:  V M Castro; S W Kong; C C Clements; R Brady; A J Kaimal; A E Doyle; E B Robinson; S E Churchill; I S Kohane; R H Perlis
Journal:  Transl Psychiatry       Date:  2016-01-05       Impact factor: 6.222

10.  Balancing Accuracy and Privacy in Federated Queries of Clinical Data Repositories: Algorithm Development and Validation.

Authors:  Yun William Yu; Griffin M Weber
Journal:  J Med Internet Res       Date:  2020-11-03       Impact factor: 5.428

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.