Literature DB >> 18852919

DataLink record linkage software applied to the cancer registry of Murcia, Spain.

M Márquez Cid1, M D Chirlaque, C Navarro.   

Abstract

OBJECTIVES: Record linkage between data sets is relatively simple when unique, universal, permanent, and common variables exist in each data set. This situation occurs infrequently; thus, there is a need to apply probabilistic methods to identify corresponding records. DataLink has been tested to determine if the use of clustering techniques will improve performance with a minimum decrease in accuracy.
METHODS: The study uses cancer registry data which includes hospital discharge and pathology reports from two hospitals in the Murcia Region for the years 2002-2003. These data are standardized prior to running DataLink. The original version of DataLink compares all of the records one by one, and in two later versions of the software clustering is applied which filters for one or more variables. Computing time and the proportion of detected matches have been investigated with each version.
RESULTS: The clustering versions achieve 96.1% and 96.2% accuracy, respectively. An improvement in the computational time of 97.3% and 98.6% is achieved for the two clustering versions compared with the original. The clustering versions lose 0.36% and 1.07% of real duplicates, respectively.
CONCLUSIONS: DataLink implements deterministic and probabilistic record linkage to eliminate duplicates and to merge new information with existing cases. The standardization of variables to a common format has been adapted to the characteristics of Spanish language data. Clustering techniques minimize computational time and maximize accuracy in the detection of corresponding records.

Entities:  

Mesh:

Year:  2008        PMID: 18852919     DOI: 10.3414/me0529

Source DB:  PubMed          Journal:  Methods Inf Med        ISSN: 0026-1270            Impact factor:   2.176


  1 in total

1.  Is hospital discharge administrative data an appropriate source of information for cancer registries purposes? Some insights from four Spanish registries.

Authors:  Enrique E Bernal-Delgado; Carmen Martos; Natalia Martínez; María Dolores Chirlaque; Mirari Márquez; Carmen Navarro; Lauro Hernando; Joaquín Palomar; Isabel Izarzugaza; Nerea Larrañaga; Olatz Mokoroa; M Cres Tobalina; Joseba Bidaurrazaga; María José Sánchez; Carmen Martínez; Miguel Rodríguez; Esther Pérez; Yoe Ling Chang
Journal:  BMC Health Serv Res       Date:  2010-01-08       Impact factor: 2.655

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.