Literature DB >> 25388307

Name segmentation using hidden Markov models and its application in record linkage.

Rita de Cassia Braga Gonçalves, Sergio Miranda Freire.   

Abstract

This study aimed to evaluate the use of hidden Markov models (HMM) for the segmentation of person names and its influence on record linkage. A HMM was applied to the segmentation of patient's and mother's names in the databases of the Mortality Information System (SIM), Information Subsystem for High Complexity Procedures (APAC), and Hospital Information System (AIH). A sample of 200 patients from each database was segmented via HMM, and the results were compared to those from segmentation by the authors. The APAC-SIM and APAC-AIH databases were linked using three different segmentation strategies, one of which used HMM. Conformity of segmentation via HMM varied from 90.5% to 92.5%. The different segmentation strategies yielded similar results in the record linkage process. This study suggests that segmentation of Brazilian names via HMM is no more effective than traditional segmentation approaches in the linkage process.

Entities:  

Mesh:

Year:  2014        PMID: 25388307     DOI: 10.1590/0102-311x00191313

Source DB:  PubMed          Journal:  Cad Saude Publica        ISSN: 0102-311X            Impact factor:   1.632


  1 in total

1.  Evaluation of Prevalence of the Sarcopenia Level Using Machine Learning Techniques: Case Study in Tijuana Baja California, Mexico.

Authors:  Cristián Castillo-Olea; Begonya Garcia-Zapirain Soto; Clemente Zuñiga
Journal:  Int J Environ Res Public Health       Date:  2020-03-15       Impact factor: 3.390

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.