Literature DB >> 2924560

Hash function performance on different biological databases.

E J Breen1, K L Williams.   

Abstract

Open hashing is used to demonstrate the effectiveness of several hashing functions for the uniform distribution of biological records. The three types of database tested include (1) genetic nomenclature, mutation sites and strain names, (2) surnames extracted from literature files and (3) a set of 1000 numeric ASCII strings. Several hash functions (hashpjw, hashcrc and hashquad) showed considerable versatility on all data sets examined while two hash functions, hashsum and hashsmc, performed poorly, on the same databases.

Mesh:

Year:  1989        PMID: 2924560     DOI: 10.1016/0169-2607(89)90164-8

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  1 in total

1.  Development of a database of health insurance claims: standardization of disease classifications and anonymous record linkage.

Authors:  Shinya Kimura; Toshihiko Sato; Shunya Ikeda; Mitsuhiko Noda; Takeo Nakayama
Journal:  J Epidemiol       Date:  2010-08-07       Impact factor: 3.211

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.