Literature DB >> 10298621

A Zipfian model of an automatic bibliographic system: an application to MEDLINE.

J Fedorowicz.   

Abstract

A Zipfian model of an automatic bibliographic system is developed using parameters describing the contents of it database and its inverted file. The underlying structure of the Zipf distribution is derived, with particular emphasis on its application to work frequencies, especially with regard to the inverted flies of an automatic bibliographic system. Andrew Booth developed a form of Zipf's law which estimates the number of words of a particular frequency for a given author and text. His formulation has been adopted as the basis of a model of term dispersion in an inverted file system. The model is also distinctive in its consideration of the proliferation of spelling errors in free text, and the inclusion of all searchable elements from the system's inverted file. This model is applied to the National Library of Medicine's MEDLINE. The model carries implications for the determination of database storage requirements, search response time, and search exhaustiveness.

Mesh:

Year:  1982        PMID: 10298621     DOI: 10.1002/asi.4630330406

Source DB:  PubMed          Journal:  J Am Soc Inf Sci        ISSN: 0002-8231


  1 in total

1.  RTX-KG2: a system for building a semantically standardized knowledge graph for translational biomedicine.

Authors:  E C Wood; Amy K Glen; Lindsey G Kvarfordt; Finn Womack; Liliana Acevedo; Timothy S Yoon; Chunyu Ma; Veronica Flores; Meghamala Sinha; Yodsawalai Chodpathumwan; Arash Termehchy; Jared C Roach; Luis Mendoza; Andrew S Hoffman; Eric W Deutsch; David Koslicki; Stephen A Ramsey
Journal:  BMC Bioinformatics       Date:  2022-09-29       Impact factor: 3.307

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.