| Literature DB >> 14728266 |
Abstract
Indexing of documents is an important strategy intended to make the literature more readily available to the user. Here we describe several dimensions of indexing that are important if indexing is to be optimal. These dimensions are coverage, predictability, and transparency. MeSH terms and text words are compared in MEDLINE in regard to these dimensions. Part of our analysis consists in applying AdaBoost with decisions trees as the weak learners to estimate how reliably index terms are being assigned and how complex the criteria are by which they are being assigned. Our conclusions are that MeSH terms are more predictable and more transparent than text words.Mesh:
Year: 2003 PMID: 14728266 PMCID: PMC1480214
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076