| Literature DB >> 8947688 |
Y Yang1.
Abstract
Whether or not high accuracy classification methods can be scaled to large applications is crucial for the ultimate usefulness of such methods in text categorization. This paper applies two statistical learning algorithms, the Linear Least Squares Fit (LLSF) mapping and a Nearest Neighbor classifier named ExpNet, to a large collection of MEDLINE documents. With the use of suitable dimensionality reduction techniques and efficient algorithms, both LLSF and ExpNet successfully scaled to this very large problem with a result significantly outperforming word-matching and other automatic learning methods applied to the same corpus.Entities:
Mesh:
Year: 1996 PMID: 8947688 PMCID: PMC2233015
Source DB: PubMed Journal: Proc AMIA Annu Fall Symp ISSN: 1091-8280