Literature DB >> 20351793

Learning accurate and concise naïve Bayes classifiers from attribute value taxonomies and data.

J Zhang1, D-K Kang, A Silvescu, V Honavar.   

Abstract

In many application domains, there is a need for learning algorithms that can effectively exploit attribute value taxonomies (AVT)-hierarchical groupings of attribute values-to learn compact, comprehensible and accurate classifiers from data-including data that are partially specified. This paper describes AVT-NBL, a natural generalization of the naïve Bayes learner (NBL), for learning classifiers from AVT and data. Our experimental results show that AVT-NBL is able to generate classifiers that are substantially more compact and more accurate than those produced by NBL on a broad range of data sets with different percentages of partially specified values. We also show that AVT-NBL is more efficient in its use of training data: AVT-NBL produces classifiers that outperform those produced by NBL using substantially fewer training examples.

Entities:  

Year:  2006        PMID: 20351793      PMCID: PMC2846370          DOI: 10.1007/s10115-005-0211-z

Source DB:  PubMed          Journal:  Knowl Inf Syst        ISSN: 0219-3116            Impact factor:   2.822


  2 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  A Framework for Learning from Distributed Data Using Sufficient Statistics and its Application to Learning Decision Trees.

Authors:  Doina Caragea; Adrian Silvescu; Vasant Honavar
Journal:  Int J Hybrid Intell Syst       Date:  2004-04-01
  2 in total
  3 in total

1.  Abstraction Augmented Markov Models.

Authors:  Cornelia Caragea; Adrian Silvescu; Doina Caragea; Vasant Honavar
Journal:  Proc IEEE Int Conf Data Min       Date:  2010-12-13

2.  Semi-supervised prediction of protein subcellular localization using abstraction augmented Markov models.

Authors:  Cornelia Caragea; Doina Caragea; Adrian Silvescu; Vasant Honavar
Journal:  BMC Bioinformatics       Date:  2010-10-26       Impact factor: 3.169

3.  The Relative Power of Structural Genomic Variation versus SNPs in Explaining the Quantitative Trait Growth in the Marine Teleost Chrysophrys auratus.

Authors:  Mike Ruigrok; Bing Xue; Andrew Catanach; Mengjie Zhang; Linley Jesson; Marcus Davy; Maren Wellenreuther
Journal:  Genes (Basel)       Date:  2022-06-23       Impact factor: 4.141

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.