| Literature DB >> 20351798 |
Doina Caragea1, Adrian Silvescu, Vasant Honavar.
Abstract
This paper motivates and precisely formulates the problem of learning from distributed data; describes a general strategy for transforming traditional machine learning algorithms into algorithms for learning from distributed data; demonstrates the application of this strategy to devise algorithms for decision tree induction from distributed data; and identifies the conditions under which the algorithms in the distributed setting are superior to their centralized counterparts in terms of time and communication complexity; The resulting algorithms are provably exact in that the decision tree constructed from distributed data is identical to that obtained in the centralized setting. Some natural extensions leading to algorithms for learning from heterogeneous distributed data and learning under privacy constraints are outlined.Entities:
Year: 2004 PMID: 20351798 PMCID: PMC2846376 DOI: 10.3233/his-2004-11-210
Source DB: PubMed Journal: Int J Hybrid Intell Syst ISSN: 1448-5869