Home

RainForest - a Framework for Fast Decision Tree Construction of Large Datasets


Author(s) : Venkatesh Ganti Raghu Ramakrishnan Johannes Gehrke, 
Publisher : N/A
Publication Date : 1998
ISSN : N/A
Abstract : Classification of large datasets is an important data mining problem. Many classification algorithms have been proposed in the literature, but studies have shown that so far no algorithm uniformly outperforms all other algorithms in terms of quality. In this paper, we present a unifying framework for decision tree classifiers that separates the scalability aspects of algorithms for constructing a decision tree from the central features that determine the quality of the tree. This generic algorithm is easy to instantiate with specific algorithms from the literature (including C4.5, CART,,