Home

Data Mining and Knowledge Discovery: A Review of Issues and a Multistrategy Approach


Author(s) : Ryszard S. Michalski John Wiley M. Kubat I. Bratko Edited R. S. Michalski Kenneth A. Kaufman, 
Publisher : N/A
Publication Date : 1998
ISSN : N/A
Abstract : An enormous proliferation of databases in almost every area of human endeavor has created a great demand for new, powerful tools for turning data into useful, task-oriented knowledge. In efforts to satisfy this need, researchers have been exploring ideas and methods developed in machine learning, pattern recognition, statistical data analysis, data visualization, neural nets, etc. These efforts have led to the emergence of a new research area, frequently called data mining and knowledge discovery. The first part of this chapter is a compendium of ideas on the applicability of symbolic machine learning methods to this area. The second part describes a multistrategy methodology for conceptual data exploration, by which we mean the derivation of high-level concepts and descriptions from data through symbolic reasoning involving both data and background knowledge. The methodology, which has been implemented in the INLEN system, combines machine learning, database and knowledge-based technologies. To illustrate the system's capabilities, we present results from its application to a problem of discovery of economic and demographic patterns in a database containing facts and statistics about the countries of the world. The presented results demonstrate a high potential utility of the methodology for assisting in solving practical data mining and knowledge discovery tasks. 2.1,