|
Abstract : |
Query routing (also known as server ranking and collection fusion in some specific context) refers to the general problem of selecting from a large set of accessible information sources the ones relevant to a given query, and evaluating the query on the selected sources. It consists of three steps, namely database selection, query evaluation, and result merging. As the number of information sources on the Internet increases dramatically, query routing is becoming increasingly important. Much of the previous work in query routing focused on information sources that are document collections. In this paper, we address the query routing problem for databases with multiple text attributes. In particular, we have proposed a number of different database selection methods each requiring different types of knowledge about the databases ' content, e.g. past queries, past query results, and statistical information collected from the database records. By conducting a series of experiments on a set of bibliographic databases, we evaluate and compare the performance of these proposed methods., |