Pharos: A scalable distributed architecture for locating heterogeneous information sourcesScalable collection summarization and selection