Home

Video similarity detection with video signature clustering


Author(s) : Avideh Zakhor Sen-ching S. Cheung, 
Publisher : N/A
Publication Date : 2001
ISSN : N/A
Abstract : The proliferation of video content on the web makes similarity detection an indispensable tool in web data management, searching, and navigation. We have previously proposed a compact representation of video clips, called video signature, for retrieving similar video clips in large databases. In this paper, we propose a new signature clustering algorithm to further improve retrieval performance. threshold graph, where the threshold is determined based on local data statistics. Similar clusters are identified as highly connected regions in the graph. This algorithm outperforms simple thresholding and hierarchical clustering techniques in identifying a set of manually-determined similar clusters from a dataset of 46,356 web video clips. At 95 % precision, our algorithm attains 85 % recall while simple thresholding and complete-link hierarchical scheme attain 67 % and 75 % recall respectively. Applying our algorithm to the entire dataset, 6,900 similar clusters are identified, with an average cluster size of 2.81 video clips. The distribution of cluster sizes follows a power-law distribution, which has been shown to describe many web phenomena. 1.,