Clustering Presentation of Web Image Retrieval Results using Textual Information and Image Features

Y. Uematsu, R. Kataoka, and H. Takeno (Japan)


Web Image Retrieval, Web Page Classification, Document Clustering, Search Result Organization


The increasing prevalence of broadband Internet access is making it easier to obtain rich contents like images, and more people are attempting image retrieval. We focus on how to present web image retrieval re sults to users. Most retrieval results contain multiple top ics. To offset this complexity, many papers have discussed text retrieval result clustering [11][14]. In result clustering, we cluster the documents according to their topics by using the distance of text similarity. To group web image retrieval results, we have to consider the differences between image retrieval and text retrieval. First, most images on the Web do not have any textual information, so we have to automat ically extract textual information if we are to index web im ages by semantic information. Second, text retrieval shows users text snippets as results which may not contain the in formation that user wants; however, thumbnail images are direct reduced-size versions of the originals, so the user can clearly figure out if the original image is desired or not. So, we think that how to present retrieval results is an important task in web image retrieval. In this paper, we describe how to semantically clas sify image retrieval results for making web image retrieval more effective. Text classification based on machine learn ing is used to generate basic semantic information, and im age features and textual features are used for cluster presen tation. We propose methods for presenting the results of image retrieval through the application of clustering. Ex periments show that our procedure is effective.

Important Links:

Go Back