Approximate Image Matching using Strings of Bag-of-Visual Words RepresentationReport as inadecuate

Approximate Image Matching using Strings of Bag-of-Visual Words Representation - Download this document for free, or read online. Document in PDF available to download.

1 LHC - Laboratoire Hubert Curien Saint Etienne

Abstract : The Spatial Pyramid Matching approach has become very popular to model images as sets of local bag-of words. The image comparison is then done region-by-region with an intersection kernel. Despite its success, this model presents some limitations: the grid partitioning is predefined and identical for all images and the matching is sensitive to intra- and inter-class variations. In this paper, we propose a novel approach based on approximate string matching to overcome these limitations and improve the results. First, we introduce a new image representation as strings of ordered bag-of-words. Second, we present a new edit distance specifically adapted to strings of histograms in the context of image comparison. This distance identifies local alignments between subregions and allows to remove sequences of similar subregions to better match two images. Experiments on 15 Scenes and Caltech 101 show that the proposed approach outperforms the classical spatial pyramid representation and most existing concurrent methods for classification presented in recent years.

Keywords : Image Classification Edit Distance String of Histograms Bag-of-Visual Words Image Classification.

Author: Hong-Thinh Nguyen - Cécile Barat - Christophe Ducottet -



Related documents