Encoding Spatial Context for Large-Scale Partial-Duplicate Web Image Retrieval

Many recent state-of-the-art image retrieval approaches are based on Bag-of-Visual-Words model and represent an image with a set of visual words by quantizing local SIFT（scale invariant feature transform） features. Feature quantization reduces the discriminative power of local features and unavoidab...

Full description

Saved in:

Bibliographic Details
Published in	Journal of computer science and technology Vol. 29; no. 5; pp. 837 - 848
Main Author	周文罡李厚强卢亦娟田奇
Format	Journal Article
Language	English
Published	Boston Springer US 01.09.2014 Springer Nature B.V
Subjects	Accuracy Artificial Intelligence Computer Science Context Data Structures and Information Theory Geometric accuracy Image retrieval Information Systems Applications (incl.Internet) Invariants Regular Paper Reproduction Retrieval Searching Software Engineering Theory of Computation Verification Visual Web 一致性验证图像数据库图像检索方法复制空间环境编码视觉匹配 affine estimation spatial verification spatial context coding large-scale image retrieval
Online Access	Get full text
ISSN	1000-9000 1860-4749
DOI	10.1007/s11390-014-1472-3

Cover

More Information
Summary:	Many recent state-of-the-art image retrieval approaches are based on Bag-of-Visual-Words model and represent an image with a set of visual words by quantizing local SIFT（scale invariant feature transform） features. Feature quantization reduces the discriminative power of local features and unavoidably causes many false local matches between images, which degrades the retrieval accuracy. To filter those false matches, geometric context among visual words has been popularly explored for the verification of geometric consistency. However, existing studies with global or local geometric verification are either computationally expensive or achieve limited accuracy. To address this issue, in this paper, we focus on partialduplicate Web image retrieval, and propose a scheme to encode the spatial context for visual matching verification. An efficient affine enhancement scheme is proposed to refine the verification results. Experiments on partial-duplicate Web image search, using a database of one million images, demonstrate the effectiveness and efficiency of the proposed approach.Evaluation on a 10-million image database further reveals the scalability of our approach.
Bibliography:	11-2296/TP Wen-Gang Zhou,Hou-Qiang Li,Yijuan Lu,Qi Tian（1 Chinese Academy of Sciences Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China, Hefei 230027, China; 2Department of Electronic Engineering and Information Science, University of Science and Technology of China Hefei 230027, China ;3Department of Computer Science, Texas State University, San Marcos, TX 78666, U.S.A. ;4Department of Computer Science, University of Texas at San Antonio, San Antonio, TX 78249, U.S.A.） large-scale image retrieval; spatial context coding; spatial verification; affine estimation Many recent state-of-the-art image retrieval approaches are based on Bag-of-Visual-Words model and represent an image with a set of visual words by quantizing local SIFT（scale invariant feature transform） features. Feature quantization reduces the discriminative power of local features and unavoidably causes many false local matches between images, which degrades the retrieval accuracy. To filter those false matches, geometric context among visual words has been popularly explored for the verification of geometric consistency. However, existing studies with global or local geometric verification are either computationally expensive or achieve limited accuracy. To address this issue, in this paper, we focus on partialduplicate Web image retrieval, and propose a scheme to encode the spatial context for visual matching verification. An efficient affine enhancement scheme is proposed to refine the verification results. Experiments on partial-duplicate Web image search, using a database of one million images, demonstrate the effectiveness and efficiency of the proposed approach.Evaluation on a 10-million image database further reveals the scalability of our approach. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	1000-9000 1860-4749
DOI:	10.1007/s11390-014-1472-3