Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree

The goal of this work is a data structure to support approximate nearest neighbor search on very large scale sets of vector descriptors. The criteria we wish to optimize are: (i) that the memory footprint of the representation should be very small (so that it fits into main memory); and (ii) that th...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 36(2014), 12 vom: 14. Dez., Seite 2396-406
1. Verfasser:	Arandjelović, Relja (VerfasserIn)
Weitere Verfasser:	Zisserman, Andrew
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2014
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article Research Support, Non-U.S. Gov't


LEADER	01000naa a22002652 4500
001	NLM252590252
003	DE-627
005	20231224164431.0
007	cr uuu---uuuuu
008	231224s2014 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TPAMI.2014.2339821 \|2 doi
028	5	2	\|a pubmed24n0842.xml
035			\|a (DE-627)NLM252590252
035			\|a (NLM)26353147
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Arandjelović, Relja \|e verfasserin \|4 aut
245	1	0	\|a Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree
264		1	\|c 2014
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 25.11.2015
500			\|a Date Revised 10.09.2015
500			\|a published: Print
500			\|a Citation Status PubMed-not-MEDLINE
520			\|a The goal of this work is a data structure to support approximate nearest neighbor search on very large scale sets of vector descriptors. The criteria we wish to optimize are: (i) that the memory footprint of the representation should be very small (so that it fits into main memory); and (ii) that the approximation of the original vectors should be accurate. We introduce a novel encoding method, named a Set Compression Tree (SCT), that satisfies these criteria. It is able to accurately compress 1 million descriptors using only a few bits per descriptor. The large compression rate is achieved by not compressing on a per-descriptor basis, but instead by compressing the set of descriptors jointly. We describe the encoding, decoding and use for nearest neighbor search, all of which are quite straightforward to implement. The method, tested on standard benchmarks (SIFT1M and 80 Million Tiny Images), achieves superior performance to a number of state-of-the-art approaches, including Product Quantization, Locality Sensitive Hashing, Spectral Hashing, and Iterative Quantization. For example, SCT has a lower error using 5 bits than any of the other approaches, even when they use 16 or more bits per descriptor. We also include a comparison of all the above methods on the standard benchmarks
650		4	\|a Journal Article
650		4	\|a Research Support, Non-U.S. Gov't
700	1		\|a Zisserman, Andrew \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on pattern analysis and machine intelligence \|d 1979 \|g 36(2014), 12 vom: 14. Dez., Seite 2396-406 \|w (DE-627)NLM098212257 \|x 1939-3539 \|7 nnns
773	1	8	\|g volume:36 \|g year:2014 \|g number:12 \|g day:14 \|g month:12 \|g pages:2396-406
856	4	0	\|u http://dx.doi.org/10.1109/TPAMI.2014.2339821 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 36 \|j 2014 \|e 12 \|b 14 \|c 12 \|h 2396-406