Aggregating Local Deep Features for Image Retrieval

A. Babenko; Lempitsky V.

АБВ
АБВ
АБВ

Обычная версия сайта

Priority areas

by year

Subject

News

March 4, 2026

Next-Generation Cardiology: AI, Genetics, and Personalised Medicine

More than 400 specialists from Russia and other countries participated in the 'Genetics and the Heart' Congress hosted by HSE University. Experts discussed the latest advances in clinical and molecular cardiology, new approaches to managing rare diseases, challenges in genome editing, and the role of artificial intelligence in interpreting medical and genetic data. A central theme of the congress was the practical integration of genetic knowledge into routine clinical practice.

March 3, 2026

HSE University Scholars Uncover E-Learning Preferences of Top Students

HSE University experts have analysed students’ digital footprints and shown for the first time that final grades depend on one’s personal approach to an online course. Balanced students have proven to be more successful than those who follow a more traditional and practical approach. The findings from this study will help create a more adaptive and personalised educational system. This research has been published in the journal The Internet and Higher Education.

March 2, 2026

Third ‘International Academic Cooperation of HSE University Open Competition Launched

Research (scientific) structural units of HSE University planning to conduct joint research with foreign universities and research centres are invited to take part in the competition.

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications

?

Aggregating Local Deep Features for Image Retrieval

P. 1269–1277.

Babenko A., Lempitsky V.

Several recent works have shown that image descriptors produced by deep convolutional neural networks provide state-of-the-art performance for image classification and retrieval problems. It has also been shown that the activations from the convolutional layers can be interpreted as local features describing particular image regions. These local features can be aggregated using aggregation approaches developed for local features (e.g. Fisher vectors), thus providing new powerful global descriptors. In this paper we investigate possible ways to aggregate local deep features to produce compact global descriptors for image retrieval. First, we show that deep features and traditional hand-engineered features have quite different distributions of pairwise similarities, hence existing aggregation methods have to be carefully re-evaluated. Such re-evaluation reveals that in contrast to shallow features, the simple aggregation method based on sum pooling provides arguably the best performance for deep convolutional features. This method is efficient, has few parameters, and bears little risk of overfitting when e.g. learning the PCA matrix. Overall, the new compact global descriptor improves the state-of-the-art on four common benchmarks considerably.

Language: English

Full text

Keywords: Aggregation methods

In book

Proceedings of the IEEE International Conference on Computer Vision (ICCV 2015)

Santiago de Chile: IEEE, 2015.