Spatially Adaptive Computation Time for Residual Networks

M. Figurnov; Collins M.; Zhu Y.; Zhang L.; Huang J.; D. Vetrov; Salakhutdinov R.

doi:10.1109/CVPR.2017.194

Publications

?

Spatially Adaptive Computation Time for Residual Networks

P. 1790–1799.

Figurnov M., Collins M., Zhu Y., Zhang L., Huang J., Vetrov D., Salakhutdinov R.

This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic. It is therefore applicable without any modifications to a wide range of computer vision problems such as image classification, object detection and image segmentation. We present experimental results showing that this model improves the computational efficiency of Residual Networks on the challenging ImageNet classification and COCO object detection datasets. Additionally, we evaluate the computation time maps on the visual saliency dataset cat2000 and find that they correlate surprisingly well with human eye fixation positions.

Keywords: computer vision deep learning

Publication based on the results of:

Разработка комбинированных нейробайесовских методов машинного обучения (2017)

In book

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017)

Curran Associates, Inc., 2017.

Analysis of Images, Social Networks and Texts. 10th International Conference, AIST 2021, Tbilisi, Georgia, December 16–18, 2021, Revised Selected Papers

Cham: Springer, 2022.

This book constitutes revised selected papers from the 9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020, held during December 16-18, 2021. The world of Data Science changes every year. At AIST, we exchange our understanding of the Science state-of-the-art, as well as how it applies to life and business. AIST ...

Added: January 4, 2022

Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020)

Piscataway: IEEE, 2020.

2020 International Joint Conference on Neural Networks (IJCNN) held virtually, as part of the IEEE World Congress on Computational Intelligence (IEEE WCCI) 2020. IJCNN 2020 is jointly organized by the IEEE Computational Intelligence Society (CIS) and the International Neural Network Society (INNS). For IJCNN 2020 (and when WCCI is organized in even-numbered years) IEEE CIS ...

Added: October 15, 2020

10th International Conference, PReMI 2023, Kolkata, India, December 12–15, 2023, Proceedings. Pattern Recognition and Machine Intelligence. LNCS, volume 14301

Cham: Springer, 2023.

Added: November 29, 2023

Spatially Adaptive Computation Time for Residual Networks

Figurnov M., Collins M., Zhu Y. et al., / Series arXiv "arXiv:1612.02297". 2016.

Added: December 12, 2016

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

IEEE, 2018.

In this paper, we present a LinkNet-based architecture with SE-ResNeXt-50 encoder and a novel training strategy that strongly relies on image preprocessing and incorporating distorted network outputs. The architecture combines a pre-trained convolutional encoder and a symmetric expanding path that enables precise localization. We show that such a network can be trained on plain RGB ...

Added: February 20, 2021

Building detection from satellite imagery using a composite loss function

Golovanov S., Rauf Kurbanov, Artamonov A. et al., , in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).: IEEE, 2018. P. 219–222.

Added: February 20, 2021

Использование сверточных нейронных сетей для реидентификации людей в городских условиях

Сучков Е. П., Алексеенко Г. О., Налчаджи К. В., Интеллектуальные системы. Теория и приложения 2022 Т. 26 № 1 С. 250–254

Currently, video surveillance systems are becoming more widespread. One of the main goals of such systems is to control and track a person’s movement. The solution of this problem allows us to solve such applied problems as tracking the occupancy of various premises (whether shopping facilities or educational and cultural institutions), creating a motion heatmap or organizing control of access to ...

Added: January 31, 2023

Weight Averaging Improves Knowledge Distillation under Domain Shift

Berezovskiy V., Morozov N., , in: The 2nd Workshop and Challenges for Out-of-Distribution Generalization in Computer Vision. ICCV 2023.: [б.и.], 2023.

Knowledge distillation (KD) is a powerful model compression technique broadly used in practical deep learning applications. It is focused on training a small student network to mimic a larger teacher network. While it is widely known that KD can offer an improvement to student generalization in i.i.d setting, its performance under domain shift, i.e. the ...

Added: November 20, 2023

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024, Proceedings, Part X. LNCS, volume 14950

Cham: Springer, 2024.

This multi-volume set, LNAI 14941 to LNAI 14950, constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2024, held in Vilnius, Lithuania, in September 2024. ...

Added: November 22, 2024

Foundations of Intelligent Systems. 25th International Symposium on Methodologies for Intelligent Systems: ISMIS 2020

Springer, 2020.

This book constitutes the proceedings of the 25th International Symposium on Foundations of Intelligent Systems, ISMIS 2020, held in Graz, Austria, in October 2020. The conference was held virtually due to the COVID-19 pandemic. The 35 full and 8 short papers presented in this volume were carefully reviewed and selected from 79 submissions. Included is also ...

Added: October 4, 2020

The 2nd Workshop and Challenges for Out-of-Distribution Generalization in Computer Vision. ICCV 2023

[б.и.], 2023.

Deep learning models are usually developed and tested under the implicit assumption that the training and test data are drawn independently and identically distributed (IID) from the same distribution. Overlooking out-of-distribution (OOD) images can result in poor performance in unseen or adverse viewing conditions, which is common in real-world scenarios. In this workshop, we are ...

Added: November 20, 2023

Intelligent Data Processing 11th International Conference, IDP 2016, Barcelona, Spain, October 10–14, 2016, Revised Selected Papers

Switzerland: Springer, 2019.

This book constitutes the refereed proceedings of the 11th International Conference on Intelligent Data Processing, IDP 2016, held in Barcelona, Spain, in October 2016. The 11 revised full papers were carefully reviewed and selected from 52 submissions. The papers of this volume are organized in topical sections on machine learning theory with applications; intelligent data processing in life ...

Added: February 8, 2020

Data-Driven Short-Term Daily Operational Sea Ice Regional Forecasting

Grigoryev T., Verezemskaya P., Krinitskiy M. et al., Remote Sensing 2022 Vol. 14 No. 22 Article 5837

Global warming has made the Arctic increasingly available for marine operations and created a demand for reliable operational sea ice forecasts to increase safety. Because ocean-ice numerical models are highly computationally intensive, relatively lightweight ML-based methods may be more efficient for sea ice forecasting. Many studies have exploited different deep learning models alongside classical approaches ...

Added: June 19, 2023

Proceedings of the 23rd International ACM Conference on 3D Web Technology

NY: Association for Computing Machinery (ACM), 2018.

Welcome to the 23rd International ACM Conference on 3D Web Technology - Web3D 2018, organized in cooperation with the Web3D Consortium at the Poznań University of Economics and Business in Poznań, Poland on June 20-22, 2018. This year's theme "3D Everywhere" emphasizes the global scope and impact of current and future 3D technology. Web3D fosters and ...

Added: September 3, 2018

Traffic4cast at NeurIPS 2021 - Temporal and Spatial Few-Shot Transfer Learning in Gridded Geo-Spatial Processes

Eichenberger C., Neun M., Martin H. et al., , in: Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track.: PMLR, 2022. P. 97–112.

Added: October 11, 2022

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2010)

San Francisco: IEEE, 2010.

Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on ...

Added: October 18, 2017

Z-flipon variants reveal the many roles of Z-DNA and Z-RNA in health and disease

Umerenkov D., Herbert A., Konovalov Dmitrii et al., Life Science Alliance 2023 Vol. 6 No. 7 Article e202301962

Identifying roles for Z-DNA remains challenging given their dynamic nature. Here, we perform genome-wide interrogation with the DNABERT transformer algorithm trained on experimentally identified Z-DNA forming sequences (Z-flipons). The algorithm yields large performance enhancements (F1 = 0.83) over existing approaches and implements computational mutagenesis to assess the effects of base substitution on Z-DNA formation. We ...

Added: June 9, 2023

Shape Perception

Sawada T., Li Y., Pizlo Z., , in: The Oxford Handbook of Computational and Mathematical Psychology.: Oxford University Press, 2015. P. 255–276.

This chapter provides a review of topics and concepts that are necessary to study and understand 3D shape perception. This includes group theory and their invariants; model-based invariants; Euclidean, affine, and projective geometry; symmetry; inverse problems; simplicity principle; Fechnerian psychophysics; regularization theory; Bayesian inference; shape constancy and shape veridicality; shape recovery; perspective and orthographic projections; ...

Added: March 10, 2015

Method of Critical Set construction for Successive Cancellation List Decoder of Polar Codes Based on Deep Learning of Neural Networks

Kotov F., Ivanov F., Timokhin I., , in: 2023 XVIII International Symposium Problems of Redundancy in Information and Control Systems (REDUNDANCY).: IEEE, 2023. P. 64–69.

The Successive Cancellation List (SCL) algorithm is a widely used decoding technique in communication systems. However, constructing the critical set for SCL decoding is a challenging task, as it requires a large number of computations and can lead to significant decoding delays. In this paper, a new approach to critical set construction for SCL decoding ...

Added: December 9, 2023

The Russian Drug Reaction Corpus and Neural Models for Drug Reactions and Effectiveness Detection in User Reviews

Tutubalina E., Алимова И. С., Мифтахутдинов З. et al., Bioinformatics 2021 Vol. 37 No. 2 P. 243–249

Drugs and diseases play a central role in many areas of biomedical research and healthcare. Aggregating knowledge about these entities across a broader range of domains and languages is critical for information extraction (IE) applications. To facilitate text mining methods for analysis and comparison of patient’s health conditions and adverse drug reactions reported on the ...

Added: January 13, 2021

Моделирование урожайности зерновых культур сельскохозяйственных регионов с использованием технологий компьютерного зрения

Arkhipova M., Экономика региона 2022 Т. 18 № 2 С. 581–594

The article examines new methodologies for modelling crop yield in agricultural regions of Russia based on the use of remote capabilities to get information on the field state. The proposed approach can be applied to develop indicator systems and create methodological platforms and models necessary to obtain more accurate estimates. In comparison with the traditional ...

Added: January 12, 2023

Bayesian Sparsification of Recurrent Neural Networks

Lobacheva E., Chirkova N., Vetrov D., / Series 1 "Workshop on Learning to Generate Natural Language". 2017.

Recurrent neural networks show state-of-the-art results in many text analysis tasks but often require a lot of memory to store their weights. Recently proposed Sparse Variational Dropout (Molchanov et al., 2017) eliminates the majority of the weights in a feed-forward neural network without significant loss of quality. We apply this technique to sparsify recurrent neural ...

Added: October 19, 2017

Face anti-spoofing with joint spoofing medium detection and eye blinking analysis

Nikitin M. Y., Konushin V. S., Konushin A., Computer Optics 2019 Vol. 43 No. 4 P. 618–626

Modern biometric systems based on face recognition demonstrate high recognition quality, but they are vulnerable to face presentation attacks, such as photo or replay attack. Existing face anti-spoofing methods are mostly based on texture analysis and due to lack of training data either use hand-crafted features or ﬁne-tuned pretrained deep models. In this paper we ...

Added: October 31, 2019

Gestalt-like constraints produce veridical (Euclidean) percepts of 3D indoor scenes

Kwon T., Li Y., Sawada T. et al., Vision Research 2016 Vol. 126 P. 264–277

This study, which was influenced a lot by Gestalt ideas, extends our prior work on the role of a priori constraints in the veridical perception of 3D shapes to the perception of 3D scenes. Our experiments tested how human sub-jects perceive the layout of a naturally-illuminated indoor scene that contains common symmetrical 3D objects standing ...

Added: September 15, 2015