Deep Part-Based Generative Shape Model with Latent Variables

Kirillov A.; Gavrikov M.; E. Lobacheva; A. Osokin; D. Vetrov

doi:10.5244/C.30.88

Publications

?

Deep Part-Based Generative Shape Model with Latent Variables

P. 1-12.

Kirillov A., Gavrikov M., Lobacheva E., Osokin A., Vetrov D.

The Shape Boltzmann Machine (SBM) and its multilabel version MSBM have been recently introduced as deep generative models that capture the variations of an object shape. While being more flexible MSBM requires datasets with labeled parts of the objects for training. In the paper we present an algorithm for training MSBM using binary masks of objects and the seeds which approximately correspond to the locations of objects parts. The latter can be obtained from part-based detectors in an unsupervised manner. We derive a latent variable model and an EM-like training procedure for adjusting the weights of MSBM using a deep learning framework. We show that the model trained by our method outperforms SBM in the tasks related to binary shapes and is very close to the original MSBM in terms of quality of multilabel shapes.

Keywords: компьютерное зрение EM Algorithm computer vision сегментация изображений probabilistic graphical models вероятностные графические модели image segmentation EM алгоритм Shape Boltzmann machine модель формы Больцмана

In book

Proceedings of the 27th British Machine Vision Conference

-, 2016

Многоклассовая модель формы со скрытыми переменными

Кириллов А. Н., Гавриков М. И., Lobacheva E. et al., Интеллектуальные системы. Теория и приложения 2015 Т. 19 № 2 С. 75-95

In this paper we consider the Shape Boltzmann Machine(SBM) and its multi-label version MSBM. We present an algorithm for training MSBM using only binary masks of objects and the seeds which approximately correspond to the locations of objects parts. ...

Added: September 30, 2015

Self-supervised recurrent depth estimation with attention mechanisms

Makarov I., Bakhanova M., Nikolenko S. et al., PeerJ Computer Science 2022 Vol. 8 Article e865

Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced ...

Added: February 1, 2022

Limits of Kalman Filter application in heavy tailed problems

Konakov V., Mozgunov P., / Cornell University. Серия math "arxiv.org". 2015. № 1505.07981.

In this paper we consider the behavior of Kalman Filter state estimates in the case of distribution with heavy tails .The simulated linear state space models with Gaussian measurement noises were used. Gaussian noises in state equation are replaced by components with alpha-stable distribution with di erent parameters alpha and beta. We consider the case ...

Added: June 1, 2015

A New Click Model for Relevance Prediction in Web Search

Fishkov A., Nikolenko S. I., , in : EEML 2012 – Experimental Economics in Machine Learning. : Leuven : Katholieke Universiteit Leuven, 2012. P. 45-54.

We present a new click model for processing click logs and predicting relevance and appeal for query–document pairs in search results. Our model is a simplified version of the task-centric click model but outperforms it in an experimental comparison. ...

Added: February 13, 2013

Multispectral Remote Information in Forest Research

Пузаченко Ю. Г., Sandlerskiy R., Krenke A. et al., Russian Journal of Forest Science 2014 Vol. 7 No. 7 P. 838-854

The article proposes approaches to the use of multispectral remote information in basic research on the spatiotemporal organization of biogeocenotic cover with and without the use of ground field measurements. It is postulated that remote measurements reflect the biophysical condition of biogeocenotic cover defined by the absorption and conversion of solar energy and can be ...

Added: September 3, 2023

Proceedings of the 2015 IEEE International Conference on Computer Vision

Los Alamitos, Washington, Tokyo : IEEE Computer Society, 2015

Proceedings of the 2015 IEEE International Conference on Computer Vision ...

Added: October 1, 2015

Ортопедия, травматология и восстановительная хирургия детского возраста

Баиндурашвили, А. Г., [б.и.], 2020

Background. A large number of studies have focused on automating the process of measuring the Cobb angle. Although there is no practical tool to assist doctors with estimating the severity of the curvature of the spine and determine the best suitable treatment type.Aim. We aimed to examine the algorithms used for distinguishing vertebral column, vertebrae, ...

Added: November 18, 2022

Proceedings of IEEE International Russian Automation Conference (RusAutoCon 2020)

IEEE, 2020

Added: October 3, 2020

Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020)

Piscataway : IEEE, 2020

2020 International Joint Conference on Neural Networks (IJCNN) held virtually, as part of the IEEE World Congress on Computational Intelligence (IEEE WCCI) 2020. IJCNN 2020 is jointly organized by the IEEE Computational Intelligence Society (CIS) and the International Neural Network Society (INNS). For IJCNN 2020 (and when WCCI is organized in even-numbered years) IEEE CIS ...

Added: October 15, 2020

Instance Segmentation of Characters Recognized in Palmyrene Aramaic Inscriptions

Hamplová A., Lyavdansky A., Novák T. et al., CMES - Computer Modeling in Engineering and Sciences 2024 Vol. 140 No. 3 P. 2869-2889

This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions, employing two state-of-the-art deep learning algorithms, namely YOLOv8 and Roboflow 3.0. The goal is to contribute to the preservation and understanding of historical texts, showcasing the potential of modern deep learning methods in archaeological research. Our research culminates in several ...

Added: July 17, 2024

Intelligent Data Processing 11th International Conference, IDP 2016, Barcelona, Spain, October 10–14, 2016, Revised Selected Papers

Switzerland : Springer, 2019

This book constitutes the refereed proceedings of the 11th International Conference on Intelligent Data Processing, IDP 2016, held in Barcelona, Spain, in October 2016. The 11 revised full papers were carefully reviewed and selected from 52 submissions. The papers of this volume are organized in topical sections on machine learning theory with applications; intelligent data processing in life ...

Added: February 8, 2020

Сочетание контрастного обучения и обучения с учителем для обнаружения видео со сверхвысоким разрешением

Meshchaninov V., Ватолин Д. С., Молодецких И. А. et al., Препринты ИПМ им. М.В. Келдыша 2022 № 80 С. 14-27

Upscaled video detection is a helpful tool in multimedia forensics, but it’s a challenging task that involves various upscaling and compression algorithms. There are many resolution-enhancement methods, including interpolation and deep-learning-based super-resolution, and they leave unique traces. In this work, we propose a new upscaled-resolutiondetection method based on learning of visual representations using contrastive and ...

Added: May 23, 2023

Распознавание объектов по составляющим их примитивам и отношениям между ними

Сливницин П. А., Mylnikov L., Информатика и автоматизация (Труды СПИИРАН) 2023 Т. 22 № 3 С. 511-540

The paper’s goal is to develop a methodology and algorithm for the recognition of objects in the environment, keeping the quality with an increasing number of objects. For this purpose, the following problems were solved: recognition of the shape features, estimation of relations between features, and matching between the found features and relations and the ...

Added: May 24, 2023

Computer Vision – ECCV 2020; 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XV

NY : Springer, 2020

The 30-volume set, comprising the LNCS books 12346 until 12375, constitutes the refereed proceedings of the 16th European Conference on Computer Vision, ECCV 2020, which was planned to be held in Glasgow, UK, during August 23-28, 2020. The conference was held virtually due to the COVID-19 pandemic. The 1360 revised papers presented in these proceedings were ...

Added: October 28, 2020

2019 International Russian Automation Conference (RusAutoCon)

IEEE, 2019

Added: October 21, 2019

Theoretical and Methodological Substantiation of Boundaries and Integrity in Landscape Cover and Its Components

A. N. Krenke, R. B. Sandlersky, A. S. Baybar et al., Известия РАН. Серия биологическая. 2023 Vol. 50 No. 1 P. S85-S99

Four main models of the appearance of boundaries (in a particular case, integrity), arising from the theory of nonlinear dynamic systems, are considered briefly. On the basis of Kotelnikov’s fundamental sampling theorem and, accordingly, general information theory, the character of a distinguished boundary as a function of the sampling frequency in a spatial series with a ...

Added: December 2, 2022

Об одном подходе к решению задачи ректификации стереоизображений по сцене без калибровки камер

Protasov S., Крыловецкий А. А., Кургалин С. Д., Известия ЮФУ. Технические науки 2012 № 6 С. 144-148

This article is devoted to the method of initial image processing to use in stereovision systems. It is based on a modification of video stabilization approach [1]. The method considers image rectification process as a sequence of transformations. Each transformation is found as a solution of optimization problem. The article describes mathematical model that fits ...

Added: October 16, 2017

Автоматизация анализа рентгенограмм позвоночника для объективизации оценки степени тяжести сколиотической деформации при идиопатическом сколиозе (предварительное сообщение)

Statsenko M., Ортопедия, травматология и восстановительная хирургия детского возраста 2020 Т. 8 № 3 С. 317-326

Added: November 18, 2022

Моделирование урожайности зерновых культур сельскохозяйственных регионов с использованием технологий компьютерного зрения

Arkhipova M., Экономика региона 2022 Т. 18 № 2 С. 581-594

The article examines new methodologies for modelling crop yield in agricultural regions of Russia based on the use of remote capabilities to get information on the field state. The proposed approach can be applied to develop indicator systems and create methodological platforms and models necessary to obtain more accurate estimates. In comparison with the traditional ...

Added: January 12, 2023

Analysis of the Applicability of a Computer Vision System for Assessment of the Quality of Quail Eggs

Zlatev Z. D., Nedeva V. I., , in : Математика программных систем: межвузовский сборник научных трудов. Вып. 10.: Пермь : Пермский государственный национальный исследовательский университет, 2013. P. 133-139.

The report presents methodology for studying the qualitative and quantitative parameters of quail eggs. The method and system of computer vision are developed. The probability estimations of the health risk from consumption of poor quality food can be determined with developed computer system. ...

Added: May 23, 2015

К вопросу создания бесконтактных средств управления большими массивами данных в эргатических системах

Morozova T., Авиакосмическое приборостроение 2013 № 5 С. 46-56

The article is devoted to the history and problems of creating interfaces. Shows the complexity and importance of effective interfaces, noted that this problem is a system of multilevel interdisciplinary. The new systems should be given serious attention to issues of human efficiency level. Man is still the leading element in determining the efficiency of ...

Added: December 6, 2013

Computer Vision – ECCV 2018. 15th European Conference, Munich, Germany, September 8–14, 2018, Proceedings, Part XII

Cham : Springer, 2018

The sixteen-volume set comprising the LNCS volumes 11205-11220 constitutes the refereed proceedings of the 15th European Conference on Computer Vision, ECCV 2018, held in Munich, Germany, in September 2018. The 776 revised papers presented were carefully reviewed and selected from 2439 submissions. The papers are organized in topical sections on learning for vision; computational photography; ...

Added: October 31, 2018

15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings

Springer, 2018

Added: October 30, 2018

A Principled Deep Random Field Model for Image Segmentation

Kohli P., Osokin A., Jegelka S., , in : Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2013). : Portland : IEEE, 2013. P. 1971-1978.

We discuss a model for image segmentation that is able to overcome the short-boundary bias observed in standard pairwise random field based approaches. To wit, we show that a random field with multi-layered hidden units can encode boundary preserving higher order potentials such as the ones used in the cooperative cuts model of [11] while ...

Added: October 19, 2017