Alpha-Flow for Video Matting

Sindeev M.; Konushin Anton; Rother C.

?

Alpha-Flow for Video Matting

P. 438-452.

Sindeev M., Konushin Anton, Rother C.

This work addresses the problem of video matting, that is extracting the opacity-layer of a foreground object from a video sequence. We introduce the notion of alpha-flow which corresponds to the flow in the opacity layer. The idea is derived from the process of rotoscoping, where a user-supplied object mask is smoothly interpolated between keyframes while preserving its correspondence with the underlying image. Our key contribution is an algorithm which infers both the opacity masks and the alpha-flow in an efficient and unified manner. We embed our algorithm in an interactive video matting system where the first and last frame of a sequence are given as keyframes, and additional user strokes may be provided in intermediate frames. We show high quality results on various challenging sequences, and give a detailed comparison to competing techniques.

Language: English

Text on another site

Keywords: computer vision video processing

In book

Lecture Notes in Computer Science

* III. Vol. 7726: Computer Vision – ACCV 2012. , Berlin : Springer, 2013

Fast automatic single-view 3-d reconstruction of urban scenes

Barinova O., Konushin V., Yakubenko A. et al., , in : Lecture Notes in Computer Science. Vol. 5303: Computer Vision – ECCV 2008.: Berlin : Springer, 2008. P. 100-113.

We consider the problem of estimating 3-d structure from a single still image of an outdoor urban scene. Our goal is to efficiently create 3-d models which are visually pleasant. We chose an appropriate 3-d model structure and formulate the task of 3-d reconstruction as model fitting problem. Our 3-d models are composed of a ...

Added: July 10, 2014

Real-Time System for Automatic Cold Strip Surface Defect Detection

Kostenetskiy P., Alkapov R., Chulkevich R. et al., FME Transactions 2019 Vol. 47 No. 4 P. 765-774

Detection and classification of surface defects of the rolled metal is one of the main tasks for correctly assessing product quality. Historically, these tasks were performed by human. But due to a multitude of production factors, such as high rolling rate and temperature of the metal, the results of such human work are rather low. ...

Added: November 22, 2019

Lecture Notes in Computer Science

Berlin : Springer, 2008

Added: July 10, 2014

15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings

Springer, 2018

The sixteen-volume set comprising the LNCS volumes 11205-11220 constitutes the refereed proceedings of the 15th European Conference on Computer Vision, ECCV 2018, held in Munich, Germany, in September 2018. The 776 revised papers presented were carefully reviewed and selected from 2439 submissions. The papers are organized in topical sections on learning for vision; computational photography; human analysis; ...

Added: October 30, 2018

Face anti-spoofing with joint spoofing medium detection and eye blinking analysis

Nikitin M. Y., Konushin V. S., Konushin A., Computer Optics 2019 Vol. 43 No. 4 P. 618-626

Modern biometric systems based on face recognition demonstrate high recognition quality, but they are vulnerable to face presentation attacks, such as photo or replay attack. Existing face anti-spoofing methods are mostly based on texture analysis and due to lack of training data either use hand-crafted features or ﬁne-tuned pretrained deep models. In this paper we ...

Added: October 31, 2019

Noise Resistant Morphological Algorithm of Moving Forklift Truck Detection on Noisy Image Data

Savchenko A., Chernousov V.O., International Journal of Conceptual Structures and Smart Applications (IJCSSA) 2014 Vol. 2 No. 2 P. 36-54

We investigate the specific problem of machine vision, namely, video-based detection of the moving forklift truck. It is shown that the detection quality of the state-of-the-art local descriptors (SURF, SIFT, etc.) is not satisfactory if the resolution is low and the illumination is changed dramatically. In this paper, we propose to use a simple mathematical ...

Added: September 10, 2015

Practical People Counting Algorithm

Mamedov T., Kuplyakov D., Konushin A., , in : Proceedings of the 31st International Conference on Computer Graphics and Vision (GraphiCon 2021). Nizhny Novgorod, Russia, September 27-30, 2021. Vol. 3027.: CEUR Workshop Proceedings, 2021. P. 453-463.

In this paper, we consider the problem of people counting in video surveillance. This is an important task in video analysis, because this data can be used for predictive analytics and improvement of customer services, traffic control, etc. The proposed methods are based on object tracking and are able to work on sparse frames, which ...

Added: November 25, 2022

Estimation of 4-DoF manipulator optimal configuration for autonomous camera calibration of a mobile robot using on-board templates

Tsoy T., Safin R., Magid E. et al., , in : 2022 International Siberian Conference on Control and Communications (SIBCON). : IEEE, 2022. Ch. 9438925.

Camera calibration is one of the important tasks in the field of robotics and computer vision. It enables to increase the accuracy of metric measurements in photogrammetry applications and provides higher performance in computer vision algorithms such as stereo matching and motion estimation. It is known that regardless of the calibration method used variation of ...

Added: October 11, 2021

The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016

Curran Associates, Inc., 2016

CVRP is the premiere annual Computer Vision event comprising the main CVRP conference and 27 co-located workshops and short courses. ...

Added: December 26, 2017

Proceedings of the IEEE International Conference on Computer Vision (ICCV 2015)

Santiago de Chile : IEEE, 2015

Computer Vision (ICCV), 2015 IEEE International Conference on ...

Added: October 19, 2017

Methods of gait recognition in video

Соколова А. И., Konushin A., Programming and Computer Software 2019 Vol. 45 No. 4 P. 213-220

Human gait is an important biometric index that allows to identify a person at a great distance without direct contact. Due to these qualities, which other popular identifiers such as fingerprints or iris do not have, the recognition of a person by the manner of walking has become very common in various areas where video ...

Added: October 31, 2019

Weight Averaging Improves Knowledge Distillation under Domain Shift

Berezovskiy V., Morozov N., , in : The 2nd Workshop and Challenges for Out-of-Distribution Generalization in Computer Vision. ICCV 2023. : [б.и.], 2023.

Knowledge distillation (KD) is a powerful model compression technique broadly used in practical deep learning applications. It is focused on training a small student network to mimic a larger teacher network. While it is widely known that KD can offer an improvement to student generalization in i.i.d setting, its performance under domain shift, i.e. the ...

Added: November 20, 2023

Shape Perception

Sawada T., Li Y., Pizlo Z., , in : The Oxford Handbook of Computational and Mathematical Psychology. : Oxford University Press, 2015. P. 255-276.

This chapter provides a review of topics and concepts that are necessary to study and understand 3D shape perception. This includes group theory and their invariants; model-based invariants; Euclidean, affine, and projective geometry; symmetry; inverse problems; simplicity principle; Fechnerian psychophysics; regularization theory; Bayesian inference; shape constancy and shape veridicality; shape recovery; perspective and orthographic projections; ...

Added: March 10, 2015

Human face detection in excessive dark image by using contrast stretching, histogram equalization and adaptive equalization

Ahmed Munna M. T., International Journal of Engineering and Technology 2018 Vol. 7 No. 4 P. 3990-3994

Darkness is the inverse state of the brightness, is obtained as an absence of noticeable light and illumination. Generally, face detection applications cannot detect any human face in a dark image, where the image has captured from the dark environment or dark night. In this manuscript, we demonstrate our experiment, where we use Contrast Stretching, ...

Added: October 29, 2019

HCI International 2023 Posters

Springer, 2023

Added: October 21, 2023

Advances in Computer Graphics. CGI 2020

Springer, 2020

onference, CGI 2020, held in Geneva, Switzerland, in October 2020. The conference was held virtually. The 43 full papers presented together with 3 short papers were carefully reviewed and selected from 189 submissions. The papers address topics such as: virtual reality; rendering and textures; augmented and mixed reality; video processing; image processing; fluid simulation and control; ...

Added: November 2, 2020

Об одном подходе к решению задачи ректификации стереоизображений по сцене без калибровки камер

Protasov S., Крыловецкий А. А., Кургалин С. Д., Известия ЮФУ. Технические науки 2012 № 6 С. 144-148

This article is devoted to the method of initial image processing to use in stereovision systems. It is based on a modification of video stabilization approach [1]. The method considers image rectification process as a sequence of transformations. Each transformation is found as a solution of optimization problem. The article describes mathematical model that fits ...

Added: October 16, 2017

People Tracking Algorithm for Human Height Mounted Cameras

Kononov V., Konushin A., Konushin V., , in : Lecture Notes in Computer Science. Vol. 6835: Pattern Recognition.: Berlin : Springer, 2011. P. 163-172.

We present a new people tracking method for human height mounted camera, e.g. the one attached near information or advertising stand. We use state-of-the-art particle filter approach and improve it by explicitly modeling of object visibility which makes the method able to cope with difficult object overlapping. We employ our own method based on online-boosting ...

Added: July 10, 2014

NTIRE 2022 Challenge on Night Photography Rendering

Ershov E., , in : 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). : IEEE, 2022. P. 1287-1300.

This paper reviews the NTIRE 2022 challenge on night photography rendering. The challenge solicited solutions that processed RAW camera images captured in night scenes to produce a photo-finished output image encoded in the standard RGB (sRGB) space. Given the subjective nature of this task, the proposed solutions were evaluated based on the mean opinions of ...

Added: September 8, 2023

Digital Fingerprinting of Microstructures

White M., Tarakanov A., Race C. et al., / arXiv. Series 2203.13718 "cs". 2022.

Finding efficient means of fingerprinting microstructural information is a critical step towards harnessing data-centric machine learning approaches. A statistical framework is systematically developed for compressed characterisation of a population of images, which includes some classical computer vision methods as special cases. The focus is on materials microstructure. The ultimate purpose is to rapidly fingerprint sample ...

Added: October 28, 2022

Моделирование урожайности зерновых культур сельскохозяйственных регионов с использованием технологий компьютерного зрения

Arkhipova M., Экономика региона 2022 Т. 18 № 2 С. 581-594

The article examines new methodologies for modelling crop yield in agricultural regions of Russia based on the use of remote capabilities to get information on the field state. The proposed approach can be applied to develop indicator systems and create methodological platforms and models necessary to obtain more accurate estimates. In comparison with the traditional ...

Added: January 12, 2023

Gestalt-like constraints produce veridical (Euclidean) percepts of 3D indoor scenes

Kwon T., Li Y., Sawada T. et al., Vision Research 2016 Vol. 126 P. 264-277

This study, which was influenced a lot by Gestalt ideas, extends our prior work on the role of a priori constraints in the veridical perception of 3D shapes to the perception of 3D scenes. Our experiments tested how human sub-jects perceive the layout of a naturally-illuminated indoor scene that contains common symmetrical 3D objects standing ...

Added: September 15, 2015

Deep Part-Based Generative Shape Model with Latent Variables

Kirillov A., Gavrikov M., Lobacheva E. et al., , in : Proceedings of the 27th British Machine Vision Conference. : -, 2016. P. 1-12.

The Shape Boltzmann Machine (SBM) and its multilabel version MSBM have been recently introduced as deep generative models that capture the variations of an object shape. While being more flexible MSBM requires datasets with labeled parts of the objects for training. In the paper we present an algorithm for training MSBM using binary masks of ...

Added: February 24, 2017

Data Analytics and Management in Data Intensive Domains. 23rd International Conference, DAMDID/RCDL 2021, Moscow, Russia, October 26–29, 2021, Revised Selected Papers

Springer, 2022

“Data Analytics and Management in Data Intensive Domains” conference (DAMDID) is planned as a multidisciplinary forum of researchers and practitioners from various domains of science and research promoting cooperation and exchange of ideas in the area of data analysis and management in data intensive domains. Approaches to data analysis and management being developed in specific data intensive domains of X-informatics (such as X = astro, bio, chemo, geo, medicine, neuro, physics, ...

Added: August 30, 2021