Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations

doi:10.1109/TMM.2020.3006419

Journal ArticleDOI

Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations

Maryam Sultana, +2 more

- 01 Jan 2021 -

IEEE Transactions on Multimedia

- Vol. 23, pp 2005-2018

TLDR

A Generative Adversarial Network (GAN) based on a moving object detection algorithm, called MOD_GAN, is proposed, enabling the algorithm to learn generating background sequences using input from uniformly distributed random noise samples.

Abstract:

Moving object detection (MOD) is a fundamental step in many high-level vision-based applications, such as human activity analysis, visual object tracking, autonomous vehicles, surveillance, and security. Most of the existing MOD algorithms observe performance degradation in the presence of complex scenes containing camouflage objects, shadows, dynamic backgrounds, and varying illumination conditions, and captured by static cameras. To appropriately handle these challenges, we propose a Generative Adversarial Network (GAN) based on a moving object detection algorithm, called MOD_GAN. In the proposed algorithm, scene-specific GANs are trained in an unsupervised MOD setting, thereby enabling the algorithm to learn generating background sequences using input from uniformly distributed random noise samples. In addition to adversarial loss, during training, norm-based loss in the image space and discriminator feature-space is also minimized between the generated images and the training data. The additional losses enable the generator to learn subtle background details, resulting in a more realistic complex scene generation. During testing, a novel back-propagation based algorithm is used to generate images with statistics similar to the test images. More appropriate random noise samples are searched by directly minimizing the loss function between the test and generated images both in the image and discriminator feature-spaces. The network is not updated in this step; only the input noise samples are iteratively modified to minimize the loss function. Moreover, motion information is used to ensure that this loss is only computed on small-motion pixels. A novel dataset containing outdoor time-lapsed images from dawn to dusk with a full illumination variation cycle is also proposed to better compare the MOD algorithms in outdoor scenes. Accordingly, extensive experiments on five benchmark datasets and comparison with 30 existing methods demonstrate the strength of the proposed algorithm.

Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations

Citations

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

U2-ONet: A Two-Level Nested Octave U-Structure Network with a Multi-Scale Attention Mechanism for Moving Object Segmentation

Moving Human Target Detection and Tracking in Video Frames

Deep Learning-based Moving Object Segmentation: Recent Progress and Research Prospects

Unsupervised moving object segmentation using background subtraction and optimal adversarial noise sample search

References

Generative Adversarial Nets

Image-to-Image Translation with Conditional Adversarial Networks

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Image-to-Image Translation with Conditional Adversarial Networks

Adaptive background mixture models for real-time tracking

Related Papers (5)

Fast algorithms for object orientation determination

The Behavior Understanding of Moving Objects Based on Vision

An End-to-End Deep Learning Network for 3D Object Detection From RGB-D Data Based on Hough Voting

Group object detection and tracking by combining RPCA and fractal analysis

Algorithms and evaluation for object detection and tracking in computer vision