scispace - formally typeset
Proceedings ArticleDOI

Intelligent monitoring of indoor surveillance video based on deep learning

TLDR
Deep learning methods are used, using the state-of-the-art framework for instance segmentation, called Mask R-CNN, to train the fine-tuning network on datasets, which can efficiently detect objects in a video image while simultaneously generating a high-quality segmentation mask for each instance.
Abstract
With the rapid development of information technology, video surveillance system has become a key part in the security and protection system of modern cities. Especially in prisons, surveillance cameras could be found almost everywhere. However, with the continuous expansion of the surveillance network, surveillance cameras not only bring convenience, but also produce a massive amount of monitoring data, which poses huge challenges to storage, analytics and retrieval. The smart monitoring system equipped with intelligent video analytics technology can monitor as well as pre-alarm abnormal events or behaviours, which is a hot research direction in the field of surveillance. This paper combines deep learning methods, using the state-of-the-art framework for instance segmentation, called Mask R-CNN, to train the fine-tuning network on our datasets, which can efficiently detect objects in a video image while simultaneously generating a high-quality segmentation mask for each instance. The experiment show that our network is simple to train and easy to generalize to other datasets, and the mask average precision is nearly up to 98.5% on our own datasets.

read more

Citations
More filters
Journal ArticleDOI

Hawk-Eye: An AI-Powered Threat Detector for Intelligent Surveillance Cameras

TL;DR: In this paper, an AI-powered threat detector for smart surveillance cameras, called Hawk-Eye, is presented, which can be deployed on centralized servers hosted in the cloud and locally on the surveillance cameras at the network edge.
Book ChapterDOI

Surveillance System for Intruder Detection Using Facial Recognition

TL;DR: In this article, the authors used the Nvidia Jetson Nano board to compute convolutional neural network algorithm for the facial recognition process, which can detect the intruder and inform the security within seconds.
Proceedings ArticleDOI

A survey of video human behaviour recognition Methodologies in the Perspective of Spatial-Temporal

TL;DR: In this paper , a comparison of existing frameworks and datasets which are related to video-type datasets only is presented, and the pros and cons of current existing works and further research directions based on existing works are provided.
Proceedings ArticleDOI

A survey of video human behaviour recognition Methodologies in the Perspective of Spatial-Temporal

TL;DR: In this article , a comparison of existing frameworks and datasets which are related to video-type datasets only is presented, and the pros and cons of current existing works and further research directions based on existing works are provided.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.
Related Papers (5)