scispace - formally typeset
Open AccessProceedings ArticleDOI

Multi-scale convolutional neural networks for crowd counting

Reads0
Chats0
TLDR
A novel multi-scale convolutional neural network (MSCNN) for single image crowd counting is proposed, able to generate scale-relevant features for higher crowd counting performances in a single-column architecture, which is both accuracy and cost effective for practical applications.
Abstract
Crowd counting on static images is a challenging problem due to scale variations. Recently deep neural networks have been shown to be effective in this task. However, existing neural-networks-based methods often use the multi-column or multi-network model to extract the scale-relevant features, which is more complicated for optimization and computation wasting. To this end, we propose a novel multi-scale convolutional neural network (MSCNN) for single image crowd counting. Based on the multi-scale blobs, the network is able to generate scale-relevant features for higher crowd counting performances in a single-column architecture, which is both accuracy and cost effective for practical applications. Complemental results show that our method outperforms the state-of-the-art methods on both accuracy and robustness with far less number of parameters.

read more

Citations
More filters
Proceedings ArticleDOI

Learning From Synthetic Data for Crowd Counting in the Wild

TL;DR: A data collector and labeler is developed which can generate the synthetic crowd scenes and simultaneously annotate them without any manpower, and a crowd counting method via domain adaptation is proposed, which can free humans from heavy data annotations.
Proceedings ArticleDOI

Crowd Counting with Deep Negative Correlation Learning

TL;DR: This work proposes a new learning strategy to produce generalizable features by way of deep negative correlation learning (NCL), which deeply learn a pool of decorrelated regressors with sound generalization capabilities through managing their intrinsic diversities.
Posted Content

Learning from Synthetic Data for Crowd Counting in the Wild

TL;DR: Wang et al. as discussed by the authors developed a data collector and labeler to generate the synthetic crowd scenes and simultaneously annotate them without any manpower, which can boost the performance of crowd counting in the wild.
Proceedings ArticleDOI

Crowd Counting With Deep Structured Scale Integration Network

TL;DR: Zhang et al. as discussed by the authors proposed a novel Deep Structured Scale Integration Network (DSSINet) for crowd counting, which addresses the scale variation of people by using structured feature representation learning and hierarchically structured loss function optimization.
Proceedings ArticleDOI

Recurrent Attentive Zooming for Joint Crowd Counting and Precise Localization

TL;DR: This work proposes Recurrent Attentive Zooming Network, which recurrently detects ambiguous image region and zooms it into high resolution for re-inspection and proposes an adaptive fusion scheme that effectively elevates the performance.
References
More filters
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.
Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.
Proceedings ArticleDOI

Caffe: Convolutional Architecture for Fast Feature Embedding

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.
Related Papers (5)