Multi-scale convolutional neural networks for crowd counting

doi:10.1109/ICIP.2017.8296324

Open AccessProceedings ArticleDOI

Multi-scale convolutional neural networks for crowd counting

Lingke Zeng, +4 more

- pp 465-469

Chats0

TLDR

A novel multi-scale convolutional neural network (MSCNN) for single image crowd counting is proposed, able to generate scale-relevant features for higher crowd counting performances in a single-column architecture, which is both accuracy and cost effective for practical applications.

Abstract:

Crowd counting on static images is a challenging problem due to scale variations. Recently deep neural networks have been shown to be effective in this task. However, existing neural-networks-based methods often use the multi-column or multi-network model to extract the scale-relevant features, which is more complicated for optimization and computation wasting. To this end, we propose a novel multi-scale convolutional neural network (MSCNN) for single image crowd counting. Based on the multi-scale blobs, the network is able to generate scale-relevant features for higher crowd counting performances in a single-column architecture, which is both accuracy and cost effective for practical applications. Complemental results show that our method outperforms the state-of-the-art methods on both accuracy and robustness with far less number of parameters.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Learning From Synthetic Data for Crowd Counting in the Wild

Qi Wang, +3 more

TL;DR: A data collector and labeler is developed which can generate the synthetic crowd scenes and simultaneously annotate them without any manpower, and a crowd counting method via domain adaptation is proposed, which can free humans from heavy data annotations.

...read moreread less

Proceedings ArticleDOI

Crowd Counting with Deep Negative Correlation Learning

Zenglin Shi, +6 more

TL;DR: This work proposes a new learning strategy to produce generalizable features by way of deep negative correlation learning (NCL), which deeply learn a pool of decorrelated regressors with sound generalization capabilities through managing their intrinsic diversities.

...read moreread less

Posted Content

Learning from Synthetic Data for Crowd Counting in the Wild

Qi Wang, +3 more

- 08 Mar 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Wang et al. as discussed by the authors developed a data collector and labeler to generate the synthetic crowd scenes and simultaneously annotate them without any manpower, which can boost the performance of crowd counting in the wild.

...read moreread less

Proceedings ArticleDOI

Crowd Counting With Deep Structured Scale Integration Network

Lingbo Liu, +5 more

TL;DR: Zhang et al. as discussed by the authors proposed a novel Deep Structured Scale Integration Network (DSSINet) for crowd counting, which addresses the scale variation of people by using structured feature representation learning and hierarchically structured loss function optimization.

...read moreread less

Proceedings ArticleDOI

Recurrent Attentive Zooming for Joint Crowd Counting and Precise Localization

Chenchen Liu, +2 more

TL;DR: This work proposes Recurrent Attentive Zooming Network, which recurrently detects ambiguous image region and zooms it into high resolution for re-inspection and proposes an adaptive fusion scheme that effectively elevates the performance.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Proceedings ArticleDOI

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Multi-scale convolutional neural networks for crowd counting

Citations

Learning From Synthetic Data for Crowd Counting in the Wild

Crowd Counting with Deep Negative Correlation Learning

Learning from Synthetic Data for Crowd Counting in the Wild

Crowd Counting With Deep Structured Scale Integration Network

Recurrent Attentive Zooming for Joint Crowd Counting and Precise Localization

References

Going deeper with convolutions

Histograms of oriented gradients for human detection

Rectified Linear Units Improve Restricted Boltzmann Machines

Caffe: Convolutional Architecture for Fast Feature Embedding

Caffe: Convolutional Architecture for Fast Feature Embedding

Related Papers (5)

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network

Cross-scene crowd counting via deep convolutional neural networks

Multi-source Multi-scale Counting in Extremely Dense Crowd Images

Switching Convolutional Neural Network for Crowd Counting

CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes