Home
/
Authors
/
Bo Ning

Author

Bo Ning

University of Science and Technology of China

Bio: Bo Ning is an academic researcher from University of Science and Technology of China. The author has contributed to research in topics: Codebook & Bag-of-words model. The author has an hindex of 2, co-authored 3 publications receiving 54 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Selecting Key Poses on Manifold for Pairwise Action Recognition

[...]

Xianbin Cao¹, Bo Ning¹, Pingkun Yan, Xuelong Li•Institutions (1)

University of Science and Technology of China¹

01 Feb 2012-IEEE Transactions on Industrial Informatics

TL;DR: A novel approach for key poses selection is proposed, which models the descriptor space utilizing a manifold learning technique to recover the geometric structure of the descriptors on a lower dimensional manifold and develops a PageRank-based centrality measure.

...read moreread less

Abstract: In action recognition, bag of visual words based approaches have been shown to be successful, for which the quality of codebook is critical. In a large vocabulary of poses (visual words), some key poses play a more decisive role than others in the codebook. This paper proposes a novel approach for key poses selection, which models the descriptor space utilizing a manifold learning technique to recover the geometric structure of the descriptors on a lower dimensional manifold. A PageRank-based centrality measure is developed to select key poses according to the recovered geometric structure. In each step, a key pose is selected from the manifold and the remaining model is modified to maximize the discriminative power of selected codebook. With the obtained codebook, each action can be represented with a histogram of the key poses. To solve the ambiguity between some action classes, a pairwise subdivision is executed to select discriminative codebooks for further recognition. Experiments on benchmark datasets showed that our method is able to obtain better performance compared with other state-of-the-art methods.

...read moreread less

52 citations

Journal Article•DOI•

Pedestrian detection in unseen scenes by dynamically updating visual words

[...]

Xianbin Cao¹, Li Wang¹, Bo Ning², Yuan Yuan³, Pingkun Yan³ - Show less +1 more•Institutions (3)

Beihang University¹, University of Science and Technology of China², Chinese Academy of Sciences³

01 Nov 2013-Neurocomputing

TL;DR: A novel bag of visual words based method is proposed to detect pedestrians in unseen scenes by dynamically updating the key words by using three strategies covering key word selection, detector invariance, and codebook update.

...read moreread less

6 citations

Proceedings Article•DOI•

Putting poses on manifold for action recognition

[...]

Xianbin Cao¹, Bo Ning¹, Pingkun Yan², Xuelong Li²•Institutions (2)

University of Science and Technology of China¹, Chinese Academy of Sciences²

01 Nov 2011

TL;DR: A novel approach to select key poses for the codebook is proposed, which models the descriptor space utilizing manifold learning to recover the geometric structure of the descriptors on a lower dimensional manifold space.

...read moreread less

Abstract: In action recognition, bag of words based approaches have been shown to be successful, for which the quality of codebook is critical. This paper proposes a novel approach to select key poses for the codebook, which models the descriptor space utilizing manifold learning to recover the geometric structure of the descriptors on a lower dimensional manifold space. A PageRank based centrality measure is developed to select key poses on the manifold. In each step, a key pose is selected and the remaining model is modified to maximize the discriminative power of selected codebook. In classification, the ambiguity of each action couple is evaluated through cross validation. An additional subdivision will be executed for ambiguous pairs. Experiments on ut-tower dataset showed that our method is able to obtain better performance than the state-of-the-art methods.

...read moreread less

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval

[...]

Chaoqun Hong¹, Jun Yu², Dacheng Tao³, Meng Wang⁴•Institutions (4)

Xiamen University of Technology¹, Hangzhou Dianzi University², University of Technology, Sydney³, Hefei University of Technology⁴

01 Jun 2015-IEEE Transactions on Industrial Electronics

TL;DR: This approach improves traditional methods by adopting multiview locality-sensitive sparse coding in the retrieving process, and incorporates a local similarity preserving term into the objective of sparse coding, which groups similar silhouettes to alleviate the instability of sparse codes.

...read moreread less

Abstract: Image-based 3-D human pose recovery is usually conducted by retrieving relevant poses with image features. However, it suffers from the high dimensionality of image features and the low efficiency of the retrieving process. Particularly for multiview data, the integration of different types of features is difficult. In this paper, a novel approach is proposed to recover 3-D human poses from silhouettes. This approach improves traditional methods by adopting multiview locality-sensitive sparse coding in the retrieving process. First, it incorporates a local similarity preserving term into the objective of sparse coding, which groups similar silhouettes to alleviate the instability of sparse codes. Second, the objective function of sparse coding is improved by integrating multiview data. The experimental results show that the retrieval error has been reduced by 20% to 50%, which demonstrate the effectiveness of the proposed method.

...read moreread less

242 citations

Journal Article•DOI•

Boosted key-frame selection and correlated pyramidal motion-feature representation for human action recognition

[...]

Li Liu¹, Ling Shao¹, Peter I. Rockett¹•Institutions (1)

University of Sheffield¹

01 Jul 2013-Pattern Recognition

TL;DR: A novel method for human action recognition based on boosted key-frame selection and correlated pyramidal motion feature representations and the correlogram, which focuses not only on probabilistic distributions within one frame but also on the temporal relationships of the action sequence is proposed.

...read moreread less

127 citations

Journal Article•DOI•

$p$ -Laplacian Regularized Sparse Coding for Human Activity Recognition

[...]

Weifeng Liu¹, Zheng-Jun Zha², Yanjiang Wang¹, Ke Lu³, Dacheng Tao⁴ - Show less +1 more•Institutions (4)

China University of Petroleum¹, University of Science and Technology of China², Chinese Academy of Sciences³, University of Technology, Sydney⁴

07 Apr 2016-IEEE Transactions on Industrial Electronics

TL;DR: The experimental results demonstrate that the proposed pLSC algorithm outperforms the manifold regularized sparse coding algorithms including the standard Laplacian regularization sparse coding algorithm with a proper p.

...read moreread less

Abstract: Human activity analysis in videos has increasingly attracted attention in computer vision research with the massive number of videos now accessible online. Although many recognition algorithms have been reported recently, activity representation is challenging. Recently, manifold regularized sparse coding has obtained promising performance in action recognition, because it simultaneously learns the sparse representation and preserves the manifold structure. In this paper, we propose a generalized version of Laplacian regularized sparse coding for human activity recognition called $p$ -Laplacian regularized sparse coding (pLSC). The proposed method exploits $p$ -Laplacian regularization to preserve the local geometry. The $p$ -Laplacian is a nonlinear generalization of standard graph Laplacian and has tighter isoperimetric inequality. As a result, pLSC provides superior theoretical evidence than standard Laplacian regularized sparse coding with a proper $p$ . We also provide a fast iterative shrinkage-thresholding algorithm for the optimization of pLSC. Finally, we input the sparse codes learned by the pLSC algorithm into support vector machines and conduct extensive experiments on the unstructured social activity attribute dataset and human motion database (HMDB51) for human activity recognition. The experimental results demonstrate that the proposed pLSC algorithm outperforms the manifold regularized sparse coding algorithms including the standard Laplacian regularized sparse coding algorithm with a proper $p$ .

...read moreread less

112 citations

Journal Article•DOI•

Silhouette Analysis-Based Action Recognition Via Exploiting Human Poses

[...]

Di Wu¹, Ling Shao¹•Institutions (1)

University of Sheffield¹

01 Feb 2013-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: The proposed scheme takes advantages of local and global features and, therefore, provides a discriminative representation for human actions and outperforms the state-of-the-art methods on the IXMAS action recognition dataset.

...read moreread less

Abstract: In this paper, we propose a novel scheme for human action recognition that combines the advantages of both local and global representations. We explore human silhouettes for human action representation by taking into account the correlation between sequential poses in an action. A modified bag-of-words model, named bag of correlated poses, is introduced to encode temporally local features of actions. To utilize the property of visual word ambiguity, we adopt the soft assignment strategy to reduce the dimensionality of our model and circumvent the penalty of computational complexity and quantization error. To compensate for the loss of structural information, we propose an extended motion template, i.e., extensions of the motion history image, to capture the holistic structural features. The proposed scheme takes advantages of local and global features and, therefore, provides a discriminative representation for human actions. Experimental results prove the viability of the complimentary properties of two descriptors and the proposed approach outperforms the state-of-the-art methods on the IXMAS action recognition dataset.

...read moreread less

97 citations

Journal Article•DOI•

Learning Discriminative Key Poses for Action Recognition

[...]

Li Liu¹, Ling Shao¹, Xiantong Zhen¹, Xuelong Li•Institutions (1)

University of Sheffield¹

01 Dec 2013-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A new classifier named weighted local naive Bayes nearest neighbor is proposed for the final action classification, which is demonstrated to be more accurate and robust than other classifiers, e.g., support vector machine (SVM) and naive Baye nearest neighbor.

...read moreread less

Abstract: In this paper, we present a new approach for human action recognition based on key-pose selection and representation. Poses in video frames are described by the proposed extensive pyramidal features (EPFs), which include the Gabor, Gaussian, and wavelet pyramids. These features are able to encode the orientation, intensity, and contour information and therefore provide an informative representation of human poses. Due to the fact that not all poses in a sequence are discriminative and representative, we further utilize the AdaBoost algorithm to learn a subset of discriminative poses. Given the boosted poses for each video sequence, a new classifier named weighted local naive Bayes nearest neighbor is proposed for the final action classification, which is demonstrated to be more accurate and robust than other classifiers, e.g., support vector machine (SVM) and naive Bayes nearest neighbor. The proposed method is systematically evaluated on the KTH data set, the Weizmann data set, the multiview IXMAS data set, and the challenging HMDB51 data set. Experimental results manifest that our method outperforms the state-of-the-art techniques in terms of recognition rate.

...read moreread less

96 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse