Home
/
Authors
/
Zhengming Li

Author

Zhengming Li

Bio: Zhengming Li is an academic researcher from Harbin Institute of Technology. The author has contributed to research in topics: Computer science & K-SVD. The author has an hindex of 5, co-authored 7 publications receiving 438 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification

[...]

Zhengming Li¹, Zhihui Lai², Yong Xu¹, Jian Yang³, David Zhang⁴ - Show less +1 more•Institutions (4)

Harbin Institute of Technology¹, Shenzhen University², Nanjing University of Science and Technology³, Hong Kong Polytechnic University⁴

01 Feb 2017-IEEE Transactions on Neural Networks

TL;DR: A discriminative dictionary learning algorithm, called the locality-constrained and label embedding dictionary learning (LCLE-DL) algorithm, was proposed for image classification, which can achieve better performance than some state-of-the-art algorithms.

...read moreread less

Abstract: Locality and label information of training samples play an important role in image classification. However, previous dictionary learning algorithms do not take the locality and label information of atoms into account together in the learning process, and thus their performance is limited. In this paper, a discriminative dictionary learning algorithm, called the locality-constrained and label embedding dictionary learning (LCLE-DL) algorithm, was proposed for image classification. First, the locality information was preserved using the graph Laplacian matrix of the learned dictionary instead of the conventional one derived from the training samples. Then, the label embedding term was constructed using the label information of atoms instead of the classification error term, which contained discriminating information of the learned dictionary. The optimal coding coefficients derived by the locality-based and label-based reconstruction were effective for image classification. Experimental results demonstrated that the LCLE-DL algorithm can achieve better performance than some state-of-the-art algorithms.

...read moreread less

163 citations

Journal Article•DOI•

Using the original and 'symmetrical face' training samples to perform representation based two-step face recognition

[...]

Yong Xu¹, Xingjie Zhu¹, Zhengming Li¹, Guang-Hai Liu², Yuwu Lu¹, Hong Liu³ - Show less +2 more•Institutions (3)

Harbin Institute of Technology¹, Guangxi Normal University², Peking University³

01 Apr 2013-Pattern Recognition

TL;DR: This paper proposes to exploit the symmetry of the face to generate new samples and devise a representation based method to perform face recognition that outperforms state-of-the-art face recognition methods including the sparse representation classification (SRC), linear regression classification (LRC), collaborative representation (CR) and two-phase test sample sparse representation (TPTSSR).

...read moreread less

160 citations

Journal Article•DOI•

A Survey of Dictionary Learning Algorithms for Face Recognition

[...]

Yong Xu¹, Zhengming Li¹, Jian Yang², David Zhang³•Institutions (3)

Harbin Institute of Technology¹, Nanjing University of Science and Technology², Hong Kong Polytechnic University³

18 Apr 2017-IEEE Access

TL;DR: A survey of dictionary learning algorithms for face recognition is provided to understand the profiles of this subject and to grasp the theoretical rationales and potentials as well as their applicability to different cases of face recognition.

...read moreread less

Abstract: During the past several years, as one of the most successful applications of sparse coding and dictionary learning, dictionary-based face recognition has received significant attention. Although some surveys of sparse coding and dictionary learning have been reported, there is no specialized survey concerning dictionary learning algorithms for face recognition. This paper provides a survey of dictionary learning algorithms for face recognition. To provide a comprehensive overview, we not only categorize existing dictionary learning algorithms for face recognition but also present details of each category. Since the number of atoms has an important impact on classification performance, we also review the algorithms for selecting the number of atoms. Specifically, we select six typical dictionary learning algorithms with different numbers of atoms to perform experiments on face databases. In summary, this paper provides a broad view of dictionary learning algorithms for face recognition and advances study in this field. It is very useful for readers to understand the profiles of this subject and to grasp the theoretical rationales and potentials as well as their applicability to different cases of face recognition.

...read moreread less

118 citations

Journal Article•DOI•

Sample diversity, representation effectiveness and robust dictionary learning for face recognition

[...]

Yong Xu, Zhengming Li, Bob Zhang¹, Jian Yang², Jane You³ - Show less +1 more•Institutions (3)

University of Macau¹, Nanjing University of Science and Technology², Hong Kong Polytechnic University³

01 Jan 2017-Information Sciences

TL;DR: Experimental results demonstrate that the proposed algorithm framework outperforms some previous state-of-the-art dictionary learning and sparse coding algorithms in face recognition and can be applied to other pattern classification tasks.

...read moreread less

66 citations

Journal Article•DOI•

PID Controller-Guided Attention Neural Network Learning for Fast and Effective Real Photographs Denoising.

[...]

Ruijun Ma¹, Bob Zhang¹, Yicong Zhou¹, Zhengming Li, Fangyuan Lei - Show less +1 more•Institutions (1)

University of Macau¹

15 Jan 2021-IEEE Transactions on Neural Networks

TL;DR: Zhang et al. as mentioned in this paper proposed a novel network, namely, PID controller guide attention neural network (PAN-Net), taking advantage of both the proportional-integral-derivative (PID) controller and attention neural networks for real photograph denoising.

...read moreread less

Abstract: Real photograph denoising is extremely challenging in low-level computer vision since the noise is sophisticated and cannot be fully modeled by explicit distributions. Although deep-learning techniques have been actively explored for this issue and achieved convincing results, most of the networks may cause vanishing or exploding gradients, and usually entail more time and memory to obtain a remarkable performance. This article overcomes these challenges and presents a novel network, namely, PID controller guide attention neural network (PAN-Net), taking advantage of both the proportional-integral-derivative (PID) controller and attention neural network for real photograph denoising. First, a PID-attention network (PID-AN) is built to learn and exploit discriminative image features. Meanwhile, we devise a dynamic learning scheme by linking the neural network and control action, which significantly improves the robustness and adaptability of PID-AN. Second, we explore both the residual structure and share-source skip connections to stack the PID-ANs. Such a framework provides a flexible way to feature residual learning, enabling us to facilitate the network training and boost the denoising performance. Extensive experiments show that our PAN-Net achieves superior denoising results against the state-of-the-art in terms of image quality and efficiency.

...read moreread less

19 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A New Discriminative Sparse Representation Method for Robust Face Recognition via $l_{2}$ Regularization

[...]

Yong Xu¹, Zuofeng Zhong¹, Jian Yang², Jane You³, David Zhang³ - Show less +1 more•Institutions (3)

Harbin Institute of Technology¹, Nanjing University², Hong Kong Polytechnic University³

01 Oct 2017-IEEE Transactions on Neural Networks

TL;DR: A novel discriminative sparse representation method is proposed and its noticeable performance in image classification is demonstrated by the experimental results, and the proposed method outperforms the existing state-of-the-art sparse representation methods.

...read moreread less

Abstract: Sparse representation has shown an attractive performance in a number of applications. However, the available sparse representation methods still suffer from some problems, and it is necessary to design more efficient methods. Particularly, to design a computationally inexpensive, easily solvable, and robust sparse representation method is a significant task. In this paper, we explore the issue of designing the simple, robust, and powerfully efficient sparse representation methods for image classification. The contributions of this paper are as follows. First, a novel discriminative sparse representation method is proposed and its noticeable performance in image classification is demonstrated by the experimental results. More importantly, the proposed method outperforms the existing state-of-the-art sparse representation methods. Second, the proposed method is not only very computationally efficient but also has an intuitive and easily understandable idea. It exploits a simple algorithm to obtain a closed-form solution and discriminative representation of the test sample. Third, the feasibility, computational efficiency, and remarkable classification accuracy of the proposed $l_{2}$ regularization-based representation are comprehensively shown by extensive experiments and analysis. The code of the proposed method is available at http://www.yongxu.org/lunwen.html .

...read moreread less

171 citations

Journal Article•DOI•

Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction

[...]

Luefeng Chen, Mengtian Zhou, Wanjuan Su, Min Wu, Jinhua She¹, Kaoru Hirota² - Show less +2 more•Institutions (2)

Tokyo University of Technology¹, Tokyo Institute of Technology²

01 Feb 2018-Information Sciences

TL;DR: Softmax regression-based deep sparse autoencoder network (SRDSAN) is proposed to recognize facial emotion in human-robot interaction and aims to handle large data in the output of deep learning by using SR, to overcome local extrema and gradient diffusion problems in the training process.

...read moreread less

158 citations

Journal Article•DOI•

Interest point based face recognition using adaptive neuro fuzzy inference system

[...]

Rejeesh M R¹•Institutions (1)

Anna University¹

01 Aug 2019-Multimedia Tools and Applications

TL;DR: The performance of the proposed ANFIS-ABC technique is evaluated using an ORL database with 400 images of 40 individuals, YALE-B database with 165 images of 15 individuals and finally with real time video the detection rate and false alarm rate is compared with proposed and existing methods to prove the system efficiency.

...read moreread less

Abstract: In this paper, an efficient face recognition method using AGA and ANFIS-ABC has been proposed. At first stage, the face images gathered from the database are preprocessed. At Second stage, an interest point which is used to improve the detection rate consequently. The parameters used in the interest point determination are optimized using the Adaptive Genetic Algorithm. Finally using ANFIS, face images are classified by using extracted features. During the training process, the parameters of ANFIS are optimized using Artificial Bee Colony Algorithm (ABC) in order to improve the accuracy. The performance of the proposed ANFIS-ABC technique is evaluated using an ORL database with 400 images of 40 individuals, YALE-B database with 165 images of 15 individuals and finally with real time video the detection rate and false alarm rate is compared with proposed and existing methods to prove the system efficiency.

...read moreread less

151 citations

Journal Article•DOI•

Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier

[...]

Zhao Zhang¹, Weiming Jiang¹, Jie Qin², Li Zhang¹, Fanzhang Li¹, Min Zhang¹, Shuicheng Yan³ - Show less +3 more•Institutions (3)

Soochow University (Suzhou)¹, ETH Zurich², National University of Singapore³

01 Aug 2018-IEEE Transactions on Neural Networks

TL;DR: The classification approach of the ADDL model is very efficient, because it can avoid the extra time-consuming sparse reconstruction process with trained dictionary for each new test data as most existing DL algorithms.

...read moreread less

Abstract: In this paper, we propose an analysis mechanism-based structured analysis discriminative dictionary learning analysis discriminative dictionary learning, framework. The ADDL seamlessly integrates ADDL, analysis representation, and analysis classifier training into a unified model. The applied analysis mechanism can make sure that the learned dictionaries, representations, and linear classifiers over different classes are independent and discriminating as much as possible. The dictionary is obtained by minimizing a reconstruction error and an analytical incoherence promoting term that encourages the subdictionaries associated with different classes to be independent. To obtain the representation coefficients, ADDL imposes a sparse $l_{2,1}$ -norm constraint on the coding coefficients instead of using $l_{0}$ or $l_{1}$ norm, since the $l_{0}$ - or $l_{1}$ -norm constraint applied in most existing DL criteria makes the training phase time consuming. The code-extraction projection that bridges data with the sparse codes by extracting special features from the given samples is calculated via minimizing a sparse code approximation term. Then we compute a linear classifier based on the approximated sparse codes by an analysis mechanism to simultaneously consider the classification and representation powers. Thus, the classification approach of our model is very efficient, because it can avoid the extra time-consuming sparse reconstruction process with trained dictionary for each new test data as most existing DL algorithms. Simulations on real image databases demonstrate that our ADDL model can obtain superior performance over other state of the arts.

...read moreread less

140 citations

Journal Article•DOI•

Survey on Multi-Output Learning

[...]

Donna Xu¹, Yaxin Shi¹, Ivor W. Tsang¹, Yew-Soon Ong², Chen Gong³, Xiaobo Shen³ - Show less +2 more•Institutions (3)

University of Technology, Sydney¹, Nanyang Technological University², Nanjing University of Science and Technology³

01 Jul 2020-IEEE Transactions on Neural Networks

TL;DR: The four Vs of multi-output learning are characterized, i.e., volume, velocity, variety, and veracity, and the ways in which the four Vs both benefit and bring challenges to multi- output learning by taking inspiration from big data are examined.

...read moreread less

Abstract: The aim of multi-output learning is to simultaneously predict multiple outputs given an input. It is an important learning problem for decision-making since making decisions in the real world often involves multiple complex factors and criteria. In recent times, an increasing number of research studies have focused on ways to predict multiple outputs at once. Such efforts have transpired in different forms according to the particular multi-output learning problem under study. Classic cases of multi-output learning include multi-label learning, multi-dimensional learning, multi-target regression, and others. From our survey of the topic, we were struck by a lack in studies that generalize the different forms of multi-output learning into a common framework. This article fills that gap with a comprehensive review and analysis of the multi-output learning paradigm. In particular, we characterize the four Vs of multi-output learning, i.e., volume, velocity, variety, and veracity, and the ways in which the four Vs both benefit and bring challenges to multi-output learning by taking inspiration from big data. We analyze the life cycle of output labeling, present the main mathematical definitions of multi-output learning, and examine the field’s key challenges and corresponding solutions as found in the literature. Several model evaluation metrics and popular data repositories are also discussed. Last but not least, we highlight some emerging challenges with multi-output learning from the perspective of the four Vs as potential research directions worthy of further studies.

...read moreread less

124 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96

Collapse