Home
/
Authors
/
Hao Liu

Author

Hao Liu

Other affiliations: Tencent

Bio: Hao Liu is an academic researcher from Hefei University of Technology. The author has contributed to research in topics: Context (language use) & Metric (mathematics). The author has an hindex of 11, co-authored 23 publications receiving 1076 citations. Previous affiliations of Hao Liu include Tencent.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

End-to-End Comparative Attention Networks for Person Re-Identification

[...]

Hao Liu¹, Jiashi Feng², Meibin Qi¹, Jianguo Jiang¹, Shuicheng Yan - Show less +1 more•Institutions (2)

Hefei University of Technology¹, National University of Singapore²

03 May 2017-IEEE Transactions on Image Processing

TL;DR: This paper proposes a new soft attention-based model, i.e., the end-to-end comparative attention network (CAN), specifically tailored for the task of person re-identification that outperforms well established baselines significantly and offers the new state-of-the-art performance.

...read moreread less

Abstract: Person re-identification across disjoint camera views has been widely applied in video surveillance yet it is still a challenging problem. One of the major challenges lies in the lack of spatial and temporal cues, which makes it difficult to deal with large variations of lighting conditions, viewing angles, body poses, and occlusions. Recently, several deep-learning-based person re-identification approaches have been proposed and achieved remarkable performance. However, most of those approaches extract discriminative features from the whole frame at one glimpse without differentiating various parts of the persons to identify. It is essentially important to examine multiple highly discriminative local regions of the person images in details through multiple glimpses for dealing with the large appearance variance. In this paper, we propose a new soft attention-based model, i.e. , the end-to-end comparative attention network (CAN), specifically tailored for the task of person re-identification. The end-to-end CAN learns to selectively focus on parts of pairs of person images after taking a few glimpses of them and adaptively comparing their appearance. The CAN model is able to learn which parts of images are relevant for discerning persons and automatically integrates information from different parts to determine whether a pair of images belongs to the same person. In other words, our proposed CAN model simulates the human perception process to verify whether two images are from the same person. Extensive experiments on four benchmark person re-identification data sets, including CUHK01, CHUHK03, Market-1501, and VIPeR, clearly demonstrate that our proposed end-to-end CAN for person re-identification outperforms well established baselines significantly and offer the new state-of-the-art performance.

...read moreread less

610 citations

Proceedings Article•DOI•

Neural Person Search Machines

[...]

Hao Liu¹, Jiashi Feng², Zequn Jie², Karlekar Jayashree³, Bo Zhao⁴, Meibin Qi¹, Jianguo Jiang¹, Shuicheng Yan² - Show less +4 more•Institutions (4)

Hefei University of Technology¹, National University of Singapore², Panasonic³, University of Texas at Austin⁴

01 Oct 2017

TL;DR: Zhang et al. as mentioned in this paper proposed to recursively shrink the search area from the whole image till achieving precise localization of the target person, by fully exploiting information from the query and contextual cues in every recursive search step.

...read moreread less

Abstract: We investigate the problem of person search in the wild in this work. Instead of comparing the query against all candidate regions generated in a query-blind manner, we propose to recursively shrink the search area from the whole image till achieving precise localization of the target person, by fully exploiting information from the query and contextual cues in every recursive search step. We develop the Neural Person Search Machines (NPSM) to implement such recursive localization for person search. Benefiting from its neural search mechanism, NPSM is able to selectively shrink its focus from a loose region to a tighter one containing the target automatically. In this process, NPSM employs an internal primitive memory component to memorize the query representation which modulates the attention and augments its robustness to other distracting regions. Evaluations on two benchmark datasets, CUHK-SYSU Person Search dataset and PRW dataset, have demonstrated that our method can outperform current state-of-the-arts in both mAP and top-1 evaluation protocols.

...read moreread less

144 citations

Journal Article•DOI•

Video-Based Person Re-Identification With Accumulative Motion Context

[...]

Hao Liu¹, Zequn Jie², Karlekar Jayashree³, Meibin Qi¹, Jianguo Jiang¹, Shuicheng Yan⁴, Jiashi Feng⁴ - Show less +3 more•Institutions (4)

Hefei University of Technology¹, Tencent², Panasonic³, National University of Singapore⁴

01 Oct 2018-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: Wang et al. as discussed by the authors proposed an accumulative motion context (AMOC) network for video-based person re-identification, which jointly learns appearance representation and motion context from a collection of adjacent frames using a two-stream convolutional architecture.

...read moreread less

Abstract: Video-based person re-identification plays a central role in realistic security and video surveillance. In this paper, we propose a novel accumulative motion context (AMOC) network for addressing this important problem, which effectively exploits the long-range motion context for robustly identifying the same person under challenging conditions. Given a video sequence of the same or different persons, the proposed AMOC network jointly learns appearance representation and motion context from a collection of adjacent frames using a two-stream convolutional architecture. Then, AMOC accumulates clues from motion context by recurrent aggregation, allowing effective information flow among adjacent frames and capturing dynamic gist of the persons. The architecture of AMOC is end-to-end trainable, and thus, motion context can be adapted to complement appearance clues under unfavorable conditions ( e.g. , occlusions). Extensive experiments are conduced on three public benchmark data sets, i.e. , the iLIDS-VID, PRID-2011, and MARS data sets, to investigate the performance of AMOC. The experimental results demonstrate that the proposed AMOC network outperforms state-of-the-arts for video-based re-identification significantly and confirm the advantage of exploiting long-range motion context for video-based person re-identification, validating our motivation evidently.

...read moreread less

141 citations

Proceedings Article•DOI•

Multi-View Image Generation from a Single-View

[...]

Bo Zhao¹, Xiao Wu², Zhi-Qi Cheng², Hao Liu³, Zequn Jie³, Jiashi Feng¹ - Show less +2 more•Institutions (3)

National University of Singapore¹, Southwest Jiaotong University², Tencent³

15 Oct 2018

TL;DR: This paper proposes a novel image generation model termed VariGANs, which combines the merits of the variational inference and the Generative Adversarial Networks (GANs), and generates the target image in a coarse-to-fine manner instead of a single pass which suffers from severe artifacts.

...read moreread less

Abstract: How to generate multi-view images with realistic-looking appearance from only a single view input is a challenging problem. In this paper, we attack this problem by proposing a novel image generation model termed VariGANs, which combines the merits of the variational inference and the Generative Adversarial Networks (GANs). It generates the target image in a coarse-to-fine manner instead of a single pass which suffers from severe artifacts. It first performs variational inference to model global appearance of the object (e.g., shape and color) and produces coarse images of different views. Conditioned on the generated coarse images, it then performs adversarial learning to fill details consistent with the input and generate the fine images. Extensive experiments conducted on two clothing datasets, MVC and DeepFashion, have demonstrated that the generated images with the proposed VariGANs are more plausible than those generated by existing approaches, which provide more consistent global appearance as well as richer and sharper details.

...read moreread less

115 citations

Posted Content•

Neural Person Search Machines

[...]

Hao Liu¹, Jiashi Feng², Zequn Jie², Karlekar Jayashree³, Bo Zhao⁴, Meibin Qi¹, Jianguo Jiang¹, Shuicheng Yan² - Show less +4 more•Institutions (4)

Hefei University of Technology¹, National University of Singapore², Panasonic³, University of Texas at Austin⁴

21 Jul 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: Evaluations on two benchmark datasets have demonstrated that the Neural Person Search Machines (NPSM) developed can outperform current state-of-the-arts in both mAP and top-1 evaluation protocols.

...read moreread less

85 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro

[...]

Zhedong Zheng¹, Liang Zheng¹, Yi Yang¹•Institutions (1)

University of Technology, Sydney¹

26 Jan 2017

TL;DR: A simple semisupervised pipeline that only uses the original training set without collecting extra data, which effectively improves the discriminative ability of learned CNN embeddings and proposes the label smoothing regularization for outliers (LSRO).

...read moreread less

Abstract: The main contribution of this paper is a simple semisupervised pipeline that only uses the original training set without collecting extra data. It is challenging in 1) how to obtain more training data only from the training set and 2) how to use the newly generated data. In this work, the generative adversarial network (GAN) is used to generate unlabeled samples. We propose the label smoothing regularization for outliers (LSRO). This method assigns a uniform label distribution to the unlabeled images, which regularizes the supervised model and improves the baseline. We verify the proposed method on a practical problem: person re-identification (re-ID). This task aims to retrieve a query person from other cameras. We adopt the deep convolutional generative adversarial network (DCGAN) for sample generation, and a baseline convolutional neural network (CNN) for representation learning. Experiments show that adding the GAN-generated data effectively improves the discriminative ability of learned CNN embeddings. On three large-scale datasets, Market- 1501, CUHK03 and DukeMTMC-reID, we obtain +4.37%, +1.6% and +2.46% improvement in rank-1 precision over the baseline CNN, respectively. We additionally apply the proposed method to fine-grained bird recognition and achieve a +0.6% improvement over a strong baseline. The code is available at https://github.com/layumi/ Person-reID_GAN.

...read moreread less

1,789 citations

Proceedings Article•DOI•

Re-ranking Person Re-identification with k-Reciprocal Encoding

[...]

Zhun Zhong¹, Liang Zheng², Donglin Cao¹, Shaozi Li¹•Institutions (2)

Xiamen University¹, University of Technology, Sydney²

21 Jul 2017

TL;DR: This paper proposes a k-reciprocal encoding method to re-rank the re-ID results, and hypothesis is that if a gallery image is similar to the probe in the k- Reciprocal nearest neighbors, it is more likely to be a true match.

...read moreread less

Abstract: When considering person re-identification (re-ID) as a retrieval process, re-ranking is a critical step to improve its accuracy. Yet in the re-ID community, limited effort has been devoted to re-ranking, especially those fully automatic, unsupervised solutions. In this paper, we propose a k-reciprocal encoding method to re-rank the re-ID results. Our hypothesis is that if a gallery image is similar to the probe in the k-reciprocal nearest neighbors, it is more likely to be a true match. Specifically, given an image, a k-reciprocal feature is calculated by encoding its k-reciprocal nearest neighbors into a single vector, which is used for re-ranking under the Jaccard distance. The final distance is computed as the combination of the original distance and the Jaccard distance. Our re-ranking method does not require any human interaction or any labeled data, so it is applicable to large-scale datasets. Experiments on the large-scale Market-1501, CUHK03, MARS, and PRW datasets confirm the effectiveness of our method.

...read moreread less

1,306 citations

Proceedings Article•DOI•

Learning Discriminative Features with Multiple Granularities for Person Re-Identification

[...]

Guanshuo Wang¹, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou¹ - Show less +1 more•Institutions (1)

Shanghai Jiao Tong University¹

15 Oct 2018

TL;DR: Comprehensive experiments implemented on the mainstream evaluation datasets including Market-1501, DukeMTMC-reid and CUHK03 indicate that the proposed end-to-end feature learning strategy robustly achieves state-of-the-art performances and outperforms any existing approaches by a large margin.

...read moreread less

Abstract: The combination of global and partial features has been an essential solution to improve discriminative performances in person re-identification (Re-ID) tasks. Previous part-based methods mainly focus on locating regions with specific pre-defined semantics to learn local representations, which increases learning difficulty but not efficient or robust to scenarios with large variances. In this paper, we propose an end-to-end feature learning strategy integrating discriminative information with various granularities. We carefully design the Multiple Granularity Network (MGN), a multi-branch deep network architecture consisting of one branch for global feature representations and two branches for local feature representations. Instead of learning on semantic regions, we uniformly partition the images into several stripes, and vary the number of parts in different local branches to obtain local feature representations with multiple granularities. Comprehensive experiments implemented on the mainstream evaluation datasets including Market-1501, DukeMTMC-reid and CUHK03 indicate that our method robustly achieves state-of-the-art performances and outperforms any existing approaches by a large margin. For example, on Market-1501 dataset in single query mode, we obtain a top result of Rank-1/mAP=96.6%/94.2% with this method after re-ranking.

...read moreread less

1,050 citations

Proceedings Article•DOI•

Harmonious Attention Network for Person Re-identification

[...]

Wei Li¹, Xiatian Zhu, Shaogang Gong¹•Institutions (1)

Queen Mary University of London¹

22 Feb 2018

TL;DR: A novel Harmonious Attention CNN (HA-CNN) model is formulated for joint learning of soft pixel attention and hard regional attention along with simultaneous optimisation of feature representations, dedicated to optimise person re-id in uncontrolled (misaligned) images.

...read moreread less

Abstract: Existing person re-identification (re-id) methods either assume the availability of well-aligned person bounding box images as model input or rely on constrained attention selection mechanisms to calibrate misaligned images. They are therefore sub-optimal for re-id matching in arbitrarily aligned person images potentially with large human pose variations and unconstrained auto-detection errors. In this work, we show the advantages of jointly learning attention selection and feature representation in a Convolutional Neural Network (CNN) by maximising the complementary information of different levels of visual attention subject to re-id discriminative learning constraints. Specifically, we formulate a novel Harmonious Attention CNN (HA-CNN) model for joint learning of soft pixel attention and hard regional attention along with simultaneous optimisation of feature representations, dedicated to optimise person re-id in uncontrolled (misaligned) images. Extensive comparative evaluations validate the superiority of this new HA-CNN model for person re-id over a wide variety of state-of-the-art methods on three large-scale benchmarks including CUHK03, Market-1501, and DukeMTMC-ReID.

...read moreread less

1,020 citations

Posted Content•

Person Re-identification: Past, Present and Future

[...]

Liang Zheng, Yi Yang, Alexander G. Hauptmann

10 Oct 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: The history of person re-identification and its relationship with image classification and instance retrieval is introduced and two new re-ID tasks which are much closer to real-world applications are described and discussed.

...read moreread less

Abstract: Person re-identification (re-ID) has become increasingly popular in the community due to its application and research significance. It aims at spotting a person of interest in other cameras. In the early days, hand-crafted algorithms and small-scale evaluation were predominantly reported. Recent years have witnessed the emergence of large-scale datasets and deep learning systems which make use of large data volumes. Considering different tasks, we classify most current re-ID methods into two classes, i.e., image-based and video-based; in both tasks, hand-crafted and deep learning systems will be reviewed. Moreover, two new re-ID tasks which are much closer to real-world applications are described and discussed, i.e., end-to-end re-ID and fast re-ID in very large galleries. This paper: 1) introduces the history of person re-ID and its relationship with image classification and instance retrieval; 2) surveys a broad selection of the hand-crafted systems and the large-scale methods in both image- and video-based re-ID; 3) describes critical future directions in end-to-end re-ID and fast retrieval in large galleries; and 4) finally briefs some important yet under-developed issues.

...read moreread less

984 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse