Home
/
Authors
/
Houqiang Li

Author

Houqiang Li

University of Science and Technology of China

Other affiliations: China University of Science and Technology, Nanjing Medical University, Capital Medical University ...read more

Bio: Houqiang Li is an academic researcher from University of Science and Technology of China. The author has contributed to research in topics: Computer science & Motion compensation. The author has an hindex of 57, co-authored 520 publications receiving 12325 citations. Previous affiliations of Houqiang Li include China University of Science and Technology & Nanjing Medical University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
1993

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

The sixth visual object tracking VOT2018 challenge results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴ +155 more•Institutions (47)

23 Jan 2019

TL;DR: The Visual Object Tracking challenge VOT2018 is the sixth annual tracker benchmarking activity organized by the VOT initiative; results of over eighty trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2018 is the sixth annual tracker benchmarking activity organized by the VOT initiative. Results of over eighty trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The evaluation included the standard VOT and other popular methodologies for short-term tracking analysis and a “real-time” experiment simulating a situation where a tracker processes images as if provided by a continuously running sensor. A long-term tracking subchallenge has been introduced to the set of standard VOT sub-challenges. The new subchallenge focuses on long-term tracking properties, namely coping with target disappearance and reappearance. A new dataset has been compiled and a performance evaluation methodology that focuses on long-term tracking capabilities has been adopted. The VOT toolkit has been updated to support both standard short-term and the new long-term tracking subchallenges. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

639 citations

Proceedings Article•DOI•

Jointly Modeling Embedding and Translation to Bridge Video and Language

[...]

Yingwei Pan¹, Tao Mei², Ting Yao², Houqiang Li¹, Yong Rui² - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, Microsoft²

27 Jun 2016

TL;DR: Liu et al. as discussed by the authors presented a unified framework, named Long Short-Term Memory with visual-semantic Embedding (LSTM-E), which can simultaneously explore the learning of LSTM and visualsemantic embedding.

...read moreread less

Abstract: Automatically describing video content with natural language is a fundamental challenge of computer vision. Re-current Neural Networks (RNNs), which models sequence dynamics, has attracted increasing attention on visual interpretation. However, most existing approaches generate a word locally with the given previous words and the visual content, while the relationship between sentence semantics and visual content is not holistically exploited. As a result, the generated sentences may be contextually correct but the semantics (e.g., subjects, verbs or objects) are not true. This paper presents a novel unified framework, named Long Short-Term Memory with visual-semantic Embedding (LSTM-E), which can simultaneously explore the learning of LSTM and visual-semantic embedding. The former aims to locally maximize the probability of generating the next word given previous words and visual content, while the latter is to create a visual-semantic embedding space for enforcing the relationship between the semantics of the entire sentence and visual content. The experiments on YouTube2Text dataset show that our proposed LSTM-E achieves to-date the best published performance in generating natural sentences: 45.3% and 31.0% in terms of BLEU@4 and METEOR, respectively. Superior performances are also reported on two movie description datasets (M-VAD and MPII-MD). In addition, we demonstrate that LSTM-E outperforms several state-of-the-art techniques in predicting Subject-Verb-Object (SVO) triplets.

...read moreread less

563 citations

Proceedings Article•DOI•

The Visual Object Tracking VOT2017 Challenge Results

[...]

Matej Kristan¹, Ales Leonardis², Jiri Matas³, Michael Felsberg⁴, Roman Pflugfelder⁵, Luka Čehovin Zajc¹, Tomas Vojir³, Gustav Häger⁴, Alan Lukezic¹, Abdelrahman Eldesokey⁴, Gustavo Fernandez⁵, Alvaro Garcia-Martin⁶, Andrej Muhič¹, Alfredo Petrosino⁷, Alireza Memarmoghadam⁸, Andrea Vedaldi⁹, Antoine Manzanera¹⁰, Antoine Tran¹⁰, A. Aydin Alatan¹¹, Bogdan Mocanu, Boyu Chen¹², Chang Huang, Changsheng Xu¹³, Chong Sun¹², Dalong Du, David Zhang, Dawei Du¹³, Deepak Mishra, Erhan Gundogdu¹⁴, Erhan Gundogdu¹¹, Erik Velasco-Salido, Fahad Shahbaz Khan⁴, Francesco Battistone, Gorthi R. K. Sai Subrahmanyam, Goutam Bhat⁴, Guan Huang, Guilherme Sousa Bastos, Guna Seetharaman¹⁵, Hongliang Zhang¹⁶, Houqiang Li¹⁷, Huchuan Lu¹², Isabela Drummond, Jack Valmadre⁹, Jae-chan Jeong¹⁸, Jaeil Cho¹⁸, Jae-Yeong Lee¹⁸, Jana Noskova, Jianke Zhu¹⁹, Jin Gao¹³, Jingyu Liu¹³, Ji-Wan Kim¹⁸, João F. Henriques⁹, José M. Martínez, Junfei Zhuang²⁰, Junliang Xing¹³, Junyu Gao¹³, Kai Chen²¹, Kannappan Palaniappan²², Karel Lebeda, Ke Gao²², Kris M. Kitani²³, Lei Zhang, Lijun Wang¹², Lingxiao Yang, Longyin Wen²⁴, Luca Bertinetto⁹, Mahdieh Poostchi²², Martin Danelljan⁴, Matthias Mueller²⁵, Mengdan Zhang¹³, Ming-Hsuan Yang²⁶, Nianhao Xie¹⁶, Ning Wang¹⁷, Ondrej Miksik⁹, Payman Moallem⁸, Pallavi Venugopal M, Pedro Senna, Philip H. S. Torr⁹, Qiang Wang¹³, Qifeng Yu¹⁶, Qingming Huang¹³, Rafael Martin-Nieto, Richard Bowden²⁷, Risheng Liu¹², Ruxandra Tapu, Simon Hadfield²⁷, Siwei Lyu²⁸, Stuart Golodetz⁹, Sunglok Choi¹⁸, Tianzhu Zhang¹³, Titus Zaharia, Vincenzo Santopietro, Wei Zou¹³, Weiming Hu¹³, Wenbing Tao²¹, Wenbo Li²⁸, Wengang Zhou¹⁷, Xianguo Yu¹⁶, Xiao Bian²⁴, Yang Li¹⁹, Yifan Xing²³, Yingruo Fan²⁰, Zheng Zhu¹³, Zhipeng Zhang¹³, Zhiqun He²⁰ - Show less +101 more•Institutions (28)

University of Ljubljana¹, University of Birmingham², Czech Technical University in Prague³, Linköping University⁴, Austrian Institute of Technology⁵, Autonomous University of Madrid⁶, Parthenope University of Naples⁷, University of Isfahan⁸, University of Oxford⁹, Superior National School of Advanced Techniques¹⁰, Middle East Technical University¹¹, Dalian University of Technology¹², Chinese Academy of Sciences¹³, ASELSAN¹⁴, United States Naval Research Laboratory¹⁵, National University of Defense Technology¹⁶, University of Science and Technology of China¹⁷, Electronics and Telecommunications Research Institute¹⁸, Zhejiang University¹⁹, Beijing University of Posts and Telecommunications²⁰, Huazhong University of Science and Technology²¹, University of Missouri²², Carnegie Mellon University²³, General Electric²⁴, King Abdullah University of Science and Technology²⁵, University of California, Merced²⁶, University of Surrey²⁷, University at Albany, SUNY²⁸

01 Jul 2017

TL;DR: The Visual Object Tracking challenge VOT2017 is the fifth annual tracker benchmarking activity organized by the VOT initiative; results of 51 trackers are presented; many are state-of-the-art published at major computer vision conferences or journals in recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2017 is the fifth annual tracker benchmarking activity organized by the VOT initiative. Results of 51 trackers are presented; many are state-of-the-art published at major computer vision conferences or journals in recent years. The evaluation included the standard VOT and other popular methodologies and a new "real-time" experiment simulating a situation where a tracker processes images as if provided by a continuously running sensor. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The VOT2017 goes beyond its predecessors by (i) improving the VOT public dataset and introducing a separate VOT2017 sequestered dataset, (ii) introducing a realtime tracking experiment and (iii) releasing a redesigned toolkit that supports complex experiments. The dataset, the evaluation kit and the results are publicly available at the challenge website1.

...read moreread less

485 citations

Posted Content•

Jointly Modeling Embedding and Translation to Bridge Video and Language

[...]

Yingwei Pan¹, Tao Mei², Ting Yao², Houqiang Li¹, Yong Rui² - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, Microsoft²

07 May 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: A novel unified framework, named Long Short-Term Memory with visual-semantic Embedding (LSTM-E), which can simultaneously explore the learning of LSTM and visual- semantic embedding and outperforms several state-of-the-art techniques in predicting Subject-Verb-Object (SVO) triplets.

...read moreread less

Abstract: Automatically describing video content with natural language is a fundamental challenge of multimedia. Recurrent Neural Networks (RNN), which models sequence dynamics, has attracted increasing attention on visual interpretation. However, most existing approaches generate a word locally with given previous words and the visual content, while the relationship between sentence semantics and visual content is not holistically exploited. As a result, the generated sentences may be contextually correct but the semantics (e.g., subjects, verbs or objects) are not true. This paper presents a novel unified framework, named Long Short-Term Memory with visual-semantic Embedding (LSTM-E), which can simultaneously explore the learning of LSTM and visual-semantic embedding. The former aims to locally maximize the probability of generating the next word given previous words and visual content, while the latter is to create a visual-semantic embedding space for enforcing the relationship between the semantics of the entire sentence and visual content. Our proposed LSTM-E consists of three components: a 2-D and/or 3-D deep convolutional neural networks for learning powerful video representation, a deep RNN for generating sentences, and a joint embedding model for exploring the relationships between visual content and sentence semantics. The experiments on YouTube2Text dataset show that our proposed LSTM-E achieves to-date the best reported performance in generating natural sentences: 45.3% and 31.0% in terms of BLEU@4 and METEOR, respectively. We also demonstrate that LSTM-E is superior in predicting Subject-Verb-Object (SVO) triplets to several state-of-the-art techniques.

...read moreread less

419 citations

Journal Article•DOI•

Nuclei Segmentation Using Marker-Controlled Watershed, Tracking Using Mean-Shift, and Kalman Filter in Time-Lapse Microscopy

[...]

Xiaodong Yang¹, Houqiang Li¹, Xiaobo Zhou²•Institutions (2)

University of Science and Technology of China¹, Brigham and Women's Hospital²

13 Nov 2006-IEEE Transactions on Circuits and Systems

TL;DR: A novel marker-controlled watershed based on mathematical morphology is proposed, which can effectively segment clustered cells with less oversegmentation and design a tracking method based on modified mean shift algorithm, in which several kernels with adaptive scale, shape, and direction are designed.

...read moreread less

Abstract: It is important to observe and study cancer cells' cycle progression in order to better understand drug effects on cancer cells. Time-lapse microscopy imaging serves as an important method to measure the cycle progression of individual cells in a large population. Since manual analysis is unreasonably time consuming for the large volumes of time-lapse image data, automated image analysis is proposed. Existing approaches dealing with time-lapse image data are rather limited and often give inaccurate analysis results, especially in segmenting and tracking individual cells in a cell population. In this paper, we present a new approach to segment and track cell nuclei in time-lapse fluorescence image sequence. First, we propose a novel marker-controlled watershed based on mathematical morphology, which can effectively segment clustered cells with less oversegmentation. To further segment undersegmented cells or to merge oversegmented cells, context information among neighboring frames is employed, which is proved to be an effective strategy. Then, we design a tracking method based on modified mean shift algorithm, in which several kernels with adaptive scale, shape, and direction are designed. Finally, we combine mean-shift and Kalman filter to achieve a more robust cell nuclei tracking method than existing ones. Experimental results show that our method can obtain 98.8% segmentation accuracy, 97.4% cell division tracking accuracy, and 97.6% cell tracking accuracy

...read moreread less

391 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Proceedings Article•DOI•

Scalable Person Re-identification: A Benchmark

[...]

Liang Zheng¹, Liang Zheng², Liyue Shen², Lu Tian², Shengjin Wang², Jingdong Wang³, Qi Tian² - Show less +3 more•Institutions (3)

University of Texas at San Antonio¹, Tsinghua University², Microsoft³

07 Dec 2015

TL;DR: A minor contribution, inspired by recent advances in large-scale image search, an unsupervised Bag-of-Words descriptor is proposed that yields competitive accuracy on VIPeR, CUHK03, and Market-1501 datasets, and is scalable on the large- scale 500k dataset.

...read moreread less

Abstract: This paper contributes a new high quality dataset for person re-identification, named "Market-1501". Generally, current datasets: 1) are limited in scale, 2) consist of hand-drawn bboxes, which are unavailable under realistic settings, 3) have only one ground truth and one query image for each identity (close environment). To tackle these problems, the proposed Market-1501 dataset is featured in three aspects. First, it contains over 32,000 annotated bboxes, plus a distractor set of over 500K images, making it the largest person re-id dataset to date. Second, images in Market-1501 dataset are produced using the Deformable Part Model (DPM) as pedestrian detector. Third, our dataset is collected in an open system, where each identity has multiple images under each camera. As a minor contribution, inspired by recent advances in large-scale image search, this paper proposes an unsupervised Bag-of-Words descriptor. We view person re-identification as a special task of image search. In experiment, we show that the proposed descriptor yields competitive accuracy on VIPeR, CUHK03, and Market-1501 datasets, and is scalable on the large-scale 500k dataset.

...read moreread less

3,564 citations

Book Chapter•DOI•

Convergence of probability measures

[...]

Richard F. Bass

01 Jan 2011

TL;DR: Weakconvergence methods in metric spaces were studied in this article, with applications sufficient to show their power and utility, and the results of the first three chapters are used in Chapter 4 to derive a variety of limit theorems for dependent sequences of random variables.

...read moreread less

Abstract: The author's preface gives an outline: "This book is about weakconvergence methods in metric spaces, with applications sufficient to show their power and utility. The Introduction motivates the definitions and indicates how the theory will yield solutions to problems arising outside it. Chapter 1 sets out the basic general theorems, which are then specialized in Chapter 2 to the space C[0, l ] of continuous functions on the unit interval and in Chapter 3 to the space D [0, 1 ] of functions with discontinuities of the first kind. The results of the first three chapters are used in Chapter 4 to derive a variety of limit theorems for dependent sequences of random variables. " The book develops and expands on Donsker's 1951 and 1952 papers on the invariance principle and empirical distributions. The basic random variables remain real-valued although, of course, measures on C[0, l ] and D[0, l ] are vitally used. Within this framework, there are various possibilities for a different and apparently better treatment of the material. More of the general theory of weak convergence of probabilities on separable metric spaces would be useful. Metrizability of the convergence is not brought up until late in the Appendix. The close relation of the Prokhorov metric and a metric for convergence in probability is (hence) not mentioned (see V. Strassen, Ann. Math. Statist. 36 (1965), 423-439; the reviewer, ibid. 39 (1968), 1563-1572). This relation would illuminate and organize such results as Theorems 4.1, 4.2 and 4.4 which give isolated, ad hoc connections between weak convergence of measures and nearness in probability. In the middle of p. 16, it should be noted that C*(S) consists of signed measures which need only be finitely additive if 5 is not compact. On p. 239, where the author twice speaks of separable subsets having nonmeasurable cardinal, he means "discrete" rather than "separable." Theorem 1.4 is Ulam's theorem that a Borel probability on a complete separable metric space is tight. Theorem 1 of Appendix 3 weakens completeness to topological completeness. After mentioning that probabilities on the rationals are tight, the author says it is an

...read moreread less

3,554 citations

Journal Article•DOI•

Recent advances in convolutional neural networks

[...]

Jiuxiang Gu¹, Zhenhua Wang¹, Jason Kuen¹, Lianyang Ma¹, Amir Shahroudy¹, Bing Shuai¹, Ting Liu¹, Xingxing Wang¹, Gang Wang¹, Jianfei Cai¹, Tsuhan Chen¹ - Show less +7 more•Institutions (1)

Nanyang Technological University¹

01 May 2018-Pattern Recognition

TL;DR: A broad survey of the recent advances in convolutional neural networks can be found in this article, where the authors discuss the improvements of CNN on different aspects, namely, layer design, activation function, loss function, regularization, optimization and fast computation.

...read moreread less

3,125 citations

The PASCAL Visual Object Classes Challenge

[...]

Jianguo Zhang

01 Jan 2006

3,012 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse