Home
/
Authors
/
Liang Li

Author

Liang Li

Other affiliations: Chinese Academy of Sciences, Beijing Normal University

Bio: Liang Li is an academic researcher from Tsinghua University. The author has contributed to research in topics: Feature (computer vision) & Image retrieval. The author has an hindex of 23, co-authored 125 publications receiving 1864 citations. Previous affiliations of Liang Li include Chinese Academy of Sciences & Beijing Normal University.

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2008
2007
2006
2005
2004

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors

[...]

Chenggang Yan¹, Yongdong Zhang¹, Xu Jizheng², Feng Dai¹, Liang Li¹, Qionghai Dai, Feng Wu² - Show less +3 more•Institutions (2)

Tsinghua University¹, Microsoft²

11 Mar 2014-IEEE Signal Processing Letters

TL;DR: This paper proposes a parallel framework to decide coding unit trees through in-depth understanding of the dependency among different coding units, and achieves averagely more than 11 and 16 times speedup for 1920x1080 and 2560x1600 video sequences, respectively, without any coding efficiency degradation.

...read moreread less

Abstract: High Efficiency Video Coding (HEVC) uses a very flexible tree structure to organize coding units, which leads to a superior coding efficiency compared with previous video coding standards. However, such a flexible coding unit tree structure also places a great challenge for encoders. In order to fully exploit the coding efficiency brought by this structure, huge amount of computational complexity is needed for an encoder to decide the optimal coding unit tree for each image block. One way to achieve this is to use parallel computing enabled by many-core processors. In this paper, we analyze the challenge to use many-core processors to make coding unit tree decision. Through in-depth understanding of the dependency among different coding units, we propose a parallel framework to decide coding unit trees. Experimental results show that, on the Tile64 platform, our proposed method achieves averagely more than 11 and 16 times speedup for 1920x1080 and 2560x1600 video sequences, respectively, without any coding efficiency degradation.

...read moreread less

342 citations

Proceedings Article•DOI•

Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations

[...]

Shuhao Cui¹, Shuhui Wang¹, Junbao Zhuo¹, Liang Li¹, Qingming Huang¹, Qi Tian² - Show less +2 more•Institutions (2)

Chinese Academy of Sciences¹, Huawei²

14 Jun 2020

TL;DR: To improve both discriminability and diversity, the proposed Batch Nuclear-norm Maximization (BNM) on the output matrix could boost the learning under typical label insufficient learning scenarios, such as semi-supervised learning, domain adaptation and open domain recognition.

...read moreread less

Abstract: The learning of the deep networks largely relies on the data with human-annotated labels. In some label insufficient situations, the performance degrades on the decision boundary with high data density. A common solution is to directly minimize the Shannon Entropy, but the side effect caused by entropy minimization, \it i.e., reduction of the prediction diversity, is mostly ignored. To address this issue, we reinvestigate the structure of classification output matrix of a randomly selected data batch. We find by theoretical analysis that the prediction discriminability and diversity could be separately measured by the Frobenius-norm and rank of the batch output matrix. Besides, the nuclear-norm is an upperbound of the Frobenius-norm, and a convex approximation of the matrix rank. Accordingly, to improve both discriminability and diversity, we propose Batch Nuclear-norm Maximization (BNM) on the output matrix. BNM could boost the learning under typical label insufficient learning scenarios, such as semi-supervised learning, domain adaptation and open domain recognition. On these tasks, extensive experimental results show that BNM outperforms competitors and works well with existing well-known methods. The code is available at https://github.com/cuishuhao/BNM

...read moreread less

205 citations

Journal Article•DOI•

Cross-Modality Bridging and Knowledge Transferring for Image Understanding

[...]

Chenggang Yan¹, Liang Li², Chunjie Zhang², Bingtao Liu¹, Yongdong Zhang², Qionghai Dai³ - Show less +2 more•Institutions (3)

Hangzhou Dianzi University¹, Chinese Academy of Sciences², Tsinghua University³

07 Mar 2019-IEEE Transactions on Multimedia

TL;DR: This work learns a cross-modality bridging dictionary for the deep and complete understanding of a vast quantity of web images and proposes a knowledge-based concept transferring algorithm to discover the underlying relations of different categories.

...read moreread less

Abstract: The understanding of web images has been a hot research topic in both artificial intelligence and multimedia content analysis domains. The web images are composed of various complex foregrounds and backgrounds, which makes the design of an accurate and robust learning algorithm a challenging task. To solve the above significant problem, first, we learn a cross-modality bridging dictionary for the deep and complete understanding of a vast quantity of web images. The proposed algorithm leverages the visual features into the semantic concept probability distribution, which can construct a global semantic description for images while preserving the local geometric structure. To discover and model the occurrence patterns between intra- and inter-categories, multi-task learning is introduced for formulating the objective formulation with Capped- $\ell _{1}$ penalty, which can obtain the optimal solution with a higher probability and outperform the traditional convex function-based methods. Second, we propose a knowledge-based concept transferring algorithm to discover the underlying relations of different categories. This distribution probability transferring among categories can bring the more robust global feature representation, and enable the image semantic representation to generalize better as the scenario becomes larger. Experimental comparisons and performance discussion with classical methods on the ImageNet, Caltech-256, SUN397, and Scene15 datasets show the effectiveness of our proposed method at three traditional image understanding tasks.

...read moreread less

169 citations

Journal Article•DOI•

Task-Adaptive Attention for Image Captioning

[...]

Chenggang Yan¹, Yiming Hao², Liang Li³, Jian Yin², An-An Liu⁴, Zhendong Mao⁵, Zhenyu Chen⁶, Xingyu Gao³ - Show less +4 more•Institutions (6)

Hangzhou Dianzi University¹, Shandong University², Chinese Academy of Sciences³, Tianjin University⁴, University of Science and Technology of China⁵, State Grid Corporation of China⁶

19 Mar 2021-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: Extensive experiments on the MSCOCO captioning dataset demonstrate that by plugging the Task-Adaptive Attention module into a vanilla Transformer-based image captioning model, performance improvement can be achieved.

...read moreread less

Abstract: Attention mechanisms are now widely used in image captioning models. However, most attention models only focus on visual features. When generating syntax related words, little visual information is needed. In this case, these attention models could mislead the word generation. In this paper, we propose Task-Adaptive Attention module for image captioning, which can alleviate this misleading problem and learn implicit non-visual clues which can be helpful for the generation of non-visual words. We further introduce a diversity regularization to enhance the expression ability of the Task-Adaptive Attention module. Extensive experiments on the MSCOCO captioning dataset demonstrate that by plugging our Task-Adaptive Attention module into a vanilla Transformer-based image captioning model, performance improvement can be achieved.

...read moreread less

118 citations

Journal Article•DOI•

Parallel deblocking filter for HEVC on many-core processor

[...]

Chenggang Yan, Yongdong Zhang, Feng Dai, Xi Wang, Liang Li, Qionghai Dai¹ - Show less +2 more•Institutions (1)

Tsinghua University¹

01 Feb 2014-Electronics Letters

TL;DR: Experiments show that the proposed three-step parallel framework for HEVC dramatically accelerates more than the state-of-the-art parallel method.

...read moreread less

Abstract: High-efficiency video coding (HEVC) is the next generation standard of video coding. The deblocking filter (DF) constitutes a significant part of the HEVC decoder complexity. A three-step parallel framework (TPF) is proposed for the H.264/AVC DF, which is also suitable for HEVC except the third step. The third step of the TPF is replaced with a directed acyclic graph-based order. Experiments show that the proposed method dramatically accelerates more than the state-of-the-art parallel method.

...read moreread less

118 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26

Collapse

Cited by

PDF

Open Access

More filters

Convex Analysisの二,三の進展について

[...]

徹丸山

01 Feb 1977

5,933 citations

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Journal Article•DOI•

Salient Object Detection: A Benchmark

[...]

Ali Borji¹, Ming-Ming Cheng², Huaizu Jiang³, Jia Li⁴•Institutions (4)

University of Wisconsin–Milwaukee¹, University of Oxford², University of Massachusetts Amherst³, Beihang University⁴

07 Oct 2015-IEEE Transactions on Image Processing

TL;DR: It is found that the models designed specifically for salient object detection generally work better than models in closely related areas, which provides a precise definition and suggests an appropriate treatment of this problem that distinguishes it from other problems.

...read moreread less

Abstract: We extensively compare, qualitatively and quantitatively, 41 state-of-the-art models (29 salient object detection, 10 fixation prediction, 1 objectness, and 1 baseline) over seven challenging data sets for the purpose of benchmarking salient object detection and segmentation methods. From the results obtained so far, our evaluation shows a consistent rapid progress over the last few years in terms of both accuracy and running time. The top contenders in this benchmark significantly outperform the models identified as the best in the previous benchmark conducted three years ago. We find that the models designed specifically for salient object detection generally work better than models in closely related areas, which in turn provides a precise definition and suggests an appropriate treatment of this problem that distinguishes it from other problems. In particular, we analyze the influences of center bias and scene complexity in model performance, which, along with the hard cases for the state-of-the-art models, provide useful hints toward constructing more challenging large-scale data sets and better saliency models. Finally, we propose probable solutions for tackling several open problems, such as evaluation scores and data set bias, which also suggest future research directions in the rapidly growing field of salient object detection.

...read moreread less

1,372 citations

Book•

Convex Analysis: (pms-28)

[...]

Ralph Tyrell Rockafellar

21 Feb 1970

986 citations

Journal Article•DOI•

Deep learning for sentiment analysis: A survey

[...]

Lei Zhang¹, Shuai Wang², Bing Liu²•Institutions (2)

LinkedIn¹, University of Illinois at Urbana–Champaign²

01 Jul 2018-Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery

TL;DR: Deep learning has emerged as a powerful machine learning technique that learns multiple layers of representations or features of the data and produces state-of-the-art prediction results as mentioned in this paper, which is also popularly used in sentiment analysis in recent years.

...read moreread less

Abstract: Deep learning has emerged as a powerful machine learning technique that learns multiple layers of representations or features of the data and produces state-of-the-art prediction results. Along with the success of deep learning in many other application domains, deep learning is also popularly used in sentiment analysis in recent years. This paper first gives an overview of deep learning and then provides a comprehensive survey of its current applications in sentiment analysis.

...read moreread less

917 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse