Home
/
Authors
/
Yanming Guo

Author

Yanming Guo

National University of Defense Technology

Other affiliations: Leiden University, Zhejiang University

Bio: Yanming Guo is an academic researcher from National University of Defense Technology. The author has contributed to research in topics: Feature (computer vision) & Convolutional neural network. The author has an hindex of 13, co-authored 37 publications receiving 1867 citations. Previous affiliations of Yanming Guo include Leiden University & Zhejiang University.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning for visual understanding

[...]

Yanming Guo¹, Yu Liu², Ard Oerlemans, Songyang Lao¹, Song Wu², Michael S. Lew² - Show less +2 more•Institutions (2)

National University of Defense Technology¹, Leiden University²

26 Apr 2016-Neurocomputing

TL;DR: The state-of-the-art in deep learning algorithms in computer vision is reviewed by highlighting the contributions and challenges from over 210 recent research papers, and the future trends and challenges in designing and training deep neural networks are summarized.

...read moreread less

1,733 citations

Journal Article•DOI•

A review of semantic segmentation using deep neural networks

[...]

Yanming Guo¹, Yu Liu¹, Theodoros Georgiou¹, Michael S. Lew¹•Institutions (1)

Leiden University¹

01 Jun 2018-International Journal of Multimedia Information Retrieval

TL;DR: The field of semantic segmentation as pertaining to deep convolutional neural networks is reviewed and comprehensive coverage of the top approaches is provided and the strengths, weaknesses and major challenges are summarized.

...read moreread less

Abstract: During the long history of computer vision, one of the grand challenges has been semantic segmentation which is the ability to segment an unknown image into different parts and objects (e.g., beach, ocean, sun, dog, swimmer). Furthermore, segmentation is even deeper than object recognition because recognition is not necessary for segmentation. Specifically, humans can perform image segmentation without even knowing what the objects are (for example, in satellite imagery or medical X-ray scans, there may be several objects which are unknown, but they can still be segmented within the image typically for further investigation). Performing segmentation without knowing the exact identity of all objects in the scene is an important part of our visual understanding process which can give us a powerful model to understand the world and also be used to improve or augment existing computer vision techniques. Herein this work, we review the field of semantic segmentation as pertaining to deep convolutional neural networks. We provide comprehensive coverage of the top approaches and summarize the strengths, weaknesses and major challenges.

...read moreread less

451 citations

Proceedings Article•DOI•

Learning a Recurrent Residual Fusion Network for Multimodal Matching

[...]

Yu Liu¹, Yanming Guo², Erwin M. Bakker¹, Michael S. Lew¹•Institutions (2)

Leiden University¹, National University of Defense Technology²

01 Oct 2017

TL;DR: This work introduces a novel bridge between the modality-specific representations by creating a co-embedding space based on a recurrent residual fusion (RRF) block that adapts the recurrent mechanism to residual learning, so that it can recursively improve feature embeddings while retaining the shared parameters.

...read moreread less

Abstract: A major challenge in matching between vision and language is that they typically have completely different features and representations. In this work, we introduce a novel bridge between the modality-specific representations by creating a co-embedding space based on a recurrent residual fusion (RRF) block. Specifically, RRF adapts the recurrent mechanism to residual learning, so that it can recursively improve feature embeddings while retaining the shared parameters. Then, a fusion module is used to integrate the intermediate recurrent outputs and generates a more powerful representation. In the matching network, RRF acts as a feature enhancement component to gather visual and textual representations into a more discriminative embedding space where it allows to narrow the crossmodal gap between vision and language. Moreover, we employ a bi-rank loss function to enforce separability of the two modalities in the embedding space. In the experiments, we evaluate the proposed RRF-Net using two multi-modal datasets where it achieves state-of-the-art results.

...read moreread less

148 citations

Journal Article•DOI•

CNN-RNN: a large-scale hierarchical image classification framework

[...]

Yanming Guo, Yu Liu, Erwin M. Bakker, Yuanhao Guo, Michael S. Lew - Show less +1 more

01 Apr 2018-Multimedia Tools and Applications

TL;DR: A high performance network based on the CNN-RNN paradigm is built which outperforms the original CNN and also the current state-of-the-art and is built on top of any CNN architecture which is primarily designed for leaf-level classification.

...read moreread less

Abstract: Objects are often organized in a semantic hierarchy of categories, where fine-level categories are grouped into coarse-level categories according to their semantic relations. While previous works usually only classify objects into the leaf categories, we argue that generating hierarchical labels can actually describe how the leaf categories evolved from higher level coarse-grained categories, thus can provide a better understanding of the objects. In this paper, we propose to utilize the CNN-RNN framework to address the hierarchical image classification task. CNN allows us to obtain discriminative features for the input images, and RNN enables us to jointly optimize the classification of coarse and fine labels. This framework can not only generate hierarchical labels for images, but also improve the traditional leaf-level classification performance due to incorporating the hierarchical information. Moreover, this framework can be built on top of any CNN architecture which is primarily designed for leaf-level classification. Accordingly, we build a high performance network based on the CNN-RNN paradigm which outperforms the original CNN (wider-ResNet) and also the current state-of-the-art. In addition, we investigate how to utilize the CNN-RNN framework to improve the fine category classification when a fraction of the training data is only annotated with coarse labels. Experimental results demonstrate that CNN-RNN can use the coarse-labeled training data to improve the classification of fine categories, and in some cases it even surpasses the performance achieved by fully annotated training data. This reveals that, CNN-RNN can alleviate the challenge of specialized and expensive annotation of fine labels.

...read moreread less

102 citations

Journal Article•DOI•

Colloidal chemically fabricated ZnO : Cu-based photodetector with extended UV-visible detection waveband

[...]

Liang Hu¹, Liping Zhu¹, Haiping He¹, Yanming Guo¹, Guoyao Pan¹, Jie Jiang¹, Yizheng Jin¹, Luwei Sun¹, Zhizhen Ye¹ - Show less +5 more•Institutions (1)

Zhejiang University¹

27 Sep 2013-Nanoscale

TL;DR: Polycrystalline ZnO : Cu-based film photodetectors with extended detection waveband (UV and visible light) were fabricated using facile colloidal chemistry and a post-annealing process to understand this complex photoconduction behaviour.

...read moreread less

Abstract: Polycrystalline ZnO : Cu-based film photodetectors with extended detection waveband (UV and visible light) were fabricated using facile colloidal chemistry and a post-annealing process. The obtained detectors are highly sensitive to visible light and can realize the response switch between UV and visible light. A native and extrinsic trap cooperatively controlled space charge limited (SCL) transport mechanism is proposed to understand this complex photoconduction behaviour.

...read moreread less

51 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

[신간의 별자리x] 우리/미술, 그리고 ‘슬픔의 박물관’

[...]

이화영

01 Jan 2015

12,972 citations

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Journal Article•DOI•

Deep convolutional neural networks for image classification: A comprehensive review

[...]

Waseem Rawat¹, Zenghui Wang¹•Institutions (1)

University of South Africa¹

01 Sep 2017-Neural Computation

TL;DR: This review, which focuses on the application of CNNs to image classification tasks, covers their development, from their predecessors up to recent state-of-the-art deep learning systems.

...read moreread less

Abstract: Convolutional neural networks CNNs have been applied to visual tasks since the late 1980s. However, despite a few scattered applications, they were dormant until the mid-2000s when developments in computing power and the advent of large amounts of labeled data, supplemented by improved algorithms, contributed to their advancement and brought them to the forefront of a neural network renaissance that has seen rapid progression since 2012. In this review, which focuses on the application of CNNs to image classification tasks, we cover their development, from their predecessors up to recent state-of-the-art deep learning systems. Along the way, we analyze 1 their early successes, 2 their role in the deep learning renaissance, 3 selected symbolic works that have contributed to their recent popularity, and 4 several improvement attempts by reviewing contributions and challenges of over 300 publications. We also introduce some of their current trends and remaining challenges.

...read moreread less

2,366 citations

Proceedings Article•DOI•

Understanding of a convolutional neural network

[...]

Saad Albawi¹, Tareq Abed Mohammed¹, Saad Al-Zawi•Institutions (1)

Istanbul Kemerburgaz University¹

01 Aug 2017

TL;DR: All the elements and important issues related to CNN, and how these elements work, are explained and defined and the parameters that effect CNN efficiency are state.

...read moreread less

Abstract: The term Deep Learning or Deep Neural Network refers to Artificial Neural Networks (ANN) with multi layers. Over the last few decades, it has been considered to be one of the most powerful tools, and has become very popular in the literature as it is able to handle a huge amount of data. The interest in having deeper hidden layers has recently begun to surpass classical methods performance in different fields; especially in pattern recognition. One of the most popular deep neural networks is the Convolutional Neural Network (CNN). It take this name from mathematical linear operation between matrixes called convolution. CNN have multiple layers; including convolutional layer, non-linearity layer, pooling layer and fully-connected layer. The convolutional and fully-connected layers have parameters but pooling and non-linearity layers don't have parameters. The CNN has an excellent performance in machine learning problems. Specially the applications that deal with image data, such as largest image classification data set (Image Net), computer vision, and in natural language processing (NLP) and the results achieved were very amazing. In this paper we will explain and define all the elements and important issues related to CNN, and how these elements work. In addition, we will also state the parameters that effect CNN efficiency. This paper assumes that the readers have adequate knowledge about both machine learning and artificial neural network.

...read moreread less

2,338 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse