Home
/
Authors
/
Andrey Ignatov

Author

Andrey Ignatov

Other affiliations: Moscow Institute of Physics and Technology, École Polytechnique Fédérale de Lausanne

Bio: Andrey Ignatov is an academic researcher from ETH Zurich. The author has contributed to research in topics: Mobile device & Computer science. The author has an hindex of 21, co-authored 47 publications receiving 1935 citations. Previous affiliations of Andrey Ignatov include Moscow Institute of Physics and Technology & École Polytechnique Fédérale de Lausanne.

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Real-time human activity recognition from accelerometer data using Convolutional Neural Networks

[...]

Andrey Ignatov¹•Institutions (1)

École Polytechnique Fédérale de Lausanne¹

01 Jan 2018

TL;DR: A user-independent deep learning-based approach for online human activity classification using Convolutional Neural Networks for local feature extraction together with simple statistical features that preserve information about the global form of time series is presented.

...read moreread less

Abstract: With a widespread of various sensors embedded in mobile devices, the analysis of human daily activities becomes more common and straightforward. This task now arises in a range of applications such as healthcare monitoring, fitness tracking or user-adaptive systems, where a general model capable of instantaneous activity recognition of an arbitrary user is needed. In this paper, we present a user-independent deep learning-based approach for online human activity classification. We propose using Convolutional Neural Networks for local feature extraction together with simple statistical features that preserve information about the global form of time series. Furthermore, we investigate the impact of time series length on the recognition accuracy and limit it up to 1 s that makes possible continuous real-time activity classification. The accuracy of the proposed approach is evaluated on two commonly used WISDM and UCI datasets that contain labeled accelerometer data from 36 and 30 users respectively, and in cross-dataset experiment. The results show that the proposed model demonstrates state-of-the-art performance while requiring low computational cost and no manual feature engineering.

...read moreread less

555 citations

Proceedings Article•DOI•

DSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks

[...]

Andrey Ignatov¹, Nikolay Kobyshev¹, Radu Timofte¹, Kenneth Vanhoey¹, Luc Van Gool¹ - Show less +1 more•Institutions (1)

ETH Zurich¹

01 Oct 2017

TL;DR: An end-to-end deep learning approach that bridges the gap by translating ordinary photos into DSLR-quality images by learning the translation function using a residual convolutional neural network that improves both color rendition and image sharpness.

...read moreread less

Abstract: Despite a rapid rise in the quality of built-in smartphone cameras, their physical limitations – small sensor size, compact lenses and the lack of specific hardware, – impede them to achieve the quality results of DSLR cameras. In this work we present an end-to-end deep learning approach that bridges this gap by translating ordinary photos into DSLR-quality images. We propose learning the translation function using a residual convolutional neural network that improves both color rendition and image sharpness. Since the standard mean squared loss is not well suited for measuring perceptual image quality, we introduce a composite perceptual error function that combines content, color and texture losses. The first two losses are defined analytically, while the texture loss is learned in an adversarial fashion. We also present DPED, a large-scale dataset that consists of real photos captured from three different phones and one high-end reflex camera. Our quantitative and qualitative assessments reveal that the enhanced image quality is comparable to that of DSLR-taken photos, while the methodology is generalized to any type of digital camera.

...read moreread less

423 citations

Book Chapter•DOI•

AI Benchmark: Running Deep Neural Networks on Android Smartphones

[...]

Andrey Ignatov¹, Radu Timofte¹, William Chou², Ke Wang³, Max Wu⁴, Tim Hartley, Luc Van Gool¹ - Show less +3 more•Institutions (4)

ETH Zurich¹, Qualcomm², Huawei³, MediaTek⁴

08 Sep 2018

TL;DR: A study of the current state of deep learning in the Android ecosystem and describe available frameworks, programming models and the limitations of running AI on smartphones, as well as an overview of the hardware acceleration resources available on four main mobile chipset platforms.

...read moreread less

Abstract: Over the last years, the computational power of mobile devices such as smartphones and tablets has grown dramatically, reaching the level of desktop computers available not long ago. While standard smartphone apps are no longer a problem for them, there is still a group of tasks that can easily challenge even high-end devices, namely running artificial intelligence algorithms. In this paper, we present a study of the current state of deep learning in the Android ecosystem and describe available frameworks, programming models and the limitations of running AI on smartphones. We give an overview of the hardware acceleration resources available on four main mobile chipset platforms: Qualcomm, HiSilicon, MediaTek and Samsung. Additionally, we present the real-world performance results of different mobile SoCs collected with AI Benchmark (http://ai-benchmark.com) that are covering all main existing hardware configurations.

...read moreread less

313 citations

Proceedings Article•DOI•

WESPE: Weakly Supervised Photo Enhancer for Digital Cameras

[...]

Andrey Ignatov¹, Nikolay Kobyshev¹, Radu Timofte¹, Kenneth Vanhoey¹, Luc Van Gool¹ - Show less +1 more•Institutions (1)

ETH Zurich¹

18 Jun 2018

TL;DR: Wang et al. as discussed by the authors proposed a weakly supervised photo enhancer (WESPE) to translate low-end and compact mobile cameras with limited capabilities into DSLR-quality photos automatically.

...read moreread less

Abstract: Low-end and compact mobile cameras demonstrate limited photo quality mainly due to space, hardware and budget constraints. In this work, we propose a deep learning solution that translates photos taken by cameras with limited capabilities into DSLR-quality photos automatically. We tackle this problem by introducing a weakly supervised photo enhancer (WESPE) - a novel image-to-image Generative Adversarial Network-based architecture. The proposed model is trained by under weak supervision: unlike previous works, there is no need for strong supervision in the form of a large annotated dataset of aligned original/enhanced photo pairs. The sole requirement is two distinct datasets: one from the source camera, and one composed of arbitrary high-quality images that can be generally crawled from the Internet - the visual content they exhibit may be unrelated. In this work, we emphasize on extensive evaluation of obtained results. Besides standard objective metrics and subjective user study, we train a virtual rater in the form of a separate CNN that mimics human raters on Flickr data and use this network to get reference scores for both original and enhanced photos. Our experiments on the DPED, KITTI and Cityscapes datasets as well as pictures from several generations of smartphones demonstrate that WESPE produces comparable or improved qualitative results with state-of-the-art strongly supervised methods.

...read moreread less

173 citations

Posted Content•

DSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks

[...]

Andrey Ignatov¹, Nikolay Kobyshev¹, Radu Timofte¹, Kenneth Vanhoey¹, Luc Van Gool¹ - Show less +1 more•Institutions (1)

ETH Zurich¹

08 Apr 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a residual convolutional neural network was proposed to translate ordinary photos into DSLR-quality images by combining content, color, and texture losses, where the first two losses are defined analytically, while the texture loss is learned in an adversarial fashion.

...read moreread less

Abstract: Despite a rapid rise in the quality of built-in smartphone cameras, their physical limitations - small sensor size, compact lenses and the lack of specific hardware, - impede them to achieve the quality results of DSLR cameras. In this work we present an end-to-end deep learning approach that bridges this gap by translating ordinary photos into DSLR-quality images. We propose learning the translation function using a residual convolutional neural network that improves both color rendition and image sharpness. Since the standard mean squared loss is not well suited for measuring perceptual image quality, we introduce a composite perceptual error function that combines content, color and texture losses. The first two losses are defined analytically, while the texture loss is learned in an adversarial fashion. We also present DPED, a large-scale dataset that consists of real photos captured from three different phones and one high-end reflex camera. Our quantitative and qualitative assessments reveal that the enhanced image quality is comparable to that of DSLR-taken photos, while the methodology is generalized to any type of digital camera.

...read moreread less

159 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Reference Entry•DOI•

IEEE Transactions on Pattern Analysis and Machine Intelligence

[...]

King-Sun Fu

15 Oct 2004

2,118 citations

Journal Article•DOI•

Deep learning for time series classification: a review

[...]

Hassan Ismail Fawaz¹, Germain Forestier², Jonathan Weber¹, Lhassane Idoumghar¹, Pierre-Alain Muller¹ - Show less +1 more•Institutions (2)

University of Upper Alsace¹, Monash University²

01 Jul 2019-Data Mining and Knowledge Discovery

TL;DR: This article proposes the most exhaustive study of DNNs for TSC by training 8730 deep learning models on 97 time series datasets and provides an open source deep learning framework to the TSC community.

...read moreread less

Abstract: Time Series Classification (TSC) is an important and challenging problem in data mining. With the increase of time series data availability, hundreds of TSC algorithms have been proposed. Among these methods, only a few have considered Deep Neural Networks (DNNs) to perform this task. This is surprising as deep learning has seen very successful applications in the last years. DNNs have indeed revolutionized the field of computer vision especially with the advent of novel deeper architectures such as Residual and Convolutional Neural Networks. Apart from images, sequential data such as text and audio can also be processed with DNNs to reach state-of-the-art performance for document classification and speech recognition. In this article, we study the current state-of-the-art performance of deep learning algorithms for TSC by presenting an empirical study of the most recent DNN architectures for TSC. We give an overview of the most successful deep learning applications in various time series domains under a unified taxonomy of DNNs for TSC. We also provide an open source deep learning framework to the TSC community where we implemented each of the compared approaches and evaluated them on a univariate TSC benchmark (the UCR/UEA archive) and 12 multivariate time series datasets. By training 8730 deep learning models on 97 time series datasets, we propose the most exhaustive study of DNNs for TSC to date.

...read moreread less

1,833 citations

Journal Article•DOI•

Deep learning for sensor-based activity recognition: A survey

[...]

Jindong Wang¹, Yiqiang Chen¹, Shuji Hao², Xiaohui Peng¹, Lisha Hu¹ - Show less +1 more•Institutions (2)

Chinese Academy of Sciences¹, Institute of High Performance Computing Singapore²

01 Feb 2018-Pattern Recognition Letters

TL;DR: The recent advance of deep learning based sensor-based activity recognition is surveyed from three aspects: sensor modality, deep model, and application and detailed insights on existing work are presented and grand challenges for future research are proposed.

...read moreread less

1,334 citations

Posted Content•

Deep High-Resolution Representation Learning for Visual Recognition

[...]

Jingdong Wang¹, Ke Sun², Tianheng Cheng³, Borui Jiang⁴, Chaorui Deng⁵, Yang Zhao⁶, Dong Liu², Yadong Mu⁴, Mingkui Tan⁵, Xinggang Wang³, Wenyu Liu³, Bin Xiao¹ - Show less +8 more•Institutions (6)

Microsoft¹, University of Science and Technology of China², Huazhong University of Science and Technology³, Peking University⁴, South China University of Technology⁵, Griffith University⁶

20 Aug 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: The superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, is shown, suggesting that the HRNet is a stronger backbone for computer vision problems.

...read moreread less

Abstract: High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions \emph{in series} (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams \emph{in parallel}; (ii) Repeatedly exchange the information across resolutions. The benefit is that the resulting representation is semantically richer and spatially more precise. We show the superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, suggesting that the HRNet is a stronger backbone for computer vision problems. All the codes are available at~{\url{this https URL}}.

...read moreread less

1,278 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse