Home
/
Authors
/
Tomoki Yoshida

Author

Tomoki Yoshida

Bio: Tomoki Yoshida is an academic researcher from ETH Zurich. The author has contributed to research in topics: Real image & Image resolution. The author has an hindex of 2, co-authored 2 publications receiving 93 citations. Previous affiliations of Tomoki Yoshida include York University.

Topics: Real image, Image resolution, sRGB, Color space, Image quality ...read more

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

NTIRE 2019 Challenge on Real Image Denoising: Methods and Results

[...]

Abdelrahman Abdelhamed¹, Radu Timofte¹, Michael S. Brown¹, Songhyun Yu¹, Bumjun Park¹, Jechang Jeong¹, Seung-Won Jung¹, Dong-Wook Kim¹, Jae-Ryun Chung¹, Jiaming Liu¹, Yuzhi Wang¹, Chi-Hao Wu¹, Qin Xu¹, Chuan Wang¹, Shaofan Cai¹, Yifan Ding¹, Haoqiang Fan¹, Jue Wang¹, Kai Zhang¹, Wangmeng Zuo¹, Magauiya Zhussip¹, Dongwon Park¹, Shakarim Soltanayev¹, Se Young Chun¹, Zhiwei Xiong¹, Chang Chen¹, Muhammad Haris¹, Kazutoshi Akita¹, Tomoki Yoshida¹, Greg Shakhnarovich¹, Norimichi Ukita¹, Syed Waqas Zamir¹, Aditya Arora¹, Salman Khan¹, Fahad Shahbaz Khan¹, Ling Shao¹, Sung-Jea Ko¹, Dong-Pan Lim¹, Seung-Wook Kim¹, Seo-Won Ji¹, Sang-Won Lee¹, Wenyi Tang¹, Yuchen Fan¹, Yuqian Zhou¹, Ding Liu¹, Thomas S. Huang¹, Deyu Meng¹, Lei Zhang¹, Hongwei Yong¹, Yiyun Zhao¹, Pengliang Tang¹, Yue Lu¹, Raimondo Schettini¹, Simone Bianco¹, Simone Zini¹, Chi Li¹, Yang Wang¹, Zhiguo Cao¹ - Show less +54 more•Institutions (1)

York University¹

16 Jun 2019

TL;DR: The proposed methods by the 15 teams represent the current state-of-the-art performance in image denoising targeting real noisy images.

...read moreread less

Abstract: This paper reviews the NTIRE 2019 challenge on real image denoising with focus on the proposed methods and their results. The challenge has two tracks for quantitatively evaluating image denoising performance in (1) the Bayer-pattern raw-RGB and (2) the standard RGB (sRGB) color spaces. The tracks had 216 and 220 registered participants, respectively. A total of 15 teams, proposing 17 methods, competed in the final phase of the challenge. The proposed methods by the 15 teams represent the current state-of-the-art performance in image denoising targeting real noisy images.

...read moreread less

99 citations

Proceedings Article•DOI•

NTIRE 2019 Challenge on Image Enhancement: Methods and Results

[...]

Andrey Ignatov¹, Radu Timofte¹, Xiaochao Qu¹, Xingguang Zhou¹, Ting Liu¹, Pengfei Wan¹, Syed Waqas Zamir¹, Aditya Arora¹, Salman Khan¹, Fahad Shahbaz Khan¹, Ling Shao¹, Dongwon Park¹, Se Young Chun¹, Pablo Navarrete Michelini¹, Hanwen Liu¹, Dan Zhu¹, Zhiwei Zhong¹, Xianming Liu¹, Junjun Jiang¹, Debin Zhao¹, Muhammad Haris¹, Kazutoshi Akita¹, Tomoki Yoshida¹, Greg Shakhnarovich¹, Norimichi Ukita¹, Jie Liu¹, Cheolkon Jung¹, Raimondo Schettini¹, Simone Bianco¹, Claudio Cusano¹, Flavio Piccoli¹, Pengju Liu¹, Kai Zhang¹, Jingdong Liu¹, Jiye Liu¹, Hongzhi Zhang¹, Wangmeng Zuo¹, Nelson Chong Ngee Bow¹, Lai-Kuan Wong¹, John See¹, Jinghui Qin¹, Lishan Huang¹, Yukai Shi¹, Pengxu Wei¹, Wushao Wen¹, Liang Lin¹, Zheng Hui¹, Xiumei Wang¹, Xinbo Gao¹, Kanti Kumari¹, Vikas Kumar Anand¹, Mahendra Khened¹, Ganapathy Krishnamurthi¹ - Show less +49 more•Institutions (1)

ETH Zurich¹

16 Jun 2019

TL;DR: The first NTIRE challenge on perceptual image enhancement as discussed by the authors focused on proposed solutions and results of real-world photo enhancement problem, where the goal was to map low-quality photos from the iPhone 3GS device to the same photos captured with Canon 70D DSLR camera.

...read moreread less

Abstract: This paper reviews the first NTIRE challenge on perceptual image enhancement with the focus on proposed solutions and results. The participating teams were solving a real-world photo enhancement problem, where the goal was to map low-quality photos from the iPhone 3GS device to the same photos captured with Canon 70D DSLR camera. The considered problem embraced a number of computer vision subtasks, such as image denoising, image resolution and sharpness enhancement, image color/contrast/exposure adjustment, etc. The target metric used in this challenge combined PSNR and SSIM scores with solutions' perceptual results measured in the user study. The proposed solutions significantly improved baseline results, defining the state-of-the-art for practical image enhancement.

...read moreread less

45 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Deep High-Resolution Representation Learning for Visual Recognition

[...]

Jingdong Wang¹, Ke Sun², Tianheng Cheng³, Borui Jiang⁴, Chaorui Deng⁵, Yang Zhao⁶, Dong Liu², Yadong Mu⁴, Mingkui Tan⁵, Xinggang Wang³, Wenyu Liu³, Bin Xiao¹ - Show less +8 more•Institutions (6)

Microsoft¹, University of Science and Technology of China², Huazhong University of Science and Technology³, Peking University⁴, South China University of Technology⁵, Griffith University⁶

20 Aug 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: The superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, is shown, suggesting that the HRNet is a stronger backbone for computer vision problems.

...read moreread less

Abstract: High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions \emph{in series} (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams \emph{in parallel}; (ii) Repeatedly exchange the information across resolutions. The benefit is that the resulting representation is semantically richer and spatially more precise. We show the superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, suggesting that the HRNet is a stronger backbone for computer vision problems. All the codes are available at~{\url{this https URL}}.

...read moreread less

1,278 citations

Journal Article•DOI•

Deep High-Resolution Representation Learning for Visual Recognition

[...]

Microsoft¹, University of Science and Technology of China², Huazhong University of Science and Technology³, Peking University⁴, South China University of Technology⁵, Griffith University⁶

01 Oct 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The High-Resolution Network (HRNet) as mentioned in this paper maintains high-resolution representations through the whole process by connecting the high-to-low resolution convolution streams in parallel and repeatedly exchanging the information across resolutions.

...read moreread less

Abstract: High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions in series (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams in parallel and (ii) repeatedly exchange the information across resolutions. The benefit is that the resulting representation is semantically richer and spatially more precise. We show the superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, suggesting that the HRNet is a stronger backbone for computer vision problems. All the codes are available at https://github.com/HRNet .

...read moreread less

1,162 citations

Proceedings Article•DOI•

Multi-Stage Progressive Image Restoration

[...]

Syed Waqas Zamir, Aditya Arora, Salman Khan¹, Munawar Hayat², Fahad Shahbaz Khan¹, Ming-Hsuan Yang³, Ling Shao - Show less +3 more•Institutions (3)

Zayed University¹, Monash University², University of California, Merced³

04 Feb 2021

TL;DR: MPRNet as discussed by the authors proposes a multi-stage architecture that progressively learns restoration functions for the degraded inputs, thereby breaking down the overall recovery process into more manageable steps, and introduces a novel per-pixel adaptive design that leverages in-situ supervised attention to reweight the local features.

...read moreread less

Abstract: Image restoration tasks demand a complex balance between spatial details and high-level contextualized information while recovering images. In this paper, we propose a novel synergistic design that can optimally balance these competing goals. Our main proposal is a multi-stage architecture, that progressively learns restoration functions for the degraded inputs, thereby breaking down the overall recovery process into more manageable steps. Specifically, our model first learns the contextualized features using encoder-decoder architectures and later combines them with a high-resolution branch that retains local information. At each stage, we introduce a novel per-pixel adaptive design that leverages in-situ supervised attention to reweight the local features. A key ingredient in such a multi-stage architecture is the information exchange between different stages. To this end, we propose a two-faceted approach where the information is not only exchanged sequentially from early to late stages, but lateral connections between feature processing blocks also exist to avoid any loss of information. The resulting tightly interlinked multi-stage architecture, named as MPRNet, delivers strong performance gains on ten datasets across a range of tasks including image deraining, deblurring, and denoising. The source code and pre-trained models are available at https://github.com/swz30/MPRNet.

...read moreread less

716 citations

Proceedings Article•DOI•

Image Super-Resolution with Non-Local Sparse Attention

[...]

Yiqun Mei¹, Yuchen Fan¹, Yuqian Zhou¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Jun 2021

TL;DR: Non-local sparse attention (NLSA) as mentioned in this paper is designed to retain long-range modeling capability from non-local operation while enjoying robustness and high-efficiency of sparse representation, which partitions the input space into hash buckets of related features.

...read moreread less

Abstract: Both Non-Local (NL) operation and sparse representation are crucial for Single Image Super-Resolution (SISR). In this paper, we investigate their combinations and propose a novel Non-Local Sparse Attention (NLSA) with dynamic sparse attention pattern. NLSA is designed to retain long-range modeling capability from NL operation while enjoying robustness and high-efficiency of sparse representation. Specifically, NLSA rectifies non-local attention with spherical locality sensitive hashing (LSH) that partitions the input space into hash buckets of related features. For every query signal, NLSA assigns a bucket to it and only computes attention within the bucket. The resulting sparse attention prevents the model from attending to locations that are noisy and less-informative, while reducing the computational cost from quadratic to asymptotic linear with respect to the spatial size. Extensive experiments validate the effectiveness and efficiency of NLSA. With a few non-local sparse attention modules, our architecture, called non-local sparse network (NLSN), reaches state-of-the-art performance for SISR quantitatively and qualitatively.

...read moreread less

216 citations

Proceedings Article•DOI•

CycleISP: Real Image Restoration via Improved Data Synthesis

[...]

Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang¹, Ling Shao - Show less +3 more•Institutions (1)

University of California, Merced¹

14 Jun 2020

TL;DR: CycleISP as discussed by the authors is a framework that models camera imaging pipeline in forward and reverse directions, which allows to produce any number of realistic image pairs for denoising both in RAW and sRGB spaces.

...read moreread less

Abstract: The availability of large-scale datasets has helped unleash the true potential of deep convolutional neural networks (CNNs). However, for the single-image denoising problem, capturing a real dataset is an unacceptably expensive and cumbersome procedure. Consequently, image denoising algorithms are mostly developed and evaluated on synthetic data that is usually generated with a widespread assumption of additive white Gaussian noise (AWGN). While the CNNs achieve impressive results on these synthetic datasets, they do not perform well when applied on real camera images, as reported in recent benchmark datasets. This is mainly because the AWGN is not adequate for modeling the real camera noise which is signal-dependent and heavily transformed by the camera imaging pipeline. In this paper, we present a framework that models camera imaging pipeline in forward and reverse directions. It allows us to produce any number of realistic image pairs for denoising both in RAW and sRGB spaces. By training a new image denoising network on realistic synthetic data, we achieve the state-of-the-art performance on real camera benchmark datasets. The parameters in our models are ~5 times lesser than the previous best method for RAW denoising. Furthermore, we demonstrate that the proposed framework generalizes beyond image denoising problem e.g., for color matching in stereoscopic cinema. The source code and pre-trained models are available at https://github.com/swz30/CycleISP.

...read moreread less

150 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

Collapse