Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report

Open AccessPosted Content

Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report

- 17 May 2021 -

TLDR

In this article, an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly real-time performance on smartphone NPUs was developed.

Abstract:

As the quality of mobile cameras starts to play a crucial role in modern smartphones, more and more attention is now being paid to ISP algorithms used to improve various perceptual aspects of mobile photos. In this Mobile AI challenge, the target was to develop an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly real-time performance on smartphone NPUs. For this, the participants were provided with a novel learned ISP dataset consisting of RAW-RGB image pairs captured with the Sony IMX586 Quad Bayer mobile sensor and a professional 102-megapixel medium format camera. The runtime of all models was evaluated on the MediaTek Dimensity 1000+ platform with a dedicated AI processing unit capable of accelerating both floating-point and quantized neural networks. The proposed solutions are fully compatible with the above NPU and are capable of processing Full HD photos under 60-100 milliseconds while achieving high fidelity results. A detailed description of all models developed in this challenge is provided in this paper.

References

PDF

Open Access

More filters

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Proceedings ArticleDOI

Searching for MobileNetV3

Andrew Howard, +11 more

TL;DR: MobileNetV3 as mentioned in this paper is the next generation of MobileNets based on a combination of complementary search techniques as well as a novel architecture design and achieves state-of-the-art results for mobile classification, detection and segmentation.

...read moreread less

Book ChapterDOI

ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Xintao Wang, +7 more

TL;DR: ESRGAN as mentioned in this paper improves the perceptual loss by using the features before activation, which could provide stronger supervision for brightness consistency and texture recovery, and won the first place in the PIRM2018-SR Challenge (region 3).

...read moreread less

Book ChapterDOI

Accelerating the Super-Resolution Convolutional Neural Network

Chao Dong, +2 more

TL;DR: Zhang et al. as mentioned in this paper proposed a compact hourglass-shape CNN structure for faster and better image super-resolution, which can achieve real-time performance on a generic CPU while still maintaining good performance.

...read moreread less

Proceedings ArticleDOI

MnasNet: Platform-Aware Neural Architecture Search for Mobile

Mingxing Tan, +6 more

TL;DR: In this article, the authors propose an automated mobile neural architecture search (MNAS) approach, which explicitly incorporates model latency into the main objective so that the search can identify a model that achieves a good trade-off between accuracy and latency.

...read moreread less

Collapse

IEEE Access

M-TEEVE: real-time 3D video interaction and broadcasting framework for mobile devices

Shu Shi, +3 more

Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report

References

U-Net: Convolutional Networks for Biomedical Image Segmentation

Searching for MobileNetV3

ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Accelerating the Super-Resolution Convolutional Neural Network

MnasNet: Platform-Aware Neural Architecture Search for Mobile

Related Papers (5)

Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report

Energy-efficient mobile video management using smartphones

Demo Abstract: On-Demand Information Retrieval from Videos Using Deep Learning in Wireless Networks

mVideo: Edge Computing Based Mobile Video Processing Systems

M-TEEVE: real-time 3D video interaction and broadcasting framework for mobile devices