A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors

doi:10.1109/LSP.2014.2310494

Home
/
Papers
/
A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors

Journal Article•DOI•

A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors

Chenggang Yan¹, Yongdong Zhang¹, Xu Jizheng², Feng Dai¹, Liang Li¹, Qionghai Dai, Feng Wu² - Show less +3 more•Institutions (2)

Tsinghua University¹, Microsoft²

11 Mar 2014-IEEE Signal Processing Letters (IEEE)-Vol. 21, Iss: 5, pp 573-576

TL;DR: This paper proposes a parallel framework to decide coding unit trees through in-depth understanding of the dependency among different coding units, and achieves averagely more than 11 and 16 times speedup for 1920x1080 and 2560x1600 video sequences, respectively, without any coding efficiency degradation.

read less

Abstract: High Efficiency Video Coding (HEVC) uses a very flexible tree structure to organize coding units, which leads to a superior coding efficiency compared with previous video coding standards. However, such a flexible coding unit tree structure also places a great challenge for encoders. In order to fully exploit the coding efficiency brought by this structure, huge amount of computational complexity is needed for an encoder to decide the optimal coding unit tree for each image block. One way to achieve this is to use parallel computing enabled by many-core processors. In this paper, we analyze the challenge to use many-core processors to make coding unit tree decision. Through in-depth understanding of the dependency among different coding units, we propose a parallel framework to decide coding unit trees. Experimental results show that, on the Tile64 platform, our proposed method achieves averagely more than 11 and 16 times speedup for 1920x1080 and 2560x1600 video sequences, respectively, without any coding efficiency degradation.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors

[...]

Chenggang Clarence Yan¹, Yongdong Zhang¹, Xu Jizheng², Feng Dai¹, Jun Zhang¹, Qionghai Dai³, Feng Wu² - Show less +3 more•Institutions (3)

Chinese Academy of Sciences¹, Microsoft², Tsinghua University³

08 Jul 2014-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: This paper analyzes the ME structure in HEVC and proposes a parallel framework to decouple ME for different partitions on many-core processors and achieves more than 30 and 40 times speedup for 1920 × 1080 and 2560 × 1600 video sequences, respectively.

...read moreread less

Abstract: High Efficiency Video Coding (HEVC) provides superior coding efficiency than previous video coding standards at the cost of increasing encoding complexity. The complexity increase of motion estimation (ME) procedure is rather significant, especially when considering the complicated partitioning structure of HEVC. To fully exploit the coding efficiency brought by HEVC requires a huge amount of computations. In this paper, we analyze the ME structure in HEVC and propose a parallel framework to decouple ME for different partitions on many-core processors. Based on local parallel method (LPM), we first use the directed acyclic graph (DAG)-based order to parallelize coding tree units (CTUs) and adopt improved LPM (ILPM) within each CTU (DAGILPM), which exploits the CTU-level and prediction unit (PU)-level parallelism. Then, we find that there exist completely independent PUs (CIPUs) and partially independent PUs (PIPUs). When the degree of parallelism (DP) is smaller than the maximum DP of DAGILPM, we process the CIPUs and PIPUs, which further increases the DP. The data dependencies and coding efficiency stay the same as LPM. Experiments show that on a 64-core system, compared with serial execution, our proposed scheme achieves more than 30 and 40 times speedup for 1920 × 1080 and 2560 × 1600 video sequences, respectively.

...read moreread less

366 citations

Cites background from "A Highly Parallel Framework for HEV..."

...intra-mode decision [31], [32], and CU partitioning tree decision [33]....
[...]

Journal Article•DOI•

Deep Hyperspectral Image Sharpening

[...]

Renwei Dian¹, Shutao Li¹, Anjing Guo¹, Leyuan Fang¹•Institutions (1)

Hunan University¹

20 Feb 2018-IEEE Transactions on Neural Networks

TL;DR: A deep HSI sharpening method is presented for the fusion of an LR-HSI with an HR-MSI, which directly learns the image priors via deep convolutional neural network-based residual learning.

...read moreread less

Abstract: Hyperspectral image (HSI) sharpening, which aims at fusing an observable low spatial resolution (LR) HSI (LR-HSI) with a high spatial resolution (HR) multispectral image (HR-MSI) of the same scene to acquire an HR-HSI, has recently attracted much attention. Most of the recent HSI sharpening approaches are based on image priors modeling, which are usually sensitive to the parameters selection and time-consuming. This paper presents a deep HSI sharpening method (named DHSIS) for the fusion of an LR-HSI with an HR-MSI, which directly learns the image priors via deep convolutional neural network-based residual learning. The DHSIS method incorporates the learned deep priors into the LR-HSI and HR-MSI fusion framework. Specifically, we first initialize the HR-HSI from the fusion framework via solving a Sylvester equation. Then, we map the initialized HR-HSI to the reference HR-HSI via deep residual learning to learn the image priors. Finally, the learned image priors are returned to the fusion framework to reconstruct the final HR-HSI. Experimental results demonstrate the superiority of the DHSIS approach over existing state-of-the-art HSI sharpening approaches in terms of reconstruction accuracy and running time.

...read moreread less

302 citations

Cites background from "A Highly Parallel Framework for HEV..."

...selections, which may need parallel computing [15], [16] to speed up....
[...]

Journal Article•DOI•

STAT: Spatial-Temporal Attention Mechanism for Video Captioning

[...]

Chenggang Yan¹, Yunbin Tu², Wang Xingzheng³, Yongbing Zhang⁴, Xinhong Hao⁵, Yongdong Zhang¹, Qionghai Dai⁴ - Show less +3 more•Institutions (5)

University of Science and Technology of China¹, Hangzhou Dianzi University², Shenzhen University³, Tsinghua University⁴, Beijing Institute of Technology⁵

01 Jan 2020-IEEE Transactions on Multimedia

TL;DR: The proposed spatial-temporal attention mechanism (STAT) within an encoder-decoder neural network for video captioning successfully takes into account both the spatial and temporal structures in a video, so it makes the decoder to automatically select the significant regions in the most relevant temporal segments for word prediction.

...read moreread less

Abstract: Video captioning refers to automatic generate natural language sentences, which summarize the video contents. Inspired by the visual attention mechanism of human beings, temporal attention mechanism has been widely used in video description to selectively focus on important frames. However, most existing methods based on temporal attention mechanism suffer from the problems of recognition error and detail missing, because temporal attention mechanism cannot further catch significant regions in frames. In order to address above problems, we propose the use of a novel spatial-temporal attention mechanism (STAT) within an encoder-decoder neural network for video captioning. The proposed STAT successfully takes into account both the spatial and temporal structures in a video, so it makes the decoder to automatically select the significant regions in the most relevant temporal segments for word prediction. We evaluate our STAT on two well-known benchmarks: MSVD and MSR-VTT-10K. Experimental results show that our proposed STAT achieves the state-of-the-art performance with several popular evaluation metrics: BLEU-4, METEOR, and CIDEr.

...read moreread less

251 citations

Journal Article•DOI•

Deep Learning in Microscopy Image Analysis: A Survey

[...]

Fuyong Xing¹, Yuanpu Xie², Hai Su², Fujun Liu², Lin Yang² - Show less +1 more•Institutions (2)

Colorado School of Public Health¹, University of Florida²

01 Oct 2018-IEEE Transactions on Neural Networks

TL;DR: A snapshot of the fast-growing deep learning field for microscopy image analysis, which explains the architectures and the principles of convolutional neural networks, fully Convolutional networks, recurrent neural Networks, stacked autoencoders, and deep belief networks and their formulations or modelings for specific tasks on various microscopy images.

...read moreread less

Abstract: Computerized microscopy image analysis plays an important role in computer aided diagnosis and prognosis. Machine learning techniques have powered many aspects of medical investigation and clinical practice. Recently, deep learning is emerging as a leading machine learning tool in computer vision and has attracted considerable attention in biomedical image analysis. In this paper, we provide a snapshot of this fast-growing field, specifically for microscopy image analysis. We briefly introduce the popular deep neural networks and summarize current deep learning achievements in various tasks, such as detection, segmentation, and classification in microscopy image analysis. In particular, we explain the architectures and the principles of convolutional neural networks, fully convolutional networks, recurrent neural networks, stacked autoencoders, and deep belief networks, and interpret their formulations or modelings for specific tasks on various microscopy images. In addition, we discuss the open challenges and the potential trends of future research in microscopy image analysis using deep learning.

...read moreread less

235 citations

Journal Article•DOI•

Optimized Structure of the Traffic Flow Forecasting Model With a Deep Learning Approach

[...]

Hao-Fan Yang¹, Tharam S. Dillon¹, Yi-Ping Phoebe Chen¹•Institutions (1)

La Trobe University¹

01 Oct 2017-IEEE Transactions on Neural Networks

TL;DR: A novel model, stacked autoencoder Levenberg-Marquardt model, which is a type of deep architecture of neural network approach aiming to improve forecasting accuracy, and an optimized structure of the traffic flow forecasting model with a deep learning approach is presented.

...read moreread less

Abstract: Forecasting accuracy is an important issue for successful intelligent traffic management, especially in the domain of traffic efficiency and congestion reduction. The dawning of the big data era brings opportunities to greatly improve prediction accuracy. In this paper, we propose a novel model, stacked autoencoder Levenberg-Marquardt model, which is a type of deep architecture of neural network approach aiming to improve forecasting accuracy. The proposed model is designed using the Taguchi method to develop an optimized structure and to learn traffic flow features through layer-by-layer feature granulation with a greedy layerwise unsupervised learning algorithm. It is applied to real-world data collected from the M6 freeway in the U.K. and is compared with three existing traffic predictors. To the best of our knowledge, this is the first time that an optimized structure of the traffic flow forecasting model with a deep learning approach is presented. The evaluation results demonstrate that the proposed model with an optimized structure has superior performance in traffic flow forecasting.

...read moreread less

216 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Overview of the H.264/AVC video coding standard

[...]

Thomas Wiegand¹, Gary J. Sullivan², G. Bjontegaard, Ajay Luthra³•Institutions (3)

Heinrich Hertz Institute¹, Microsoft², Motorola³

01 Jul 2003-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: An overview of the technical features of H.264/AVC is provided, profiles and applications for the standard are described, and the history of the standardization process is outlined.

...read moreread less

Abstract: H.264/AVC is newest video coding standard of the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group. The main goals of the H.264/AVC standardization effort have been enhanced compression performance and provision of a "network-friendly" video representation addressing "conversational" (video telephony) and "nonconversational" (storage, broadcast, or streaming) applications. H.264/AVC has achieved a significant improvement in rate-distortion efficiency relative to existing standards. This article provides an overview of the technical features of H.264/AVC, describes profiles and applications for the standard, and outlines the history of the standardization process.

...read moreread less

8,646 citations

Journal Article•DOI•

Overview of the High Efficiency Video Coding (HEVC) Standard

[...]

Gary J. Sullivan¹, Jens-Rainer Ohm², Woo-Jin Han³, Thomas Wiegand⁴•Institutions (4)

Microsoft¹, RWTH Aachen University², Gachon University³, Fraunhofer Society⁴

01 Dec 2012-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: The main goal of the HEVC standardization effort is to enable significantly improved compression performance relative to existing standards-in the range of 50% bit-rate reduction for equal perceptual video quality.

...read moreread less

Abstract: High Efficiency Video Coding (HEVC) is currently being prepared as the newest video coding standard of the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group. The main goal of the HEVC standardization effort is to enable significantly improved compression performance relative to existing standards-in the range of 50% bit-rate reduction for equal perceptual video quality. This paper provides an overview of the technical features and characteristics of the HEVC standard.

...read moreread less

7,383 citations

"A Highly Parallel Framework for HEV..." refers background or methods in this paper

...To enhance the coding efficiency of HEVC, HEVC provides as many as 35 prediction modes [1]....
[...]
...H IGH EFFICIENCY VIDEO CODING (HEVC) is the state-of-the-art video coding standard [1]–[4]....
[...]

Journal Article•DOI•

Rate-constrained coder control and comparison of video coding standards

[...]

Thomas Wiegand¹, Heiko Schwarz¹, Anthony Joch, Faouzi Kossentini², Gary J. Sullivan³ - Show less +1 more•Institutions (3)

Heinrich Hertz Institute¹, University of British Columbia², Microsoft³

01 Jul 2003-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: A unified approach to the coder control of video coding standards such as MPEG-2, H.263, MPEG-4, and the draft video coding standard H.264/AVC (advanced video coding) is presented.

...read moreread less

Abstract: A unified approach to the coder control of video coding standards such as MPEG-2, H.263, MPEG-4, and the draft video coding standard H.264/AVC (advanced video coding) is presented. The performance of the various standards is compared by means of PSNR and subjective testing results. The results indicate that H.264/AVC compliant encoders typically achieve essentially the same reproduction quality as encoders that are compliant with the previous standards while typically requiring 60% or less of the bit rate.

...read moreread less

3,312 citations

Common test conditions and software reference configurations

[...]

F. Bossen

01 Jan 2011

2,237 citations

Journal Article•DOI•

Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC)

[...]

Jens-Rainer Ohm¹, Gary J. Sullivan², Heiko Schwarz³, Thiow Keng Tan, Thomas Wiegand³ - Show less +1 more•Institutions (3)

RWTH Aachen University¹, Microsoft², Fraunhofer Society³

01 Dec 2012-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: The results of subjective tests for WVGA and HD sequences indicate that HEVC encoders can achieve equivalent subjective reproduction quality as encoder that conform to H.264/MPEG-4 AVC when using approximately 50% less bit rate on average.

...read moreread less

Abstract: The compression capability of several generations of video coding standards is compared by means of peak signal-to-noise ratio (PSNR) and subjective testing results. A unified approach is applied to the analysis of designs, including H.262/MPEG-2 Video, H.263, MPEG-4 Visual, H.264/MPEG-4 Advanced Video Coding (AVC), and High Efficiency Video Coding (HEVC). The results of subjective tests for WVGA and HD sequences indicate that HEVC encoders can achieve equivalent subjective reproduction quality as encoders that conform to H.264/MPEG-4 AVC when using approximately 50% less bit rate on average. The HEVC design is shown to be especially effective for low bit rates, high-resolution video content, and low-delay communication applications. The measured subjective improvement somewhat exceeds the improvement measured by the PSNR metric.

...read moreread less

1,279 citations

"A Highly Parallel Framework for HEV..." refers methods in this paper

...264/AVC, HEVC provides a similar reconstructed quality with about half of bitrate [5], which largely benefits from a highly flexible hierarchy of HEVC coding unit (CU) partitioning [6]....
[...]