Home
/
Authors
/
Cheng Chen

Author

Cheng Chen

Bio: Cheng Chen is an academic researcher from Google. The author has contributed to research in topics: Codec & Data compression. The author has an hindex of 11, co-authored 33 publications receiving 467 citations. Previous affiliations of Cheng Chen include University of Iowa.

Topics: Codec, Data compression, Skeletonization, Osteoporosis, Bone mineral ...read more

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

An Overview of Core Coding Tools in the AV1 Video Codec

[...]

Yue Chen¹, Debargha Murherjee¹, Jingning Han¹, Adrian Grange¹, Yaowu Xu¹, Zoe Liu¹, Sarah Parker¹, Cheng Chen¹, Hui Su¹, Urvang Joshi¹, Ching-Han Chiang¹, Yunqing Wang¹, Paul Wilkins¹, Jim Bankoski¹, Luc Trudeau, Nathan E. Egge, Jean-Marc Valin, Thomas Davies², Steinar Midtskogen², Andrey Norkin³, Peter de Rivaz - Show less +17 more•Institutions (3)

Google¹, Cisco Systems, Inc.², Netflix³

24 Jun 2018

TL;DR: A brief technical overview of key coding techniques in AV1 is provided along with preliminary compression performance comparison against VP9 and HEVC.

...read moreread less

Abstract: AV1 is an emerging open-source and royalty-free video compression format, which is jointly developed and finalized in early 2018 by the Alliance for Open Media (AOMedia) industry consortium. The main goal of AV1 development is to achieve substantial compression gain over state-of-the-art codecs while maintaining practical decoding complexity and hardware feasibility. This paper provides a brief technical overview of key coding techniques in AV1 along with preliminary compression performance comparison against VP9 and HEVC.

...read moreread less

260 citations

Journal Article•DOI•

A Technical Overview of AV1

[...]

Jingning Han¹, Bohan Li¹, Debargha Mukherjee¹, Ching-Han Chiang¹, Adrian Grange¹, Cheng Chen¹, Hui Su¹, Sarah Parker¹, Sai Deng¹, Urvang Joshi¹, Yue Chen¹, Yunqing Wang¹, Paul Wilkins¹, Yaowu Xu¹, James Bankoski¹ - Show less +11 more•Institutions (1)

Google¹

26 Feb 2021

TL;DR: A technical overview of the AV1 codec design that enables the compression performance gains with considerations for hardware feasibility is provided.

...read moreread less

Abstract: The AV1 video compression format is developed by the Alliance for Open Media consortium. It achieves more than a 30% reduction in bit rate compared to its predecessor VP9 for the same decoded video quality. This article provides a technical overview of the AV1 codec design that enables the compression performance gains with considerations for hardware feasibility.

...read moreread less

95 citations

Journal Article•DOI•

A robust and efficient curve skeletonization algorithm for tree-like objects using minimum cost paths

[...]

Dakai Jin¹, Krishna S. Iyer¹, Cheng Chen¹, Eric A. Hoffman¹, Punam K. Saha¹ - Show less +1 more•Institutions (1)

University of Iowa¹

01 Jun 2016-Pattern Recognition Letters

TL;DR: A new robust and efficient curve skeletonization algorithm for three-dimensional (3-D) elongated fuzzy objects using a minimum cost path approach, which avoids spurious branches without requiring post-pruning.

...read moreread less

57 citations

Journal Article•DOI•

Artificial Intelligence Applied to Osteoporosis: A Performance Comparison of Machine Learning Algorithms in Predicting Fragility Fractures From MRI Data.

[...]

Uran Ferizi¹, Harrison Besser¹, Pirro G. Hysi², Joseph G. Jacobs³, Chamith S. Rajapakse⁴, Cheng Chen⁵, Punam K. Saha⁵, Stephen Honig¹, Gregory Chang¹ - Show less +5 more•Institutions (5)

New York University¹, King's College London², University College London³, University of Pennsylvania⁴, University of Iowa⁵

01 Apr 2019-Journal of Magnetic Resonance Imaging

TL;DR: A current challenge in osteoporosis is identifying patients at risk of bone fracture and identifying patients with a high likelihood of fracture.

...read moreread less

Abstract: BACKGROUND A current challenge in osteoporosis is identifying patients at risk of bone fracture. PURPOSE To identify the machine learning classifiers that predict best osteoporotic bone fractures and, from the data, to highlight the imaging features and the anatomical regions that contribute most to prediction performance. STUDY TYPE Prospective (cross-sectional) case-control study. POPULATION Thirty-two women with prior fragility bone fractures, of mean age = 61.6 and body mass index (BMI) = 22.7 kg/m2 , and 60 women without fractures, of mean age = 62.3 and BMI = 21.4 kg/m2 . Field Strength/ Sequence: 3D FLASH at 3T. ASSESSMENT Quantitative MRI outcomes by software algorithms. Mechanical and topological microstructural parameters of the trabecular bone were calculated for five femoral regions, and added to the vector of features together with bone mineral density measurement, fracture risk assessment tool (FRAX) score, and personal characteristics such as age, weight, and height. We fitted 15 classifiers using 200 randomized cross-validation datasets. Statistical Tests: Data: Kolmogorov-Smirnov test for normality. Model Performance: sensitivity, specificity, precision, accuracy, F1-test, receiver operating characteristic curve (ROC). Two-sided t-test, with P < 0.05 for statistical significance. RESULTS The top three performing classifiers are RUS-boosted trees (in particular, performing best with head data, F1 = 0.64 ± 0.03), the logistic regression and the linear discriminant (both best with trochanteric datasets, F1 = 0.65 ± 0.03 and F1 = 0.67 ± 0.03, respectively). A permutation of these classifiers comprised the best three performers for four out of five anatomical datasets. After averaging across all the anatomical datasets, the score for the best performer, the boosted trees, was F1 = 0.63 ± 0.03 for All-features dataset, F1 = 0.52 ± 0.05 for the no-MRI dataset, and F1 = 0.48 ± 0.06 for the no-FRAX dataset. Data Conclusion: Of many classifiers, the RUS-boosted trees, the logistic regression, and the linear discriminant are best for predicting osteoporotic fracture. Both MRI and FRAX independently add value in identifying osteoporotic fractures. The femoral head, greater trochanter, and inter-trochanter anatomical regions within the proximal femur yielded better F1-scores for the best three classifiers. LEVEL OF EVIDENCE 2 Technical Efficacy: Stage 2 J. Magn. Reson. Imaging 2019;49:1029-1038.

...read moreread less

53 citations

Journal Article•DOI•

An Overview of Coding Tools in AV1: the First Video Codec from the Alliance for Open Media

[...]

Yue Chen¹, Debargha Mukherjee¹, Jingning Han¹, Adrian Grange¹, Yaowu Xu¹, Sarah Parker¹, Cheng Chen¹, Hui Su¹, Urvang Joshi¹, Ching-Han Chiang¹, Yunqing Wang¹, Paul Wilkins¹, Jim Bankoski¹, Luc Trudeau, Nathan E. Egge, Jean-Marc Valin², Thomas Davies³, Steinar Midtskogen³, Andrey Norkin⁴, Peter de Rivaz, Zoe Liu - Show less +17 more•Institutions (4)

Google¹, Amazon.com², Cisco Systems, Inc.³, Netflix⁴

23 Feb 2020

TL;DR: A technical overview of key coding techniques in AV1 is provided and the coding performance gains are validated by video compression tests performed with the libaom AV1 encoder against the libvpx VP9 encoder.

...read moreread less

Abstract: In 2018, the Alliance for Open Media (AOMedia) finalized its first video compression format AV1, which is jointly developed by the industry consortium of leading video technology companies. The main goal of AV1 is to provide an open source and royalty-free video coding format that substantially outperforms state-of-the-art codecs available on the market in compression efficiency while remaining practical decoding complexity as well as being optimized for hardware feasibility and scalability on modern devices. To give detailed insights into how the targeted performance and feasibility is realized, this paper provides a technical overview of key coding techniques in AV1. Besides, the coding performance gains are validated by video compression tests performed with the libaom AV1 encoder against the libvpx VP9 encoder. Preliminary comparison with two leading HEVC encoders, x265 and HM, and the reference software of VVC is also conducted on AOM's common test set and an open 4k set.

...read moreread less

44 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

CT Super-Resolution GAN Constrained by the Identical, Residual, and Cycle Learning Ensemble (GAN-CIRCLE)

[...]

Chenyu You¹, Wenxiang Cong², Michael W. Vannier³, Punam K. Saha⁴, Eric A. Hoffman⁴, Ge Wang², Guang Li², Yi Zhang⁵, Xiaoliu Zhang⁴, Hongming Shan², Mengzhou Li², Shenghong Ju⁶, Zhen Zhao⁶, Zhuiyang Zhang - Show less +10 more•Institutions (6)

Stanford University¹, Rensselaer Polytechnic Institute², University of Chicago³, University of Iowa⁴, Sichuan University⁵, Southeast University⁶

01 Jan 2020-IEEE Transactions on Medical Imaging

TL;DR: Wang et al. as mentioned in this paper proposed a semi-supervised deep learning approach to recover high-resolution (HR) CT images from low resolution (LR) counterparts by enforcing the cycle-consistency in terms of the Wasserstein distance.

...read moreread less

Abstract: In this paper, we present a semi-supervised deep learning approach to accurately recover high-resolution (HR) CT images from low-resolution (LR) counterparts. Specifically, with the generative adversarial network (GAN) as the building block, we enforce the cycle-consistency in terms of the Wasserstein distance to establish a nonlinear end-to-end mapping from noisy LR input images to denoised and deblurred HR outputs. We also include the joint constraints in the loss function to facilitate structural preservation. In this process, we incorporate deep convolutional neural network (CNN), residual learning, and network in network techniques for feature extraction and restoration. In contrast to the current trend of increasing network depth and complexity to boost the imaging performance, we apply a parallel ${1}\times {1}$ CNN to compress the output of the hidden layer and optimize the number of layers and the number of filters for each convolutional layer. The quantitative and qualitative evaluative results demonstrate that our proposed model is accurate, efficient and robust for super-resolution (SR) image restoration from noisy LR input images. In particular, we validate our composite SR networks on three large-scale CT datasets, and obtain promising results as compared to the other state-of-the-art methods.

...read moreread less

257 citations

Journal Article•DOI•

CT Super-resolution GAN Constrained by the Identical, Residual, and Cycle Learning Ensemble(GAN-CIRCLE)

[...]

Chenyu You, Guang Li, Yi Zhang, Xiaoliu Zhang, Hongming Shan, Shenghong Ju, Zhen Zhao, Zhuiyang Zhang, Wenxiang Cong, Michael W. Vannier, Punam K. Saha, Ge Wang - Show less +8 more

10 Aug 2018-arXiv: Image and Video Processing

TL;DR: In this article, a semi-supervised deep learning approach was proposed to recover high-resolution (HR) CT images from low resolution (LR) counterparts by enforcing the cycle-consistency in terms of Wasserstein distance to establish a nonlinear end-to-end mapping from noisy LR input images to denoised and deblurred HR outputs.

...read moreread less

Abstract: Computed tomography (CT) is widely used in screening, diagnosis, and image-guided therapy for both clinical and research purposes. Since CT involves ionizing radiation, an overarching thrust of related technical research is development of novel methods enabling ultrahigh quality imaging with fine structural details while reducing the X-ray radiation. In this paper, we present a semi-supervised deep learning approach to accurately recover high-resolution (HR) CT images from low-resolution (LR) counterparts. Specifically, with the generative adversarial network (GAN) as the building block, we enforce the cycle-consistency in terms of the Wasserstein distance to establish a nonlinear end-to-end mapping from noisy LR input images to denoised and deblurred HR outputs. We also include the joint constraints in the loss function to facilitate structural preservation. In this deep imaging process, we incorporate deep convolutional neural network (CNN), residual learning, and network in network techniques for feature extraction and restoration. In contrast to the current trend of increasing network depth and complexity to boost the CT imaging performance, which limit its real-world applications by imposing considerable computational and memory overheads, we apply a parallel $1\times1$ CNN to compress the output of the hidden layer and optimize the number of layers and the number of filters for each convolutional layer. Quantitative and qualitative evaluations demonstrate that our proposed model is accurate, efficient and robust for super-resolution (SR) image restoration from noisy LR input images. In particular, we validate our composite SR networks on three large-scale CT datasets, and obtain promising results as compared to the other state-of-the-art methods.

...read moreread less

242 citations

A fast Karhunen-Loeve transform for a class of random processes

[...]

A. Jain¹•Institutions (1)

University at Buffalo¹

01 Sep 1976

TL;DR: The Karhunter-Loeve transform for a class of signals is proven to be a set of periodic sine functions and this Karhunen- Loeve series expansion can be obtained via an FFT algorithm, which could be useful in data compression and other mean-square signal processing applications.

...read moreread less

Abstract: The Karhunen-Loeve transform for a class of signals is proven to be a set of periodic sine functions and this Karhunen-Loeve series expansion can be obtained via an FFT algorithm. This fast algorithm obtained could be useful in data compression and other mean-square signal processing applications.

...read moreread less

211 citations

Posted Content•

CompressAI: a PyTorch library and evaluation platform for end-to-end compression research.

[...]

Jean Begaint, Fabien Racape, Simon Feltman, Akshay Pushparaja

05 Nov 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: CompressAI is presented, a platform that provides custom operations, layers, models and tools to research, develop and evaluate end-to-end image and video compression codecs and is intended to be soon extended to the video compression domain.

...read moreread less

Abstract: This paper presents CompressAI, a platform that provides custom operations, layers, models and tools to research, develop and evaluate end-to-end image and video compression codecs. In particular, CompressAI includes pre-trained models and evaluation tools to compare learned methods with traditional codecs. Multiple models from the state-of-the-art on learned end-to-end compression have thus been reimplemented in PyTorch and trained from scratch. We also report objective comparison results using PSNR and MS-SSIM metrics vs. bit-rate, using the Kodak image dataset as test set. Although this framework currently implements models for still-picture compression, it is intended to be soon extended to the video compression domain.

...read moreread less

175 citations

Journal Article•DOI•

Gesture recognition based on skeletonization algorithm and CNN with ASL database

[...]

Du Jiang¹, Gongfa Li¹, Ying Sun¹, Jianyi Kong¹, Bo Tao¹ - Show less +1 more•Institutions (1)

Wuhan University of Science and Technology¹

01 Nov 2019-Multimedia Tools and Applications

TL;DR: The skeletonization algorithm and convolutional neural network (CNN) for the recognition algorithm reduce the impact of shooting angle and environment on recognition effect, and improve the accuracy of gesture recognition in complex environments.

...read moreread less

Abstract: In the field of human-computer interaction, vision-based gesture recognition methods are widely studied. However, its recognition effect depends to a large extent on the performance of the recognition algorithm. The skeletonization algorithm and convolutional neural network (CNN) for the recognition algorithm reduce the impact of shooting angle and environment on recognition effect, and improve the accuracy of gesture recognition in complex environments. According to the influence of the shooting angle on the same gesture recognition, the skeletonization algorithm is optimized based on the layer-by-layer stripping concept, so that the key node information in the hand skeleton diagram is extracted. The gesture direction is determined by the spatial coordinate axis of the hand. Based on this, gesture segmentation is implemented to overcome the influence of the environment on the recognition effect. In order to further improve the accuracy of gesture recognition, the ASK gesture database is used to train the convolutional neural network model. The experimental results show that compared with SVM method, dictionary learning + sparse representation, CNN method and other methods, the recognition rate reaches 96.01%.

...read moreread less

136 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127

Collapse