scispace - formally typeset
Open AccessJournal ArticleDOI

On the Randomness of Compressed Data

Shmuel T. Klein, +1 more
- 07 Apr 2020 - 
- Vol. 11, Iss: 4, pp 196
Reads0
Chats0
TLDR
Evidence is presented here that arithmetic coding may produce an output that is identical to that of Huffman coding, and it is found that there is much variability in the randomness of the output of these techniques.
Abstract
It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known lossless compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we present evidence here that arithmetic coding may produce an output that is identical to that of Huffman.

read more

Citations
More filters

TestU01: A C Library for Empirical Testing of Random Number Generators

TL;DR: TestU01, a software library implemented in the ANSI C language, and offering a collection of utilities for the empirical statistical testing of uniform random number generators (RNGs), is introduced.
Posted Content

Weighted Adaptive Coding

TL;DR: This paper introduces a new generic coding method, extending the known static and dynamic variants of Huffman coding and including them as special cases, and presents empirical results that show improvements overstatic and dynamic Huffman and arithmetic coding achieved by the proposed method.
Journal ArticleDOI

Weighted forward looking adaptive coding

TL;DR: In this article , a new generic coding method, called positional coding, was proposed, which is adaptive in the sense that it uses changing statistics depending on the current position within the processed file, yet it behaves like static coding, as it assumes the knowledge of the distribution in the entire file.
Journal ArticleDOI

Second compression for pixelated images under edge-based compression algorithms: JPEG-LS as an example

TL;DR: This work utilized the JPEG-LS as an example of an edge-based compression algorithm for compressing pixelated images, and the output was subjected to another round of compression using a more robust but slower compressor (PAQ8f).
Proceedings ArticleDOI

Weakly Secure Coded Distributed Computing with Group-based Function Assignment

TL;DR: In this article , the tradeoff between computation and communication and avoid exchanged information leakage to eavesdroppers is studied in a distributed computing system where nodes are grouped such that nodes in the same group compute the same function.
References
More filters
Journal ArticleDOI

A universal algorithm for sequential data compression

TL;DR: The compression ratio achieved by the proposed universal code uniformly approaches the lower bounds on the compression ratios attainable by block-to-variable codes and variable- to-block codes designed to match a completely specified source.
Journal ArticleDOI

Compression of individual sequences via variable-rate coding

TL;DR: The proposed concept of compressibility is shown to play a role analogous to that of entropy in classical information theory where one deals with probabilistic ensembles of sequences rather than with individual sequences.
Journal ArticleDOI

Arithmetic coding for data compression

TL;DR: The state of the art in data compression is arithmetic coding, not the better-known Huffman method, which gives greater compression, is faster for adaptive models, and clearly separates the model from the channel encoding.
Journal ArticleDOI

A Technique for High-Performance Data Compression

Welch
- 01 Jun 1984 - 
TL;DR: A new compression algorithm is introduced that is based on principles not found in existing commercial methods in that it dynamically adapts to the redundancy characteristics of the data being compressed, and serves to illustrate system problems inherent in using any compression scheme.
Related Papers (5)