scispace - formally typeset
Journal ArticleDOI

Compression of individual sequences via variable-rate coding

TLDR
The proposed concept of compressibility is shown to play a role analogous to that of entropy in classical information theory where one deals with probabilistic ensembles of sequences rather than with individual sequences.
Abstract
Compressibility of individual sequences by the class of generalized finite-state information-lossless encoders is investigated. These encoders can operate in a variable-rate mode as well as a fixed-rate one, and they allow for any finite-state scheme of variable-length-to-variable-length coding. For every individual infinite sequence x a quantity \rho(x) is defined, called the compressibility of x , which is shown to be the asymptotically attainable lower bound on the compression ratio that can be achieved for x by any finite-state encoder. This is demonstrated by means of a constructive coding theorem and its converse that, apart from their asymptotic significance, also provide useful performance criteria for finite and practical data-compression tasks. The proposed concept of compressibility is also shown to play a role analogous to that of entropy in classical information theory where one deals with probabilistic ensembles of sequences rather than with individual sequences. While the definition of \rho(x) allows a different machine for each different sequence to be compressed, the constructive coding theorem leads to a universal algorithm that is asymptotically optimal for all sequences.

read more

Citations
More filters
Posted Content

Compression-Based Compressed Sensing

TL;DR: In this article, the performance of the compressible signal pursuit (CSP) algorithm is studied in this stochastic setting, and it is proved that in the very low distortion regime, as the blocklength grows to infinity, the CSP algorithm reliably and robustly recovers $n$ instances of a stationary process from random linear projections as long as their count is slightly more than the rate-distortion dimension (RDD) of the source.

Context Prediction in Pervasive Computing Systems: Achievements and Challenges.

TL;DR: In this paper, the authors develop and justify the principles of analysis and comparison of context-prediction methods, analyze the development in the area, compare different context prediction techniques to identify their benefits and shortcomings, and, finally, identify current challenges in the field and propose the solutions.
Patent

Compression/decompression techniques based on tokens and Huffman coding

TL;DR: In this article, the Huffman data is modified to account for the selection of the at least one bit string, and then compression is performed on the uncompressed data based on token replacements for the at- least one string.
Journal ArticleDOI

A Review of Methods for Estimating Algorithmic Complexity: Options, Challenges, and New Directions

TL;DR: Some established and also novel techniques in the field of applications of algorithmic (Kolmogorov) complexity currently co-exist for the first time and are here reviewed, ranging from dominant ones such as statistical lossless compression to newer approaches that advance, complement and also pose new challenges and may exhibit their own limitations.
Book ChapterDOI

Optimal partitions of strings: a new class of Burrows-Wheeler compression algorithms

TL;DR: A new class of Burrows-Wheeler algorithms that use optimal partitions of strings, rather than symbol ranking, for the additional step are provided, which is a quite surprising specialization to strings of partitioning techniques devised by Buchsbaum et al.
References
More filters
Book

Information Theory and Reliable Communication

TL;DR: This chapter discusses Coding for Discrete Sources, Techniques for Coding and Decoding, and Source Coding with a Fidelity Criterion.
Journal ArticleDOI

A universal algorithm for sequential data compression

TL;DR: The compression ratio achieved by the proposed universal code uniformly approaches the lower bounds on the compression ratios attainable by block-to-variable codes and variable- to-block codes designed to match a completely specified source.
Journal ArticleDOI

On the Complexity of Finite Sequences

TL;DR: A new approach to the problem of evaluating the complexity ("randomness") of finite sequences is presented, related to the number of steps in a self-delimiting production process by which a given sequence is presumed to be generated.
Journal ArticleDOI

Coding theorems for individual sequences

TL;DR: The finite-state complexity of a sequence plays a role similar to that of entropy in classical information theory (which deals with probabilistic ensembles of sequences rather than an individual sequence).
Journal ArticleDOI

On Information Lossless Automata of Finite Order

TL;DR: The application of the tests to finite deterministic automata is discussed and a method of constructing a decoder for a given finite automaton that is information lossless of finite order, is described.