scispace - formally typeset
Patent

Approximate string matching system and process for lossless data compression

Reads0
Chats0
TLDR
In this paper, an approximate string matching scheme was proposed for lossless data compression employing an entropy-based compression technique, where the residual data represents the difference between each value of an earlier occurring block of source data, whose location and length is identified by a pointer, and an equal-sized block of the source data associated with the pointer.
Abstract
A system and process for lossless data compression employing a unique approximate string matching scheme. The encoder of the system characterizes source data as a set of pointers and associated blocks of residual data. Each pointer identifies a location earlier in the source data, as well as the number of source data values associated with the identified location. The residual data represents the difference between each value of an earlier occurring block of source data, whose location and length is identified by a pointer, and an equal-sized block of source data associated with the pointer. The choice of a block of earlier occurring source data for use in forming a residual data block is based on a cost analysis which is designed to minimize the entropy of the differences between the previous block and the new block of source data to a desired degree. The encoded data, which will exhibit a significantly lower entropy, can be compressed effectively using an entropy-based compression technique. The decoder portion of the system operates by initially decompressing the encoded data. Next, the first data value is decoded by adding the first residual to a predetermined constant. Once the first data value has been decoded, subsequent data values are decoded by first finding the block in the previously decoded data indicated by a pointer, and then adding each data value in the block to its corresponding data element in the residual data block associated with the pointer. The process is repeated until all the data is decoded.

read more

Citations
More filters
Patent

Method and apparatus for efficient hardware based deflate

TL;DR: In this article, a method and apparatus provide for data compression with deflate block overhead reduction through the use of pseudo-dynamic Huffman codes, with inflation detection and mitigation capabilities.
Patent

Method of compression of binary data with a random number generator

TL;DR: A dictionary based method of lossless compression and encryption of small redundancy data was proposed in this paper, where a sequence of integer samples equivalent to a group of bits of input data was compared with samples of several dictionaries produced by random number generators.
Patent

PPM-based data compression

TL;DR: In this article, the authors proposed techniques that reduce the computational complexity of PPM-based data compression through use of certain simplifying assumptions that permit faster search for a close-to-optimal PPM model than conventional techniques.
Patent

Method and system for image compression and decompression using span of interest of an imaging sequence

TL;DR: In this paper, a system and method for image compression and decompression is described, which includes selecting a portion of image in a span of interest obtained from an acquired imaging sequence, applying lossless compression to the portion of the image and obtaining a compressed image sequence; and applying decompression to the compressed image and obtain therefrom an analytically relevant image sequence.
Patent

Optimization of decoder memory usage for VLC tables

TL;DR: In this paper, a variable length decoding system is described in which decode tables are analyzed and elements of the tables sorted by probability of occurrence storage of elements can be determined by probability-of-occurrence and a single large table is provided that cannot fit into decoder fast-memory.
References
More filters
Journal ArticleDOI

A universal algorithm for sequential data compression

TL;DR: The compression ratio achieved by the proposed universal code uniformly approaches the lower bounds on the compression ratios attainable by block-to-variable codes and variable- to-block codes designed to match a completely specified source.
Journal ArticleDOI

On the Complexity of Finite Sequences

TL;DR: A new approach to the problem of evaluating the complexity ("randomness") of finite sequences is presented, related to the number of steps in a self-delimiting production process by which a given sequence is presumed to be generated.
Journal ArticleDOI

A Technique for High-Performance Data Compression

Welch
- 01 Jun 1984 - 
TL;DR: A new compression algorithm is introduced that is based on principles not found in existing commercial methods in that it dynamically adapts to the redundancy characteristics of the data being compressed, and serves to illustrate system problems inherent in using any compression scheme.
Journal ArticleDOI

Data Compression Using Adaptive Coding and Partial String Matching

TL;DR: This paper describes how the conflict can be resolved with partial string matching, and reports experimental results which show that mixed-case English text can be coded in as little as 2.2 bits/ character with no prior knowledge of the source.
Journal ArticleDOI

Implementing the PPM data compression scheme

TL;DR: It is shown that the estimates made by Cleary and Witten of the resources required to implement the PPM scheme can be revised to allow for a tractable and useful implementation.
Related Papers (5)