scispace - formally typeset
Open AccessJournal ArticleDOI

Analysis of the average depth in a suffix tree under a Markov model

Reads0
Chats0
TLDR
In this article, it was shown that under a Markovian model of order one, the average depth of suffix trees of index n is asymptotically similar to the average depths of tries (a.k.a. digital trees) built on n independent strings.
Abstract
In this report, we prove that under a Markovian model of order one, the average depth of suffix trees of index n is asymptotically similar to the average depth of tries (a.k.a. digital trees) built on n independent strings. This leads to an asymptotic behavior of $(\log{n})/h + C$ for the average of the depth of the suffix tree, where $h$ is the entropy of the Markov model and $C$ is constant. Our proof compares the generating functions for the average depth in tries and in suffix trees; the difference between these generating functions is shown to be asymptotically small. We conclude by using the asymptotic behavior of the average depth in a trie under the Markov model found by Jacquet and Szpankowski ([JaSz91]).

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Efficient Periodicity Mining in Time Series Databases Using Suffix Trees

TL;DR: This paper presents an algorithm which can detect symbol, sequence (partial), and segment (full cycle) periodicity in time series and is noise resilient; it is generally more time-efficient and noise-resilient than existing algorithms.
Journal ArticleDOI

A Framework for Periodic Outlier Pattern Detection in Time-Series Sequences

TL;DR: This paper presents a robust and time efficient suffix tree-based algorithm capable of detecting the periodicity of outlier patterns in a time series by giving more significance to less frequent yet periodic patterns.
Journal ArticleDOI

Profiles of Tries

TL;DR: A detailed study of the distribution of the profiles in a trie built over random strings generated by a memoryless source finds typical behaviors of the height, shortest path, fill-up level, and depth.
Book

Analytic Pattern Matching

TL;DR: This book for researchers and graduate students demonstrates the probabilistic approach to pattern matching, which predicts the performance of pattern matching algorithms with very high precision using analytic combinatorics and analytic information theory.
Book ChapterDOI

Profile of tries

TL;DR: This work presents a detailed study of the distribution of the profiles in a trie built over strings generated by a memoryless source (extension to Markov sources is possible).
References
More filters
Journal ArticleDOI

A universal algorithm for sequential data compression

TL;DR: The compression ratio achieved by the proposed universal code uniformly approaches the lower bounds on the compression ratios attainable by block-to-variable codes and variable- to-block codes designed to match a completely specified source.
Journal ArticleDOI

Mellin transforms and asymptotics: harmonic sums

TL;DR: This survey presents a unified and essentially self-contained approach to the asymptotic analysis of a large class of sums that arise in combinatorial mathematics, discrete probabilistic models, and the average-case analysis of algorithms using the Mellin transform, a close relative of the integral transforms of Laplace and Fourier.
Journal ArticleDOI

On Pattern Frequency Occurrences in a Markovian Sequence

TL;DR: exact and asymptotic formulae for moments, probability of r pattern occurrences for three different regions of r, namely: (i) r=O(1) , (ii) central limit regime, and (iii) large deviations regime are presented.
Journal ArticleDOI

Analysis of digital tries with Markovian dependency

TL;DR: A complete characterization of a digital tree is presented from the depth viewpoint in a Markovian framework, that is, under the assumption that symbols in a key are Markov-dependent, and shows that D/sub n/ tends to the normal distribution in all cases except the symmetric independent model.
Journal ArticleDOI

Dynamical sources in information theory: Fundamental intervals and word prefixes

TL;DR: A quite general model of source that comes from dynamical systems theory is introduced, and the main tool is the generalized Ruelle operator, which can be viewed as a “generating” operator for fundamental intervals (associated to information sharing common prefixes).