scispace - formally typeset
M

Mikkel Thorup

Researcher at University of Copenhagen

Publications -  306
Citations -  17294

Mikkel Thorup is an academic researcher from University of Copenhagen. The author has contributed to research in topics: Time complexity & Hash function. The author has an hindex of 63, co-authored 297 publications receiving 16344 citations. Previous affiliations of Mikkel Thorup include Max Planck Society & University of Copenhagen Faculty of Science.

Papers
More filters
Proceedings ArticleDOI

On AC0 implementations of fusion trees and atomic heaps

TL;DR: The answer is "no" unless you have room for a multiplication table, and both fusion trees and Fredman and Willard's later atomic heaps can be implemented using AC0 operations on emerging multimedia processors such as the Pentium 4.
Proceedings ArticleDOI

Fast Similarity Sketching

TL;DR: In this article, the authors present a new sketch which obtains essentially the best of both worlds: a fast O(t log t + |A|) expected running time while getting the same strong concentration bounds as MinHash, and demonstrate the power of their new sketch by considering popular applications in large-scale classification with linear SVM as introduced by Li et al.
Proceedings ArticleDOI

Twisted tabulation hashing

TL;DR: A new tabulation-based hashing scheme called "twisted tabulation" is introduced, essentially as simple and fast as simple tabulation, but has some powerful distributional properties illustrating its promise.
Proceedings Article

Practical Hash Functions for Similarity Estimation and Dimensionality Reduction

TL;DR: An experimental comparison of different hashing schemes when used inside FH, OPH, and LSH finds that mixed tabulation hashing is almost as fast as the multiply-mod-prime scheme ax+b mod p, and Mutiply- mod-prime is guaranteed to work well on sufficiently random data, but it can lead to bias and poor concentration on both real-world and synthetic data.
Proceedings ArticleDOI

Static dictionaries on AC/sup 0/ RAMs: query time /spl theta/(/spl radic/log n/log log n) is necessary and sufficient

TL;DR: A tight upper and lower bound is obtained on the time for answering membership queries in a set of size n when reasonable space is used for the data structure storing the set; the upper bound can be obtained using O(n) space, and the lower bound holds even if the authors allow space 2/sup polylog n/.