Showing papers by "Kunihiko Sadakane published in 2020"

PDF

Open Access

Journal Article•DOI•

Efficient query autocompletion with edit distance-based error tolerance

[...]

Jianbin Qin¹, Chuan Xiao², Chuan Xiao³, Sheng Hu², Sheng Hu⁴, Jie Zhang, Wei Wang⁵, Yoshiharu Ishikawa², Koji Tsuda⁶, Kunihiko Sadakane⁶ - Show less +6 more•Institutions (6)

Shenzhen University¹, Nagoya University², Osaka University³, Kyoto University⁴, University of New South Wales⁵, University of Tokyo⁶

01 Jul 2020

TL;DR: This paper proposes a novel neighborhood generation-based method to process error-tolerant query autocompletion that tolerates errors in users' input using edit distance constraints and only maintains a small set of active nodes, thus saving both space and time to process the query.

...read moreread less

Abstract: Query autocompletion is an important feature saving users many keystrokes from typing the entire query. In this paper, we study the problem of query autocompletion that tolerates errors in users’ input using edit distance constraints. Previous approaches index data strings in a trie, and continuously maintain all the prefixes of data strings whose edit distances from the query string are within the given threshold. The major inherent drawback of these approaches is that the number of such prefixes is huge for the first few characters of the query string and is exponential in the alphabet size. This results in slow query response even if the entire query approximately matches only few prefixes. We propose a novel neighborhood generation-based method to process error-tolerant query autocompletion. Our proposed method only maintains a small set of active nodes, thus saving both space and time to process the query. We also study efficient duplicate removal, a core problem in fetching query answers, and extend our method to support top-k queries. Optimization techniques are proposed to reduce the index size. The efficiency of our method is demonstrated through extensive experiments on real datasets.

...read moreread less

5 citations

Journal Article•DOI•

A linear-space data structure for range-LCP queries in poly-logarithmic time

[...]

Paniz Abedin¹, Arnab Ganguly², Wing-Kai Hon³, Kotaro Matsuda⁴, Yakov Nekrich⁵, Kunihiko Sadakane⁴, Rahul Shah⁶, Sharma V. Thankachan¹ - Show less +4 more•Institutions (6)

University of Central Florida¹, University of Wisconsin–Whitewater², National Tsing Hua University³, University of Tokyo⁴, University of Waterloo⁵, Louisiana State University⁶

21 Apr 2020-Theoretical Computer Science

TL;DR: An O ( n ) space data structure with query time O ( log 1 + ϵ ⁡ n ) and construction time O( n log ⁢ n ) is presented, which poses an interesting question, whether it is possible to answer rlcp ( ⋅, Ⓟ ) queries in poly-logarithmic time using a linear space dataructure.

...read moreread less

5 citations

Book Chapter•DOI•

Optimal In-place Algorithms for Basic Graph Problems

[...]

Sankardeep Chakraborty¹, Kunihiko Sadakane², Srinivasa Rao Satti³•Institutions (3)

National Institute of Informatics¹, University of Tokyo², Seoul National University³

08 Jun 2020

TL;DR: In this article, the authors presented linear time in-place algorithms for several fundamental graph problems including the well-known graph search methods (like depth-first search, breadth first search, maximum cardinality search), connectivity problems (like biconnectivity, 2-edge connectivity), decomposition problem (like chain decomposition) among various others, improving the running time of Chakraborty et al.

...read moreread less

Abstract: We present linear time in-place algorithms for several fundamental graph problems including the well-known graph search methods (like depth-first search, breadth-first search, maximum cardinality search), connectivity problems (like biconnectivity, 2-edge connectivity), decomposition problem (like chain decomposition) among various others, improving the running time (by polynomial multiplicative factor) of the recent results of Chakraborty et al. [ESA, 2018] who designed $O(n^3 \lg n)$ time in-place algorithms for some of the above mentioned problems. The running times of all our algorithms are essentially optimal as they run in linear time. One of the main ideas behind obtaining these algorithms is the detection and careful exploitation of sortedness present in the input representation for any graph without loss of generality. This observation alone is powerful enough to design some basic linear time in-place algorithms, but more non-trivial graph problems require extra techniques which, we believe, may find other applications while designing in-place algorithms for different graph problems in future.

...read moreread less

5 citations

Proceedings Article•

Compressed Orthogonal Search on Suffix Arrays with Applications to Range LCP.

[...]

Kotaro Matsuda¹, Kunihiko Sadakane¹, Tatiana Starikovskaya², Masakazu Tateshita•Institutions (2)

University of Tokyo¹, PSL Research University²

01 Jan 2020

TL;DR: A space-efficient data structure for orthogonal range search on suffix arrays which uses O( 1 ε n(H0 + 1)) bits where H0 is the order-0 entropy of the string and answers a counting query in O(n) time for any constant ε > 0.

...read moreread less

Abstract: We propose a space-efficient data structure for orthogonal range search on suffix arrays. For general two-dimensional orthogonal range search problem on a set of n points, there exists an n logn(1+o(1))bit data structure supporting O(logn)-time counting queries [Mäkinen, Navarro 2007]. The space matches the information-theoretic lower bound. However, if we focus on a point set representing a suffix array, there is a chance to obtain a space efficient data structure. We answer this question affirmatively. Namely, we propose a data structure for orthogonal range search on suffix arrays which uses O( 1 ε n(H0 + 1)) bits where H0 is the order-0 entropy of the string and answers a counting query in O(n) time for any constant ε > 0. As an application, we give an O( 1 ε n(H0 + 1))-bit data structure for the range LCP problem. 2012 ACM Subject Classification Theory of computation → Models of computation

...read moreread less

5 citations

Posted Content•

Succinct Navigational Oracles for Families of Intersection Graphs on a Circle.

[...]

Hüseyin Acan¹, Sankardeep Chakraborty, Seungbum Jo², Kei Nakashima³, Kunihiko Sadakane³, Srinivasa Rao Satti⁴ - Show less +2 more•Institutions (4)

Drexel University¹, Chungbuk National University², University of Tokyo³, Seoul National University⁴

09 Oct 2020-arXiv: Data Structures and Algorithms

TL;DR: A lower bound of space is proved for representing {\it trapezoid} graphs and a succinct navigational oracle is given for this class of graphs.

...read moreread less

Abstract: We consider the problem of designing succinct navigational oracles, i.e., succinct data structures supporting basic navigational queries such as degree, adjacency, and neighborhood efficiently for intersection graphs on a circle, which include graph classes such as {\it circle graphs}, {\it $k$-polygon-circle graphs}, {\it circle-trapezoid graphs}, {\it trapezoid graphs}. The degree query reports the number of incident edges to a given vertex, the adjacency query asks if there is an edge between two given vertices, and the neighborhood query enumerates all the neighbors of a given vertex. We first prove a general lower bound for these intersection graph classes and then present a uniform approach that lets us obtain matching lower and upper bounds for representing each of these graph classes. More specifically, our lower bound proofs use a unified technique to produce tight bounds for all these classes, and this is followed by our data structures which are also obtained from a unified representation method to achieve succinctness for each class. In addition, we prove a lower bound of space for representing {\it trapezoid} graphs and give a succinct navigational oracle for this class of graphs.

...read moreread less

2 citations

Posted Content•

Storing Set Families More Compactly with Top ZDDs

[...]

Kotaro Matsuda¹, Shuhei Denzumi¹, Kunihiko Sadakane¹•Institutions (1)

University of Tokyo¹

09 Apr 2020-arXiv: Data Structures and Algorithms

TL;DR: The top ZDD is an extension of top tree, which compresses trees, to compress directed acyclic graphs by sharing identical subgraphs and it is proved that navigational operations on ZDDs can be done in time poly-logarithmicin ZDD size, and that there exist set families for which the size of the top Z DD is exponentially smaller than that of the ZDD.

...read moreread less

Abstract: Zero-suppressed Binary Decision Diagrams (ZDDs) are data structures for representing set families in a compressed form. With ZDDs, many valuable operations on set families can be done in time polynomial in ZDD size. In some cases, however, the size of ZDDs for representing large set families becomes too huge to store them in the main memory. This paper proposes top ZDD, a novel representation of ZDDs which uses less space than existing ones. The top ZDD is an extension of top tree, which compresses trees, to compress directed acyclic graphs by sharing identical subgraphs. We prove that navigational operations on ZDDs can be done in time poly-logarithmicin ZDD size, and show that there exist set families for which the size of the top ZDD is exponentially smaller than that of the ZDD. We also show experimentally that our top ZDDs have smaller size than ZDDs for real data.

...read moreread less

2 citations

Proceedings Article•DOI•

Enumerating Range Modes

[...]

Kentaro Sumigawa¹, Sankardeep Chakraborty², Kunihiko Sadakane¹, Srinivasa Rao Satti³•Institutions (3)

University of Tokyo¹, National Institute of Informatics², Seoul National University³

01 Jan 2020

TL;DR: In this paper, the problem of indexing the sequence to support range mode queries is considered, where given a query range, find the element with maximum frequency in the range, given a sequence, and construct a data structure that can be used later to process arbitrary queries.

...read moreread less

Abstract: Given a sequence of elements, we consider the problem of indexing the sequence to support range mode queries - given a query range, find the element with maximum frequency in the range. We give indexing data structures for this problem; given a sequence, we construct a data structure that can be used later to process arbitrary queries. Our algorithms are efficient for small maximum frequency cases. We also consider a natural generalization of the problem: the range mode enumeration problem, for which there has been no known efficient algorithms. Our algorithms have query time complexities which are linear in the output size plus small terms.

...read moreread less

1 citations

Journal Article•DOI•

Compact and succinct data structures for multidimensional orthogonal range searching

[...]

Kazuki Ishiyama¹, Kunihiko Sadakane¹•Institutions (1)

University of Tokyo¹

01 Aug 2020-Information & Computation

TL;DR: Compact and succinct representations of a d-dimensional point set for any constant d ≥ 3 supporting orthogonal range searching are introduced and the algorithm runs fast in practical database search.

...read moreread less

Abstract: We introduce compact and succinct representations of a d-dimensional point set for any constant d ≥ 3 supporting orthogonal range searching. Our first data structure uses d n lg ⁡ n + o ( n lg ⁡ n ) bits, where n denotes the number of points in P, and supporting reporting queries in O ( ( n ( d − 2 ) / d + occ ) lg ⁡ n / lg ⁡ lg ⁡ n ) time, and counting queries in O ( n ( d − 2 ) / d lg ⁡ n / lg ⁡ lg ⁡ n ) time, where occ denotes the number of point to report, which is faster than known algorithms. Our second data structure uses d n lg ⁡ U − n lg ⁡ n + o ( n lg ⁡ n ) bits, where U is the size of the universe, which asymptotically matches the information-theoretic lower bound. The query time complexity is worse than the first one, but the algorithm runs fast in practical database search.

...read moreread less