Showing papers by "Nikos Mamoulis published in 2019"

PDF

Open Access

Journal Article•DOI•

HHMF: hidden hierarchical matrix factorization for recommender systems

[...]

Hui Li¹, Yu Liu², Yuqiu Qian², Nikos Mamoulis³, Wenting Tu⁴, David W. Cheung² - Show less +2 more•Institutions (4)

Xiamen University¹, University of Hong Kong², University of Ioannina³, Shanghai University of Finance and Economics⁴

01 Nov 2019-Data Mining and Knowledge Discovery

TL;DR: A Hidden Hierarchical Matrix Factorization technique, which learns the hidden hierarchical structure from the user-item rating records, and outperforms existing methods, demonstrating that the discovery of latent hierarchical structures indeed improves the quality of recommendation.

...read moreread less

Abstract: Matrix factorization (MF) is one of the most powerful techniques used in recommender systems. MF models the (user, item) interactions behind historical explicit or implicit ratings. Standard MF does not capture the hierarchical structural correlations, such as publisher and advertiser in advertisement recommender systems, or the taxonomy (e.g., tracks, albums, artists, genres) in music recommender systems. There are a few hierarchical MF approaches, but they require the hierarchical structures to be known beforehand. In this paper, we propose a Hidden Hierarchical Matrix Factorization (HHMF) technique, which learns the hidden hierarchical structure from the user-item rating records. HHMF does not require the prior knowledge of hierarchical structure; hence, as opposed to existing hierarchical MF methods, HHMF can be applied when this information is either explicit or implicit. According to our extensive experiments, HHMF outperforms existing methods, demonstrating that the discovery of latent hierarchical structures indeed improves the quality of recommendation.

...read moreread less

15 citations

Proceedings Article•DOI•

Parallel In-Memory Evaluation of Spatial Joins

[...]

Dimitrios Tsitsigkos, Panagiotis Bouros¹, Nikos Mamoulis², Manolis Terrovitis•Institutions (2)

University of Mainz¹, University of Ioannina²

05 Nov 2019

TL;DR: In this article, the authors study the in-memory and parallel evaluation of spatial joins, by tuning a classic partitioning based algorithm, and show that compared to a straightforward implementation of the algorithm, performance can be improved significantly.

...read moreread less

Abstract: We study the in-memory and parallel evaluation of spatial joins, by tuning a classic partitioning based algorithm. Our study shows that, compared to a straightforward implementation of the algorithm, performance can be improved significantly. We also show how to select appropriate partitioning parameters based on data statistics, in order to tune the algorithm for the given join inputs. Our parallel implementation scales gracefully with the number of threads reducing the cost of the join to at most one second even for join inputs with tens of millions of rectangles.

...read moreread less

14 citations

Journal Article•DOI•

Spatial joins: what's next?

[...]

Panagiotis Bouros¹, Nikos Mamoulis²•Institutions (2)

University of Mainz¹, University of Ioannina²

05 Aug 2019-Sigspatial Special

TL;DR: The complexity of different data types, the consideration of different join predicates, the use of modern commodity hardware, and support for parallel processing open the road to a number of interesting directions for future research, some of which are outlined in the paper.

...read moreread less

Abstract: The spatial join is a popular operation in spatial database systems and its evaluation is a well-studied problem. This paper reviews research and recent trends on spatial join evaluation. The complexity of different data types, the consideration of different join predicates, the use of modern commodity hardware, and support for parallel processing open the road to a number of interesting directions for future research, some of which we outline in the paper.

...read moreread less

11 citations

Journal Article•DOI•

Evaluating pattern matching queries for spatial databases

[...]

Yixiang Fang¹, Yun Li², Reynold Cheng¹, Nikos Mamoulis³, Gao Cong⁴ - Show less +1 more•Institutions (4)

University of Hong Kong¹, Nanjing University², University of Ioannina³, Nanyang Technological University⁴

01 Oct 2019

TL;DR: It is proved that answering spatial pattern matching queries is computationally intractable and proposed algorithms to address two related problems of the SPM are highly effective and efficient.

...read moreread less

Abstract: In this paper, we study the spatial pattern matching (SPM) query. Given a set D of spatial objects (e.g., houses and shops), each with a textual description, we aim at finding all combinations of objects from D that match a user-defined spatial patternP. A pattern P is a graph whose vertices represent spatial objects, and edges denote distance relationships between them. The SPM query returns the instances that satisfy P. An example of P can be “a house within 10-min walk from a school, which is at least 2 km away from a hospital.” The SPM query can benefit users such as house buyers, urban planners, and archeologists. We prove that answering such queries is computationally intractable and propose two efficient algorithms for their evaluation. Moreover, we study efficient solutions to address two related problems of the SPM: (1) find top-k matches that are close to a query location and (2) return partial matches for a query pattern. Experiments and case studies on real datasets show that our proposed solutions are highly effective and efficient.

...read moreread less

9 citations

Journal Article•DOI•

SRX: efficient management of spatial RDF data

[...]

Konstantinos Theocharidis¹, John Liagouris², Nikos Mamoulis³, Panagiotis Bouros⁴, Manolis Terrovitis - Show less +1 more•Institutions (4)

University of Peloponnese¹, ETH Zurich², University of Ioannina³, University of Mainz⁴

01 Oct 2019

TL;DR: Compared to RDF-3X, SRX improves its performance for queries with spatial predicates while incurring little overhead during updates, and the results show SRX ’s superior performance over the competitors.

...read moreread less

Abstract: We present a general encoding scheme for the efficient management of spatial RDF data. The scheme approximates the geometries of the RDF entities inside their (integer) IDs and can be used, along with several operators and optimizations we introduce, to accelerate queries with spatial predicates and to re-encode entities dynamically in case of updates. We implement our ideas in SRX, a system built on top of the popular RDF-3X system. SRX extends RDF-3X with support for three types of spatial queries: range selections (e.g., find entities within a given polygon), spatial joins (e.g., find pairs of entities whose locations are close to each other), and spatial k-nearest neighbors (e.g., find the three closest entities from a given location). We evaluate SRX on spatial queries and updates with real RDF data, and we also compare its performance with the latest versions of three popular RDF stores. The results show SRX ’s superior performance over the competitors; compared to RDF-3X, SRX improves its performance for queries with spatial predicates while incurring little overhead during updates.

...read moreread less

8 citations

Posted Content•

An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms

[...]

Caihua Shan¹, Nikos Mamoulis², Reynold Cheng¹, Guoliang Li³, Xiang Li¹, Yuqiu Qian⁴ - Show less +2 more•Institutions (4)

University of Hong Kong¹, University of Ioannina², Tsinghua University³, Tencent⁴

04 Nov 2019-arXiv: Learning

TL;DR: Deep Q-Network (DQN) is proposed, an RL-based method combined with a neural network to estimate the expected long-term return of recommending a task and two DQNs are designed that capture the benefit of both workers and requesters and maximize the profit of the platform.

...read moreread less

Abstract: In this paper, we propose a Deep Reinforcement Learning (RL) framework for task arrangement, which is a critical problem for the success of crowdsourcing platforms. Previous works conduct the personalized recommendation of tasks to workers via supervised learning methods. However, the majority of them only consider the benefit of either workers or requesters independently. In addition, they cannot handle the dynamic environment and may produce sub-optimal results. To address these issues, we utilize Deep Q-Network (DQN), an RL-based method combined with a neural network to estimate the expected long-term return of recommending a task. DQN inherently considers the immediate and future reward simultaneously and can be updated in real-time to deal with evolving data and dynamic changes. Furthermore, we design two DQNs that capture the benefit of both workers and requesters and maximize the profit of the platform. To learn value functions in DQN effectively, we also propose novel state representations, carefully design the computation of Q values, and predict transition probabilities and future states. Experiments on synthetic and real datasets demonstrate the superior performance of our framework.

...read moreread less

5 citations

Proceedings Article•DOI•

Parallel In-Memory Evaluation of Spatial Joins

[...]

Dimitrios Tsitsigkos, Panagiotis Bouros¹, Nikos Mamoulis², Manolis Terrovitis•Institutions (2)

University of Mainz¹, University of Ioannina²

30 Aug 2019-arXiv: Databases

TL;DR: In this article, the authors study the in-memory and parallel evaluation of spatial joins, by re-designing a classic partitioning-based algorithm to consider alternative approaches for space partitioning.

...read moreread less

Abstract: The spatial join is a popular operation in spatial database systems and its evaluation is a well-studied problem. As main memories become bigger and faster and commodity hardware supports parallel processing, there is a need to revamp classic join algorithms which have been designed for I/O-bound processing. In view of this, we study the in-memory and parallel evaluation of spatial joins, by re-designing a classic partitioning-based algorithm to consider alternative approaches for space partitioning. Our study shows that, compared to a straightforward implementation of the algorithm, our tuning can improve performance significantly. We also show how to select appropriate partitioning parameters based on data statistics, in order to tune the algorithm for the given join inputs. Our parallel implementation scales gracefully with the number of threads reducing the cost of the join to at most one second even for join inputs with tens of millions of rectangles.

...read moreread less

3 citations

Posted Content•

A General Early-Stopping Module for Crowdsourced Ranking

[...]

Caihua Shan¹, Leong Hou U², Nikos Mamoulis³, Reynold Cheng¹, Xiang Li¹ - Show less +1 more•Institutions (3)

University of Hong Kong¹, University of Macau², University of Ioannina³

04 Nov 2019-arXiv: Databases

TL;DR: This work studies how a ranking process can be terminated early, and yet achieve a high-quality ranking and great savings in the budget, and proposes an early-stopping module that is better than other existing general stopping criteria.

...read moreread less

Abstract: Crowdsourcing can be used to determine a total order for an object set (e.g., the top-10 NBA players) based on crowd opinions. This ranking problem is often decomposed into a set of microtasks (e.g., pairwise comparisons). These microtasks are passed to a large number of workers and their answers are aggregated to infer the ranking. The number of microtasks depends on the budget allocated for the problem. Intuitively, the higher the number of microtask answers, the more accurate the ranking becomes. However, it is often hard to decide the budget required for an accurate ranking. We study how a ranking process can be terminated early, and yet achieve a high-quality ranking and great savings in the budget. We use statistical tools to estimate the quality of the ranking result at any stage of the crowdsourcing process and terminate the process as soon as the desired quality is achieved. Our proposed early-stopping module can be seamlessly integrated with most existing inference algorithms and task assignment methods. We conduct extensive experiments and show that our early-stopping module is better than other existing general stopping criteria. We also implement a prototype system to demonstrate the usability and effectiveness of our approach in practice.

...read moreread less

1 citations

Query Processing: Joins.

[...]

Nikos Mamoulis

01 Jan 2019