Luca Gagliardelli

Journal ArticleDOI

Three-dimensional Entity Resolution with JedAI

- 01 Nov 2020 -

TL;DR: JedAI is an open-source system that puts together a series of state-of-the-art ER techniques that have been proposed and examined independently, targeting parts of the ER end-to-end pipeline, a unique approach as no other ER tool brings together so many established techniques.

...read moreread less

Journal ArticleDOI

Reproducible experiments on Three-Dimensional Entity Resolution with JedAI

Georgios M. Mandilaras, +10 more

- 01 Dec 2021 -

Information Systems

TL;DR: JedAI as mentioned in this paper is an open-source Entity Resolution (ER) system that allows for building a large variety of end-to-end ER pipelines through a thorough experimental evaluation.

...read moreread less

Journal ArticleDOI

Scaling entity resolution: A loosely schema-aware approach

Giovanni Simonini, +3 more

- 01 Jul 2019 -

Information Systems

TL;DR: It is demonstrated how “loose” schema information can be exploited to enhance the quality of the blocks in a holistic loosely schema-aware (meta-)blocking approach that can be used to speed up your favorite Entity Resolution algorithm.

...read moreread less

Proceedings ArticleDOI

SparkER: Scaling Entity Resolution in Spark

Luca Gagliardelli, +3 more

TL;DR: The new version of SparkER, an ER tool that can scale practitioners’ favorite ER algorithms, and a supervised mode has been added, which can be assisted in supervising the entire process and in injecting his knowledge in order to achieve the best result.

...read moreread less

Journal ArticleDOI

BigBench Workload Executed by using Apache Flink

Sonia Bergamaschi, +3 more

- 01 Jan 2017 -

Procedia Manufacturing

TL;DR: This paper compares two of the most employed and promising frameworks to manage big data: Apache Flink and Apache Hive, which are general purpose distributed platforms under the umbrella of the Apache Software Foundation.

...read moreread less

Papers

Three-dimensional Entity Resolution with JedAI

Reproducible experiments on Three-Dimensional Entity Resolution with JedAI

Scaling entity resolution: A loosely schema-aware approach

SparkER: Scaling Entity Resolution in Spark

BigBench Workload Executed by using Apache Flink