Showing papers presented at "Cross-Language Evaluation Forum in 2020"

PDF

Open Access

Book Chapter•DOI•

Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media

[...]

Alberto Barrón-Cedeño¹, Tamer Elsayed², Preslav Nakov³, Giovanni Da San Martino³, Maram Hasanain², Reem Suwaileh², Fatima Haouari², Nikolay Babulkov⁴, Bayan Hamdan, Alex Nikolov⁴, Shaden Shaar³, Zien Sheikh Ali² - Show less +8 more•Institutions (4)

University of Bologna¹, Qatar University², Qatar Computing Research Institute³, Sofia University⁴

22 Sep 2020

TL;DR: An overview of the third edition of the CheckThat!

...read moreread less

Abstract: We present an overview of the third edition of the CheckThat! Lab at CLEF 2020. The lab featured five tasks in two different languages: English and Arabic. The first four tasks compose the full pipeline of claim verification in social media: Task 1 on check-worthiness estimation, Task 2 on retrieving previously fact-checked claims, Task 3 on evidence retrieval, and Task 4 on claim verification. The lab is completed with Task 5 on check-worthiness estimation in political debates and speeches. A total of 67 teams registered to participate in the lab (up from 47 at CLEF 2019), and 23 of them actually submitted runs (compared to 14 at CLEF 2019). Most teams used deep neural networks based on BERT, LSTMs, or CNNs, and achieved sizable improvements over the baselines on all tasks. Here we describe the tasks setup, the evaluation results, and a summary of the approaches used by the participants, and we discuss some lessons learned. Last but not least, we release to the research community all datasets from the lab as well as the evaluation scripts, which should enable further research in the important tasks of check-worthiness estimation and automatic claim verification.

...read moreread less

55 citations

Book Chapter•DOI•

Overview of the CLEF eHealth Evaluation Lab 2020

[...]

Lorraine Goeuriot¹, Hanna Suominen², Hanna Suominen³, Hanna Suominen⁴, Liadh Kelly⁵, Antonio Miranda-Escalada⁶, Martin Krallinger⁶, Zhengyang Liu³, Gabriella Pasi⁷, Gabriela Gonzalez Saez¹, Marco Viviani⁷, Chenchen Xu⁴, Chenchen Xu³ - Show less +9 more•Institutions (7)

University of Grenoble¹, University of Turku², Australian National University³, Commonwealth Scientific and Industrial Research Organisation⁴, Maynooth University⁵, Barcelona Supercomputing Center⁶, University of Milano-Bicocca⁷

22 Sep 2020

TL;DR: An overview of the eight annual edition of the Conference and Labs of the Evaluation Forum (CLEF) eHealth evaluation lab is provided and the resources created for the two tasks and evaluation methodology adopted are described.

...read moreread less

Abstract: In this paper, we provide an overview of the eight annual edition of the Conference and Labs of the Evaluation Forum (CLEF) eHealth evaluation lab. The Conference and Labs of the Evaluation Forum (CLEF) eHealth 2020 continues our development of evaluation tasks and resources since 2012 to address laypeople’s difficulties to retrieve and digest valid and relevant information in their preferred language to make health-centred decisions. This year’s lab advertised two tasks. Task 1 on Information Extraction (IE) was new and focused on automatic clinical coding of diagnosis and procedure the tenth revision of the International Statistical Classification of Diseases and Related Health Problems (ICD10) codes as well as finding the corresponding evidence text snippets for clinical case documents in Spanish. Task 2 on Information Retrieval (IR) was a novel extension of the most popular and established task in the Conference and Labs of the Evaluation Forum (CLEF) eHealth on Consumer Health Search (CHS). In total 55 submissions were made to these tasks. Herein, we describe the resources created for the two tasks and evaluation methodology adopted. We also summarize lab submissions and results. As in previous years, the organizers have made data and tools associated with the lab tasks available for future research and development. The ongoing substantial community interest in the tasks and their resources has led to the Conference and Labs of the Evaluation Forum (CLEF) eHealth maturing as a primary venue for all interdisciplinary actors of the ecosystem for producing, processing, and consuming electronic health information.

...read moreread less

39 citations

Book Chapter•DOI•

Overview of ARQMath 2020: CLEF Lab on Answer Retrieval for Questions on Math

[...]

Richard Zanibbi¹, Douglas W. Oard², Anurag Agarwal¹, Behrooz Mansouri¹•Institutions (2)

Rochester Institute of Technology¹, University of Maryland, College Park²

22 Sep 2020

TL;DR: The ARQMath Lab at CLEF considers finding answers to new mathematical questions among posted answers on a community question answering site (Math Stack Exchange), which includes a formula retrieval sub-task.

...read moreread less

Abstract: The ARQMath Lab at CLEF considers finding answers to new mathematical questions among posted answers on a community question answering site (Math Stack Exchange). Queries are question posts held out from the searched collection, each containing both text and at least one formula. This is a challenging task, as both math and text may be needed to find relevant answer posts. ARQMath also includes a formula retrieval sub-task: individual formulas from question posts are used to locate formulae in earlier question and answer posts, with relevance determined considering the context of the post from which a query formula is taken, and the posts in which retrieved formulae appear.

...read moreread less

33 citations

Book Chapter•DOI•

Overview of Touché 2020: Argument Retrieval

[...]

Alexander Bondarenko¹, Maik Fröbe¹, Meriem Beloucif², Lukas Gienapp³, Yamen Ajjour¹, Alexander Panchenko⁴, Chris Biemann², Benno Stein⁵, Henning Wachsmuth⁶, Martin Potthast³, Matthias Hagen¹ - Show less +7 more•Institutions (6)

Martin Luther University of Halle-Wittenberg¹, University of Hamburg², Leipzig University³, Skolkovo Institute of Science and Technology⁴, Bauhaus University, Weimar⁵, University of Paderborn⁶

22 Sep 2020

TL;DR: This paper is a condensed report on Touche: the first shared task on argument retrieval that was held at CLEF 2020 and runs two tasks: supporting individuals in finding arguments on socially important topics and supporting individuals with arguments on everyday personal decisions.

...read moreread less

Abstract: This paper is a condensed report on Touche: the first shared task on argument retrieval that was held at CLEF 2020. With the goal to create a collaborative platform for research in argument retrieval, we run two tasks: (1) supporting individuals in finding arguments on socially important topics and (2) supporting individuals with arguments on everyday personal decisions.

...read moreread less

33 citations

Book Chapter•DOI•

Overview of LifeCLEF 2020: A System-Oriented Evaluation of Automated Species Identification and Species Distribution Prediction

[...]

Alexis Joly¹, Hervé Goëau², Stefan Kahl³, Benjamin Deneu¹, Maximillien Servajean⁴, Elijah Cole⁵, Lukás Picek⁶, Rafael Luis Ruiz De Castaneda⁷, Isabelle Bolon⁷, Andrew M. Durso⁸, Titouan Lorieul¹, Christophe Botella², Hervé Glotin⁹, Julien Champ¹, Ivan Eggel, WP Willem Pier Vellinga, Pierre Bonnet², Henning Müller - Show less +14 more•Institutions (9)

French Institute for Research in Computer Science and Automation¹, Centre national de la recherche scientifique², Cornell University³, University of Montpellier⁴, California Institute of Technology⁵, University of West Bohemia⁶, University of Geneva⁷, Florida Gulf Coast University⁸, Aix-Marseille University⁹

22 Sep 2020

TL;DR: The 2020 edition of the LifeCLEF campaign proposes four data-oriented challenges related to the identification and prediction of biodiversity: cross-domain plant identification based on herbarium sheets, bird species recognition in audio soundscapes, location-based prediction of species based on environmental and occurrence data, and SnakeCLEF.

...read moreread less

Abstract: Building accurate knowledge of the identity, the geographic distribution and the evolution of species is essential for the sustainable development of humanity, as well as for biodiversity conservation. However, the difficulty of identifying plants and animals in the field is hindering the aggregation of new data and knowledge. Identifying and naming living plants or animals is almost impossible for the general public and is often difficult even for professionals and naturalists. Bridging this gap is a key step towards enabling effective biodiversity monitoring systems. The LifeCLEF campaign, presented in this paper, has been promoting and evaluating advances in this domain since 2011. The 2020 edition proposes four data-oriented challenges related to the identification and prediction of biodiversity: (i) PlantCLEF: cross-domain plant identification based on herbarium sheets (ii) BirdCLEF: bird species recognition in audio soundscapes, (iii) GeoLifeCLEF: location-based prediction of species based on environmental and occurrence data, and (iv) SnakeCLEF: snake identification based on image and geographic location.

...read moreread less

32 citations

Book Chapter•DOI•

Overview of eRisk 2020: Early Risk Prediction on the Internet

[...]

David E. Losada¹, Fabio Crestani², Javier Parapar•Institutions (2)

University of Santiago de Compostela¹, University of Lugano²

22 Sep 2020

TL;DR: The eRisk 2020 edition as discussed by the authors focused on early detecting signs of self-harm and depression, and the second task challenged the participants to automatically filling a depression questionnaire based on user interactions in social media.

...read moreread less

Abstract: This paper provides an overview of eRisk 2020, the fourth edition of this lab under the CLEF conference. The main purpose of eRisk is to explore issues of evaluation methodology, effectiveness metrics and other processes related to early risk detection. Early detection technologies can be employed in different areas, particularly those related to health and safety. This edition of eRisk had two tasks. The first task focused on early detecting signs of self-harm. The second task challenged the participants to automatically filling a depression questionnaire based on user interactions in social media.

...read moreread less

31 citations

Book Chapter•DOI•

SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis

[...]

Pavel Efimov¹, Andrey Chertok, Leonid Boytsov, Pavel Braslavski²•Institutions (2)

Saint Petersburg State University¹, Ural Federal University²

22 Sep 2020

TL;DR: SberQuAD as discussed by the authors is a large Russian reading comprehension (RC) dataset created similarly to English SQuAD, which contains about 50k question-paragraph-answer triples and is seven times larger compared to the next competitor.

...read moreread less

Abstract: The paper presents SberQuAD – a large Russian reading comprehension (RC) dataset created similarly to English SQuAD. SberQuAD contains about 50K question-paragraph-answer triples and is seven times larger compared to the next competitor. We provide its description, thorough analysis, and baseline experimental results. We scrutinized various aspects of the dataset that can have impact on the task performance: question/paragraph similarity, misspellings in questions, answer structure, and question types. We applied five popular RC models to SberQuAD and analyzed their performance. We believe our work makes an important contribution to research in multilingual question answering.

...read moreread less

30 citations

Book Chapter•DOI•

Overview of CLEF HIPE 2020: Named Entity Recognition and Linking on Historical Newspapers

[...]

Maud Ehrmann¹, Matteo Romanello¹, Alex Flückiger², Simon Clematide²•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, University of Zurich²

01 Jan 2020

TL;DR: An overview of the first edition of HIPE (Identifying Historical People, Places and other Entities), a pioneering shared task dedicated to the evaluation of named entity processing on historical newspapers in French, German and English, and its objective is strengthening the robustness of existing approaches on non-standard inputs, enabling performance comparison of NEprocessing on historical texts, and fostering efficient semantic indexing of historical documents.

...read moreread less

Abstract: This paper presents an overview of the first edition of HIPE (Identifying Historical People, Places and other Entities), a pioneering shared task dedicated to the evaluation of named entity processing on historical newspapers in French, German and English. Since its introduction some twenty years ago, named entity (NE) processing has become an essential component of virtually any text mining application and has undergone major changes. Recently, two main trends characterise its developments: the adoption of deep learning architectures and the consideration of textual material originating from historical and cultural heritage collections. While the former opens up new opportunities, the latter introduces new challenges with heterogeneous, historical and noisy inputs. In this context, the objective of HIPE, run as part of the CLEF 2020 conference, is threefold: strengthening the robustness of existing approaches on non-standard inputs, enabling performance comparison of NE processing on historical texts, and, in the long run, fostering efficient semantic indexing of historical documents. Tasks, corpora, and results of 13 participating teams are presented.

...read moreread less

30 citations

Proceedings Article•DOI•

Extended Overview of CLEF HIPE 2020: Named Entity Processing on Historical Newspapers

[...]

Maud Ehrmann¹, Matteo Romanello², Alex Flückiger³, Simon Clematide³•Institutions (3)

Xerox¹, École Polytechnique Fédérale de Lausanne², University of Zurich³

25 Sep 2020

TL;DR: This paper presents an extended overview of the first edition of HIPE (Identifying Historical People, Places and other Entities), a pioneering shared task dedicated to the evaluation of named entity processing on historical newspapers in French, German and English.

...read moreread less

Abstract: This paper presents an extended overview of the first edition of HIPE (Identifying Historical People, Places and other Entities), a pioneering shared task dedicated to the evaluation of named entity processing on historical newspapers in French, German and English. Since its introduction some twenty years ago, named entity (NE) processing has become an essential component of virtually any text mining application and has undergone major changes. Recently, two main trends characterise its developments: the adoption of deep learning architectures and the consideration of textual material originating from historical and cultural heritage collections. While the former opens up new opportunities, the latter introduces new challenges with heterogeneous, historical and noisy inputs. In this context, the objective of HIPE, run as part of the CLEF 2020 conference, is threefold: strengthening the robustness of existing approaches on non-standard inputs, enabling performance comparison of NE processing on historical texts, and, in the long run, fostering efficient semantic indexing of historical documents. Tasks, corpora, and results of 13 participating teams are presented. Compared to the condensed overview [31], this paper includes further details about data generation and statistics, additional information on participating systems, and the presentation of complementary results.

...read moreread less

29 citations

Book Chapter•DOI•

Overview of PAN 2020: Authorship Verification, Celebrity Profiling, Profiling Fake News Spreaders on Twitter, and Style Change Detection.

[...]

Janek Bevendorff¹, Bilal Ghanem², Anastasia Giachanou², Mike Kestemont³, Enrique Manjavacas³, Ilia Markov³, Maximilian Mayerl⁴, Martin Potthast⁵, Francisco Rangel, Paolo Rosso², Günther Specht⁴, Efstathios Stamatatos⁶, Benno Stein¹, Matti Wiegmann¹, Eva Zangerle⁴ - Show less +11 more•Institutions (6)

Bauhaus University, Weimar¹, Polytechnic University of Valencia², University of Antwerp³, University of Innsbruck⁴, Leipzig University⁵, University of the Aegean⁶

22 Sep 2020

TL;DR: It is reported that the four shared tasks organized as part of the PAN 2020 evaluation lab on digital text forensics and authorship analysis attracted 230 registrations, yielding 83 successful submissions, marking for a good start into the second decade of PAN evaluations labs.

...read moreread less

Abstract: We briefly report on the four shared tasks organized as part of the PAN 2020 evaluation lab on digital text forensics and authorship analysis. Each tasks is introduced, motivated, and the results obtained are presented. Altogether, the four tasks attracted 230 registrations, yielding 83 successful submissions. This, and the fact that we continue to invite the submissions of software rather than its run output using the TIRA experimentation platform, marks for a good start into the second decade of PAN evaluations labs.

...read moreread less

27 citations

Book Chapter•DOI•

Twitter User Profiling: Bot and Gender Identification

[...]

Dijana Kosmajac¹, Vlado Keselj¹•Institutions (1)

Dalhousie University¹

22 Sep 2020

TL;DR: A set of feature extraction and transformation methods in conjunction with ensemble classifiers for the PAN 2019 Author Profiling task uses user behaviour fingerprint and statistical diversity measures, while for the gender identification subtask uses text statistics and raw words.

...read moreread less

Abstract: Social bots are automated programs that generate a significant amount of social media content. This content can be harmful, as it may target a certain audience to influence opinions, often politically motivated, or to promote individuals to appear more popular than they really are. We proposed a set of feature extraction and transformation methods in conjunction with ensemble classifiers for the PAN 2019 Author Profiling task. For the bot identification subtask we used user behaviour fingerprint and statistical diversity measures, while for the gender identification subtask we used a set of text statistics, as well as syntactic information and raw words.

...read moreread less

Proceedings Article•

Overview of BirdCLEF 2020: Bird Sound Recognition in Complex Acoustic Environments.

[...]

Stefan Kahl¹, Mary Clapp², W. Alexander Hopping¹, Hervé Goëau, Hervé Glotin³, Robert Planqué, WP Willem Pier Vellinga, Alexis Joly - Show less +4 more•Institutions (3)

Cornell University¹, University of California, Davis², University of the South, Toulon-Var³

22 Sep 2020

TL;DR: This paper describes the methodology of the conducted evaluation as well as the synthesis of the main results and lessons learned in the development of reliable detection systems for avian vocalizations in continuous soundscape data.

...read moreread less

Abstract: Passive acoustic monitoring is a cornerstone of the assessment of ecosystem health and the improvement of automated assessment systems has the potential to have a transformative impact on global biodiversity monitoring, at a scale and level of detail that is impossible with manual annotation or other more traditional methods. The BirdCLEF challenge-as part of the 2020 LifeCLEF Lab [12]-focuses on the development of reliable detection systems for avian vocalizations in continuous soundscape data. The goal of the task is to localize and identify all audible birds within the provided soundscape test set. This paper describes the methodology of the conducted evaluation as well as the synthesis of the main results and lessons learned.

...read moreread less

Book Chapter•DOI•

Overview of the ImageCLEF 2020: Multimedia Retrieval in Medical, Lifelogging, Nature, and Internet Applications

[...]

Bogdan Ionescu¹, Henning Müller², Renaud Péteri³, Asma Ben Abacha⁴, Vivek V. Datla⁵, Sadid A. Hasan⁶, Dina Demner-Fushman⁴, Serge Kozlovski, Vitali Liauchuk, Yashin Dicente Cid⁷, Vassili Kovalev, Obioma Pelka⁸, Christoph M. Friedrich⁸, Alba García Seco de Herrera⁹, Van-Tu Ninh¹⁰, Tu-Khiem Le¹⁰, Liting Zhou¹⁰, Luca Piras¹¹, Michael Riegler¹², Pål Halvorsen¹², Minh-Triet Tran¹³, Mathias Lux¹⁴, Cathal Gurrin¹⁰, Duc-Tien Dang-Nguyen¹⁵, Jon Chamberlain⁹, Adrian F. Clark⁹, Antonio Campello¹⁶, Dimitri Fichou, Raul Berari, Paul Brie, Mihai Dogariu¹, Liviu-Daniel Stefan¹, Mihai Gabriel Constantin¹ - Show less +29 more•Institutions (16)

Politehnica University of Bucharest¹, University of Applied Sciences Western Switzerland², University of La Rochelle³, National Institutes of Health⁴, Philips⁵, CVS Health⁶, University of Warwick⁷, Dortmund University of Applied Sciences and Arts⁸, University of Essex⁹, Dublin City University¹⁰, University of Cagliari¹¹, University of Oslo¹², Ho Chi Minh City University of Science¹³, Alpen-Adria-Universität Klagenfurt¹⁴, University of Bergen¹⁵, Wellcome Trust¹⁶

22 Sep 2020

TL;DR: An overview of the ImageCLEF 2020 lab is presented that was organized as part of the Conference and Labs of the Evaluation Forum - CLEF Labs 2020 as well as a new Internet task addressing the problems of identifying hand-drawn user interface components.

...read moreread less

Abstract: This paper presents an overview of the ImageCLEF 2020 lab that was organized as part of the Conference and Labs of the Evaluation Forum - CLEF Labs 2020. ImageCLEF is an ongoing evaluation initiative (first run in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of providing information access to large collections of images in various usage scenarios and domains. In 2020, the 18th edition of ImageCLEF runs four main tasks: (i) a medical task that groups three previous tasks, i.e., caption analysis, tuberculosis prediction, and medical visual question answering and question generation, (ii) a lifelog task (videos, images and other sources) about daily activity understanding, retrieval and summarization, (iii) a coral task about segmenting and labeling collections of coral reef images, and (iv) a new Internet task addressing the problems of identifying hand-drawn user interface components. Despite the current pandemic situation, the benchmark campaign received a strong participation with over 40 groups submitting more than 295 runs.

...read moreread less

Proceedings Article•DOI•

Robust Named Entity Recognition and Linking on Historical Multilingual Documents

[...]

Emanuela Boros, Elvys Linhares Pontes, Luis Adrián Cabrera-Diego, Ahmed Hamdi, Jose G. Moreno, Nicolas Sidere, Antoine Doucet - Show less +3 more

17 Jul 2020

TL;DR: The participation of the L3i laboratory of the University of La Rochelle in the Identifying Historical People, Places, and other Entities (HIPE) evaluation campaign of CLEF 2020 relies on two neural models, one for named entity recognition and classification (NERC) and another one for entity linking (EL).

...read moreread less

Abstract: This paper summarizes the participation of the L3i laboratory of the University of La Rochelle in the Identifying Historical People, Places, and other Entities (HIPE) evaluation campaign of CLEF 2020. Our participation relies on two neural models, one for named entity recognition and classification (NERC) and another one for entity linking (EL). We carefully pre-processed inputs to mitigate its flaws, notably in terms of segmentation. Our submitted runs cover all languages (English, French, and German) and sub-tasks proposed in the lab: NERC, endto-end EL, and EL-only. Our submissions obtained top performance in 50 out of the 52 scoreboards proposed by the lab organizers. In further detail, out of 70 runs submitted by 13 participants, our approaches obtained the best score for all metrics in all three languages both for NERC and for end-to-end EL. It also obtained the best score for all metrics in French and German for EL-only.

...read moreread less

Book Chapter•DOI•

Overview of ChEMU 2020: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents

[...]

Jiayuan He¹, Dat Quoc Nguyen¹, Saber A. Akhondi², Christian Druckenbrodt², Camilo Thorne², Ralph Hoessel², Zubair Afzal², Zenan Zhai¹, Biaoyan Fang¹, Hiyori Yoshikawa¹, Hiyori Yoshikawa³, Ameer Albahem⁴, Lawrence Cavedon⁴, Trevor Cohn¹, Timothy Baldwin¹, Karin Verspoor¹ - Show less +12 more•Institutions (4)

University of Melbourne¹, Elsevier², Fujitsu³, RMIT University⁴

22 Sep 2020

TL;DR: The Cheminformatics Elsevier Melbourne University (ChEMU) evaluation lab 2020 as discussed by the authors focused on information extraction over chemical reactions from patent texts using the ChEMU corpus of 1500 “snippets” (text segments) sampled from 170 patent documents and annotated by chemical experts.

...read moreread less

Abstract: In this paper, we provide an overview of the Cheminformatics Elsevier Melbourne University (ChEMU) evaluation lab 2020, part of the Conference and Labs of the Evaluation Forum 2020 (CLEF2020). The ChEMU evaluation lab focuses on information extraction over chemical reactions from patent texts. Using the ChEMU corpus of 1500 “snippets” (text segments) sampled from 170 patent documents and annotated by chemical experts, we defined two key information extraction tasks. Task 1 addresses chemical named entity recognition, the identification of chemical compounds and their specific roles in chemical reactions. Task 2 focuses on event extraction, the identification of reaction steps, relating the chemical compounds involved in a chemical reaction. Herein, we describe the resources created for these tasks and the evaluation methodology adopted. We also provide a brief summary of the participants of this lab and the results obtained across 46 runs from 11 teams, finding that several submissions achieve substantially better results than our baseline methods.

...read moreread less

Proceedings Article•

Fake News Spreaders Profiling Through Behavioural Analysis

[...]

Matteo Cardaioli¹, Stefano Cecconello¹, Mauro Conti², Luca Pajola¹, Federico Turrin¹ - Show less +1 more•Institutions (2)

University of Padua¹, Manipal University Jaipur²

01 Jan 2020

Proceedings Article•

Overview of LifeCLEF plant identification task 2020

[...]

Hervé Goëau¹, Pierre Bonnet², Alexis Joly³•Institutions (3)

Centre de coopération internationale en recherche agronomique pour le développement¹, Centre national de la recherche scientifique², University of Montpellier³

01 Jan 2020

TL;DR: The LifeCLEF 2020 Plant Identification challenge was designed to evaluate to what extent automated identification on the flora of data deficient regions can be improved by the use of herbarium collections.

...read moreread less

Abstract: Automated identification of plants has improved considerably thanks to the recent progress in deep learning and the availability of training data with more and more photos in the field. However, this profusion of data only concerns a few tens of thousands of species, mostly located in North America and Western Europe, much less in the richest regions in terms of biodiversity such as tropical countries. On the other hand, for several centuries, botanists have collected, catalogued and systematically stored plant specimens in herbaria, particularly in tropical regions, and the recent efforts by the biodiversity informatics community made it possible to put millions of digitized sheets online. The LifeCLEF 2020 Plant Identification challenge (or "PlantCLEF 2020") was designed to evaluate to what extent automated identification on the flora of data deficient regions can be improved by the use of herbarium collections. It is based on a dataset of about 1,000 species mainly focused on the South America's Guiana Shield, an area known to have one of the greatest diversity of plants in the world. The challenge was evaluated as a cross-domain classification task where the training set consist of several hundred thousand herbarium sheets and few thousand of photos to enable learning a mapping between the two domains. The test set was exclusively composed of photos in the field. This paper presents the resources and assessments of the conducted evaluation, summarizes the approaches and systems employed by the participating research groups, and provides an analysis of the main outcomes.

...read moreread less

Book Chapter•DOI•

Medical Image Tagging by Deep Learning and Retrieval

[...]

Vasiliki Kougia¹, John Pavlopoulos¹, Ion Androutsopoulos¹•Institutions (1)

Athens University of Economics and Business¹

22 Sep 2020

TL;DR: The methods implemented and submitted to the Concept Detection 2019 task, where the best performance with a deep learning method was achieved, are described, called ConceptCXN, and it is shown that retrieval-based methods can perform very well in this task, when combined with deep learning image encoders.

...read moreread less

Abstract: Radiologists and other qualified physicians need to examine and interpret large numbers of medical images daily. Systems that would help them spot and report abnormalities in medical images could speed up diagnostic workflows. Systems that would help exploit past diagnoses made by highly skilled physicians could also benefit their more junior colleagues. A task that systems can perform towards this end is medical image classification, which assigns medical concepts to images. This task, called Concept Detection, was part of the ImageCLEF 2019 competition. We describe the methods we implemented and submitted to the Concept Detection 2019 task, where we achieved the best performance with a deep learning method we call ConceptCXN. We also show that retrieval-based methods can perform very well in this task, when combined with deep learning image encoders. Finally, we report additional post-competition experiments we performed to shed more light on the performance of our best systems. Our systems can be installed through PyPi as part of the BioCaption package.

...read moreread less

Proceedings Article•

Domain Adaptation in the Context of Herbarium Collections: A submission to PlantCLEF 2020.

[...]

Juan Villacis-Llobet¹, Hervé Goëau², Pierre Bonnet³, Alexis Joly⁴, Erick Mata-Montero¹ - Show less +1 more•Institutions (4)

Costa Rica Institute of Technology¹, Centre de coopération internationale en recherche agronomique pour le développement², Centre national de la recherche scientifique³, University of Montpellier⁴

01 Jan 2020

TL;DR: This work makes use of the Few-Shot Adversarial Domain Adaptation method proposed by Motiian et al. (9) to tackle the classification of plant images in the field, based on a dataset composed mainly of herbaria.

...read moreread less

Abstract: This paper describes a submission to the PlantCLEF 2020 challenge, whose topic was the classification of plant images in the field, based on a dataset composed mainly of herbaria.. This work proposes the usage of domain adaptation techniques to tackle the problem. In particular, it makes use of the Few-Shot Adversarial Domain Adaptation method proposed by Motiian et al. (9). Additionally, a modification of this architecture is proposed to take advantage of upper taxa relations between species in the dataset. Experiments performed show that domain adaptation can provide very significant increases in accuracy when compared with traditional CNN-based approaches.

...read moreread less

Proceedings Article•

Participation of LIRMM / Inria to the GeoLifeCLEF 2020 challenge

[...]

Benjamin Deneu, Maximilien Servajean, Pierre Bonnet, François Munoz, Alexis Joly - Show less +1 more

05 Nov 2020

TL;DR: This paper describes the methods that are implemented in the context of the GeoLifeCLEF 2020 machine learning challenge to advance the state-of-the-art in locationbased species recommendation on a very large dataset of 1.9 million species observations paired with high-resolution remote sensing imagery, land cover data, and altitude.

...read moreread less

Abstract: This paper describes the methods that we have implemented in the context of the GeoLifeCLEF 2020 machine learning challenge. The goal of this challenge is to advance the state-of-the-art in location- based species recommendation on a very large dataset of 1.9 million species observations, paired with high-resolution remote sensing imagery, land cover data, and altitude. We provide a detailed description of the algorithms and methodology, developed by the LIRMM / Inria team, in order to facilitate the understanding and reproducibility of the obtained results.

...read moreread less

Proceedings Article•

Overview of LifeCLEF location-based species prediction task 2020 (GeoLifeCLEF)

[...]

Benjamin Deneu, Titouan Lorieul, Elijah Cole¹, Maximilien Servajean, Christophe Botella, Pierre Bonnet, Alexis Joly - Show less +3 more•Institutions (1)

California Institute of Technology¹

22 Sep 2020

TL;DR: An overview of the GeoLifeCLEF 2020 competition is presented, which highlights the ability of remote sensing imagery and convolutional neural networks to improve predictive performance, complementary to traditional approaches.

...read moreread less

Abstract: Understanding the geographic distribution of species is a key concern in conservation. By pairing species occurrences with environmental features, researchers can model the relationship between an environment and the species which may be found there. To advance the state-of-the-art in this area, a large-scale machine learning competition called GeoLifeCLEF 2020 was organized. It relied on a dataset of 1.9 million species observations paired with high-resolution remote sensing imagery, land cover data, and altitude, in addition to traditional low-resolution climate and soil variables. This paper presents an overview of the competition , synthesizes the approaches used by the participating groups, and analyzes the main results. In particular, we highlight the ability of remote sensing imagery and convolutional neural networks to improve predictive performance, complementary to traditional approaches.

...read moreread less

Proceedings Article•

Named Entity Recognition and Linking on Historical Newspapers: UvA.ILPS & REL at CLEF HIPE 2020.

[...]

Vera Provatorova, Svitlana Vakulenko¹, Evangelos Kanoulas², Koen Dercksen³, Johannes M. van Hulst³ - Show less +1 more•Institutions (3)

Vienna University of Economics and Business¹, University of Amsterdam², Radboud University Nijmegen³

01 Jan 2020

TL;DR: This paper describes the submission to the CLEF HIPE 2020 shared task on identifying named entities in multi-lingual historical newspapers in French, German and English, and uses an ensemble of fine-tuned BERT models for named entity recognition and entity linking.

...read moreread less

Abstract: This paper describes our submission to the CLEF HIPE 2020 shared task on identifying named entities in multi-lingual historical newspapers in French, German and English. The subtasks we addressed in our submission include coarse-grained named entity recognition, entity mention detection and entity linking. For the task of named entity recognition we used an ensemble of fine-tuned BERT models; entity linking was approached by three different methods: (1) a simple method relying on ElasticSearch retrieval scores, (2) an approach based on contextualised text embeddings, and (3) REL, a modular entity linking system based on several state-of-the-art components.

...read moreread less

Proceedings Article•

Profiling Fake News Spreaders: Stylometry, Personality, Emotions and Embeddings.

[...]

Elisabetta Fersini¹, Justin Armanini, Michael D'Intorni•Institutions (1)

University of Milano-Bicocca¹

01 Jan 2020

TL;DR: This paper describes the proposed solution for the Profiling Fake News Spreaders on Twitter shared task at PAN 2020, based on modeling both types of users according to four main types of characteristics, i.e. stylometry, personality, emotions and feed embeddings.

...read moreread less

Abstract: This paper describes our proposed solution for the Profiling Fake News Spreaders on Twitter shared task at PAN 2020 [23]. The task consists in determining whether a given author a set of Twitter posts is a fake news spreader or not, both for the English and Spanish languages. The proposed approach is based on modeling both types of users according to four main types of characteristics, i.e. stylometry, personality, emotions and feed embeddings. Our system achieved an accuracy of 60% for the English dataset, while 72% for the Spanish one.

...read moreread less

Book Chapter•DOI•

Overview of LiLAS 2020 – Living Labs for Academic Search

[...]

Philipp Schaer¹, Johann Schaible², Leyla Jael Castro³•Institutions (3)

Cologne University of Applied Sciences¹, Leibniz Association², Centre for Life³

22 Sep 2020

TL;DR: LiLAS as discussed by the authors is a Docker-based research environment for academic search that allows researchers to evaluate their systems in real-world environments, such as the COVID-19 pandemic.

...read moreread less

Abstract: Academic Search is a timeless challenge that the field of Information Retrieval has been dealing with for many years. Even today, the search for academic material is a broad field of research that recently started working on problems like the COVID-19 pandemic. However, test collections and specialized data sets like CORD-19 only allow for system-oriented experiments, while the evaluation of algorithms in real-world environments is only available to researchers from industry. In LiLAS, we open up two academic search platforms to allow participating researchers to evaluate their systems in a Docker-based research environment. This overview paper describes the motivation, infrastructure, and two systems LIVIVO and GESIS Search that are part of this CLEF lab.

...read moreread less

Book Chapter•DOI•

2AIRTC: The Amharic Adhoc Information Retrieval Test Collection

[...]

Tilahun Yeshambel¹, Josiane Mothe², Yaregal Assabie¹•Institutions (2)

Addis Ababa University¹, Centre national de la recherche scientifique²

22 Sep 2020

TL;DR: This paper promotes the monolingual Amharic IR test collection that is built for the IR community and named 2AIRTC consists of 12,583 documents, 240 topics and the corresponding relevance judgments.

...read moreread less

Abstract: Evaluation is highly important for designing, developing, and maintaining information retrieval (IR) systems. The IR community has developed shared tasks where evaluation framework, evaluation measures and test collections have been developed for different languages. Although Amharic is the official language of Ethiopia currently having an estimated population of over 110 million, it is one of the under-resourced languages and there is no Amharic adhoc IR test collection to date. In this paper, we promote the monolingual Amharic IR test collection that we build for the IR community. Following the framework of Cranfield project and TREC, the collection that we named 2AIRTC consists of 12,583 documents, 240 topics and the corresponding relevance judgments.

...read moreread less

Book Chapter•DOI•

Fact Check-Worthiness Detection with Contrastive Ranking

[...]

Casper Worm Hansen¹, Christian Hansen¹, Jakob Grue Simonsen¹, Christina Lioma¹•Institutions (1)

University of Copenhagen¹

22 Sep 2020

TL;DR: A recurrent neural network model that learns a sentence encoding, from which a check-worthiness score is predicted is presented, trained by jointly optimizing a binary cross entropy loss, as well as a ranking based pairwise hinge loss.

...read moreread less

Abstract: Check-worthiness detection aims at predicting which sentences should be prioritized for fact-checking. A typical use is to rank sentences in political debates and speeches according to their degree of check-worthiness. We present the first direct optimization of sentence ranking for check-worthiness; in contrast, all previous work has solely used standard classification based loss functions. We present a recurrent neural network model that learns a sentence encoding, from which a check-worthiness score is predicted. The model is trained by jointly optimizing a binary cross entropy loss, as well as a ranking based pairwise hinge loss. We obtain sentence pairs for training through contrastive sampling, where for each sentence we find the top most semantically similar sentences with opposite label. Through a comparison to existing state-of-the-art check-worthiness methods, we find that our approach improves the MAP score by 11%.

...read moreread less

Book Chapter•DOI•

A Study on a Stopping Strategy for Systematic Reviews Based on a Distributed Effort Approach

[...]

Giorgio Maria Di Nunzio¹•Institutions (1)

University of Padua¹

22 Sep 2020

TL;DR: In the CLEF eHealth Technology Assisted Review Task (TAR) 2019, this paper presented a comparison of a Continuous Active Learning approach that uses either a fixed amount or a variable amount of resources according to the size of the pool.

...read moreread less

Abstract: Systematic reviews are scientific investigations that use strategies to include a comprehensive search of all potentially relevant articles and the use of explicit, reproducible criteria in the selection of articles for review. As time and resources are limited for compiling a systematic review, limits to the search are needed. In this paper, we describe the stopping strategy that we have been designed and refined over three years of participation to the CLEF eHealth Technology Assisted Review Task. In particular, we present a comparison of a Continuous Active Learning approach that uses either a fixed amount or a variable amount of resources according to the size of the pool. The results show that our approach performs on average much better than any other participant in the CLEF 2019 eHealth TAR task. Nevertheless, a failure analysis allows to understand the weak points of this approach and possible future directions.

...read moreread less

Book Chapter•DOI•

s-AWARE: Supervised Measure-Based Methods for Crowd-Assessors Combination

[...]

Marco Ferrante¹, Nicola Ferro¹, Luca Piazzon¹•Institutions (1)

University of Padua¹

22 Sep 2020

TL;DR: Ground-truth creation is one of the most demanding activities in terms of time, effort, and resources needed for creating an experimental collection and crowdsourcing has emerged as a viable option to reduce the costs and time invested in it.

...read moreread less

Abstract: Ground-truth creation is one of the most demanding activities in terms of time, effort, and resources needed for creating an experimental collection. For this reason, crowdsourcing has emerged as a viable option to reduce the costs and time invested in it.

...read moreread less

Proceedings Article•

SinNer@Clef-Hipe2020 : Sinful adaptation of SotA models for Named Entity Recognition in French and German

[...]

Pedro Javier Ortiz Suárez¹, Yoann Dupont, Gaël Lejeune, Tian Tian•Institutions (1)

University of Paris¹

23 Sep 2020

TL;DR: It is shown that combining several word representations enhances the quality of the results for all NE types and that the segmentation in sentences has an important impact on the results.

...read moreread less

Abstract: In this article we present the approaches developed by the Sorbonne-INRIA for NER (SinNer) team for the CLEF-HIPE 2020 challenge on Named Entity Processing on old newspapers. The challenge proposed various tasks for three languages, among them we focused on Named Entity Recognition in French and German texts. The best system we proposed ranked third for these two languages, it uses FastText em-beddings and Elmo language models (FrELMo and German ELMo). We show that combining several word representations enhances the quality of the results for all NE types and that the segmentation in sentences has an important impact on the results.

...read moreread less

Book Chapter•DOI•

Protest Event Detection: When Task-Specific Models Outperform an Event-Driven Method

[...]

Angelo Basile, Tommaso Caselli¹•Institutions (1)

University of Groningen¹

22 Sep 2020

TL;DR: Two approaches for identifying protest events in news in English are presented and it is shown that developing dedicated architectures and models for each task outperforms simpler solutions based on the propagation of labels from lexical items to documents.

...read moreread less

Abstract: 2019 has been characterized by worldwide waves of protests. Each country’s protests is different but there appear to be common factors. In this paper we present two approaches for identifying protest events in news in English. Our goal is to provide political science and discourse analysis scholars with tools that may facilitate the understanding of this on-going phenomenon. We test our approaches against the ProtestNews Lab 2019 benchmark that challenges systems to perform unsupervised domain adaptation on protest events on three sub-tasks: document classification, sentence classification, and event extraction. Results indicate that developing dedicated architectures and models for each task outperforms simpler solutions based on the propagation of labels from lexical items to documents. Furthermore, we complete the description of our systems with a detailed data analysis to shed light on the limits of the methods.

...read moreread less