Open AccessBook
Actes de la conférence Traitement Automatique de la Langue Naturelle, TALN 2018
Anne-Laure Ligozat,Peggy Cellier,Anne-Lyse Minard,Vincent Claveau,Cyril Grouin,Patrick Paroubek +5 more
Reads0
Chats0
TLDR
This article presents an information extraction method which collects additional information on the web so as to enrich already existing information and then fill in a knowledge base using lexical and syntactical patterns.Abstract:
Relation pattern extraction and information extraction from the web. This article presents an information extraction method which collects additional information on the web so as to enrich already existing information and then fill in a knowledge base. Our method is based on lexical and syntactical patterns, both used as search queries and extraction patterns to allow the analysis of unstructured documents. To do so, we first defined relevant criteria coming from the analysis phase so as to ease the discovery of new values. MOTS-CLES : Construction de patrons, extraction d’information, extraction d’entités nommées, syntaxe en dépendances, apprentissage de patrons d’extraction, web comme corpus.read more
Citations
More filters
Synthesis Lectures on Human Language Technologies
Ido Dagan,Dan Roth,Mark Sammons,Fabio Massimo Zanzotto,Web Corpus Construction,Roland Schäfer,Felix Bildhauer +6 more
TL;DR: This book gives a comprehensive view of state-of-the-art techniques that are used to build spoken dialogue systems and presents dialogue modelling and system development issues relevant in both academic and industrial environments and also discusses requirements and challenges for advanced interaction management and future research.
Journal Article
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Journal ArticleDOI
Sémiologie Graphique: les diagrammes, les réseaux, les cartes@@@Semiologie Graphique: les diagrammes, les reseaux, les cartes
David Bickmore,Jacques Bertin +1 more
Journal ArticleDOI
Automatic Text SimplificationHoracio Saggion (Universitat Pompeu Fabra) Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst, volume 37), 2017, xvi+121 pp; paperback, ISBN 978-1-62705-868-1; ebook, ISBN 978-1-62705-869-8; doi:10.2200/S00700ED1V01Y201602HLT032
References
More filters
Proceedings ArticleDOI
Simplifying Lexical Simplification: Do We Need Simplified Corpora?
Goran Glavaš,Sanja Štajner +1 more
TL;DR: This work presents an unsupervised approach to lexical simplification that makes use of the most recent word vector representations and requires only regular corpora, and is as effective as systems that rely on simplified corpora.
Proceedings ArticleDOI
It Depends: Dependency Parser Comparison Using A Web-based Evaluation Tool
TL;DR: A comparative analysis of ten leading statistical dependency parsers on a multi-genre corpus of English is presented, and a new web-based tool is developed that gives a convenient way of comparing dependency parser outputs.
Journal ArticleDOI
A study of the effects of preprocessing strategies on sentiment analysis for Arabic text
Rehab Duwairi,Mahmoud El-Orfali +1 more
TL;DR: Results show that the selection of preprocessing strategies on the reviews increases the performance of the classifiers, and the effects of the characteristics of the dataset on sentiment analysis were analysed.
Journal ArticleDOI
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer,Liwei Wang,Christopher M. Cervantes,Juan C. Caicedo,Julia Hockenmaier,Svetlana Lazebnik +5 more
TL;DR: The Flickr30k Entities dataset as mentioned in this paper augments the 158k captions with 244k coreference chains, linking mentions of the same entities across different captions for the same image and associating them with 276k manually annotated bounding boxes.
Proceedings ArticleDOI
Illinois-LH: A Denotational and Distributional Approach to Semantics
Alice Lai,Julia Hockenmaier +1 more
TL;DR: This paper describes and analyzes the SemEval 2014 Task 1 system, which features are based on distributional and denotational similarities; word alignment; negation; and hypernym/hyponym, synonym, and antonym relations.