A
Aitor Soroa
Researcher at University of the Basque Country
Publications - 109
Citations - 4376
Aitor Soroa is an academic researcher from University of the Basque Country. The author has contributed to research in topics: WordNet & Computer science. The author has an hindex of 24, co-authored 96 publications receiving 3551 citations. Previous affiliations of Aitor Soroa include National University of Distance Education & Polytechnic University of Catalonia.
Papers
More filters
Proceedings Article
BasqueGLUE: A Natural Language Understanding Benchmark for Basque
TL;DR: BasqueGLUE is presented, the first NLU benchmark for Basque, a less-resourced language, which has been elaborated from previously existing datasets and following similar criteria to those used for the construction of GLUE and SuperGLUE.
Proceedings ArticleDOI
Principled Paraphrase Generation with Parallel Corpora
TL;DR: This paper formalizes the implicit similarity function induced by round-trip Machine Translation, and designs an alternative similarity metric that mitigates this issue by requiring the entire translation distribution to match, and implements a relaxation of it through the Information Bottleneck method.
Proceedings ArticleDOI
Information seeking in digital cultural heritage with PATHS
Mark M. Hall,Paul Clough,Samuel Fernando,Paula Goodale,Mark Stevenson,Eneko Agirre,Arantxa Otegi,Aitor Soroa,Kate Fernie,Jillian R. Griffiths,Runar Bergheim +10 more
TL;DR: This demonstration presents the second PATHS system which provides the exploration, analysis, and sense-making features to support the full information seeking process.
Journal ArticleDOI
MLDS: A translator-oriented MultiLingual dictionary system
TL;DR: The model adopted for the representation of multilingual dictionary-knowledge is described, which allows an enriched exploitation of the lexical-semantic relations extracted from dictionaries.
Journal ArticleDOI
Elhisa: An architecture for the integration of heterogeneous lexical information
Xabier Artola,Aitor Soroa +1 more
TL;DR: The ELHISA system is presented, a software architecture for the integration of heterogeneous lexical information, and five resources covering a broad scope have been integrated into it so far, showing the suitability of the approach taken.