Globalex Workshop on Linked Lexicography (GWLL 2020, @LREC 2020)
Full-day workshop at LREC2020 | Marseille, France | May 12, 2020
Submission deadline: February 23, 2020 (see also Important dates in Call for Papers)
Workshop website: https://globalex.link/events/workshops/globalex-workshop-2020/
Workshop Proceedings
Workshop Description
The GLOBALEX 2020 Workshop @ LREC will follow up on the successful GLOBALEX workshops at LREC 2016 (https://globalex2016.globalex.link/) and LREC 2018 (https://globalex2018.globalex.link/). It is organized by Globalex – Global Alliance for Lexicography (https://globalex.link/), with support from:
- ELEXIS (EU’s H2020-funded project European Lexicography Infrastructure, https://elex.is/)
- TIAD (Translation Inference Across Dictionaries shared tasks and workshops, https://tiad2020.unizar.es/, https://tiad2019.unizar.es/, https://tiad2017.wordpress.com/)
This third iteration of GLOBALEX workshops at LREC will focus on linking data from lexicographic resources and will highlight aspects related to the automated linking of content among different dictionaries and other lexical sources, in the aim of enhancing linguistic data generation, enrichment and reinforcement.
Linking lexicographic data sets to each other and to other lexical resources, and in particular the interoperability of lexicography with Linked Data (LD) methodologies, have been gaining substantial attention in recent years, becoming a subject of various projects for research by and collaboration between academia and industry, including support of the public sector. Most notably, the W3C community group on Ontology-Lexica [1] was established following the release of the lemon model, which constituted the first de-facto standard for representing ontology-lexica, with the mission to “develop models for the representation of lexica (and machine readable dictionaries) relative to ontologies” [2]. The ensuing OntoLex-lemon model [3], [4] has served since 2016 as the leading option for conversion of lexicographic data into LD, and has recently been updated with the lexicog module [5] released on 17 September 2019 [6]. This trend has been complemented since 2015 by relevant literature (e.g. [7], [8], [9]), conference papers (e.g. [10], [11], [12], [13]) and EU-funded projects ([14] and [15], [16]).
Besides a section including general research papers, the workshop will include two shared task tracks – one on linking monolingual data and the other on linking bilingual and multilingual data, as follows:
- Monolingual Word Sense Alignment – in conjunction with a shared task conducted by ELEXIS.
Task 1 will be evaluated on novel dictionary linking data developed by the ELEXIS project [15], which will cover linking for the following languages: Danish, Dutch, English, Estonian, German, Hungarian, Irish, Italian, Serbian, Slovene and Russian.
More details can be found here: https://competitions.codalab.org/competitions/22163 - Linking Bilingual and Multilingual Lexicographic Resources – in conjunction with the 3rd TIAD shared task
Task 2 will host the 3rd edition of the Translation Inference Across Dictionaries (TIAD) shared task, of which previous editions were co-located at Language, Data and Knowledge conferences [17], [18]. The aim is to explore methods and techniques for automatically generating new bilingual (and multilingual) dictionaries from existing ones in the context of a coherent experiment framework that enables reliable validation of results and solid comparison of the processes used. In particular, the participating systems will be asked to generate new translations automatically among three languages, English, French, Portuguese, based on known translations contained in the Apertium RDF graph [19]. The inclusion of other language pairs will also be possible for this edition.
Organizers
ORGANIZERS
- Ilan Kernerman, K Dictionaries
- Simon Krek, Globalex, Jožef Stefan Institute
TRACK 1 ORGANIZERS
- John McCrae, National University of Ireland – Galway
- Sina Ahmadi, National University of Ireland – Galway
TRACK 2 ORGANIZERS
- Jorge Gracia, University of Zaragoza
- Besim Kabashi, Friedrich-Alexander University of Erlangen-Nuremberg and Ludwig-Maximilian University of Munich
Program Committee
Anna Braasch. University of Copenhagen, Denmark
Sara Carvalho. University of Aveiro, Portugal
Philip Cimiano. University of Bielefeld, Germany
Rute Costa, Universidade Nova de Lisboa, Portugal
Thierry Fontenelle. European Investment Bank, Luxembourg
Radovan Garabik. Ľ. Štúr Institute of Linguistics, Slovakia
Jorge Gracia, University of Zaragoza, Spain
Dagmar Gromann. University of Vienna, Austria
Ales Horak. Masaryk University, Czech Republic
Besim Kabashi. Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
Ilan Kernerman. K Dictionaries, Israel
Iztok Kosem. Jožef Stefan Institute, Slovenia
Simon Krek. Jožef Stefan Institute, Slovenia
Nikola Ljubešić. Jožef Stefan Institute, Slovenia
Dorielle Lonke. K Dictionaries, Israel
Patricia Martín Chozas. Madrid Polytechnic University, Spain
John Philip McCrae. National University of Ireland Galway, Ireland
Krzysztof Nowak. Institute of Polish Language, Poland
Maciej Piaceski. Wroclaw University of Science and Technology, Poland
Carole Tiberius. Instituut voor Nederlandse Lexicologie, Netherlands
Lars Trap-Jensen. Society for Danish Language and Literature, Denmark
Marieke van Erp. KNAW Humanities Cluster, Netherlands