Cracking a Walnut with a Sledgehammer: XLM-RoBERTa for German Verbal Idiom Disambiguation Tasks

2021 | conference paper. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Cracking a Walnut with a Sledgehammer: XLM-RoBERTa for German Verbal Idiom Disambiguation Tasks​
Pannach, F. & Dönicke, T. ​ (2021)
​Proceedings of the Shared Task on the Disambiguation of German Verbal Idioms at KONVENS 2021. ​Shared Task on the Disambiguation of German Verbal Idioms @ KONVENS 2021​, Düsseldorf, Germany.
Zenodo. DOI: https://doi.org/10.5281/ZENODO.5769286 

Documents & Media

License

GRO License GRO License

Details

Authors
Pannach, Franziska; Dönicke, Tillmann 
Abstract
This paper describes the efforts in solving the Shared Task on the Disambiguation of German Verbal Idioms at KONVENS 2021. It presents the team's efforts to extend the training data semi-automatically. The disambigua- tion task was solved using XLM-RoBERTa, which delivered the best results with 0.76 f1- Score on all tested non-idiomatic instances in the test set. The baseline model, a linear SVM, achieves 0.55 f1-Score. Furthermore, additional data was collected to enhance the training data set with respect to literal use of idiomatic expressions. While the baseline model improves slightly with additional training data, the XLM-RoBERTa model performs better when only the core training data is provided.
Issue Date
2021
Publisher
Zenodo
Conference
Shared Task on the Disambiguation of German Verbal Idioms @ KONVENS 2021
Conference Place
Düsseldorf, Germany
Event start
2021-09-06
Event end
2021-09-09
Language
English

Reference

Citations


Social Media