Cracking a Walnut with a Sledgehammer: XLM-RoBERTa for German Verbal Idiom Disambiguation Tasks
2021 | conference paper. A publication with affiliation to the University of Göttingen.
Jump to: Cite & Linked | Documents & Media | Details | Version history
Cite this publication
Cracking a Walnut with a Sledgehammer: XLM-RoBERTa for German Verbal Idiom Disambiguation Tasks
Pannach, F. & Dönicke, T. (2021)
Proceedings of the Shared Task on the Disambiguation of German Verbal Idioms at KONVENS 2021. Shared Task on the Disambiguation of German Verbal Idioms @ KONVENS 2021, Düsseldorf, Germany.
Zenodo. DOI: https://doi.org/10.5281/ZENODO.5769286
Documents & Media
Details
- Authors
- Pannach, Franziska; Dönicke, Tillmann
- Abstract
- This paper describes the efforts in solving the Shared Task on the Disambiguation of German Verbal Idioms at KONVENS 2021. It presents the team's efforts to extend the training data semi-automatically. The disambigua- tion task was solved using XLM-RoBERTa, which delivered the best results with 0.76 f1- Score on all tested non-idiomatic instances in the test set. The baseline model, a linear SVM, achieves 0.55 f1-Score. Furthermore, additional data was collected to enhance the training data set with respect to literal use of idiomatic expressions. While the baseline model improves slightly with additional training data, the XLM-RoBERTa model performs better when only the core training data is provided.
- Issue Date
- 2021
- Publisher
- Zenodo
- Conference
- Shared Task on the Disambiguation of German Verbal Idioms @ KONVENS 2021
- Conference Place
- Düsseldorf, Germany
- Event start
- 2021-09-06
- Event end
- 2021-09-09
- Language
- English