Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://dx.doi.org/10.18419/opus-14135
Langanzeige der Metadaten
DC ElementWertSprache
dc.contributor.authorHülsing, Anna-
dc.date.accessioned2024-03-27T11:20:09Z-
dc.date.available2024-03-27T11:20:09Z-
dc.date.issued2023de
dc.identifier.other1884955630-
dc.identifier.urihttp://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-141548de
dc.identifier.urihttp://elib.uni-stuttgart.de/handle/11682/14154-
dc.identifier.urihttp://dx.doi.org/10.18419/opus-14135-
dc.description.abstractState-of-the-art metaphor detection (MD) models achieve human-like performance for English data, while studies on MD for low-resource languages are currently missing. This thesis explores cross-lingual approaches that harness data from English, a high-resource language, in order to classify data in the target languages of Russian, German and Latin, either without using training data from the target languages or with as little as 20 instances. These instances were taken from the test data, but could also be created manually due to the small amount of annotating effort. The experiments indicate that the neural cross-lingual models mBERT (zero- and few-shot classification) and mBERT-based MAD-X perform well for German and Russian, while for languages where little data was used to pretrain mBERT, non-neural cross-lingual models with vector space model and conceptual features (abstractness, supersenses) outperform the mBERT-based models, if default hyperparameters are used. No validation data in the target languages was available for performing hyperparameter-tuning. Therefore, as a byproduct it was discovered that, while using a source language dataset for validation leads to overfitting, using a dataset from another language rather than the source language leads to decent results. This is especially true for the MAD-X model, which - with the help of successful hyperparameter-tuning - outperforms the non-neural classifier for the low-resource language Latin.en
dc.language.isoende
dc.rightsinfo:eu-repo/semantics/openAccessde
dc.subject.ddc004de
dc.subject.ddc400de
dc.titleCross-lingual metaphor detection for low-resource languagesen
dc.typemasterThesisde
ubs.fakultaetInformatik, Elektrotechnik und Informationstechnikde
ubs.institutInstitut für Maschinelle Sprachverarbeitungde
ubs.publikation.seiten101de
ubs.publikation.typAbschlussarbeit (Master)de
Enthalten in den Sammlungen:05 Fakultät Informatik, Elektrotechnik und Informationstechnik

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
thesis_huelsing.pdf1,15 MBAdobe PDFÖffnen/Anzeigen


Alle Ressourcen in diesem Repositorium sind urheberrechtlich geschützt.