Optimierung von Clustering von Wortverwendungsgraphen

Tunc, Benjamin

Optimierung von Clustering von Wortverwendungsgraphen

dc.contributor.author	Tunc, Benjamin
dc.date.accessioned	2022-01-20T13:51:50Z
dc.date.available	2022-01-20T13:51:50Z
dc.date.issued	2021	de
dc.description.abstract	Algorithms for clustering of Word Usage Graphs are not optimal in terms of efficiency and often do not find the optimal clustering loss on larger graphs. Our aim in this paper is to find efficient ways to approximate the global minimum of a clustering loss function on three Word Usage Graphs data sets using correlation clustering and simulated annealing. Therefore we define 321 models with different initialization modifications, parameter combinations and stopping criterion and evaluate them in terms of loss, similarity to word sense description annotation, robustness and runtime. We evaluate different approaches and define efficient models with dynamic stopping criterion to find the lowest loss, which yield robust cluster solutions. We find that lowering the loss lead to better and clustering solutions.	en
dc.description.abstract	Algorithmen für das Clustering von Wortverwendungsgraphen sind im Hinblick auf ihre Effizienz nicht optimal und finden oft nicht den optimalen Clustering-Loss bei größeren Graphen. Unser Ziel in diesem Arbeit ist es, effiziente Wege zu finden, um das globale Minimum einer Clustering-Lossfunktion auf drei Wortverwendungsgraphen-Datensätzen mit Hilfe von Korrelationsclustering und Simulated Annealing zu approximieren. Zu diesem Zweck definieren wir 321 Modelle mit unterschiedlichen Initialisierungsmodifikationen, Parameterkombinationen und Abbruchkriterien und evaluieren sie in Bezug auf Loss, Ähnlichkeit mit Word Sense Description, Robustheit und Laufzeit. Wir evaluieren verschiedene Ansätze und definieren effiziente Modelle mit dynamischem Abbruchkriterium, um den geringsten Loss zu finden und zeigen dass diese zu robusten Clusterlösungen führen. Wir stellen fest, dass eine Verringerung des Verlusts zu besseren und robusteren Clusterlösungen führt.	de
dc.identifier.other	1786619938
dc.identifier.uri	http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-119232	de
dc.identifier.uri	http://elib.uni-stuttgart.de/handle/11682/11923
dc.identifier.uri	http://dx.doi.org/10.18419/opus-11906
dc.language.iso	en	de
dc.rights	info:eu-repo/semantics/openAccess	de
dc.subject.ddc	004	de
dc.title	Optimierung von Clustering von Wortverwendungsgraphen	de
dc.type	bachelorThesis	de
ubs.fakultaet	Informatik, Elektrotechnik und Informationstechnik	de
ubs.institut	Institut für Maschinelle Sprachverarbeitung	de
ubs.publikation.seiten	22	de
ubs.publikation.typ	Abschlussarbeit (Bachelor)	de

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Bachelorarbeit_SWT_Tunc.pdf
Size:: 5.15 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 3.39 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

05 Fakultät Informatik, Elektrotechnik und Informationstechnik