Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://dx.doi.org/10.18419/opus-10171
Autor(en): Zendler, Ulrich
Titel: How word-embedding methods improve information extraction and can be used for multilingual approaches
Erscheinungsdatum: 2018
Dokumentart: Abschlussarbeit (Master)
Seiten: 108
URI: http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-101881
http://elib.uni-stuttgart.de/handle/11682/10188
http://dx.doi.org/10.18419/opus-10171
Zusammenfassung: Expanding entity sets and extracting relations are key tasks in natural language processing (NLP), which is accomplished in various approaches. Recent successful attempts are all using word-embeddings like the ones presented by Mikolov et al. While most work concentrates on how to improve these tasks in general without considering a specific domain, it is of interest how to achieve even higher precisions when focusing on a specific domain and optimizing the methods towards a single purpose. Therefore this thesis suggests methods and adjustments to optimize the proposals for entity set expansion for the domain of drugs. While this is the main purpose of this thesis, it will also present a novel idea, how to improve the precision in relation extraction by using word-embeddings, which could be combined with existing successful relation extraction methods. And finally another key aspect of many international companies is tagged, by presenting a solution for multilingual information extraction system (IES), which is capable of preprocessing text of multiple languages, expanding entity sets independent of the language used and extracting relations on the texts.
Enthalten in den Sammlungen:05 Fakultät Informatik, Elektrotechnik und Informationstechnik

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
MasterThesisUlrichZendler.pdf1,39 MBAdobe PDFÖffnen/Anzeigen


Alle Ressourcen in diesem Repositorium sind urheberrechtlich geschützt.