Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://dx.doi.org/10.18419/opus-10471
Autor(en): Sabbatino, Valentino
Titel: Automatic recognition of structures in obituaries
Erscheinungsdatum: 2019
Dokumentart: Abschlussarbeit (Bachelor)
Seiten: 39
URI: http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-104889
http://elib.uni-stuttgart.de/handle/11682/10488
http://dx.doi.org/10.18419/opus-10471
Zusammenfassung: Obituaries are a less common text type in research that contains a lot of information about people, events in history and culture. The information that can be obtained by zoning such obituaries enables new research, e.g., in social studies. Our work focuses on the question if the structuring of obituaries is possible and viable. Therefore we created a corpus for this work containing 20058 obituaries of which 1008 were annotated manually by us. We implemented four models, a CNN text classifier and three variations of a Bi-LSTM sequence labeler, to see if the zoning procedure is possible and which among the models performs best for this task. The CNN text classifier showed the most promising results together with the variant of the Bi-LSTM model using a Bag-of-Word model.
Enthalten in den Sammlungen:05 Fakultät Informatik, Elektrotechnik und Informationstechnik

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
thesis_valentino_sabbatino.pdf681,72 kBAdobe PDFÖffnen/Anzeigen


Alle Ressourcen in diesem Repositorium sind urheberrechtlich geschützt.