Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
http://dx.doi.org/10.18419/opus-10471
Autor(en): | Sabbatino, Valentino |
Titel: | Automatic recognition of structures in obituaries |
Erscheinungsdatum: | 2019 |
Dokumentart: | Abschlussarbeit (Bachelor) |
Seiten: | 39 |
URI: | http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-104889 http://elib.uni-stuttgart.de/handle/11682/10488 http://dx.doi.org/10.18419/opus-10471 |
Zusammenfassung: | Obituaries are a less common text type in research that contains a lot of information about people, events in history and culture. The information that can be obtained by zoning such obituaries enables new research, e.g., in social studies. Our work focuses on the question if the structuring of obituaries is possible and viable. Therefore we created a corpus for this work containing 20058 obituaries of which 1008 were annotated manually by us. We implemented four models, a CNN text classifier and three variations of a Bi-LSTM sequence labeler, to see if the zoning procedure is possible and which among the models performs best for this task. The CNN text classifier showed the most promising results together with the variant of the Bi-LSTM model using a Bag-of-Word model. |
Enthalten in den Sammlungen: | 05 Fakultät Informatik, Elektrotechnik und Informationstechnik |
Dateien zu dieser Ressource:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
thesis_valentino_sabbatino.pdf | 681,72 kB | Adobe PDF | Öffnen/Anzeigen |
Alle Ressourcen in diesem Repositorium sind urheberrechtlich geschützt.