Automatic recognition of structures in obituaries

dc.contributor.authorSabbatino, Valentino
dc.date.accessioned2019-07-22T15:26:40Z
dc.date.available2019-07-22T15:26:40Z
dc.date.issued2019de
dc.description.abstractObituaries are a less common text type in research that contains a lot of information about people, events in history and culture. The information that can be obtained by zoning such obituaries enables new research, e.g., in social studies. Our work focuses on the question if the structuring of obituaries is possible and viable. Therefore we created a corpus for this work containing 20058 obituaries of which 1008 were annotated manually by us. We implemented four models, a CNN text classifier and three variations of a Bi-LSTM sequence labeler, to see if the zoning procedure is possible and which among the models performs best for this task. The CNN text classifier showed the most promising results together with the variant of the Bi-LSTM model using a Bag-of-Word model.en
dc.identifier.other1671057112
dc.identifier.urihttp://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-104889de
dc.identifier.urihttp://elib.uni-stuttgart.de/handle/11682/10488
dc.identifier.urihttp://dx.doi.org/10.18419/opus-10471
dc.language.isoende
dc.rightsinfo:eu-repo/semantics/openAccessde
dc.subject.ddc004de
dc.titleAutomatic recognition of structures in obituariesen
dc.typebachelorThesisde
ubs.fakultaetInformatik, Elektrotechnik und Informationstechnikde
ubs.institutInstitut für Maschinelle Sprachverarbeitungde
ubs.publikation.seiten39de
ubs.publikation.typAbschlussarbeit (Bachelor)de

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
thesis_valentino_sabbatino.pdf
Size:
681.72 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.39 KB
Format:
Item-specific license agreed upon to submission
Description: