Towards robust cross-domain domain adaptation for part-of-speech tagging

dc.contributor.authorSchnabel, Tobiasde
dc.date.accessioned2013-06-07de
dc.date.accessioned2016-03-31T08:00:25Z
dc.date.available2013-06-07de
dc.date.available2016-03-31T08:00:25Z
dc.date.issued2013de
dc.description.abstractMost systems in natural language processing experience a substantial loss in performance when the data that the system is tested with differs significantly from the data that the system has been trained on. Systems for part-of-speech (POS) tagging, for example, are typically trained on newspaper texts but are often applied to texts of other domains such as medical texts. Domain adaptation (DA) techniques seek to improve such systems so that they are able to achieve consistently good performance - independent of the domains at hand. We investigate the robustness of domain adaptation representations and methods across target domains using part-of-speech tagging as a case study. We find that there is no single representation and method that works equally well for all target domains. In particular, there are large differences between target domains that are more similar to the source domain and those that are less similar.en
dc.identifier.other383435633de
dc.identifier.urihttp://nbn-resolving.de/urn:nbn:de:bsz:93-opus-84507de
dc.identifier.urihttp://elib.uni-stuttgart.de/handle/11682/3081
dc.identifier.urihttp://dx.doi.org/10.18419/opus-3064
dc.language.isoende
dc.rightsinfo:eu-repo/semantics/openAccessde
dc.subject.ddc004de
dc.titleTowards robust cross-domain domain adaptation for part-of-speech taggingen
dc.typemasterThesisde
ubs.fakultaetFakultät Informatik, Elektrotechnik und Informationstechnikde
ubs.institutInstitut für Maschinelle Sprachverarbeitungde
ubs.opusid8450de
ubs.publikation.typAbschlussarbeit (Diplom)de

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
DIP_3399.pdf
Size:
808.03 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
935 B
Format:
Plain Text
Description: