Towards learners that plan: Integrating trainable planning modules for data-efficient learning

dc.contributor.authorSchneider, Tim
dc.date.accessioned2022-01-28T09:40:22Z
dc.date.available2022-01-28T09:40:22Z
dc.date.issued2018de
dc.description.abstractLearning is one of the most important abilities of intelligent adaptive agents. The generalization capability and training efficiency of learning algorithms depend heavily on the abstract representations acquired. Planning, on the other hand, allows agents to anticipate the future consequences of their actions so as to act optimally at the now. The action-contingent predictive features generated by planning modules thereby provide a good abstract representation constituting the current state of the agent. From this insight, this thesis aims to integrate trainable planning modules for data-efficient learning in sequential decision making and manipulation problems, ranging from Go game to real-world robotic AI. Specifically, this thesis will investigate the effectiveness of such approach by trying to solve the key questions of (1) how to integrate planning modules into deep learning frameworks so as to train the whole system from data, and (2) how to exploit predictive, but possibly inaccurate, abstract features from planning modules to guide the learning process. The main contributions of this thesis are to answer these questions within a broad literature survey and incorporate the ideas in an algorithm that can be applied to learn to plan in visual navigation tasks in a completely unsupervised manner.en
dc.identifier.other1787781925
dc.identifier.urihttp://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-119376de
dc.identifier.urihttp://elib.uni-stuttgart.de/handle/11682/11937
dc.identifier.urihttp://dx.doi.org/10.18419/opus-11920
dc.language.isoende
dc.rightsinfo:eu-repo/semantics/openAccessde
dc.subject.ddc004de
dc.titleTowards learners that plan: Integrating trainable planning modules for data-efficient learningen
dc.typebachelorThesisde
ubs.fakultaetInformatik, Elektrotechnik und Informationstechnikde
ubs.institutInstitut für Parallele und Verteilte Systemede
ubs.publikation.seiten57de
ubs.publikation.typAbschlussarbeit (Bachelor)de

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
thesis.pdf
Size:
1.49 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.39 KB
Format:
Item-specific license agreed upon to submission
Description: