Align-RUDDER - Learning From Few Demonstrations by Reward Redistribution.

Vihang P. Patil, Markus Hofmarcher, Marius-Constantin Dinu, Matthias Dorfer, Patrick M. Blies, Johannes Brandstetter, Jose A. Arjona-Medina, Sepp Hochreiter

Research output: Other contributionpeer-review

Original languageEnglish
Volumeabs/2009.14108
Publication statusPublished - 2020
Externally publishedYes

Cite this