Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics Roder, Frank Eppe, Manfred Wermter, Stefan 2022 - Forschungsinformationssystem der UHH