AUTHOR=Petsanis Thanos , Keroglou Christoforos , Ch. Kapoutsis Athanasios , Kosmatopoulos Elias B. , Sirakoulis Georgios Ch. 

TITLE=Decomposing user-defined tasks in a reinforcement learning setup using TextWorld

JOURNAL=Frontiers in Robotics and AI

VOLUME=Volume 10 - 2023

YEAR=2023

URL=https://www.frontiersin.org/journals/robotics-and-ai/articles/10.3389/frobt.2023.1280578

DOI=10.3389/frobt.2023.1280578

ISSN=2296-9144

ABSTRACT=The current paper proposes a Hierarchical Reinforcement Learning (HRL) method to decompose a complex task into simpler sub-tasks and leverage those to improve training of an autonomous agent in a simulated environment. For practical reasons (i.e., illustrating purposes, easy implementation, user friendly interface, useful functionalities), we employ two python frameworks called TextWorld and MiniGrid. MiniGrid functions as a 2D simulated representation of the real environment while TextWorld functions as a high-level abstraction of this simulated environment.Training on this abstraction disentangles manipulation from navigation actions and allows us to design a dense reward function instead of a sparse reward function for the lower-level environment which, as we show, improves performance of training. Formal methods are utilized throughout the paper to establish that our algorithm is not prevented to derive to solutions.