AUTHOR=Liu Feng , Li Dongqi , Gao Jian 

TITLE=Hybrid knowledge transfer for MARL based on action advising and experience sharing

JOURNAL=Frontiers in Neurorobotics

VOLUME=Volume 18 - 2024

YEAR=2024

URL=https://www.frontiersin.org/journals/neurorobotics/articles/10.3389/fnbot.2024.1364587

DOI=10.3389/fnbot.2024.1364587

ISSN=1662-5218

ABSTRACT=Multiagent Reinforcement Learning (MARL) has been well-adopted due to its exceptional ability to solve multiagent decision-making problems. To further enhance learning efficiency, knowledge transfer algorithms have been developed, among which experience-sharing-based and actionadvising-based transfer strategies share the mainstream. However, it is notable that, although there exist many successful applications of both strategies, they are not flawless. For the longdeveloped action-advising-based methods (namely KT-AA, short for knowledge transfer based on action advising), their data efficiency and scalability are not satisfactory. As for the newly proposed experience-sharing-based knowledge transfer methods (KT-ES), although the shortcomings of KT-AA have been partially overcome, they are incompetent to correct specific bad decisions in the later learning stage. To leverage the superiority of both KT-AA and KT-ES, this paper proposes KT-Hybrid, a hybrid knowledge transfer approach. In the early learning phase, KT-ES methods are employed, expecting better data efficiency from KT-ES to enhance the policy to a basic level as soon as possible. Later, we focus on correcting specific errors made by the basic policy, trying to use KT-AA methods to further improve the performance. Simulations demonstrate that the proposed KT-Hybrid outperforms well-received action-advising-and experience-sharing-based methods.