Skip to main content

ORIGINAL RESEARCH article

Front. Robot. AI
Sec. Robot Learning and Evolution
Volume 11 - 2024 | doi: 10.3389/frobt.2024.1455375

Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models

Provisionally accepted
Leonard Bärmann Leonard Bärmann 1*Rainer Kartmann Rainer Kartmann 1Fabian Peller-Konrad Fabian Peller-Konrad 1Jan Niehues Jan Niehues 1Alex Waibel Alex Waibel 1,2Tamim Asfour Tamim Asfour 1
  • 1 Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
  • 2 Carnegie Mellon University, Pittsburgh, Pennsylvania, United States

The final, formatted version of the article will be published soon.

    Natural-language dialog is key for intuitive human-robot interaction. It can be used not only to express humans' intents, but also to communicate instructions for improvement if a robot does not understand a command correctly. Of great importance is to let robots learn from such interaction experience in an incremental way to allow them to improve their behaviors or avoid mistakes in the future. In this paper, we propose a system to achieve such incremental learning of complex high-level behavior from natural interaction, and demonstrate its implementation on a humanoid robot. Our system deploys Large Language Models (LLMs) for high-level orchestration of the robot's behavior, based on the idea of enabling the LLM to generate Python statements in an interactive console to invoke both robot perception and action. Human instructions, environment observations, and execution results are fed back to the LLM, thus informing the generation of the next statement. Since an LLM can misunderstand (potentially ambiguous) user instructions, we introduce incremental learning from interaction, which enables the system to learn from its mistakes. For that purpose, the LLM can call another LLM responsible for code-level improvements of the current interaction based on human feedback. Subsequently, we store the improved interaction in the robot's memory so that it can later be retrieved on semantically similar requests. We integrate the system in the robot cognitive architecture of the humanoid robot ARMAR-6 and evaluate our methods both quantitatively (in simulation) and qualitatively (in simulation and real-world) by demonstrating generalized incrementally-learned knowledge.

    Keywords: Incremental Learning, human-robot interaction, Cognitive Modeling, Knowledge Representation for Robots, humanoid robots

    Received: 26 Jun 2024; Accepted: 06 Sep 2024.

    Copyright: © 2024 Bärmann, Kartmann, Peller-Konrad, Niehues, Waibel and Asfour. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Leonard Bärmann, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.