RMIT University
Browse

Multi-Agent Intention Progression with Reward Machines

conference contribution
posted on 2024-11-03, 14:33 authored by Michael Dann, Yuan Yao, Natasha Alechina, Brian Logan, John ThangarajahJohn Thangarajah
Recent work in multi-agent intention scheduling has shown that enabling agents to predict the actions of other agents when choosing their own actions can be beneficial. However existing approaches to 'intention-aware' scheduling assume that the programs of other agents are known, or are "similar" to that of the agent making the prediction. While this assumption is reasonable in some circumstances, it is less plausible when the agents are not co-designed. In this paper, we present a new approach to multi-agent intention scheduling in which agents predict the actions of other agents based on a high-level specification of the tasks performed by an agent in the form of a reward machine (RM) rather than on its (assumed) program. We show how a reward machine can be used to generate tree and rollout policies for an MCTS-based scheduler. We evaluate our approach in a range of multi-agent environments, and show that RM-based scheduling out-performs previous intention-aware scheduling approaches in settings where agents are not co-designed.

History

Related Materials

  1. 1.
    ISBN - Is published in 9781956792003 (urn:isbn:9781956792003)
  2. 2.

Start page

215

End page

222

Total pages

8

Outlet

Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI 2022)

Name of conference

The 31st International Joint Conference on Artificial Intelligence

Publisher

International Joint Conferences on Artificial Intelligence

Place published

United States

Start date

2022-07-23

End date

2022-07-29

Language

English

Former Identifier

2006116350

Esploro creation date

2023-03-22

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC