Skip to main navigation Skip to search Skip to main content

Learning cooperative strategies in StarCraft through role-based monotonic value function factorization

  • Faculty of Computing, Harbin Institute of Technology
  • School of Medicine and Health, Harbin Institute of Technology
  • School of Management, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

StarCraft is a popular real-time strategy game that has been widely used as a research platform for artificial intelligence. Micromanagement refers to the process of making each unit perform appropriate actions separately, depending on the current state in the the multi-agent system comprising all of the units, i.e., the fine-grained control of individual units for common benefit. Therefore, cooperation between different units is crucially important to improve the joint strategy. We have selected multi-agent deep reinforcement learning to tackle the problem of micromanagement. In this paper, we propose a method for learning cooperative strategies in StarCraft based on role-based montonic value function factorization (RoMIX). RoMIX learns roles based on the potential impact of each agent on the multi-agent task; it then represents the action value of a role in a mixed way based on monotonic value function factorization. The final value is calculated by accumulating the action value of all roles. The role-based learning improves the cooperation between agents on the team, allowing them to learn the joint strategy more quickly and efficiently. In addition, RoMIX can also reduce storage resources to a certain extent. Experiments show that RoMIX can not only solve easy tasks, but it can also learn better cooperation strategies for more complex and difficult tasks.

Original languageEnglish
Pages (from-to)779-798
Number of pages20
JournalElectronic Research Archive
Volume32
Issue number2
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • Q-learning
  • StarCraft multi-agent challenge
  • artificial intelligence
  • machine learning
  • multi-agent reinforcement learning

Fingerprint

Dive into the research topics of 'Learning cooperative strategies in StarCraft through role-based monotonic value function factorization'. Together they form a unique fingerprint.

Cite this