Skip to main navigation Skip to search Skip to main content

D2CFR: Minimize Counterfactual Regret with Deep Dueling Neural Network

  • Northwestern Polytechnical University Xian
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies
  • Peng Cheng Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

Counterfactual regret minimization (CFR) is a popular method for finding approximate Nash equilibrium in two-player zero-sum games with imperfect information. Solving large-scale games with CFR needs a combination of abstraction techniques and certain expert knowledge, which constrains its scalability. Recent neural-based CFR methods mitigate the need for abstraction and expert knowledge by training an efficient network to directly obtain counterfactual regret without abstraction. However, these methods only consider estimating regret values for individual actions, neglecting the evaluation of state values, which are significant for decision-making. In this article, we introduce deep dueling CFR (D2CFR), which emphasizes the state value estimation by employing a novel value network with a dueling structure. Moreover, a rectification module based on a time-shifted Monte Carlo simulation is designed to rectify the inaccurate state value estimation. Extensive experimental results are conducted to show that D2CFR converges faster and outperforms comparison methods on test games.

Original languageEnglish
Pages (from-to)18343-18356
Number of pages14
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume35
Issue number12
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • Counterfactual regret minimization (CFR)
  • Nash equilibrium
  • imperfect information games (IIGs)
  • neural network

Fingerprint

Dive into the research topics of 'D2CFR: Minimize Counterfactual Regret with Deep Dueling Neural Network'. Together they form a unique fingerprint.

Cite this