Abstract
Sequential Resource Allocation with situational constraints presents a significant challenge in real-world applications, where resource demands and priorities are context-dependent. This paper introduces a novel framework, SCRL, to address this problem. We formalize situational constraints as logic implications and develop a new algorithm that dynamically penalizes constraint violations. To handle situational constraints effectively, we propose a probabilistic selection mechanism to overcome limitations of traditional constraint reinforcement learning (CRL) approaches. We evaluate SCRL across two scenarios: medical resource allocation during a pandemic and pesticide distribution in agriculture. Experiments demonstrate that SCRL outperforms existing baselines in satisfying constraints while maintaining high resource efficiency, showcasing its potential for real-world, context-sensitive decision-making tasks.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025 |
| Editors | James Kwok |
| Publisher | International Joint Conferences on Artificial Intelligence |
| Pages | 9121-9129 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781956792065 |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
| Event | 34th Internationa Joint Conference on Artificial Intelligence, IJCAI 2025 - Montreal, Canada Duration: 16 Aug 2025 → 22 Aug 2025 |
Publication series
| Name | IJCAI International Joint Conference on Artificial Intelligence |
|---|---|
| ISSN (Print) | 1045-0823 |
Conference
| Conference | 34th Internationa Joint Conference on Artificial Intelligence, IJCAI 2025 |
|---|---|
| Country/Territory | Canada |
| City | Montreal |
| Period | 16/08/25 → 22/08/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 8 Decent Work and Economic Growth
-
SDG 12 Responsible Consumption and Production
Fingerprint
Dive into the research topics of 'Situational-Constrained Sequential Resources Allocation via Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver