Lab member paper accepted at AAMAS 2023
- Post by: Bahareh Arghavani
- February 16, 2023
- Comments off
Congratulations on the acceptance of your paper titled “A Theory of Mind Approach as Test-Time Mitigation Against Emergent Adversarial Communication” This is a significant achievement that reflects your hard work, dedication, and expertise in the field of machine learning and artificial intelligence.
The paper is available on Arxiv.
Your research has great potential to make a real impact on the development of more robust and reliable AI systems, which is an essential task in today’s rapidly evolving technological landscape. Your approach to using the Theory of Mind framework to mitigate adversarial communication at test time is innovative and promising, and we look forward to seeing the results of further research and development in this area.
Abstract :
Multi-Agent Systems (MAS) is the study of multi-agent interactions in a shared environment. Communication for cooperation is a fundamental construct for sharing information in partially observable environments. Cooperative Multi-Agent Reinforcement Learning (CoMARL) is a learning framework where we learn agent policies either with cooperative mechanisms or policies that exhibit cooperative behavior. Explicitly, there are works on learning to communicate messages from CoMARL agents; however, non-cooperative agents, when capable of access a cooperative team’s communication channel, have been shown to learn adversarial communication messages, sabotaging the cooperative team’s performance particularly when objectives depend on finite resources. To address this issue, we propose a technique which leverages local formulations of Theory-of-Mind (ToM) to distinguish exhibited cooperative behavior from non-cooperative behavior before accepting messages from any agent. We demonstrate the efficacy and feasibility of the proposed technique in empirical evaluations in a centralized training, decentralized execution (CTDE) CoMARL benchmark. Furthermore, while we propose our explicit ToM defense for test-time, we emphasize that ToM is a construct for designing a cognitive defense rather than be the objective of the defense.
Author(s):
Piazza , Nancirose and Behzadan , Vahid
DOI number:
arXiv:2302.07176