Adversarial Robustness of Reinforcement Learning

Over View

Since the emergence of Deep Reinforcement Learning (DRL) algorithms, there has been a surge of interest from both research and industry in the promising potential of this paradigm. The current and envisioned applications of deep RL range from autonomous navigation and robotics to control applications in critical infrastructure, air traffic control, defense technologies, and cybersecurity. Despite the extensive opportunities and benefits of deep RL algorithms, the security risks and challenges associated with them remain largely unexplored. Recent studies have highlighted the vulnerability of DRL algorithms to small perturbations in their state observations, which can be exploited by adversaries to manipulate the behavior and performance of DRL agents. This project aims to advance the current state of the art in three distinct but interconnected areas:

Developing techniques and metrics to evaluate the resilience and robustness of DRL agents against adversarial perturbations of state, reward, and actuation.
Developing tools and techniques for efficient and guaranteed mitigation of adversarial attacks against RL agents.
Addressing the challenges of policy extraction and inversion to enable the protection of models and intellectual property rights.

Current Team Members:

Venu Korada
PI: Vahid Behzadan

Affiliate Research Groups:

Tools and Datasets:

RLAttack: Framework for experimental analysis of adversarial example attacks on policy learning in Deep RL.

Publications: