Adversarial Reinforcement Learning for Cooperative AI — Dissertation
Built a three-stage co-evolutionary RL pipeline (40M timesteps) training cooperative agents to stay effective alongside imperfect or adversarial partners in Overcooked-AI. Evaluated across three seeds with a compute-matched ablation study, quantifying a 2–3× improvement over the self-play baseline and isolating which stages drove the gains.