Featured Projects

[contact me for code or reports where not provided]

Multi-Agent Car Parking Thesis

Multi-Agent Car Parking

Undergraduate thesis applying reinforcement learning to simulate and control groups of autonomous vehicles using PPO and Unity ML-Agents.
  • Implemented variants of Q-Learning and modifications to PPO using Tensorflow/Python.
  • Designed a flexible environment modelled as an MDP with independent agents and dynamic goals. Implemented in Unity using C#.
  • Achieved over 98% parking success with 7 cars using PPO, outperforming single-agent baselines.
  • Explored collaborative and competitive behaviours with variable communication and density.
  • Revealed novel group dynamics such as 'leaky' collaboration and competition-induced cooperation.
  • Scaled training on the university's HPC using Slurm.
  • Awarded 85% for the thesis (top 2% / 250 in cohort).