1v1 Self-Play Football AI

This project utilizes the Unity ML-Agents toolkit to create a competitive 1v1 football environment. Agents are trained using the Proximal Policy Optimization (PPO) algorithm combined with a Self-Play mechanism, allowing them to evolve by competing against increasingly difficult versions of themselves. The system features a sophisticated reward shaping logic, 360-degree ray-cast vision and real-time ELO rating tracking to measure agent progression.

Key Features

Tech Stack

Unity EngineC# (Scripting)Python (ML-Agents Toolkit)PyTorch (Neural Networks)YAML (Hyperparameter Configuration)TensorBoard (Data Visualization)

Screenshots

Screenshot 0
Screenshot 1
Screenshot 2
Screenshot 3
Screenshot 4
Screenshot 5
Screenshot 6
Screenshot 7

Video