TrueSkate-AI

Training an AI agent to play the mobile game True Skate at an expert level.

Approach

The project started as behavioral cloning (supervised learning from gameplay recordings) but pivoted to reinforcement learning — the agent generates its own touch gestures via Appium on a physical iPhone, so there's no labeling bottleneck.

The current baseline method uses CMA-ES (Covariance Matrix Adaptation Evolution Strategy) to optimize a 17-dimensional continuous parameter vector that encodes two curved multi-touch gestures. The agent attempts tricks, reads the result via OCR, and receives a shaped reward.

An experimental path now also exists for trick-conditioned PPO with a neural policy that outputs a 42-parameter action plan (4 gated gesture slots, inter-slot delays, and spin timing controls).

Results So Far

The agent has landed pop shove-its, varial flips, 360 flips, nightmare flips, and others — confirming the pipeline works end-to-end. CMA-ES tends to converge on reliably landable medium-reward tricks rather than volatile high-reward ones, which is an active area of work.

How It Works

Gesture parameterization — Two gesture slots (3 normalised waypoints + duration + easing each) plus inter-slot delay → 17 params total. See GESTURES.md for the full coordinate and schema reference.
Touch execution — Gestures fire as sequential calls to a custom WDA endpoint on a physical iPhone; push uses Appium ActionChains
Trick detection — Screenshot → Apple Vision OCR → fuzzy match against known tricks
Reward — Tiered scoring based on detected trick name
Optimization — CMA-ES samples gesture parameter vectors, evaluates them on-device, updates the search distribution

Key Constraint

True Skate runs at 1× real-time — no way to speed up the simulator. A GPU can't accelerate the live interaction loop. This makes data throughput (~15K steps/hour) the binding constraint, not compute.

Project Structure

src/trueskate_ai/ ├── nn/ # Trick-conditioned neural policy ├── rl/ # Action parameterization, reward, collectors, PPO and CMA-ES ├── sim/ # Device control (touch_actions, trick_info_reader, known_tricks) ├── labeling/ # Legacy CV pipeline (pre-RL pivot) ├── vision/ # Legacy PyTorch datasets └── utils/ # Trajectory splines, data loading scripts/ # Entry points (train_cmaes, train_ppo, launch_services, build_trick_library, etc.) experiments/ # Experiment journals and standalone experiments

Requirements

Python 3.11+ with venv
Appium + WebDriverAgent (Xcode)
Physical iPhone with True Skate installed

Setup

python -m venv .venv && source .venv/bin/activate
pip install numpy opencv-python torch torchvision scipy pillow appium-python-client matplotlib requests cma python-dotenv pyobjc

Copy .env.example to .env and set your device UDID.

Training Entrypoints

# CMA-ES baseline
python scripts/train/train_cmaes.py

# Trick-conditioned PPO experiment
python scripts/train/train_ppo.py --updates 100 --steps-per-update 24

# Spin button coordinate calibration helper
python scripts/inspect/calibrate_spin_button.py --x 25 --y 362 --repeat 2

# Optional global spin override during PPO runs
python scripts/train/train_ppo.py --spin-x 25 --spin-y 362

# Disable hindsight relabeling (enabled by default)
python scripts/train/train_ppo.py --no-hindsight-relabel

# Resume a new run from a prior checkpoint
python scripts/train/train_ppo.py --resume-from /absolute/path/to/policy_update_0010.pt

Name		Name	Last commit message	Last commit date
Latest commit History 225 Commits
.github		.github
configs/overnight		configs/overnight
curricula		curricula
deploy		deploy
experiments		experiments
logs/runs		logs/runs
notebooks		notebooks
scripts		scripts
src		src
tools		tools
trick_libraries		trick_libraries
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
DEPLOYMENT.md		DEPLOYMENT.md
GESTURES.md		GESTURES.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TrueSkate-AI

Approach

Results So Far

How It Works

Key Constraint

Project Structure

Requirements

Setup

Training Entrypoints

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TrueSkate-AI

Approach

Results So Far

How It Works

Key Constraint

Project Structure

Requirements

Setup

Training Entrypoints

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages