| layout | default |
|---|---|
| title | Quick Start |
| nav_order | 2 |
| description | Download Off Grid and run your first local AI model in under 5 minutes - no account, no API key, no cloud. |
Run your first local AI model in under 5 minutes. No account. No API key. No internet after setup.
iOS: Download on the App Store - requires iPhone 12 or newer (4GB RAM+)
Android: Get it on Google Play - requires Android 10+, 4GB RAM+
Or grab the latest APK directly from GitHub Releases.
When you open the app, you'll see the model picker. If you're unsure, start here:
| You want | Start with | Size |
|---|---|---|
| Fast chat, 3–4GB RAM | Qwen 3.5 0.8B | ~0.8GB |
| Best for most phones | Qwen 3.5 2B | ~1.7GB |
| Best quality (8GB RAM) | Qwen 3.5 9B | ~5.5GB |
| Vision + reasoning | Gemma 4 E2B | ~1.5GB |
| Image generation | SD 1.5 Palettized (iOS) / Absolute Reality (Android) | ~1GB |
Not sure? Pick Qwen 3.5 2B. It fits comfortably in 4GB RAM, supports 262K context, and is the best starting point for most phones.
Tap a model → Download. This is the only time you need internet. The download goes to your device storage.
Once downloaded, tap Load - the model loads into RAM. On first load this takes 5–15 seconds depending on model size.
Type your first message. You're now running AI locally.
Put your phone in airplane mode. Everything still works.
- [Which model should I use?]({{ '/guides/which-model' | relative_url }}) - full comparison table by device and use case
- [Connect your home Ollama server]({{ '/guides/ollama-android' | relative_url }}) - use bigger models from your desktop via LAN
- [Run Stable Diffusion on Android]({{ '/guides/stable-diffusion-android' | relative_url }}) - generate images completely on-device
Stuck, or want to share what you're building? Join the Slack community.
The app is open source - view it on GitHub.