-
Notifications
You must be signed in to change notification settings - Fork 0
Roadmap
DeVenLucaz edited this page Jun 19, 2026
·
4 revisions
llamdrop develops in focused phases. Each version ships working features, not promises.
- One-command install, no compilation
- Prebuilt llama.cpp binary for Android
- Auto device detection — RAM, CPU, OS, storage
- Verified model catalog with tier system
- Live HuggingFace search
- Resilient downloader with resume and retry
- Smart quantization selection
- Session save and resume
- Background catalog update checker
- Phone-wide GGUF scanner
- Vulkan GPU acceleration
- Live RAM monitor and context trimming
-
llamdrop updateandllamdrop doctor - Model benchmarking
- Config file
- Battery monitoring
- Chip-aware thread count
- Ollama backend auto-detect (Linux)
- Conditional mmap
- Full
DeviceProfiledataclass rollout - Device-aware browser
- Cancelled downloads cleanup
- Incremental prompt buffer
- Arabic UI language added
- Multi-language UI Support (English, Hindi, Spanish, Portuguese, Arabic)
- Optional File Context Injection (chat with PDFs, TXTs, CSVs)
- Unified RAM reading
- Fixed Inference UI Blocking
- Formal Testing Suite
- Dynamic Backend Probing: Auto micro-benchmark for best backend selection.
-
Independent Engine Updates: Update
llama.cppseparately. -
Auto-healing Doctor:
[F]key auto-fixes broken configs/binaries. -
Power User Overrides:
allow_thermal_melt, backend forcing,[U]unsupported models toggle. - Dynamic Catalog Overhaul: Background catalog fetching and direct HF URL pasting.
- Dropped Windows/macOS: Focus exclusively on Android (Termux), Linux, and SBCs.
- Dropped Desktop Tier: Focused on accessible hardware from Micro to High (up to 16GB).
- Web-based model catalog on GitHub Pages — browse models before installing
- Community device profile submissions
- llamdrop server mode — run model on phone, access from browser on same WiFi
- Streaming tokens via Ollama backend
LLAMdrop v0.10.0 • Built by @DeVenLucaz • Free & Open Source
Empowering low-spec devices with local AI.