Tremayne Timms t-timms

Tremayne Timms

ML & AI Engineer — Fine-Tuning · Agentic Systems · Edge Deployment · Production LLM Ops

About Me

I build production LLM systems from the metal up — from quantized models running on Jetson edge hardware to multi-agent cloud deployments with tool-use, permission gating, and audit trails. Currently focused on MoE fine-tuning (ZAYA1-8B), Blackwell-native FP4 quantization (NVFP4), and SOTA agentic coding benchmarks.

Dallas-Fort Worth, TX · ttimmsinternational@gmail.com

Current Focus (May 2026)

Project	What	Why It Matters
zaya1-godspeed	Fine-tuning ZAYA1-8B MoE for agentic tool calling	760M active params matching 14B models — closing a deliberate gap Zyphra left in the tech report
llama.cpp NVFP4	Blackwell-native FP4 quantization with MSE-optimal scales	First consumer NVFP4 tooling on RTX 5070 Ti — PR #22897 awaiting upstream review

What I'm Building

Godspeed Coding Agent

Security-first open-source coding agent. Hand-rolled async ReAct loop with 4-tier deny-first permission engine, SHA-256 hash-chained audit trail, and 200+ LLM providers via LiteLLM. 854 tests.

30+ built-in tools with JSON Schema validation, MCP server + client
Parallel + speculative tool dispatch, cost budget enforcement
Self-evolution via LLM-guided mutations, multi-language verify gate with retry
Training data export (openai/chatml/sharegpt), per-step reward annotations for GRPO
SWE-bench Lite: 34.8% single-shot · 52.2% oracle best-of-5

Open Source Contributions

llama.cpp #22897 — NVFP4 default type mapping + per-tensor scale tensors + MSE-optimal correction
llama.cpp #22858 — Missing LLAMA_FTYPE_MOSTLY_NVFP4 case fix (closed, replaced by #22897)
Zyphra/ZAYA1-8B — Agentic fine-tuning to complete the model's post-training (SFT + GRPO)

GitHub Activity

📈 Contribution Graph

Skills

Area	Technologies
LLMs & Agents	LiteLLM, 200+ providers, Ollama, llama.cpp, multi-agent orchestration, ReAct loops
Fine-Tuning	Unsloth, TRL (SFT/DPO/GRPO/ORPO), QLoRA, PEFT, MoE architectures, RLHF/RLAIF
Inference	vLLM (custom forks), speculative decoding (750 tok/s), TensorRT-LLM, EXL2
Quantization	NVFP4 (Blackwell-native), GGUF, EXL2, FP8, NF4, GPTQ, AWQ
ML Infrastructure	PyTorch, CUDA 12.8, torch.compile, DeepSpeed, lm-eval, W&B, MLflow
Systems	Python, Rust, TypeScript, Docker, GitHub Actions CI/CD, systemd
Edge / Hardware	NVIDIA Jetson Orin Nano, RTX 5070 Ti (Blackwell sm_120), 16 GB VRAM optimization
Data	PostgreSQL, SQL, pandas, SQLAlchemy, ChromaDB, LanceDB, BM25

Tremayne Timms · GitHub · LinkedIn · Email

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tremayne Timms t-timms

Achievements

Achievements

Block or report t-timms

Tremayne Timms

About Me

Current Focus (May 2026)

What I'm Building

Godspeed Coding Agent

Sovereign Edge

Manna Trading

Bible AI Assistant

GPU Server Test Suite

ML Lab

LLM Wiki

Manufacturing Quality Analytics

Tesla Tire Wear ML

Open Source Contributions

GitHub Activity

Skills

Pinned Loading

Uh oh!