Bare Metal SLM Inference

Local SLM testing. No wrappers. Direct hardware utilization via compiled binaries.

Hardware

Intel i7-1165G7, 12GB RAM, AVX-512 optimized. See hardware-profile.json.

llama.cpp compiled from source with AVX-512 flags. See inference-config.json.

Tests are in no particular order - see .md files for each model's test results.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md
gemma-3-4b-020726.md		gemma-3-4b-020726.md
hardware-profile.json		hardware-profile.json
inference-config.json		inference-config.json
llama-3.1-8B-020326.md		llama-3.1-8B-020326.md