Skip to content

steveseguin/Unofficial-Intel-XPU-Community

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unofficial Intel XPU Community

Intel GPU notes for people trying to run local AI without losing a weekend to driver/runtime weirdness.

This is an unofficial community project. It is not owned by Intel.

Start Here

What This Community Is For

Intel Arc and XPU local AI is promising, but setup is still scattered across drivers, oneAPI, OpenVINO, PyTorch XPU, Level Zero, SYCL, vLLM, llama.cpp, Docker, and random forum posts.

This repo is meant to be the stable hub:

  • setup guides for Linux and Windows
  • Docker notes people can actually run
  • benchmark templates and comparable results
  • patch notes for vLLM, llama.cpp, OpenVINO, oneAPI, and SYCL
  • troubleshooting for drivers, PCIe topology, XPU visibility, and runtime mismatches
  • discussion categories for setup help, benchmarks, guides, patches, build photos, and research leads

Quick Links

I want to... Go here
Ask a setup question Discussions
Read community notes Wiki
Get started Getting Started
Check drivers/runtimes Drivers and Runtimes
Diagnose XPU visibility Diagnostics
Share benchmark results Benchmarks and Results
Try containers Docker and Containers
Understand 2x B70 use cases Two B70 Use Cases
See research status Current Research Snapshot
Deploy MiniMax on 4x B70 Ubuntu 24 recipe

Good First Post

If you need help, open a Discussion and include:

  • GPU model and count
  • Windows or Linux version
  • what you are trying to run
  • what guide or command you followed
  • what failed
  • logs or screenshots if you have them

If you are on Linux, attach output from:

bash scripts/collect_xpu_snapshot.sh

If you are on Windows PowerShell, attach output from:

.\scripts\collect_xpu_snapshot.ps1

What Counts As A Useful Benchmark

A useful result says more than "I got 80 tok/s."

Include:

  • model and quantization
  • GPU model/count
  • engine: vLLM, llama.cpp, OpenVINO, PyTorch, etc.
  • prompt length, output length, context length
  • batch size or concurrency
  • output tok/s and total tok/s if available
  • whether quality was checked
  • exact command or linked recipe

Use the benchmark template.

Current Focus

Near-term community priorities:

  • make B70/B-series setup reproducible
  • document Windows and Linux paths clearly
  • get Docker/container recipes that work outside one private machine
  • collect comparable vLLM, llama.cpp, OpenVINO, and PyTorch XPU results
  • track driver/runtime mismatches and fixes
  • promote stable recipes from the fast-moving B70 optimization lab

Current Working Recipe

The most complete current community reference is MiniMax M2.7 INT4 AutoRound on Ubuntu 24 with 4x Intel Arc Pro B70s:

Treat this as a working starting point, not the final speed ceiling.

Repo Map

Important Caveat

This is community research. Some notes will be experimental, incomplete, or wrong for your exact machine. Prefer posts with commands, versions, logs, and reproducible steps.

Do not post API keys, private model files, proprietary data, or copied vendor docs. Link to sources instead.

About

Unofficial community hub for Intel Arc Pro B70/B-series, Intel XPU, oneAPI, OpenVINO, PyTorch XPU, vLLM, llama.cpp, setup guides, benchmarks, and patches.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors