gb10

Star

Here are 35 public repositories matching this topic...

eelbaz / dgx-spark-vllm-setup

Star

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

machine-learning ai deep-learning gpu cuda pytorch nvidia arm64 blackwell llm vllm llm-inference gb10 dgx-spark

Updated Oct 28, 2025
Shell

jdaln / dgx-spark-inference-stack

Star

Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace Blackwell AI supercomputer on your desk. Mostly vLLM based for now and single-spark. For the not-so-rich buddies

docker docker-compose cuda inference self-hosted llama model-serving mlops dgx generative-ai local-llm gb10 dgx-spark

Updated Apr 29, 2026
Shell

joeynyc / spark-doctor

Star

Local diagnostic CLI for NVIDIA DGX Spark (GB10). Detects power caps, unified memory pressure, thermal risk, Docker/runtime issues, and validates vLLM/Ollama/llama.cpp/SGLang recipes.

cli nvidia diagnostics dgx llama-cpp vllm local-llm ollama sglang gb10 dgx-spark grace-blackwell nvidia-dgx-spark

Updated Apr 24, 2026
Python

seanGSISG / dgx-spark-sunshine-setup

Star

headless remote desktop to your dgx spark in crystal clear 4k

remote-desktop remote-access sunshine dgx gb10 dgx-spark

Updated Apr 5, 2026
Shell

Single-file web UI for NVIDIA DGX Spark — pull Ollama models, browse and download from HuggingFace, manage LiteLLM routing, and control SGLang, vLLM, llama.cpp, LocalAI, and ComfyUI. All from one browser tab.

web ai nvidia model-deployment fastapi ai-tools llm llm-tools gb10 dgx-spark dgxspark

Updated May 3, 2026
Python

getainode / ainode

Star

Turn any NVIDIA GPU into a local AI platform. Inference + fine-tuning in your browser. One command to start, automatic clustering.

open-source gpu cuda inference self-hosted distributed nvidia fine-tuning ai-platform llm vllm local-ai gb10 dgx-spark grace-blackwell

Updated Apr 25, 2026
Python

parallelArchitect / sparkview

Star

Operator-grade GPU monitor for NVIDIA GPUs with native GB10 / DGX Spark coherent UMA support — PSI pressure, clock detection, ConnectX-7 network layer

python monitoring gpu cuda tui nvidia psi unified-memory gb10 dgx-spark

Updated May 3, 2026
Python

Navi-AI-Lab / nvllm

Star

(Experimental) A high-throughput and memory-efficient inference and serving engine for LLMs on DGX Spark / GB10

nvidia cuda-kernels cutlass local-inference vllm llm-inference qwen paged-attention self-hosted-ai gb10 sm120 nvfp4 dgx-spark fp4-quantization attention-kernel fp8-kv-cache

Updated May 4, 2026
Python

jxlarrea / homeassistant-voice-recipes

Sponsor

Star

GPU/CUDA-accelerated voice control stack for Home Assistant. Runs on x86/x64 and ARM64 (including the NVIDIA DGX Spark). 100% Local - No Cloud, No Subscriptions.

text-to-speech x86-64 cuda gpu-acceleration home-assistant speech-to-text arm64 voice-assistant local-llm qwen3 gb10 dgx-spark

Updated Apr 24, 2026
Go

scottgl9 / sglang-spark-gb10-optimizations

Sponsor

Star

SGLang optimizations for NVIDIA Spark (GB10) — SM121 Grace Blackwell

optimization marlin sglang gb10

Updated May 4, 2026
Python

Logos-Flux / optimized-CUDA-GB10

Star

Optimized CUDA kernels for NVIDIA GB10 Blackwell (sm_121, DGX Spark). RMSNorm + GELU. First sm_121 kernel on HuggingFace Kernel Hub.

gpu cuda pytorch nvidia kernels gelu huggingface blackwell rmsnorm gb10 dgx-spark sm121

Updated May 3, 2026
Cuda

croll83 / llama.cpp-dgx

Star

llama.cpp fork optimized for NVIDIA DGX Spark / GB10 (Blackwell, SM 12.1) — TurboQuant weights + KV, NVFP4, DFlash MTP

blackwell llama-cpp speculative-decoding gb10 nvfp4 dflash turboquant

Updated May 1, 2026
C++

parallelArchitect / spark-gpu-throttle-check

Star

Enhanced GPU throttle diagnostic for DGX Spark (GB10): NVML direct telemetry, throttle cause decoder, PCIe link monitoring, baseline drift detection, timeline capture.

cuda cublas nvidia nvml pcie usb-pd gpu-monitoring power-delivery gb10 gpu-diagnostics dgx-spark throttle-detection clock-throttling

Updated Mar 22, 2026
Python

ridanuae / dgx-spark-sglang-qwen35

Star

Run Qwen3.5-35B-A3B on NVIDIA DGX Spark (GB10) with SGLang - Ready-to-use Docker image + complete guide

docker nvidia moe blackwell llm sglang qwen3 gb10 dgx-spark qwen35

Updated Feb 26, 2026
Shell

kiisu-dsalyss / recipe-deck

Star

DGX Spark / GB10 Family - Spark-VLLM-Docker frontend

ai nvidia-gpu ai-model ai-models vllm gb10 dgx-spark

Updated Apr 8, 2026
TypeScript

leap21ai / autospark

Star

DGX Spark (GB10/SM121) platform support for Meta's KernelAgent — auto-detect, hardware constraints, safe Triton configs

cuda nvidia triton gpu-optimization gb10 dgx-spark sm121 kernel-agent

Updated Mar 14, 2026
Python

ogulcanaydogan / dgx-spark-llm-stack

Star

Pre-built PyTorch wheels and build scripts for NVIDIA DGX Spark (GB10, sm_121, Blackwell, CUDA 13.0, ARM64)

machine-learning deep-learning gpu cuda inference pytorch nvidia arm64 aarch64 fine-tuning blackwell llm gb10 dgx-spark grace-blackwell sm121 cuda-13 pre-built-wheels

Updated Apr 22, 2026
Shell

Ikaran8623 / dgx-spark-ai

Star

Run GPT-OSS 120B on NVIDIA DGX Spark with vLLM, build an API server, and create a local AI coding assistant

amd cuda self-hosted nvidia llama hetzner model-serving dgx generative-ai local-llm gb10 dgx-spark moltbot openclaw nemoclaw hitechcloud

Updated May 5, 2026
Shell

pureGavin / unsloth4arm

Star

This project is the ARM architecture version of unsloth.

arm fine-tuning unsloth gb10

Updated Jan 9, 2026
Dockerfile

infinitycloud-ch / vision

Star

Solo-built agentic AI ecosystem from Switzerland on a 100W NVIDIA GB10 Blackwell desktop supercomputer. Cognitive robotics (Unitree Go2 + Isaac Sim 5.1 + RL PPO + GR00T N1.7), local-first BI (DuckDB + LLM NL→SQL), and LLM-reasoning EDR cybersecurity. Showcase: articles, technical docs, demo videos.

robotics switzerland showcase multi-agent cybersecurity ai-agents applied-ai indie-hacker edge-ai local-first blackwell duckdb isaac-sim anthropic local-llm claude-api gb10 infinity-cloud

Updated Apr 30, 2026

Improve this page

Add a description, image, and links to the gb10 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gb10 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gb10

Here are 35 public repositories matching this topic...

eelbaz / dgx-spark-vllm-setup

jdaln / dgx-spark-inference-stack

joeynyc / spark-doctor

seanGSISG / dgx-spark-sunshine-setup

calico88x / DGX-Model-Manager

getainode / ainode

parallelArchitect / sparkview

Navi-AI-Lab / nvllm

jxlarrea / homeassistant-voice-recipes

scottgl9 / sglang-spark-gb10-optimizations

Logos-Flux / optimized-CUDA-GB10

croll83 / llama.cpp-dgx

parallelArchitect / spark-gpu-throttle-check

ridanuae / dgx-spark-sglang-qwen35

kiisu-dsalyss / recipe-deck

leap21ai / autospark

ogulcanaydogan / dgx-spark-llm-stack

Ikaran8623 / dgx-spark-ai

pureGavin / unsloth4arm

infinitycloud-ch / vision

Improve this page

Add this topic to your repo