megakernel

Here are 5 public repositories matching this topic...

Luce-Org / lucebox-hub

Fast LLM speculative inference server for consumer hardware.

kernel cuda cuda-kernels nvidia-cuda luce rtx3090 llama-cpp local-ai qwen speculative-decoding dflash megakernel speculative-prefill pflash lucebox

Updated Jun 12, 2026
C++

RightNow-AI / AutoMegaKernel

Sponsor

Star

An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

machine-learning gpu cuda gpu-programming kernel-fusion mlsys llm-inference agent-harness megakernel

Updated Jun 8, 2026
Python

SunayHegde2006 / Air.rs

Star

Air.rs 70B+ inference on consumer GPU, LLM inference in Rust

open-source kernel inference lora instruction-set nvidia-cuda open-models apple-silicon llama-cpp ggml qlora local-ai megakernel

Updated May 30, 2026
Rust

BoundlessWindMoon / minivllm

Star

A light, transparent, and modular inference & quantization engine for studying LLMs.

framework inference awq multi-backends quantum-kernel cuda-graph megakernel

Updated Jun 4, 2026
Cuda

bhupinders / qwen3_tts_megakernel

Star

Qwen3-TTS inference with CUDA megakernels

python agent cuda tts llm pipecat qwen qwen3 megakernel

Updated May 30, 2026
Python

Improve this page

Add a description, image, and links to the megakernel topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the megakernel topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

megakernel

Here are 5 public repositories matching this topic...

Luce-Org / lucebox-hub

RightNow-AI / AutoMegaKernel

SunayHegde2006 / Air.rs

BoundlessWindMoon / minivllm

bhupinders / qwen3_tts_megakernel

Improve this page

Add this topic to your repo