Skip to content

Releases: brontoguana/krasis

v0.1.66-rc7

27 Apr 16:14

Choose a tag to compare

v0.1.66-rc7 Pre-release
Pre-release

Pre-release candidate with native HQQ attention modes and release CI sidecar packaging fixes.

v0.1.66-rc6

27 Apr 14:17

Choose a tag to compare

v0.1.66-rc6 Pre-release
Pre-release

Pre-release candidate with native HQQ attention modes, HQQ8 launcher default, HQQ4SC installer option, llama-witness sequence validation support, and release CI fixes for CUDA sidecars and Blackwell FLA sidecars.

v0.1.66-rc5

27 Apr 14:13

Choose a tag to compare

v0.1.66-rc5 Pre-release
Pre-release

Pre-release candidate with native HQQ attention modes, HQQ8 launcher default, HQQ4SC installer option, llama-witness sequence validation support, and release CI fixes for CUDA sidecars and Blackwell FLA sidecars.

v0.1.66-rc4

27 Apr 14:09

Choose a tag to compare

v0.1.66-rc4 Pre-release
Pre-release

Pre-release candidate with native HQQ attention modes, HQQ8 launcher default, HQQ4SC installer option, llama-witness sequence validation support, and release CI sidecar packaging fixes.

v0.1.66-rc3

27 Apr 13:55

Choose a tag to compare

v0.1.66-rc3 Pre-release
Pre-release

Pre-release candidate with native HQQ attention modes, HQQ8 launcher default, HQQ4SC installer option, llama-witness sequence validation support, and release CI CUDA driver-stub linking fixes.

v0.1.66-rc2

18 Apr 05:34

Choose a tag to compare

v0.1.66-rc2 Pre-release
Pre-release

Recreated rc2 on commit 418e3ce after switching the manylinux FLA link step to the resolved CUDA stub file path directly.

v0.1.66-rc1

07 Apr 23:06

Choose a tag to compare

v0.1.66-rc1 Pre-release
Pre-release

Pre-release for multi-GPU testing.

Changes since v0.1.65-rc6:

  • 122B FLA fix: multi-H cubins and scratch buffer sizing
  • Cross-compiled FLA kernels for sm80/sm89/sm90/sm120
  • FLA kernel arg signature and block size fix
  • Arch-specific FLA .so files ship in wheel (no first-run JIT)
  • GPU arch auto-detection with forward/backward compat fallback

v0.1.65-rc6

29 Mar 22:22

Choose a tag to compare

v0.1.65-rc6 Pre-release
Pre-release

Prerelease for installed-package sidecar fixes and FP8-only KV cache on Ampere.

v0.1.65-rc5

29 Mar 22:11

Choose a tag to compare

v0.1.65-rc5 Pre-release
Pre-release

Prerelease with vendored CUDA sidecars injected into release wheels, prerelease installer force-reinstall handling, and FP8-only KV cache on Ampere and in the interactive launcher.

v0.1.65-rc4

29 Mar 22:03

Choose a tag to compare

v0.1.65-rc4 Pre-release
Pre-release

Prerelease with release-wheel sidecar injection, prerelease installer force-reinstall handling, and FP8-only KV cache on Ampere and in the interactive launcher.