S₀ Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models
lora mamba hybrid-model fine-tuning state-space-model peft humaneval qwen gated-delta-net recurrent-state
-
Updated
Apr 8, 2026 - Python