Skip to content

Pinned Loading

  1. aurora-release aurora-release Public

    Aurora optimizer release

    Python 146 6

  2. nsa-release nsa-release Public

    An efficient implementation of the NSA (Native Sparse Attention) kernel

    Python 135 6

  3. momoe-release momoe-release Public

    Memory optimized Mixture of Experts

    Python 78 7

  4. nitrobrew-release nitrobrew-release Public

    Fused KL divergence from hidden states for knowledge distillation

    Python 18

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…