ternary-quantization

Star

Here are 3 public repositories matching this topic...

vinsis / ternary-quantization

Star

Training models with ternary quantized weights using PyTorch

pytorch quantization model-compression ternary-quantization

Updated Jun 12, 2019
Python

Bit-by-Bit-Collective / BitNet-7B-KDE

Star

Colab-friendly BitNet distillation engine: collect KD traces from a teacher, train a ternary Mini-BitNet, and dry-run 7B memory. Multi-provider + Drive/S3

transformers pytorch parquet knowledge-distillation model-compression bitnet colab-notebook mixed-precision ternary-quantization large-language-model efficient-llm efficient-llm-inference activation-quantization kd-traces

Updated Sep 5, 2025
Python

PILON (Primitive-Induced Linear Operator Network) explores a compositional weight parameterization for transformer FFN layers. The goal is to replace dense FFN matrices with shared low-rank primitives plus learned composition weights.

research pytorch transformer language-model model-compression bitnet weight-sharing low-rank ternary-quantization quantization-aware-training efficient-deep-learning consumer-gpu

Updated Mar 8, 2026
Python

Improve this page

Add a description, image, and links to the ternary-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ternary-quantization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ternary-quantization

Here are 3 public repositories matching this topic...

vinsis / ternary-quantization

Bit-by-Bit-Collective / BitNet-7B-KDE

Babyhamsta / PILON

Improve this page

Add this topic to your repo