- Implementation of Mistral 7b using PyTorch
- Look at the model params
Sliding window attention - Rolling buffer cache - Prefill and chunking - MoE -
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Sliding window attention - Rolling buffer cache - Prefill and chunking - MoE -