First of all, thank you for the great work on this project!
I was wondering if it might be possible to share or release the 2-bit, 3-bit, and 4-bit quantized versions of the LLaMA 3.1-70B and LLaMA 2-70B models. Having these available would be incredibly helpful for my usage.
Thank you very much for your time and for sharing your excellent work with the community!
First of all, thank you for the great work on this project!
I was wondering if it might be possible to share or release the 2-bit, 3-bit, and 4-bit quantized versions of the LLaMA 3.1-70B and LLaMA 2-70B models. Having these available would be incredibly helpful for my usage.
Thank you very much for your time and for sharing your excellent work with the community!