Experimental! https://github.com/ggerganov/llama.cpp/pull/3586
- Downloads last month
- 23
Hardware compatibility
Log In to add your hardware
3-bit
4-bit
5-bit
6-bit
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support