https://github.com/qwopqwop200/GPTQ-for-LLaMa GPTQ quantization is far more accurate.
https://github.com/qwopqwop200/GPTQ-for-LLaMa
GPTQ quantization is far more accurate.