Files
HuggingFace_transformer/docs/source/en
Vivek Khandelwal 2963e196ee Add support for loading GPTQ models on CPU (#26719)
* Add support for loading GPTQ models on CPU

Right now, we can only load the GPTQ Quantized model on the CUDA
device. The attribute `gptq_supports_cpu` checks if the current
auto_gptq version is the one which has the cpu support for the
model or not.
The larger variants of the model are hard to load/run/trace on
the GPU and that's the rationale behind adding this attribute.

Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>

* Update quantization.md

* Update quantization.md

* Update quantization.md
2023-10-31 13:45:23 +00:00
..
2023-10-23 14:49:48 +02:00
2023-10-30 21:42:19 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:16:34 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-10-30 13:32:17 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-10-04 15:13:37 +02:00
2023-07-25 22:10:06 +02:00
2023-10-24 16:49:26 +02:00