Vivek Khandelwal
2963e196ee
Add support for loading GPTQ models on CPU ( #26719 )
...
* Add support for loading GPTQ models on CPU
Right now, we can only load the GPTQ Quantized model on the CUDA
device. The attribute `gptq_supports_cpu` checks if the current
auto_gptq version is the one which has the cpu support for the
model or not.
The larger variants of the model are hard to load/run/trace on
the GPU and that's the rationale behind adding this attribute.
Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com >
* Update quantization.md
* Update quantization.md
* Update quantization.md
2023-10-31 13:45:23 +00:00
..
2023-08-25 11:56:17 +01:00
2023-10-31 13:45:23 +00:00
2023-10-31 11:21:02 +00:00
2023-10-23 14:49:48 +02:00
2023-10-30 21:42:19 +01:00
2023-09-04 11:15:12 +01:00
2023-09-25 12:58:11 +02:00
2023-09-19 00:41:16 +02:00
2023-09-18 13:33:01 +02:00
2023-06-20 18:07:47 -04:00
2023-09-04 11:16:49 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-04 11:15:12 +01:00
2023-10-06 12:04:40 +01:00
2023-09-04 11:16:34 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-05 12:27:20 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-29 09:43:39 +02:00
2023-10-24 13:10:06 -04:00
2023-09-12 18:01:22 +01:00
2023-10-30 13:32:17 +01:00
2023-09-29 09:42:21 +02:00
2023-10-16 16:09:50 +02:00
2023-10-19 16:53:28 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-06-20 18:07:47 -04:00
2023-08-18 07:58:59 +02:00
2023-06-20 18:07:47 -04:00
2023-08-18 19:08:03 +02:00
2023-07-25 07:44:24 -04:00
2023-06-20 18:07:47 -04:00
2023-09-22 17:42:10 +02:00
2023-10-31 11:21:02 +00:00
2023-06-20 18:07:47 -04:00
2023-08-10 13:25:00 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-10-24 13:10:06 -04:00
2023-09-22 17:42:10 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-07-24 08:57:24 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-29 18:32:37 +01:00
2023-08-18 10:17:44 +02:00
2023-10-04 15:13:37 +02:00
2023-10-20 11:16:37 +02:00
2023-08-16 08:03:23 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-07-25 22:10:06 +02:00
2023-06-20 18:07:47 -04:00
2023-10-24 16:49:26 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-08-03 14:17:30 -07:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-05 12:27:20 +01:00
2023-06-20 18:07:47 -04:00