HuggingFace_transformer/docs/source/en/main_classes at 2963e196ee1fb9ab2c677221ca5e135568604662 - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Vivek Khandelwal 2963e196ee Add support for loading GPTQ models on CPU (#26719 )

* Add support for loading GPTQ models on CPU

Right now, we can only load the GPTQ Quantized model on the CUDA
device. The attribute `gptq_supports_cpu` checks if the current
auto_gptq version is the one which has the cpu support for the
model or not.
The larger variants of the model are hard to load/run/trace on
the GPU and that's the rationale behind adding this attribute.

Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>

* Update quantization.md

* Update quantization.md

* Update quantization.md

2023-10-31 13:45:23 +00:00

..

agent.md

[doc] Always call it Agents for consistency (#25958 )

2023-09-05 12:27:20 +01:00

callback.md

Update docs to explain disabling callbacks using report_to (#26155 )

2023-10-11 07:50:23 -04:00

configuration.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

data_collator.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

deepspeed.md

Fix Typo: table in deepspeed.md (#26705 )

2023-10-10 11:50:10 +02:00

feature_extractor.md

Fixed typos (#26810 )

2023-10-16 09:52:29 +02:00

image_processor.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

keras_callbacks.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

logging.md

Warnings controlled by logger level (#26527 )

2023-10-12 10:48:38 +02:00

model.md

Fix typo 'submosules' (#24809 )

2023-07-13 16:56:53 +01:00

onnx.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

optimizer_schedules.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

output.md

Translating en/main_classes folder docs to Japanese 🇯🇵 (#26894 )

2023-10-30 09:39:14 -07:00

pipelines.md

[docs] Add MaskGenerationPipeline in docs (#27063 )

2023-10-25 19:31:36 +02:00

processors.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

quantization.md

Add support for loading GPTQ models on CPU (#26719 )

2023-10-31 13:45:23 +00:00

text_generation.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

tokenizer.md

Tweaks to Chat Templates docs (#26168 )

2023-09-15 12:50:57 +01:00

trainer.md

Translating en/main_classes folder docs to Japanese 🇯🇵 (#26894 )

2023-10-30 09:39:14 -07:00