HuggingFace_transformer/docs/source/en at 2963e196ee1fb9ab2c677221ca5e135568604662 - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Vivek Khandelwal 2963e196ee Add support for loading GPTQ models on CPU (#26719 )

* Add support for loading GPTQ models on CPU

Right now, we can only load the GPTQ Quantized model on the CUDA
device. The attribute `gptq_supports_cpu` checks if the current
auto_gptq version is the one which has the cpu support for the
model or not.
The larger variants of the model are hard to load/run/trace on
the GPU and that's the rationale behind adding this attribute.

Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>

* Update quantization.md

* Update quantization.md

* Update quantization.md

2023-10-31 13:45:23 +00:00

..

Generate: add missing logits processors docs (#25653 )

2023-08-25 11:56:17 +01:00

Add support for loading GPTQ models on CPU (#26719 )

2023-10-31 13:45:23 +00:00

Add flash attention for gpt_bigcode (#26479 )

2023-10-31 11:21:02 +00:00

Add Seamless M4T model (#25693 )

2023-10-23 14:49:48 +02:00

_config.py

…

_toctree.yml

[KOSMOS-2] Update docs (#27157 )

2023-10-30 21:42:19 +01:00

accelerate.md

Fix typos (#25936 )

2023-09-04 11:15:12 +01:00

add_new_model.md

Update add_new_model.md (#26365 )

2023-09-25 12:58:11 +02:00

add_new_pipeline.md

Update add_new_pipeline.md (#26197 )

2023-09-19 00:41:16 +02:00

add_tensorflow_model.md

Remove utils/documentation_tests.txt (#26213 )

2023-09-18 13:33:01 +02:00

attention.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

autoclass_tutorial.md

Update autoclass_tutorial.md (#25929 )

2023-09-04 11:16:49 +01:00

benchmarks.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

bertology.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

big_models.md

Fix typos (#25936 )

2023-09-04 11:15:12 +01:00

chat_templating.md

Update chat template docs with more tips on writing a template (#26625 )

2023-10-06 12:04:40 +01:00

community.md

Update community.md (#25928 )

2023-09-04 11:16:34 +01:00

contributing.md

…

create_a_model.md

Fix typos (#25936 )

2023-09-04 11:15:12 +01:00

custom_models.md

Fix typos (#25936 )

2023-09-04 11:15:12 +01:00

custom_tools.md

[doc] Always call it Agents for consistency (#25958 )

2023-09-05 12:27:20 +01:00

debugging.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

fast_tokenizers.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

generation_strategies.md

[docs] navigation improvement between text gen pipelines and text gen params (#26477 )

2023-09-29 09:43:39 +02:00

glossary.md

[docs] Performance docs refactor p.2 (#26791 )

2023-10-24 13:10:06 -04:00

hpo_train.md

enable optuna multi-objectives feature (#25969 )

2023-09-12 18:01:22 +01:00

index.md

Add Kosmos-2 model (#24709 )

2023-10-30 13:32:17 +01:00

installation.md

[docs] Update offline mode docs (#26478 )

2023-09-29 09:42:21 +02:00

llm_tutorial_optimization.md

Add LLM doc (#26058 )

2023-10-16 16:09:50 +02:00

llm_tutorial.md

Generate: update basic llm tutorial (#26937 )

2023-10-19 16:53:28 +01:00

model_memory_anatomy.md

Fix typos (#25936 )

2023-09-04 11:15:12 +01:00

model_sharing.md

Fix typos (#25936 )

2023-09-04 11:15:12 +01:00

model_summary.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

multilingual.md

Fix typo in example code (#25583 )

2023-08-18 07:58:59 +02:00

notebooks.md

…

pad_truncation.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

peft.md

[PEFT] Peft integration alternative design (#25077 )

2023-08-18 19:08:03 +02:00

perf_hardware.md

🌐 [i18n-KO] Translated perf_hardware.md to Korean (#24966 )

2023-07-25 07:44:24 -04:00

perf_infer_cpu.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

perf_infer_gpu_many.md

[core ] Integrate Flash attention 2 in most used models (#25598 )

2023-09-22 17:42:10 +02:00

perf_infer_gpu_one.md

Add flash attention for gpt_bigcode (#26479 )

2023-10-31 11:21:02 +00:00

perf_infer_special.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

perf_torch_compile.md

Fix rendering for torch.compile() docs (#25432 )

2023-08-10 13:25:00 +02:00

perf_train_cpu_many.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

perf_train_cpu.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

perf_train_gpu_many.md

[docs] Performance docs refactor p.2 (#26791 )

2023-10-24 13:10:06 -04:00

perf_train_gpu_one.md

[core ] Integrate Flash attention 2 in most used models (#25598 )

2023-09-22 17:42:10 +02:00

perf_train_special.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

perf_train_tpu_tf.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

perf_train_tpu.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

performance.md

[docs] Performance docs tidy up, part 1 (#23963 )

2023-07-24 08:57:24 -04:00

perplexity.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

philosophy.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

pipeline_tutorial.md

[ASR Pipe] Improve docs and error messages (#26476 )

2023-09-29 18:32:37 +01:00

pipeline_webserver.md

Suggestions on Pipeline_webserver (#25570 )

2023-08-18 10:17:44 +02:00

pr_checks.md

Docstring check (#26052 )

2023-10-04 15:13:37 +02:00

preprocessing.md

fix set_transform link docs (#26856 )

2023-10-20 11:16:37 +02:00

quicktour.md

[TYPO] fix typo/format in quicktour.md (#25519 )

2023-08-16 08:03:23 +02:00

run_scripts.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

sagemaker.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

serialization.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

task_summary.md

Fix doctest (#25031 )

2023-07-25 22:10:06 +02:00

tasks_explained.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

testing.md

Device agnostic testing (#25870 )

2023-10-24 16:49:26 +02:00

tf_xla.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

tflite.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

tokenizer_summary.md

Fix typo: Roberta -> RoBERTa (#25302 )

2023-08-03 14:17:30 -07:00

torchscript.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

training.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00

transformers_agents.md

[doc] Always call it Agents for consistency (#25958 )

2023-09-05 12:27:20 +01:00

troubleshooting.md

Migrate doc files to Markdown. (#24376 )

2023-06-20 18:07:47 -04:00