Files
HuggingFace_transformer/docs/source/en
Jerry Zhang 78d78cdf8a Add TorchAOHfQuantizer (#32306)
* Add TorchAOHfQuantizer

Summary:
Enable loading torchao quantized model in huggingface.

Test Plan:
local test

Reviewers:

Subscribers:

Tasks:

Tags:

* Fix a few issues

* style

* Added tests and addressed some comments about dtype conversion

* fix torch_dtype warning message

* fix tests

* style

* TorchAOConfig -> TorchAoConfig

* enable offload + fix memory with multi-gpu

* update torchao version requirement to 0.4.0

* better comments

* add torch.compile to torchao README, add perf number link

---------

Co-authored-by: Marc Sun <marc@huggingface.co>
2024-08-14 16:14:24 +02:00
..
2024-08-08 13:43:14 -07:00
2024-08-14 16:14:24 +02:00
2024-08-07 11:42:52 +02:00
2024-07-08 11:52:47 +01:00
2023-12-20 10:37:23 -08:00
2024-07-08 11:52:47 +01:00
2023-11-13 14:20:54 +01:00
2024-08-12 08:22:47 +02:00
2024-08-06 10:24:19 +05:00
2024-04-18 12:49:43 -04:00
2024-07-08 11:52:47 +01:00
2024-08-08 15:47:24 +02:00
2024-06-12 11:33:00 +01:00