Jerry Zhang
78d78cdf8a
Add TorchAOHfQuantizer ( #32306 )
...
* Add TorchAOHfQuantizer
Summary:
Enable loading torchao quantized model in huggingface.
Test Plan:
local test
Reviewers:
Subscribers:
Tasks:
Tags:
* Fix a few issues
* style
* Added tests and addressed some comments about dtype conversion
* fix torch_dtype warning message
* fix tests
* style
* TorchAOConfig -> TorchAoConfig
* enable offload + fix memory with multi-gpu
* update torchao version requirement to 0.4.0
* better comments
* add torch.compile to torchao README, add perf number link
---------
Co-authored-by: Marc Sun <marc@huggingface.co >
2024-08-14 16:14:24 +02:00
..
2024-08-07 10:03:05 +05:00
2024-08-14 16:14:24 +02:00
2024-08-12 20:20:17 +01:00
2024-08-14 16:14:24 +02:00
2024-08-08 13:43:14 -07:00
2024-04-08 14:21:16 +01:00
2024-05-28 18:29:22 +02:00
2024-08-14 16:14:24 +02:00
2024-04-24 09:38:18 +02:00
2024-04-16 15:34:04 +01:00
2024-08-07 11:42:52 +02:00
2024-02-08 14:13:35 -08:00
2024-02-16 08:16:58 +01:00
2024-02-16 08:16:58 +01:00
2024-04-01 18:47:32 -07:00
2024-08-12 16:20:14 +01:00
2024-02-16 08:16:58 +01:00
2024-07-23 17:47:51 +01:00
2024-06-06 22:02:38 +01:00
2024-02-12 10:48:31 -08:00
2024-02-02 08:45:00 +01:00
2024-07-08 11:52:47 +01:00
2023-12-20 10:37:23 -08:00
2024-08-07 16:34:46 +01:00
2024-06-03 14:55:10 +01:00
2024-07-08 11:52:47 +01:00
2023-11-13 14:20:54 +01:00
2024-08-12 08:22:47 +02:00
2024-05-29 11:55:43 +01:00
2024-08-06 10:24:19 +05:00
2024-07-29 10:52:13 +01:00
2024-07-08 11:52:47 +01:00
2024-04-30 18:14:12 +01:00
2024-04-18 12:49:43 -04:00
2024-07-30 15:49:14 +01:00
2024-03-23 18:29:39 -07:00
2024-02-16 08:16:58 +01:00
2023-12-08 10:32:18 -08:00
2024-05-30 16:47:35 +02:00
2024-07-08 11:52:47 +01:00
2024-02-02 08:45:00 +01:00
2024-08-08 15:47:24 +02:00
2024-07-29 10:50:43 +01:00
2024-02-16 08:16:58 +01:00
2024-02-16 08:16:58 +01:00
2024-06-18 11:00:26 -07:00
2024-07-04 13:20:49 -04:00
2024-02-16 08:16:58 +01:00
2023-10-31 09:44:51 -07:00
2024-02-16 08:16:58 +01:00
2023-11-06 19:45:03 +00:00
2024-07-05 17:21:50 +01:00
2024-02-16 08:16:58 +01:00
2024-02-02 08:45:00 +01:00
2024-07-09 10:38:29 +01:00
2024-06-12 11:33:00 +01:00
2024-04-29 10:57:51 +01:00
2023-11-06 19:45:03 +00:00
2024-02-16 08:16:58 +01:00
2024-04-16 11:58:55 +02:00
2024-02-26 08:18:15 -08:00
2024-08-07 11:01:33 -07:00
2024-07-29 10:50:43 +01:00
2024-02-16 08:16:58 +01:00
2024-06-03 16:52:23 -07:00
2024-02-16 08:16:58 +01:00
2024-08-13 13:20:28 +01:00
2024-05-14 18:45:06 +01:00
2024-02-16 08:16:58 +01:00