Andrei Panferov
64c05eecd6
HIGGS Quantization Support (#34997)
* higgs init
* working with crunches
* per-model workspaces
* style
* style 2
* tests and style
* higgs tests passing
* protecting torch import
* removed torch.Tensor type annotations
* torch.nn.Module inheritance fix maybe
* hide inputs inside quantizer calls
* style structure something
* Update src/transformers/quantizers/quantizer_higgs.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* reworked num_sms
* Update src/transformers/integrations/higgs.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* revamped device checks
* docstring upd
* Update src/transformers/quantizers/quantizer_higgs.py
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* edited tests and device map assertions
* minor edits
* updated flute cuda version in docker
* Added p=1 and 2,3bit HIGGS
* flute version check update
* incorporated `modules_to_not_convert`
* less hardcoding
* Fixed comment
* Added docs
* Fixed gemma support
* example in docs
* fixed torch_dtype for HIGGS
* Update docs/source/en/quantization/higgs.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Collection link
* dequantize interface
* newer flute version, torch.compile support
* unittest message fix
* docs update compile
* isort
* ValueError instead of assert
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2024-12-23 16:54:49 +01:00
..
2024-12-20 12:08:12 +01:00
2024-12-23 16:54:49 +01:00
2024-12-19 14:45:52 -08:00
2024-12-23 16:54:49 +01:00
2024-12-20 09:17:28 -08:00
2024-11-28 16:04:05 +01:00
2024-05-28 18:29:22 +02:00
2024-12-23 16:54:49 +01:00
2024-09-09 10:47:24 +02:00
2024-09-27 17:15:13 +02:00
2024-12-20 09:22:44 -08:00
2024-12-02 15:26:34 +00:00
2024-12-02 15:26:34 +00:00
2024-02-08 14:13:35 -08:00
2024-12-04 09:18:44 -08:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-04-01 18:47:32 -07:00
2024-12-12 16:53:21 -08:00
2024-09-09 10:47:24 +02:00
2024-07-23 17:47:51 +01:00
2024-06-06 22:02:38 +01:00
2024-08-26 13:15:43 +02:00
2024-09-09 10:47:24 +02:00
2024-07-08 11:52:47 +01:00
2023-06-20 18:07:47 -04:00
2023-12-20 10:37:23 -08:00
2024-12-05 17:07:33 +01:00
2024-11-21 11:37:34 +01:00
2024-07-08 11:52:47 +01:00
2024-10-07 10:08:20 +02:00
2024-10-02 14:08:46 +01:00
2024-12-19 14:03:35 +01:00
2024-11-26 09:37:18 -08:00
2024-12-04 07:48:34 -08:00
2024-12-20 09:45:53 +01:00
2024-11-27 07:47:28 -08:00
2024-10-02 09:12:21 +01:00
2024-09-24 03:40:56 -06:00
2024-11-11 07:09:31 -08:00
2024-03-23 18:29:39 -07:00
2024-12-20 09:16:02 -08:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-12-04 09:18:44 -08:00
2024-12-17 09:33:50 -08:00
2024-12-19 14:03:35 +01:00
2024-12-03 10:53:45 -08:00
2024-11-26 09:23:44 -08:00
2024-11-26 09:23:44 -08:00
2024-12-02 15:26:34 +00:00
2024-09-25 19:20:51 +02:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-11-18 19:51:49 +01:00
2024-11-26 09:23:34 -08:00
2023-11-06 19:45:03 +00:00
2024-12-04 09:18:44 -08:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-12-04 09:18:44 -08:00
2024-09-12 10:16:12 -07:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-04-16 11:58:55 +02:00
2024-11-18 18:42:28 +00:00
2024-11-18 09:59:11 -08:00
2024-07-29 10:50:43 +01:00
2024-02-16 08:16:58 +01:00
2024-11-25 18:44:09 +01:00
2024-06-03 16:52:23 -07:00
2024-09-09 10:47:24 +02:00
2024-11-18 18:41:12 +00:00
2024-12-04 09:18:44 -08:00
2024-02-16 08:16:58 +01:00