Mohamed Mekkouri
efe72fe21f
Adding FP8 Quantization to transformers (#36026)
* first commit
* adding kernels
* fix create_quantized_param
* fix quantization logic
* end2end
* fix style
* fix imports
* fix consistency
* update
* fix style
* update
* udpate after review
* make style
* update
* update
* fix
* update
* fix docstring
* update
* update after review
* update
* fix scheme
* update
* update
* fix
* update
* fix docstring
* add source
* fix test
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2025-02-13 13:01:19 +01:00
..
2024-09-18 11:07:51 +02:00
2024-07-17 20:24:10 +01:00
2024-07-05 08:13:46 +02:00
2023-06-20 18:07:47 -04:00
2025-02-13 12:01:28 +01:00
2025-02-05 08:19:31 -08:00
2024-10-11 14:38:35 +02:00
2023-10-16 09:52:29 +02:00
2024-12-15 14:00:36 -05:00
2023-06-20 18:07:47 -04:00
2024-09-09 10:47:24 +02:00
2024-07-16 09:32:01 -04:00
2023-06-20 18:07:47 -04:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-10-31 15:48:11 -04:00
2023-11-06 19:45:03 +00:00
2025-02-13 13:01:19 +01:00
2024-10-23 21:18:52 +01:00
2024-11-04 16:37:51 +01:00
2024-09-09 10:47:24 +02:00