Mohamed Mekkouri
efe72fe21f
Adding FP8 Quantization to transformers (#36026)
* first commit
* adding kernels
* fix create_quantized_param
* fix quantization logic
* end2end
* fix style
* fix imports
* fix consistency
* update
* fix style
* update
* udpate after review
* make style
* update
* update
* fix
* update
* fix docstring
* update
* update after review
* update
* fix scheme
* update
* update
* fix
* update
* fix docstring
* add source
* fix test
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2025-02-13 13:01:19 +01:00
..
2024-12-20 12:08:12 +01:00
2025-02-13 13:01:19 +01:00
2025-02-13 12:20:53 +01:00
2025-02-13 13:01:19 +01:00
2025-02-13 12:01:28 +01:00
2024-11-28 16:04:05 +01:00
2024-05-28 18:29:22 +02:00
2025-02-13 13:01:19 +01:00
2024-09-09 10:47:24 +02:00
2024-09-27 17:15:13 +02:00
2024-12-20 09:22:44 -08:00
2025-01-26 15:26:38 -08:00
2024-12-02 15:26:34 +00:00
2024-02-08 14:13:35 -08:00
2024-12-04 09:18:44 -08:00
2024-09-09 10:47:24 +02:00
2024-04-01 18:47:32 -07:00
2025-01-26 15:26:38 -08:00
2024-09-09 10:47:24 +02:00
2022-04-04 10:25:46 -04:00
2024-07-23 17:47:51 +01:00
2024-06-06 22:02:38 +01:00
2024-08-26 13:15:43 +02:00
2025-02-05 08:19:31 -08:00
2025-02-05 08:19:31 -08:00
2023-06-20 18:07:47 -04:00
2025-01-02 11:29:46 +01:00
2025-01-26 15:26:38 -08:00
2025-01-03 14:50:07 +01:00
2024-07-08 11:52:47 +01:00
2025-02-12 12:45:11 +01:00
2024-10-02 14:08:46 +01:00
2025-02-10 11:32:45 +00:00
2025-01-27 08:49:28 -08:00
2025-02-05 08:22:33 -08:00
2025-01-06 08:54:31 -08:00
2024-11-27 07:47:28 -08:00
2025-02-05 08:20:02 -08:00
2024-09-24 03:40:56 -06:00
2024-11-11 07:09:31 -08:00
2024-03-23 18:29:39 -07:00
2025-01-21 17:53:30 +01:00
2024-02-16 08:16:58 +01:00
2022-04-04 10:25:46 -04:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-12-04 09:18:44 -08:00
2025-02-12 15:53:27 +01:00
2025-02-10 11:32:45 +00:00
2024-12-03 10:53:45 -08:00
2024-11-26 09:23:44 -08:00
2024-11-26 09:23:44 -08:00
2025-02-05 08:19:31 -08:00
2025-02-04 11:01:49 +01:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-11-18 19:51:49 +01:00
2024-11-26 09:23:34 -08:00
2023-11-06 19:45:03 +00:00
2024-12-04 09:18:44 -08:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2025-01-22 14:32:27 +00:00
2024-09-12 10:16:12 -07:00
2024-09-09 10:47:24 +02:00
2025-02-07 12:41:52 -08:00
2025-01-26 15:26:38 -08:00
2024-11-18 18:42:28 +00:00
2024-11-18 09:59:11 -08:00
2024-07-29 10:50:43 +01:00
2024-02-16 08:16:58 +01:00
2024-11-25 18:44:09 +01:00
2024-06-03 16:52:23 -07:00
2024-09-09 10:47:24 +02:00
2025-02-12 15:33:43 +01:00
2024-12-04 09:18:44 -08:00
2024-02-16 08:16:58 +01:00