Mohamed Mekkouri
b262680af4
Release - Conda / build_and_package (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
Add Bitnet model (#37742)
* Adding BitNet b1.58 Model
* Add testing code for BitNet
* Fix format issues
* Fix docstring format issues
* Fix docstring
* Fix docstring
* Fix: weight back to uint8
* Fix
* Fix format issues
* Remove copy comments
* Add model link to the docstring
* Fix: set tie_word_embeddings default to false
* Update
* Generate modeling file
* Change config name for automatically generating modeling file.
* Generate modeling file
* Fix class name
* Change testing branch
* Remove unused param
* Fix config docstring
* Add docstring for BitNetQuantConfig.
* Fix docstring
* Update docs/source/en/model_doc/bitnet.md
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update docs/source/en/model_doc/bitnet.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update bitnet config
* Update explanation between online and offline mode
* Remove space
* revert changes
* more revert
* spaces
* update
* fix-copies
* doc fix
* fix minor nits
* empty
* small nit
* empty
---------
Co-authored-by: Shuming Ma <shumingma@pku.edu.cn>
Co-authored-by: shumingma <shmingm@gmail.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2025-04-28 15:08:46 +02:00
..
2025-04-22 11:33:31 +01:00
2025-04-28 15:08:46 +02:00
2025-04-28 15:08:46 +02:00
2025-04-24 18:19:38 +02:00
2025-04-23 13:31:33 -04:00
2024-11-28 16:04:05 +01:00
2024-05-28 18:29:22 +02:00
2025-04-28 15:08:46 +02:00
2025-03-03 10:33:46 -08:00
2025-03-13 14:16:37 -04:00
2025-03-03 10:33:46 -08:00
2025-04-11 18:42:37 +01:00
2025-03-28 18:00:35 +01:00
2024-02-08 14:13:35 -08:00
2025-03-03 10:33:46 -08:00
2025-03-24 14:08:29 +00:00
2025-03-07 13:09:02 +00:00
2025-04-10 14:42:32 +02:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2024-09-09 10:47:24 +02:00
2022-04-04 10:25:46 -04:00
2025-03-03 10:33:46 -08:00
2025-03-24 14:08:29 +00:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-04-18 12:50:17 -07:00
2025-03-11 15:29:14 +01:00
2024-07-08 11:52:47 +01:00
2025-03-24 14:08:29 +00:00
2025-03-11 09:41:41 -07:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-31 09:50:49 +02:00
2025-04-07 15:19:47 +02:00
2025-04-21 09:01:11 -07:00
2025-04-03 14:15:53 +01:00
2025-03-11 09:41:41 -07:00
2025-03-04 13:47:41 +00:00
2024-09-24 03:40:56 -06:00
2025-03-03 10:33:46 -08:00
2024-03-23 18:29:39 -07:00
2025-03-03 10:33:46 -08:00
2025-03-18 14:00:54 -04:00
2022-04-04 10:25:46 -04:00
2025-03-03 10:33:46 -08:00
2024-09-09 10:47:24 +02:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-03-03 10:33:46 -08:00
2025-03-31 10:55:47 +02:00
2025-04-18 18:57:33 +02:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-04-17 14:54:44 +01:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2024-11-26 09:23:34 -08:00
2023-11-06 19:45:03 +00:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-04-15 08:35:05 -07:00
2024-09-09 10:47:24 +02:00
2025-03-25 11:34:21 -07:00
2025-03-03 10:33:46 -08:00
2025-03-11 13:47:38 +00:00
2025-03-03 10:33:46 -08:00
2025-03-10 13:14:19 -07:00
2025-01-26 15:26:38 -08:00
2024-11-18 18:42:28 +00:00
2025-03-04 13:47:41 +00:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2024-06-03 16:52:23 -07:00
2025-04-11 18:42:37 +01:00
2025-03-03 10:33:46 -08:00
2025-04-10 17:44:09 +02:00
2025-03-03 10:33:46 -08:00
2024-02-16 08:16:58 +01:00