Mohamed Mekkouri
b262680af4
Release - Conda / build_and_package (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
Add Bitnet model (#37742)
* Adding BitNet b1.58 Model
* Add testing code for BitNet
* Fix format issues
* Fix docstring format issues
* Fix docstring
* Fix docstring
* Fix: weight back to uint8
* Fix
* Fix format issues
* Remove copy comments
* Add model link to the docstring
* Fix: set tie_word_embeddings default to false
* Update
* Generate modeling file
* Change config name for automatically generating modeling file.
* Generate modeling file
* Fix class name
* Change testing branch
* Remove unused param
* Fix config docstring
* Add docstring for BitNetQuantConfig.
* Fix docstring
* Update docs/source/en/model_doc/bitnet.md
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update docs/source/en/model_doc/bitnet.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update bitnet config
* Update explanation between online and offline mode
* Remove space
* revert changes
* more revert
* spaces
* update
* fix-copies
* doc fix
* fix minor nits
* empty
* small nit
* empty
---------
Co-authored-by: Shuming Ma <shumingma@pku.edu.cn>
Co-authored-by: shumingma <shmingm@gmail.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2025-04-28 15:08:46 +02:00
..
2024-07-17 20:24:10 +01:00
2025-03-06 17:35:30 +01:00
2023-06-20 18:07:47 -04:00
2025-02-13 12:01:28 +01:00
2025-02-05 08:19:31 -08:00
2024-10-11 14:38:35 +02:00
2023-10-16 09:52:29 +02:00
2024-12-15 14:00:36 -05:00
2023-06-20 18:07:47 -04:00
2024-09-09 10:47:24 +02:00
2024-07-16 09:32:01 -04:00
2023-06-20 18:07:47 -04:00
2025-03-19 18:29:40 +00:00
2024-09-09 10:47:24 +02:00
2025-03-03 10:33:46 -08:00
2024-10-31 15:48:11 -04:00
2023-11-06 19:45:03 +00:00
2025-04-28 15:08:46 +02:00
2024-10-23 21:18:52 +01:00
2024-11-04 16:37:51 +01:00
2024-09-09 10:47:24 +02:00