FEAT : Adding BitNet quantization method to HFQuantizer (#33410)

* rebasing changes * fixing style * adding some doc to functions * remove bitblas * change dtype * fixing check_code_quality * fixing import order * adding doc to tree * Small update on BitLinear * adding some tests * sorting imports * small update * reformatting * reformatting * reformatting with ruff * adding assert * changes after review * update disk offloading * adapting after review * Update after review * add is_serializable back * fixing style * adding serialization test * make style * small updates after review
2024-10-09 17:51:41 +02:00
parent 48461c0fe2
commit 36d410dab6
11 changed files with 745 additions and 1 deletions
--- a/docs/source/en/main_classes/quantization.md
+++ b/docs/source/en/main_classes/quantization.md
@@ -68,3 +68,7 @@ Learn how to quantize models in the [Quantization](../quantization) guide.
 ## TorchAoConfig

 [[autodoc]] TorchAoConfig
+
+## BitNetConfig
+
+[[autodoc]] BitNetConfig