FEAT : Adding VPTQ quantization method to HFQuantizer (#34770)

* init vptq

* add integration

* add vptq support

fix readme

* add tests && format

* format

* address comments

* format

* format

* address comments

* format

* address comments

* remove debug code

* Revert "remove debug code"

This reverts commit ed3b3eaaba82caf58cb3aa6e865d98e49650cf66.

* fix test

---------

Co-authored-by: Yang Wang <wyatuestc@gmail.com>
This commit is contained in:
wejoncy
2024-12-20 16:45:53 +08:00
committed by GitHub
parent 5a2aedca1e
commit 4e27a4009d
21 changed files with 647 additions and 3 deletions

View File

@@ -157,6 +157,8 @@
# title: AWQ
# - local: quantization/aqlm
# title: AQLM
# - local: quantization/vptq
# title: VPTQ
# - local: quantization/quanto
# title: Quanto
# - local: quantization/eetq