Files
HuggingFace_transformer/tests/quantization
Penut Chen 1c122a46dc Support dequantizing GGUF FP16 format (#31783)
* support gguf fp16

* support gguf bf16 with pytorch

* add gguf f16 test

* remove bf16
2024-07-24 17:59:59 +02:00
..
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-07-22 20:21:59 +02:00