Files
HuggingFace_transformer/tests/quantization
Vladislav Bronzov 9d200cfbee Add gguf support for bloom (#33473)
* add bloom arch support for gguf

* apply format

* small refactoring, bug fix in GGUF_TENSOR_MAPPING naming

* optimize bloom GGUF_TENSOR_MAPPING

* implement reverse reshaping for bloom gguf

* add qkv weights test

* add q_8 test for bloom
2024-09-27 12:13:40 +02:00
..
2024-06-26 21:59:08 +01:00
2024-09-27 12:13:40 +02:00