Add gguf support for bloom (#33473)

* add bloom arch support for gguf

* apply format

* small refactoring, bug fix in GGUF_TENSOR_MAPPING naming

* optimize bloom GGUF_TENSOR_MAPPING

* implement reverse reshaping for bloom gguf

* add qkv weights test

* add q_8 test for bloom
This commit is contained in:
Vladislav Bronzov
2024-09-27 12:13:40 +02:00
committed by GitHub
parent 3e039d3827
commit 9d200cfbee
6 changed files with 140 additions and 8 deletions

View File

@@ -80,6 +80,7 @@ For now the supported model architectures are the architectures that have been v
- Qwen2
- Qwen2Moe
- Phi3
- Bloom
## Example usage