Add gguf support for bloom (#33473)
* add bloom arch support for gguf * apply format * small refactoring, bug fix in GGUF_TENSOR_MAPPING naming * optimize bloom GGUF_TENSOR_MAPPING * implement reverse reshaping for bloom gguf * add qkv weights test * add q_8 test for bloom
This commit is contained in:
committed by
GitHub
parent
3e039d3827
commit
9d200cfbee
@@ -80,6 +80,7 @@ For now the supported model architectures are the architectures that have been v
|
||||
- Qwen2
|
||||
- Qwen2Moe
|
||||
- Phi3
|
||||
- Bloom
|
||||
|
||||
## Example usage
|
||||
|
||||
|
||||
Reference in New Issue
Block a user