Add gguf support for StableLM (#33793)

* add stablelm gguf architecture support

* add additional quantization tests

* resolve merge conflict, add weight conversion tests for fp16
This commit is contained in:
Vladislav Bronzov
2024-10-09 12:16:13 +02:00
committed by GitHub
parent e783f12f20
commit faa0f63b93
4 changed files with 107 additions and 5 deletions

View File

@@ -82,6 +82,7 @@ For now the supported model architectures are the architectures that have been v
- Phi3
- Bloom
- Falcon
- StableLM
## Example usage