Add gguf support for StableLM (#33793)

* add stablelm gguf architecture support * add additional quantization tests * resolve merge conflict, add weight conversion tests for fp16
2024-10-09 12:16:13 +02:00
parent e783f12f20
commit faa0f63b93
4 changed files with 107 additions and 5 deletions
--- a/docs/source/en/gguf.md
+++ b/docs/source/en/gguf.md
@@ -82,6 +82,7 @@ For now the supported model architectures are the architectures that have been v
 - Phi3
 - Bloom
 - Falcon
+- StableLM

 ## Example usage