Add GGUF for Mamba (#34200)

* add mamba architecture for gguf

* add logic for weights conversion, some fixes and refactoring

* add lm_head layers, unit test refactoring

* more fixes for tests

* remove lm_head creation

* remove unused comments
This commit is contained in:
Vladislav Bronzov
2024-10-30 16:52:17 +01:00
committed by GitHub
parent eab6c491d4
commit 5251fe6271
4 changed files with 93 additions and 2 deletions

View File

@@ -86,6 +86,7 @@ For now the supported model architectures are the architectures that have been v
- GPT2
- Starcoder2
- T5
- Mamba
## Example usage