Add Gemma2 GGUF support (#34002)

* initial setup for ggml.py

* initial setup of GGUFGemma2Converter class

* Add gemma2 model to gguf.md doc

* Partial work on GGUF_TENSOR_MAPPING

* initial setup of GGUF_TENSOR_MAPPING for Gemma2

* refactor: rename GemmaConvert class to GemmaConverter for naming consistency

* feat: complete gemma2 tensor mapping implementation

* feat: add initial implementation of GGUFGemmaConverter

* feat: complete GGUFGemmaConverter implementation

* feat: add test code for gemma2

* refactor: minor code cleanup

* refactor: minor code cleanup

* fix: resolve suggestions

* Update tests/quantization/ggml/test_ggml.py

Co-authored-by: Isotr0py <2037008807@qq.com>

---------

Co-authored-by: Isotr0py <2037008807@qq.com>
This commit is contained in:
Yijun Lee
2025-01-03 22:50:07 +09:00
committed by GitHub
parent 1fe2d53d4e
commit e5fd865eba
5 changed files with 180 additions and 3 deletions

View File

@@ -88,6 +88,7 @@ For now the supported model architectures are the architectures that have been v
- T5
- Mamba
- Nemotron
- Gemma2
## Example usage