Add Gemma2 GGUF support (#34002)

* initial setup for ggml.py

* initial setup of GGUFGemma2Converter class

* Add gemma2 model to gguf.md doc

* Partial work on GGUF_TENSOR_MAPPING

* initial setup of GGUF_TENSOR_MAPPING for Gemma2

* refactor: rename GemmaConvert class to GemmaConverter for naming consistency

* feat: complete gemma2 tensor mapping implementation

* feat: add initial implementation of GGUFGemmaConverter

* feat: complete GGUFGemmaConverter implementation

* feat: add test code for gemma2

* refactor: minor code cleanup

* refactor: minor code cleanup

* fix: resolve suggestions

* Update tests/quantization/ggml/test_ggml.py

Co-authored-by: Isotr0py <2037008807@qq.com>

---------

Co-authored-by: Isotr0py <2037008807@qq.com>

This commit is contained in:

Yijun Lee

2025-01-03 22:50:07 +09:00

committed by

GitHub

parent 1fe2d53d4e

commit e5fd865eba

5 changed files with 180 additions and 3 deletions

									
										1

docs/source/en/gguf.md
									
												View File
												
				@@ -88,6 +88,7 @@ For now the supported model architectures are the architectures that have been v

				- T5

				- Mamba

				- Nemotron

				- Gemma2

				## Example usage

Add Gemma2 GGUF support (#34002)

1 docs/source/en/gguf.md Unescape Escape View File

1

docs/source/en/gguf.md

View File