Files
HuggingFace_transformer/tests/quantization
Vladislav Bronzov 5d11de4a2f Add Qwen2Moe GGUF loading support (#33264)
* update gguf doc, config and tensor mapping

* add qwen2moe architecture support, GGUFQwen2MoeConverter and q4 unit tests

* apply code style fixes

* reformat files

* assign GGUFQwen2Converter to qwen2_moe
2024-09-05 17:42:03 +02:00
..
2024-06-26 21:59:08 +01:00
2024-07-22 20:21:59 +02:00