Add Qwen2Moe GGUF loading support (#33264)
* update gguf doc, config and tensor mapping * add qwen2moe architecture support, GGUFQwen2MoeConverter and q4 unit tests * apply code style fixes * reformat files * assign GGUFQwen2Converter to qwen2_moe
This commit is contained in:
committed by
GitHub
parent
132e87500e
commit
5d11de4a2f
@@ -78,6 +78,7 @@ For now the supported model architectures are the architectures that have been v
|
||||
- LLaMa
|
||||
- Mistral
|
||||
- Qwen2
|
||||
- Qwen2Moe
|
||||
|
||||
## Example usage
|
||||
|
||||
|
||||
Reference in New Issue
Block a user