Add Qwen2 GGUF loading support (#31175)

* add qwen2 gguf support * Update docs * fix qwen2 tokenizer * add qwen2 gguf test * fix typo in qwen2 gguf test * format code * Remove mistral, clarify the error message * format code * add typing and update docstring
2024-06-03 21:55:10 +08:00
parent df848acc5d
commit e4628434d8
6 changed files with 82 additions and 15 deletions
--- a/docs/source/en/gguf.md
+++ b/docs/source/en/gguf.md
@@ -63,6 +63,7 @@ For now the supported model architectures are the architectures that have been v

 - LLaMa
 - Mistral
+- Qwen2

 ## Example usage