Add Qwen2 GGUF loading support (#31175)

* add qwen2 gguf support

* Update docs

* fix qwen2 tokenizer

* add qwen2 gguf test

* fix typo in qwen2 gguf test

* format code

* Remove mistral, clarify the error message

* format code

* add typing and update docstring
This commit is contained in:
Isotr0py
2024-06-03 21:55:10 +08:00
committed by GitHub
parent df848acc5d
commit e4628434d8
6 changed files with 82 additions and 15 deletions

View File

@@ -63,6 +63,7 @@ For now the supported model architectures are the architectures that have been v
- LLaMa
- Mistral
- Qwen2
## Example usage