Add support for GGUF Phi-3 (#31844)

* Update docs for GGUF supported models

* Add tensor mappings and define class GGUFPhi3Converter

* Fix tokenizer

* Working version

* Attempt to fix some CI failures

* Run ruff format

* Add vocab, merges, decoder methods like LlamaConverter

* Resolve conflicts since Qwen2Moe was added to gguf

- I missed one place when resolving conflict
- I also made a mistake with tests_ggml.py and now has been fixed to reflect
its master version.
This commit is contained in:
Alazar
2024-09-10 04:32:38 -07:00
committed by GitHub
parent 8e8e7d8558
commit 96429e74a8
5 changed files with 122 additions and 1 deletions

View File

@@ -79,6 +79,7 @@ For now the supported model architectures are the architectures that have been v
- Mistral
- Qwen2
- Qwen2Moe
- Phi3
## Example usage