Add support for GGUF Phi-3 (#31844)

* Update docs for GGUF supported models * Add tensor mappings and define class GGUFPhi3Converter * Fix tokenizer * Working version * Attempt to fix some CI failures * Run ruff format * Add vocab, merges, decoder methods like LlamaConverter * Resolve conflicts since Qwen2Moe was added to gguf - I missed one place when resolving conflict - I also made a mistake with tests_ggml.py and now has been fixed to reflect its master version.
2024-09-10 04:32:38 -07:00
parent 8e8e7d8558
commit 96429e74a8
5 changed files with 122 additions and 1 deletions
--- a/docs/source/en/gguf.md
+++ b/docs/source/en/gguf.md
@@ -79,6 +79,7 @@ For now the supported model architectures are the architectures that have been v
 - Mistral
 - Qwen2
 - Qwen2Moe
+- Phi3

 ## Example usage