Add Nemotron GGUF Loading Support (#34725)

* Add Nemotron GGUF Loading Support

* fix the Nemotron architecture assignation

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
This commit is contained in:
farrosalferro
2024-11-21 19:37:34 +09:00
committed by GitHub
parent d4e1acbb7c
commit c57eafdaa1
3 changed files with 68 additions and 0 deletions

View File

@@ -87,6 +87,7 @@ For now the supported model architectures are the architectures that have been v
- Starcoder2
- T5
- Mamba
- Nemotron
## Example usage