eustlb
967045082f
Add voxtral (#39429)
* draft
* draft update (conversion working)
* mend
* draft update
* draft update: working generate
* refactor
* VoxtralProcessor draft
* processor update
* update convert_tekken_tokenizer
* refactor processor
* update convert
* make style
* better handle prefil
* make style
* add tests
* add mistral_common audio loading
* processor update
* revert changes
* audio utils update
* add audio to apply chat template mistral update
* voxtral processor update
* fix
* udpate converstion script
* make mistral tokenier from pretrain work from local dir
* fix udpates
* add integration tests
* add batched version
* processor docstring
* make style
* revert convert_tekken_tokenizer changes
* revert processing_qwen2.5 changes
* add multi-turn test
* processor improvements
* address review changes
* Update src/transformers/tokenization_mistral_common.py
Co-authored-by: Julien Denize <40604584+juliendenize@users.noreply.github.com>
* update audio utils
* nits
* integration test update
* correct _support
* update tests
* test update
* update integration tests
* fix
* fix
* fix
* add test_apply_chat_template_with_audio
* add model doc
* model doc
* nit
* doc uptade
* nit
* processor improvement
* ensure default is 3B
* nits
* make
* make
* convert modular
* update checkpoint
* fix test
* make
* make
* autos
* make
* make
* nit
* nit
* nit
---------
Co-authored-by: Julien Denize <40604584+juliendenize@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2025-07-18 00:02:04 +00:00
..
2025-06-17 19:37:18 +01:00
2025-06-25 17:29:10 +00:00
2025-07-18 00:02:04 +00:00
2025-06-17 19:37:18 +01:00
2025-06-13 11:07:09 +00:00
2024-11-04 09:40:30 -08:00
2025-07-08 10:20:52 +02:00
2025-06-25 17:29:10 +00:00
2025-07-09 09:29:51 -07:00
2025-06-13 11:07:09 +00:00
2025-06-17 19:37:18 +01:00
2024-12-17 09:32:00 -08:00
2023-11-08 08:35:20 -05:00
2025-06-17 19:37:18 +01:00
2024-04-08 14:21:16 +01:00