eustlb
6bdd4ec952
Release - Conda / build_and_package (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
Add kyutai stt (#38909)
* first draft
* cleaner version
* udpate tests + modeling
* add tests
* init
* udpate test_modeling_common
* fix tests
* csm Processor draft
* convertion update
* mimi cache padding convolutions draft
* mimi streaming udpates
* update mimi padding cache test
* udpate cache padding mimi test
* make style mimi
* updates generate moshi asr
* moshi asr integration tests (single + batched)
* update tests
* update conversion script
* good default sliding window value
* udpdate generate
* update test checkpoint
* nit
* fix mimi
* fix codec prefix
* revert
* revert
* update config
* update config
* unnecessary mimi input restriction
* remove delay in tokens
* remove _prepare_4d_causal_attention_mask_with_cache_position and _update_causal_mask
* test update
* modular update
* make style
* nit
* rename
* create codec model generation config at init
* remove delay
* max_new_tokens/length warning
* correct conv1 padding cache import for modular
* nit
* fix on encoder_past_key_values
* convert modular
* move frame_size to config
* move frame_size to config
* update test name
* handle first token is bos
* better handling of max_new_tokens
* fix
* fix batch size in test input prep
* update docstring
* convert modular
* make style
* make style
* add feature extractor
* correct modular convention name for feature_extraction file
* update convertion script
* doc processor
* update doc
* udpate init
* update model type
* fixes
* update tests
* fix
* make
* add doc
* nit
* fix
* doc
* auto mappings
* doc
* nit
* convert modular
* doc
* nit
* extend _keep_in_fp32_modules to enforce fp32
* renaming to stt
* doc update + test update
* doc fixes
* doc fix
* doc fix
* fix musicgen tests
* fix musicgen tests
* make style
* fix musicgen tests
* correct frame_rate config param for mimi
* update mimi test
* revert update mimi test
* enforce cpu test
* move cache init in cache class
* convert modular
* docstring update
* update model id
* feature_extractor -> feature_extraction (SEW)
* convert modular
* update model id
2025-06-24 18:01:15 +02:00
..
2025-05-12 11:55:51 +02:00
2021-02-15 07:55:10 -05:00
2024-05-22 06:40:15 +02:00
2025-05-24 19:15:02 +02:00
2025-03-21 13:08:47 +01:00
2025-05-29 11:08:23 +00:00
2025-05-06 06:47:43 +02:00
2025-06-18 14:38:08 +01:00
2025-06-17 19:37:18 +01:00
2025-06-23 10:39:41 -04:00
2024-05-22 06:40:15 +02:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2023-03-13 19:11:19 +01:00
2025-06-19 15:22:59 +01:00
2025-06-13 13:44:07 +01:00
2025-06-17 19:37:18 +01:00
2025-06-13 17:37:46 +02:00
2021-02-15 07:55:10 -05:00
2025-03-06 13:12:30 +00:00
2024-08-27 11:58:27 +01:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2025-03-25 16:00:11 +01:00
2025-05-09 11:45:03 +02:00
2024-04-15 15:08:09 +02:00
2025-04-02 14:39:57 +02:00
2024-01-31 15:58:17 +01:00
2025-03-25 16:00:11 +01:00
2023-02-03 12:57:02 -05:00
2025-06-04 11:38:25 +02:00
2025-06-20 16:10:35 +00:00
2024-08-27 11:58:27 +01:00
2024-04-12 10:01:28 +02:00
2024-05-22 06:40:15 +02:00
2025-06-24 18:01:15 +02:00
2025-06-13 12:02:27 -07:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2025-03-25 16:00:11 +01:00
2025-05-30 11:19:42 +02:00
2025-06-17 19:37:18 +01:00
2025-06-23 10:56:51 +02:00
2025-05-24 19:15:02 +02:00
2025-03-25 16:00:11 +01:00
2024-09-03 16:53:21 +02:00
2025-03-11 13:47:38 +00:00
2024-06-10 15:16:58 +02:00
2025-06-13 12:02:27 -07:00
2024-05-22 06:40:15 +02:00
2025-03-13 15:12:44 +00:00
2024-04-24 22:32:42 +02:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2024-07-22 14:14:47 +01:00