Yoach Lacombe
9ba021ea75
Moshi integration (#33624)
* clean mimi commit
* some nits suggestions from Arthur
* make fixup
* first moshi WIP
* converting weights working + configuration + generation configuration
* finalize converting script - still missing tokenizer and FE and processor
* fix saving model w/o default config
* working generation
* use GenerationMixin instead of inheriting
* add delay pattern mask
* fix right order: moshi codes then user codes
* unconditional inputs + generation config
* get rid of MoshiGenerationConfig
* blank user inputs
* update convert script:fix conversion, add tokenizer, feature extractor and bf16
* add and correct Auto classes
* update modeling code, configuration and tests
* make fixup
* fix some copies
* WIP: add integration tests
* add dummy objects
* propose better readiblity and code organisation
* update tokenization tests
* update docstrigns, eval and modeling
* add .md
* make fixup
* add MoshiForConditionalGeneration to ignore Auto
* revert mimi changes
* re
* further fix
* Update moshi.md
* correct md formating
* move prepare causal mask to class
* fix copies
* fix depth decoder causal
* fix and correct some tests
* make style and update .md
* correct config checkpoitn
* Update tests/models/moshi/test_tokenization_moshi.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update tests/models/moshi/test_tokenization_moshi.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* make style
* Update src/transformers/models/moshi/__init__.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fixup
* change firm in copyrights
* udpate config with nested dict
* replace einsum
* make style
* change split to True
* add back splt=False
* remove tests in convert
* Update tests/models/moshi/test_modeling_moshi.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* add default config repo + add model to FA2 docstrings
* remove logits float
* fix some tokenization tests and ignore some others
* make style tokenization tests
* update modeling with sliding window + update modeling tests
* [run-slow] moshi
* remove prepare for generation frol CausalLM
* isort
* remove copied from
* ignore offload tests
* update causal mask and prepare 4D mask aligned with recent changes
* further test refine + add back prepare_inputs_for_generation for depth decoder
* correct conditional use of prepare mask
* update slow integration tests
* fix multi-device forward
* remove previous solution to device_map
* save_load is flaky
* fix generate multi-devices
* fix device
* move tensor to int
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Marc Sun <marc@huggingface.co>
2024-10-16 11:21:49 +02:00
..
2024-10-07 18:11:13 +02:00
2024-09-24 21:28:19 -04:00
2024-10-09 16:46:11 +01:00
2024-10-09 16:46:11 +01:00
2024-09-23 18:28:36 +01:00
2024-06-07 19:40:26 +01:00
2024-06-26 21:59:08 +01:00
2024-08-23 11:12:53 +01:00
2024-06-26 21:59:08 +01:00
2024-03-13 14:53:27 +01:00
2024-10-07 18:11:13 +02:00
2024-10-07 18:11:13 +02:00
2024-03-13 14:53:27 +01:00
2024-03-13 14:53:27 +01:00
2024-03-13 14:53:27 +01:00
2024-10-07 18:11:13 +02:00
2024-10-09 16:46:11 +01:00
2024-08-16 11:20:41 +01:00
2024-06-19 10:18:08 +01:00
2024-08-23 11:12:53 +01:00
2024-10-09 16:46:11 +01:00
2024-10-02 16:43:43 +02:00
2024-10-10 16:32:08 +01:00
2024-09-19 12:04:24 +02:00
2024-10-02 16:43:43 +02:00
2024-10-09 16:46:11 +01:00
2024-06-26 21:59:08 +01:00
2024-08-28 10:26:00 +02:00
2024-06-26 21:59:08 +01:00
2024-10-03 14:01:24 +01:00
2024-09-25 19:05:01 +01:00
2024-07-23 14:54:38 +08:00
2024-09-25 19:05:01 +01:00
2024-09-25 19:05:01 +01:00
2024-09-26 19:38:20 +02:00
2024-06-26 21:59:08 +01:00
2024-09-24 16:40:48 +01:00
2024-10-03 14:01:24 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-08-06 11:33:05 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-06-19 10:18:08 +01:00
2024-10-11 14:43:03 +02:00
2024-07-23 14:54:38 +08:00
2024-07-29 10:52:13 +01:00
2024-07-26 10:33:02 +02:00
2024-07-26 10:33:02 +02:00
2024-08-23 11:12:53 +01:00
2024-06-26 21:59:08 +01:00
2024-08-06 11:33:05 +01:00
2024-08-13 16:16:30 +02:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-09-21 01:58:00 +01:00
2024-10-07 18:11:13 +02:00
2023-06-29 10:17:36 +01:00
2024-10-02 16:43:43 +02:00
2024-03-25 10:33:38 +01:00
2024-08-06 11:33:05 +01:00
2024-08-27 11:58:27 +01:00
2024-10-07 18:11:13 +02:00
2024-10-03 14:01:24 +01:00
2024-09-25 12:21:53 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-09-24 16:40:48 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-09-09 13:20:27 +01:00
2024-10-09 16:46:11 +01:00
2024-06-19 10:18:08 +01:00
2024-06-26 21:59:08 +01:00
2024-05-22 06:40:15 +02:00
2024-10-02 16:43:43 +02:00
2024-10-11 10:16:31 +02:00
2024-10-11 10:16:31 +02:00
2024-09-25 19:05:01 +01:00
2024-08-06 11:33:05 +01:00
2024-07-24 17:36:32 +01:00
2024-09-24 16:40:48 +01:00
2024-09-24 16:40:48 +01:00
2024-09-09 10:59:04 +02:00
2024-09-09 10:59:04 +02:00
2024-07-24 17:36:32 +01:00
2024-10-09 16:46:11 +01:00
2024-10-11 11:51:49 +01:00
2024-10-11 11:51:49 +01:00
2024-09-20 10:16:43 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-09-05 16:17:34 +02:00
2024-07-23 14:54:38 +08:00
2024-07-25 15:12:23 +02:00
2024-10-16 09:25:26 +02:00
2024-10-16 09:25:26 +02:00
2024-10-16 09:25:26 +02:00
2024-10-11 15:24:01 +02:00
2024-09-25 12:21:53 +01:00
2024-09-24 21:28:19 -04:00
2024-09-24 21:28:19 -04:00
2024-10-05 16:03:12 +02:00
2024-10-11 11:51:49 +01:00
2024-10-16 09:25:26 +02:00
2024-06-26 21:59:08 +01:00
2024-09-19 19:28:04 +01:00
2024-10-09 16:46:11 +01:00
2024-09-13 11:58:38 +01:00
2024-10-09 16:46:11 +01:00
2024-08-06 11:33:05 +01:00
2024-10-09 16:46:11 +01:00
2024-10-11 11:51:49 +01:00
2024-10-15 16:19:18 +02:00
2024-09-24 21:28:19 -04:00
2024-09-19 12:04:24 +02:00
2024-10-02 16:43:43 +02:00
2024-10-09 16:46:11 +01:00
2024-07-16 09:32:01 -04:00
2024-10-09 16:46:11 +01:00
2024-07-26 10:33:02 +02:00
2024-10-09 16:46:11 +01:00
2024-10-01 09:28:40 +02:00
2024-09-20 11:40:57 +01:00
2024-08-23 11:12:53 +01:00
2024-10-09 16:46:11 +01:00
2024-09-10 11:19:56 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-09-09 13:20:27 +01:00
2024-10-16 11:21:49 +02:00
2024-10-14 08:53:32 +02:00
2024-10-14 08:53:32 +02:00
2024-10-09 14:02:56 +02:00
2024-08-05 15:19:42 +01:00
2024-07-26 10:33:02 +02:00
2024-08-06 11:33:05 +01:00
2024-08-06 11:33:05 +01:00
2024-08-23 11:12:53 +01:00
2024-06-07 19:40:26 +01:00
2024-10-16 11:21:49 +02:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 11:23:50 +01:00
2024-10-09 16:46:11 +01:00
2024-10-03 14:01:24 +01:00
2024-10-03 14:01:24 +01:00
2024-10-09 16:46:11 +01:00
2024-10-07 10:56:24 +02:00
2024-10-04 19:00:45 +02:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-08-06 11:33:05 +01:00
2024-05-22 06:40:15 +02:00
2024-10-11 11:51:49 +01:00
2024-10-11 11:51:49 +01:00
2024-10-01 23:52:03 +02:00
2024-10-09 16:46:11 +01:00
2024-10-09 16:46:11 +01:00
2024-10-10 11:49:34 +02:00
2024-08-06 11:33:05 +01:00
2024-09-09 13:20:27 +01:00
2024-10-09 12:15:48 +01:00
2024-06-26 21:59:08 +01:00
2024-06-07 19:40:26 +01:00
2024-08-23 11:12:53 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-10-11 11:51:49 +01:00
2024-10-09 16:46:11 +01:00
2024-10-10 11:50:39 +02:00
2024-10-04 22:45:37 +02:00
2024-03-13 14:53:27 +01:00
2024-10-01 23:52:03 +02:00
2024-10-01 23:52:03 +02:00
2024-10-09 16:46:11 +01:00
2024-08-06 11:33:05 +01:00
2024-07-23 14:54:38 +08:00
2024-10-09 16:46:11 +01:00
2024-08-06 11:33:05 +01:00
2024-06-26 21:59:08 +01:00
2024-10-14 08:53:32 +02:00
2024-08-08 15:47:24 +02:00
2024-10-14 08:53:32 +02:00
2024-10-02 16:43:43 +02:00
2024-09-19 11:55:26 +01:00
2024-10-09 16:46:11 +01:00
2024-10-10 14:44:05 +02:00
2024-10-07 18:11:13 +02:00
2024-06-21 01:48:10 -07:00
2024-10-07 18:11:13 +02:00
2024-10-07 18:11:13 +02:00
2024-10-07 18:11:13 +02:00
2024-10-09 16:46:11 +01:00
2024-10-09 16:46:11 +01:00
2024-08-06 11:33:05 +01:00
2024-09-24 16:40:48 +01:00
2024-10-09 16:46:11 +01:00
2024-10-03 14:01:24 +01:00
2024-10-03 14:01:24 +01:00
2024-06-26 21:59:08 +01:00
2024-06-11 15:47:38 +01:00
2024-09-17 08:05:21 +02:00
2024-09-17 08:05:21 +02:00
2024-08-06 11:33:05 +01:00
2024-09-25 12:21:53 +01:00
2024-10-03 14:01:24 +01:00
2024-10-03 14:01:24 +01:00
2024-10-09 16:46:11 +01:00
2024-07-26 10:33:02 +02:00
2024-10-11 11:51:49 +01:00
2024-10-09 16:46:11 +01:00
2024-06-19 10:18:08 +01:00
2024-06-07 19:40:26 +01:00
2024-06-19 10:18:08 +01:00
2024-08-06 11:33:05 +01:00
2024-06-26 18:46:48 +01:00
2024-08-06 11:48:32 +01:00
2024-10-09 16:46:11 +01:00
2024-06-26 21:59:08 +01:00
2024-10-09 16:46:11 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-08-06 11:33:05 +01:00
2024-09-23 12:47:32 -04:00
2024-10-09 16:46:11 +01:00
2024-07-23 14:54:38 +08:00
2024-07-23 14:54:38 +08:00
2024-10-03 14:01:24 +01:00
2024-06-19 10:18:08 +01:00
2024-10-11 15:24:01 +02:00
2024-08-06 11:33:05 +01:00
2024-08-06 11:33:05 +01:00
2024-10-15 16:19:18 +02:00
2024-09-25 12:21:53 +01:00
2024-09-25 12:21:53 +01:00
2024-05-22 06:40:15 +02:00
2024-10-07 18:11:13 +02:00
2024-09-30 18:47:13 +02:00
2024-06-07 19:40:26 +01:00
2024-06-26 21:59:08 +01:00
2024-08-06 11:33:05 +01:00
2024-10-03 14:01:24 +01:00
2024-10-15 11:27:54 +02:00
2024-10-02 10:41:08 +02:00
2024-10-02 10:41:08 +02:00
2024-07-23 14:54:38 +08:00
2024-09-26 19:38:20 +02:00
2024-06-17 17:29:13 +01:00
2024-07-23 14:54:38 +08:00
2024-10-09 16:46:11 +01:00
2024-09-25 19:05:01 +01:00
2024-09-24 16:40:48 +01:00
2024-10-09 16:46:11 +01:00
2024-08-28 10:26:00 +02:00
2024-10-09 16:46:11 +01:00
2024-10-09 16:46:11 +01:00
2024-10-09 16:46:11 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-10-04 22:28:05 +02:00
2024-08-06 11:33:05 +01:00
2022-05-03 14:42:02 +02:00