eustlb
af2d7caff3
Add Moonshine (#34784)
* config draft
* full encoder forward
* full decoder forward
* fix sdpa and FA2
* fix sdpa and FA2
* moonshine model
* moonshine model forward
* fix attention with past_key_values
* add MoonshineForConditionalGeneration
* fix cache handling and causality for cross attention
* no causal attention mask for the encoder
* model addition (imports etc)
* small nit
* nits
* Update src/transformers/models/moonshine/convert_usefulsensors_to_hf.py
Co-authored-by: Joshua Lochner <admin@xenova.com>
* add rope_theta
* nits
* model doc
* Update src/transformers/models/auto/configuration_auto.py
Co-authored-by: Joshua Lochner <admin@xenova.com>
* imports
* add MODEL_FOR_SPEECH_SEQ_2_SEQ_MAPPING_NAMES
* updates modular
* make
* make fix-copies
* ruff check examples fix
* fix check_modular_conversion
* nit
* nits
* nits
* copied from -> imports
* imports fix
* integrate attention refacto
* modular edge case
* remove encoder
* convolutions params in config
* run modular_model_converter
* make
* Update docs/source/en/model_doc/moonshine.md
Co-authored-by: Joshua Lochner <admin@xenova.com>
* MoonshineModelTest
* correct typo
* make style
* integration tests
* make
* modular convert
* name conversion update (up_proj -> fc1 etc)
* update config
* update MLP
* update attention
* update encoder layer
* update decoder layer
* update convolutions parameters
* update encoder
* remove INPUTS_DOCSTRING
* update decoder
* update conditional generation
* update pretrained model
* imports
* modular converted
* update doc
* fix
* typo
* update doc
* update license
* update init
* split config in file
* two classes for MLP
* attention from GLM
* from GlmRotaryEmbedding
* split MLP
* apply arthur's review suggestions
* apply arthur's review suggestions
* apply arthur's review suggestions
* auto feature extractor
* convert modular
* fix + make
* convert modular
* make
* unsplit config
* use correct checkpoint
* wrap generate
* update tests
* typos
* make
* typo
* update doc
---------
Co-authored-by: Joshua Lochner <admin@xenova.com>
2025-01-10 11:03:36 +01:00
..
2024-12-20 12:08:12 +01:00
2024-12-23 16:54:49 +01:00
2025-01-10 11:03:36 +01:00
2024-12-23 16:54:49 +01:00
2025-01-09 11:20:36 +01:00
2024-11-28 16:04:05 +01:00
2024-05-28 18:29:22 +02:00
2025-01-10 11:03:36 +01:00
2024-09-09 10:47:24 +02:00
2024-09-27 17:15:13 +02:00
2024-12-20 09:22:44 -08:00
2024-12-02 15:26:34 +00:00
2024-12-02 15:26:34 +00:00
2024-02-08 14:13:35 -08:00
2024-12-04 09:18:44 -08:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-04-01 18:47:32 -07:00
2024-12-12 16:53:21 -08:00
2024-09-09 10:47:24 +02:00
2024-07-23 17:47:51 +01:00
2024-06-06 22:02:38 +01:00
2024-08-26 13:15:43 +02:00
2024-09-09 10:47:24 +02:00
2025-01-09 09:50:51 -08:00
2023-06-20 18:07:47 -04:00
2025-01-02 11:29:46 +01:00
2025-01-08 17:08:02 +00:00
2025-01-03 14:50:07 +01:00
2024-07-08 11:52:47 +01:00
2024-10-07 10:08:20 +02:00
2024-10-02 14:08:46 +01:00
2025-01-10 11:03:36 +01:00
2024-11-26 09:37:18 -08:00
2024-12-04 07:48:34 -08:00
2025-01-06 08:54:31 -08:00
2024-11-27 07:47:28 -08:00
2025-01-08 13:36:30 +00:00
2024-09-24 03:40:56 -06:00
2024-11-11 07:09:31 -08:00
2024-03-23 18:29:39 -07:00
2024-12-20 09:16:02 -08:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-12-04 09:18:44 -08:00
2024-12-17 09:33:50 -08:00
2025-01-10 11:03:36 +01:00
2024-12-03 10:53:45 -08:00
2024-11-26 09:23:44 -08:00
2024-11-26 09:23:44 -08:00
2024-12-02 15:26:34 +00:00
2024-09-25 19:20:51 +02:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-11-18 19:51:49 +01:00
2024-11-26 09:23:34 -08:00
2023-11-06 19:45:03 +00:00
2024-12-04 09:18:44 -08:00
2024-02-16 08:16:58 +01:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-12-04 09:18:44 -08:00
2024-09-12 10:16:12 -07:00
2024-09-09 10:47:24 +02:00
2024-09-09 10:47:24 +02:00
2024-04-16 11:58:55 +02:00
2024-11-18 18:42:28 +00:00
2024-11-18 09:59:11 -08:00
2024-07-29 10:50:43 +01:00
2024-02-16 08:16:58 +01:00
2024-11-25 18:44:09 +01:00
2024-06-03 16:52:23 -07:00
2024-09-09 10:47:24 +02:00
2024-11-18 18:41:12 +00:00
2024-12-04 09:18:44 -08:00
2024-02-16 08:16:58 +01:00