Files
HuggingFace_transformer/docs/source/en/model_doc
eustlb af2d7caff3 Add Moonshine (#34784)
* config draft

* full encoder forward

* full decoder forward

* fix sdpa and FA2

* fix sdpa and FA2

* moonshine model

* moonshine model forward

* fix attention with past_key_values

* add MoonshineForConditionalGeneration

* fix cache handling and causality for cross attention

* no causal attention mask for the encoder

* model addition (imports etc)

* small nit

* nits

* Update src/transformers/models/moonshine/convert_usefulsensors_to_hf.py

Co-authored-by: Joshua Lochner <admin@xenova.com>

* add rope_theta

* nits

* model doc

* Update src/transformers/models/auto/configuration_auto.py

Co-authored-by: Joshua Lochner <admin@xenova.com>

* imports

* add MODEL_FOR_SPEECH_SEQ_2_SEQ_MAPPING_NAMES

* updates modular

* make

* make fix-copies

* ruff check examples fix

* fix check_modular_conversion

* nit

* nits

* nits

* copied from -> imports

* imports fix

* integrate attention refacto

* modular edge case

* remove encoder

* convolutions params in config

* run modular_model_converter

* make

* Update docs/source/en/model_doc/moonshine.md

Co-authored-by: Joshua Lochner <admin@xenova.com>

* MoonshineModelTest

* correct typo

* make style

* integration tests

* make

* modular convert

* name conversion update (up_proj -> fc1 etc)

* update config

* update MLP

* update attention

* update encoder layer

* update decoder layer

* update convolutions parameters

* update encoder

* remove INPUTS_DOCSTRING

* update decoder

* update conditional generation

* update pretrained model

* imports

* modular converted

* update doc

* fix

* typo

* update doc

* update license

* update init

* split config in file

* two classes for MLP

* attention from GLM

* from GlmRotaryEmbedding

* split MLP

* apply arthur's review suggestions

* apply arthur's review suggestions

* apply arthur's review suggestions

* auto feature extractor

* convert modular

* fix + make

* convert modular

* make

* unsplit config

* use correct checkpoint

* wrap generate

* update tests

* typos

* make

* typo

* update doc

---------

Co-authored-by: Joshua Lochner <admin@xenova.com>
2025-01-10 11:03:36 +01:00
..
2024-09-03 14:01:00 +01:00
2024-12-06 12:17:34 +01:00
2024-12-18 20:18:17 +01:00
2024-12-17 14:44:47 +01:00
2024-04-26 16:23:44 +01:00
2024-09-20 14:27:32 +01:00
2024-06-04 18:29:45 +02:00
2023-11-06 19:45:03 +00:00
2024-07-18 10:30:37 +05:30
2023-11-10 13:49:10 +00:00
2024-12-17 09:36:31 -08:00
2024-03-15 14:29:11 +01:00
2024-08-19 10:21:51 +01:00
2024-12-17 14:44:47 +01:00
2024-05-28 18:07:07 +01:00
2024-10-21 09:05:05 -04:00
2025-01-07 11:34:56 +01:00
2024-08-19 09:28:13 +01:00
2024-10-02 13:55:19 +01:00
2023-11-23 17:44:08 +00:00
2024-05-28 18:07:07 +01:00
2024-12-17 14:23:13 +01:00
2024-08-07 10:03:05 +05:00
2024-10-18 17:41:12 +02:00
2024-06-19 09:40:57 +02:00
2024-08-27 21:27:21 +02:00
2024-09-21 01:43:50 +02:00
2024-12-06 12:17:34 +01:00
2024-05-13 15:59:46 +01:00
2024-12-09 10:01:31 +01:00
2024-05-14 16:32:01 +02:00
2024-05-28 18:07:07 +01:00
2023-10-30 21:42:19 +01:00
2023-12-20 14:25:07 +05:30
2024-08-26 17:49:44 +02:00
2024-05-28 18:07:07 +01:00
2024-10-16 11:21:49 +02:00
2024-10-30 10:11:50 +01:00
2025-01-09 20:15:38 +01:00
2025-01-10 11:03:36 +01:00
2024-10-16 11:21:49 +02:00
2024-05-28 18:07:07 +01:00
2024-08-06 15:42:05 +02:00
2024-05-28 18:07:07 +01:00
2024-09-25 18:04:42 +01:00
2024-04-17 17:59:07 +02:00
2024-09-05 15:49:28 +02:00
2024-10-10 11:49:34 +02:00
2024-05-14 08:36:11 +02:00
2024-02-19 15:22:29 +01:00
2024-02-19 15:22:29 +01:00
2024-10-04 21:39:45 +02:00
2024-03-13 19:05:20 +00:00
2024-05-28 18:07:07 +01:00
2024-10-07 09:54:07 +02:00
2024-05-28 18:07:07 +01:00
2023-11-06 19:45:03 +00:00
2023-07-13 11:46:54 -04:00
2024-04-19 18:31:43 +01:00
2023-11-06 19:45:03 +00:00
2025-01-08 09:52:51 +01:00
2024-12-11 12:40:30 +00:00
2024-05-28 18:07:07 +01:00
2023-11-23 17:02:16 +00:00
2024-06-11 15:47:38 +01:00
2025-01-08 16:02:14 +00:00
2024-10-15 11:27:54 +02:00
2024-05-28 18:07:07 +01:00
2024-10-04 22:28:05 +02:00