Files
HuggingFace_transformer/docs/source/en
Niklas Muennighoff ecd61c6286 Add OLMoE (#32406)
* Add OLMoE

* Add OLMoE

* Updates

* Make norm optional; add keys

* Add output

* Add

* Fix dtype

* Fix eos config

* Update

* Add OLMoE

* Fix OLMoE path

* Format

* Format

* Rmv copy statement

* Rmv copy statement

* Format

* Add copies

* Cp rotary

* Fix aming

* Fix naming

* Update RoPE integration; num_logits_to_keep; Add copy statements

* Add eps to config

* Format

* Add aux loss

* Adapt router_aux_loss_coef

* Update md

* Adapt

* adapt tests
2024-09-03 18:43:12 +02:00
..
2024-09-03 18:43:12 +02:00
2024-09-03 18:43:12 +02:00
2023-09-04 11:15:12 +01:00
2024-09-02 09:56:20 +02:00
2022-04-04 10:25:46 -04:00
2024-07-08 11:52:47 +01:00
2023-12-20 10:37:23 -08:00
2024-07-08 11:52:47 +01:00
2023-11-13 14:20:54 +01:00
2024-09-03 18:43:12 +02:00
2022-04-04 10:25:46 -04:00
2024-07-08 11:52:47 +01:00
2024-09-03 18:43:12 +02:00
2024-06-12 11:33:00 +01:00