Files
HuggingFace_transformer/tests/models
Arthur 4a04b4ccca [Mixtral] Fix loss + nits (#28115)
* default config should not use sliding window

* update the doc

* nits

* add a proper test

* update

* update

* update expected value

* Update src/transformers/tokenization_utils_fast.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* convert to float

* average then N**2

* comment

* revert nit

* good to fo

* fixup

* Update tests/models/mixtral/test_modeling_mixtral.py

Co-authored-by: Lysandre Debut <hi@lysand.re>

* revert unrelated change

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Lysandre Debut <hi@lysand.re>
2023-12-19 17:31:54 +01:00
..
2023-12-18 15:26:03 +00:00
2023-11-28 08:38:32 +00:00
2023-11-28 17:40:01 +01:00
2023-10-24 16:49:26 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-11-16 17:43:19 +01:00
2023-06-16 15:40:49 +01:00
2023-12-18 15:26:03 +00:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-12-07 10:00:08 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-05-24 13:52:52 +01:00
2023-11-08 13:26:02 +00:00
2023-11-16 17:43:19 +01:00
2023-08-02 20:22:36 +02:00
2023-09-26 07:06:04 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-05 10:12:25 +02:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-12-04 09:40:42 +01:00
2023-08-16 17:45:02 +01:00
2023-12-18 15:26:03 +00:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00