Files
HuggingFace_transformer/docs/source/en
Arthur f5d45d89c4 🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288)
* Protect ParallelInterface

* early error out on output attention setting for no wraning in modeling

* modular update

* fixup

* update model tests

* update

* oups

* set model's config

* more cases

* ??

* properly fix

* fixup

* update

* last onces

* update

* fix?

* fix wrong merge commit

* fix hub test

* nits

* wow I am tired

* updates

* fix pipeline!

---------

Co-authored-by: Lysandre <hi@lysand.re>
2025-05-23 17:17:38 +02:00
..
2025-05-21 10:43:11 +02:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-04-11 18:42:37 +01:00
2025-03-03 10:33:46 -08:00
2025-03-24 14:08:29 +00:00
2025-03-03 10:33:46 -08:00
2025-03-24 14:08:29 +00:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-11 15:29:14 +01:00
2024-07-08 11:52:47 +01:00
2025-03-24 14:08:29 +00:00
2025-03-03 10:33:46 -08:00
2025-03-31 09:50:49 +02:00
2025-04-07 15:19:47 +02:00
2025-04-21 09:01:11 -07:00
2025-04-03 14:15:53 +01:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-04-17 14:54:44 +01:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-05-19 13:16:35 +00:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-05-06 14:32:44 +01:00
2025-01-26 15:26:38 -08:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-04-11 18:42:37 +01:00
2025-05-14 12:40:00 +00:00
2025-03-03 10:33:46 -08:00