Poedator
a0779b9e19
Llama: fix custom 4D masks, v2 (#30348)
* 4d mask fixes
* Update custom 4D mask logic
* test moved to mixin
* extra tests 4d mask
* upd 4d mask and StaticCache handling
* added Mask4DTestHard to mistral tests
* post-rebase fixes
* test fixes for StaticCache
* make fix-copies
* upd 1 after #30476
* fix common tests
* rm elif attention_mask.dim() == 4:
* tests combined, fixed, mixtral supported
* bigbird style chg reverted
* rm if attention_mask.dim() == 2
* modeling_llama formatting chg
---------
Co-authored-by: Joao Gante <joao@huggingface.co>
2024-05-13 13:46:06 +02:00
..
2024-05-07 12:59:49 +02:00
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-04-18 12:49:43 -04:00
2024-04-18 12:49:43 -04:00
2024-03-19 14:43:02 +00:00
2024-04-22 13:15:28 +01:00
2024-05-09 18:01:57 +01:00
2024-05-13 13:46:06 +02:00
2024-04-25 12:07:21 +01:00
2024-02-29 03:56:16 +01:00
2024-05-07 10:17:27 +01:00
2024-05-13 11:41:03 +02:00
2023-12-07 10:00:08 +01:00
2024-02-16 08:16:58 +01:00
2024-03-25 10:33:38 +01:00
2024-05-08 17:54:49 +01:00
2024-04-26 18:21:47 +01:00
2023-12-20 18:33:17 +00:00
2024-03-06 10:57:04 +00:00
2023-11-15 14:10:39 +01:00
2024-03-15 14:18:41 +00:00
2023-06-15 07:30:24 -04:00
2024-03-15 14:18:41 +00:00
2024-02-20 16:20:20 +01:00
2024-03-15 14:18:41 +00:00
2023-11-10 15:35:27 +00:00
2024-05-13 13:46:06 +02:00
2024-04-15 09:36:06 +01:00
2024-01-23 10:28:23 +01:00
2024-01-30 17:26:36 +00:00
2024-03-21 14:04:11 +00:00
2024-05-13 13:46:06 +02:00
2024-02-05 14:50:07 +00:00
2024-01-19 09:59:14 +00:00
2023-09-05 10:12:25 +02:00
2024-04-15 09:36:06 +01:00
2024-03-15 14:18:41 +00:00