Arthur
2da82e432d
Multiple llama4 fixe ( #37353 )
...
* update for fixes
* more fixes
* fuxix dynamic cache?
* style
* fix both traiining and generating. Eager seems alright
* dynamic does not work
* fix most cases, use_cache or not, eager or not, no default cache (ex: not training but you want to get cache states)
* should be final fixes
* fix more stuff no cat
* style
* fix
* style
* final sytle
* qualityeioiwhjfaopsejdpofqsdjkfjha;wesdhgfkjlqsw.denghjkaswednkgs
* fix
* revert
2025-04-08 11:14:49 +02:00
..
2022-11-08 19:54:41 +00:00
2024-05-22 06:40:15 +02:00
2025-04-04 12:18:20 +02:00
2025-03-21 13:08:47 +01:00
2025-04-08 11:14:49 +02:00
2025-03-31 09:50:49 +02:00
2025-03-13 15:12:44 +00:00
2024-05-22 06:40:15 +02:00
2025-04-08 11:14:49 +02:00
2024-05-22 06:40:15 +02:00
2025-04-05 22:02:22 +02:00
2025-03-13 15:12:44 +00:00
2023-03-13 19:11:19 +01:00
2025-03-20 10:55:12 +00:00
2025-04-05 22:02:22 +02:00
2023-06-06 18:17:41 +02:00
2025-03-06 13:12:30 +00:00
2024-08-27 11:58:27 +01:00
2025-03-13 15:12:44 +00:00
2025-03-13 15:12:44 +00:00
2025-03-25 16:00:11 +01:00
2024-04-15 15:08:09 +02:00
2025-04-02 14:39:57 +02:00
2024-01-31 15:58:17 +01:00
2025-03-25 16:00:11 +01:00
2023-02-03 12:57:02 -05:00
2024-10-17 16:11:52 +02:00
2024-08-27 11:58:27 +01:00
2024-04-12 10:01:28 +02:00
2024-05-22 06:40:15 +02:00
2025-04-04 11:46:27 +02:00
2025-03-21 13:08:47 +01:00
2024-04-15 13:20:36 +02:00
2025-03-25 16:00:11 +01:00
2025-03-25 16:00:11 +01:00
2025-03-25 16:00:11 +01:00
2024-10-09 09:21:46 +02:00
2025-02-24 17:53:18 +01:00
2022-06-02 10:24:16 +02:00
2024-10-28 12:01:05 +01:00
2025-03-25 16:00:11 +01:00
2024-09-03 16:53:21 +02:00
2025-03-11 13:47:38 +00:00
2024-06-10 15:16:58 +02:00
2024-05-09 22:57:52 +02:00
2024-05-22 06:40:15 +02:00
2025-03-13 15:12:44 +00:00
2024-04-24 22:32:42 +02:00
2025-03-26 12:49:50 +01:00
2025-03-25 16:00:11 +01:00
2024-07-22 14:14:47 +01:00