Arthur
07884817e4
[BC 4.37 -> 4.38] for Llama family, memory and speed ( #29753 )
...
* attempt to fix
* the actual fix that works with compilation!
* this?
* temporary update
* nit?
* dispatcg to memory efficient?
* update both models that have static cache support
* fix copies fix compile
* make sure fix
* fix cohere and gemma
* fix beams?
* nit
* slipped through the cracks
* nit
* nits
* update
* fix-copies
* skip failing tests
* nits
2024-03-20 18:47:26 -04:00
..
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-02-16 17:18:45 +05:30
2024-03-13 17:44:35 +00:00
2024-03-20 10:46:45 -04:00
2024-03-13 22:03:02 +05:30
2024-03-20 10:46:59 -04:00
2024-03-20 18:47:26 -04:00
2023-03-02 12:08:43 -05:00
2024-02-29 03:56:16 +01:00
2024-03-08 11:11:10 +00:00
2024-03-15 11:51:29 -04:00
2023-12-07 10:00:08 +01:00
2024-02-16 08:16:58 +01:00
2024-02-16 08:16:58 +01:00
2023-06-26 09:58:14 -04:00
2024-03-19 11:40:23 +01:00
2024-03-18 13:06:12 +00:00
2023-12-20 18:33:17 +00:00
2024-03-06 10:57:04 +00:00
2023-11-15 14:10:39 +01:00
2024-03-15 14:18:41 +00:00
2023-06-15 07:30:24 -04:00
2024-03-15 14:18:41 +00:00
2024-02-20 16:20:20 +01:00
2024-03-15 14:18:41 +00:00
2023-11-10 15:35:27 +00:00
2024-03-20 10:48:07 -04:00
2024-01-26 18:20:39 +00:00
2024-01-23 10:28:23 +01:00
2024-01-30 17:26:36 +00:00
2024-03-15 14:18:41 +00:00
2024-03-20 10:46:59 -04:00
2024-02-05 14:50:07 +00:00
2024-01-19 09:59:14 +00:00
2023-09-05 10:12:25 +02:00
2024-03-19 15:13:56 +01:00
2024-03-15 14:18:41 +00:00