Longjie Zheng
616bb11d48
Add torch.compile for Mistral (#30642)
* first version
* fix sliding window
* fix style
* add sliding window cache
* fix style
* address comments
* fix test
* fix style
* move sliding window check inside cache init
* revert changes on irrelevant files & add comment on SlidingWindowCache
* address comments & fix style
fix style
* update causal mask
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] llama
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* revert CI from a10 to t4
* wrap up
2024-05-20 16:27:24 +02:00
..
2024-05-07 12:59:49 +02:00
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-05-15 10:02:31 -04:00
2024-05-13 18:14:36 +02:00
2024-03-19 14:43:02 +00:00
2024-04-22 13:15:28 +01:00
2024-05-14 13:31:39 +05:00
2024-05-20 16:27:24 +02:00
2024-04-25 12:07:21 +01:00
2024-02-29 03:56:16 +01:00
2024-05-20 11:38:32 +02:00
2024-05-15 17:17:09 +02:00
2023-12-07 10:00:08 +01:00
2024-02-16 08:16:58 +01:00
2024-03-25 10:33:38 +01:00
2024-05-20 09:21:40 -04:00
2024-04-26 18:21:47 +01:00
2020-01-06 15:11:12 +01:00
2023-12-20 18:33:17 +00:00
2024-03-06 10:57:04 +00:00
2023-11-15 14:10:39 +01:00
2024-03-15 14:18:41 +00:00
2023-06-15 07:30:24 -04:00
2024-03-15 14:18:41 +00:00
2024-02-20 16:20:20 +01:00
2024-03-15 14:18:41 +00:00
2023-11-10 15:35:27 +00:00
2024-05-20 10:36:57 +02:00
2024-05-16 10:56:11 +01:00
2024-01-23 10:28:23 +01:00
2024-05-13 15:59:46 +01:00
2024-03-21 14:04:11 +00:00
2024-05-13 13:46:06 +02:00
2024-02-05 14:50:07 +00:00
2024-01-19 09:59:14 +00:00
2023-09-05 10:12:25 +02:00
2024-04-15 09:36:06 +01:00
2024-03-15 14:18:41 +00:00