Longjie Zheng
616bb11d48
Add torch.compile for Mistral (#30642)
* first version
* fix sliding window
* fix style
* add sliding window cache
* fix style
* address comments
* fix test
* fix style
* move sliding window check inside cache init
* revert changes on irrelevant files & add comment on SlidingWindowCache
* address comments & fix style
fix style
* update causal mask
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] llama
* [run-slow] mistral
* [run-slow] mistral
* [run-slow] mistral
* revert CI from a10 to t4
* wrap up
2024-05-20 16:27:24 +02:00
..
2024-04-29 10:57:51 +01:00
2024-05-20 16:27:24 +02:00
2024-05-10 09:29:26 -07:00
2024-04-08 14:21:16 +01:00
2024-04-16 11:58:55 +02:00
2024-05-01 15:47:05 +01:00
2024-05-09 22:57:52 +02:00
2024-05-16 14:32:21 +01:00
2024-04-23 16:06:20 +01:00
2024-05-01 15:47:05 +01:00
2024-04-08 14:21:16 +01:00
2023-11-08 08:35:20 -05:00
2024-05-07 12:59:49 +02:00
2024-04-08 14:21:16 +01:00