Raushan Turganbay
7a25f8dfdb
[qwen2-vl] fix FA2 inference ( #39121 )
...
* fix FA2
* update is causal flag and remove mask for FA2
* update for FA2 with varlen path
* how the tests were passing with different devices?
* add comment and ref to the PR
* move mask preparation to base pretrained model
* seq len is the first dim, not second
* fix copies to fix GLM4V
2025-07-01 10:18:37 +00:00
..
2025-04-08 14:12:08 +02:00
2025-06-30 15:25:36 +02:00
2025-06-25 14:31:20 +00:00
2025-04-28 14:20:45 +01:00
2025-06-26 16:25:00 +01:00
2025-06-23 10:56:51 +02:00
2025-06-26 18:36:56 +02:00
2025-07-01 10:18:37 +00:00
2025-06-25 17:29:10 +00:00
2025-06-27 18:33:11 +02:00
2025-06-27 19:25:32 +01:00
2025-06-30 11:49:03 +02:00
2025-06-13 11:07:09 +00:00
2025-06-11 17:28:06 +01:00
2025-06-30 11:49:03 +02:00
2025-06-26 16:25:00 +01:00
2025-06-25 17:29:10 +00:00
2025-06-26 16:25:00 +01:00
2025-06-13 16:22:12 +01:00
2025-04-08 14:12:08 +02:00
2025-04-09 11:48:49 +02:00
2025-04-08 14:12:08 +02:00
2025-06-23 14:17:25 +00:00
2025-06-26 16:25:00 +01:00
2025-07-01 10:34:53 +02:00
2025-06-17 19:37:18 +01:00
2025-06-12 09:34:30 +00:00
2025-06-25 17:29:10 +00:00
2025-06-26 16:25:00 +01:00
2025-03-17 16:09:46 +01:00
2025-06-12 09:34:30 +00:00