Files
HuggingFace_transformer/tests/models
Anton Vlasjuk badc71b9f6 🔴[Attention] Attention refactor for Whisper-based models (#38235)
* start refactoring whisper

* revert for now

* first step

* carry over attn fixes

* check if this works

* whisper has an off by one somewhere - cutting mask in any interface

* make it based on interface

* remove some tests that were skipped but now work

* some fixes for whisper tests

* interface changes

* change the order of fix

* some attention adjustments for eager + TP

* fix scaling

* mask changes

* why does whisper contain those extra seq lens?

* fix from config for fa2 as input_ids is invalid

* fix another test

* another fix

* disable flex attn due to compile issues

* copies and refactor for qwen audio since it somewhat relies on whisper

* fix scaling and smaller things

* retrigger

* new new interface version + more fixups

* adjust qwen

* add comment

* forgot this one

* change copies as whisper cuts on the mask

* add guard

* add flex attention

* switch to new mask function + add skips for torchscript

* remove old api with cache position

* last changes?

* trigger ci
2025-05-28 13:32:38 +02:00
..
2025-04-08 17:15:37 +01:00
2025-05-26 17:37:30 +01:00
2025-05-06 17:40:28 -04:00
2025-05-16 13:26:54 +02:00
2025-04-28 15:08:46 +02:00
2025-04-08 17:15:37 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-26 19:54:04 +02:00
2025-04-08 17:15:37 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-04-14 17:07:36 +02:00
2025-05-23 18:29:31 +01:00
2025-04-15 11:33:09 +01:00
2025-05-12 16:59:00 +02:00
2025-04-08 17:15:37 +01:00
2025-04-08 17:15:37 +01:00
2025-04-14 17:58:09 +02:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-04-23 15:55:20 -04:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-05-13 12:50:43 +00:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-04-08 17:15:37 +01:00
2022-05-03 14:42:02 +02:00