Files
HuggingFace_transformer/tests/models
Cyril Vallez 3f860dba55 Fix mask slicing for models with HybridCache (#35681)
* correctly slice

* check mask

* Update modular_gemma2.py

* fix

* add tests

* fix typo

* finally fix mask slicing

* Finally correctly slice in all cases!!

* add test for all attention functions

* small fix in tests

* trick around dynamo tracing issue

* last update

* more robust

* kwargs propagation

* make it explicit for checkpointing

* apply modular
2025-01-28 14:35:00 +01:00
..
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2025-01-17 12:10:43 +00:00
2025-01-22 09:41:04 +00:00
2024-12-20 14:36:31 +01:00
2025-01-13 18:41:15 +01:00
2024-06-26 21:59:08 +01:00
2025-01-20 11:15:39 +01:00
2024-06-26 21:59:08 +01:00
2024-10-07 10:56:24 +02:00
2024-06-26 21:59:08 +01:00
2025-01-23 11:23:00 +01:00
2025-01-20 10:32:39 +00:00
2024-11-29 11:58:11 +00:00
2024-06-26 21:59:08 +01:00
2025-01-08 16:02:14 +00:00
2025-01-27 10:51:23 +01:00
2022-05-03 14:42:02 +02:00