Files
HuggingFace_transformer/tests/models
Raushan Turganbay c8524aeb07 [cache] make all classes cache compatible finally (#38635)
* dump

* push other models

* fix simple greedy generation

* xmod

* add fmst and clean up some mentions of old cache format

* gpt-bigcode now follows standards

* delete tuple cache reference in generation

* fix some models

* fix some models

* fix mambas and support cache in tapas

* fix some more tests

* fix copies

* delete `_reorder_cache`

* another fix copies

* fix typos and delete unnecessary test

* fix rag generate, needs special cache reordering

* fix tapas and superglue

* reformer create special cache

* recurrent gemma `reorder_cache` was a no-op, delete

* fix-copies

* fix blio and musicgen pipeline tests

* fix reformer

* fix reformer, again...

* delete `_supports_cache_class`

* delete `supports_quantized_cache`

* fix failing tests

* fix copies

* some minor clean up

* style

* style

* fix copies

* fix tests

* fix copies

* create causal mask now needs positions?

* fixc copies

* style

* Update tests/test_modeling_common.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* clean-up of non-generative model after merging main

* check `is_decoder` for cache

* delete transpose for scores

* remove tuple cache from docs everywhere

* fix tests

* fix copies

* fix copies once more

* properly deprecate `encoder_attention_mask` in Bert-like models

* import `deprecate_kwarg` where needed

* fix copies again

* fix copies

* delete `nex_decoder_cache`

* fix copies asks to update for PLM

* fix copies

* rebasing had a few new models, fix them and merge asap!

* fix copies once more

* fix slow tests

* fix tests and updare PLM checkpoint

* add read token and revert accidentally removed line

* oh com -on, style

* just skip it, read token has no access to PLM yet

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2025-07-16 14:00:17 +02:00
..
2025-06-11 17:28:06 +01:00
2025-06-24 15:05:29 +02:00
2025-06-26 11:04:23 +00:00
2025-06-11 17:28:06 +01:00
2025-05-16 13:26:54 +02:00
2025-04-28 15:08:46 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-07-08 11:44:29 +02:00
2025-06-27 16:54:11 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-07-02 12:25:26 +01:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-06-11 17:28:06 +01:00
2025-06-19 10:56:34 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-06-26 20:07:17 +02:00
2025-06-11 17:28:06 +01:00
2025-07-14 12:02:59 +02:00
2025-07-01 23:27:22 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-24 18:01:15 +02:00
2025-04-15 11:33:09 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-04-08 17:15:37 +01:00
2025-06-13 11:13:32 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-07-10 05:18:44 +00:00
2025-07-10 05:18:44 +00:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-07-10 11:51:55 +02:00
2025-05-28 16:44:20 +01:00
2025-05-28 16:44:20 +01:00
2025-06-18 14:36:03 +02:00
2025-06-23 17:42:46 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-25 15:12:15 +00:00
2025-05-23 18:29:31 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-04-08 17:15:37 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00