Files
HuggingFace_transformer/docs/source/en/model_doc
Sanchit Gandhi a9701953ff [whisper] static kv cache (#31166)
* make work with cache abstraction

* correct for static cache

* hacks for compile

* make fast

* fix

* fix pos ids

* generate

* fix sdpa

* fix sdpa cache pos

* fix fa2

* clean fa2

* integrate cache into generate

* make style

* copies

* more copies

* update eager

* update sdpa

* update fa2

* simplify

* use cache pos

* always compute cross-cache for debug

* avoid recompiles
Co-authored-by: Arthur Zucker <arthur@huggingface.co>

* fix fix

* fix fix fix

* more fix

* try encoder-decoder cache (too messy)

* revert encoder-decoder cache

* check cross-attn cache

* use enc-dec dataclass

* use richer enc-dec dataclass

* clean-up

* revert static cache changes

* small fixes

* revert to cpu flag

* fix copies

* add static slow test

* past k/v docstring

* more docstrings

* cache_position docstrings

* add to docs

* add enc-dec cache to docs

* make style

* fix after rebase

* fix beam

* style

* fix generation strategies

* fix most decoder-only tests

* style

* skip test

* more clean up

* small docstrings

* Apply suggestions from code review

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* add todo

* only crop self-attn

* check cache in mixin

* style

* fix re-compile after rebase

* move `is_updated` logic to enc-dec wrapper

* revert back

* revert cache back

* finalise design

* fix

* fix fix

* style

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* deprecate

* updates

* final updates

* style

* style

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-07-02 13:24:15 +01:00
..
2024-04-26 16:23:44 +01:00
2024-06-04 18:29:45 +02:00
2023-11-06 19:45:03 +00:00
2023-11-10 13:49:10 +00:00
2024-03-15 14:29:11 +01:00
2024-05-28 18:07:07 +01:00
2023-11-23 17:44:08 +00:00
2023-11-23 17:44:08 +00:00
2024-05-28 18:07:07 +01:00
2024-03-12 10:16:21 +00:00
2024-06-27 17:36:19 +02:00
2024-06-19 09:40:57 +02:00
2024-05-21 19:38:02 +02:00
2024-05-13 15:59:46 +01:00
2024-04-18 11:04:02 +02:00
2024-05-14 16:32:01 +02:00
2024-05-28 18:07:07 +01:00
2023-10-30 21:42:19 +01:00
2023-12-20 14:25:07 +05:30
2024-06-27 10:32:51 -07:00
2024-05-28 18:07:07 +01:00
2024-05-28 18:07:07 +01:00
2024-05-28 18:07:07 +01:00
2024-04-17 17:59:07 +02:00
2024-05-14 08:36:11 +02:00
2024-02-19 15:22:29 +01:00
2024-02-19 15:22:29 +01:00
2024-03-13 19:05:20 +00:00
2024-05-28 18:07:07 +01:00
2024-05-28 18:07:07 +01:00
2023-11-06 19:45:03 +00:00
2023-07-13 11:46:54 -04:00
2024-05-31 12:34:29 +02:00
2024-04-19 18:31:43 +01:00
2023-11-06 19:45:03 +00:00
2024-05-28 18:07:07 +01:00
2023-11-23 17:02:16 +00:00
2023-12-15 20:16:47 +01:00
2024-06-11 15:47:38 +01:00
2024-07-02 13:24:15 +01:00
2024-05-28 18:07:07 +01:00