Poedator
7c62e69326
GPT2Model StaticCache support (#35761)
* initial GPT2 changes
* causal_mask support
* return_legacy_cache
* cleanup
* fix1
* outputs shape fixes
* gpt2 return fix
* pkv, attn fixes
* fix dual_head
* is_causal arg fix
* decision transformer updated
* style fix
* batch_size from inputs_embeds
* DecisionTransformerModel fixes
* cross-attn support + cache warning
* x-attn @decision
* EDCache proper init
* simplified logic in `if use_cache:` for GPT2Model
* @deprecate_kwarg for DecisionTr attn fwd
* @deprecate_kwarg in gpt2
* deprecation version updated to 4.51
* kwargs in gradient_checkpointing_fn
* rename next_cache to past_key_values
* attention_mask prep
* +cache_position in GPT2DoubleHeadsModel
* undo kwargs in gradient checkpointing
* moved up `if self.gradient_checkpointing`
* consistency in decision_transformer
* pastkv, cache_pos in grad_checkpt args
* rm _reorder_cache
* output_attentions streamlined
* decision_transformer consistency
* return_legacy_cache improved
* ClvpForCausalLM used for legacy cache test now
* is_causal fixed
* attn_output cleanup
* consistency @ decision_transformer
* Updated deprecation notice version to 4.52
* upd deprecation
* consistent legacy cache code in decision transformers\
* next_cache -> past_kv in decision_tr
* cache support flags in decision_transf
* rm legacy cache warning
* consistency in cache init for decision transf
* no Static Cache for Decision Transformer
---------
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
2025-04-24 14:46:35 +02:00
..
2025-04-11 11:08:36 +02:00
2022-02-23 15:46:28 -05:00
2022-04-19 16:13:08 +01:00
2025-01-24 16:55:28 +01:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2024-05-22 15:23:04 +01:00
2025-04-24 14:46:35 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2023-04-06 14:00:29 +02:00
2025-01-23 09:47:54 +01:00
2025-04-08 14:12:08 +02:00
2023-05-24 15:40:19 -04:00
2025-04-07 16:02:14 +02:00
2025-04-08 14:12:08 +02:00
2023-02-27 15:31:55 +00:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-22 11:38:10 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2024-09-20 20:58:51 +02:00
2025-03-17 16:09:09 +01:00
2023-02-28 16:24:14 -05:00
2025-04-08 14:12:08 +02:00
2025-04-18 16:45:54 +02:00
2025-04-08 14:12:08 +02:00
2025-02-19 15:13:27 +00:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-10 20:54:21 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-08 14:12:08 +02:00
2025-04-10 20:54:21 +02:00
2024-10-31 15:48:11 -04:00