Cyril Vallez
dc11a3cbb2
[core] Refactor the Cache logic to make it simpler and more general (#39797)
* Simplify the logic quite a bit
* Update cache_utils.py
* continue work
* continue simplifying a lot
* style
* Update cache_utils.py
* offloading much simpler
* style
* Update cache_utils.py
* update inits
* Update cache_utils.py
* consistemncy
* Update cache_utils.py
* update generate
* style
* fix
* fix
* add early_initialization
* fix
* fix mamba caches
* update
* fix
* fix
* fix
* fix tests
* fix configs
* revert
* fix tests
* alright
* Update modeling_gptj.py
* fix the constructors
* cache tests
* Update test_cache_utils.py
* fix
* simplify
* back to before -> avoid compile bug
* doc
* mistral test
* llama4 test dtype
* Update test_modeling_llama4.py
* CIs
* Finally find a nice impl
* Update cache_utils.py
* Update cache_utils.py
* add lazy methods in autodoc
* typo
* better doc
* Add detailed docstring for lazy init
* CIs
* style
* fix
2025-08-08 14:47:21 +02:00
..
2025-07-31 09:14:14 -07:00
2025-07-25 14:10:04 +02:00
2025-08-08 14:47:21 +02:00
2025-07-25 14:10:04 +02:00
2025-07-25 14:10:04 +02:00
2024-11-04 09:40:30 -08:00
2025-07-25 14:10:04 +02:00
2025-07-31 09:14:14 -07:00
2025-08-08 14:47:21 +02:00
2025-06-13 11:07:09 +00:00
2025-07-25 14:10:04 +02:00
2024-12-17 09:32:00 -08:00
2023-11-08 08:35:20 -05:00
2025-07-31 09:14:14 -07:00
2024-04-08 14:21:16 +01:00