Cyril Vallez
dc11a3cbb2
[core] Refactor the Cache logic to make it simpler and more general (#39797)
* Simplify the logic quite a bit
* Update cache_utils.py
* continue work
* continue simplifying a lot
* style
* Update cache_utils.py
* offloading much simpler
* style
* Update cache_utils.py
* update inits
* Update cache_utils.py
* consistemncy
* Update cache_utils.py
* update generate
* style
* fix
* fix
* add early_initialization
* fix
* fix mamba caches
* update
* fix
* fix
* fix
* fix tests
* fix configs
* revert
* fix tests
* alright
* Update modeling_gptj.py
* fix the constructors
* cache tests
* Update test_cache_utils.py
* fix
* simplify
* back to before -> avoid compile bug
* doc
* mistral test
* llama4 test dtype
* Update test_modeling_llama4.py
* CIs
* Finally find a nice impl
* Update cache_utils.py
* Update cache_utils.py
* add lazy methods in autodoc
* typo
* better doc
* Add detailed docstring for lazy init
* CIs
* style
* fix
2025-08-08 14:47:21 +02:00
..
2024-10-07 13:39:10 -07:00
2024-10-08 17:57:17 -07:00
2025-08-08 14:47:21 +02:00
2024-10-08 18:19:37 -07:00
2025-04-18 18:47:34 +01:00
2024-10-07 15:05:17 -07:00
2024-10-07 15:05:00 -07:00
2024-10-08 17:56:30 -07:00
2024-10-15 11:21:05 -07:00