Cyril Vallez
dc11a3cbb2
[core] Refactor the Cache logic to make it simpler and more general (#39797)
* Simplify the logic quite a bit
* Update cache_utils.py
* continue work
* continue simplifying a lot
* style
* Update cache_utils.py
* offloading much simpler
* style
* Update cache_utils.py
* update inits
* Update cache_utils.py
* consistemncy
* Update cache_utils.py
* update generate
* style
* fix
* fix
* add early_initialization
* fix
* fix mamba caches
* update
* fix
* fix
* fix
* fix tests
* fix configs
* revert
* fix tests
* alright
* Update modeling_gptj.py
* fix the constructors
* cache tests
* Update test_cache_utils.py
* fix
* simplify
* back to before -> avoid compile bug
* doc
* mistral test
* llama4 test dtype
* Update test_modeling_llama4.py
* CIs
* Finally find a nice impl
* Update cache_utils.py
* Update cache_utils.py
* add lazy methods in autodoc
* typo
* better doc
* Add detailed docstring for lazy init
* CIs
* style
* fix
2025-08-08 14:47:21 +02:00
..
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2025-08-08 14:47:21 +02:00
2023-06-20 18:07:47 -04:00
2025-06-09 14:07:31 +00:00
2025-07-16 12:45:46 +02:00
2025-05-22 11:38:26 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-12-04 10:04:28 -08:00