Cyril Vallez
dc11a3cbb2
[core] Refactor the Cache logic to make it simpler and more general (#39797)
* Simplify the logic quite a bit
* Update cache_utils.py
* continue work
* continue simplifying a lot
* style
* Update cache_utils.py
* offloading much simpler
* style
* Update cache_utils.py
* update inits
* Update cache_utils.py
* consistemncy
* Update cache_utils.py
* update generate
* style
* fix
* fix
* add early_initialization
* fix
* fix mamba caches
* update
* fix
* fix
* fix
* fix tests
* fix configs
* revert
* fix tests
* alright
* Update modeling_gptj.py
* fix the constructors
* cache tests
* Update test_cache_utils.py
* fix
* simplify
* back to before -> avoid compile bug
* doc
* mistral test
* llama4 test dtype
* Update test_modeling_llama4.py
* CIs
* Finally find a nice impl
* Update cache_utils.py
* Update cache_utils.py
* add lazy methods in autodoc
* typo
* better doc
* Add detailed docstring for lazy init
* CIs
* style
* fix
2025-08-08 14:47:21 +02:00
..
2025-08-08 14:47:21 +02:00
2025-07-29 08:04:14 -07:00
2025-07-31 09:14:14 -07:00
2025-07-09 09:29:51 -07:00
2025-06-17 19:37:18 +01:00
2024-04-08 14:21:16 +01:00
2025-08-06 10:12:34 +00:00
2023-06-20 18:07:47 -04:00
2025-07-29 17:05:26 +01:00
2025-04-30 12:15:43 +01:00
2024-04-16 15:34:04 +01:00
2025-08-05 08:20:13 -07:00
2024-08-07 11:25:19 -07:00
2024-02-16 08:16:58 +01:00
2025-04-30 12:15:43 +01:00
2024-08-27 09:25:41 -07:00
2025-07-25 14:10:04 +02:00
2023-09-27 13:47:44 -07:00
2025-04-30 12:15:43 +01:00
2025-07-22 09:00:42 -07:00
2023-06-20 18:07:47 -04:00
2024-08-08 09:40:03 -07:00
2025-07-31 09:14:14 -07:00
2025-04-14 14:16:07 +01:00
2025-07-07 09:12:55 -07:00
2025-07-29 08:09:16 -07:00
2024-10-02 14:08:46 +01:00
2023-06-20 18:07:47 -04:00
2025-06-13 11:07:09 +00:00
2024-11-26 09:37:18 -08:00
2025-08-08 14:47:21 +02:00
2025-07-31 09:14:14 -07:00
2023-11-20 17:14:23 +00:00
2025-06-13 11:07:09 +00:00
2025-07-25 14:10:04 +02:00
2024-10-08 18:30:41 -07:00
2024-02-06 11:15:44 -08:00
2025-06-13 11:07:09 +00:00
2024-02-16 08:16:58 +01:00
2023-07-25 16:04:14 +02:00
2025-07-21 09:14:15 -07:00
2025-06-13 11:07:09 +00:00
2024-02-16 08:16:58 +01:00
2024-02-16 08:16:58 +01:00
2025-06-13 11:07:09 +00:00
2025-07-29 08:08:57 -07:00
2024-11-04 09:41:44 -08:00
2024-02-16 08:16:58 +01:00
2023-11-06 19:45:03 +00:00
2025-07-29 08:04:30 -07:00
2024-02-16 08:16:58 +01:00
2024-02-16 08:16:58 +01:00
2023-08-17 08:03:17 +02:00
2024-12-17 09:32:00 -08:00
2025-07-25 14:10:04 +02:00
2024-02-16 08:16:58 +01:00
2025-05-01 08:44:12 -07:00
2025-06-25 17:29:10 +00:00
2024-02-16 08:16:58 +01:00
2025-06-13 11:07:09 +00:00
2024-02-16 08:16:58 +01:00
2025-06-13 12:03:49 +00:00
2024-04-18 12:49:43 -04:00
2024-02-16 08:16:58 +01:00