Cyril Vallez
4ded9a4113
🚨🚨 Fix and simplify attention implementation dispatch and subconfigs handling (#39423)
* first try
* Update modeling_utils.py
* Update modeling_utils.py
* big refactor
* Update modeling_utils.py
* style
* docstrings and simplify inner workings of configs
* remove all trace of _internal
* Update modeling_utils.py
* fix logic error
* Update modeling_utils.py
* recursive on config
* Update configuration_utils.py
* fix
* Update configuration_dpt.py
* Update configuration_utils.py
* Update configuration_utils.py
* Update modeling_idefics.py
* Update modeling_utils.py
* fix for old models
* more old models fixup
* Update modeling_utils.py
* Update configuration_utils.py
* Remove outdated test
* remove the deepcopy!! 🥵🥵
* Update test_modeling_gpt_bigcode.py
* fix qwen dispatch
* restrict to only models supporting it
* style
* switch name
* Update modeling_utils.py
* Update modeling_utils.py
* add tests!
* fix
* rypo
* remove bad copies
* fix
* Update modeling_utils.py
* additional check
* Update modeling_utils.py
* Update modeling_utils.py
* Update modeling_utils.py
* Update modeling_utils.py
* Update modeling_utils.py
* fix
* skip
2025-07-18 13:41:54 +02:00
..
2025-04-08 14:12:08 +02:00
2025-07-17 14:29:57 +00:00
2025-06-25 14:31:20 +00:00
2025-04-28 14:20:45 +01:00
2025-06-26 16:25:00 +01:00
2025-06-23 10:56:51 +02:00
2025-07-16 14:00:17 +02:00
2025-07-18 13:41:54 +02:00
2025-06-25 17:29:10 +00:00
2025-07-17 13:21:59 +00:00
2025-07-17 10:24:30 +02:00
2025-07-15 17:16:10 +02:00
2025-07-10 19:07:59 +01:00
2025-06-11 17:28:06 +01:00
2025-07-09 21:14:45 +00:00
2025-06-26 16:25:00 +01:00
2025-07-17 13:51:50 +01:00
2025-07-18 13:41:54 +02:00
2025-06-13 16:22:12 +01:00
2025-04-08 14:12:08 +02:00
2025-04-09 11:48:49 +02:00
2025-04-08 14:12:08 +02:00
2025-07-12 23:39:06 +00:00
2025-06-26 16:25:00 +01:00
2025-07-18 13:41:54 +02:00
2025-07-17 13:21:59 +00:00
2025-07-16 11:31:35 +02:00
2025-06-25 17:29:10 +00:00
2025-06-26 16:25:00 +01:00
2025-07-18 00:02:04 +00:00
2025-03-17 16:09:46 +01:00
2025-07-12 23:39:06 +00:00