Arthur
e314395277
Refactor flash attention implementation in transformers (#31446)
* dumb commit
* nit
* update
* something like this
* unpack in modeling utils
* safe import
* oups
* update
* nits
* diff convert gemma
* update
* start propagating
* udpate other modeling code as well
* update for sliding window models
* nits
* more init cleanups
* styling
* fixup
* noice
* pass fixup
* typo typing_extension -> typing_extensions
* torch.nn.functionnal -> torch.nn.functional
* add to import structure
* unpack
* simplify a bit more for this first version
* nut
* update
* update
* nit
* ease the import of `Unpack`
* remove useless `use_sliding_window`
* no qua please
* protect import?
* style
* [run-slow]
* [run slow] llama,gemma,mistral,mixtral
* remove extra kwargs
* fix llama
* address review comments
* apply diff_model_converter to modeling_gemma.py
* remove cache_position 1
* remove cache_position 2
* some cleaning
* refactor gemma2 as well
* apply review comments
* rename file to modeling_flash_attention_utils.py
* siglip refactor
* remove dead code
* is the hub down?
* still down?
* fix siglip
* fix gemma2
* fatal: Could not read from remote repository.
* fix typo in softcap implem
* flacky
* Failed: Timeout >120.0s
---------
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>
2024-07-11 20:37:31 +08:00
..
2022-11-08 19:54:41 +00:00
2021-02-15 07:55:10 -05:00
2024-05-22 06:40:15 +02:00
2023-05-18 14:14:43 -04:00
2024-07-11 20:37:31 +08:00
2024-02-16 08:16:58 +01:00
2024-06-25 13:55:11 +02:00
2024-05-22 06:40:15 +02:00
2024-06-07 17:50:18 +01:00
2024-05-22 06:40:15 +02:00
2024-05-22 06:40:15 +02:00
2024-05-15 14:28:20 +02:00
2023-03-13 19:11:19 +01:00
2024-06-25 15:45:39 +05:00
2023-06-06 18:17:41 +02:00
2024-05-22 06:40:15 +02:00
2024-05-22 06:40:15 +02:00
2021-02-15 07:55:10 -05:00
2024-06-17 17:29:13 +01:00
2024-05-22 06:40:15 +02:00
2024-06-03 09:42:53 +01:00
2024-07-08 11:52:47 +01:00
2024-05-22 06:40:15 +02:00
2024-04-15 15:08:09 +02:00
2024-01-31 15:58:17 +01:00
2023-02-28 17:12:44 +01:00
2023-02-03 12:57:02 -05:00
2024-05-07 16:58:50 +02:00
2023-03-01 17:53:29 +01:00
2024-04-12 10:01:28 +02:00
2024-05-22 06:40:15 +02:00
2024-06-07 17:50:18 +01:00
2024-04-15 13:20:36 +02:00
2024-06-20 18:57:24 +02:00
2024-06-20 18:57:24 +02:00
2023-03-30 21:06:35 +02:00
2024-05-31 08:49:33 +02:00
2024-05-03 18:19:30 +02:00
2022-06-02 10:24:16 +02:00
2024-05-22 06:40:15 +02:00
2024-06-10 15:16:58 +02:00
2024-05-09 22:57:52 +02:00
2024-05-22 06:40:15 +02:00
2024-04-15 13:20:36 +02:00
2024-04-24 22:32:42 +02:00
2024-04-16 21:25:06 +02:00
2024-05-22 06:40:15 +02:00
2024-05-22 06:40:15 +02:00