Files
HuggingFace_transformer/tests/models
bytebarde be3fd8a262 [Flash Attention 2] Add flash attention 2 for GPT-J (#28295)
* initial implementation of flash attention for gptj

* modify flash attention and overwrite test_flash_attn_2_generate_padding_right

* update flash attention support list

* remove the copy line in the `CodeGenBlock`

* address copy mechanism

* Update src/transformers/models/gptj/modeling_gptj.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add GPTJ attention classes

* add expected outputs in the gptj test

* Ensure repo consistency with 'make fix-copies'

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-03-13 08:43:00 +01:00
..
2023-10-24 16:49:26 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2024-01-11 16:18:27 +01:00
2023-08-02 20:22:36 +02:00
2024-01-31 14:19:02 +01:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-26 07:06:04 +02:00
2023-08-02 20:22:36 +02:00
2024-01-11 16:18:27 +01:00
2023-12-21 15:14:46 +00:00
2023-08-02 20:22:36 +02:00
2024-02-26 18:17:19 +00:00
2023-09-05 10:12:25 +02:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2024-01-11 16:18:27 +01:00
2023-08-16 17:45:02 +01:00
2024-01-11 16:18:27 +01:00
2024-01-11 16:18:27 +01:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00