Files
HuggingFace_transformer/docs/source/en
fxmarty 1da1302ec8 Flash Attention 2 support for RoCm (#27611)
* support FA2

* fix typo

* fix broken tests

* fix more test errors

* left/right

* fix bug

* more test

* typo

* fix layout flash attention falcon

* do not support this case

* use allclose instead of equal

* fix various bugs with flash attention

* bump

* fix test

* fix mistral

* use skiptest instead of return that may be misleading

* add fix causal arg flash attention

* fix copies

* more explicit comment

* still use self.is_causal

* fix causal argument

* comment

* fixes

* update documentation

* add link

* wrong test

* simplify FA2 RoCm requirements

* update opt

* make flash_attn_uses_top_left_mask attribute private and precise comment

* better error handling

* fix copy & mistral

* Update src/transformers/modeling_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/modeling_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/modeling_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/utils/import_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* use is_flash_attn_greater_or_equal_2_10 instead of is_flash_attn_greater_or_equal_210

* fix merge

* simplify

* inline args

---------

Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-12-04 21:52:17 +09:00
..
2023-11-28 08:41:47 -08:00
2023-11-30 20:24:43 +01:00
2023-11-30 20:24:43 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:16:34 +01:00
2022-04-04 10:25:46 -04:00
2023-11-13 14:20:54 +01:00
2023-11-30 20:24:43 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2022-04-04 10:25:46 -04:00
2023-10-04 15:13:37 +02:00
2023-11-28 08:41:47 -08:00
2023-07-25 22:10:06 +02:00
2023-10-24 16:49:26 +02:00