fxmarty
1da1302ec8
Flash Attention 2 support for RoCm (#27611)
* support FA2
* fix typo
* fix broken tests
* fix more test errors
* left/right
* fix bug
* more test
* typo
* fix layout flash attention falcon
* do not support this case
* use allclose instead of equal
* fix various bugs with flash attention
* bump
* fix test
* fix mistral
* use skiptest instead of return that may be misleading
* add fix causal arg flash attention
* fix copies
* more explicit comment
* still use self.is_causal
* fix causal argument
* comment
* fixes
* update documentation
* add link
* wrong test
* simplify FA2 RoCm requirements
* update opt
* make flash_attn_uses_top_left_mask attribute private and precise comment
* better error handling
* fix copy & mistral
* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/utils/import_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* use is_flash_attn_greater_or_equal_2_10 instead of is_flash_attn_greater_or_equal_210
* fix merge
* simplify
* inline args
---------
Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-12-04 21:52:17 +09:00
..
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2023-11-09 12:34:13 +01:00
2023-10-30 18:16:40 +00:00
2023-02-03 12:43:46 -05:00
2023-11-01 07:17:06 +01:00
2023-11-30 14:16:31 +00:00
2023-12-04 13:36:57 +01:00
2023-03-02 12:08:43 -05:00
2023-11-14 10:32:57 +01:00
2023-11-23 21:00:39 +01:00
2023-11-13 18:18:41 +01:00
2023-10-04 15:13:37 +02:00
2023-11-17 13:44:09 -08:00
2023-11-16 17:43:19 +01:00
2023-06-26 09:58:14 -04:00
2023-11-28 08:33:45 +01:00
2023-11-23 21:00:39 +01:00
2023-09-18 23:47:54 +02:00
2023-11-15 14:10:39 +01:00
2023-11-13 14:20:54 +01:00
2023-06-15 07:30:24 -04:00
2023-11-13 14:20:54 +01:00
2023-08-16 17:45:02 +01:00
2023-11-13 14:20:54 +01:00
2023-11-10 15:35:27 +00:00
2023-12-04 21:52:17 +09:00
2023-06-15 07:30:24 -04:00
2023-11-13 15:17:01 +01:00
2023-11-24 11:48:02 +01:00
2023-11-13 15:17:01 +01:00
2023-12-04 08:33:15 +01:00
2023-10-31 14:20:04 +00:00
2023-09-05 10:12:25 +02:00
2023-11-16 17:43:19 +01:00
2023-11-13 14:20:54 +01:00