fxmarty
1da1302ec8
Flash Attention 2 support for RoCm (#27611)
* support FA2
* fix typo
* fix broken tests
* fix more test errors
* left/right
* fix bug
* more test
* typo
* fix layout flash attention falcon
* do not support this case
* use allclose instead of equal
* fix various bugs with flash attention
* bump
* fix test
* fix mistral
* use skiptest instead of return that may be misleading
* add fix causal arg flash attention
* fix copies
* more explicit comment
* still use self.is_causal
* fix causal argument
* comment
* fixes
* update documentation
* add link
* wrong test
* simplify FA2 RoCm requirements
* update opt
* make flash_attn_uses_top_left_mask attribute private and precise comment
* better error handling
* fix copy & mistral
* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/utils/import_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* use is_flash_attn_greater_or_equal_2_10 instead of is_flash_attn_greater_or_equal_210
* fix merge
* simplify
* inline args
---------
Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-12-04 21:52:17 +09:00
..
2023-08-25 11:56:17 +01:00
2023-11-28 08:41:47 -08:00
2023-12-04 21:52:17 +09:00
2023-11-30 20:24:43 +01:00
2023-11-16 17:43:19 +01:00
2023-11-23 15:58:21 +00:00
2023-11-30 20:24:43 +01:00
2023-09-04 11:15:12 +01:00
2023-09-25 12:58:11 +02:00
2023-09-19 00:41:16 +02:00
2023-09-18 13:33:01 +02:00
2023-06-20 18:07:47 -04:00
2023-09-04 11:16:49 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-04 11:15:12 +01:00
2023-11-27 18:40:10 +00:00
2023-09-04 11:16:34 +01:00
2022-04-04 10:25:46 -04:00
2023-11-06 19:45:03 +00:00
2023-11-27 09:38:18 +01:00
2023-09-05 12:27:20 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-29 09:43:39 +02:00
2023-10-24 13:10:06 -04:00
2023-11-13 14:20:54 +01:00
2023-11-30 20:24:43 +01:00
2023-09-29 09:42:21 +02:00
2023-11-21 10:48:14 +00:00
2023-10-19 16:53:28 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-06-20 18:07:47 -04:00
2023-08-18 07:58:59 +02:00
2022-04-04 10:25:46 -04:00
2023-06-20 18:07:47 -04:00
2023-11-14 10:32:57 +01:00
2023-11-27 16:26:33 +00:00
2023-10-31 09:44:51 -07:00
2023-12-04 21:52:17 +09:00
2023-08-10 13:25:00 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-11-27 16:26:33 +00:00
2023-11-25 00:59:17 +09:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-10-31 09:44:51 -07:00
2023-06-20 18:07:47 -04:00
2023-11-06 19:45:03 +00:00
2023-09-29 18:32:37 +01:00
2023-08-18 10:17:44 +02:00
2023-10-04 15:13:37 +02:00
2023-11-17 13:44:09 -08:00
2023-11-28 08:41:47 -08:00
2023-08-16 08:03:23 +02:00
2023-11-27 16:26:33 +00:00
2023-11-06 19:45:03 +00:00
2023-06-20 18:07:47 -04:00
2023-07-25 22:10:06 +02:00
2023-06-20 18:07:47 -04:00
2023-10-24 16:49:26 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-08-03 14:17:30 -07:00
2023-06-20 18:07:47 -04:00
2023-11-24 12:41:16 +01:00
2023-09-05 12:27:20 +01:00
2023-06-20 18:07:47 -04:00