Wang, Yi
9323d0873c
use the enable_gqa param in torch.nn.functional.scaled_dot_product_at… (#39412)
* use the enable_gqa param in torch.nn.functional.scaled_dot_product_attention
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* ci failure fix
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* add check
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* fix ci failure
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* refine code, extend to cuda
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* refine code
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* fix review comments
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* refine the PR
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
2025-07-21 14:46:43 +02:00
..
2025-04-08 14:12:08 +02:00
2025-07-17 14:29:57 +00:00
2025-06-25 14:31:20 +00:00
2025-04-28 14:20:45 +01:00
2025-06-26 16:25:00 +01:00
2025-06-23 10:56:51 +02:00
2025-07-21 14:02:57 +02:00
2025-07-21 14:43:52 +02:00
2025-06-25 17:29:10 +00:00
2025-07-17 13:21:59 +00:00
2025-07-21 12:42:00 +00:00
2025-07-15 17:16:10 +02:00
2025-07-10 19:07:59 +01:00
2025-06-11 17:28:06 +01:00
2025-07-09 21:14:45 +00:00
2025-06-26 16:25:00 +01:00
2025-07-17 13:51:50 +01:00
2025-07-21 14:46:43 +02:00
2025-07-21 14:02:57 +02:00
2025-04-08 14:12:08 +02:00
2025-04-09 11:48:49 +02:00
2025-04-08 14:12:08 +02:00
2025-07-12 23:39:06 +00:00
2025-07-21 12:42:00 +00:00
2025-07-21 14:43:52 +02:00
2025-07-17 13:21:59 +00:00
2025-07-18 12:23:20 +00:00
2025-06-25 17:29:10 +00:00
2025-07-21 12:38:05 +00:00
2025-07-18 00:02:04 +00:00
2025-03-17 16:09:46 +01:00
2025-07-12 23:39:06 +00:00